NVIDIA: Llama 3.1 Nemotron 70B Instruct
🇺🇸 NVIDIA · Llama 3.1
Input Price $1.20 per million tokens NT$38.4
Output Price $1.20 per million tokens NT$38.4
Context Window 131K tokens Output limit: 16K
OpenRouter Route Price Please verify with official pricing pages
| Dimension | Unit | Price (USD) |
|---|---|---|
| Input | per 1M tokens | $1.20 |
| Output | per 1M tokens | $1.20 |
- Provider
- NVIDIA (NVIDIA)
- Model Family
- Llama 3.1
- Version String
- nvidia/llama-3.1-nemotron-70b-instruct
- Status
- active
- Modality
- text
- Context Window
- 131,072 tokens
- Output Limit
- 16,384 tokens
Index Metrics
Cross-domain capability indexes evaluated by Artificial Analysis — Artificial Analysis
Agentic Index 8 F Measured: 2026-05-27
Coding Index 11 F Measured: 2026-05-27
Intelligence Index 13 F Measured: 2026-05-27
Benchmark Scores
Data source: Artificial Analysis
AA-LCR 7.0% F Measured: 2026-05-27
GPQA Diamond 46.5% C Measured: 2026-05-27
HLE 4.6% D Measured: 2026-05-27
HLE 4.6% D Measured: 2026-05-27
IFBench 30.8% D Measured: 2026-05-27
Non-Hallucination 31.2% Measured: 2026-05-27
Omniscience Accuracy 16.4% Measured: 2026-05-27
SciCode 23.3% C Measured: 2026-05-27
Tau2 23.1% Measured: 2026-05-27
TerminalBench 4.5% Measured: 2026-05-27
Performance Metrics
Real-world benchmarks, updated every 72 hours by Artificial Analysis — Artificial Analysis
First Token Latency 0.5s Measured: 2026-05-27
Output Speed 292 t/s Measured: 2026-05-27
Response Time 2.2s Measured: 2026-05-27
90-Day Price Trend
Input / Output price (USD per 1M tokens)
Past 90 days of records; every price change is shown here
| Date | Dimension | Price (USD) | Source |
|---|---|---|---|
| 2026-05-07 | Output | $1.20 | openrouter |
| 2026-05-07 | Input | $1.20 | openrouter |
| 2026-05-07 | Output | $1.20 | openrouter |
| 2026-05-07 | Input | $1.20 | openrouter |
| 2026-05-07 | Output | $1.20 | openrouter |
| 2026-05-07 | Input | $1.20 | openrouter |
| 2026-05-06 | Output | $1.20 | openrouter |
| 2026-05-06 | Input | $1.20 | openrouter |
| 2026-05-05 | Output | $1.20 | openrouter |
| 2026-05-05 | Input | $1.20 | openrouter |
| 2026-05-05 | Output | $1.20 | openrouter |
| 2026-05-05 | Input | $1.20 | openrouter |
| 2026-05-05 | Output | $1.20 | openrouter |
| 2026-05-05 | Input | $1.20 | openrouter |
| 2026-05-05 | Output | $1.20 | openrouter |
| 2026-05-05 | Input | $1.20 | openrouter |
| 2026-05-05 | Output | $1.20 | openrouter |
| 2026-05-05 | Input | $1.20 | openrouter |
| 2026-05-05 | Output | $1.20 | openrouter |
| 2026-05-05 | Input | $1.20 | openrouter |
| 2026-05-05 | Output | $1.20 | openrouter |
| 2026-05-05 | Input | $1.20 | openrouter |
| 2026-05-04 | Output | $1.20 | openrouter |
| 2026-05-04 | Input | $1.20 | openrouter |
| 2026-05-04 | Output | $1.20 | openrouter |
| 2026-05-04 | Input | $1.20 | openrouter |
| 2026-05-04 | Output | $1.20 | openrouter |
| 2026-05-04 | Input | $1.20 | openrouter |
| 2026-05-04 | Output | $1.20 | openrouter |
| 2026-05-04 | Input | $1.20 | openrouter |
| 2026-05-03 | Output | $1.20 | openrouter |
| 2026-05-03 | Input | $1.20 | openrouter |
| 2026-05-03 | Output | $1.20 | openrouter |
| 2026-05-03 | Input | $1.20 | openrouter |
| 2026-05-03 | Output | $1.20 | openrouter |
| 2026-05-03 | Input | $1.20 | openrouter |
Description
NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...
Key Insights
Key data points from this page for quick reference and citation.
- NVIDIA: Llama 3.1 Nemotron 70B Instruct Input price: $1.2/M tokens
- NVIDIA: Llama 3.1 Nemotron 70B Instruct Output price: $1.2/M tokens
- Context window: 131,072 tokens
- Provider: NVIDIA
- Model family: Llama 3.1
- Modalities: text
- Data source: OpenRouter, updated daily