← All Models

NVIDIA: Llama 3.1 Nemotron 70B Instruct

🇺🇸 NVIDIA · Llama 3.1

Input Price $1.20 per million tokens NT$38.4
Output Price $1.20 per million tokens NT$38.4
Context Window 131K tokens Output limit: 16K
OpenRouter Route Price Please verify with official pricing pages
Use this model via OpenRouter →

Dimension Unit Price (USD) Price (TWD) Effective From
Input per 1M tokens $1.20 NT$38.4 2026-05-03
Output per 1M tokens $1.20 NT$38.4 2026-05-03

Provider
NVIDIA (NVIDIA)
Model Family
Llama 3.1
Version String
nvidia/llama-3.1-nemotron-70b-instruct
Status
active
Modality
text
Context Window
131,072 tokens
Output Limit
16,384 tokens

Index Metrics

Cross-domain capability indexes evaluated by Artificial Analysis — Artificial Analysis

Agentic Index 8 F Measured: 2026-05-27
Coding Index 11 F Measured: 2026-05-27
Intelligence Index 13 F Measured: 2026-05-27

Benchmark Scores

Data source: Artificial Analysis

AA-LCR 7.0% F Measured: 2026-05-27
GPQA Diamond 46.5% C Measured: 2026-05-27
HLE 4.6% D Measured: 2026-05-27
HLE 4.6% D Measured: 2026-05-27
IFBench 30.8% D Measured: 2026-05-27
Non-Hallucination 31.2% Measured: 2026-05-27
Omniscience Accuracy 16.4% Measured: 2026-05-27
SciCode 23.3% C Measured: 2026-05-27
Tau2 23.1% Measured: 2026-05-27
TerminalBench 4.5% Measured: 2026-05-27

Performance Metrics

Real-world benchmarks, updated every 72 hours by Artificial Analysis — Artificial Analysis

First Token Latency 0.5s Measured: 2026-05-27
Output Speed 292 t/s Measured: 2026-05-27
Response Time 2.2s Measured: 2026-05-27

90-Day Price Trend

Input / Output price (USD per 1M tokens)

Past 90 days of records; every price change is shown here

Date Dimension Price (USD) Source
2026-05-07 Output $1.20 openrouter
2026-05-07 Input $1.20 openrouter
2026-05-07 Output $1.20 openrouter
2026-05-07 Input $1.20 openrouter
2026-05-07 Output $1.20 openrouter
2026-05-07 Input $1.20 openrouter
2026-05-06 Output $1.20 openrouter
2026-05-06 Input $1.20 openrouter
2026-05-05 Output $1.20 openrouter
2026-05-05 Input $1.20 openrouter
2026-05-05 Output $1.20 openrouter
2026-05-05 Input $1.20 openrouter
2026-05-05 Output $1.20 openrouter
2026-05-05 Input $1.20 openrouter
2026-05-05 Output $1.20 openrouter
2026-05-05 Input $1.20 openrouter
2026-05-05 Output $1.20 openrouter
2026-05-05 Input $1.20 openrouter
2026-05-05 Output $1.20 openrouter
2026-05-05 Input $1.20 openrouter
2026-05-05 Output $1.20 openrouter
2026-05-05 Input $1.20 openrouter
2026-05-04 Output $1.20 openrouter
2026-05-04 Input $1.20 openrouter
2026-05-04 Output $1.20 openrouter
2026-05-04 Input $1.20 openrouter
2026-05-04 Output $1.20 openrouter
2026-05-04 Input $1.20 openrouter
2026-05-04 Output $1.20 openrouter
2026-05-04 Input $1.20 openrouter
2026-05-03 Output $1.20 openrouter
2026-05-03 Input $1.20 openrouter
2026-05-03 Output $1.20 openrouter
2026-05-03 Input $1.20 openrouter
2026-05-03 Output $1.20 openrouter
2026-05-03 Input $1.20 openrouter

Description

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...

Key Insights

Key data points from this page for quick reference and citation.

  • NVIDIA: Llama 3.1 Nemotron 70B Instruct Input price: $1.2/M tokens
  • NVIDIA: Llama 3.1 Nemotron 70B Instruct Output price: $1.2/M tokens
  • Context window: 131,072 tokens
  • Provider: NVIDIA
  • Model family: Llama 3.1
  • Modalities: text
  • Data source: OpenRouter, updated daily