Skip to content
Lambda Ai

Llama3.1 Nemotron 70b Instruct Fp8

Llama3.1 Nemotron 70b Instruct Fp8 is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

Llama3.1 Nemotron 70b Instruct Fp8 Pricing & Specifications

Input Price$0.12 per 1M tokens
Output Price$0.30 per 1M tokens
Context Window131,072 tokens (131K)
Max Output131,072 tokens
ProviderLambda Ai

What is Llama3.1 Nemotron 70b Instruct Fp8?

Llama3.1 Nemotron 70b Instruct Fp8 is a large language model by Lambda Ai with a 131K context window and up to 131,072 output tokens. It costs $0.12 per 1M input tokens and $0.30 per 1M output tokens. Llama3.1 Nemotron 70b Instruct Fp8 is available via Lambda Ai with a 131K context window and up to 131,072 output tokens. Pricing: $0.1200/1M input tokens, $0.3000/1M output tokens.

Capabilities

text function calling

Llama3.1 Nemotron 70b Instruct Fp8 Cost Examples

Short prompt (500 tokens)

$0.000060

Medium prompt (2K tokens)

$0.00024

Long output (4K tokens)

$0.00120

Count tokens for Llama3.1 Nemotron 70b Instruct Fp8

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Llama3.1 Nemotron 70b Instruct Fp8

Lambda Ai

Hermes3 70b

$0.12/1M in 131K ctx

Lambda Ai

Llama3.1 70b Instruct Fp8

$0.12/1M in 131K ctx

Lambda Ai

Llama3.3 70b Instruct Fp8

$0.12/1M in 131K ctx

Lambda Ai

Lfm 40b

$0.10/1M in 131K ctx

Frequently Asked Questions

How much does Llama3.1 Nemotron 70b Instruct Fp8 cost per token? +
Llama3.1 Nemotron 70b Instruct Fp8 costs $0.12 per 1M input tokens and $0.30 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000270.
What is the context window for Llama3.1 Nemotron 70b Instruct Fp8? +
Llama3.1 Nemotron 70b Instruct Fp8 supports a context window of 131,072 tokens (131K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Llama3.1 Nemotron 70b Instruct Fp8? +
Llama3.1 Nemotron 70b Instruct Fp8 can generate up to 131,072 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Llama3.1 Nemotron 70b Instruct Fp8 good for coding tasks? +
Yes, Llama3.1 Nemotron 70b Instruct Fp8 supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Lambda Ai Models