Skip to content
Llamagate

Llama 3.1 8b

Llama 3.1 8b is available via Llamagate with a 131K context window and up to 8,192 output tokens. Pricing: $0.0300/1M input tokens, $0.0500/1M output tokens.

Llama 3.1 8b Pricing & Specifications

Input Price$0.030 per 1M tokens
Output Price$0.050 per 1M tokens
Context Window131,072 tokens (131K)
Max Output8,192 tokens
ProviderLlamagate

What is Llama 3.1 8b?

Llama 3.1 8b is a large language model by Llamagate with a 131K context window and up to 8,192 output tokens. It costs $0.030 per 1M input tokens and $0.050 per 1M output tokens. Llama 3.1 8b is available via Llamagate with a 131K context window and up to 8,192 output tokens. Pricing: $0.0300/1M input tokens, $0.0500/1M output tokens.

Capabilities

text function calling json mode

Llama 3.1 8b Cost Examples

Short prompt (500 tokens)

$0.000015

Medium prompt (2K tokens)

$0.00006

Long output (4K tokens)

$0.00020

Count tokens for Llama 3.1 8b

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Llama 3.1 8b

Llamagate

Gemma3 4b

$0.030/1M in 128K ctx

Llamagate

Llama 3.2 3b

$0.040/1M in 131K ctx

Llamagate

Qwen3 8b

$0.040/1M in 33K ctx

Llamagate

Qwen2.5 Coder 7b

$0.060/1M in 33K ctx

Frequently Asked Questions

How much does Llama 3.1 8b cost per token? +
Llama 3.1 8b costs $0.030 per 1M input tokens and $0.050 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000055.
What is the context window for Llama 3.1 8b? +
Llama 3.1 8b supports a context window of 131,072 tokens (131K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Llama 3.1 8b? +
Llama 3.1 8b can generate up to 8,192 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Llama 3.1 8b good for coding tasks? +
Yes, Llama 3.1 8b supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Llamagate Models