Skip to content
Cerebras

Llama3.1 8b

Llama3.1 8b is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Llama3.1 8b Pricing & Specifications

Input Price$0.10 per 1M tokens
Output Price$0.10 per 1M tokens
Context Window128,000 tokens (128K)
Max Output128,000 tokens
ProviderCerebras

What is Llama3.1 8b?

Llama3.1 8b is a large language model by Cerebras with a 128K context window and up to 128,000 output tokens. It costs $0.10 per 1M input tokens and $0.10 per 1M output tokens. Llama3.1 8b is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Capabilities

text function calling

Llama3.1 8b Cost Examples

Short prompt (500 tokens)

$0.000050

Medium prompt (2K tokens)

$0.00020

Long output (4K tokens)

$0.00040

Count tokens for Llama3.1 8b

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Llama3.1 8b

Cerebras

Gpt Oss 120b

$0.35/1M in 131K ctx

Cerebras

Qwen 3 32b

$0.40/1M in 128K ctx

Cerebras

Llama3.1 70b

$0.60/1M in 128K ctx

Cerebras

Llama 3.3 70b

$0.85/1M in 128K ctx

Frequently Asked Questions

How much does Llama3.1 8b cost per token? +
Llama3.1 8b costs $0.10 per 1M input tokens and $0.10 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000150.
What is the context window for Llama3.1 8b? +
Llama3.1 8b supports a context window of 128,000 tokens (128K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Llama3.1 8b? +
Llama3.1 8b can generate up to 128,000 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Llama3.1 8b good for coding tasks? +
Yes, Llama3.1 8b supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Cerebras Models