Skip to content
Groq

Llama 3.1 8b Instant

Llama 3.1 8b Instant is available via Groq with a 128K context window and up to 8,192 output tokens. Pricing: $0.0500/1M input tokens, $0.0800/1M output tokens.

Llama 3.1 8b Instant Pricing & Specifications

Input Price$0.050 per 1M tokens
Output Price$0.080 per 1M tokens
Context Window128,000 tokens (128K)
Max Output8,192 tokens
ProviderGroq

What is Llama 3.1 8b Instant?

Llama 3.1 8b Instant is a large language model by Groq with a 128K context window and up to 8,192 output tokens. It costs $0.050 per 1M input tokens and $0.080 per 1M output tokens. Llama 3.1 8b Instant is available via Groq with a 128K context window and up to 8,192 output tokens. Pricing: $0.0500/1M input tokens, $0.0800/1M output tokens.

Capabilities

text function calling

Llama 3.1 8b Instant Cost Examples

Short prompt (500 tokens)

$0.000025

Medium prompt (2K tokens)

$0.00010

Long output (4K tokens)

$0.00032

Count tokens for Llama 3.1 8b Instant

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Llama 3.1 8b Instant

Groq

Gemma 7b It

$0.050/1M in 8K ctx

Groq

Openai/Gpt Oss 20b

$0.075/1M in 131K ctx

Groq

Openai/Gpt Oss Safeguard 20b

$0.075/1M in 131K ctx

Groq

Meta Llama/Llama 4 Scout 17b 16e Instruct

$0.11/1M in 131K ctx

Frequently Asked Questions

How much does Llama 3.1 8b Instant cost per token? +
Llama 3.1 8b Instant costs $0.050 per 1M input tokens and $0.080 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000090.
What is the context window for Llama 3.1 8b Instant? +
Llama 3.1 8b Instant supports a context window of 128,000 tokens (128K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Llama 3.1 8b Instant? +
Llama 3.1 8b Instant can generate up to 8,192 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Llama 3.1 8b Instant good for coding tasks? +
Yes, Llama 3.1 8b Instant supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Groq Models