Skip to content
Ovhcloud

Llama 3.1 8B Instruct

Llama 3.1 8B Instruct is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Llama 3.1 8B Instruct Pricing & Specifications

Input Price$0.10 per 1M tokens
Output Price$0.10 per 1M tokens
Context Window131,000 tokens (131K)
Max Output131,000 tokens
ProviderOvhcloud

What is Llama 3.1 8B Instruct?

Llama 3.1 8B Instruct is a large language model by Ovhcloud with a 131K context window and up to 131,000 output tokens. It costs $0.10 per 1M input tokens and $0.10 per 1M output tokens. Llama 3.1 8B Instruct is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Capabilities

text function calling json mode

Llama 3.1 8B Instruct Cost Examples

Short prompt (500 tokens)

$0.000050

Medium prompt (2K tokens)

$0.00020

Long output (4K tokens)

$0.00040

Count tokens for Llama 3.1 8B Instruct

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Llama 3.1 8B Instruct

Ovhcloud

Mistral 7B Instruct V0.3

$0.10/1M in 127K ctx

Ovhcloud

Mistral Small 3.2 24B Instruct 2506

$0.090/1M in 128K ctx

Ovhcloud

Qwen3 32B

$0.080/1M in 32K ctx

Ovhcloud

Gpt Oss 120b

$0.080/1M in 131K ctx

Frequently Asked Questions

How much does Llama 3.1 8B Instruct cost per token? +
Llama 3.1 8B Instruct costs $0.10 per 1M input tokens and $0.10 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000150.
What is the context window for Llama 3.1 8B Instruct? +
Llama 3.1 8B Instruct supports a context window of 131,000 tokens (131K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Llama 3.1 8B Instruct? +
Llama 3.1 8B Instruct can generate up to 131,000 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Llama 3.1 8B Instruct good for coding tasks? +
Yes, Llama 3.1 8B Instruct supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Ovhcloud Models