Question 1

How much does Llama 3.1 8B Instruct cost per token?

Accepted Answer

Llama 3.1 8B Instruct costs $0.10 per 1M input tokens and $0.10 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000150.

Question 2

What is the context window for Llama 3.1 8B Instruct?

Accepted Answer

Llama 3.1 8B Instruct supports a context window of 131,000 tokens (131K). This determines the maximum combined length of your prompt and conversation history in a single API call.

Question 3

What is the maximum output length for Llama 3.1 8B Instruct?

Accepted Answer

Llama 3.1 8B Instruct can generate up to 131,000 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.

Question 4

Is Llama 3.1 8B Instruct good for coding tasks?

Accepted Answer

Yes, Llama 3.1 8B Instruct supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.

Input Price	$0.10 per 1M tokens
Output Price	$0.10 per 1M tokens
Context Window	131,000 tokens (131K)
Max Output	131,000 tokens
Provider	Ovhcloud

Llama 3.1 8B Instruct

Llama 3.1 8B Instruct Pricing & Specifications

What is Llama 3.1 8B Instruct?

Capabilities

Llama 3.1 8B Instruct Cost Examples

Count tokens for Llama 3.1 8B Instruct

Similar Models to Llama 3.1 8B Instruct

Mistral 7B Instruct V0.3

Mistral Small 3.2 24B Instruct 2506

Qwen3 32B

Gpt Oss 120b

Frequently Asked Questions