Skip to content
FriendliAI

Meta Llama 3.1 8b Instruct

Meta Llama 3.1 8b Instruct is available via FriendliAI with a 8K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Meta Llama 3.1 8b Instruct Pricing & Specifications

Input Price$0.10 per 1M tokens
Output Price$0.10 per 1M tokens
Context Window8,192 tokens (8K)
Max Output8,192 tokens
ProviderFriendliAI

What is Meta Llama 3.1 8b Instruct?

Meta Llama 3.1 8b Instruct is a large language model by FriendliAI with a 8K context window and up to 8,192 output tokens. It costs $0.10 per 1M input tokens and $0.10 per 1M output tokens. Meta Llama 3.1 8b Instruct is available via FriendliAI with a 8K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Capabilities

text function calling json mode

Meta Llama 3.1 8b Instruct Cost Examples

Short prompt (500 tokens)

$0.000050

Medium prompt (2K tokens)

$0.00020

Long output (4K tokens)

$0.00040

Count tokens for Meta Llama 3.1 8b Instruct

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Meta Llama 3.1 8b Instruct

FriendliAI

Meta Llama 3.1 70b Instruct

$0.60/1M in 8K ctx

Azure OpenAI

Gpt 4.1 Nano

$0.10/1M in 1.0M ctx

Azure OpenAI

Gpt 4.1 Nano 2025 04 14

$0.10/1M in 1.0M ctx

Azure AI

Mistral Small 2503

$0.10/1M in 128K ctx

Frequently Asked Questions

How much does Meta Llama 3.1 8b Instruct cost per token? +
Meta Llama 3.1 8b Instruct costs $0.10 per 1M input tokens and $0.10 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000150.
What is the context window for Meta Llama 3.1 8b Instruct? +
Meta Llama 3.1 8b Instruct supports a context window of 8,192 tokens (8K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Meta Llama 3.1 8b Instruct? +
Meta Llama 3.1 8b Instruct can generate up to 8,192 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Meta Llama 3.1 8b Instruct good for coding tasks? +
Yes, Meta Llama 3.1 8b Instruct supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All FriendliAI Models