Skip to content
SambaNova

Meta Llama 3.1 405B Instruct

Meta Llama 3.1 405B Instruct is available via SambaNova with a 16K context window and up to 16,384 output tokens. Pricing: $5.00/1M input tokens, $10.00/1M output tokens.

Meta Llama 3.1 405B Instruct Pricing & Specifications

Input Price$5.00 per 1M tokens
Output Price$10.00 per 1M tokens
Context Window16,384 tokens (16K)
Max Output16,384 tokens
ProviderSambaNova

What is Meta Llama 3.1 405B Instruct?

Meta Llama 3.1 405B Instruct is a large language model by SambaNova with a 16K context window and up to 16,384 output tokens. It costs $5.00 per 1M input tokens and $10.00 per 1M output tokens. Meta Llama 3.1 405B Instruct is available via SambaNova with a 16K context window and up to 16,384 output tokens. Pricing: $5.00/1M input tokens, $10.00/1M output tokens.

Capabilities

text function calling json mode

Meta Llama 3.1 405B Instruct Cost Examples

Short prompt (500 tokens)

$0.002500

Medium prompt (2K tokens)

$0.01000

Long output (4K tokens)

$0.04000

Count tokens for Meta Llama 3.1 405B Instruct

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Meta Llama 3.1 405B Instruct

SambaNova

DeepSeek R1

$5.00/1M in 33K ctx

SambaNova

DeepSeek V3 0324

$3.00/1M in 33K ctx

SambaNova

DeepSeek V3.1

$3.00/1M in 33K ctx

SambaNova

Gpt Oss 120b

$3.00/1M in 131K ctx

Frequently Asked Questions

How much does Meta Llama 3.1 405B Instruct cost per token? +
Meta Llama 3.1 405B Instruct costs $5.00 per 1M input tokens and $10.00 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.010000.
What is the context window for Meta Llama 3.1 405B Instruct? +
Meta Llama 3.1 405B Instruct supports a context window of 16,384 tokens (16K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Meta Llama 3.1 405B Instruct? +
Meta Llama 3.1 405B Instruct can generate up to 16,384 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Meta Llama 3.1 405B Instruct good for coding tasks? +
Yes, Meta Llama 3.1 405B Instruct supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All SambaNova Models