Skip to content
DeepInfra

Meta Llama/Llama 4 Scout 17B 16E Instruct

Meta Llama/Llama 4 Scout 17B 16E Instruct is available via DeepInfra with a 328K context window and up to 327,680 output tokens. Pricing: $0.0800/1M input tokens, $0.3000/1M output tokens.

Meta Llama/Llama 4 Scout 17B 16E Instruct Pricing & Specifications

Input Price$0.080 per 1M tokens
Output Price$0.30 per 1M tokens
Context Window327,680 tokens (328K)
Max Output327,680 tokens
ProviderDeepInfra

What is Meta Llama/Llama 4 Scout 17B 16E Instruct?

Meta Llama/Llama 4 Scout 17B 16E Instruct is a large language model by DeepInfra with a 328K context window and up to 327,680 output tokens. It costs $0.080 per 1M input tokens and $0.30 per 1M output tokens. Meta Llama/Llama 4 Scout 17B 16E Instruct is available via DeepInfra with a 328K context window and up to 327,680 output tokens. Pricing: $0.0800/1M input tokens, $0.3000/1M output tokens.

Capabilities

text function calling

Meta Llama/Llama 4 Scout 17B 16E Instruct Cost Examples

Short prompt (500 tokens)

$0.000040

Medium prompt (2K tokens)

$0.00016

Long output (4K tokens)

$0.00120

Count tokens for Meta Llama/Llama 4 Scout 17B 16E Instruct

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Meta Llama/Llama 4 Scout 17B 16E Instruct

DeepInfra

Gryphe/MythoMax L2 13b

$0.080/1M in 4K ctx

DeepInfra

Qwen/Qwen3 30B A3B

$0.080/1M in 41K ctx

DeepInfra

Mistralai/Mistral Small 3.2 24B Instruct 2506

$0.075/1M in 128K ctx

DeepInfra

Microsoft/Phi 4

$0.070/1M in 16K ctx

Frequently Asked Questions

How much does Meta Llama/Llama 4 Scout 17B 16E Instruct cost per token? +
Meta Llama/Llama 4 Scout 17B 16E Instruct costs $0.080 per 1M input tokens and $0.30 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000230.
What is the context window for Meta Llama/Llama 4 Scout 17B 16E Instruct? +
Meta Llama/Llama 4 Scout 17B 16E Instruct supports a context window of 327,680 tokens (328K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Meta Llama/Llama 4 Scout 17B 16E Instruct? +
Meta Llama/Llama 4 Scout 17B 16E Instruct can generate up to 327,680 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Meta Llama/Llama 4 Scout 17B 16E Instruct good for coding tasks? +
Yes, Meta Llama/Llama 4 Scout 17B 16E Instruct supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All DeepInfra Models