Skip to content
Google Vertex AI

Meta/Llama 4 Scout 17b 128e Instruct Maas

Meta/Llama 4 Scout 17b 128e Instruct Maas is available via Google Vertex AI with a 10M context window and up to 10,000,000 output tokens. Pricing: $0.2500/1M input tokens, $0.7000/1M output tokens.

Meta/Llama 4 Scout 17b 128e Instruct Maas Pricing & Specifications

Input Price$0.25 per 1M tokens
Output Price$0.70 per 1M tokens
Context Window10,000,000 tokens (10M)
Max Output10,000,000 tokens
ProviderGoogle Vertex AI

What is Meta/Llama 4 Scout 17b 128e Instruct Maas?

Meta/Llama 4 Scout 17b 128e Instruct Maas is a large language model by Google Vertex AI with a 10M context window and up to 10,000,000 output tokens. It costs $0.25 per 1M input tokens and $0.70 per 1M output tokens. Meta/Llama 4 Scout 17b 128e Instruct Maas is available via Google Vertex AI with a 10M context window and up to 10,000,000 output tokens. Pricing: $0.2500/1M input tokens, $0.7000/1M output tokens.

Capabilities

text function calling

Meta/Llama 4 Scout 17b 128e Instruct Maas Cost Examples

Short prompt (500 tokens)

$0.000125

Medium prompt (2K tokens)

$0.00050

Long output (4K tokens)

$0.00280

Count tokens for Meta/Llama 4 Scout 17b 128e Instruct Maas

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Meta/Llama 4 Scout 17b 128e Instruct Maas

Google Vertex AI

Gemini 3.1 Flash Lite Preview

$0.25/1M in 1.0M ctx

Google Vertex AI

Claude 3 Haiku

$0.25/1M in 200K ctx

Google Vertex AI

Claude 3 Haiku

$0.25/1M in 200K ctx

Google Vertex AI

Gemini 3.1 Flash Lite Preview

$0.25/1M in 1.0M ctx

Frequently Asked Questions

How much does Meta/Llama 4 Scout 17b 128e Instruct Maas cost per token? +
Meta/Llama 4 Scout 17b 128e Instruct Maas costs $0.25 per 1M input tokens and $0.70 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000600.
What is the context window for Meta/Llama 4 Scout 17b 128e Instruct Maas? +
Meta/Llama 4 Scout 17b 128e Instruct Maas supports a context window of 10,000,000 tokens (10M). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Meta/Llama 4 Scout 17b 128e Instruct Maas? +
Meta/Llama 4 Scout 17b 128e Instruct Maas can generate up to 10,000,000 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Meta/Llama 4 Scout 17b 128e Instruct Maas good for coding tasks? +
Yes, Meta/Llama 4 Scout 17b 128e Instruct Maas supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Google Vertex AI Models