Skip to content
Google Vertex AI

Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is available via Google Vertex AI with a 1.0M context window and up to 65,536 output tokens. Pricing: $0.2500/1M input tokens, $1.50/1M output tokens.

Gemini 3.1 Flash Lite Preview Pricing & Specifications

Input Price$0.25 per 1M tokens
Output Price$1.50 per 1M tokens
Context Window1,048,576 tokens (1.0M)
Max Output65,536 tokens
ProviderGoogle Vertex AI

What is Gemini 3.1 Flash Lite Preview?

Gemini 3.1 Flash Lite Preview is a large language model by Google Vertex AI with a 1.0M context window and up to 65,536 output tokens. It costs $0.25 per 1M input tokens and $1.50 per 1M output tokens. Gemini 3.1 Flash Lite Preview is available via Google Vertex AI with a 1.0M context window and up to 65,536 output tokens. Pricing: $0.2500/1M input tokens, $1.50/1M output tokens.

Capabilities

text vision function calling reasoning audio pdf web search json mode

Gemini 3.1 Flash Lite Preview Cost Examples

Short prompt (500 tokens)

$0.000125

Medium prompt (2K tokens)

$0.00050

Long output (4K tokens)

$0.00600

Count tokens for Gemini 3.1 Flash Lite Preview

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Gemini 3.1 Flash Lite Preview

Google Vertex AI

Gemini 3.1 Flash Lite Preview

$0.25/1M in 1.0M ctx

Google Vertex AI

Claude 3 Haiku

$0.25/1M in 200K ctx

Google Vertex AI

Claude 3 Haiku

$0.25/1M in 200K ctx

Google Vertex AI

Meta/Llama 4 Scout 17b 128e Instruct Maas

$0.25/1M in 10M ctx

Frequently Asked Questions

How much does Gemini 3.1 Flash Lite Preview cost per token? +
Gemini 3.1 Flash Lite Preview costs $0.25 per 1M input tokens and $1.50 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.001000.
What is the context window for Gemini 3.1 Flash Lite Preview? +
Gemini 3.1 Flash Lite Preview supports a context window of 1,048,576 tokens (1.0M). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Gemini 3.1 Flash Lite Preview? +
Gemini 3.1 Flash Lite Preview can generate up to 65,536 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Gemini 3.1 Flash Lite Preview good for coding tasks? +
Yes, Gemini 3.1 Flash Lite Preview supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Google Vertex AI Models