Skip to content
Google Vertex AI

Gemini 2.0 Flash 001

Gemini 2.0 Flash 001 is available via Google Vertex AI with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

Gemini 2.0 Flash 001 Pricing & Specifications

Input Price$0.15 per 1M tokens
Output Price$0.60 per 1M tokens
Context Window1,048,576 tokens (1.0M)
Max Output8,192 tokens
ProviderGoogle Vertex AI

What is Gemini 2.0 Flash 001?

Gemini 2.0 Flash 001 is a large language model by Google Vertex AI with a 1.0M context window and up to 8,192 output tokens. It costs $0.15 per 1M input tokens and $0.60 per 1M output tokens. Gemini 2.0 Flash 001 is available via Google Vertex AI with a 1.0M context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

Capabilities

text vision function calling web search json mode

Gemini 2.0 Flash 001 Cost Examples

Short prompt (500 tokens)

$0.000075

Medium prompt (2K tokens)

$0.00030

Long output (4K tokens)

$0.00240

Count tokens for Gemini 2.0 Flash 001

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Gemini 2.0 Flash 001

Google Vertex AI

Mistral Nemo@Latest

$0.15/1M in 128K ctx

Google Vertex AI

Openai/Gpt Oss 120b Maas

$0.15/1M in 131K ctx

Google Vertex AI

Qwen/Qwen3 Next 80b A3b Instruct Maas

$0.15/1M in 262K ctx

Google Vertex AI

Qwen/Qwen3 Next 80b A3b Thinking Maas

$0.15/1M in 262K ctx

Frequently Asked Questions

How much does Gemini 2.0 Flash 001 cost per token? +
Gemini 2.0 Flash 001 costs $0.15 per 1M input tokens and $0.60 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000450.
What is the context window for Gemini 2.0 Flash 001? +
Gemini 2.0 Flash 001 supports a context window of 1,048,576 tokens (1.0M). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Gemini 2.0 Flash 001? +
Gemini 2.0 Flash 001 can generate up to 8,192 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Gemini 2.0 Flash 001 good for coding tasks? +
Yes, Gemini 2.0 Flash 001 supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Google Vertex AI Models