Skip to content
Llamagate

Gemma3 4b

Gemma3 4b is available via Llamagate with a 128K context window and up to 8,192 output tokens. Pricing: $0.0300/1M input tokens, $0.0800/1M output tokens.

Gemma3 4b Pricing & Specifications

Input Price$0.030 per 1M tokens
Output Price$0.080 per 1M tokens
Context Window128,000 tokens (128K)
Max Output8,192 tokens
ProviderLlamagate

What is Gemma3 4b?

Gemma3 4b is a large language model by Llamagate with a 128K context window and up to 8,192 output tokens. It costs $0.030 per 1M input tokens and $0.080 per 1M output tokens. Gemma3 4b is available via Llamagate with a 128K context window and up to 8,192 output tokens. Pricing: $0.0300/1M input tokens, $0.0800/1M output tokens.

Capabilities

text vision function calling json mode

Gemma3 4b Cost Examples

Short prompt (500 tokens)

$0.000015

Medium prompt (2K tokens)

$0.00006

Long output (4K tokens)

$0.00032

Count tokens for Gemma3 4b

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Gemma3 4b

Llamagate

Llama 3.1 8b

$0.030/1M in 131K ctx

Llamagate

Llama 3.2 3b

$0.040/1M in 131K ctx

Llamagate

Qwen3 8b

$0.040/1M in 33K ctx

Llamagate

Qwen2.5 Coder 7b

$0.060/1M in 33K ctx

Frequently Asked Questions

How much does Gemma3 4b cost per token? +
Gemma3 4b costs $0.030 per 1M input tokens and $0.080 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000070.
What is the context window for Gemma3 4b? +
Gemma3 4b supports a context window of 128,000 tokens (128K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Gemma3 4b? +
Gemma3 4b can generate up to 8,192 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Gemma3 4b good for coding tasks? +
Yes, Gemma3 4b supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Llamagate Models