Question 1

How much does Gemma3 4b cost per token?

Accepted Answer

Gemma3 4b costs $0.030 per 1M input tokens and $0.080 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000070.

Question 2

What is the context window for Gemma3 4b?

Accepted Answer

Gemma3 4b supports a context window of 128,000 tokens (128K). This determines the maximum combined length of your prompt and conversation history in a single API call.

Question 3

What is the maximum output length for Gemma3 4b?

Accepted Answer

Gemma3 4b can generate up to 8,192 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.

Question 4

Is Gemma3 4b good for coding tasks?

Accepted Answer

Yes, Gemma3 4b supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.

Input Price	$0.030 per 1M tokens
Output Price	$0.080 per 1M tokens
Context Window	128,000 tokens (128K)
Max Output	8,192 tokens
Provider	Llamagate

Gemma3 4b

Gemma3 4b Pricing & Specifications

What is Gemma3 4b?

Capabilities

Gemma3 4b Cost Examples

Count tokens for Gemma3 4b

Similar Models to Gemma3 4b

Llama 3.1 8b

Llama 3.2 3b

Qwen3 8b

Qwen2.5 Coder 7b

Frequently Asked Questions