Skip to content
Cerebras

Zai Glm 4.6

Zai Glm 4.6 is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $2.25/1M input tokens, $2.75/1M output tokens.

Zai Glm 4.6 Pricing & Specifications

Input Price$2.25 per 1M tokens
Output Price$2.75 per 1M tokens
Context Window128,000 tokens (128K)
Max Output128,000 tokens
ProviderCerebras

What is Zai Glm 4.6?

Zai Glm 4.6 is a large language model by Cerebras with a 128K context window and up to 128,000 output tokens. It costs $2.25 per 1M input tokens and $2.75 per 1M output tokens. Zai Glm 4.6 is available via Cerebras with a 128K context window and up to 128,000 output tokens. Pricing: $2.25/1M input tokens, $2.75/1M output tokens.

Capabilities

text function calling reasoning

Zai Glm 4.6 Cost Examples

Short prompt (500 tokens)

$0.001125

Medium prompt (2K tokens)

$0.00450

Long output (4K tokens)

$0.01100

Count tokens for Zai Glm 4.6

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Zai Glm 4.6

Cerebras

Zai Glm 4.7

$2.25/1M in 128K ctx

Cerebras

Llama 3.3 70b

$0.85/1M in 128K ctx

Cerebras

Llama3.1 70b

$0.60/1M in 128K ctx

Cerebras

Qwen 3 32b

$0.40/1M in 128K ctx

Frequently Asked Questions

How much does Zai Glm 4.6 cost per token? +
Zai Glm 4.6 costs $2.25 per 1M input tokens and $2.75 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.003625.
What is the context window for Zai Glm 4.6? +
Zai Glm 4.6 supports a context window of 128,000 tokens (128K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Zai Glm 4.6? +
Zai Glm 4.6 can generate up to 128,000 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Zai Glm 4.6 good for coding tasks? +
Yes, Zai Glm 4.6 supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All Cerebras Models