Question 1

How much does Gpt Oss 20b Mxfp4 GGUF cost per token?

Accepted Answer

Gpt Oss 20b Mxfp4 GGUF costs $0.000 per 1M input tokens and $0.000 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000000.

Question 2

What is the context window for Gpt Oss 20b Mxfp4 GGUF?

Accepted Answer

Gpt Oss 20b Mxfp4 GGUF supports a context window of 131,072 tokens (131K). This determines the maximum combined length of your prompt and conversation history in a single API call.

Question 3

What is the maximum output length for Gpt Oss 20b Mxfp4 GGUF?

Accepted Answer

Gpt Oss 20b Mxfp4 GGUF can generate up to 32,768 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.

Question 4

Is Gpt Oss 20b Mxfp4 GGUF good for coding tasks?

Accepted Answer

Yes, Gpt Oss 20b Mxfp4 GGUF supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.

Input Price	$0.000 per 1M tokens
Output Price	$0.000 per 1M tokens
Context Window	131,072 tokens (131K)
Max Output	32,768 tokens
Provider	Lemonade

Gpt Oss 20b Mxfp4 GGUF

Gpt Oss 20b Mxfp4 GGUF Pricing & Specifications

What is Gpt Oss 20b Mxfp4 GGUF?

Capabilities

Gpt Oss 20b Mxfp4 GGUF Cost Examples

Count tokens for Gpt Oss 20b Mxfp4 GGUF

Similar Models to Gpt Oss 20b Mxfp4 GGUF

Qwen3 Coder 30B A3B Instruct GGUF

Gpt Oss 120b Mxfp GGUF

Gemma 3 4b It GGUF

Qwen3 4B Instruct 2507 GGUF

Frequently Asked Questions