Gpt Oss 20b Mxfp4 GGUF
Gpt Oss 20b Mxfp4 GGUF is available via Lemonade with a 131K context window and up to 32,768 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.
Gpt Oss 20b Mxfp4 GGUF Pricing & Specifications
What is Gpt Oss 20b Mxfp4 GGUF?
Gpt Oss 20b Mxfp4 GGUF is a large language model by Lemonade with a 131K context window and up to 32,768 output tokens. It costs $0.000 per 1M input tokens and $0.000 per 1M output tokens. Gpt Oss 20b Mxfp4 GGUF is available via Lemonade with a 131K context window and up to 32,768 output tokens. Pricing: $0.000000/1M input tokens, $0.000000/1M output tokens.
Capabilities
text function calling json mode
Gpt Oss 20b Mxfp4 GGUF Cost Examples
Short prompt (500 tokens)
$0.000000
Medium prompt (2K tokens)
$0.00000
Long output (4K tokens)
$0.00000
Count tokens for Gpt Oss 20b Mxfp4 GGUF
Paste your prompt to see exact token counts and API cost estimates.
Open Token CounterSimilar Models to Gpt Oss 20b Mxfp4 GGUF
Frequently Asked Questions
How much does Gpt Oss 20b Mxfp4 GGUF cost per token? +
Gpt Oss 20b Mxfp4 GGUF costs $0.000 per 1M input tokens and $0.000 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000000.
What is the context window for Gpt Oss 20b Mxfp4 GGUF? +
Gpt Oss 20b Mxfp4 GGUF supports a context window of 131,072 tokens (131K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Gpt Oss 20b Mxfp4 GGUF? +
Gpt Oss 20b Mxfp4 GGUF can generate up to 32,768 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Gpt Oss 20b Mxfp4 GGUF good for coding tasks? +
Yes, Gpt Oss 20b Mxfp4 GGUF supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.