244
Models Available
$0.001
Cheapest Input / 1M
262K
Largest Context
What is Fireworks AI?
Fireworks AI is an AI model provider offering 244 large language models for developers. Their cheapest model starts at $0.001 per 1M input tokens, and their largest context window reaches 262K. Fireworks AI provides 244 AI models accessible via API.
Fireworks AI Strengths
All Fireworks AI Models
Model Details
Accounts/Fireworks/Models/Flux 1 Dev Controlnet Union
Accounts/Fireworks/Models/Flux 1 Dev Controlnet Union is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.001000/1M input tokens, $0.001000/1M output tokens.
Accounts/Fireworks/Models/Gpt Oss 20b
Accounts/Fireworks/Models/Gpt Oss 20b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.0500/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Llama V3p1 8b Instruct
Accounts/Fireworks/Models/Llama V3p1 8b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Llama V3p2 1b Instruct
Accounts/Fireworks/Models/Llama V3p2 1b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Llama V3p2 3b Instruct
Accounts/Fireworks/Models/Llama V3p2 3b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Codegemma 2b
Accounts/Fireworks/Models/Codegemma 2b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Cogito V1 Preview Llama 3b
Accounts/Fireworks/Models/Cogito V1 Preview Llama 3b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Deepseek Coder 1b Base
Accounts/Fireworks/Models/Deepseek Coder 1b Base is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 1p5b
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 1p5b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Ernie 4p5 21b A3b Pt
Accounts/Fireworks/Models/Ernie 4p5 21b A3b Pt is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Ernie 4p5 300b A47b Pt
Accounts/Fireworks/Models/Ernie 4p5 300b A47b Pt is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Flux 1 Dev
Accounts/Fireworks/Models/Flux 1 Dev is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Flux 1 Schnell
Accounts/Fireworks/Models/Flux 1 Schnell is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Gemma 2b It
Accounts/Fireworks/Models/Gemma 2b It is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Llama Guard 3 1b
Accounts/Fireworks/Models/Llama Guard 3 1b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Llama V2 70b
Accounts/Fireworks/Models/Llama V2 70b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Llama V3p1 405b Instruct Long
Accounts/Fireworks/Models/Llama V3p1 405b Instruct Long is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Llama V3p1 70b Instruct 1b
Accounts/Fireworks/Models/Llama V3p1 70b Instruct 1b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Llama V3p2 1b
Accounts/Fireworks/Models/Llama V3p2 1b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Llama V3p2 3b
Accounts/Fireworks/Models/Llama V3p2 3b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Minimax M1 80k
Accounts/Fireworks/Models/Minimax M1 80k is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Ministral 3 3b Instruct 2512
Accounts/Fireworks/Models/Ministral 3 3b Instruct 2512 is available via Fireworks AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Nemotron Nano V2 12b Vl
Accounts/Fireworks/Models/Nemotron Nano V2 12b Vl is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Phi 2 3b
Accounts/Fireworks/Models/Phi 2 3b is available via Fireworks AI with a 2K context window and up to 2,048 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Phi 3 Mini 128k Instruct
Accounts/Fireworks/Models/Phi 3 Mini 128k Instruct is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Qwen2 Vl 2b Instruct
Accounts/Fireworks/Models/Qwen2 Vl 2b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 0p5b Instruct
Accounts/Fireworks/Models/Qwen2p5 0p5b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 1p5b Instruct
Accounts/Fireworks/Models/Qwen2p5 1p5b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b
Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b Instruct
Accounts/Fireworks/Models/Qwen2p5 Coder 0p5b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b
Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b Instruct
Accounts/Fireworks/Models/Qwen2p5 Coder 1p5b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 3b
Accounts/Fireworks/Models/Qwen2p5 Coder 3b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 3b Instruct
Accounts/Fireworks/Models/Qwen2p5 Coder 3b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 0p6b
Accounts/Fireworks/Models/Qwen3 0p6b is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 1p7b
Accounts/Fireworks/Models/Qwen3 1p7b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft
Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 131072
Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 131072 is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 40960
Accounts/Fireworks/Models/Qwen3 1p7b Fp8 Draft 40960 is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Stablecode 3b
Accounts/Fireworks/Models/Stablecode 3b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Starcoder2 3b
Accounts/Fireworks/Models/Starcoder2 3b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.
Accounts/Fireworks/Models/Gpt Oss 120b
Accounts/Fireworks/Models/Gpt Oss 120b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.
Accounts/Fireworks/Models/Llama4 Scout Instruct Basic
Accounts/Fireworks/Models/Llama4 Scout Instruct Basic is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 30b A3b
Accounts/Fireworks/Models/Qwen3 30b A3b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 Coder 30b A3b Instruct
Accounts/Fireworks/Models/Qwen3 Coder 30b A3b Instruct is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Instruct
Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Instruct is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Thinking
Accounts/Fireworks/Models/Qwen3 Vl 30b A3b Thinking is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.
Accounts/Fireworks/Models/Llama V3p2 11b Vision Instruct
Accounts/Fireworks/Models/Llama V3p2 11b Vision Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Chronos Hermes 13b
Accounts/Fireworks/Models/Chronos Hermes 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Code Llama 13b
Accounts/Fireworks/Models/Code Llama 13b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Code Llama 13b Instruct
Accounts/Fireworks/Models/Code Llama 13b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Code Llama 13b Python
Accounts/Fireworks/Models/Code Llama 13b Python is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Code Llama 7b
Accounts/Fireworks/Models/Code Llama 7b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Code Llama 7b Instruct
Accounts/Fireworks/Models/Code Llama 7b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Code Llama 7b Python
Accounts/Fireworks/Models/Code Llama 7b Python is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Code Qwen 1p5 7b
Accounts/Fireworks/Models/Code Qwen 1p5 7b is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Codegemma 7b
Accounts/Fireworks/Models/Codegemma 7b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Cogito V1 Preview Llama 8b
Accounts/Fireworks/Models/Cogito V1 Preview Llama 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Cogito V1 Preview Qwen 14b
Accounts/Fireworks/Models/Cogito V1 Preview Qwen 14b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Deepseek Coder 7b Base
Accounts/Fireworks/Models/Deepseek Coder 7b Base is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Deepseek Coder 7b Base V1p5
Accounts/Fireworks/Models/Deepseek Coder 7b Base V1p5 is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Deepseek Coder 7b Instruct V1p5
Accounts/Fireworks/Models/Deepseek Coder 7b Instruct V1p5 is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Deepseek R1 0528 Distill Qwen3 8b
Accounts/Fireworks/Models/Deepseek R1 0528 Distill Qwen3 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Deepseek R1 Distill Llama 8b
Accounts/Fireworks/Models/Deepseek R1 Distill Llama 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 14b
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 14b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 7b
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 7b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Dobby Mini Unhinged Plus Llama 3 1 8b
Accounts/Fireworks/Models/Dobby Mini Unhinged Plus Llama 3 1 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Firellava 13b
Accounts/Fireworks/Models/Firellava 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Firesearch Ocr
Accounts/Fireworks/Models/Firesearch Ocr is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Gemma 7b
Accounts/Fireworks/Models/Gemma 7b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Gemma 7b It
Accounts/Fireworks/Models/Gemma 7b It is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Gemma2 9b It
Accounts/Fireworks/Models/Gemma2 9b It is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Hermes 2 Pro Mistral 7b
Accounts/Fireworks/Models/Hermes 2 Pro Mistral 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Internvl3 8b
Accounts/Fireworks/Models/Internvl3 8b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Llama Guard 2 8b
Accounts/Fireworks/Models/Llama Guard 2 8b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Llama Guard 3 8b
Accounts/Fireworks/Models/Llama Guard 3 8b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Llama V2 13b
Accounts/Fireworks/Models/Llama V2 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Llama V2 13b Chat
Accounts/Fireworks/Models/Llama V2 13b Chat is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Llama V2 7b
Accounts/Fireworks/Models/Llama V2 7b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Llama V2 7b Chat
Accounts/Fireworks/Models/Llama V2 7b Chat is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Llama V3 8b
Accounts/Fireworks/Models/Llama V3 8b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Llama V3 8b Instruct Hf
Accounts/Fireworks/Models/Llama V3 8b Instruct Hf is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Llamaguard 7b
Accounts/Fireworks/Models/Llamaguard 7b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Ministral 3 14b Instruct 2512
Accounts/Fireworks/Models/Ministral 3 14b Instruct 2512 is available via Fireworks AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Ministral 3 8b Instruct 2512
Accounts/Fireworks/Models/Ministral 3 8b Instruct 2512 is available via Fireworks AI with a 256K context window and up to 256,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Mistral 7b
Accounts/Fireworks/Models/Mistral 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Mistral 7b Instruct 4k
Accounts/Fireworks/Models/Mistral 7b Instruct 4k is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Mistral 7b Instruct V0p2
Accounts/Fireworks/Models/Mistral 7b Instruct V0p2 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Mistral 7b Instruct
Accounts/Fireworks/Models/Mistral 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Mistral 7b V0p2
Accounts/Fireworks/Models/Mistral 7b V0p2 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Mistral Nemo Base 2407
Accounts/Fireworks/Models/Mistral Nemo Base 2407 is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Mistral Nemo Instruct 2407
Accounts/Fireworks/Models/Mistral Nemo Instruct 2407 is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Mythomax L2 13b
Accounts/Fireworks/Models/Mythomax L2 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Nous Capybara 7b V1p9
Accounts/Fireworks/Models/Nous Capybara 7b V1p9 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Nous Hermes Llama2 13b
Accounts/Fireworks/Models/Nous Hermes Llama2 13b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Nous Hermes Llama2 7b
Accounts/Fireworks/Models/Nous Hermes Llama2 7b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Nvidia Nemotron Nano 12b
Accounts/Fireworks/Models/Nvidia Nemotron Nano 12b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Nvidia Nemotron Nano 9b
Accounts/Fireworks/Models/Nvidia Nemotron Nano 9b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Openchat 3p5 0106 7b
Accounts/Fireworks/Models/Openchat 3p5 0106 7b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Openhermes 2 Mistral 7b
Accounts/Fireworks/Models/Openhermes 2 Mistral 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Openhermes 2p5 Mistral 7b
Accounts/Fireworks/Models/Openhermes 2p5 Mistral 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Openorca 7b
Accounts/Fireworks/Models/Openorca 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Phi 3 Vision 128k Instruct
Accounts/Fireworks/Models/Phi 3 Vision 128k Instruct is available via Fireworks AI with a 32K context window and up to 32,064 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Pythia 12b
Accounts/Fireworks/Models/Pythia 12b is available via Fireworks AI with a 2K context window and up to 2,048 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen V2p5 14b Instruct
Accounts/Fireworks/Models/Qwen V2p5 14b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen V2p5 7b
Accounts/Fireworks/Models/Qwen V2p5 7b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen2 7b Instruct
Accounts/Fireworks/Models/Qwen2 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen2 Vl 7b Instruct
Accounts/Fireworks/Models/Qwen2 Vl 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 14b
Accounts/Fireworks/Models/Qwen2p5 14b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 7b Instruct
Accounts/Fireworks/Models/Qwen2p5 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 14b
Accounts/Fireworks/Models/Qwen2p5 Coder 14b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 14b Instruct
Accounts/Fireworks/Models/Qwen2p5 Coder 14b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 7b
Accounts/Fireworks/Models/Qwen2p5 Coder 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 7b Instruct
Accounts/Fireworks/Models/Qwen2p5 Coder 7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Vl 3b Instruct
Accounts/Fireworks/Models/Qwen2p5 Vl 3b Instruct is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Vl 7b Instruct
Accounts/Fireworks/Models/Qwen2p5 Vl 7b Instruct is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 14b
Accounts/Fireworks/Models/Qwen3 14b is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 4b
Accounts/Fireworks/Models/Qwen3 4b is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 4b Instruct 2507
Accounts/Fireworks/Models/Qwen3 4b Instruct 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 8b
Accounts/Fireworks/Models/Qwen3 8b is available via Fireworks AI with a 41K context window and up to 40,960 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 Vl 8b Instruct
Accounts/Fireworks/Models/Qwen3 Vl 8b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Rolm Ocr
Accounts/Fireworks/Models/Rolm Ocr is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Snorkel Mistral 7b Pairrm Dpo
Accounts/Fireworks/Models/Snorkel Mistral 7b Pairrm Dpo is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Starcoder 16b
Accounts/Fireworks/Models/Starcoder 16b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Starcoder 7b
Accounts/Fireworks/Models/Starcoder 7b is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Starcoder2 15b
Accounts/Fireworks/Models/Starcoder2 15b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Starcoder2 7b
Accounts/Fireworks/Models/Starcoder2 7b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Toppy M 7b
Accounts/Fireworks/Models/Toppy M 7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Yi 6b
Accounts/Fireworks/Models/Yi 6b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Zephyr 7b Beta
Accounts/Fireworks/Models/Zephyr 7b Beta is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.2000/1M input tokens, $0.2000/1M output tokens.
Accounts/Fireworks/Models/Glm 4p5 Air
Accounts/Fireworks/Models/Glm 4p5 Air is available via Fireworks AI with a 128K context window and up to 96,000 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.
Accounts/Fireworks/Models/Llama4 Maverick Instruct Basic
Accounts/Fireworks/Models/Llama4 Maverick Instruct Basic is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.
Accounts/Fireworks/Models/Qwen3 235b A22b
Accounts/Fireworks/Models/Qwen3 235b A22b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.
Accounts/Fireworks/Models/Qwen3 235b A22b Instruct 2507
Accounts/Fireworks/Models/Qwen3 235b A22b Instruct 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.
Accounts/Fireworks/Models/Qwen3 235b A22b Thinking 2507
Accounts/Fireworks/Models/Qwen3 235b A22b Thinking 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.
Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Instruct
Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Instruct is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.
Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Thinking
Accounts/Fireworks/Models/Qwen3 Vl 235b A22b Thinking is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.2200/1M input tokens, $0.8800/1M output tokens.
Accounts/Fireworks/Models/Minimax M2p1
Accounts/Fireworks/Models/Minimax M2p1 is available via Fireworks AI with a 205K context window and up to 204,800 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.
Minimax M2p1
Minimax M2p1 is available via Fireworks AI with a 205K context window and up to 204,800 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.
Accounts/Fireworks/Models/Minimax M2
Accounts/Fireworks/Models/Minimax M2 is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.3000/1M input tokens, $1.20/1M output tokens.
Accounts/Fireworks/Models/Qwen3 Coder 480b A35b Instruct
Accounts/Fireworks/Models/Qwen3 Coder 480b A35b Instruct is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.4500/1M input tokens, $1.80/1M output tokens.
Accounts/Fireworks/Models/Deepseek Coder V2 Lite Base
Accounts/Fireworks/Models/Deepseek Coder V2 Lite Base is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.
Accounts/Fireworks/Models/Deepseek Coder V2 Lite Instruct
Accounts/Fireworks/Models/Deepseek Coder V2 Lite Instruct is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.
Accounts/Fireworks/Models/Deepseek V2 Lite Chat
Accounts/Fireworks/Models/Deepseek V2 Lite Chat is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.
Accounts/Fireworks/Models/Dolphin 2p6 Mixtral 8x7b
Accounts/Fireworks/Models/Dolphin 2p6 Mixtral 8x7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.
Accounts/Fireworks/Models/Firefunction
Accounts/Fireworks/Models/Firefunction is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.
Accounts/Fireworks/Models/Gpt Oss Safeguard 20b
Accounts/Fireworks/Models/Gpt Oss Safeguard 20b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.
Accounts/Fireworks/Models/Mixtral 8x7b
Accounts/Fireworks/Models/Mixtral 8x7b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.
Accounts/Fireworks/Models/Mixtral 8x7b Instruct
Accounts/Fireworks/Models/Mixtral 8x7b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.
Accounts/Fireworks/Models/Mixtral 8x7b Instruct Hf
Accounts/Fireworks/Models/Mixtral 8x7b Instruct Hf is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.
Accounts/Fireworks/Models/Nous Hermes 2 Mixtral 8x7b Dpo
Accounts/Fireworks/Models/Nous Hermes 2 Mixtral 8x7b Dpo is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 30b A3b Instruct 2507
Accounts/Fireworks/Models/Qwen3 30b A3b Instruct 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.5000/1M input tokens, $0.5000/1M output tokens.
Accounts/Fireworks/Models/Deepseek R1 Basic
Accounts/Fireworks/Models/Deepseek R1 Basic is available via Fireworks AI with a 128K context window and up to 20,480 output tokens. Pricing: $0.5500/1M input tokens, $2.19/1M output tokens.
Accounts/Fireworks/Models/Glm 4p5
Accounts/Fireworks/Models/Glm 4p5 is available via Fireworks AI with a 128K context window and up to 96,000 output tokens. Pricing: $0.5500/1M input tokens, $2.19/1M output tokens.
Accounts/Fireworks/Models/Glm 4p6
Accounts/Fireworks/Models/Glm 4p6 is available via Fireworks AI with a 203K context window and up to 202,800 output tokens. Pricing: $0.5500/1M input tokens, $2.19/1M output tokens.
Accounts/Fireworks/Models/Deepseek V3p1
Accounts/Fireworks/Models/Deepseek V3p1 is available via Fireworks AI with a 128K context window and up to 8,192 output tokens. Pricing: $0.5600/1M input tokens, $1.68/1M output tokens.
Accounts/Fireworks/Models/Deepseek V3p1 Terminus
Accounts/Fireworks/Models/Deepseek V3p1 Terminus is available via Fireworks AI with a 128K context window and up to 8,192 output tokens. Pricing: $0.5600/1M input tokens, $1.68/1M output tokens.
Accounts/Fireworks/Models/Deepseek V3p2
Accounts/Fireworks/Models/Deepseek V3p2 is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5600/1M input tokens, $1.68/1M output tokens.
Accounts/Fireworks/Models/Glm 4p7
Accounts/Fireworks/Models/Glm 4p7 is available via Fireworks AI with a 203K context window and up to 202,800 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.
Accounts/Fireworks/Models/Kimi K2 Instruct
Accounts/Fireworks/Models/Kimi K2 Instruct is available via Fireworks AI with a 131K context window and up to 16,384 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.
Accounts/Fireworks/Models/Kimi K2 Instruct 0905
Accounts/Fireworks/Models/Kimi K2 Instruct 0905 is available via Fireworks AI with a 262K context window and up to 32,768 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.
Accounts/Fireworks/Models/Kimi K2 Thinking
Accounts/Fireworks/Models/Kimi K2 Thinking is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $2.50/1M output tokens.
Accounts/Fireworks/Models/Kimi K2p5
Accounts/Fireworks/Models/Kimi K2p5 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.
Glm 4p7
Glm 4p7 is available via Fireworks AI with a 203K context window and up to 202,800 output tokens. Pricing: $0.6000/1M input tokens, $2.20/1M output tokens.
Kimi K2p5
Kimi K2p5 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.
Accounts/Fireworks/Models/Deepseek
Accounts/Fireworks/Models/Deepseek is available via Fireworks AI with a 128K context window and up to 8,192 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Deepseek V3 0324
Accounts/Fireworks/Models/Deepseek V3 0324 is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Firefunction
Accounts/Fireworks/Models/Firefunction is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Llama V3p2 90b Vision Instruct
Accounts/Fireworks/Models/Llama V3p2 90b Vision Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen2 72b Instruct
Accounts/Fireworks/Models/Qwen2 72b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Code Llama 34b
Accounts/Fireworks/Models/Code Llama 34b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Code Llama 34b Instruct
Accounts/Fireworks/Models/Code Llama 34b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Code Llama 34b Python
Accounts/Fireworks/Models/Code Llama 34b Python is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Code Llama 70b
Accounts/Fireworks/Models/Code Llama 70b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Code Llama 70b Instruct
Accounts/Fireworks/Models/Code Llama 70b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Code Llama 70b Python
Accounts/Fireworks/Models/Code Llama 70b Python is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Cogito V1 Preview Llama 70b
Accounts/Fireworks/Models/Cogito V1 Preview Llama 70b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Cogito V1 Preview Qwen 32b
Accounts/Fireworks/Models/Cogito V1 Preview Qwen 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Deepseek Coder 33b Instruct
Accounts/Fireworks/Models/Deepseek Coder 33b Instruct is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Deepseek R1 Distill Llama 70b
Accounts/Fireworks/Models/Deepseek R1 Distill Llama 70b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 32b
Accounts/Fireworks/Models/Deepseek R1 Distill Qwen 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Devstral Small 2505
Accounts/Fireworks/Models/Devstral Small 2505 is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Dobby Unhinged Llama 3 3 70b New
Accounts/Fireworks/Models/Dobby Unhinged Llama 3 3 70b New is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Dolphin 2 9 2 Qwen2 72b
Accounts/Fireworks/Models/Dolphin 2 9 2 Qwen2 72b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Fare 20b
Accounts/Fireworks/Models/Fare 20b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Gemma 3 27b It
Accounts/Fireworks/Models/Gemma 3 27b It is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Internvl3 38b
Accounts/Fireworks/Models/Internvl3 38b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Internvl3 78b
Accounts/Fireworks/Models/Internvl3 78b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Kat Coder
Accounts/Fireworks/Models/Kat Coder is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Kat Dev 32b
Accounts/Fireworks/Models/Kat Dev 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Kat Dev 72b Exp
Accounts/Fireworks/Models/Kat Dev 72b Exp is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Llama V2 70b Chat
Accounts/Fireworks/Models/Llama V2 70b Chat is available via Fireworks AI with a 2K context window and up to 2,048 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Llama V3 70b Instruct
Accounts/Fireworks/Models/Llama V3 70b Instruct is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Llama V3 70b Instruct Hf
Accounts/Fireworks/Models/Llama V3 70b Instruct Hf is available via Fireworks AI with a 8K context window and up to 8,192 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Llama V3p1 70b Instruct
Accounts/Fireworks/Models/Llama V3p1 70b Instruct is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Llama V3p1 Nemotron 70b Instruct
Accounts/Fireworks/Models/Llama V3p1 Nemotron 70b Instruct is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Llama V3p3 70b Instruct
Accounts/Fireworks/Models/Llama V3p3 70b Instruct is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Llava Yi 34b
Accounts/Fireworks/Models/Llava Yi 34b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Mistral Small 24b Instruct 2501
Accounts/Fireworks/Models/Mistral Small 24b Instruct 2501 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Nous Hermes 2 Yi 34b
Accounts/Fireworks/Models/Nous Hermes 2 Yi 34b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Nous Hermes Llama2 70b
Accounts/Fireworks/Models/Nous Hermes Llama2 70b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Phind Code Llama 34b Python
Accounts/Fireworks/Models/Phind Code Llama 34b Python is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Phind Code Llama 34b
Accounts/Fireworks/Models/Phind Code Llama 34b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Phind Code Llama 34b
Accounts/Fireworks/Models/Phind Code Llama 34b is available via Fireworks AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen Qwq 32b Preview
Accounts/Fireworks/Models/Qwen Qwq 32b Preview is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen1p5 72b Chat
Accounts/Fireworks/Models/Qwen1p5 72b Chat is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen2 Vl 72b Instruct
Accounts/Fireworks/Models/Qwen2 Vl 72b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 32b
Accounts/Fireworks/Models/Qwen2p5 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 32b Instruct
Accounts/Fireworks/Models/Qwen2p5 32b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 72b
Accounts/Fireworks/Models/Qwen2p5 72b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 72b Instruct
Accounts/Fireworks/Models/Qwen2p5 72b Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 32b
Accounts/Fireworks/Models/Qwen2p5 Coder 32b is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 128k
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 128k is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 32k Rope
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 32k Rope is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 64k
Accounts/Fireworks/Models/Qwen2p5 Coder 32b Instruct 64k is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Math 72b Instruct
Accounts/Fireworks/Models/Qwen2p5 Math 72b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Vl 32b Instruct
Accounts/Fireworks/Models/Qwen2p5 Vl 32b Instruct is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen2p5 Vl 72b Instruct
Accounts/Fireworks/Models/Qwen2p5 Vl 72b Instruct is available via Fireworks AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 30b A3b Thinking 2507
Accounts/Fireworks/Models/Qwen3 30b A3b Thinking 2507 is available via Fireworks AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 32b
Accounts/Fireworks/Models/Qwen3 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 Coder 480b Instruct Bf16
Accounts/Fireworks/Models/Qwen3 Coder 480b Instruct Bf16 is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 Next 80b A3b Instruct
Accounts/Fireworks/Models/Qwen3 Next 80b A3b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 Next 80b A3b Thinking
Accounts/Fireworks/Models/Qwen3 Next 80b A3b Thinking is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwen3 Vl 32b Instruct
Accounts/Fireworks/Models/Qwen3 Vl 32b Instruct is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Qwq 32b
Accounts/Fireworks/Models/Qwq 32b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Yi 34b
Accounts/Fireworks/Models/Yi 34b is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Yi 34b 200k Capybara
Accounts/Fireworks/Models/Yi 34b 200k Capybara is available via Fireworks AI with a 200K context window and up to 200,000 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Yi 34b Chat
Accounts/Fireworks/Models/Yi 34b Chat is available via Fireworks AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.
Accounts/Fireworks/Models/Deepseek Coder V2 Instruct
Accounts/Fireworks/Models/Deepseek Coder V2 Instruct is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.
Accounts/Fireworks/Models/Mixtral 8x22b Instruct Hf
Accounts/Fireworks/Models/Mixtral 8x22b Instruct Hf is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.
Accounts/Fireworks/Models/Cogito 671b V2 P1
Accounts/Fireworks/Models/Cogito 671b V2 P1 is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.
Accounts/Fireworks/Models/Dbrx Instruct
Accounts/Fireworks/Models/Dbrx Instruct is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.
Accounts/Fireworks/Models/Deepseek Prover
Accounts/Fireworks/Models/Deepseek Prover is available via Fireworks AI with a 164K context window and up to 163,840 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.
Accounts/Fireworks/Models/Deepseek V2p5
Accounts/Fireworks/Models/Deepseek V2p5 is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.
Accounts/Fireworks/Models/Glm 4p5v
Accounts/Fireworks/Models/Glm 4p5v is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.
Accounts/Fireworks/Models/Gpt Oss Safeguard 120b
Accounts/Fireworks/Models/Gpt Oss Safeguard 120b is available via Fireworks AI with a 131K context window and up to 131,072 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.
Accounts/Fireworks/Models/Mistral Large 3 Fp8
Accounts/Fireworks/Models/Mistral Large 3 Fp8 is available via Fireworks AI with a 256K context window and up to 256,000 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.
Accounts/Fireworks/Models/Mixtral 8x22b
Accounts/Fireworks/Models/Mixtral 8x22b is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.
Accounts/Fireworks/Models/Mixtral 8x22b Instruct
Accounts/Fireworks/Models/Mixtral 8x22b Instruct is available via Fireworks AI with a 66K context window and up to 65,536 output tokens. Pricing: $1.20/1M input tokens, $1.20/1M output tokens.
Accounts/Fireworks/Models/Deepseek R1
Accounts/Fireworks/Models/Deepseek R1 is available via Fireworks AI with a 128K context window and up to 20,480 output tokens. Pricing: $3.00/1M input tokens, $8.00/1M output tokens.
Accounts/Fireworks/Models/Deepseek R1 0528
Accounts/Fireworks/Models/Deepseek R1 0528 is available via Fireworks AI with a 160K context window and up to 160,000 output tokens. Pricing: $3.00/1M input tokens, $8.00/1M output tokens.
Accounts/Fireworks/Models/Llama V3p1 405b Instruct
Accounts/Fireworks/Models/Llama V3p1 405b Instruct is available via Fireworks AI with a 128K context window and up to 16,384 output tokens. Pricing: $3.00/1M input tokens, $3.00/1M output tokens.
Accounts/Fireworks/Models/Yi Large
Accounts/Fireworks/Models/Yi Large is available via Fireworks AI with a 33K context window and up to 32,768 output tokens. Pricing: $3.00/1M input tokens, $3.00/1M output tokens.
Compare Fireworks AI model pricing
Use our pricing calculator to find the cheapest Fireworks AI model for your workload.