58
Models Available
$0.040
Cheapest Input / 1M
10M
Largest Context
What is Azure AI?
Azure AI is an AI model provider offering 58 large language models for developers. Their cheapest model starts at $0.040 per 1M input tokens, and their largest context window reaches 10M. Azure AI provides 58 AI models accessible via API.
Azure AI Strengths
All Azure AI Models
| Model | Input $/1M | Output $/1M | Context | Max Output | Released |
|---|---|---|---|---|---|
| Ministral 3b | $0.040 | $0.040 | 128K | 4,096 | — |
| Phi 4 Mini Instruct | $0.075 | $0.30 | 131K | 4,096 | — |
| Phi 4 Multimodal Instruct | $0.080 | $0.32 | 131K | 4,096 | — |
| Phi 4 Mini Reasoning | $0.080 | $0.32 | 131K | 4,096 | — |
| Mistral Small 2503 | $0.10 | $0.30 | 128K | 128,000 | — |
| Phi 4 | $0.13 | $0.50 | 16K | 16,384 | — |
| Phi 4 Reasoning | $0.13 | $0.50 | 33K | 4,096 | — |
| Phi 3 Mini 128k Instruct | $0.13 | $0.52 | 128K | 4,096 | — |
| Phi 3 Mini 4k Instruct | $0.13 | $0.52 | 4K | 4,096 | — |
| Phi 3.5 Mini Instruct | $0.13 | $0.52 | 128K | 4,096 | — |
| Phi 3.5 Vision Instruct | $0.13 | $0.52 | 128K | 4,096 | — |
| Gpt Oss 120b | $0.15 | $0.60 | 131K | 131,072 | — |
| Phi 3 Small 128k Instruct | $0.15 | $0.60 | 128K | 4,096 | — |
| Phi 3 Small 8k Instruct | $0.15 | $0.60 | 8K | 4,096 | — |
| Mistral Nemo | $0.15 | $0.15 | 131K | 4,096 | — |
| Phi 3.5 MoE Instruct | $0.16 | $0.64 | 128K | 4,096 | — |
| Phi 3 Medium 128k Instruct | $0.17 | $0.68 | 128K | 4,096 | — |
| Phi 3 Medium 4k Instruct | $0.17 | $0.68 | 4K | 4,096 | — |
| Llama 4 Scout 17B 16E Instruct | $0.20 | $0.78 | 10M | 16,384 | — |
| Grok 4 Fast Non Reasoning | $0.20 | $0.50 | 131K | 131,072 | — |
| Grok 4 Fast Reasoning | $0.20 | $0.50 | 131K | 131,072 | — |
| Grok 4 1 Fast Non Reasoning | $0.20 | $0.50 | 131K | 131,072 | — |
| Grok 4 1 Fast Reasoning | $0.20 | $0.50 | 131K | 131,072 | — |
| Grok Code Fast 1 | $0.20 | $1.50 | 131K | 131,072 | — |
| Global/Grok 3 Mini | $0.25 | $1.27 | 131K | 131,072 | — |
| Grok 3 Mini | $0.25 | $1.27 | 131K | 131,072 | — |
| Meta Llama 3.1 8B Instruct | $0.30 | $0.61 | 128K | 2,048 | — |
| Llama 3.2 11B Vision Instruct | $0.37 | $0.37 | 128K | 2,048 | — |
| Mistral Medium 2505 | $0.40 | $2.00 | 131K | 8,191 | — |
| Jamba Instruct | $0.50 | $0.70 | 70K | 4,096 | — |
| Mistral Large 3 | $0.50 | $1.50 | 256K | 8,191 | — |
| Deepseek V3.2 | $0.58 | $1.68 | 164K | 163,840 | — |
| Deepseek V3.2 Speciale | $0.58 | $1.68 | 164K | 163,840 | — |
| Kimi K2.5 | $0.60 | $3.00 | 262K | 262,144 | — |
| Llama 3.3 70B Instruct | $0.71 | $0.71 | 128K | 2,048 | — |
| Claude Haiku 4 5 | $1.00 | $5.00 | 200K | 64,000 | — |
| Mistral Small | $1.00 | $3.00 | 32K | 8,191 | — |
| Meta Llama 3 70B Instruct | $1.10 | $0.37 | 8K | 2,048 | — |
| Deepseek | $1.14 | $4.56 | 128K | 8,192 | — |
| Deepseek V3 0324 | $1.14 | $4.56 | 128K | 8,192 | — |
| MAI DS R1 | $1.35 | $5.40 | 128K | 8,192 | — |
| Deepseek R1 | $1.35 | $5.40 | 128K | 8,192 | — |
| Llama 4 Maverick 17B 128E Instruct FP8 | $1.41 | $0.35 | 1M | 16,384 | — |
| Mistral Large 2407 | $2.00 | $6.00 | 128K | 4,096 | — |
| Mistral Large Latest | $2.00 | $6.00 | 128K | 4,096 | — |
| Llama 3.2 90B Vision Instruct | $2.04 | $2.04 | 128K | 2,048 | — |
| Meta Llama 3.1 70B Instruct | $2.68 | $3.54 | 128K | 2,048 | — |
| Claude Sonnet 4 5 | $3.00 | $15.00 | 200K | 64,000 | — |
| Claude Sonnet 4 6 | $3.00 | $15.00 | 1M | 64,000 | — |
| Global/Grok 3 | $3.00 | $15.00 | 131K | 131,072 | — |
| Grok 3 | $3.00 | $15.00 | 131K | 131,072 | — |
| Grok 4 | $3.00 | $15.00 | 131K | 131,072 | — |
| Mistral Large | $4.00 | $12.00 | 32K | 8,191 | — |
| Claude Opus 4 5 | $5.00 | $25.00 | 200K | 64,000 | — |
| Claude Opus 4 6 | $5.00 | $25.00 | 200K | 128,000 | — |
| Meta Llama 3.1 405B Instruct | $5.33 | $16.00 | 128K | 2,048 | — |
| Claude Opus 4 1 | $15.00 | $75.00 | 200K | 32,000 | — |
| Jais 30b Chat | $3200.00 | $9710.00 | 8K | 8,192 | — |
Model Details
Ministral 3b
Ministral 3b is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.0400/1M input tokens, $0.0400/1M output tokens.
Phi 4 Mini Instruct
Phi 4 Mini Instruct is available via Azure AI with a 131K context window and up to 4,096 output tokens. Pricing: $0.0750/1M input tokens, $0.3000/1M output tokens.
Phi 4 Multimodal Instruct
Phi 4 Multimodal Instruct is available via Azure AI with a 131K context window and up to 4,096 output tokens. Pricing: $0.0800/1M input tokens, $0.3200/1M output tokens.
Phi 4 Mini Reasoning
Phi 4 Mini Reasoning is available via Azure AI with a 131K context window and up to 4,096 output tokens. Pricing: $0.0800/1M input tokens, $0.3200/1M output tokens.
Mistral Small 2503
Mistral Small 2503 is available via Azure AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.
Phi 4
Phi 4 is available via Azure AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1250/1M input tokens, $0.5000/1M output tokens.
Phi 4 Reasoning
Phi 4 Reasoning is available via Azure AI with a 33K context window and up to 4,096 output tokens. Pricing: $0.1250/1M input tokens, $0.5000/1M output tokens.
Phi 3 Mini 128k Instruct
Phi 3 Mini 128k Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1300/1M input tokens, $0.5200/1M output tokens.
Phi 3 Mini 4k Instruct
Phi 3 Mini 4k Instruct is available via Azure AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1300/1M input tokens, $0.5200/1M output tokens.
Phi 3.5 Mini Instruct
Phi 3.5 Mini Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1300/1M input tokens, $0.5200/1M output tokens.
Phi 3.5 Vision Instruct
Phi 3.5 Vision Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1300/1M input tokens, $0.5200/1M output tokens.
Gpt Oss 120b
Gpt Oss 120b is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.
Phi 3 Small 128k Instruct
Phi 3 Small 128k Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.
Phi 3 Small 8k Instruct
Phi 3 Small 8k Instruct is available via Azure AI with a 8K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.
Mistral Nemo
Mistral Nemo is available via Azure AI with a 131K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.
Phi 3.5 MoE Instruct
Phi 3.5 MoE Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1600/1M input tokens, $0.6400/1M output tokens.
Phi 3 Medium 128k Instruct
Phi 3 Medium 128k Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1700/1M input tokens, $0.6800/1M output tokens.
Phi 3 Medium 4k Instruct
Phi 3 Medium 4k Instruct is available via Azure AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1700/1M input tokens, $0.6800/1M output tokens.
Llama 4 Scout 17B 16E Instruct
Llama 4 Scout 17B 16E Instruct is available via Azure AI with a 10M context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.7800/1M output tokens.
Grok 4 Fast Non Reasoning
Grok 4 Fast Non Reasoning is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.
Grok 4 Fast Reasoning
Grok 4 Fast Reasoning is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.
Grok 4 1 Fast Non Reasoning
Grok 4 1 Fast Non Reasoning is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.
Grok 4 1 Fast Reasoning
Grok 4 1 Fast Reasoning is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.
Grok Code Fast 1
Grok Code Fast 1 is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $1.50/1M output tokens.
Global/Grok 3 Mini
Global/Grok 3 Mini is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2500/1M input tokens, $1.27/1M output tokens.
Grok 3 Mini
Grok 3 Mini is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2500/1M input tokens, $1.27/1M output tokens.
Meta Llama 3.1 8B Instruct
Meta Llama 3.1 8B Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $0.3000/1M input tokens, $0.6100/1M output tokens.
Llama 3.2 11B Vision Instruct
Llama 3.2 11B Vision Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $0.3700/1M input tokens, $0.3700/1M output tokens.
Mistral Medium 2505
Mistral Medium 2505 is available via Azure AI with a 131K context window and up to 8,191 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.
Jamba Instruct
Jamba Instruct is available via Azure AI with a 70K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $0.7000/1M output tokens.
Mistral Large 3
Mistral Large 3 is available via Azure AI with a 256K context window and up to 8,191 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.
Deepseek V3.2
Deepseek V3.2 is available via Azure AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5800/1M input tokens, $1.68/1M output tokens.
Deepseek V3.2 Speciale
Deepseek V3.2 Speciale is available via Azure AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5800/1M input tokens, $1.68/1M output tokens.
Kimi K2.5
Kimi K2.5 is available via Azure AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.
Llama 3.3 70B Instruct
Llama 3.3 70B Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $0.7100/1M input tokens, $0.7100/1M output tokens.
Claude Haiku 4 5
Claude Haiku 4 5 is available via Azure AI with a 200K context window and up to 64,000 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.
Mistral Small
Mistral Small is available via Azure AI with a 32K context window and up to 8,191 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.
Meta Llama 3 70B Instruct
Meta Llama 3 70B Instruct is available via Azure AI with a 8K context window and up to 2,048 output tokens. Pricing: $1.10/1M input tokens, $0.3700/1M output tokens.
Deepseek
Deepseek is available via Azure AI with a 128K context window and up to 8,192 output tokens. Pricing: $1.14/1M input tokens, $4.56/1M output tokens.
Deepseek V3 0324
Deepseek V3 0324 is available via Azure AI with a 128K context window and up to 8,192 output tokens. Pricing: $1.14/1M input tokens, $4.56/1M output tokens.
MAI DS R1
MAI DS R1 is available via Azure AI with a 128K context window and up to 8,192 output tokens. Pricing: $1.35/1M input tokens, $5.40/1M output tokens.
Deepseek R1
Deepseek R1 is available via Azure AI with a 128K context window and up to 8,192 output tokens. Pricing: $1.35/1M input tokens, $5.40/1M output tokens.
Llama 4 Maverick 17B 128E Instruct FP8
Llama 4 Maverick 17B 128E Instruct FP8 is available via Azure AI with a 1M context window and up to 16,384 output tokens. Pricing: $1.41/1M input tokens, $0.3500/1M output tokens.
Mistral Large 2407
Mistral Large 2407 is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.
Mistral Large Latest
Mistral Large Latest is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.
Llama 3.2 90B Vision Instruct
Llama 3.2 90B Vision Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $2.04/1M input tokens, $2.04/1M output tokens.
Meta Llama 3.1 70B Instruct
Meta Llama 3.1 70B Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $2.68/1M input tokens, $3.54/1M output tokens.
Claude Sonnet 4 5
Claude Sonnet 4 5 is available via Azure AI with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.
Claude Sonnet 4 6
Claude Sonnet 4 6 is available via Azure AI with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.
Global/Grok 3
Global/Grok 3 is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.
Grok 3
Grok 3 is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.
Grok 4
Grok 4 is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.
Mistral Large
Mistral Large is available via Azure AI with a 32K context window and up to 8,191 output tokens. Pricing: $4.00/1M input tokens, $12.00/1M output tokens.
Claude Opus 4 5
Claude Opus 4 5 is available via Azure AI with a 200K context window and up to 64,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.
Claude Opus 4 6
Claude Opus 4 6 is available via Azure AI with a 200K context window and up to 128,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.
Meta Llama 3.1 405B Instruct
Meta Llama 3.1 405B Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $5.33/1M input tokens, $16.00/1M output tokens.
Claude Opus 4 1
Claude Opus 4 1 is available via Azure AI with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.
Jais 30b Chat
Jais 30b Chat is available via Azure AI with a 8K context window and up to 8,192 output tokens. Pricing: $3200.00/1M input tokens, $9710.00/1M output tokens.
Compare Azure AI model pricing
Use our pricing calculator to find the cheapest Azure AI model for your workload.