Skip to content

Azure AI Models

Azure AI provides 58 AI models accessible via API.

Visit Azure AI →

58

Models Available

$0.040

Cheapest Input / 1M

10M

Largest Context

What is Azure AI?

Azure AI is an AI model provider offering 58 large language models for developers. Their cheapest model starts at $0.040 per 1M input tokens, and their largest context window reaches 10M. Azure AI provides 58 AI models accessible via API.

Azure AI Strengths

All Azure AI Models

Model Input $/1M Output $/1M Context Max Output Released
Ministral 3b $0.040 $0.040 128K 4,096
Phi 4 Mini Instruct $0.075 $0.30 131K 4,096
Phi 4 Multimodal Instruct $0.080 $0.32 131K 4,096
Phi 4 Mini Reasoning $0.080 $0.32 131K 4,096
Mistral Small 2503 $0.10 $0.30 128K 128,000
Phi 4 $0.13 $0.50 16K 16,384
Phi 4 Reasoning $0.13 $0.50 33K 4,096
Phi 3 Mini 128k Instruct $0.13 $0.52 128K 4,096
Phi 3 Mini 4k Instruct $0.13 $0.52 4K 4,096
Phi 3.5 Mini Instruct $0.13 $0.52 128K 4,096
Phi 3.5 Vision Instruct $0.13 $0.52 128K 4,096
Gpt Oss 120b $0.15 $0.60 131K 131,072
Phi 3 Small 128k Instruct $0.15 $0.60 128K 4,096
Phi 3 Small 8k Instruct $0.15 $0.60 8K 4,096
Mistral Nemo $0.15 $0.15 131K 4,096
Phi 3.5 MoE Instruct $0.16 $0.64 128K 4,096
Phi 3 Medium 128k Instruct $0.17 $0.68 128K 4,096
Phi 3 Medium 4k Instruct $0.17 $0.68 4K 4,096
Llama 4 Scout 17B 16E Instruct $0.20 $0.78 10M 16,384
Grok 4 Fast Non Reasoning $0.20 $0.50 131K 131,072
Grok 4 Fast Reasoning $0.20 $0.50 131K 131,072
Grok 4 1 Fast Non Reasoning $0.20 $0.50 131K 131,072
Grok 4 1 Fast Reasoning $0.20 $0.50 131K 131,072
Grok Code Fast 1 $0.20 $1.50 131K 131,072
Global/Grok 3 Mini $0.25 $1.27 131K 131,072
Grok 3 Mini $0.25 $1.27 131K 131,072
Meta Llama 3.1 8B Instruct $0.30 $0.61 128K 2,048
Llama 3.2 11B Vision Instruct $0.37 $0.37 128K 2,048
Mistral Medium 2505 $0.40 $2.00 131K 8,191
Jamba Instruct $0.50 $0.70 70K 4,096
Mistral Large 3 $0.50 $1.50 256K 8,191
Deepseek V3.2 $0.58 $1.68 164K 163,840
Deepseek V3.2 Speciale $0.58 $1.68 164K 163,840
Kimi K2.5 $0.60 $3.00 262K 262,144
Llama 3.3 70B Instruct $0.71 $0.71 128K 2,048
Claude Haiku 4 5 $1.00 $5.00 200K 64,000
Mistral Small $1.00 $3.00 32K 8,191
Meta Llama 3 70B Instruct $1.10 $0.37 8K 2,048
Deepseek $1.14 $4.56 128K 8,192
Deepseek V3 0324 $1.14 $4.56 128K 8,192
MAI DS R1 $1.35 $5.40 128K 8,192
Deepseek R1 $1.35 $5.40 128K 8,192
Llama 4 Maverick 17B 128E Instruct FP8 $1.41 $0.35 1M 16,384
Mistral Large 2407 $2.00 $6.00 128K 4,096
Mistral Large Latest $2.00 $6.00 128K 4,096
Llama 3.2 90B Vision Instruct $2.04 $2.04 128K 2,048
Meta Llama 3.1 70B Instruct $2.68 $3.54 128K 2,048
Claude Sonnet 4 5 $3.00 $15.00 200K 64,000
Claude Sonnet 4 6 $3.00 $15.00 1M 64,000
Global/Grok 3 $3.00 $15.00 131K 131,072
Grok 3 $3.00 $15.00 131K 131,072
Grok 4 $3.00 $15.00 131K 131,072
Mistral Large $4.00 $12.00 32K 8,191
Claude Opus 4 5 $5.00 $25.00 200K 64,000
Claude Opus 4 6 $5.00 $25.00 200K 128,000
Meta Llama 3.1 405B Instruct $5.33 $16.00 128K 2,048
Claude Opus 4 1 $15.00 $75.00 200K 32,000
Jais 30b Chat $3200.00 $9710.00 8K 8,192

Model Details

Ministral 3b

Ministral 3b is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.0400/1M input tokens, $0.0400/1M output tokens.

Input: $0.040/1M Output: $0.040/1M Context: 128K
text function calling

Phi 4 Mini Instruct

Phi 4 Mini Instruct is available via Azure AI with a 131K context window and up to 4,096 output tokens. Pricing: $0.0750/1M input tokens, $0.3000/1M output tokens.

Input: $0.075/1M Output: $0.30/1M Context: 131K
text function calling

Phi 4 Multimodal Instruct

Phi 4 Multimodal Instruct is available via Azure AI with a 131K context window and up to 4,096 output tokens. Pricing: $0.0800/1M input tokens, $0.3200/1M output tokens.

Input: $0.080/1M Output: $0.32/1M Context: 131K
text vision function calling audio

Phi 4 Mini Reasoning

Phi 4 Mini Reasoning is available via Azure AI with a 131K context window and up to 4,096 output tokens. Pricing: $0.0800/1M input tokens, $0.3200/1M output tokens.

Input: $0.080/1M Output: $0.32/1M Context: 131K
text function calling

Mistral Small 2503

Mistral Small 2503 is available via Azure AI with a 128K context window and up to 128,000 output tokens. Pricing: $0.1000/1M input tokens, $0.3000/1M output tokens.

Input: $0.10/1M Output: $0.30/1M Context: 128K
text vision function calling

Phi 4

Phi 4 is available via Azure AI with a 16K context window and up to 16,384 output tokens. Pricing: $0.1250/1M input tokens, $0.5000/1M output tokens.

Input: $0.13/1M Output: $0.50/1M Context: 16K
text function calling

Phi 4 Reasoning

Phi 4 Reasoning is available via Azure AI with a 33K context window and up to 4,096 output tokens. Pricing: $0.1250/1M input tokens, $0.5000/1M output tokens.

Input: $0.13/1M Output: $0.50/1M Context: 33K
text function calling reasoning

Phi 3 Mini 128k Instruct

Phi 3 Mini 128k Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1300/1M input tokens, $0.5200/1M output tokens.

Input: $0.13/1M Output: $0.52/1M Context: 128K
text

Phi 3 Mini 4k Instruct

Phi 3 Mini 4k Instruct is available via Azure AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1300/1M input tokens, $0.5200/1M output tokens.

Input: $0.13/1M Output: $0.52/1M Context: 4K
text

Phi 3.5 Mini Instruct

Phi 3.5 Mini Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1300/1M input tokens, $0.5200/1M output tokens.

Input: $0.13/1M Output: $0.52/1M Context: 128K
text

Phi 3.5 Vision Instruct

Phi 3.5 Vision Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1300/1M input tokens, $0.5200/1M output tokens.

Input: $0.13/1M Output: $0.52/1M Context: 128K
text vision

Gpt Oss 120b

Gpt Oss 120b is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

Input: $0.15/1M Output: $0.60/1M Context: 131K
text function calling json mode

Phi 3 Small 128k Instruct

Phi 3 Small 128k Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

Input: $0.15/1M Output: $0.60/1M Context: 128K
text

Phi 3 Small 8k Instruct

Phi 3 Small 8k Instruct is available via Azure AI with a 8K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

Input: $0.15/1M Output: $0.60/1M Context: 8K
text

Mistral Nemo

Mistral Nemo is available via Azure AI with a 131K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

Input: $0.15/1M Output: $0.15/1M Context: 131K
text function calling

Phi 3.5 MoE Instruct

Phi 3.5 MoE Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1600/1M input tokens, $0.6400/1M output tokens.

Input: $0.16/1M Output: $0.64/1M Context: 128K
text

Phi 3 Medium 128k Instruct

Phi 3 Medium 128k Instruct is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $0.1700/1M input tokens, $0.6800/1M output tokens.

Input: $0.17/1M Output: $0.68/1M Context: 128K
text

Phi 3 Medium 4k Instruct

Phi 3 Medium 4k Instruct is available via Azure AI with a 4K context window and up to 4,096 output tokens. Pricing: $0.1700/1M input tokens, $0.6800/1M output tokens.

Input: $0.17/1M Output: $0.68/1M Context: 4K
text

Llama 4 Scout 17B 16E Instruct

Llama 4 Scout 17B 16E Instruct is available via Azure AI with a 10M context window and up to 16,384 output tokens. Pricing: $0.2000/1M input tokens, $0.7800/1M output tokens.

Input: $0.20/1M Output: $0.78/1M Context: 10M
text vision function calling

Grok 4 Fast Non Reasoning

Grok 4 Fast Non Reasoning is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

Input: $0.20/1M Output: $0.50/1M Context: 131K
text function calling web search json mode

Grok 4 Fast Reasoning

Grok 4 Fast Reasoning is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

Input: $0.20/1M Output: $0.50/1M Context: 131K
text function calling web search json mode

Grok 4 1 Fast Non Reasoning

Grok 4 1 Fast Non Reasoning is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

Input: $0.20/1M Output: $0.50/1M Context: 131K
text function calling web search json mode

Grok 4 1 Fast Reasoning

Grok 4 1 Fast Reasoning is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $0.5000/1M output tokens.

Input: $0.20/1M Output: $0.50/1M Context: 131K
text function calling reasoning web search json mode

Grok Code Fast 1

Grok Code Fast 1 is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2000/1M input tokens, $1.50/1M output tokens.

Input: $0.20/1M Output: $1.50/1M Context: 131K
text function calling web search json mode

Global/Grok 3 Mini

Global/Grok 3 Mini is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2500/1M input tokens, $1.27/1M output tokens.

Input: $0.25/1M Output: $1.27/1M Context: 131K
text function calling reasoning web search

Grok 3 Mini

Grok 3 Mini is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $0.2500/1M input tokens, $1.27/1M output tokens.

Input: $0.25/1M Output: $1.27/1M Context: 131K
text function calling reasoning web search

Meta Llama 3.1 8B Instruct

Meta Llama 3.1 8B Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $0.3000/1M input tokens, $0.6100/1M output tokens.

Input: $0.30/1M Output: $0.61/1M Context: 128K
text

Llama 3.2 11B Vision Instruct

Llama 3.2 11B Vision Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $0.3700/1M input tokens, $0.3700/1M output tokens.

Input: $0.37/1M Output: $0.37/1M Context: 128K
text vision function calling

Mistral Medium 2505

Mistral Medium 2505 is available via Azure AI with a 131K context window and up to 8,191 output tokens. Pricing: $0.4000/1M input tokens, $2.00/1M output tokens.

Input: $0.40/1M Output: $2.00/1M Context: 131K
text function calling

Jamba Instruct

Jamba Instruct is available via Azure AI with a 70K context window and up to 4,096 output tokens. Pricing: $0.5000/1M input tokens, $0.7000/1M output tokens.

Input: $0.50/1M Output: $0.70/1M Context: 70K
text

Mistral Large 3

Mistral Large 3 is available via Azure AI with a 256K context window and up to 8,191 output tokens. Pricing: $0.5000/1M input tokens, $1.50/1M output tokens.

Input: $0.50/1M Output: $1.50/1M Context: 256K
text vision function calling

Deepseek V3.2

Deepseek V3.2 is available via Azure AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5800/1M input tokens, $1.68/1M output tokens.

Input: $0.58/1M Output: $1.68/1M Context: 164K
text function calling reasoning

Deepseek V3.2 Speciale

Deepseek V3.2 Speciale is available via Azure AI with a 164K context window and up to 163,840 output tokens. Pricing: $0.5800/1M input tokens, $1.68/1M output tokens.

Input: $0.58/1M Output: $1.68/1M Context: 164K
text function calling reasoning

Kimi K2.5

Kimi K2.5 is available via Azure AI with a 262K context window and up to 262,144 output tokens. Pricing: $0.6000/1M input tokens, $3.00/1M output tokens.

Input: $0.60/1M Output: $3.00/1M Context: 262K
text vision function calling

Llama 3.3 70B Instruct

Llama 3.3 70B Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $0.7100/1M input tokens, $0.7100/1M output tokens.

Input: $0.71/1M Output: $0.71/1M Context: 128K
text function calling

Claude Haiku 4 5

Claude Haiku 4 5 is available via Azure AI with a 200K context window and up to 64,000 output tokens. Pricing: $1.00/1M input tokens, $5.00/1M output tokens.

Input: $1.00/1M Output: $5.00/1M Context: 200K
text vision function calling reasoning pdf computer use json mode

Mistral Small

Mistral Small is available via Azure AI with a 32K context window and up to 8,191 output tokens. Pricing: $1.00/1M input tokens, $3.00/1M output tokens.

Input: $1.00/1M Output: $3.00/1M Context: 32K
text function calling

Meta Llama 3 70B Instruct

Meta Llama 3 70B Instruct is available via Azure AI with a 8K context window and up to 2,048 output tokens. Pricing: $1.10/1M input tokens, $0.3700/1M output tokens.

Input: $1.10/1M Output: $0.37/1M Context: 8K
text

Deepseek

Deepseek is available via Azure AI with a 128K context window and up to 8,192 output tokens. Pricing: $1.14/1M input tokens, $4.56/1M output tokens.

Input: $1.14/1M Output: $4.56/1M Context: 128K
text

Deepseek V3 0324

Deepseek V3 0324 is available via Azure AI with a 128K context window and up to 8,192 output tokens. Pricing: $1.14/1M input tokens, $4.56/1M output tokens.

Input: $1.14/1M Output: $4.56/1M Context: 128K
text function calling

MAI DS R1

MAI DS R1 is available via Azure AI with a 128K context window and up to 8,192 output tokens. Pricing: $1.35/1M input tokens, $5.40/1M output tokens.

Input: $1.35/1M Output: $5.40/1M Context: 128K
text reasoning

Deepseek R1

Deepseek R1 is available via Azure AI with a 128K context window and up to 8,192 output tokens. Pricing: $1.35/1M input tokens, $5.40/1M output tokens.

Input: $1.35/1M Output: $5.40/1M Context: 128K
text reasoning

Llama 4 Maverick 17B 128E Instruct FP8

Llama 4 Maverick 17B 128E Instruct FP8 is available via Azure AI with a 1M context window and up to 16,384 output tokens. Pricing: $1.41/1M input tokens, $0.3500/1M output tokens.

Input: $1.41/1M Output: $0.35/1M Context: 1M
text vision function calling

Mistral Large 2407

Mistral Large 2407 is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

Input: $2.00/1M Output: $6.00/1M Context: 128K
text function calling

Mistral Large Latest

Mistral Large Latest is available via Azure AI with a 128K context window and up to 4,096 output tokens. Pricing: $2.00/1M input tokens, $6.00/1M output tokens.

Input: $2.00/1M Output: $6.00/1M Context: 128K
text function calling

Llama 3.2 90B Vision Instruct

Llama 3.2 90B Vision Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $2.04/1M input tokens, $2.04/1M output tokens.

Input: $2.04/1M Output: $2.04/1M Context: 128K
text vision function calling

Meta Llama 3.1 70B Instruct

Meta Llama 3.1 70B Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $2.68/1M input tokens, $3.54/1M output tokens.

Input: $2.68/1M Output: $3.54/1M Context: 128K
text

Claude Sonnet 4 5

Claude Sonnet 4 5 is available via Azure AI with a 200K context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

Input: $3.00/1M Output: $15.00/1M Context: 200K
text vision function calling reasoning pdf computer use json mode

Claude Sonnet 4 6

Claude Sonnet 4 6 is available via Azure AI with a 1M context window and up to 64,000 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

Input: $3.00/1M Output: $15.00/1M Context: 1M
text vision function calling reasoning pdf computer use json mode

Global/Grok 3

Global/Grok 3 is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

Input: $3.00/1M Output: $15.00/1M Context: 131K
text function calling web search

Grok 3

Grok 3 is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

Input: $3.00/1M Output: $15.00/1M Context: 131K
text function calling web search

Grok 4

Grok 4 is available via Azure AI with a 131K context window and up to 131,072 output tokens. Pricing: $3.00/1M input tokens, $15.00/1M output tokens.

Input: $3.00/1M Output: $15.00/1M Context: 131K
text function calling web search json mode

Mistral Large

Mistral Large is available via Azure AI with a 32K context window and up to 8,191 output tokens. Pricing: $4.00/1M input tokens, $12.00/1M output tokens.

Input: $4.00/1M Output: $12.00/1M Context: 32K
text function calling

Claude Opus 4 5

Claude Opus 4 5 is available via Azure AI with a 200K context window and up to 64,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

Input: $5.00/1M Output: $25.00/1M Context: 200K
text vision function calling reasoning pdf computer use json mode

Claude Opus 4 6

Claude Opus 4 6 is available via Azure AI with a 200K context window and up to 128,000 output tokens. Pricing: $5.00/1M input tokens, $25.00/1M output tokens.

Input: $5.00/1M Output: $25.00/1M Context: 200K
text vision function calling reasoning pdf computer use json mode

Meta Llama 3.1 405B Instruct

Meta Llama 3.1 405B Instruct is available via Azure AI with a 128K context window and up to 2,048 output tokens. Pricing: $5.33/1M input tokens, $16.00/1M output tokens.

Input: $5.33/1M Output: $16.00/1M Context: 128K
text

Claude Opus 4 1

Claude Opus 4 1 is available via Azure AI with a 200K context window and up to 32,000 output tokens. Pricing: $15.00/1M input tokens, $75.00/1M output tokens.

Input: $15.00/1M Output: $75.00/1M Context: 200K
text vision function calling reasoning pdf computer use json mode

Jais 30b Chat

Jais 30b Chat is available via Azure AI with a 8K context window and up to 8,192 output tokens. Pricing: $3200.00/1M input tokens, $9710.00/1M output tokens.

Input: $3200.00/1M Output: $9710.00/1M Context: 8K
text

Compare Azure AI model pricing

Use our pricing calculator to find the cheapest Azure AI model for your workload.

Pricing Calculator Compare Models All Models Directory

Related Reading

OpenAI vs Anthropic vs Google: Which AI API Should You Choose? → Cheapest LLM API in 2026: Complete Pricing Comparison → OpenAI API Pricing Guide 2026 →