Ovhcloud Models

Ovhcloud provides 15 AI models accessible via API.

Models Available

$0.040

Cheapest Input / 1M

256K

Largest Context

What is Ovhcloud?

Ovhcloud is an AI model provider offering 15 large language models for developers. Their cheapest model starts at $0.040 per 1M input tokens, and their largest context window reaches 256K. Ovhcloud provides 15 AI models accessible via API.

Ovhcloud Strengths

All Ovhcloud Models

Model	Input $/1M	Output $/1M	Context	Max Output	Released
Gpt Oss 20b	$0.040	$0.15	131K	131,000	—
Qwen3 32B	$0.080	$0.23	32K	32,000	—
Gpt Oss 120b	$0.080	$0.40	131K	131,000	—
Mistral Small 3.2 24B Instruct 2506	$0.090	$0.28	128K	128,000	—
Llama 3.1 8B Instruct	$0.10	$0.10	131K	131,000	—
Mistral 7B Instruct V0.3	$0.10	$0.10	127K	127,000	—
Mistral Nemo Instruct 2407	$0.13	$0.13	118K	118,000	—
Mamba Codestral 7B V0.1	$0.19	$0.19	256K	256,000	—
Llava V1.6 Mistral 7b Hf	$0.29	$0.29	32K	32,000	—
Mixtral 8x7B Instruct V0.1	$0.63	$0.63	32K	32,000	—
DeepSeek R1 Distill Llama 70B	$0.67	$0.67	131K	131,000	—
Meta Llama 3 1 70B Instruct	$0.67	$0.67	131K	131,000	—
Meta Llama 3 3 70B Instruct	$0.67	$0.67	131K	131,000	—
Qwen2.5 Coder 32B Instruct	$0.87	$0.87	32K	32,000	—
Qwen2.5 VL 72B Instruct	$0.91	$0.91	32K	32,000	—

Model Details

Gpt Oss 20b

Gpt Oss 20b is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.0400/1M input tokens, $0.1500/1M output tokens.

Input: $0.040/1M Output: $0.15/1M Context: 131K

text reasoning json mode

Qwen3 32B

Qwen3 32B is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.0800/1M input tokens, $0.2300/1M output tokens.

Input: $0.080/1M Output: $0.23/1M Context: 32K

text function calling reasoning json mode

Gpt Oss 120b

Gpt Oss 120b is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.0800/1M input tokens, $0.4000/1M output tokens.

Input: $0.080/1M Output: $0.40/1M Context: 131K

text reasoning json mode

Mistral Small 3.2 24B Instruct 2506

Mistral Small 3.2 24B Instruct 2506 is available via Ovhcloud with a 128K context window and up to 128,000 output tokens. Pricing: $0.0900/1M input tokens, $0.2800/1M output tokens.

Input: $0.090/1M Output: $0.28/1M Context: 128K

text vision function calling json mode

Llama 3.1 8B Instruct

Llama 3.1 8B Instruct is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 131K

text function calling json mode

Mistral 7B Instruct V0.3

Mistral 7B Instruct V0.3 is available via Ovhcloud with a 127K context window and up to 127,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 127K

text function calling json mode

Mistral Nemo Instruct 2407

Mistral Nemo Instruct 2407 is available via Ovhcloud with a 118K context window and up to 118,000 output tokens. Pricing: $0.1300/1M input tokens, $0.1300/1M output tokens.

Input: $0.13/1M Output: $0.13/1M Context: 118K

text function calling json mode

Mamba Codestral 7B V0.1

Mamba Codestral 7B V0.1 is available via Ovhcloud with a 256K context window and up to 256,000 output tokens. Pricing: $0.1900/1M input tokens, $0.1900/1M output tokens.

Input: $0.19/1M Output: $0.19/1M Context: 256K

text json mode

Llava V1.6 Mistral 7b Hf

Llava V1.6 Mistral 7b Hf is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.2900/1M input tokens, $0.2900/1M output tokens.

Input: $0.29/1M Output: $0.29/1M Context: 32K

text vision json mode

Mixtral 8x7B Instruct V0.1

Mixtral 8x7B Instruct V0.1 is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.6300/1M input tokens, $0.6300/1M output tokens.

Input: $0.63/1M Output: $0.63/1M Context: 32K

text json mode

DeepSeek R1 Distill Llama 70B

DeepSeek R1 Distill Llama 70B is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.6700/1M input tokens, $0.6700/1M output tokens.

Input: $0.67/1M Output: $0.67/1M Context: 131K

text function calling reasoning json mode

Meta Llama 3 1 70B Instruct

Meta Llama 3 1 70B Instruct is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.6700/1M input tokens, $0.6700/1M output tokens.

Input: $0.67/1M Output: $0.67/1M Context: 131K

text

Meta Llama 3 3 70B Instruct

Meta Llama 3 3 70B Instruct is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.6700/1M input tokens, $0.6700/1M output tokens.

Input: $0.67/1M Output: $0.67/1M Context: 131K

text function calling json mode

Qwen2.5 Coder 32B Instruct

Qwen2.5 Coder 32B Instruct is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.8700/1M input tokens, $0.8700/1M output tokens.

Input: $0.87/1M Output: $0.87/1M Context: 32K

text json mode

Qwen2.5 VL 72B Instruct

Qwen2.5 VL 72B Instruct is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.9100/1M input tokens, $0.9100/1M output tokens.

Input: $0.91/1M Output: $0.91/1M Context: 32K

text vision json mode

Compare Ovhcloud model pricing

Use our pricing calculator to find the cheapest Ovhcloud model for your workload.

Pricing Calculator Compare Models All Models Directory

Ovhcloud Models

What is Ovhcloud?

Ovhcloud Strengths

All Ovhcloud Models

Model Details

Gpt Oss 20b

Qwen3 32B

Gpt Oss 120b

Mistral Small 3.2 24B Instruct 2506

Llama 3.1 8B Instruct

Mistral 7B Instruct V0.3

Mistral Nemo Instruct 2407

Mamba Codestral 7B V0.1

Llava V1.6 Mistral 7b Hf

Mixtral 8x7B Instruct V0.1

DeepSeek R1 Distill Llama 70B

Meta Llama 3 1 70B Instruct

Meta Llama 3 3 70B Instruct

Qwen2.5 Coder 32B Instruct

Qwen2.5 VL 72B Instruct

Compare Ovhcloud model pricing

Related Reading