Skip to content

Ovhcloud Models

Ovhcloud provides 15 AI models accessible via API.

Visit Ovhcloud →

15

Models Available

$0.040

Cheapest Input / 1M

256K

Largest Context

What is Ovhcloud?

Ovhcloud is an AI model provider offering 15 large language models for developers. Their cheapest model starts at $0.040 per 1M input tokens, and their largest context window reaches 256K. Ovhcloud provides 15 AI models accessible via API.

Ovhcloud Strengths

All Ovhcloud Models

Model Input $/1M Output $/1M Context Max Output Released
Gpt Oss 20b $0.040 $0.15 131K 131,000
Qwen3 32B $0.080 $0.23 32K 32,000
Gpt Oss 120b $0.080 $0.40 131K 131,000
Mistral Small 3.2 24B Instruct 2506 $0.090 $0.28 128K 128,000
Llama 3.1 8B Instruct $0.10 $0.10 131K 131,000
Mistral 7B Instruct V0.3 $0.10 $0.10 127K 127,000
Mistral Nemo Instruct 2407 $0.13 $0.13 118K 118,000
Mamba Codestral 7B V0.1 $0.19 $0.19 256K 256,000
Llava V1.6 Mistral 7b Hf $0.29 $0.29 32K 32,000
Mixtral 8x7B Instruct V0.1 $0.63 $0.63 32K 32,000
DeepSeek R1 Distill Llama 70B $0.67 $0.67 131K 131,000
Meta Llama 3 1 70B Instruct $0.67 $0.67 131K 131,000
Meta Llama 3 3 70B Instruct $0.67 $0.67 131K 131,000
Qwen2.5 Coder 32B Instruct $0.87 $0.87 32K 32,000
Qwen2.5 VL 72B Instruct $0.91 $0.91 32K 32,000

Model Details

Gpt Oss 20b

Gpt Oss 20b is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.0400/1M input tokens, $0.1500/1M output tokens.

Input: $0.040/1M Output: $0.15/1M Context: 131K
text reasoning json mode

Qwen3 32B

Qwen3 32B is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.0800/1M input tokens, $0.2300/1M output tokens.

Input: $0.080/1M Output: $0.23/1M Context: 32K
text function calling reasoning json mode

Gpt Oss 120b

Gpt Oss 120b is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.0800/1M input tokens, $0.4000/1M output tokens.

Input: $0.080/1M Output: $0.40/1M Context: 131K
text reasoning json mode

Mistral Small 3.2 24B Instruct 2506

Mistral Small 3.2 24B Instruct 2506 is available via Ovhcloud with a 128K context window and up to 128,000 output tokens. Pricing: $0.0900/1M input tokens, $0.2800/1M output tokens.

Input: $0.090/1M Output: $0.28/1M Context: 128K
text vision function calling json mode

Llama 3.1 8B Instruct

Llama 3.1 8B Instruct is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 131K
text function calling json mode

Mistral 7B Instruct V0.3

Mistral 7B Instruct V0.3 is available via Ovhcloud with a 127K context window and up to 127,000 output tokens. Pricing: $0.1000/1M input tokens, $0.1000/1M output tokens.

Input: $0.10/1M Output: $0.10/1M Context: 127K
text function calling json mode

Mistral Nemo Instruct 2407

Mistral Nemo Instruct 2407 is available via Ovhcloud with a 118K context window and up to 118,000 output tokens. Pricing: $0.1300/1M input tokens, $0.1300/1M output tokens.

Input: $0.13/1M Output: $0.13/1M Context: 118K
text function calling json mode

Mamba Codestral 7B V0.1

Mamba Codestral 7B V0.1 is available via Ovhcloud with a 256K context window and up to 256,000 output tokens. Pricing: $0.1900/1M input tokens, $0.1900/1M output tokens.

Input: $0.19/1M Output: $0.19/1M Context: 256K
text json mode

Llava V1.6 Mistral 7b Hf

Llava V1.6 Mistral 7b Hf is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.2900/1M input tokens, $0.2900/1M output tokens.

Input: $0.29/1M Output: $0.29/1M Context: 32K
text vision json mode

Mixtral 8x7B Instruct V0.1

Mixtral 8x7B Instruct V0.1 is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.6300/1M input tokens, $0.6300/1M output tokens.

Input: $0.63/1M Output: $0.63/1M Context: 32K
text json mode

DeepSeek R1 Distill Llama 70B

DeepSeek R1 Distill Llama 70B is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.6700/1M input tokens, $0.6700/1M output tokens.

Input: $0.67/1M Output: $0.67/1M Context: 131K
text function calling reasoning json mode

Meta Llama 3 1 70B Instruct

Meta Llama 3 1 70B Instruct is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.6700/1M input tokens, $0.6700/1M output tokens.

Input: $0.67/1M Output: $0.67/1M Context: 131K
text

Meta Llama 3 3 70B Instruct

Meta Llama 3 3 70B Instruct is available via Ovhcloud with a 131K context window and up to 131,000 output tokens. Pricing: $0.6700/1M input tokens, $0.6700/1M output tokens.

Input: $0.67/1M Output: $0.67/1M Context: 131K
text function calling json mode

Qwen2.5 Coder 32B Instruct

Qwen2.5 Coder 32B Instruct is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.8700/1M input tokens, $0.8700/1M output tokens.

Input: $0.87/1M Output: $0.87/1M Context: 32K
text json mode

Qwen2.5 VL 72B Instruct

Qwen2.5 VL 72B Instruct is available via Ovhcloud with a 32K context window and up to 32,000 output tokens. Pricing: $0.9100/1M input tokens, $0.9100/1M output tokens.

Input: $0.91/1M Output: $0.91/1M Context: 32K
text vision json mode

Compare Ovhcloud model pricing

Use our pricing calculator to find the cheapest Ovhcloud model for your workload.

Pricing Calculator Compare Models All Models Directory

Related Reading

OpenAI vs Anthropic vs Google: Which AI API Should You Choose? → Cheapest LLM API in 2026: Complete Pricing Comparison → OpenAI API Pricing Guide 2026 →