Skip to content

Anyscale Models

Anyscale provides 12 AI models accessible via API.

Visit Anyscale →

12

Models Available

$0.15

Cheapest Input / 1M

66K

Largest Context

What is Anyscale?

Anyscale is an AI model provider offering 12 large language models for developers. Their cheapest model starts at $0.15 per 1M input tokens, and their largest context window reaches 66K. Anyscale provides 12 AI models accessible via API.

Anyscale Strengths

All Anyscale Models

Model Input $/1M Output $/1M Context Max Output Released
HuggingFaceH4/Zephyr 7b Beta $0.15 $0.15 16K 16,384
Google/Gemma 7b It $0.15 $0.15 8K 8,192
Meta Llama/Llama 2 7b Chat Hf $0.15 $0.15 4K 4,096
Meta Llama/Meta Llama 3 8B Instruct $0.15 $0.15 8K 8,192
Mistralai/Mistral 7B Instruct V0.1 $0.15 $0.15 16K 16,384
Mistralai/Mixtral 8x7B Instruct V0.1 $0.15 $0.15 16K 16,384
Meta Llama/Llama 2 13b Chat Hf $0.25 $0.25 4K 4,096
Mistralai/Mixtral 8x22B Instruct V0.1 $0.90 $0.90 66K 65,536
Codellama/CodeLlama 34b Instruct Hf $1.00 $1.00 4K 4,096
Codellama/CodeLlama 70b Instruct Hf $1.00 $1.00 4K 4,096
Meta Llama/Llama 2 70b Chat Hf $1.00 $1.00 4K 4,096
Meta Llama/Meta Llama 3 70B Instruct $1.00 $1.00 8K 8,192

Model Details

HuggingFaceH4/Zephyr 7b Beta

HuggingFaceH4/Zephyr 7b Beta is available via Anyscale with a 16K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

Input: $0.15/1M Output: $0.15/1M Context: 16K
text

Google/Gemma 7b It

Google/Gemma 7b It is available via Anyscale with a 8K context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

Input: $0.15/1M Output: $0.15/1M Context: 8K
text

Meta Llama/Llama 2 7b Chat Hf

Meta Llama/Llama 2 7b Chat Hf is available via Anyscale with a 4K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

Input: $0.15/1M Output: $0.15/1M Context: 4K
text

Meta Llama/Meta Llama 3 8B Instruct

Meta Llama/Meta Llama 3 8B Instruct is available via Anyscale with a 8K context window and up to 8,192 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

Input: $0.15/1M Output: $0.15/1M Context: 8K
text

Mistralai/Mistral 7B Instruct V0.1

Mistralai/Mistral 7B Instruct V0.1 is available via Anyscale with a 16K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

Input: $0.15/1M Output: $0.15/1M Context: 16K
text function calling

Mistralai/Mixtral 8x7B Instruct V0.1

Mistralai/Mixtral 8x7B Instruct V0.1 is available via Anyscale with a 16K context window and up to 16,384 output tokens. Pricing: $0.1500/1M input tokens, $0.1500/1M output tokens.

Input: $0.15/1M Output: $0.15/1M Context: 16K
text function calling

Meta Llama/Llama 2 13b Chat Hf

Meta Llama/Llama 2 13b Chat Hf is available via Anyscale with a 4K context window and up to 4,096 output tokens. Pricing: $0.2500/1M input tokens, $0.2500/1M output tokens.

Input: $0.25/1M Output: $0.25/1M Context: 4K
text

Mistralai/Mixtral 8x22B Instruct V0.1

Mistralai/Mixtral 8x22B Instruct V0.1 is available via Anyscale with a 66K context window and up to 65,536 output tokens. Pricing: $0.9000/1M input tokens, $0.9000/1M output tokens.

Input: $0.90/1M Output: $0.90/1M Context: 66K
text function calling

Codellama/CodeLlama 34b Instruct Hf

Codellama/CodeLlama 34b Instruct Hf is available via Anyscale with a 4K context window and up to 4,096 output tokens. Pricing: $1.00/1M input tokens, $1.00/1M output tokens.

Input: $1.00/1M Output: $1.00/1M Context: 4K
text

Codellama/CodeLlama 70b Instruct Hf

Codellama/CodeLlama 70b Instruct Hf is available via Anyscale with a 4K context window and up to 4,096 output tokens. Pricing: $1.00/1M input tokens, $1.00/1M output tokens.

Input: $1.00/1M Output: $1.00/1M Context: 4K
text

Meta Llama/Llama 2 70b Chat Hf

Meta Llama/Llama 2 70b Chat Hf is available via Anyscale with a 4K context window and up to 4,096 output tokens. Pricing: $1.00/1M input tokens, $1.00/1M output tokens.

Input: $1.00/1M Output: $1.00/1M Context: 4K
text

Meta Llama/Meta Llama 3 70B Instruct

Meta Llama/Meta Llama 3 70B Instruct is available via Anyscale with a 8K context window and up to 8,192 output tokens. Pricing: $1.00/1M input tokens, $1.00/1M output tokens.

Input: $1.00/1M Output: $1.00/1M Context: 8K
text

Compare Anyscale model pricing

Use our pricing calculator to find the cheapest Anyscale model for your workload.

Pricing Calculator Compare Models All Models Directory

Related Reading

OpenAI vs Anthropic vs Google: Which AI API Should You Choose? → Cheapest LLM API in 2026: Complete Pricing Comparison → OpenAI API Pricing Guide 2026 →