Skip to content

Cohere Models

Cohere focuses on enterprise AI with retrieval-augmented generation (RAG) and search as first-class features. Command R+ is optimized for business workflows that combine generation with structured data retrieval.

Visit Cohere →

7

Models Available

$0.15

Cheapest Input / 1M

256K

Largest Context

What is Cohere?

Cohere is an AI model provider offering 7 large language models for developers. Their cheapest model starts at $0.15 per 1M input tokens, and their largest context window reaches 256K. Cohere focuses on enterprise AI with retrieval-augmented generation (RAG) and search as first-class features. Command R+ is optimized for business workflows that combine generation with structured data retrieval.

Cohere Strengths

Best-in-class RAG support
Enterprise-focused features
Strong multilingual embeddings
Grounded generation with citations

All Cohere Models

Model Input $/1M Output $/1M Context Max Output Released
Command R $0.15 $0.60 128K 4,096
Command R 08 2024 $0.15 $0.60 128K 4,096
Command R7b 12 2024 $0.15 $0.037 128K 4,096
Command Light $0.30 $0.60 4K 4,096
Command A 03 2025 $2.50 $10.00 256K 8,000
Command R Plus $2.50 $10.00 128K 4,096
Command R Plus 08 2024 $2.50 $10.00 128K 4,096

Model Details

Command R

Command R is available via Cohere with a 128K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

Input: $0.15/1M Output: $0.60/1M Context: 128K
text function calling

Command R 08 2024

Command R 08 2024 is available via Cohere with a 128K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.6000/1M output tokens.

Input: $0.15/1M Output: $0.60/1M Context: 128K
text function calling

Command R7b 12 2024

Command R7b 12 2024 is available via Cohere with a 128K context window and up to 4,096 output tokens. Pricing: $0.1500/1M input tokens, $0.0375/1M output tokens.

Input: $0.15/1M Output: $0.037/1M Context: 128K
text function calling

Command Light

Command Light is available via Cohere with a 4K context window and up to 4,096 output tokens. Pricing: $0.3000/1M input tokens, $0.6000/1M output tokens.

Input: $0.30/1M Output: $0.60/1M Context: 4K
text

Command A 03 2025

Command A 03 2025 is available via Cohere with a 256K context window and up to 8,000 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

Input: $2.50/1M Output: $10.00/1M Context: 256K
text function calling

Command R Plus

Command R Plus is available via Cohere with a 128K context window and up to 4,096 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

Input: $2.50/1M Output: $10.00/1M Context: 128K
text function calling

Command R Plus 08 2024

Command R Plus 08 2024 is available via Cohere with a 128K context window and up to 4,096 output tokens. Pricing: $2.50/1M input tokens, $10.00/1M output tokens.

Input: $2.50/1M Output: $10.00/1M Context: 128K
text function calling

Compare Cohere model pricing

Use our pricing calculator to find the cheapest Cohere model for your workload.

Pricing Calculator Compare Models All Models Directory

Related Reading

OpenAI vs Anthropic vs Google: Which AI API Should You Choose? → Cheapest LLM API in 2026: Complete Pricing Comparison → OpenAI API Pricing Guide 2026 →