Skip to content
Gradient Ai

Llama3.3 70b Instruct

Llama3.3 70b Instruct is available via Gradient Ai with a 128K context window and up to 2,048 output tokens. Pricing: $0.6500/1M input tokens, $0.6500/1M output tokens.

Llama3.3 70b Instruct Pricing & Specifications

Input Price$0.65 per 1M tokens
Output Price$0.65 per 1M tokens
Context Window128,000 tokens (128K)
Max Output2,048 tokens
ProviderGradient Ai

What is Llama3.3 70b Instruct?

Llama3.3 70b Instruct is a large language model by Gradient Ai with a 128K context window and up to 2,048 output tokens. It costs $0.65 per 1M input tokens and $0.65 per 1M output tokens. Llama3.3 70b Instruct is available via Gradient Ai with a 128K context window and up to 2,048 output tokens. Pricing: $0.6500/1M input tokens, $0.6500/1M output tokens.

Capabilities

text

Llama3.3 70b Instruct Cost Examples

Short prompt (500 tokens)

$0.000325

Medium prompt (2K tokens)

$0.00130

Long output (4K tokens)

$0.00260

Count tokens for Llama3.3 70b Instruct

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Llama3.3 70b Instruct

Gradient Ai

Anthropic Claude 3.5 Haiku

$0.80/1M in 200K ctx

Gradient Ai

Deepseek R1 Distill Llama 70b

$0.99/1M in 33K ctx

Gradient Ai

Mistral Nemo Instruct 2407

$0.30/1M in 128K ctx

Gradient Ai

Llama3 8b Instruct

$0.20/1M in 8K ctx

Frequently Asked Questions

How much does Llama3.3 70b Instruct cost per token? +
Llama3.3 70b Instruct costs $0.65 per 1M input tokens and $0.65 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000975.
What is the context window for Llama3.3 70b Instruct? +
Llama3.3 70b Instruct supports a context window of 128,000 tokens (128K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Llama3.3 70b Instruct? +
Llama3.3 70b Instruct can generate up to 2,048 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Llama3.3 70b Instruct good for coding tasks? +
Llama3.3 70b Instruct can handle basic coding tasks, but there are models specifically optimized for code generation that may perform better on complex programming problems.
Token Counter | Pricing Calculator | Model Comparison | All Gradient Ai Models