Skip to content
AWS Bedrock

Nvidia.Nemotron Super 3 120b

Nvidia.Nemotron Super 3 120b is available via AWS Bedrock with a 256K context window and up to 32,768 output tokens. Pricing: $0.1500/1M input tokens, $0.6500/1M output tokens.

Nvidia.Nemotron Super 3 120b Pricing & Specifications

Input Price$0.15 per 1M tokens
Output Price$0.65 per 1M tokens
Context Window256,000 tokens (256K)
Max Output32,768 tokens
ProviderAWS Bedrock

What is Nvidia.Nemotron Super 3 120b?

Nvidia.Nemotron Super 3 120b is a large language model by AWS Bedrock with a 256K context window and up to 32,768 output tokens. It costs $0.15 per 1M input tokens and $0.65 per 1M output tokens. Nvidia.Nemotron Super 3 120b is available via AWS Bedrock with a 256K context window and up to 32,768 output tokens. Pricing: $0.1500/1M input tokens, $0.6500/1M output tokens.

Capabilities

text function calling reasoning

Nvidia.Nemotron Super 3 120b Cost Examples

Short prompt (500 tokens)

$0.000075

Medium prompt (2K tokens)

$0.00030

Long output (4K tokens)

$0.00260

Count tokens for Nvidia.Nemotron Super 3 120b

Paste your prompt to see exact token counts and API cost estimates.

Open Token Counter

Similar Models to Nvidia.Nemotron Super 3 120b

AWS Bedrock

Us East 1/Mistral.Mistral 7b Instruct

$0.15/1M in 32K ctx

AWS Bedrock

Us West 2/Mistral.Mistral 7b Instruct

$0.15/1M in 32K ctx

AWS Bedrock

Meta.Llama3 2 3b Instruct

$0.15/1M in 128K ctx

AWS Bedrock

Mistral.Ministral 3 8b Instruct

$0.15/1M in 128K ctx

Frequently Asked Questions

How much does Nvidia.Nemotron Super 3 120b cost per token? +
Nvidia.Nemotron Super 3 120b costs $0.15 per 1M input tokens and $0.65 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.000475.
What is the context window for Nvidia.Nemotron Super 3 120b? +
Nvidia.Nemotron Super 3 120b supports a context window of 256,000 tokens (256K). This determines the maximum combined length of your prompt and conversation history in a single API call.
What is the maximum output length for Nvidia.Nemotron Super 3 120b? +
Nvidia.Nemotron Super 3 120b can generate up to 32,768 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.
Is Nvidia.Nemotron Super 3 120b good for coding tasks? +
Yes, Nvidia.Nemotron Super 3 120b supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.
Token Counter | Pricing Calculator | Model Comparison | All AWS Bedrock Models