Question 1

How much does Llama3.1 405b Instruct Fp8 cost per token?

Accepted Answer

Llama3.1 405b Instruct Fp8 costs $0.80 per 1M input tokens and $0.80 per 1M output tokens. For a typical 1,000-token request with a 500-token response, that works out to roughly $0.001200.

Question 2

What is the context window for Llama3.1 405b Instruct Fp8?

Accepted Answer

Llama3.1 405b Instruct Fp8 supports a context window of 131,072 tokens (131K). This determines the maximum combined length of your prompt and conversation history in a single API call.

Question 3

What is the maximum output length for Llama3.1 405b Instruct Fp8?

Accepted Answer

Llama3.1 405b Instruct Fp8 can generate up to 131,072 tokens in a single response. If you need longer outputs, you can make multiple API calls and concatenate the results.

Question 4

Is Llama3.1 405b Instruct Fp8 good for coding tasks?

Accepted Answer

Yes, Llama3.1 405b Instruct Fp8 supports capabilities well-suited for coding tasks including code generation, debugging, and refactoring.

Input Price	$0.80 per 1M tokens
Output Price	$0.80 per 1M tokens
Context Window	131,072 tokens (131K)
Max Output	131,072 tokens
Provider	Lambda Ai

Llama3.1 405b Instruct Fp8

Llama3.1 405b Instruct Fp8 Pricing & Specifications

What is Llama3.1 405b Instruct Fp8?

Capabilities

Llama3.1 405b Instruct Fp8 Cost Examples

Count tokens for Llama3.1 405b Instruct Fp8

Similar Models to Llama3.1 405b Instruct Fp8

Deepseek R1 671b

Hermes3 405b

Deepseek Llama3.3 70b

Deepseek R1 0528

Frequently Asked Questions