Pricing comparison

Anthropic vs Meta Llama Pricing

Per-million-token pricing for Anthropic and Meta Llama, with side-by-side flagship models, cheapest tiers, and context windows. Pricing data syncs weekly from a continuously-updated model catalog — last updated July 6, 2026.

As of July 6, 2026, Meta Llama offers the lowest output-token price at $0.02/1M (Llama 3.2 3B Instruct — $0.02/1M output).

Who wins on what

Cheapest input tokens

$0.02/1M

Meta Llama

Llama 3.1 8B Instruct — $0.02/1M input

Cheapest output tokens

$0.02/1M

Meta Llama

Llama 3.2 3B Instruct — $0.02/1M output

Longest context window

1.0M

Meta Llama

Llama 4 Maverick — 1.0M input tokens

Lowest average output cost

$0.67/1M

Meta Llama

Provider-wide average across 22 models

Largest model catalog

35 models

Anthropic

More options to match cost vs capability

Most reasoning models

3 models

Anthropic

Models with dedicated reasoning / thinking support

Most vision models

3 models

Anthropic

Models that accept image input

Side-by-side

35 models

Anthropic

Cheapest input

$0.250

Claude 3 Haiku

Cheapest output

$1.25

Claude 3 Haiku

Longest context

1.0M

Claude 4 Sonnet (2025-05-14)

Avg output / 1M

$26.14

Across catalog

Cheapest cached input

$0.200

Claude Sonnet 5

Model	In/1M	Out/1M	Ctx
Claude Sonnet 5 VisionReasoningToolsCache	$2.00	$10.00	1.0M
Claude Fable 5 VisionReasoningToolsCache	$10.00	$50.00	1.0M
Claude Opus 4.8 VisionReasoningToolsCache	$5.00	$25.00	1.0M
Claude Opus 4.7	$5.00	$25.00	200K
Claude Opus 4.6	$5.00	$25.00	200K
Claude 3 Haiku	$0.250	$1.25	200K

22 models

Meta Llama

Cheapest input

$0.020

Llama 3.1 8B Instruct

Cheapest output

$0.020

Llama 3.2 3B Instruct

Longest context

1.0M

Llama 4 Maverick

Avg output / 1M

$0.674

Across catalog

Model	In/1M	Out/1M	Ctx
Llama 3.1 405B (base)	$4.00	$4.00	33K
Llama 3.1 405B Instruct	$3.50	$3.50	131K
Llama 4 Maverick	$0.150	$0.600	1.0M
Llama 3 70B Instruct	$0.300	$0.400	8K
Llama 3.1 70B Instruct	$0.400	$0.400	131K
Llama 3.2 3B Instruct	$0.020	$0.020	131K

All prices in USD per 1 million tokens. Showing top 6 models per provider, sorted by output cost.

Related comparisons

OpenAI vs Anthropic Anthropic vs Google Vertex AI OpenAI vs Meta Llama

See the full AI model leaderboard — every model ranked by intelligence, real-world usage, and value.

Run the numbers for your workload

Calcaas multiplies per-token costs by your real usage patterns — inputs, outputs, retries, and conversation history — across both providers in one model.