Anthropic vs Google Vertex AI Pricing
Per-million-token pricing for Anthropic and Google Vertex AI, with side-by-side flagship models, cheapest tiers, and context windows. Pricing data syncs weekly from a continuously-updated model catalog — last updated June 22, 2026.
As of June 22, 2026, Google Vertex AI offers the lowest output-token price at $0.04/1M (Gemma 3n 4B — $0.04/1M output).
Who wins on what
Cheapest input tokens
$0.02/1MGoogle Vertex AI
Gemma 3 4B — $0.02/1M input
Cheapest output tokens
$0.04/1MGoogle Vertex AI
Gemma 3n 4B — $0.04/1M output
Longest context window
2.0MGoogle Vertex AI
Gemini 3/3.1 (> 200k context) — 2.0M input tokens
Lowest average output cost
$8.73/1MGoogle Vertex AI
Provider-wide average across 50 models
Largest model catalog
50 modelsGoogle Vertex AI
More options to match cost vs capability
Cheapest cached input
$0.03/1MGoogle Vertex AI
Gemini 3.1 Flash Lite — $0.03/1M cached read
Most reasoning models
11 modelsGoogle Vertex AI
Models with dedicated reasoning / thinking support
Most vision models
11 modelsGoogle Vertex AI
Models that accept image input
Side-by-side
Anthropic
Cheapest input
$0.250
Claude 3 Haiku
Cheapest output
$1.25
Claude 3 Haiku
Longest context
1.0M
Claude 4 Sonnet (2025-05-14)
Avg output / 1M
$25.49
Across catalog
Cheapest cached input
$0.030
Claude Haiku 3
| Model | In/1M | Out/1M | Ctx |
|---|---|---|---|
| Claude Fable 5 VisionReasoningToolsCache | $10.00 | $50.00 | 1.0M |
| Claude Opus 4.8 VisionReasoningToolsCache | $5.00 | $25.00 | 1.0M |
| Claude Opus 4.7 | $5.00 | $25.00 | 200K |
| Claude Opus 4.6 | $5.00 | $25.00 | 200K |
| Claude Opus 4.5 | $5.00 | $25.00 | 200K |
| Claude 3 Haiku | $0.250 | $1.25 | 200K |
Google Vertex AI
Cheapest input
$0.017
Gemma 3 4B
Cheapest output
$0.040
Gemma 3n 4B
Longest context
2.0M
Gemini 3/3.1 (> 200k context)
Avg output / 1M
$8.73
Across catalog
Cheapest cached input
$0.025
Gemini 3.1 Flash Lite
| Model | In/1M | Out/1M | Ctx |
|---|---|---|---|
| Gemini 3.5 Flash VisionReasoningToolsCache | $1.50 | $9.00 | 1.0M |
| Gemini 3.1 Flash Lite VisionReasoningToolsCache | $0.250 | $1.50 | 1.0M |
| Gemini 3.1 Flash Lite Preview VisionReasoningToolsCache | $0.250 | $1.50 | 1.0M |
| Nano Banana 2 VisionReasoning | $0.500 | $60.00 | 66K |
| Gemini 3.1 Pro Preview VisionReasoningToolsCache | $2.00 | $12.00 | 1.0M |
| Gemma 3n 4B | $0.020 | $0.040 | 33K |
All prices in USD per 1 million tokens. Showing top 6 models per provider, sorted by output cost.
Related comparisons
Run the numbers for your workload
Calcaas multiplies per-token costs by your real usage patterns — inputs, outputs, retries, and conversation history — across both providers in one model.