Pricing comparison

Google Vertex AI vs DeepSeek Pricing

Per-million-token pricing for Google Vertex AI and DeepSeek, with side-by-side flagship models, cheapest tiers, and context windows. Pricing data syncs weekly from a continuously-updated model catalog — last updated July 6, 2026.

As of July 6, 2026, Google Vertex AI offers the lowest output-token price at $0.04/1M (Gemma 3n 4B — $0.04/1M output).

Who wins on what

Cheapest input tokens

$0.02/1M

Google Vertex AI

Gemma 3 4B — $0.02/1M input

Cheapest output tokens

$0.04/1M

Google Vertex AI

Gemma 3n 4B — $0.04/1M output

Longest context window

2.0M

Google Vertex AI

Gemini 3/3.1 (> 200k context) — 2.0M input tokens

Lowest average output cost

$0.79/1M

DeepSeek

Provider-wide average across 21 models

Largest model catalog

50 models

Google Vertex AI

More options to match cost vs capability

Cheapest cached input

$0.00/1M

DeepSeek

DeepSeek Chat — $0.00/1M cached read

Most reasoning models

11 models

Google Vertex AI

Models with dedicated reasoning / thinking support

Most vision models

11 models

Google Vertex AI

Models that accept image input

Open-weights available

Yes

DeepSeek

Offers open-weight models you can self-host

Side-by-side

50 models

Google Vertex AI

Cheapest input

$0.017

Gemma 3 4B

Cheapest output

$0.040

Gemma 3n 4B

Longest context

2.0M

Gemini 3/3.1 (> 200k context)

Avg output / 1M

$8.73

Across catalog

Cheapest cached input

$0.025

Gemini 3.1 Flash Lite

Model	In/1M	Out/1M	Ctx
Gemini 3.5 Flash VisionReasoningToolsCache	$1.50	$9.00	1.0M
Gemini 3.1 Flash Lite VisionReasoningToolsCache	$0.250	$1.50	1.0M
Gemini 3.1 Flash Lite Preview VisionReasoningToolsCache	$0.250	$1.50	1.0M
Nano Banana 2 VisionReasoning	$0.500	$60.00	66K
Gemini 3.1 Pro Preview VisionReasoningToolsCache	$2.00	$12.00	1.0M
Gemma 3n 4B	$0.020	$0.040	33K

21 models

DeepSeek

Cheapest input

$0.020

DeepSeek R1 0528 Qwen3 8B

Cheapest output

$0.100

DeepSeek R1 0528 Qwen3 8B

Longest context

1.0M

DeepSeek Chat

Avg output / 1M

$0.790

Across catalog

Cheapest cached input

$0.0028

DeepSeek Chat

Model	In/1M	Out/1M	Ctx
DeepSeek V4 Pro ReasoningToolsCache	$0.435	$0.870	1.0M
DeepSeek V4 Flash ReasoningToolsCache	$0.140	$0.280	1.0M
DeepSeek Chat ToolsCache	$0.140	$0.280	1.0M
DeepSeek Reasoner ReasoningToolsCache	$0.140	$0.280	1.0M
DeepSeek R1	$0.550	$2.19	66K
DeepSeek R1 0528 Qwen3 8B	$0.020	$0.100	33K

All prices in USD per 1 million tokens. Showing top 6 models per provider, sorted by output cost.

Related comparisons

OpenAI vs Google Vertex AI Anthropic vs Google Vertex AI OpenAI vs DeepSeek

See the full AI model leaderboard — every model ranked by intelligence, real-world usage, and value.

Run the numbers for your workload

Calcaas multiplies per-token costs by your real usage patterns — inputs, outputs, retries, and conversation history — across both providers in one model.