Skip to content
Tools Interactive · Calculator

Model cost calculator

Pricing verified .

10M tokens
2M tokens
60%
Quick presets
Cheapest for this workload
DeepSeek V4
DeepSeek · cheapest frontier model on the chart; coding + math punch well above price
$1.14
per month
Saves $0.903/mo vs next cheapest (Gemini 3.5 Flash, +79%)  View pricing →

Cheapest for this workload: DeepSeek V4, $1.14 per month.

Ranked cheapest → priciest

Prices move monthly. Get the Briefing and we'll flag when your stack's costs change.

Free. One click to unsubscribe.

How the math works

For each model, the per-request input cost depends on whether your token is served from the prefix cache. So the effective input rate is a weighted average:

effective_input_$_per_Mtok =
    cache_hit_rate × cached_input_price
  + (1 − cache_hit_rate) × full_input_price

monthly_cost =
    input_tokens × effective_input_$_per_Mtok / 1_000_000
  + output_tokens × output_price / 1_000_000

If you're running RAG or any workflow with a stable system prompt, your real hit rate is 60-90%, far above the 0% the vendor price table implicitly assumes. That gap is the whole reason a Sonnet workload that "should" cost $X often costs $X/3 in practice. The calculator above models it honestly.

Caveats before you trust these numbers
  • Prices verified against vendor pricing pages on 2026-06-04. Tap any model row in the table to open its current vendor page and sanity-check before committing budget.
  • API list pricing only. Enterprise discounts, committed-spend contracts, and provisioned-throughput pricing all differ.
  • Long-context surcharges (some vendors charge a higher rate above 200K or 500K tokens per request) aren't modeled here. If you're routinely sending huge prompts, treat the result as a floor.
  • Tool calls, embeddings for retrieval, fine-tuning, and image/audio modalities are billed separately and aren't included.
  • Cache hit rate is the % of input tokens served from a prefix cache. Output is never cached.

Get pinged when these prices move

We rerun this calculator and email the diff whenever any vendor changes API pricing. No other email.

Free. Unsubscribe in one click.

Use this tool anywhere

Free to link, cite, or embed, a credit link back is all we ask.

Embed this tool on your site
<iframe src="https://thecounterbrief.com/tools/model-cost-calculator?embed=1" title="AI Model Cost Calculator | The Counter Brief" width="100%" height="640" loading="lazy" style="border:1px solid #e4e6eb;border-radius:12px"></iframe>
<p style="font:14px/1.4 system-ui,sans-serif;color:#5a6070"><a href="https://thecounterbrief.com/briefing?source=embed-calc&utm_source=signal-embed&utm_medium=calculator">Live pricing by The Counter Brief — get cost-change alerts</a></p>
⚡ The Counter Brief — get the brief →