Claude Sonnet 5 Launch: $2/$10 Promo Pricing Undercuts Opus 4.8 for Coding Agents
By Eric Bush · July 1, 2026 · 9 min read
What Anthropic Shipped Today
Anthropic released Claude Sonnet 5 on July 1, 2026. The public pitch: reasoning, tool use, coding and knowledge-work performance approaching Opus 4.8, delivered at a fraction of the cost. The launch numbers to remember are the ones on the invoice, not the benchmark chart:
| Model | Input / M tokens | Output / M tokens | Window |
|---|---|---|---|
| Claude Sonnet 5 (promo) | $2.00 | $10.00 | Now → Aug 31, 2026 |
| Claude Sonnet 5 (standard) | $3.00 | $15.00 | From Sep 1, 2026 |
| Claude Sonnet 4.6 | $3.00 | $15.00 | Ongoing |
| Claude Opus 4.8 | $5.00 | $25.00 | Ongoing |
The promo undercuts Sonnet 4.6 by 33% flat, and undercuts Opus 4.8 by roughly 2.5× on both dimensions. Anthropic almost never launches an intelligence upgrade at a discount to the previous generation. The two-month window is the story.
Why the Promo Is Meaningful for Coding Agents
Coding agents are output-heavy relative to chat. A typical Claude Code session runs 8:1 to 12:1 input-to-output tokens over a 30-minute session, because the agent reads the codebase, holds context, and writes back diffs. When output prices drop, coding costs drop with them. Sonnet 5 promo cuts output by exactly the same 33% as input compared to Sonnet 4.6 — but versus Opus 4.8 the drop is 2.5× on both meters, which for output-heavy workloads compounds fast.
Concretely: an agent that consumed 4M input tokens and 350K output tokens per developer-day on Opus 4.8 was billing $28.75/day at Opus pricing. The same workload on Sonnet 5 promo pricing runs $11.50/day. Even if you have to burn 20% more tokens because Sonnet 5 misses some Opus-only reasoning wins, you are still at $13.80/day. The two-month promo period is 61 days; that is a $910–$1,050 delta per developer if you can route successfully.
Where Sonnet 5 Actually Replaces Opus
Anthropic's launch materials call out three capability upgrades over Sonnet 4.6: planning, browser tool use, and terminal tool use. In practice this means Sonnet 5 becomes the default for:
- Multi-file edits — Sonnet 4.6 tended to lose track around the 8-file mark; Sonnet 5 holds through mid-teens on typical monorepos.
- Terminal-driven agents — bash tool sequences with dependency between steps, where Sonnet 4.6 sometimes retried against stale state.
- Long-form refactors — the 200K context is unchanged, but tool-use stability makes it usable for the full window.
Where Opus 4.8 still earns its price: architecture design under ambiguity, adversarial code review, novel algorithm work. If your agent's job is to make judgement calls rather than execute a plan, Opus is still the right router destination. If the agent is executing a well-scoped diff, Sonnet 5 wins on total cost with acceptable quality.
Routing Strategy for the Promo Window
A defensible routing setup for the next 61 days:
- Set Sonnet 5 as the default coding model for all agents.
- Keep Opus 4.8 available as an escalation for
/deep-think-style flagged tasks. - Enable prompt caching on both — cache hits price at 10% of write, so long system prompts amortize even faster.
- Instrument per-model spend so you can measure the actual output-quality delta rather than assuming.
The trap to avoid: assuming Sonnet 5 promo pricing sticks. It will not. On September 1, standard $3/$15 kicks in and Sonnet 5 becomes identical on price to Sonnet 4.6. Any workload you shifted onto Sonnet 5 for cost reasons needs a September re-evaluation.
Two-Month Playbook
If your team has been budget-blocked from doing an expensive migration, refactor, or codebase reorganization, the promo window is the cheapest time in a year to do it. A migration that would have burned $12K on Opus can plausibly run $4K on Sonnet 5 during promo. Two months is enough to plan, execute, and clean up. After September, you are back to normal pricing math.
Also worth doing during promo: rebuilding eval suites, generating documentation across your codebase, and any “we should really automate this someday” task that stalled on cost.
Want to calculate exact costs for your project?
Frequently Asked Questions
How long does the $2/$10 Claude Sonnet 5 promo price last?
Through August 31, 2026. Starting September 1, 2026, Sonnet 5 reverts to standard pricing at $3/M input and $15/M output — the same as Sonnet 4.6.
Should I switch coding agents from Opus 4.8 to Sonnet 5?
For most execution-focused coding tasks (multi-file edits, terminal-driven agents, refactors) — yes, during the promo. Keep Opus 4.8 as an escalation path for tasks requiring novel judgement or adversarial reasoning. Reevaluate the split in September when promo ends.
What's the expected daily cost per developer using Sonnet 5?
Roughly $11–14/day during promo based on typical Claude Code usage patterns (4M input / 350K output tokens/day), versus $28–35/day on Opus 4.8 at $5/$25 pricing.
Does Sonnet 5 support prompt caching?
Yes. Cache reads price at approximately 10% of the input write cost, so long system prompts and stable context blocks amortize cheaply on both promo and standard pricing.
Does Sonnet 5 have a larger context window than Sonnet 4.6?
No — both use a 200K token context window. The upgrade is in tool-use reliability and planning within that window, not raw size.
Related Articles
Claude Fable 5 Pricing: $10/$50 Per Million Tokens — Is Anthropic's Strongest Model Worth It for Coding?
Claude Fable 5 launched at $10 input / $50 output per million tokens — less than half of Mythos Preview pricing. We analyze when the premium over Opus 4.8 at $5/$25 is justified for coding workflows.
Claude Fable 5 and Mythos 5 Pricing: $10/$50 Per Million Tokens Is a 50% Price Cut
Anthropic launches Claude Fable 5 and Mythos 5 at $10/$50 per million tokens — a 50% price cut from Mythos Preview. Full pricing analysis comparing with Opus 4.8 and Sonnet 4.6 for AI coding workflows.
Claude Fable 5 Is Back: Export Controls Lifted, What the Return Means for Coding Costs
Anthropic announced on June 30, 2026 that US export controls on Claude Fable 5 have been lifted. The model returns to global availability on July 1 with a 50% weekly-quota inclusion window through July 7. We break down what changed, the new safeguard, and the coding-cost implications.