Business & Funding Models & Releases·The Decoder·May 23

Deepseek makes its 75 percent discount permanent, pricing output tokens at least 34x below GPT-5.5

Deepseek's permanent 75 percent pricing cut reshapes the LLM cost structure, pushing output tokens to $0.0015 per million, a threshold that forces Western incumbents to recalibrate margins. The move targets agentic workloads where token volume compounds costs, signaling a shift from model capability competition toward infrastructure economics. For enterprises building token-intensive systems, this pricing floor may accelerate adoption of Chinese models and pressure OpenAI, Anthropic, and Google to defend market share through either aggressive repricing or differentiated performance claims.

Modelwire context

Analyst take

The permanence of the cut is the operative word here. Temporary promotional pricing can be dismissed as a land-grab; a permanent floor forces competitors to model against it in their own unit economics indefinitely, which is a different kind of pressure.

This is largely disconnected from recent activity in our archive, as we have no prior coverage to anchor it to. That gap is itself worth noting: Deepseek's pricing moves have been accelerating faster than most Western trade coverage has tracked them. The story belongs to a longer arc of Chinese frontier labs using infrastructure cost advantages to compete on price rather than benchmark headlines, a strategy that sidesteps the capability arms race entirely and targets the procurement conversation instead. For enterprises already running token-intensive agentic pipelines, the relevant comparison is not raw model quality but total cost per completed workflow, and at $0.0015 per million output tokens, Deepseek V4-Pro changes that calculation materially.

Watch whether OpenAI or Anthropic announce API price reductions within the next 60 days. If neither moves, it signals they are betting on performance differentiation over price parity, which would confirm a deliberate two-tier market strategy rather than a reactive one.

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsDeepseek · Deepseek V4-Pro · OpenAI · GPT-5.5 · Anthropic · Google

Read full story at The Decoder →(the-decoder.com)

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Our mission How we write

Modelwire summarizes, we don’t republish. The full content lives on the-decoder.com. If you’re a publisher and want a different summarization policy for your work, see our takedown page.