Research Opinion & Analysis·The Decoder·May 5

Anthropic co-founder maps out how recursive AI improvement could outpace the humans meant to supervise it

Anthropic co-founder Jack Clark has outlined a technical pathway for recursive self-improvement in AI systems, arguing that the foundational components already exist. His analysis assigns 60 percent probability to systems capable of training their own successors by end of 2028. This directly challenges the assumption that human oversight can scale with AI capability gains, reshaping how the field thinks about supervision bottlenecks and the timeline for autonomous capability iteration. The claim matters because it moves recursive improvement from theoretical concern to near-term engineering problem.

Modelwire context

Analyst take

Clark's 60 percent figure is doing real work here: it's not a vague warning but a dated, probabilistic claim from someone with direct visibility into Anthropic's internal capability roadmap, which makes it materially different from the recursive-improvement discourse that has circulated in academic safety literature for years.

The timing sits uncomfortably against two threads in recent coverage. First, the ARC-AGI-3 analysis from The Decoder (May 2) found that frontier models including Opus 4.7 still fail on basic reasoning tasks humans solve intuitively, which cuts against the premise that current systems are close to training competent successors. Second, the UK AI Security Institute finding that GPT-5.5 now matches Claude Mythos in autonomous cyber attack simulations (The Decoder, May 1) shows that offensive capability parity is arriving faster than governance structures anticipated, which is exactly the supervision-gap problem Clark is describing in a different domain. Together, these stories suggest the field is simultaneously hitting reasoning ceilings and capability thresholds, a combination that makes the oversight question harder, not easier.

Watch whether Anthropic publishes a formal technical report or eval framework tied to this 2028 claim before end of 2026. If they do, it signals internal alignment on the timeline and forces competitors to respond publicly. If the claim stays in co-founder commentary without institutional backing, treat it as a positioning move rather than a roadmap.

Coverage we drew on

Even the latest AI models make three systematic reasoning errors, ARC-AGI-3 analysis shows · The Decoder

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsAnthropic · Jack Clark · The Decoder

Read full story at The Decoder →(the-decoder.com)

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Our mission How we write

Modelwire summarizes, we don’t republish. The full content lives on the-decoder.com. If you’re a publisher and want a different summarization policy for your work, see our takedown page.