The Predictive-Causal Gap: An Impossibility Theorem and Large-Scale Neural Evidence

A new theoretical result exposes a fundamental tension in how neural networks learn from data. Researchers tested 2695 configurations and found that predictive models systematically ignore the causal structure they're meant to capture, instead tracking environmental noise. The optimal encoder achieves lower prediction error by focusing on spurious correlations rather than true system dynamics, a failure that worsens dramatically in high dimensions. The paper proves this is not a training quirk but an inherent property of the predictive objective itself. This challenges a core assumption in representation learning: that minimizing prediction loss yields interpretable, causally grounded features. The finding has implications for any system relying on self-supervised pretraining to extract meaningful structure from observations.
Modelwire context
ExplainerThe paper's most pointed implication isn't about model quality in general but about self-supervised pretraining specifically: the objective function used to train the representations that underpin nearly every modern foundation model is, by this proof, structurally incapable of recovering causal structure regardless of scale or architecture.
This connects directly to two threads already on the site. The Impossibility Triangle of Long-Context Modeling (arXiv, May 6) established a similar pattern: what looks like an engineering gap is actually a formal constraint baked into the objective. Together, these two papers suggest a broader moment where theoretical work is catching up to empirical intuitions about what current training regimes cannot do. That framing also sharpens the ARC-AGI-3 analysis from The Decoder (May 2), which found systematic reasoning failures that persisted despite scale. If predictive objectives are provably blind to causal structure, the three repeatable error patterns identified there may not be fixable by more pretraining data or larger models. They may require a different objective entirely.
Watch whether any major self-supervised pretraining effort (Meta's JEPA line or DeepMind's successor to Gato are the most plausible candidates) explicitly incorporates causal objectives in a public release within the next 12 months. If they do, this theorem will have moved from theory to design constraint faster than most impossibility results do.
This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.
MentionsNeural networks · Representation learning · Self-supervised learning · Causal inference
Modelwire Editorial
This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.
Modelwire summarizes, we don’t republish. The full content lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.