Learning Over-Relaxation Policies for ADMM with Convergence Guarantees

Researchers propose a learned approach to tuning relaxation parameters in ADMM, a foundational optimization algorithm widely deployed in control systems and structured ML problems. By framing parameter adaptation as an online learning task rather than manual tuning, the work targets repeated problem-solving scenarios like Model Predictive Control where problem structure remains fixed but data shifts. The contribution matters for practitioners building optimization-heavy AI systems: it sidesteps expensive matrix refactorizations while maintaining convergence guarantees, directly improving throughput in solvers like OSQP that power embedded and real-time ML inference pipelines.
Modelwire context
ExplainerThe genuinely tricky contribution here is not the learning itself but the convergence guarantee: most learned optimizer work trades theoretical safety for empirical speed, so preserving provable convergence while adapting parameters online is the harder constraint the summary gestures at but doesn't unpack.
The closest thread in recent coverage is the Hyper Input Convex Neural Networks paper from the same day, which also sits at the intersection of constrained learning and optimization theory. Both papers are working on the same underlying tension: how do you bring learned flexibility into optimization pipelines without discarding the structural guarantees practitioners depend on? HyCNNs do it through architectural constraints on function shape; this ADMM work does it through constrained online policy learning over relaxation parameters. Neither paper cites the other, but together they sketch a pattern worth tracking: learned components are moving deeper into the solver stack, not just sitting on top as warm-start heuristics.
The real test is whether OSQP or a comparable open-source solver ships an optional learned relaxation module within the next 12 months. If that happens, it signals the research has cleared the reproducibility bar practitioners require; if it stays in arXiv benchmarks, the convergence guarantees may not hold robustly enough outside controlled problem distributions.
Coverage we drew on
This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.
MentionsADMM · OSQP · Model Predictive Control
Modelwire Editorial
This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.
Modelwire summarizes, we don’t republish. The full content lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.