Research Models & Releases·arXiv cs.LG·1d ago

ART for Diffusion Sampling: Continuous-Time Control and Actor-Critic Learning

Researchers propose Adaptive Reparameterized Time (ART), a learned control mechanism that optimizes how diffusion models allocate computational steps during image generation. Rather than relying on fixed schedules, ART treats sampling speed as a learnable variable, enabling the model to spend more steps where they matter most. This addresses a fundamental efficiency bottleneck in diffusion sampling that affects inference cost across computer vision applications. The work bridges optimal control theory with reinforcement learning, offering a principled path to faster generation without sacrificing quality, with implications for real-time deployment of diffusion-based systems.

Modelwire context

Explainer

ART treats the sampling trajectory itself as a learnable optimization variable rather than a fixed hyperparameter. This inverts the typical approach: instead of designing better schedules offline, the model learns to allocate steps dynamically during inference based on what each generation actually needs.

This is part of a coherent pattern across recent work on adaptive compute. CAT (from July 1st) lets reasoning models adjust chain-of-thought depth per problem; Valdi (same day) addresses the inference bottleneck in diffusion-based world models by rethinking the sampling loop itself. ART extends this logic specifically to the step-allocation problem within a single diffusion trajectory. All three papers share a common insight: uniform resource allocation wastes computation on easy cases. The difference here is that ART applies actor-critic learning to a continuous control problem (sampling speed) rather than discrete decisions (token depth or single-step inference).

If ART reduces wall-clock inference time by more than 25% on standard benchmarks (COCO, ImageNet) while maintaining FID parity with fixed schedules, and if this holds across different model sizes (base, large, XL), then the approach is robust enough for production deployment. If the gains collapse below 10% or require model retraining per task, it's a marginal optimization rather than a general principle.

Coverage we drew on

CAT: Confidence-Adaptive Thinking for Efficient Reasoning of Large Reasoning Models · arXiv cs.CL

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsAdaptive Reparameterized Time (ART) · ART-RL · diffusion models · score-based diffusion

Read full story at arXiv cs.LG →(arxiv.org)

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Our mission How we write

Modelwire summarizes, we don’t republish. The full content lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.