Models & Releases Research·arXiv cs.LG·2d ago

TiRex-2: Generalizing TiRex to Multivariate Data and Streaming

TiRex-2 advances time series forecasting by extending recurrent xLSTM architecture to handle multivariate data and streaming inference. The key innovation addresses a critical pain point in production forecasting: existing Transformer-based foundation models suffer quadratic complexity as context grows and require full recomputation when new observations arrive. TiRex-2's memory-centric design maintains constant per-patch cost during streaming while capturing cross-variable dependencies through bidirectional and asymmetric attention mechanisms. This matters for practitioners deploying real-time forecasting systems where latency and computational efficiency directly impact feasibility.

Modelwire context

Explainer

The paper doesn't just extend TiRex to multivariate data; it solves a specific architectural problem that Transformers inherit: quadratic complexity forces practitioners to either truncate context windows or accept recomputation overhead on every new observation. TiRex-2's constant per-patch cost during streaming is the actual contribution, not the multivariate support alone.

This connects directly to the Aionoscope work from the same day, which exposed a gap between what time-series models measure (raw accuracy) and what production systems need (interpretable latent state). TiRex-2 addresses the efficiency half of that problem. It also echoes the clinical NLP deployment lesson from Dynamic Bidirectional Pattern Memory: production constraints (latency, recomputation cost) often matter more than benchmark gains. The xLSTM architecture choice here is a bet that recurrence scales better than attention for streaming inference, which is testable but not yet proven at scale.

If TiRex-2 matches or beats Transformer foundation models on standard multivariate benchmarks (like ETTm2 or Weather) while maintaining sub-100ms inference latency on streaming windows, the efficiency claim holds. If latency advantage disappears on real-world data with >50 variables or >1000-step context, the constant-cost property may not survive contact with production complexity.

Coverage we drew on

Aionoscope: Debugging Latent-State Accessibility in Time-Series Representations · arXiv cs.LG

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsTiRex-2 · TiRex · xLSTM · Transformer

Read full story at arXiv cs.LG →(arxiv.org)

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Our mission How we write

Modelwire summarizes, we don’t republish. The full content lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.