Fixation Sequences as Time Series: A Topological Approach to Dyslexia Detection

Researchers applied persistent homology, a topological data analysis technique, to eye-tracking sequences to detect dyslexia, combining topological features with statistical methods on the Copenhagen Corpus. The hybrid approach outperformed existing baselines for distinguishing dyslexic readers across native and non-native speakers.

Modelwire context

Explainer

The real methodological bet here is treating fixation sequences as geometric objects rather than raw time series, which lets topological features capture reading rhythm irregularities that standard statistical summaries tend to flatten out. The Copenhagen Corpus is also worth flagging: it includes both native and non-native Danish readers, making cross-linguistic generalization a built-in test rather than an afterthought.

This sits at a distance from most recent Modelwire coverage, which has focused on LLM inference and evaluation. The closest conceptual neighbor is the DiscoTrace paper from April 16, which also compared human and model behavior through structured sequence representations, though DiscoTrace targets rhetorical strategy rather than clinical signals. More broadly, this work belongs to a quieter thread in the archive: using formal mathematical structure to extract meaning from behavioral traces that neural approaches tend to treat as noise.

The benchmark to track is whether this hybrid topological-statistical approach holds up when applied to corpora outside Scandinavian languages, particularly ones with orthographically irregular writing systems like English, where dyslexic fixation patterns are known to differ. If a replication on an English-language corpus like the Provo or MECO dataset shows comparable separation, the method has real generalization legs.

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsPersistent Homology · Copenhagen Corpus · Topological Data Analysis

Read full story at arXiv cs.CL →(arxiv.org)

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Our mission How we write

Modelwire summarizes, we don’t republish. The full content lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.