Introducing talkie: a 13B vintage language model from 1930

Researchers including Alec Radford (GPT, Whisper) have released talkie, a 13B language model trained exclusively on pre-1931 English text. This specialized historical model opens a new frontier in temporal domain adaptation, enabling researchers to study how language models behave when constrained to specific linguistic eras. The release signals growing interest in controllable pretraining as a research lever, with implications for understanding model behavior across distributional shifts and for building domain-specific variants without massive compute budgets.
Modelwire context
ExplainerThe buried detail is the team composition: Alec Radford, whose prior work on GPT and Whisper shaped the modern pretraining playbook, is now contributing to a project that deliberately inverts the usual scaling logic by narrowing the data distribution rather than expanding it. That choice is the actual research signal here.
This is largely disconnected from recent activity in our archive, which has no prior coverage to anchor against. The work belongs to a quieter but growing thread in the research community around controlled pretraining, where the question is not how big a model can get but how precisely its knowledge can be bounded. That framing connects to ongoing debates about data provenance, copyright, and distributional shift that have surfaced repeatedly in discussions of foundation model training, even if we have not yet covered those threads directly.
Watch whether other groups publish evaluations using talkie as a baseline for temporal generalization experiments within the next six months. If that happens, it signals the model is being adopted as a shared research artifact rather than remaining a one-off curiosity.
This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.
MentionsAlec Radford · Nick Levine · David Duvenaud · talkie · Hugging Face
Modelwire Editorial
This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.
Modelwire summarizes, we don’t republish. The full content lives on simonwillison.net. If you’re a publisher and want a different summarization policy for your work, see our takedown page.