Models & Releases Tools & Code·Simon Willison·May 4

Granite 4.1 3B SVG Pelican Gallery

IBM's Granite 4.1 family expands the open-weight LLM landscape with Apache 2.0 licensed models across three scales (3B, 8B, 30B), directly challenging the closed-model dominance of frontier labs. The 3B variant's rapid quantization by Unsloth into 21 GGUF variants signals strong developer adoption momentum for efficient inference, positioning Granite as a credible alternative for cost-conscious deployments and on-device applications where model size and licensing clarity matter.

Modelwire context

Analyst take

The real signal here isn't the model family itself but the licensing choice. Apache 2.0 removes the commercial-use friction that has quietly limited adoption of otherwise capable open-weight models, and that distinction matters more for enterprise procurement than any benchmark number IBM will publish.

This lands in the middle of a rapidly forming tier of capable, cost-conscious alternatives to frontier labs. Xiaomi's MiMo-V2.5-Pro (covered May 3rd) made the same play from a different angle, competing on token efficiency rather than raw scores. xAI's Grok 4.3 (May 2nd) is pursuing the same cost-sensitive segment through price cuts rather than open weights. What's emerging is a multi-front squeeze on the mid-market: open-weight players offering licensing clarity, and closed-weight players offering aggressive pricing. IBM's 3B variant, with Unsloth already shipping 21 GGUF variants within days of release, suggests the developer community is treating this as a serious on-device candidate rather than a corporate vanity release.

Watch whether enterprise tooling vendors (LangChain, LlamaIndex, Ollama) add first-class Granite 4.1 support within the next 60 days. Broad integration there would confirm genuine adoption momentum rather than benchmark tourism.

Coverage we drew on

Xiaomi's open-weight MiMo-V2.5-Pro takes aim at Claude Opus with hours-long autonomous coding · The Decoder

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsIBM · Granite 4.1 · Unsloth · Yousaf Shah

Read full story at Simon Willison →(simonwillison.net)

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Our mission How we write

Modelwire summarizes, we don’t republish. The full content lives on simonwillison.net. If you’re a publisher and want a different summarization policy for your work, see our takedown page.