Siri AI at WWDC 2026

Apple's 2026 Siri overhaul pivots toward on-device and private-cloud LLM inference using a custom Gemini-derived model, marking a strategic shift away from cloud-dependent AI. The approach leverages vision capabilities to parse screen context, reducing reliance on explicit user queries. Willison's skepticism reflects justified caution after Apple Intelligence's 2024 underdelivery, but the technical architecture now appears grounded in feasible infrastructure rather than speculative claims. This signals how major platforms are decoupling from third-party model providers while maintaining privacy guarantees through hybrid compute.
Modelwire context
Analyst takeThe detail worth sitting with is the Gemini-derived lineage. Apple isn't building from scratch, it's forking or licensing from Google, which means two of the largest platform competitors are now in a quiet technical partnership that neither has incentive to publicize loudly.
This is largely disconnected from recent activity in our archive, as we have no prior coverage to anchor against. In the broader space, though, this story belongs alongside the ongoing pattern of large platform companies internalizing model development rather than routing user data through third-party APIs. Apple's move mirrors what we've seen from Meta and Amazon in different product contexts: the calculus is that owning inference infrastructure is worth the engineering cost once user volume is high enough to justify it. The privacy framing via Private Cloud Compute is doing real work here, giving Apple a differentiator that pure model performance cannot.
Watch whether Apple discloses the specific terms of its arrangement with Google around the Gemini-derived model before WWDC 2027. If no disclosure comes and the model continues shipping in production, that silence will tell us something about how platform AI licensing deals are being structured to avoid regulatory scrutiny.
This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.
MentionsApple · Siri · Google Gemini · Apple Intelligence · Private Cloud Compute · Simon Willison
Modelwire Editorial
This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.
Modelwire summarizes, we don’t republish. The full content lives on simonwillison.net. If you’re a publisher and want a different summarization policy for your work, see our takedown page.