Products & Apps Research·The Decoder·Jun 7

Perplexity's "Search as Code" lets AI models write their own search pipelines instead of calling fixed APIs

Perplexity has shifted from fixed API-based search to a model-driven architecture where AI agents compose their own search logic in Python within a sandboxed environment. This architectural pivot addresses a fundamental inefficiency in current agentic systems: rigid pipelines force models to make suboptimal calls and waste tokens on redundant operations. The reported 85 percent token reduction and benchmark wins over OpenAI and Anthropic suggest the approach unlocks material efficiency gains, signaling a broader industry move toward flexible, agent-controlled data retrieval rather than predefined tool interfaces.

Modelwire context

Analyst take

The 85 percent token reduction figure is doing a lot of work here, but Perplexity hasn't disclosed which benchmarks, what task distribution, or whether the comparison baselines reflect current OpenAI and Anthropic production configurations. That number needs independent replication before it anchors any competitive narrative.

This move lands directly on top of the argument Hugging Face made in 'Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic' (June 1): the bottleneck in production AI is shifting from model quality to how well agents orchestrate tools and data retrieval. Perplexity is essentially operationalizing that thesis at the infrastructure layer, betting that model-composed pipelines beat fixed APIs before the larger labs can respond. The K-BrowseComp benchmark story from the same week is also relevant context: it showed that agentic web interaction is brittle even for frontier models, which raises a real question about whether Perplexity's sandbox approach holds up across languages and query types beyond its reported test set.

Watch whether OpenAI or Anthropic ships a comparable agent-controlled retrieval interface within the next two quarters. If neither responds and Perplexity's token efficiency claims get third-party validation on a public agentic benchmark, the architectural bet is confirmed. If the claims stay proprietary, treat them as unverified.

Coverage we drew on

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic · Hugging Face

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsPerplexity · OpenAI · Anthropic · Search as Code

Read full story at The Decoder →(the-decoder.com)

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Our mission How we write

Modelwire summarizes, we don’t republish. The full content lives on the-decoder.com. If you’re a publisher and want a different summarization policy for your work, see our takedown page.