Modelwire
Subscribe

Why Aren’t We Measuring How AI Affects Humans?

Illustration accompanying: Why Aren’t We Measuring How AI Affects Humans?

The AI industry has built sophisticated measurement frameworks for model capabilities while largely ignoring systematic assessment of how these systems reshape human cognition, relationships, and behavior. Imran Khan at the Center for Humane Technology argues this gap represents a critical blind spot as deployment accelerates. The absence of psychosocial evaluation metrics mirrors historical regulatory failures in other technologies, leaving organizations to deploy tools with unknown downstream effects on users and society. This framing positions human-impact measurement as an urgent infrastructure gap rather than a peripheral concern.

Modelwire context

Analyst take

The Center for Humane Technology's framing treats human-impact measurement as infrastructure, not ethics theater, which subtly shifts the ask from voluntary best-practice adoption to something closer to a mandatory baseline. That distinction matters for how organizations respond.

The timing here is pointed. Florida's lawsuit against OpenAI, covered here on June 1st, is essentially a legal stress-test of exactly the gap Khan describes: a court is now being asked to determine causal harm from AI deployment in the absence of any systematic psychosocial monitoring that might have flagged risk earlier. Separately, the Amazon leaderboard story from the same day illustrates that even capability-side measurement is fragile under competitive pressure, which makes the prospect of building rigorous human-impact metrics look even harder. And Anthropic's IPO filing raises the same underlying tension from a different direction: public markets will price safety commitments, but only if there are numbers to anchor them to.

Watch whether the Florida lawsuit's discovery phase forces OpenAI to disclose what user-impact monitoring, if any, existed around the relevant interactions. If that record is thin, it will do more to accelerate psychosocial measurement standards than any advocacy paper.

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsImran Khan · Center for Humane Technology · IEEE Spectrum

MW

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Modelwire summarizes, we don’t republish. The full content lives on spectrum.ieee.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.

Why Aren’t We Measuring How AI Affects Humans? · Modelwire