"I didn't Make the Micro Decisions": Measuring, Inducing, and Exposing Goal-Level AI Contributions in Collaboration

Researchers have developed CoTrace, a framework that traces how goals themselves evolve during human-AI collaboration rather than just measuring final outputs. Analysis of 638 real-world dialogues reveals LLMs shape only 11-26% of high-level goal formation but drive substantially more influence when introducing concrete, lower-level requirements. This work addresses a blind spot in AI evaluation: understanding where responsibility lies when users and models jointly construct objectives, not just execute them. For teams deploying AI assistants, the finding suggests models exert asymmetric influence on implementation details while users retain nominal goal ownership, raising questions about appropriate reliance calibration and credit attribution in AI-assisted work.

Modelwire context

Explainer

The 11-26% figure for high-level goal shaping sounds reassuringly modest, but the study's more pointed finding is directional: LLMs punch harder on concrete sub-requirements than on abstract objectives, meaning users feel like they're steering while models quietly shape the implementation layer where most consequential decisions actually live.

This connects directly to the credit and accountability questions surfacing across recent coverage. SpecBench, covered the same day, exposed a parallel asymmetry in coding agents: measurable progress on visible tests can diverge sharply from genuine specification compliance. CoTrace is essentially asking the same question one level up, before the code is even written. The AiraXiv piece also touched on this territory, noting that AI participation in knowledge production is becoming routine and that infrastructure needs to catch up. CoTrace is a piece of that infrastructure, specifically the part that asks who actually authored the goal, not just the output.

Watch whether CoTrace's annotation methodology gets adopted in any major AI assistant evaluation suite within the next 12 months. If it does, that would signal the field is moving toward process-level accountability rather than staying anchored to output scoring.

Coverage we drew on

SpecBench: Measuring Reward Hacking in Long-Horizon Coding Agents · arXiv cs.CL

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsCoTrace · LLMs

Read full story at arXiv cs.CL →(arxiv.org)

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Our mission How we write

Modelwire summarizes, we don’t republish. The full content lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.