Persona Non Grata: LLM Persona-Driven Generations in MCQA are Unstable in Distinct Dimensions

Researchers have identified a critical stability gap in how large language models maintain consistent personas during multiple-choice question answering, a task type largely absent from prior robustness studies. The work introduces three new metrics spanning performance, outcome, and correctness dimensions, revealing that instability patterns correlate predictably with model family and scale. This matters for production systems relying on persona-driven LLM outputs in structured tasks, where inconsistency could undermine reliability in customer-facing or safety-critical applications.

Modelwire context

Explainer

The paper doesn't just confirm personas drift in LLMs; it separates instability into three distinct failure modes (performance, outcome, correctness) and shows they correlate differently with model scale and family. This granularity matters because a system might fail consistently on one dimension while succeeding on another, which prior monolithic robustness studies would have missed.

This connects directly to the persona-adaptive work from early July (Behavior-Adaptive Conversational Agents), which proposed dynamic personality calibration as a fix for fixed-persona limitations. That paper showed adaptive behavior improves outcomes; this new work reveals the underlying instability problem that makes static personas risky in the first place. Together they frame a design tension: personas are unstable by default, but the field has been deploying them as fixed anyway. The affective gap paper (Quantifying the Affective Gap) also fits here, since emotion recognition is a persona-adjacent task where inconsistency would compound safety risks in mental health applications.

If the same three metrics are adopted in production LLM evaluations within the next 6 months (watch for mentions in Claude, GPT, or Gemini safety documentation), that signals the industry is moving beyond binary persona pass/fail to dimensional stability tracking. If not, this remains a research finding without downstream adoption pressure.

Coverage we drew on

Behavior-Adaptive Conversational Agents: Toward a Fluid Personality Framework · arXiv cs.CL

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsLLM · MCQA · persona-driven generation

Read full story at arXiv cs.CL →(arxiv.org)

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Our mission How we write

Modelwire summarizes, we don’t republish. The full content lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.

Research

Persona Non Grata: LLM Persona-Driven Generations in MCQA are Unstable in Distinct Dimensions

Modelwire context

Coverage we drew on

Modelwire Editorial

Related

MSQA: A Natively Sourced Multilingual and Multicultural SimpleQA Benchmark

LLMs are stuck in a groupthink groove. This startup is trying to get them out.

Behavior-Adaptive Conversational Agents: Toward a Fluid Personality Framework