Learning to Think Like a Cartoon Captionist: Incongruity-Resolution Supervision for Multimodal Humor Understanding

Researchers introduce IRS, a framework that decomposes humor understanding into incongruity detection, resolution modeling, and preference alignment, grounded in cognitive theory and tested on the New Yorker Cartoon Caption Contest benchmark.

MentionsNew Yorker Cartoon Caption Contest · IRS · incongruity-resolution theory

Read full story at arXiv cs.CL →(arxiv.org)

Modelwire summarizes — we don’t republish. The full article lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.

Research

IG-Search: Step-Level Information Gain Rewards for Search-Augmented Reasoning

arXiv cs.CL·3d ago

Research

Structural interpretability in SVMs with truncated orthogonal polynomial kernels

arXiv cs.LG·2d ago

Research

DiscoTrace: Representing and Comparing Answering Strategies of Humans and LLMs in Information-Seeking Question Answering

arXiv cs.CL·3d ago

Learning to Think Like a Cartoon Captionist: Incongruity-Resolution Supervision for Multimodal Humor Understanding

Related

IG-Search: Step-Level Information Gain Rewards for Search-Augmented Reasoning

Structural interpretability in SVMs with truncated orthogonal polynomial kernels

DiscoTrace: Representing and Comparing Answering Strategies of Humans and LLMs in Information-Seeking Question Answering