Research Tools & Code·arXiv cs.CL·Apr 23

EVENT5Ws: A Large Dataset for Open-Domain Event Extraction from Documents

Researchers released EVENT5Ws, a manually annotated dataset of 5,000+ documents for training open-domain event extraction systems. The work addresses a gap in NLP benchmarks by providing large-scale, verified annotations across diverse event types, enabling better automated analysis of real-world incidents.

Modelwire context

Explainer

The significance here isn't the annotation count alone but the 'open-domain' scope: most prior event extraction benchmarks constrain models to a fixed ontology of event types, which means systems trained on them fail badly when encountering novel real-world incidents outside that predefined list. EVENT5Ws is designed to break that ceiling.

The closest parallel in recent Modelwire coverage is MADE, the living medical adverse-event benchmark covered in mid-April from arXiv cs.CL. Both releases are responding to the same structural problem: NLP evaluation suites that are too narrow, too contaminated, or too static to reflect deployment conditions. MADE addressed label imbalance and data contamination in a high-stakes vertical; EVENT5Ws targets breadth across event types rather than depth in one domain. These are complementary pressures on the benchmark ecosystem, not competing ones. Recent coverage of LLM judge bias (the 'Context Over Content' paper, also from arXiv cs.CL in mid-April) adds another layer: better training data only helps if the evaluation pipeline used to score models trained on it is itself trustworthy.

Watch whether any of the major information-extraction leaderboards (ACE, ERE, or the emerging RAMS variants) adopt EVENT5Ws as a standard split within the next two conference cycles. Adoption there would confirm the dataset fills a real gap rather than duplicating existing resources.

Coverage we drew on

MADE: A Living Benchmark for Multi-Label Text Classification with Uncertainty Quantification of Medical Device Adverse Events · arXiv cs.CL

This analysis is generated by Modelwire’s editorial layer from our archive and the summary above. It is not a substitute for the original reporting. How we write it.

MentionsEVENT5Ws

Read full story at arXiv cs.CL →(arxiv.org)

Modelwire Editorial

This synthesis and analysis was prepared by the Modelwire editorial team. We use advanced language models to read, ground, and connect the day’s most significant AI developments, providing original strategic context that helps practitioners and leaders stay ahead of the frontier.

Our mission How we write

Modelwire summarizes, we don’t republish. The full content lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.