
Beyond Rating: A Comprehensive Evaluation and Benchmark for AI Reviews
Researchers introduce Beyond Rating, a framework that evaluates AI-generated peer reviews on five dimensions beyond numeric scores—including argumentative quality and question constructiveness. The work includes a curated dataset and Max-Recall strategy to handle expert disagreement, shifting focus from rating prediction to the substance of textual critique.58



























