
Controllable Spoken Dialogue Generation: An LLM-Driven Grading System for K-12 Non-Native English Learners
Researchers developed a proficiency-aligned framework that adapts LLM outputs to match K-12 English learners' abilities, using China's national curriculum as a test case. The core contribution is DDPO, a policy optimization algorithm that maintains dialogue diversity while improving quality across multi-turn conversations.52




























