iRuler: Intelligible Rubric-Based User-Defined Llm Evaluation For Revision

Authors

Jingwen Bai,

Wei Soon Cheong,

Philippe Muller,

Brian Y Lim

Date

02/2026

Publisher

arXiv

Link

https://arxiv.org/pdf/2602.12779v1

Large Language Models (LLMs) have become indispensable for evaluating writing. However, text feedback they provide is often unintelligible, generic, and not specific to user criteria. Inspired by structured rubrics in education and intelligible AI explanations, we propose iRULER following identified design guidelines to \textit{scaffold} the review process by \textit{specific} criteria, providing \textit{justification} for score selection, and offering \textit{actionable} revisions to target different quality levels. To \textit{qualify} user-defined criteria, we recursively used iRULER with a rubric-of-rubrics to iteratively \textit{refine} rubrics. In controlled experiments on writing revision and rubric creation, iRULER most improved validated LLM-judged review scores and was perceived as most helpful and aligned compared to read-only rubric and text-based LLM feedback. Qualitative findings further support how iRULER satisfies the design guidelines for user-defined feedback. This work contributes interactive rubric tools for intelligible LLM-based review and revision of writing, and user-defined rubric creation.

What is the application?

Teaching – Assessment and Feedback

Who is the user?

Student,

Who age?

Why use AI?

Outcomes – Durable Skills

Study design