From Coders to Critics: Empowering Students through Peer Assessment in the Age of AI Copilots

Authors

Santiago Berrezueta-Guzman,

Stephan Krusche,

Stefan Wagner

Date

05/2025

Publisher

arXiv

Link

http://arxiv.org/pdf/2505.22093v1

The rapid adoption of AI powered coding assistants like ChatGPT and other coding copilots is transforming programming education, raising questions about assessment practices, academic integrity, and skill development. As educators seek alternatives to traditional grading methods susceptible to AI enabled plagiarism, structured peer assessment could be a promising strategy. This paper presents an empirical study of a rubric based, anonymized peer review process implemented in a large introductory programming course. Students evaluated each other's final projects (2D game), and their assessments were compared to instructor grades using correlation, mean absolute error, and root mean square error (RMSE). Additionally, reflective surveys from 47 teams captured student perceptions of fairness, grading behavior, and preferences regarding grade aggregation. Results show that peer review can approximate instructor evaluation with moderate accuracy and foster student engagement, evaluative thinking, and interest in providing good feedback to their peers. We discuss these findings for designing scalable, trustworthy peer assessment systems to face the age of AI assisted coding.

Who age?

Post-Secondary

Why use AI?

Outcomes – Durable Skills

Study design

Quantitative – Others

Search and Filter

Submit a research study

From Coders to Critics: Empowering Students through Peer Assessment in the Age of AI Copilots