Breadcrumb
- Home
- Generative AI For Education Hub
- Research Study Repository
- Teaching – Assessment and Feedback
Search and Filter
Submit a research study
Contribute to the repository:
Teaching – Assessment and Feedback
Research synthesis is AI-generated, human reviewed. Updated 05/2025.
Displaying 1 - 30 of 291
Pedagogy-R1: Pedagogically-Aligned Reasoning Model with Balanced Educational Benchmark
Unggi Lee, Jaeyong Lee, Jiyeong Bae, Yeil Jeong, Junbo Koh, Gyeonggeon Lee, Gunho Lee, Taekyung Ahn, Hyeoncheol Kim. (05/2025). arXiv. http://arxiv.org/pdf/2505.18467v1
Towards Robust Evaluation of STEM Education: Leveraging MLLMs in Project-Based Learning
Yanhao Jia, Xinyi Wu, Qinglin Zhang, Yiran Qin, Luwei Xiao, Shuai Zhao. (05/2025). arXiv. http://arxiv.org/pdf/2505.17050v1
CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring
Jiamin Su, Yibo Yan, Zhuoran Gao, Han Zhang, Xiang Liu, Xuming Hu. (05/2025). arXiv. http://arxiv.org/pdf/2505.13965v1
Predicting Student Dropout Risk With A Dual-Modal Abrupt Behavioral Changes Approach
Jiabei Cheng, Zhen-Qun Yang, Jiannong Cao, Yu Yang, Xinzhe Zheng. (05/2025). arXiv. http://arxiv.org/pdf/2505.11119v1
SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models
Peichao Lai, Kexuan Zhang, Yi Lin, Linyihan Zhang, Feiyang Ye, Jinhao Yan, Yanwei Xu, Conghui He, Yilei Wang, Wentao Zhang, Bin Cui. (05/2025). arXiv. http://arxiv.org/pdf/2505.07247v2
Human-AI Collaboration or Academic Misconduct? Measuring AI Use in Student Writing Through Stylometric Evidence
Eduardo Araujo Oliveira, Madhavi Mohoni, Sonsoles López-Pernas, Mohammed Saqr. (05/2025). arXiv. http://arxiv.org/pdf/2505.08828v1
VTutor for High-Impact Tutoring at Scale: Managing Engagement and Real-Time Multi-Screen Monitoring with P2P Connections
Eason Chen, Xinyi Tang, Aprille Xi, Chenyu Lin, Conrad Borchers, Jionghao Lin, Shivang Gupta, Kenneth R Koedinger. (05/2025). arXiv. http://arxiv.org/pdf/2505.07736v2
An End-to-End Approach for Child Reading Assessment in the Xhosa Language
Sergio Chevtchenko, Nikhil Navas, Rafaella Vale, Franco Ubaudi, Sipumelele Lucwaba, Cally Ardington, Soheil Afshar, Mark Antoniou, Saeed Afshar. (05/2025). arXiv. http://arxiv.org/pdf/2505.17371v1
From First Draft to Final Insight: A Multi-Agent Approach for Feedback Generation
Jie Cao, Chloe Qianhui Zhao, Xian Chen, Shuman Wang, Christian Schunn, Kenneth R. Koedinger, Jionghao Lin. (05/2025). arXiv. http://arxiv.org/pdf/2505.04869v1
KCluster: An LLM-based Clustering Approach to Knowledge Component Discovery
Yumou Wei, Paulo Carvalho, John Stamper. (05/2025). arXiv. http://arxiv.org/pdf/2505.06469v1
Towards Actionable Pedagogical Feedback: A Multi-Perspective Analysis of Mathematics Teaching and Tutoring Dialogue
Jannatun Naim, Jie Cao, Fareen Tasneem, Jennifer Jacobs, Brent Milne, Tamara Sumner, James Martin. (05/2025). arXiv. http://arxiv.org/pdf/2505.07161v1
SlideItRight: Using AI to Find Relevant Slides and Provide Feedback for Open-Ended Questions
Chloe Qianhui Zhao, Jie Cao, Eason Chen, Kenneth R. Koedinger, Jionghao Lin. (05/2025). arXiv. http://arxiv.org/pdf/2505.04584v1
Exploring LLM-Generated Feedback for Economics Essays: How Teaching Assistants Evaluate and Envision Its Use
Xinyi Lu, Aditya Mahesh, Zejia Shen, Mitchell Dudley, Larissa Sano, Xu Wang. (05/2025). arXiv. http://arxiv.org/pdf/2505.15596v1
Investigating Pedagogical Teacher and Student LLM Agents: Genetic Adaptation Meets Retrieval-Augmented Generation Across Learning Styles
Debdeep Sanyal, Agniva Maiti, Umakanta Maharana, Dhruv Kumar, Ankur Mali, C. Lee Giles, Murari Mandal. (05/2025). arXiv. http://arxiv.org/pdf/2505.19173v1
STRIVE: A Think & Improve Approach with Iterative Refinement for Enhancing Question Quality Estimation
Aniket Deroy, Subhankar Maity. (04/2025). arXiv. https://arxiv.org/pdf/2504.05693v1
Beyond Answers: How LLMs Can Pursue Strategic Thinking in Education
Eleonora Grassucci, Gualtiero Grassucci, Aurelio Uncini, Danilo Comminiello. (04/2025). arXiv. https://arxiv.org/pdf/2504.04815v1
Beyond Tools: Generative AI as Epistemic Infrastructure in Education
Bodong Chen. (04/2025). arXiv. https://arxiv.org/pdf/2504.06928v1
Evaluating Trust in AI, Human, and Co-produced Feedback Among Undergraduate Students
Audrey Zhang, Yifei Gao, Wannapon Suraworachet, Tanya Nazaretsky, Mutlu Cukurova. (04/2025). arXiv. https://arxiv.org/pdf/2504.10961v1
LLM-based Automated Grading with Human-in-the-Loop
Hang Li, Yucheng Chu, Kaiqi Yang, Yasemin Copur-Gencturk, and Jiliang Tang. (04/2025). arXiv. https://arxiv.org/pdf/2504.05239v2
Single-Agent vs. Multi-Agent LLM Strategies for Automated Student Reflection Assessment
Gen Li, Li Chen, Cheng Tang, Valdemar Svabensky, Daisuke Deguchi, Takayoshi Yamashita, Atsushi Shimada. (04/2025). arXiv. https://arxiv.org/pdf/2504.05716v1
Inclusive Education with AI: Supporting Special Needs and Tackling Language Barriers
Ricardo Fitas. (04/2025). arXiv. https://arxiv.org/pdf/2504.14120v1
Fostering Self-Directed Growth with Generative AI: Toward a New Learning Analytics Framework
Qianrun Mao. (04/2025). arXiv. https://arxiv.org/pdf/2504.20851v1
Evolution of AI in Education: Agentic Workflows
Firuz Kamalov, David Santandreu Calonge, Linda Smail, Dilshod Azizov, Dimple R. Thadani, Theresa Kwong, Amara Atif. (04/2025). arXiv. https://arxiv.org/pdf/2504.20082v1
Multilingual Performance Biases of Large Language Models in Education
Vansh Gupta, Sankalan Pal Chowdhury, Vilem Zouhar, Donya Rooein, Mrinmaya Sachan. (04/2025). arXiv. https://arxiv.org/pdf/2504.17720v1
Al for Accessible Education: Personalized Audio-Based Learning for Blind Students
Crystal Yang, Paul Taele. (04/2025). arXiv. https://arxiv.org/pdf/2504.17117v1
EduPlanner: LLM-Based Multi-Agent Systems for Customized and Intelligent Instructional Design
Xueqiao Zhang, Chao Zhang, Jianwen Sun, Jun Xiao, Yi Yang, and Yawei Luo. (04/2025). arXiv. https://arxiv.org/pdf/2504.05370v1
Towards responsible AI for education: Hybrid human-AI to confront the Elephant in the room
Danial Hooshyar, Gustav Sir, Yeongwook Yang, Eve Kikas, Raija Hamalainen, Tommi Karkkainen, Dragan Gasevic, Roger Azevedo. (04/2025). arXiv. https://arxiv.org/pdf/2504.16148v1
Mathematical Capabilities of Large Language Models in Finnish Matriculation Examination
Mika Setala, Pieta Sikstrom, Ville Heilala, Tommi Karkkainen. (04/2025). arXiv. https://arxiv.org/pdf/2504.12347v1