Breadcrumb
- Home
- Generative AI For Education Hub
- Research Study Repository
- Teaching – Assessment and Feedback
Search and Filter
Submit a research study
Contribute to the repository:
Teaching – Assessment and Feedback
Research synthesis is AI-generated, human reviewed. Updated 09/2025.
Displaying 271 - 300 of 476
Use Me Wisely: AI-Driven Assessment for LLM Prompting Skills Development
Dimitri Ognibene, Gregor Donabauer, Emily Theophilou, Cansu Koyuturk, Mona Yavari, Sathya Bursic, Alessia Telari, Alessia Testa, Raffaele Boiano, Davide Taibi, Davinia Hernandez-Leo, Udo Kruschwitz and Martin Ruskov. (03/2025). arXiv. https://arxiv.org/pdf/2503.02532
AIGCodeSet: A New Annotated Dataset for AI Generated Code Detection
Basak Demirok, Mucahid Kutlu. (03/2025). arXiv. https://arxiv.org/pdf/2412.16594
LLMs as Educational Analysts: Transforming Multimodal Data Traces into Actionable Reading Assessment Reports
Eduardo Davalos, Yike Zhang, Namrata Srivastaval, Jorge Alberto Salas, Sara McFadden, Sun-Joo Cho, Gautam Biswas, Amanda Goodwin. (03/2025). arXiv. https://arxiv.org/pdf/2503.02099
"Don't Forget the Teachers": Towards an Educator-Centered Understanding of Harms from Large Language Models in Education
Emma Harvey, Allison Koenecke, RenŽ F. Kizilcec. (02/2025). arXiv. https://arxiv.org/pdf/2502.14592v1
Exploring the Potential of Large Language Models for Estimating the Reading Comprehension Question Difficulty*
Yoshee Jain, John Hollander, Amber He, Sunny Tang, Liang Zhang, John Sabatini. (02/2025). arXiv. https://arxiv.org/pdf/2502.17785
Unifying AI Tutor Evaluation: An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors
Kaushal Kumar Maurya, KV Aditya Srivatsa, Kseniia Petukhova and Ekaterina Kochmar. (02/2025). arXiv. https://arxiv.org/pdf/2412.09416
Autograding Mathematical Induction Proofs with Natural Language Processing
Chenyan Zhao, Mariana Silva, Seth Poulsen. (02/2025). arXiv. http://arxiv.org/pdf/2406.10268v2
Co-designing Large Language Model Tools for Project-Based Learning with K-12 Educators
Prerna Ravi, John Masla, Gisella Kakoti, Grace C. Lin, Emma Anderson, Matt Taylor, Anastasia K. Ostrowski, Cynthia Breazeal, Eric Klopfer, Hal Abelson. (02/2025). arXiv. https://arxiv.org/pdf/2502.09799
EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models
Jiamin Su, Yibo Yan, Fangteng Fu, Han Zhang, Jingheng Ye, Xiang Liu, Jiahao Huo, Huiyu Zhou, Xuming Hu. (02/2025). arXiv. https://arxiv.org/pdf/2502.11916
Event Segmentation Applications In Large Language Model Enabled Automated Recall Assessments
Ryan A. Panela, Alexander J. Barnett, Morgan D. Barense, Bjornn Herrmann. (02/2025). arXiv. https://arxiv.org/pdf/2502.13349
From Correctness to Comprehension: AI Agents for Personalized Error Diagnosis in Education
Yi-Fan Zhang, Hang Li, Dingjie Song, Lichao Sun, Tianlong Xu, Qingsong Wen. (02/2025). arXiv. https://arxiv.org/pdf/2502.13789
Improve LLM-based Automatic Essay Scoring with Linguistic Features
Zhaoyi Joey Hou, Alejandro Ciuba, Xiang Lorraine Li. (02/2025). arXiv. https://arxiv.org/pdf/2502.09497
Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic Scoring
Xuansheng Wu, Padmaja Pravin Saraf, Gyeonggeon Lee, Ehsan Latif, Ninghao Liu, Xiaoming Zhai. (02/2025). arXiv. http://arxiv.org/pdf/2407.18328v2
Towards Adaptive Feedback with AI: Comparing the Feedback Quality of LLMs and Teachers on Experimentation Protocols
Kathrin Sessler, Arne Bewersdorff, Claudia Nerdel, Enkelejda Kasneci. (02/2025). arXiv. https://arxiv.org/pdf/2502.12842
Position: LLMs Can be Good Tutors in Foreign Language Education
Jingheng Ye, Shen Wang, Deqing Zhou, Yibo Yan, Kun Wang, Hai-Tao Zheng, Zenglin Xu, Irwin King, Philip S. Yu, Qingsong Wen. (02/2025). arXiv. https://arxiv.org/pdf/2502.05467
The Responsible Development of Automated Student Feedback with Generative AI
Euan D Lindsay, Mike Zhang, Aditya Johri, Johannes Bjerva. (02/2025). arXiv. http://arxiv.org/pdf/2308.15334v3
SocratiQ: A Generative AI-Powered Learning Companion for Personalized Education and Broader Accessibility
Jason Jabbour, Kai Kleinbard, Olivia Miller, Robert Haussman, Vijay Janapa Reddi. (02/2025). arXiv. https://arxiv.org/pdf/2502.00341
The Advancement of Personalized Learning Potentially Accelerated by Generative AI
Yuang Wei, Yuan-Hao Jiang, Jiayi Liu, Changyong Qi, Linzhao Jia, Rui Jia. (02/2025). arXiv. http://arxiv.org/pdf/2412.00691v2
The Imitation Game for Educational AI
Shashank Sonkar, Naiming Liu, Xinghe Chen, Richard G. Baraniuk. (02/2025). arXiv. https://arxiv.org/pdf/2502.15127
iLLuMinaTE: An LLM-XAI Framework Leveraging Social Science Explanation Theories Towards Actionable Student Performance Feedback
Vinitra Swamy, Davide Romano, Bhargav Srinivasa Desikan, Oana-Maria Camburu, Tanja Kaser. (01/2025). arXiv. http://arxiv.org/pdf/2409.08027v2
Education in the Era of Generative Artificial Intelligence (AI): Understanding the Potential Benefits of ChatGPT in Promoting Teaching and Learning
David Baidoo-Anu, Leticia Owusu Ansah. (01/2025). SSRN. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4337484
A Novel Approach to Scalable and Automatic Topic-Controlled Question Generation in Education
Ziqing Li, Mutlu Cukurova, Sahan Bulathwela. (01/2025). arXiv. http://arxiv.org/pdf/2501.05220v1
A Study on Educational Data Analysis and Personalized Feedback Report Generation Based on Tags and ChatGPT*
Yizhou Zhou, Mengqiao Zhang, Yuan-Hao Jiang, Xinyu Gao, Naijie Liu, Bo Jiang. (01/2025). arXiv. http://arxiv.org/pdf/2501.06819v1
A Zero-Shot LLM Framework for Automatic Assignment Grading in Higher Education
Calvin Yeung, Jeff Yu, King Chau Cheung, Tat Wing Wong, Chun Man Chan, Kin Chi Wong, Keisuke Fujii. (01/2025). arXiv. https://arxiv.org/pdf/2501.14305
Debugging Without Error Messages: How LLM Prompting Strategy Affects Programming Error Explanation Effectiveness
Audrey Salmon, Katie Hammer, Eddie Antonio Santos, Brett A. Becker. (01/2025). arXiv. http://arxiv.org/pdf/2501.05706v1
Exploring Iterative Enhancement for Improving Learnersourced Multiple-Choice Question Explanations with Large Language Models
Qiming Bao, Juho Leinonen, Alex Yuxuan Peng, Wanjun Zhong, Ga‘l Gendron, Tim Pistotti, Alice Huang, Paul Denny, Michael Witbrock, Jiamou Liu. (01/2025). arXiv. http://arxiv.org/pdf/2309.10444v5
Fine-tuning ChatGPT for Automatic Scoring of Written Scientific Explanations in Chinese
Jie Yang, Ehsan Latif, Yuze He, Xiaoming Zhai. (01/2025). arXiv. http://arxiv.org/pdf/2501.06704v1
Generative AI in Education: From Foundational Insights to the Socratic Playground for Learning
Xiangen Hu, Sheng Xu, Richard Tong, & Art Graesser. (01/2025). arXiv. http://arxiv.org/pdf/2501.06682v1
ChatGPT In Lesson Preparation: A Teacher Choices Trial Evaluation Report
Palak Roy, Helen Poet, Ruth Staunton, Katherine Aston, David Thomas. (12/2024). Education Endowment Foundation. https://d2tic4wvo1iusb.cloudfront.net/production/documents/projects/chatgpt_in_…
You're (Not) My Type - Can LLMs Generate Feedback of Specific Types for Introductory Programming Tasks?
Dominic Lohr, Hieke Keuning, Natalie Kiesler. (12/2024). arXiv. http://arxiv.org/pdf/2412.03516v1
