Breadcrumb
- Home
- AI Hub For Education
- Research Study Repository
- Teaching – Assessment and Feedback
Teaching – Assessment and Feedback
Research synthesis is AI-generated, human reviewed. Updated 09/2025.
Displaying 331 - 360 of 584
An End-to-End Approach for Child Reading Assessment in the Xhosa Language
Sergio Chevtchenko, Nikhil Navas, Rafaella Vale, Franco Ubaudi, Sipumelele Lucwaba, Cally Ardington, Soheil Afshar, Mark Antoniou, Saeed Afshar. (06/2025). arXiv. http://arxiv.org/pdf/2505.17371v2
Machine vs Machine: Using AI to Tackle Generative AI Threats in Assessment
Mohammad Saleh Torkestani, Taha Mansouri. (05/2025). arXiv. http://arxiv.org/pdf/2506.02046v1
Encouraging Students' Responsible Use of GenAI in Software Engineering Education: A Causal Model and Two Institutional Applications
Vahid Garousi, Zafar Jafarov, Aytan Movsumova, Atif Namazov, Huseyn Mirzeyev. (05/2025). arXiv. http://arxiv.org/pdf/2506.00682v1
Evaluating Gemini in an Arena for Learning
LearnLM Team, Google. (05/2025). arXiv. http://arxiv.org/pdf/2505.24477v1
Enhancing Marker Scoring Accuracy through Ordinal Confidence Modelling in Educational Assessments
Abhirup Chakravarty, Mark Brenchley, Trevor Breakspear, Ian Lewin, Yan Huang. (05/2025). arXiv. http://arxiv.org/pdf/2505.23315v1
Distinguishing Fact from Fiction: Student Traits, Attitudes, and AI Hallucination Detection in Business School Assessment
Dr Canh Thien Dang, Dr An Nguyen. (05/2025). arXiv. http://arxiv.org/pdf/2506.00050v1
A Human-Centric Approach to Explainable AI for Personalized Education
Vinitra Swamy. (05/2025). arXiv. http://arxiv.org/pdf/2505.22541v1
RATAS: A Generative AI Framework for Explainable and Scalable Automated Answer Grading
Masoud Safilian, Amin Beheshti, Stephen Elbourn. (05/2025). arXiv. http://arxiv.org/pdf/2505.23818v1
LMCD: Language Models are Zeroshot Cognitive Diagnosis Learners
Yu He, Zihan Yao, Chentao Song, Tianyu Qi, Jun Liu, Ming Li, Qing Huang. (05/2025). arXiv. http://arxiv.org/pdf/2505.21239v1
Evaluating Software Plagiarism Detection in the Age of AI Automated Obfuscation and Lessons for Academic Integrity
Timur SaŸlam, Larissa Schmid. (05/2025). arXiv. http://arxiv.org/pdf/2505.20158v1
Automated evaluation of children's speech fluency for low-resource languages
Bowen Zhang, Nur Afiqah Abdul Latiff, Justin Kan, Rong Tong, Donny Soh, Xiaoxiao Miao, Ian McLoughlin. (05/2025). arXiv. http://arxiv.org/pdf/2505.19671v1
Investigating Pedagogical Teacher and Student LLM Agents: Genetic Adaptation Meets Retrieval-Augmented Generation Across Learning Styles
Debdeep Sanyal, Agniva Maiti, Umakanta Maharana, Dhruv Kumar, Ankur Mali, C. Lee Giles, Murari Mandal. (05/2025). arXiv. http://arxiv.org/pdf/2505.19173v1
A qualitative systematic review on Al empowered self-regulated learning in higher education
Min Lan, Xiaofeng Zhou. (05/2025). npj Science of Learning. https://www.nature.com/articles/s41539-025-00319-0
Human-AI Collaboration or Academic Misconduct? Measuring AI Use in Student Writing Through Stylometric Evidence
Eduardo Araujo Oliveira, Madhavi Mohoni, Sonsoles L¬ópez-Pernas, Mohammed Saqr. (05/2025). arXiv. http://arxiv.org/pdf/2505.08828v1
Exploring LLM-Generated Feedback for Economics Essays: How Teaching Assistants Evaluate and Envision Its Use
Xinyi Lu, Aditya Mahesh, Zejia Shen, Mitchell Dudley, Larissa Sano, Xu Wang. (05/2025). arXiv. http://arxiv.org/pdf/2505.15596v1
MSA at BEA 2025 Shared Task: Disagreement-Aware Instruction Tuning for Multi-Dimensional Evaluation of LLMs as Math Tutors
Baraa Hikal, Mohamed Basem, Islam Oshallah, Ali Hamdi. (05/2025). arXiv. http://arxiv.org/pdf/2505.18549v1
SlideItRight: Using AI to Find Relevant Slides and Provide Feedback for Open-Ended Questions
Chloe Qianhui Zhao, Jie Cao, Eason Chen, Kenneth R. Koedinger, Jionghao Lin. (05/2025). arXiv. http://arxiv.org/pdf/2505.04584v1
From First Draft to Final Insight: A Multi-Agent Approach for Feedback Generation
Jie Cao, Chloe Qianhui Zhao, Xian Chen, Shuman Wang, Christian Schunn, Kenneth R. Koedinger, Jionghao Lin. (05/2025). arXiv. http://arxiv.org/pdf/2505.04869v1
SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models
Peichao Lai, Kexuan Zhang, Yi Lin, Linyihan Zhang, Feiyang Ye, Jinhao Yan, Yanwei Xu, Conghui He, Yilei Wang, Wentao Zhang, Bin Cui. (05/2025). arXiv. http://arxiv.org/pdf/2505.07247v2
From Recall to Reasoning: Automated Question Generation for Deeper Math Learning through Large Language Models
Yongan Yu, Alexandre Krantz, Nikki G. Lobczowski. (05/2025). arXiv. http://arxiv.org/pdf/2505.11899v1
CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring
Jiamin Su, Yibo Yan, Zhuoran Gao, Han Zhang, Xiang Liu, Xuming Hu. (05/2025). arXiv. http://arxiv.org/pdf/2505.13965v1
Towards Robust Evaluation of STEM Education: Leveraging MLLMs in Project-Based Learning
Yanhao Jia, Xinyi Wu, Qinglin Zhang, Yiran Qin, Luwei Xiao, Shuai Zhao. (05/2025). arXiv. http://arxiv.org/pdf/2505.17050v1
Pedagogy-R1: Pedagogically-Aligned Reasoning Model with Balanced Educational Benchmark
Unggi Lee, Jaeyong Lee, Jiyeong Bae, Yeil Jeong, Junbo Koh, Gyeonggeon Lee, Gunho Lee, Taekyung Ahn, Hyeoncheol Kim. (05/2025). arXiv. http://arxiv.org/pdf/2505.18467v1
When the prompting stops: exploring teachers' work around the educational frailties of generative AI tools
Neil Selwyn, Marita Ljungqvist, Anders Sonesson. (04/2025). Learning, Media and Technology. https://www.tandfonline.com/doi/full/10.1080/17439884.2025.2537959#abstract
Assessing AI-Generated Questions' Alignment with Cognitive Frameworks in Educational Assessment
Antoun Yaacoub, Jerome Da-Rugna, Zainab Assaghir. (04/2025). arXiv. https://arxiv.org/pdf/2504.14232v1
Inclusive Education with AI: Supporting Special Needs and Tackling Language Barriers
Ricardo Fitas. (04/2025). arXiv. https://arxiv.org/pdf/2504.14120v1
Beyond Tools: Generative AI as Epistemic Infrastructure in Education
Bodong Chen. (04/2025). arXiv. https://arxiv.org/pdf/2504.06928v1
Single-Agent vs. Multi-Agent LLM Strategies for Automated Student Reflection Assessment
Gen Li, Li Chen, Cheng Tang, Valdemar Svabensky, Daisuke Deguchi, Takayoshi Yamashita, Atsushi Shimada. (04/2025). arXiv. https://arxiv.org/pdf/2504.05716v1
STRIVE: A Think & Improve Approach with Iterative Refinement for Enhancing Question Quality Estimation
Aniket Deroy, Subhankar Maity. (04/2025). arXiv. https://arxiv.org/pdf/2504.05693v1

