Breadcrumb
- Home
- Generative AI For Education Hub
- Research Study Repository
- Teaching – Assessment and Feedback
Search and Filter
Submit a research study
Contribute to the repository:
Teaching – Assessment and Feedback
Research synthesis is AI-generated, human reviewed. Updated 09/2025.
Displaying 331 - 360 of 476
Automated Feedback in Math Education: A Comparative Analysis of LLMs for Open-Ended Responses
Sami Baral, Eamon Worden, Wen-Chiang Lim, Zhuang Luo, Christopher Santorelli, Ashish Gurung, Neil Heffernan. (10/2024). arXiv. http://arxiv.org/pdf/2411.08910v1
Beyond Scores: A Modular Rag-Based System For Automatic Short Answer Scoring with Feedback
Menna Fateen, Bo Wang, Tsunenori Mine. (10/2024). arXiv. http://arxiv.org/pdf/2409.20042v2
MalAlgoQA: Pedagogical Evaluation of Counterfactual Reasoning in Large Language Models and Implications for AI in Education
Naiming Liu, MyCo Le, Shashank Sonkar, Richard G. Baraniuk. (10/2024). arXiv. https://arxiv.org/pdf/2407.00938
Hey AI Can You Grade My Essay?: Automatic Essay Grading
Maisha Maliha, Vishal Pramanik. (10/2024). arXiv. https://arxiv.org/pdf/2410.09319
AI Governance in Higher Education: Case Studies of Guidance at Big Ten Universities
Chuhao Wu, He Zhang, John M. Carroll. (09/2024). arXiv. https://arxiv.org/pdf/2409.02017
Using Large Multimodal Models to Extract Knowledge Components for Knowledge Tracing from Multimedia Question Information
Hyeongdon Moon, Richard Davis, Seyed Parsa Neshaei, Pierre Dillenbourg. (09/2024). arXiv. http://arxiv.org/pdf/2409.20167v1
Are Large Language Models Good Essay Graders?
Kundu, Anindita, Barbosa, Denilson. (09/2024). arXiv. http://arxiv.org/pdf/2409.13120v1
A Comprehensive Review on Generative AI for Education
Uday Mittal, Siva Sai, Vinay Chamola, Devika Sangwan. (09/2024). IEEE. https://ieeexplore.ieee.org/document/10695056
How to Align Large Language Models for Teaching English? Designing and Developing LLM based-Chatbot for Teaching English Conversation in EFL, Findings and Limitations
Jaekwon Park, Jiyoung Bae, Unggi Lee, Taekyung Ahn, Sookbun Lee, Dohee Kim, Aram Choi, Yeil Jeong, Jewoong Moon, Hyeoncheol Kim. (09/2024). arXiv. http://arxiv.org/pdf/2409.04987v1
Taking the Next Step with Generative Artificial Intelligence: The Transformative Role of Multimodal Large Language Models in Science Education
Arne Bewersdorff, Christian Hartmann, Marie Hornberger, Kathrin Sessler, Maria Bannert, Enkelejda Kasneci, Gjergji Kasneci, Xiaoming Zhai, Claudia Nerdel. (09/2024). arXiv. http://arxiv.org/pdf/2401.00832v3
Learning to Love Edge Cases in Formative Math Assessment: Using the AMMORE Dataset and Chain-of-Thought Prompting to Improve Grading Accuracy
Owen Henkel, Hannah Horne-Robinson, Maria Dyshel, Nabil Ch, Baptiste Moreau-Pernet, Ralph Abood. (09/2024). arXiv. http://arxiv.org/pdf/2409.17904v1
"My Grade is Wrong!": A Contestable AI Framework for Interactive Feedback in Evaluating Student Essays
Shengxin Hong, Chang Cai, Sixuan Du, Haiyue Feng, Siyuan Liu, Xiuyi Fan. (09/2024). arXiv. http://arxiv.org/pdf/2409.07453v1
StuGPTViz: A Visual Analytics Approach to Understand Student-ChatGPT Interactions
Zixin Chen, Jiachen Wang, Meng Xia, Kento Shigyo, Dingdong Liu, Rong Zhang, Huamin Qu. (09/2024). arXiv. https://arxiv.org/pdf/2407.12423
The Feedback Prize: A Case Study in Assisted Writing Feedback Tools
Perpetual Baffour, Scott Crossley, Yu Tian, Alex Franklin, Natalie Rambis, Meg Benner, Ulrich Boser. (08/2024). The Learning Agency Lab. https://the-learning-agency-lab.com/wp-content/uploads/2023/08/TLA-Lab_Whitepap…
Facilitating Holistic Evaluations with LLMs Insights from Scenario-Based Experiments
Toru Ishida, Tongxi Liu, Hailong Wang & William K. Cheung. (08/2024). arXiv. http://arxiv.org/pdf/2405.17728v2
Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation
Tung Phung, Victor-Alexandru Padurean, Anjali Singh, Christopher Brooks, Jose Cambronero, Sumit Gulwani, Adish Singla, Gustavo Soares. (08/2024). arXiv. http://arxiv.org/pdf/2310.03780v4
Generative Language Models With Retrieval Augmented Generation for Automated Short Answer Scoring
Zifan Wang, Christopher Ormerod. (08/2024). arXiv. http://arxiv.org/pdf/2408.03811v1
How do students use ChatGPT as a writing support?
Sarah Levine, Sarah W. Beck, Chris Mah, Lena Phalen, Jaylen Pittman. (07/2024). International Literacy Association. https://ila.onlinelibrary.wiley.com/doi/full/10.1002/jaal.1373
Building a Domain-specific Guardrail Model in Production
Mohammad Niknazar, Paul V Haley, Latha Ramanan, Sang T. Truong, Yedendra Shrinivasan, Ayan Kumar Bhowmick, Prasenjit Dey, Ashish Jagmohan, Hema Maheshwari, Shom Ponoth, Robert Smith, Aditya Vempaty, Nick Haber, Sanmi Koyejo, Sharad Sundararajan. (07/2024). arXiv. http://arxiv.org/pdf/2408.01452v1
Generative AI in Higher Education: Seeing ChatGPT Through Universities' Policies, Resources, and Guidelines
Hui Wang, Anh Dang, Zihao Wu, Son Mac. (07/2024). arXiv. https://arxiv.org/pdf/2312.05235
To accept or not to accept? An IRT-TOE Framework to Understand Educators' Resistance to Generative AI in Higher Education
Jan-Erik Kalmus, Anastasija Nikiforova. (07/2024). arXiv. https://arxiv.org/pdf/2407.20130
Bringing Generative AI to Adaptive Learning in Education
Hang Li, Tianlong Xu, Chaoli Zhang, Eason Chen, Jing Liang, Xing Fan, Haoyang Li, Jiliang Tang, Qingsong Wen. (06/2024). arXiv. https://arxiv.org/pdf/2402.14601
How Effective is GPT-4 Turbo in Generating School-Level Questions from Textbooks Based on Bloom's Revised Taxonomy?
Subhankar Maity, Aniket Deroy, Sudeshna Sarkar. (06/2024). arXiv. http://arxiv.org/pdf/2406.15211v1
Human-AI Collaborative Essay Scoring: A Dual-Process Framework with LLMs
Changrong Xiao, Wenxing Ma, Qingping Song, Sean Xin Xu, Kunpeng Zhang, Yufang Wang, Qi Fu. (06/2024). arXiv. https://arxiv.org/pdf/2401.06431
Knowledge Distillation of LLMs for Automatic Scoring of Science Assessments
Ehsan Latif, Luyang Fang, Ping Ma, Xiaoming Zhai. (06/2024). arXiv. http://arxiv.org/pdf/2312.15842v3
Generative AI for Enhancing Active Learning in Education: A Comparative Study of GPT-3.5 and GPT-4 in Crafting Customized Test Questions
Hamidreza Rouzegar, Masoud Makrehchit. (06/2024). arXiv. https://arxiv.org/pdf/2406.13903
Generative Al and Digital Neocolonialism in Global Education: Towards an Equitable Framework
Matthew Nyaaba, Alyson Wright, Gyu Lim Choi. (06/2024). arXiv. http://arxiv.org/pdf/2406.02966v3
Responsible Adoption of Generative AI in Higher Education: Developing a “Points to Consider” Approach Based on Faculty Perspectives
Ravit Dotan, Lisa S. Parker, John G. Radzilowicz. (06/2024). arXiv. https://arxiv.org/pdf/2406.01930
AI AGENTS AND EDUCATION: SIMULATED PRACTICE AT SCALE
Ethan Mollick, Lilach Mollick, Natalie Bach, LJ Ciccarelli, Ben Przystanski, Daniel Ravipinto. (06/2024). arXiv. https://arxiv.org/pdf/2407.12796
