Breadcrumb
- Home
- AI Hub For Education
- Research Study Repository
- Teaching – Assessment and Feedback
Teaching – Assessment and Feedback
Research synthesis is AI-generated, human reviewed. Updated 09/2025.
Displaying 91 - 120 of 584
Problems With Large Language Models For Learner Modelling: Why Llms Alone Fall Short For Responsible Tutoring In K-12 Education
Danial Hooshyar, Yeongwook Yang, Gustav Sir, Tommi Karkkainen, Raija Hamalainen, Mutlu Cukurova, Roger Azevedo. (12/2025). arXiv. https://arxiv.org/pdf/2512.23036v1
Unveiling The Learning Mind Of Language Models: A Cognitive Framework And Empirical Study
Zhengyu Hu, Jianxun Lian, Zheyuan Xiao, Seraphina Zhang, Tianfu Wang, Nicholas Jing Yuan, Xing Xie, Hui Xiong. (12/2025). arXiv. https://arxiv.org/pdf/2506.13464v3
Teaching With Ai: A Systematic Review Of Chatbots, Generative Tools, And Tutoring Systems In Programming Education
Said Elnaffar, Farzad Rashidi, Abedallah Zaid Abualkishik. (12/2025). arXiv. https://arxiv.org/pdf/2510.03884v2
An Exploration Of Higher Education Course Evaluation By Large Language Models
Bo Yuan, Jiazi Hu. (12/2025). arXiv. https://arxiv.org/pdf/2411.02455v2
Bidirectional Human-Ai Alignment In Education For Trustworthy Learning Environments
Hua Shen, NYU Shanghai, New York University. (12/2025). arXiv. https://arxiv.org/pdf/2512.21552v1
Agenttutor: Empowering Personalized Learning With Multi-Turn Interactive Teaching In Intelligent Education Systems
Yuxin Liu, Zeqing Song, Jiong Lou, Chentao Wu, Jie Li. (12/2025). arXiv. https://arxiv.org/pdf/2601.04219v1
From Pilots To Practices: A Scoping Review Of Genai-Enabled Personalization In Computer Science Education
Iman Reihanian, Yunfei Hou and Qingquan Sun. (12/2025). arXiv. https://arxiv.org/pdf/2512.20714v1
Can Llms Estimate Student Struggles? Human-Ai Difficulty Alignment With Proficiency Simulation For Item Difficulty Prediction
Ming Li, Han Chen, Yunze Xiao, Jian Chen, Hong Jiao, Tianyi Zhou. (12/2025). arXiv. https://arxiv.org/pdf/2512.18880v1
Subjective Question Generation And Answer Evaluation Using Nlp
G. M. Refatul Islam, Safwan Shaheer, Yaseen Nur, Mohammad Rafid Hamid. (12/2025). arXiv. https://arxiv.org/pdf/2512.17289v1
Bridging Psychometric And Content Development Practices With Ai: A Community-Based Workflow For Augmenting Hawaiian Language Assessments
Pōhai Kūkea-Shultz, Frank Brockmann. (12/2025). arXiv. https://arxiv.org/pdf/2512.17140v1
Comprehensive Ai Literacy: The Case For Centering Human Agency
Sri Yash Tadimalla, Justin Cary, Gordon Hull, Jordan Register, Tina Heafner, Daniel Maxwell, David Pugalee. (12/2025). arXiv. https://arxiv.org/pdf/2512.16656v1
Snapclass: An Ai-Enhanced Classroom Management System For Block-Based Programming
Bahare Riahi, Xiaoyi Tian, Ally Limke, Viktoriia Storozhevykh, Veronica Cateté, Tiffany Barnes, Nicholas Lytle, Khushbu Singh. (12/2025). arXiv. https://arxiv.org/pdf/2512.15825v1
Beyond Tools: Generative Al As Epistemic Infrastructure In Education
Bodong Chen. (12/2025). arXiv. https://arxiv.org/pdf/2504.06928v2
How K-12 Educators Use Ai: Llm-Assisted Qualitative Analysis At Scale
Alex Liu, Lief Esbenshade, Shawon Sarkar, Zewei (Victor) Tian, Zachary Zhang, Kevin He, Min Sun. (12/2025). arXiv. https://arxiv.org/pdf/2507.17985v3
On Emerging Paradigm Of Teaching Measurement Science And Technology In Times Of Ubiquitous Use Of Ai Tools
Roman Z. Morawski. (12/2025). arXiv. https://arxiv.org/pdf/2512.13028v1
Kidsartbench: Multi-Dimensional Children's Art Evaluation With Attribute-Aware MLLMs
Mingrui Ye, Chanjin Zheng, Zengyi Yu, Chenyu Xiang, Zhixue Zhao, Zheng Yuan, Helen Yannakoudakis. (12/2025). arXiv. https://arxiv.org/pdf/2512.12503v1
From Co-Design To Metacognitive Laziness: Evaluating Generative Ai In Vocational Education
Amir Yunus, Gay Peng Rend, Lee Oon Teng. (12/2025). arXiv. https://arxiv.org/pdf/2512.12306v1
Baid: A Benchmark For Bias Assessment Of Ai Detectors
Priyam Basu, Yunfeng Zhang, Vipul Raheja. (12/2025). arXiv. https://arxiv.org/pdf/2512.11505v1
Unveiling User Perceptions In The Generative Ai Era: A Sentiment-Driven Evaluation Of Ai Educational Apps' Role In Digital Transformation Of E-Teaching
Adeleh Mazaherian, Erfan Nourbakhsh. (12/2025). arXiv. https://arxiv.org/pdf/2512.11934v1
Developing And Evaluating A Large Language Model-Based Automated Feedback System Grounded In Evidence-Centered Design For Supporting Physics Problem Solving
Holger Maus, Paul Tschisgale, Fabian Kieser, Stefan Petersen, Peter Wulff. (12/2025). arXiv. https://arxiv.org/pdf/2512.10785v1
Examining Student Interactions With A Pedagogical Ai-Assistant For Essay Writing And Their Impact On Students' Writing Quality
Wicaksono Febriantoro, Qi Zhou, Wannapon Suraworachet, Sahan Bulathwela, Andrea Gauthier, Eva Millan, Mutlu Cukurova. (12/2025). arXiv. https://arxiv.org/abs/2512.08596v1
Report On The Scoping Workshop On AI In Science Education Research
Marcus Kubsch, Marit Kastaun, Peter Wulff, Nicole Graulich, Moriah Ariely, Alexander Bergmann-Gering, Sebastian Gombert, Bor Gregorcic, Hendrik Härtig, Benedikt Heuckmann, Andrea Horbach, Christina Krist, Gerd Kortemeyer, Ben Münch, Samuel Pazicni, Joshua M. Rosenberg, Sascha Schanze, Gena Sbeglia, Vidar Skogvoll, Christophe Speroni, Christoph Thyssen, Lars-Jochen Thoms, Brandon J. Yik, Xiaoming Zhai. (12/2025). arXiv. https://arxiv.org/abs/2511.14318v3
Flora: An Advanced AI-Powered Engine To Facilitate Hybrid Human-AI Regulated Learning
Xinyu Li, Tongguang Li, Lixiang Yan, Yuheng Li, Linxuan Zhao, Mladen Rakovic, Inge Molenaar, Dragan Gasevic, Yizhou Fan. (12/2025). arXiv. https://arxiv.org/abs/2507.07362v3
Teacher-AI Collaboration For Curating And Customizing Lesson Plans In Low-Resource Schools
Deepak Varuvel Dennison, Bakhtawar Ahtisham, Kavyansh Chourasia, Nirmit Arora, Rahul Singh, Rene F. Kizilcec, Akshay Nambi, Tanuja Ganu, Aditya Vashistha. (12/2025). arXiv. https://arxiv.org/abs/2507.00456v2
Learning To Use AI For Learning: Teaching Responsible Use Of AI Chatbot To K-12 Students Through An AI Literacy Module
Ruiwei Xiao, Xinying Hou, Ying-Jui Tseng, Hsuan Nieu, Guanze Liao, John Stamper, Kenneth R. Koedinger. (12/2025). arXiv. https://arxiv.org/abs/2508.13962v2
Classifying German Language Proficiency Levels Using Large Language Models
Elias-Leander Ahlers, Witold Brunsmann, Malte Schilling. (12/2025). arXiv. https://arxiv.org/abs/2512.06483v1
Towards Responsible Development Of Generative AI For Education: An Evaluation-Driven Approach
Irina Jurenka, Markus Kunesch, Kevin R. McKee, Daniel Gillick, Shaojian Zhu, Sara Wiltberger, Shubham Milind Phal, Katherine Hermann, Daniel Kasenberg, Avishkar Bhoopchand, Ankit Anand, Miruna Pîslar, Stephanie Chan, Lisa Wang, Jennifer She, Parsa Mahmoudieh, Aliya Rysbek, Wei-Jen Ko, Andrea Huber, Brett Wiltshire, Gal Elidan, Roni Rabin, Jasmin Rubinovitz, Amit Pitaru, Mac McAllister, Julia Wilkowski, David Choi, Roee Engelberg, Lidan Hackmon, Adva Levin, Rachel Griffin, Michael Sears, Filip Bar, Mia Mesar, Mana Jabbour, Arslan Chaudhry, James Cohan, Sridhar Thiagarajan, Nir Levine, Ben Brown, Dilan Gorur, Svetlana Grant, Rachel Hashimshoni, Laura Weidinger, Jieru Hu, Dawn Chen, Kuba Dolecki, Canfer Akbulut, Maxwell Bileschi, Laura Culp, Wen-Xin Dong, Nahema Marchal, Kelsie Van Deman, Hema Bajaj Misra, Michael Duah, Moran Ambar, Avi Caciularu, Sandra Lefdal, Chris Summerfield, James An, Pierre-Alexandre Kamienny, Abhinit Mohdi, Theofilos Strinopoulous, Annie Hale, Wayne Anderson, Luis C. Cobo, Niv Efron, Muktha Ananda, Shakir Mohamed, Maureen Heymans, Zoubin Ghahramani, Yossi Matias, Ben Gomes, Lila Ibrahim. (12/2025). arXiv. https://arxiv.org/abs/2407.12687v4
Feed-O-Meter: Investigating Ai-Generated Mentee Personas As Interactive Agents For Scaffolding Design Feedback Practice
Hyunseung Lim, Dasom Choi, DaEun Choi, Sooyohn Nam, Hwajung Hong. (12/2025). arXiv. https://arxiv.org/abs/2509.07424
Advisingwise: Supporting Academic Advising In Higher Education Settings Through A Human-In-The-Loop Multi-Agent Framework
Wendan Jiang, Shiyuan Wang, Hiba Eltigani, Rukhshan Haroon, Abdullah Bin Faisal, Fahad Dogar. (12/2025). arXiv. https://arxiv.org/abs/2511.05706v2
Ai-Enabled Grading With Near-Domain Data For Scaling Feedback With Human-Level Accuracy
Shyam Agarwall, Ali Moghimi, Kevin C. Haudek. (12/2025). arXiv. https://arxiv.org/abs/2512.04113v1

