Breadcrumb
- Home
- Generative AI For Education Hub
- Research Study Repository
- Outcomes – Numeracy
Search and Filter
Submit a research study
Contribute to the repository:
Outcomes – Numeracy
Research synthesis is AI-generated, human reviewed. Updated 09/2025.
Displaying 1 - 30 of 180
Large Language Models For Education And Research: An Empirical And User Survey-Based Analysis
Md Mostafizer Rahman, Ariful Islam Shiplu, Md Faizul Ibne Amin, Yutaka Watanobe, Lu Peng. (12/2025). arXiv. https://arxiv.org/abs/2512.08057v1
Aitutor-Evalkit: Exploring The Capabilities Of Ai Tutors
Numaan Naeem, Kaushal Kumar Maurya, Kseniia Petukhova, Ekaterina Kochmar. (12/2025). arXiv. https://arxiv.org/abs/2512.03688v1
Ezyer: A Simulacrum Of High School With Generative Agent
Jinming Yang, Zimu Ji, Weiqi Luo, Gaoxi Wang, Bin Ma, Yueling Deng. (12/2025). arXiv. https://arxiv.org/abs/2512.02561v1
Confident Rag: Enhancing The Performance Of Llms For Mathematics Question Answering Through Multi-Embedding And Confidence Scoring
Shiting Chen, Zijian Zhao and Jinsong Chen. (11/2025). arXiv. https://arxiv.org/abs/2507.17442v3
Edueval: A Hierarchical Cognitive Benchmark For Evaluating Large Language Models In Chinese Education
Guoqing Ma, Jia Zhu, Hanghui Guo, Weijie Shi, Yue Cui, Jiawei Shen, Zilong Li, Yidan Liang. (11/2025). arXiv. https://arxiv.org/abs/2512.00290v1
Exploring Student Interactions With Ai-Powered Learning Tools: A Qualitative Study Connecting Interaction Patterns To Educational Learning Theories
Prathamesh Muzumdar, Sumanth Cheemalapati. (11/2025). arXiv. https://arxiv.org/abs/2512.00519v1
Training For Obsolescence? The Ai-Driven Education Trap
Andrew J. Peterson. (11/2025). arXiv. https://arxiv.org/abs/2508.19625v2
Magma-Edu: Multi-Agent Generative Multimodal Framework For Text-Diagram Educational Question Generation
Zhenyu Wu, Jian Li, Hua Huang. (11/2025). arXiv. https://arxiv.org/abs/2511.18714v1
Exploring Families' Use And Mediation Of Generative Ai: A Multi-User Perspective
Shirley Zhang, Dakota Sullivan, Jennica Li, Bengisu Cagiltay, Bilge Mutlu, Heather Kirkorian, Kassem Fawaz. (11/2025). arXiv. https://arxiv.org/abs/2504.09004v3
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise
Rose E. Wang, Ana T. Ribeiro, Carly D. Robinson, Susanna Loeb, Dora Demszky. (11/2025). EdWorkingPapers. https://edworkingpapers.com/ai24-1054
Llms4All: A Review Of Large Language Models Across Academic Disciplines
Yanfang (Fanny) Ye, Zheyuan Zhang, Tianyi Ma, Zehong Wang, Yiyang Li, Shifu Hou, Weixiang Sun, Kaiwen Shi, Yijun Ma, Wei Song, Ahmed Abbasi, Ying Cheng, Jane Cleland-Huang, Steven Corcelli, Robert Goulding, Ming Hu, Ting Hua, John Lalor, Fang Liu, Tengfei Luo, Edward Maginn, Nuno Moniz, Jason Rohr, Brett Savoie, Daniel Slate, Matthew Webber, Olaf Wiest, Johnny Zhang, Nitesh V Chawla. (11/2025). arXiv. https://arxiv.org/abs/2509.19580v5
Simulated Human Learning In A Dynamic, Partially-Observed, Time-Series Environment
Jeffrey Jiang, Kevin Hong, Gregory Pottie, Emily Kuczynski. (11/2025). arXiv. https://arxiv.org/abs/2511.15032v1
Explain With Visual Keypoints Like A Real Mentor! A Benchmark For Multimodal Solution Explanation
Jaewoo Park, Jungyang Park, Dongju Jang, Jiwan Chung, Byungwoo Yoo, Jaewoo Shin, Seonjoon Park, Taehyeong Kim, Youngjae Yu. (11/2025). arXiv. https://arxiv.org/abs/2504.03197v4
Autosynth: Automated Workflow Optimization For High-Quality Synthetic Dataset Generation Via Monte Carlo Tree Search
Shuzhen Bi, Chang Song, Siyu Song, Jinze Lv, Jian Chen, Xinyun Wang, Aimin Zhou, Hao Hao. (11/2025). arXiv. https://arxiv.org/abs/2511.09488v1
Uco: A Multi-Turn Interactive Reinforcement Learning Method For Adaptive Teaching With Large Language Models
Shouang Wei, Min Zhang, Xin Lin, Bo Jiang, Kun Kuang, Zhongxiang Dai. (11/2025). arXiv. https://arxiv.org/abs/2511.08873v1
A Matter Of Interest: Understanding Interestingness Of Math Problems In Humans And Language Models
Shubhra Mishra, Yuka Machino, Gabriel Poesia, Albert Jiang, Joy Hsu, Adrian Weller, Challenger Mishra, David Broman, Joshua B. Tenenbaum, Mateja Jamnik, Cedegao E. Zhang, Katherine M. Collins. (11/2025). arXiv. https://arxiv.org/abs/2511.08548v1
Diagramir: An Automatic Pipeline For Educational Math Diagram Evaluation
Vishal Kumar, Shubhra Mishra, Rebecca Hao, Rizwaan Malik, David Broman, Dorottya Demszky. (11/2025). arXiv. https://arxiv.org/abs/2511.08283v1
Ai Tutoring Can Safely And Effectively Support Students: An Exploratory Rct In Uk Classrooms
LearnLM Team, Google & Eedi. (11/2025). Google. https://storage.googleapis.com/deepmind-media/LearnLM/learnLM_nov25.pdf
Computational Blueprints: Generating Isomorphic Mathematics Problems With Large Language Models
Jeong-Hoon Kim, Jinwoo Nam, Geunsik Jo. (11/2025). arXiv. https://arxiv.org/abs/2511.07932v1
Eduagentqg: A Multi-Agent Workflow Framework For Personalized Question Generation
Rui Jia, Min Zhang, Fengrui Liu, Bo Jiang, Kun Kuang, Zhongxiang Dai. (11/2025). arXiv. https://arxiv.org/abs/2511.11635v1
Artificial Intelligence In Elementary Stem Education: A Systematic Review Of Current Applications And Future Challenges
Majid Memari, Krista Ruggles. (11/2025). arXiv. https://arxiv.org/abs/2511.00105v2
Extracting Causal Relations In Deep Knowledge Tracing
Kevin Hong, Kia Karbasi, Gregory Pottie. (11/2025). arXiv. https://arxiv.org/abs/2511.03948v1
Physicseval: Inference-Time Techniques To Improve The Reasoning Proficiency Of Large Language Models On Physics Problems
Oshayer Siddique, J. M Areeb Uzair Alam, Md Jobayer Rahman Rafy, Syed Rifat Raiyan, Hasan Mahmud, Md Kamrul Hasan. (11/2025). arXiv. https://arxiv.org/abs/2508.00079v2
Next Token Knowledge Tracing: Exploiting Pretrained Llm Representations To Decode Student Behaviour.
Max Norris, Kobi Gal and Sahan Bulathwela. (11/2025). arXiv. https://arxiv.org/abs/2511.02599v1
From Superficial Outputs To Superficial Learning: Risks Of Large Language Models In Education
IRIS DELIKOURA, YI R. (MAY) FUNG, PAN HUI. (11/2025). arXiv. https://arxiv.org/abs/2509.21972v2
Improving Human Verification Of Llm Reasoning Through Interactive Explanation Interfaces
Runtao Zhou, Anh Totti Nguyen, Giang Nguyen, Nikita Kharya, Chirag Agarwal. (10/2025). arXiv. https://arxiv.org/abs/2510.22922v2
Measuring Teaching With Llms
Michael Hardy. (10/2025). arXiv. https://arxiv.org/abs/2510.22968v1
"Learning Together": Ai-Mediated Support For Parental Involvement In Everyday Learning
Yao Li, Jingyi Xie, Ya-Fang Lin, He Zhang, Ge Wang, Gaojian Huang, Rui Yu, Si Chen. (10/2025). arXiv. https://arxiv.org/abs/2510.20123v2
Pedagogy-Driven Evaluation Of Generative Ai-Powered Intelligent Tutoring Systems
Kaushal Kumar Maurya and Ekaterina Kochmar. (10/2025). arXiv. https://arxiv.org/abs/2510.22581v1
Relief Or Displacement? How Teachers Are Negotiating Generative Al'S Role In Their Professional Practice
Aayushi Dangol, Smriti Kotiyal, Robert Wolfe, Alex J. Bowers, Antonio Vigil, Jason Yip, Julie A. Kientz, Suleman Shahid, Tom Yeh, Vincent Cho, Katie Davis. (10/2025). arXiv. http://arxiv.org/abs/2510.18296v1
