Breadcrumb
- Home
- AI Hub For Education
- Research Study Repository
- Outcomes – Numeracy
Outcomes – Numeracy
Research synthesis is AI-generated, human reviewed. Updated 05/2026.
Displaying 121 - 150 of 224
Simulating LLM-to-LLM Tutoring for Multilingual Math Feedback
Junior Cedric Tonga, KV Aditya Srivatsa, Kaushal Kumar Maurya, Fajri Koto, Ekaterina Kochmar. (06/2025). arXiv. http://arxiv.org/pdf/2506.04920v1
Evaluating Vision - Language and Large Language Models for Automated Student Assessment in Indonesian Classrooms
Nurul Aisyah, Muhammad Dehan Al Kautsar, Arif Hidayat, Raqib Chowdhury, Fajri Koto. (06/2025). arXiv. http://arxiv.org/pdf/2506.04822v1
Generating Pedagogically Meaningful Visuals for Math Word Problems: A New Benchmark and Analysis of Text-to-Image Models
Junling Wang, Anna Rutkiewicz, April Yi Wang, Mrinmaya Sachan. (06/2025). arXiv. http://arxiv.org/pdf/2506.03735v1
TestAgent: An Adaptive and Intelligent Expert for Human Assessment
Junhao Yu, Yan Zhuang, YuXuan Sun, Weibo Gao, Qi Liu, Mingyue Cheng, Zhenya Huang, Enhong Chen. (06/2025). arXiv. http://arxiv.org/pdf/2506.03032v1
Towards Generating Controllable and Solvable Geometry Problem by Leveraging Symbolic Deduction Engine
Zhuoxuan Jiang, Tianyang Zhang, Peiyan Peng, Jing Chen, Yinong Xun, Haotian Zhang, Lichi Li, Yong Li, Shaohua Zhang. (06/2025). arXiv. http://arxiv.org/pdf/2506.02565v1
BD at BEA 2025 Shared Task: MPNet Ensembles for Pedagogical Mistake Identification and Localization in AI Tutor Responses
Shadman Rohan, Ishita Sur Apan, Muhtasim Ibteda Shochcho, Md Fahim, Mohammad Ashfaq Ur Rahman, AKM Mahbubur Rahman, Amin Ahsan Ali. (06/2025). arXiv. http://arxiv.org/pdf/2506.01817v1
Evaluating Gemini in an Arena for Learning
LearnLM Team, Google. (05/2025). arXiv. http://arxiv.org/pdf/2505.24477v1
A Structured Unplugged Approach for Foundational AI Literacy in Primary Education
Maria Cristina Carrisi, Mirko Marras, Sara Vergallo. (05/2025). arXiv. http://arxiv.org/pdf/2505.21398v1
LMCD: Language Models are Zeroshot Cognitive Diagnosis Learners
Yu He, Zihan Yao, Chentao Song, Tianyu Qi, Jun Liu, Ming Li, Qing Huang. (05/2025). arXiv. http://arxiv.org/pdf/2505.21239v1
From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Reasoning-Driven Pedagogical Visualization
Haonian Ji, Shi Qiu, Siyang Xin, Siwei Han, Zhaorun Chen, Dake Zhang, Hongyi Wang, Huaxiu Yao. (05/2025). arXiv. http://arxiv.org/pdf/2505.16832v2
From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization
Haonian Ji, Shi Qiu, Siyang Xin, Siwei Han, Zhaorun Chen, Dake Zhang, Hongyi Wang, Huaxiu Yao. (05/2025). arXiv. http://arxiv.org/pdf/2505.16832v1
MSA at BEA 2025 Shared Task: Disagreement-Aware Instruction Tuning for Multi-Dimensional Evaluation of LLMs as Math Tutors
Baraa Hikal, Mohamed Basem, Islam Oshallah, Ali Hamdi. (05/2025). arXiv. http://arxiv.org/pdf/2505.18549v1
Towards Actionable Pedagogical Feedback: A Multi-Perspective Analysis of Mathematics Teaching and Tutoring Dialogue
Jannatun Naim, Jie Cao, Fareen Tasneem, Jennifer Jacobs, Brent Milne, Tamara Sumner, James Martin. (05/2025). arXiv. http://arxiv.org/pdf/2505.07161v1
VTutor for High-Impact Tutoring at Scale: Managing Engagement and Real-Time Multi-Screen Monitoring with P2P Connections
Eason Chen, Xinyi Tang, Aprille Xi, Chenyu Lin, Conrad Borchers, Jionghao Lin, Shivang Gupta, Kenneth R Koedinger. (05/2025). arXiv. http://arxiv.org/pdf/2505.07736v2
Multimodal Assessment of Classroom Discourse Quality: A Text-Centered Attention-Based Multi-Task Learning Approach
Ruikun Hou, Babette BŸhler, Tim FŸtterer, Efe Bozkir, Peter Gerjets, Ulrich Trautwein, Enkelejda Kasneci. (05/2025). arXiv. http://arxiv.org/pdf/2505.07902v1
From Recall to Reasoning: Automated Question Generation for Deeper Math Learning through Large Language Models
Yongan Yu, Alexandre Krantz, Nikki G. Lobczowski. (05/2025). arXiv. http://arxiv.org/pdf/2505.11899v1
Enhancing Mathematics Learning for Hard-of-Hearing Students Through Real-Time Palestinian Sign Language Recognition: A New Dataset
Fidaa khandaqji, Huthaifa I. Ashqar, Abdelrahem Atawnih. (05/2025). arXiv. http://arxiv.org/pdf/2505.17055v1
Are LLMs Ready for English Standardized Tests? A Benchmarking and Elicitation Perspective
Luoxi Tang, Tharunya Sundar, Shuai Yang, Ankita Patra, Manohar Chippada, Giqi Zhao, Yi Li, Riteng Zhang, Tunan Zhao, Ting Yang, Yuqiao Meng, Weicheng Ma, Zhaohan Xi. (05/2025). arXiv. http://arxiv.org/pdf/2505.17056v1
Pedagogy-R1: Pedagogically-Aligned Reasoning Model with Balanced Educational Benchmark
Unggi Lee, Jaeyong Lee, Jiyeong Bae, Yeil Jeong, Junbo Koh, Gyeonggeon Lee, Gunho Lee, Taekyung Ahn, Hyeoncheol Kim. (05/2025). arXiv. http://arxiv.org/pdf/2505.18467v1
EduPlanner: LLM-Based Multi-Agent Systems for Customized and Intelligent Instructional Design
Xueqiao Zhang, Chao Zhang, Jianwen Sun, Jun Xiao, Yi Yang, Yawei Luo. (04/2025). arXiv. https://arxiv.org/pdf/2504.05370v1
Can Large Language Models Match Tutoring System Adaptivity? A Benchmarking Study
Conrad Borchers, Tianze Shou. (04/2025). arXiv. https://arxiv.org/pdf/2504.05570v1
Inclusive Education with AI: Supporting Special Needs and Tackling Language Barriers
Ricardo Fitas. (04/2025). arXiv. https://arxiv.org/pdf/2504.14120v1
AI for Accessible Education: Personalized Audio-Based Learning for Blind Students
Crystal Yang, Paul Taele. (04/2025). arXiv. https://arxiv.org/pdf/2504.17117v1
How Do Teachers Create Pedagogical Chatbots?: Current Practices and Challenges
Minju Yoo, Hyoungwook Jin, Juho Kim. (03/2025). arXiv. https://arxiv.org/pdf/2503.00967
The Imitation Game for Educational AI
Shashank Sonkar, Naiming Liu, Xinghe Chen, Richard G. Baraniuk. (02/2025). arXiv. https://arxiv.org/pdf/2502.15127
Unifying AI Tutor Evaluation: An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors
Kaushal Kumar Maurya, KV Aditya Srivatsa, Kseniia Petukhova and Ekaterina Kochmar. (02/2025). arXiv. https://arxiv.org/pdf/2412.09416
The Advancement of Personalized Learning Potentially Accelerated by Generative AI
Yuang Wei, Yuan-Hao Jiang, Jiayi Liu, Changyong Qi, Linzhao Jia, Rui Jia. (02/2025). arXiv. http://arxiv.org/pdf/2412.00691v2
SET-PAIRED: Designing for Parental Involvement in Learning with an AI-Assisted Educational Robot
Hui-Ru Ho, Nitigya Kargeti, Ziqi Liu, Bilge Mutlu. (02/2025). arXiv. https://arxiv.org/pdf/2502.17623
Scaffolding Middle-School Mathematics Curricula With Large Language Models
Rizwaan Malik, Dorna Abdi, Rose Wang, Dorottya Demszky. (02/2025). British Journal of Educational Technology. https://bera-journals.onlinelibrary.wiley.com/doi/10.1111/bjet.13571?af=R
Autograding Mathematical Induction Proofs with Natural Language Processing
Chenyan Zhao, Mariana Silva, Seth Poulsen. (02/2025). arXiv. http://arxiv.org/pdf/2406.10268v2

