Breadcrumb
- Home
- Generative AI For Education Hub
- Research Study Repository
- Outcomes – Numeracy
Search and Filter
Submit a research study
Contribute to the repository:
Outcomes – Numeracy
Research synthesis is AI-generated, human reviewed. Updated 09/2025.
Displaying 1 - 30 of 157
Teachlm: Post-Training Llms For Education Using Authentic Learning Data
Janos Perczel, Jin Chow, Dorottya Demszky. (10/2025). arXiv. https://arxiv.org/pdf/2510.05087
A Framework for Building High-Quality Education Data for R&D in the Age of AI: The EDSI Dataset and Expert Insights
Jing Liu, Brendon Krall, Sarah Montana, Ting-Yu Ariel Chung, Heather Hill. (10/2025). Center for Educational Data Science and Innovation. https://edsi.umd.edu/publications/framework-building-high-quality-education-dat…
Llms4All: A Review On Large Language Models For Research And Applications In Academic Disciplines
Yanfang (Fanny) Ye*, Zheyuan Zhang, Tianyi Ma, Zehong Wang, Yiyang Li, Shifu Hou, Weixiang Sun, Kaiwen Shi, Yijun Ma, Wei Song, Ahmed Abbasi, Ying Cheng, Jane Cleland-Huang, Steven Corcelli, Robert Goulding, Ming Hu, Ting Hua, John Lalor, Fang Liu, Tengfei Luo, Ed Maginn, Nuno Moniz, Jason Rohr, Brett Savoie, Daniel Slate, Tom Stapleford, Matthew Webber, Olaf Wiest, Johnny Zhang, Nitesh V Chawla. (09/2025). arXiv. http://arxiv.org/pdf/2509.19580v3
Evaluating Undergraduate Mathematics Examinations In The Era Of Generative Ai: A Curriculum-Level Case Study
BENJAMIN J. WALKER, NIKOLETA KALAYDZHIEVA, BEATRIZ NAVARRO LAMEDA, RUTH A. REYNOLDS. (09/2025). arXiv. http://arxiv.org/pdf/2509.13359v3
Accurate Predictions In Education With Discrete Variational Inference
Tom Quilter, Anastasia Ilick, Karen Poon, Richard Turner. (09/2025). arXiv. http://arxiv.org/pdf/2509.23484v1
Comparing Rag And Graphrag For Page-Level Retrieval Question Answering On Math Textbook
Eason Chen, Chuangji Li, Shizhuo Li, Zimo Xiao, Jionghao Lin, and Kenneth R. Koedinger. (09/2025). arXiv. http://arxiv.org/pdf/2509.16780v2
Malaysia'S Ai-Driven Education Landscape: Policies, Applications, And Comparative Insights For A Digital Future
Fadhilah Jamaluddin, Ahmad Hakiim Jamaluddin, Faridzah Jamaluddin, Faathirah Jamaluddin. (09/2025). arXiv. http://arxiv.org/pdf/2509.21858v1
Developing Strategies To Increase Capacity In Ai Education
Noah Q. Cowit, Sri Yash Tadimalla, Stephanie T. Jones, Mary Lou Maher, Tracy Camp, Enrico Pontelli. (09/2025). arXiv. http://arxiv.org/pdf/2509.21713v1
A Meta-Analysis Of Llm Effects On Students Across Qualification, Socialisation, And Subjectification
Jiayu Huang, Ruoxin Ritter Wang, Jen-Hao Liu, Boming Xia, Yue Huang, Ruoxi Sun, Jason (Minhui) Xue, Jinan Zou. (09/2025). arXiv. http://arxiv.org/pdf/2509.22725v1
Investigating Bias: A Multilingual Pipeline For Generating, Solving, And Evaluating Math Problems With Llms
Mariam Mahran, Katharina Simbeck. (09/2025). arXiv. http://arxiv.org/pdf/2509.17701v1
Generative Ai Alone May Not Be Enough: Evaluating Ai Support For Learning Mathematical Proof
Eason Chen, Sophia Judicke, Kayla Beigh, Xinyi Tang, Zimo Xiao, Chuangji Li, Shizhuo Li, Reed Luttmer, Shreya Singh, Maria Yampolsky, Naman Parikh, Yi Zhao, Meiyi Chen, Scarlett Huang, Anishka Mohanty, Gregory Johnson, John Mackey, Jionghao Lin, Ken Koedinger. (09/2025). arXiv. http://arxiv.org/pdf/2509.16778v1
Gen Ai In Proof-Based Math Courses: A Pilot Study
Hannah Klawa, Shraddha Rajpal, Cigole Thomas. (09/2025). arXiv. http://arxiv.org/pdf/2509.13570v1
Emerging Patterns Of Genai Use In K-12 Science And Mathematics Education
Lief Esbenshade, Shawon Sarkar, Drew Nucci, Ann Edwards, Sarah Nielsen, Joshua M. Rosenberg, Alex Liu, Zewei (Victor) Tian, Min Sun, Zachary Zhang, Thomas Han, Yulia Lapicus, Kevin He. (09/2025). arXiv. http://arxiv.org/pdf/2509.10747v1
Understanding, Protecting, And Augmenting Human Cognition With Generative Ai: A Synthesis Of The Chi 2025 Tools For Thought Workshop
Lev Tankelevitch, Elena L. Glassman, Jessica He, Aniket Kittur, Mina Lee, Srishti Palani, Advait Sarkar, Gonzalo Ramos, Yvonne Rogers, Hari Subramonyam. (08/2025). arXiv. http://arxiv.org/pdf/2508.21036v1
Mathbuddy: A Multimodal System For Affective Math Tutoring
Debanjana Kar, Leopold Boss, Dacia Braca, Sebastian Maximilian Dennerlein, Nina Christine Hubig, Philipp Wintersberger, Yufang Hou. (08/2025). arXiv. http://arxiv.org/pdf/2508.19993v1
Mab Optimizer For Estimating Math Question Difficulty Via Inverse Cv Without Nlp
Surajit Das, Gourav Roy, Aleksei Eliseev, Ram Kumar Rajendran. (08/2025). arXiv. http://arxiv.org/pdf/2508.19014v1
Exploring Generative Artificial Intelligence (Genai) And Al Agents In Research And Teaching - Concepts And Practical Cases.
Jussi S. Jauhiainen, Aurora Toppari. (08/2025). arXiv. http://arxiv.org/pdf/2508.16701v2
Facet: Teacher-Centred Llm-Based Multi-Agent Systems- Towards Personalized Educational Worksheets
Jana Gonnermann-Muller, Jennifer Haase, Konstantin Fackeldey, Sebastian Pokutta. (08/2025). arXiv. http://arxiv.org/pdf/2508.11401v2
Explainable Ai For Predicting And Understanding Mathematics Achievement: A Cross-National Analysis Of Pisa 2018
Liu Liu, Dai Rui. (08/2025). arXiv. http://arxiv.org/pdf/2508.16747v1
Reliable Generation Of Isomorphic Physics Problems Using Chatgpt With Prompt-Chaining And Tool Use
Zhongzhou Chen. (08/2025). arXiv. http://arxiv.org/pdf/2508.14755v1
Alvorada-Bench: Can Language Models Solve Brazilian University Entrance Exams?
Henrique Godoy. (08/2025). arXiv. http://arxiv.org/pdf/2508.15835v1
Mathematical Computation And Reasoning Errors By Large Language Models
Liang Zhang, Edith Aurora Graf. (08/2025). arXiv. http://arxiv.org/pdf/2508.09932v2
Aryabhata: An Exam-Focused Language Model For Jee Math
Ritvik Rastogi, Sachin Dharashivkar, Sandeep Varma. (08/2025). arXiv. http://arxiv.org/pdf/2508.08665v2
Everything You Need To Know About Cs Education: Open Results From A Survey Of More Than 18,000 Participants
Katsiaryna Dzialets, Aleksandra Makeeva, Ilya Vlasov, Anna Potriasaeva, Aleksei Rostovskii, Yaroslav Golubev, Anastasiia Birillo. (08/2025). arXiv. http://arxiv.org/pdf/2508.05286v1
Sid: Benchmarking Guided Instruction Capabilities In Stem Education With A Socratic Interdisciplinary Dialogues Dataset
Mei Jiang, Houping Yue, Bingdong Li, Hao Hao, Ying Qian, Bo Jiang, and Aimin Zhou. (08/2025). arXiv. http://arxiv.org/pdf/2508.04563v1
Automated Generation Of Curriculum-Aligned Multiple-Choice Questions For Malaysian Secondary Mathematics Using Generative Ai
Rohaizah Abdul Wahid, Muhamad Said Nizamuddin Nadim, Suliana Sulaiman, Syahmi Akmal Shaharudin, Muhammad Danial Jupikil, Iqqwan Jasman Su Azlan Su. (08/2025). arXiv. http://arxiv.org/pdf/2508.04442v1
From Answers To Questions: Eqgbench For Evaluating Llms' Educational Question Generation
Chengliang Zhou, Mei Wang, Ting Zhang, Qiannan Zhu, Jian Li, Hua Huang. (08/2025). arXiv. http://arxiv.org/pdf/2508.10005v1
A Mixed User-Centered Approach To Enable Augmented Intelligence In Intelligent Tutoring Systems: The Case Of Mathalde App
Guilherme Guerino, Luiz Rodrigues, Luana Bianchini, Mariana Alves, Marcelo Marinho, Thomaz Veloso, Valmir Macario, Diego Dermeval, Thales Vieira, Ig Bittencourt, Seiji Isotani. (08/2025). arXiv. http://arxiv.org/pdf/2508.00103v2
Wip: Enhancing Game-Based Learning With Ai-Driven Peer Agents
Chengzhang Zhu, Cecile H. Sam, Yanlai Wu, Ying Tang. (08/2025). arXiv. http://arxiv.org/pdf/2508.01169v1
What Counts As Evidence In Ai & Ed: Towards Science-For-Policy 3.0
Ilkka Tuomi. (08/2025). European Journal of Education Policy and Practice. https://www.aup-online.com/content/journals/10.5117/EJEP2025.1.001.TUOM
