Breadcrumb
- Home
- Generative AI For Education Hub
- Research Study Repository
- Outcomes – Numeracy
Search and Filter
Submit a research study
Contribute to the repository:
Outcomes – Numeracy
Research synthesis is AI-generated, human reviewed. Updated 09/2025.
Displaying 31 - 60 of 180
Interpretability Framework For Llms In Undergraduate Calculus
Sagnik Dakshit, Sushmita Sinha Roy. (10/2025). arXiv. http://arxiv.org/abs/2510.17910v1
Enumerate-Conjecture-Prove: Formally Solving Answer-Construction Problems In Math Competitions
Jialiang Sun, Yuzhi Tang, Ao Li, Chris J. Maddison, Kuldeep S. Meel. (10/2025). arXiv. http://arxiv.org/abs/2505.18492v4
Ai-Driven Predictive Models For Optimizing Mathematics Education Technology: Enhancing Decision-Making Through Educational Data Mining And Meta-Analysis
Aneng He, Wenwen Yuan, Lai Soon Lee, Tian Tian. (10/2025). Smart Learning Environments. https://link.springer.com/article/10.1186/s40561-025-00415-z
Banglamath : A Bangla Benchmark Dataset For Testing Llm Mathematical Reasoning At Grades 6, 7, And 8
Tabia Tanzin Prama, Christopher M. Danforth, Peter Sheridan Dodds. (10/2025). arXiv. http://arxiv.org/abs/2510.12836v1
From Problem-Solving To Teaching Problem-Solving: Aligning Llms With Pedagogy Using Reinforcement Learning
David Dinucu-Jianu, Jakub Macina, Nico Daheim, Ido Hakimi, Iryna Gurevych, Mrinmaya Sachan. (10/2025). arXiv. http://arxiv.org/abs/2505.15607v2
Survey Of Natural Language Processing For Education: Taxonomy, Systematic Review, And Future Trends
Yunshi Lan, Xinyuan Li, Hanyue Du, Xuesong Lu, Ming Gao, Weining Qian, Aoying Zhou. (10/2025). arXiv. http://arxiv.org/abs/2401.07518v4
Edumath: Generating Standards-Aligned Educational Math Word Problems
Bryan R. Christ, Penelope Molitz, Jonathan Kropko, Thomas Hartvigsen. (10/2025). arXiv. http://arxiv.org/abs/2510.06965v1
From Handwriting To Feedback: Evaluating Vlms And Llms For Ai-Powered Assessment In Indonesian Classrooms
Nurul Aisyah, Muhammad Dehan Al Kautsar, Arif Hidayat, Raqib Chowdhury, Fajri Koto. (10/2025). arXiv. http://arxiv.org/abs/2506.04822v2
Investigating Students' Preferences For Ai Roles In Mathematical Modelling: Evidence From A Randomized Controlled Trial
Wangda Zhu, Guang CHEN, Yumeng Zhu, Lei Cai, Xiangen Hu. (10/2025). arXiv. http://arxiv.org/abs/2510.06617v1
Uniedu: Toward Unified And Efficient Large Multimodal Models For Educational Tasks
Zhendong Chu, Jian Xie, Shen Wang, Zichao Wang, Qingsong Wen. (10/2025). arXiv. http://arxiv.org/abs/2503.20701v2
Harnessing Llm For Noise-Robust Cognitive Diagnosis In Web-Based Intelligent Education Systems
Guixian Zhang, Guan Yuan, Yanmei Zhang, Jing Ren, Ziqi Xu, Zhenyun Deng, Debo Cheng. (10/2025). arXiv. http://arxiv.org/abs/2510.04093v2
Seeing The Big Picture: Evaluating Multimodal Llms º Ability To Interpret And Grade Handwritten Student Work
Owen Henkel, Bill Roberts, Doug Jaffe, Laurence Holt. (10/2025). arXiv. http://arxiv.org/abs/2510.05538v1
Visiomath: Benchmarking Figure-Based Mathematical Reasoning In Lmms
Can Li, Ying Liu, Ting Zhang, Mei Wang, Hua Huang. (10/2025). arXiv. http://arxiv.org/abs/2506.06727v3
Llm-Driven Rubric-Based Assessment Of Algebraic Competence In Multi-Stage Block Coding Tasks With Design And Field Evaluation
Yong Oh Lee, Byeonghun Bang, Sejun Oh. (10/2025). arXiv. http://arxiv.org/abs/2510.06253v1
A Framework for Building High-Quality Education Data for R&D in the Age of AI: The EDSI Dataset and Expert Insights
Jing Liu, Brendon Krall, Sarah Montana, Ting-Yu Ariel Chung, Heather Hill. (10/2025). Center for Educational Data Science and Innovation. https://edsi.umd.edu/publications/framework-building-high-quality-education-dat…
Accurate Predictions In Education With Discrete Variational Inference
Tom Quilter, Anastasija Ilick, Karen Poon, Richard Turner. (09/2025). arXiv. http://arxiv.org/abs/2509.23484v2
Evaluating undergraduate mathematics examinations in the era of generative AI: a curriculum-level case study
BENJAMIN J. WALKER, NIKOLETA KALAYDZHIEVA, BEATRIZ NAVARRO LAMEDA, RUTH A. REYNOLDS. (09/2025). arXiv. http://arxiv.org/pdf/2509.13359v3
LLMS4ALL: A Review On Large Language Models For Research And Applications In Academic Disciplines
Yanfang (Fanny) Ye*, Zheyuan Zhang, Tianyi Ma, Zehong Wang, Yiyang Li, Shifu Hou, Weixiang Sun, Kaiwen Shi, Yijun Ma, Wei Song, Ahmed Abbasi, Ying Cheng, Jane Cleland-Huang, Steven Corcelli, Robert Goulding, Ming Hu, Ting Hua, John Lalor, Fang Liu, Tengfei Luo, Ed Maginn, Nuno Moniz, Jason Rohr, Brett Savoie, Daniel Slate, Tom Stapleford, Matthew Webber, Olaf Wiest, Johnny Zhang, Nitesh V Chawla. (09/2025). arXiv. http://arxiv.org/pdf/2509.19580v3
Comparing RAG and GraphRAG for Page-Level Retrieval Question Answering on Math Textbook
Eason Chen, Chuangji Li, Shizhuo Li, Zimo Xiao, Jionghao Lin, Kenneth R. Koedinger. (09/2025). arXiv. http://arxiv.org/pdf/2509.16780v2
Generative AI alone may not be enough: Evaluating AI Support for Learning Mathematical Proof
Eason Chen, Sophia Judicke, Kayla Beigh, Xinyi Tang, Zimo Xiao, Chuangji Li, Shizhuo Li, Reed Luttmer, Shreya Singh, Maria Yampolsky, Naman Parikh, Yi Zhao, Meiyi Chen, Scarlett Huang, Anishka Mohanty, Gregory Johnson, John Mackey, Jionghao Lin, Ken Koedinger. (09/2025). arXiv. http://arxiv.org/pdf/2509.16778v1
Gen AI In Proof-Based Math Courses: A Pilot Study
Hannah Klawa, Shraddha Rajpal, Cigole Thomas. (09/2025). arXiv. http://arxiv.org/pdf/2509.13570v1
Do Teachers Dream of GenAI Widening Educational (In)equality? Envisioning the Future of K-12 GenAI Education from Global Teachers' Perspectives
Ruiwei Xiao, Qing Xiao, Xinying Hou, Phenyo Phemelo Moletsane, Hanqi Jane Li, Hong Shen, John Stamper. (09/2025). arXiv. http://arxiv.org/pdf/2509.10782v1
MathBuddy: A Multimodal System for Affective Math Tutoring
Debanjana Kar, Leopold B¬öss, Dacia Braca, Sebastian Maximilian Dennerlein, Nina Christine Hubig, Philipp Wintersberger, Yufang Hou. (08/2025). arXiv. http://arxiv.org/pdf/2508.19993v1
MAB Optimizer for Estimating Math Question Difficulty via Inverse CV without NLP
Surajit Das, Gourav Roy, Aleksei Eliseev, Ram Kumar Rajendran. (08/2025). arXiv. http://arxiv.org/pdf/2508.19014v1
Who Is Lagging Behind: Profiling Student Behaviors with Graph-Level Encoding in Curriculum-Based Online Learning Systems
Qian Xiao, Conn Breathnach, Ioana Ghergulescu, Conor O'Sullivan, Keith Johnston, Vincent Wade. (08/2025). arXiv. http://arxiv.org/pdf/2508.18925v1
Explainable AI for Predicting and Understanding Mathematics Achievement: A Cross-National Analysis of PISA 2018
Liu Liu, Dai Rui. (08/2025). arXiv. http://arxiv.org/pdf/2508.16747v1
FACET: Teacher-Centred LLM-Based Multi-Agent Systems- Towards Personalized Educational Worksheets
Jana Gonnermann-Muller, Jennifer Haase, Konstantin Fackeldey, Sebastian Pokutta. (08/2025). arXiv. http://arxiv.org/pdf/2508.11401v2
Alvorada-Bench: Can Language Models Solve Brazilian University Entrance Exams?
Henrique Godoy. (08/2025). arXiv. http://arxiv.org/pdf/2508.15835v1
Mathematical Computation and Reasoning Errors by Large Language Models
Liang Zhang, Edith Aurora Graf. (08/2025). arXiv. http://arxiv.org/pdf/2508.09932v2
