Breadcrumb
- Home
- AI Hub For Education
- Research Study Repository
- Outcomes – Numeracy
Outcomes – Numeracy
Research synthesis is AI-generated, human reviewed. Updated 05/2026.
Displaying 61 - 90 of 224
A Matter Of Interest: Understanding Interestingness Of Math Problems In Humans And Language Models
Shubhra Mishra, Yuka Machino, Gabriel Poesia, Albert Jiang, Joy Hsu, Adrian Weller, Challenger Mishra, David Broman, Joshua B. Tenenbaum, Mateja Jamnik, Cedegao E. Zhang, Katherine M. Collins. (11/2025). arXiv. https://arxiv.org/abs/2511.08548v1
Eduagentqg: A Multi-Agent Workflow Framework For Personalized Question Generation
Rui Jia, Min Zhang, Fengrui Liu, Bo Jiang, Kun Kuang, Zhongxiang Dai. (11/2025). arXiv. https://arxiv.org/abs/2511.11635v1
Extracting Causal Relations In Deep Knowledge Tracing
Kevin Hong, Kia Karbasi, Gregory Pottie. (11/2025). arXiv. https://arxiv.org/abs/2511.03948v1
Artificial Intelligence In Elementary Stem Education: A Systematic Review Of Current Applications And Future Challenges
Majid Memari, Krista Ruggles. (11/2025). arXiv. https://arxiv.org/abs/2511.00105v2
Physicseval: Inference-Time Techniques To Improve The Reasoning Proficiency Of Large Language Models On Physics Problems
Oshayer Siddique, J. M Areeb Uzair Alam, Md Jobayer Rahman Rafy, Syed Rifat Raiyan, Hasan Mahmud, Md Kamrul Hasan. (11/2025). arXiv. https://arxiv.org/abs/2508.00079v2
Next Token Knowledge Tracing: Exploiting Pretrained Llm Representations To Decode Student Behaviour.
Max Norris, Kobi Gal and Sahan Bulathwela. (11/2025). arXiv. https://arxiv.org/abs/2511.02599v1
From Superficial Outputs To Superficial Learning: Risks Of Large Language Models In Education
IRIS DELIKOURA, YI R. (MAY) FUNG, PAN HUI. (11/2025). arXiv. https://arxiv.org/abs/2509.21972v2
Improving Human Verification Of Llm Reasoning Through Interactive Explanation Interfaces
Runtao Zhou, Anh Totti Nguyen, Giang Nguyen, Nikita Kharya, Chirag Agarwal. (10/2025). arXiv. https://arxiv.org/abs/2510.22922v2
"Learning Together": Ai-Mediated Support For Parental Involvement In Everyday Learning
Yao Li, Jingyi Xie, Ya-Fang Lin, He Zhang, Ge Wang, Gaojian Huang, Rui Yu, Si Chen. (10/2025). arXiv. https://arxiv.org/abs/2510.20123v2
Measuring Teaching With Llms
Michael Hardy. (10/2025). arXiv. https://arxiv.org/abs/2510.22968v1
Pedagogy-Driven Evaluation Of Generative Ai-Powered Intelligent Tutoring Systems
Kaushal Kumar Maurya and Ekaterina Kochmar. (10/2025). arXiv. https://arxiv.org/abs/2510.22581v1
Relief Or Displacement? How Teachers Are Negotiating Generative Al'S Role In Their Professional Practice
Aayushi Dangol, Smriti Kotiyal, Robert Wolfe, Alex J. Bowers, Antonio Vigil, Jason Yip, Julie A. Kientz, Suleman Shahid, Tom Yeh, Vincent Cho, Katie Davis. (10/2025). arXiv. http://arxiv.org/abs/2510.18296v1
Interpretability Framework For Llms In Undergraduate Calculus
Sagnik Dakshit, Sushmita Sinha Roy. (10/2025). arXiv. http://arxiv.org/abs/2510.17910v1
Enumerate-Conjecture-Prove: Formally Solving Answer-Construction Problems In Math Competitions
Jialiang Sun, Yuzhi Tang, Ao Li, Chris J. Maddison, Kuldeep S. Meel. (10/2025). arXiv. http://arxiv.org/abs/2505.18492v4
Ai-Driven Predictive Models For Optimizing Mathematics Education Technology: Enhancing Decision-Making Through Educational Data Mining And Meta-Analysis
Aneng He, Wenwen Yuan, Lai Soon Lee, Tian Tian. (10/2025). Smart Learning Environments. https://link.springer.com/article/10.1186/s40561-025-00415-z
Banglamath : A Bangla Benchmark Dataset For Testing Llm Mathematical Reasoning At Grades 6, 7, And 8
Tabia Tanzin Prama, Christopher M. Danforth, Peter Sheridan Dodds. (10/2025). arXiv. http://arxiv.org/abs/2510.12836v1
From Problem-Solving To Teaching Problem-Solving: Aligning Llms With Pedagogy Using Reinforcement Learning
David Dinucu-Jianu, Jakub Macina, Nico Daheim, Ido Hakimi, Iryna Gurevych, Mrinmaya Sachan. (10/2025). arXiv. http://arxiv.org/abs/2505.15607v2
Survey Of Natural Language Processing For Education: Taxonomy, Systematic Review, And Future Trends
Yunshi Lan, Xinyuan Li, Hanyue Du, Xuesong Lu, Ming Gao, Weining Qian, Aoying Zhou. (10/2025). arXiv. http://arxiv.org/abs/2401.07518v4
Edumath: Generating Standards-Aligned Educational Math Word Problems
Bryan R. Christ, Penelope Molitz, Jonathan Kropko, Thomas Hartvigsen. (10/2025). arXiv. http://arxiv.org/abs/2510.06965v1
Investigating Students' Preferences For Ai Roles In Mathematical Modelling: Evidence From A Randomized Controlled Trial
Wangda Zhu, Guang CHEN, Yumeng Zhu, Lei Cai, Xiangen Hu. (10/2025). arXiv. http://arxiv.org/abs/2510.06617v1
Uniedu: Toward Unified And Efficient Large Multimodal Models For Educational Tasks
Zhendong Chu, Jian Xie, Shen Wang, Zichao Wang, Qingsong Wen. (10/2025). arXiv. http://arxiv.org/abs/2503.20701v2
From Handwriting To Feedback: Evaluating Vlms And Llms For Ai-Powered Assessment In Indonesian Classrooms
Nurul Aisyah, Muhammad Dehan Al Kautsar, Arif Hidayat, Raqib Chowdhury, Fajri Koto. (10/2025). arXiv. http://arxiv.org/abs/2506.04822v2
Seeing The Big Picture: Evaluating Multimodal Llms º Ability To Interpret And Grade Handwritten Student Work
Owen Henkel, Bill Roberts, Doug Jaffe, Laurence Holt. (10/2025). arXiv. http://arxiv.org/abs/2510.05538v1
Harnessing Llm For Noise-Robust Cognitive Diagnosis In Web-Based Intelligent Education Systems
Guixian Zhang, Guan Yuan, Yanmei Zhang, Jing Ren, Ziqi Xu, Zhenyun Deng, Debo Cheng. (10/2025). arXiv. http://arxiv.org/abs/2510.04093v2
Visiomath: Benchmarking Figure-Based Mathematical Reasoning In Lmms
Can Li, Ying Liu, Ting Zhang, Mei Wang, Hua Huang. (10/2025). arXiv. http://arxiv.org/abs/2506.06727v3
Llm-Driven Rubric-Based Assessment Of Algebraic Competence In Multi-Stage Block Coding Tasks With Design And Field Evaluation
Yong Oh Lee, Byeonghun Bang, Sejun Oh. (10/2025). arXiv. http://arxiv.org/abs/2510.06253v1
A Framework for Building High-Quality Education Data for R&D in the Age of AI: The EDSI Dataset and Expert Insights
Jing Liu, Brendon Krall, Sarah Montana, Ting-Yu Ariel Chung, Heather Hill. (10/2025). Center for Educational Data Science and Innovation. https://edsi.umd.edu/publications/framework-building-high-quality-education-dat…
Accurate Predictions In Education With Discrete Variational Inference
Tom Quilter, Anastasija Ilick, Karen Poon, Richard Turner. (09/2025). arXiv. http://arxiv.org/abs/2509.23484v2
Evaluating undergraduate mathematics examinations in the era of generative AI: a curriculum-level case study
BENJAMIN J. WALKER, NIKOLETA KALAYDZHIEVA, BEATRIZ NAVARRO LAMEDA, RUTH A. REYNOLDS. (09/2025). arXiv. http://arxiv.org/pdf/2509.13359v3

