Personalized Learning Path Planing Through Goal-Driven Learner State Modeling

Authors

Joy Jia Yin Lim,

Ye He,

Jifan Yu,

Xin Cong,

Daniel Zhang-Li,

Zhiyuan Liu,

Huiqin Liu,

Lei Hou,

Juanzi Li,

Bin Xu

Date

10/2025

Publisher

arXiv

Link

http://arxiv.org/abs/2510.13215v1

Personalized Learning Path Planning (PLPP) aims to design adaptive learning paths that align with individual goals. While large language models (LLMs) show potential in personalizing learning experiences, existing approaches often lack mechanisms for goal-aligned planning. We introduce Pxplore, a novel framework for PLPP that integrates a reinforcement-based training paradigm and an LLM-driven educational architecture. We design a structured learner state model and an automated reward function that transforms abstract objectives into computable signals. We train the policy combining supervised fine-tuning (SFT) and Group Relative Policy Optimization (GRPO), and deploy it within a real-world learning platform. Extensive experiments validate Pxplore's effectiveness in producing coherent, personalized, and goal-driven learning paths. We release our code and dataset to facilitate future research.

What is the application?

Teaching – Instructional Materials,

Learning – Student Support

Who is the user?

Student

Who age?

Post-Secondary

Why use AI?

Outcomes – Other Academic,

Outcomes – Differentiation

Study design

Impact – Randomized Controlled Trial,