Optimizing Student Learning Outcomes in Virtual Classrooms Using Hierarchical Reinforcement Learning (HRL)

Authors

  • Dr. Megala Rajendran Vice Rector, Research & Innovation, Turan International University, Namangan, Uzbekistan.
  • N. Nivetha Assistant Professor, Computer Science, Meenakshi College of Arts and Science, Meenakshi Academy of Higher Education and Research, Chennai, Tamil Nadu, India.
  • N. Prabhavathy Devi Professor, Nutrition and Dietetics, Meenakshi College of Arts and Science, Meenakshi Academy of Higher Education and Research, Chennai, Tamil Nadu, India.
  • Dr.R. Chithra Professor, Department of Information Technology, K.S. Rangasamy College of Technology, Tiruchengode, Tamil Nadu, India.
  • Nigora Saliyeva Chinese Language Lecturer, "Silk Road" International University of Tourism and Cultural Heritage Samarkand, Uzbekistan.
  • Asadbek Eshniyozov Department of Economics, Termez University of Economics and Service, Termez, Uzbekistan.

Keywords:

Hierarchical Reinforcement Learning, Virtual Classrooms, Adaptive Learning, Intelligent Tutoring Systems, Student Learning Outcomes, Deep Reinforcement Learning, Personalized Education.

Abstract

Virtual learning environments have come with unique problems related to personalizing, optimizing, and tailoring learning experiences to the needs of the learner. Traditionally, there have been no methods for personalization because existing electronic learning systems use fixed instructional methods that do not consider the changing behavioral, cognitive, and motivational states of the learner. This research presents a Hierarchical Reinforcement Learning (HRL) framework dubbed HRL-VCO (Hierarchical Reinforcement Learning for Virtual Classroom Optimization) for optimizing the learning experience of students in virtual classroom settings by adapting instruction, difficulty, and engagement levels over different timescales. The HRL-VCO framework is made up of two hierarchical layers: a top-layer meta-controller responsible for deciding on the high-level teaching goals (session goals, topic scheduling), and a bottom-layer sub-policy network for implementing micro-level actions (quiz difficulty, session pacing, hint provision). The HRL-VCO model was developed and tested using the EdNet Dataset, which contained 131+ million interactions collected from 784,309 users. The experiments showed that HRL-VCO achieved 91.4% accuracy in predicting user activity, significantly outperforming other models, such as DQN, with its 83.2%, PPO, which gave 85.7%, and a supervised learning method, which yielded 78.6%. In addition, HRL-VCO showed an F1-score of 89.3%, a precision of 90.1%, and a recall of 88.6%, with a mean reward increase by 34.2% in comparison with flat RL baseline models. It has been proven that the policy hierarchical approach helped improve reward accumulation by 12.7% compared to flat RL methods. The obtained results clearly show that hierarchical reinforcement learning can bring a real revolution in the field of adaptive learning systems.

Downloads

Published

2026-04-15

How to Cite

Rajendran, D. M., Nivetha, N., Devi, N. P., Chithra, D., Saliyeva, N., & Eshniyozov, A. (2026). Optimizing Student Learning Outcomes in Virtual Classrooms Using Hierarchical Reinforcement Learning (HRL). International Journal of Artificial Intelligence and Machine Learning, 6(1s), 292–303. Retrieved from https://www.svedbergopen.com/index.php/ijaiml/article/view/119

Similar Articles

<< < 4 5 6 7 8 9 10 11 12 > >> 

You may also start an advanced similarity search for this article.