Developing Autonomous Feedback Mechanisms in Education with Deep Q-networks (DQN) and NLP

Authors

  • Dr.U. Harita Assistant Professor, Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Guntur, Andhra Pradesh, India.
  • Dr. Shanthi Vairavan Professor & Principal, Computer Science, Meenakshi College of Arts and Science, Meenakshi Academy of Higher Education and Research, Tamil Nadu, India.
  • Mavlonbek Doniyarov PhD, Lecturer, Jizzakh State Pedagogical University, Jizzakh, Uzbekistan.
  • P.V. Hari Hara Subramanyan Associate Professor, Meenakshi College of Physiotherapy, Meenakshi Academy of Higher Education and Research, Tamil Nadu, India.
  • Abdullayeva Shakhnoza Anvarovna Turan International University, Namangan, Uzbekistan.
  • Nigora Abduraimova Department of Economics, Termez University of Economics and Service, Termez, Uzbekistan.

Keywords:

Deep Q-Networks (DQN), Autonomous Feedback, Natural Language Processing, Reinforcement Learning in Education, Adaptive Learning Systems, Formative Assessment, Intelligent Tutoring Systems.

Abstract

The increasing deployment of digital learning environments has highlighted key shortcomings in existing static and rule-based approaches to formative feedback that do not account for the dynamic nature of learner experience. This paper proposes a novel autonomous system for personalized feedback based on a combination of Deep Q-Networks and Natural Language Processing techniques. In particular, the feedback process is modeled as an agent-based approach to reinforcement learning with a DQN-based agent that learns how to provide optimal formative feedback based on student actions. This is achieved through the construction of state-action-state transition dynamics for the Markov Decision Process using data extracted from the student interaction text using the pipeline of tokenization, sentiment analysis, named entity recognition, and intent classification. The effectiveness of the proposed approach was evaluated on a specially designed dataset of 12,847 student interactions covering three STEM fields (Mathematics, Physics, and Computer Science) collected from an online learning platform over 18 months. The results show that the proposed DQN-NLP method attains 93.7% accuracy, 91.4% precision, 92.8% recall, and 92.1% F1 score, which significantly surpasses the baselines such as rule-based feedback (F1 score = 71.3%), vanilla RL without NLP (F1 score = 82.6%), and transformers alone (F1 score = 88.4%). More importantly, the model increased the learning gain of students by 34.2% and shortened the feedback response time by 61% relative to instructor-driven feedback. The ablation experiment reveals that the contribution from DQN is 9.5%, whereas NLP adds another 7.8%. This research highlights the revolutionary impact of integrating reinforcement learning with natural language processing in education for scalable, adaptive educational feedback.

Downloads

Published

2026-04-15

How to Cite

Harita, D., Vairavan, D. S., Doniyarov, M., Subramanyan, P. H. H., Anvarovna, A. S., & Abduraimova, N. (2026). Developing Autonomous Feedback Mechanisms in Education with Deep Q-networks (DQN) and NLP. International Journal of Artificial Intelligence and Machine Learning, 6(1s), 203–213. Retrieved from https://www.svedbergopen.com/index.php/ijaiml/article/view/112

Similar Articles

<< < 6 7 8 9 10 11 12 > >> 

You may also start an advanced similarity search for this article.