Dynamic Pricing Strategy In Retail Using Deep Q-Learning And Genetic Algorithms

Authors

  • Manjula R Assistant Professor, Department of Commerce, Meenakshi College of Arts and Science, Meenakshi Academy of Higher Education and Research, Chennai, Tamil Nadu, India.
  • Ms.V. Hemamalini Assistant Professor, ECE, New Prince Shri Bhavani College of Engineering and Technology, Chennai, Tamil Nadu, India.
  • Dheeraj Kalra Department of Electronics & Communications Engineering, GLA University, Mathura, Uttar Pradesh, India.
  • Dr. Maganti Venkatesh Department of Artificial Intelligence and Machine Learning, Aditya University, Surampalem, Andhra Pradesh, India.
  • Y Lavanya Department of ECE, Ramachandra College of Engineering, Eluru, Andhra Pradesh, India.
  • Ms.R. Kiruthika Asstistant Professor, Computer Science and Engineering Mahendra Engineering College, Namakkal, Tamil Nadu, India.

Keywords:

Dynamic Pricing, Deep Q-Learning, Genetic Algorithms, Reinforcement Learning, Retail Revenue Management, Pricing Optimization, Evolutionary Computation

Abstract

Background: With e-commerce, dynamic pricing is a key strategy for maximizing revenue in retail. Conventional optimization and rule-based techniques are not real-time because the nature of consumer behavior and market conditions is volatile. Objective: The paper presents a hybrid intelligent pricing system based on Deep Q-Learning (DQL) and Genetic Algorithms (GA) to realize adaptive, autonomous, and cost-effective dynamic pricing in a retail environment. Methodology: The proposed model utilizes a Deep Q-Network (DQN) as the main decision-making module that learns the best pricing policies by engaging with a simulated retail market environment. The hyperparameters of the DQN, such as learning rate, discount factor, and the configuration of the network structure, are optimized by using the GA, which helps the algorithm to converge faster and prevent it from getting stuck in local optima. The state space is classified based on price elasticity indices, inventory levels, competitors' price signals, and temporal patterns of demand. The reward function is written to maximize profits and user conversion rate. Results: Electronic product transaction datasets were used for experiments, and the results show that the proposed hybrid DQL-GA model can improve the mean profit by 18.4%, the mean conversion rate by 12.7%, and the mean inventory turnover by 23.0% over the baseline rule-based method. The model also shows a performance better than that of standalone DQL and traditional optimization strategies on the basis of five performance metrics: Precision (98.9%), Accuracy (97.5%), Recall (96.5%), Area Under the Curve (98.0%), and Delay Reduction (4.9%). Conclusion: The proposed DQL-GA hybrid framework is scalable, robust, and interpretable for intelligent retail pricing and is shown to be resilient for stable trading, promotional peak, and overstock clearance scenarios.

Downloads

Published

2026-05-24

How to Cite

R, M., Hemamalini, M., Kalra, D., Venkatesh, D. M., Lavanya, Y., & Kiruthika, M. (2026). Dynamic Pricing Strategy In Retail Using Deep Q-Learning And Genetic Algorithms. International Journal of Artificial Intelligence and Machine Learning, 6(3s), 237–246. Retrieved from https://www.svedbergopen.com/index.php/ijaiml/article/view/313