Deep Convolutional Neural Networks with Attention Mechanisms for Multi-Scale Feature Extraction in Complex Image Classification Tasks

Narendra Mohan; Radha  Krishna; Bhavadharani S; Swetha Polisetty; Dr. Ravi  Thangjam; Subramanian  Karthick; Mohit  Aggarwal; D. Akila

Deep Convolutional Neural Networks with Attention Mechanisms for Multi-Scale Feature Extraction in Complex Image Classification Tasks

Authors

Narendra Mohan Department of Computer Engineering & Applications, GLA University, Mathura.
Radha Krishna Professor, Department of CSE (Artificial Intelligence & Machine Learning), Pragati Engineering College, ADB Road, Surampalem, Near Peddapuram, Kakinada District, Andhra Pradesh, India – 533437.
Bhavadharani S Assistant Professor, Department of Commerce, Meenakshi College of Arts and Science, Meenakshi Academy of Higher Education and Research.
Swetha Polisetty Assistant Professor, Department of Information Technology, Vardhaman College of Engineering, Shamshabad, Hyderabad, India - 501 218.
Dr. Ravi Thangjam Professor, School of Business, Aditya University, Surampalem, Andhra Pradesh, Pin 533437.
Subramanian Karthick Professor, Computer Engineering , Vishwakarma Institute of Technology, Pune, Maharashtra, 411037.
Mohit Aggarwal School of Engineering &Technology, Noida international University, Uttar Pradesh 203201, India.
D. Akila Professor, Department of Computer Science and Applications, Faculty of Science and Humanities, SRM Institute of Science and Technology, Ramapuram Campus, Chennai, Tamil Nadu, India., Tamil Nadu ,India.

Keywords:

Deep learning, CNN, Attention Mechanism, Multi-Scale Feature Extraction, Image Classification, CBAM, ResNet, Confusion Matrix.

Abstract

Although there is global support of safety-engineered syringes, the use of auto- The challenges of complex image classification has grown into an urgent research domain in computer vision, as there is a growing need to effectively perform visual recognition on complex images in medical imaging, autonomous systems, intelligent surveillance, and industrial inspection. The traditional convolutional neural networks (CNNs) have proved to be very effective in feature learning ability, but they are usually limited to discriminative multi-scale spatial features as well as fine-grained contextual features of complex image data. These constraints may diminish the robustness of classification, especially with variation in object size, texture, illumination and complexity of the background. To overcome these issues, this research suggests a profound convolutional neural network model with channel attention and multi-scale features extraction schemes to improve the performance of image classification models. The architecture that is proposed uses a backbone based on ResNet50 with channel attention module and multi-scale feature fusion block to dynamically focus on informative feature and avoid redundant feature responses. Standardized training and testing of the model was performed on benchmark image classification datasets, such as, CIFAR-10 and CIFAR-100. Experimental findings show that the proposed framework obtains a high classification accuracy, precision, recall and F1-score when compared to traditional CNN backends that include VGG16, DenseNet121 and baseline ResNet. The confusion matrix analysis also confirms the increased prediction capability by class and decrease in misclassification rates. The significant contribution of this study is that the attention-directed learning of feature refinement is successfully combined with the hierarchical multi-scale learning of representations, leading to better discriminative features extraction and to superior classification resilience in challenging visual recognition tasks.

Downloads

Published

2026-05-12

How to Cite

Mohan, N., Krishna, R., S, B., Polisetty, S., Thangjam, D. R., Karthick, S., … Akila, D. (2026). Deep Convolutional Neural Networks with Attention Mechanisms for Multi-Scale Feature Extraction in Complex Image Classification Tasks. International Journal of Artificial Intelligence and Machine Learning, 6(2s), 585–599. Retrieved from https://www.svedbergopen.com/index.php/ijaiml/article/view/240

Download Citation

Issue

Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Most read articles by the same author(s)

Dr. Meenakshi Sundaram G, Dr.Ambika P, Komiljon Karimov, Bhavadharani S, Nozima Dosmuxammedova, Jasur Abdullayev, Personalized Learning Recommendations with Multi-Layered Neural Collaborative Filtering , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 1s (2026): IJAIML_VOL.6_NO.1s 2026
Mayank Srivastava, Y. Suma Chamundeswari, Suganya S, Swetha Polisetty, Dr. G. Sanjiv Rao, Ashutosh Kulkarni, Kuldeep Dhiman, Ankur Singh , Autonomous Multi-Agent Systems Using Reinforcement Learning for Cooperative Task Allocation and Optimization , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026
Puneet Sharma, T Ganga Bhavani, Anitha K, Shaikh Sumaiya, Dr. Makineedi Raja Babu, Kapil Mundada, Mohit Aggarwal, D. Akila, Transformer-Based Large Language Models for Context-Aware Semantic Understanding and Domain-Specific Text Generation , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026
Ashish Sharma, Lakshmi Viveka K, Hadasha Nobel tune, Dhanalaxmi Chinthala, Dr. Ravi Thangjam, Bipin Sule, Tanveer Ahmad Wani, D. Akila, A Hybrid Framework Integrating Supervised and Reinforcement Learning for Adaptive Decision-Making in Dynamic Environments , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026
Neeraj Gupta, V Anantha Lakshmi, Jeevajothi R, Nirmal Keshari Swain, Dr. Ravi Thangjam, Ganesh Korwar, Mahi Singh, Multi-Modal Deep Learning Architectures for Integrating Text, Image, and Sensor Data in Intelligent Systems , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026

Deep Convolutional Neural Networks with Attention Mechanisms for Multi-Scale Feature Extraction in Complex Image Classification Tasks

Authors

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Similar Articles

Make a Submission

INDEXING

Developed By

Information

Browse

Current Issue