Hybrid deep segmentation architecture using dual attention U-Net and Mask-RCNN for accurate detection of pests, diseases, and weeds in crops
Downloads
Published
DOI:
https://doi.org/10.58414/SCIENTIFICTEMPER.2025.16.7.04Keywords:
Attention mechanism, Deep learning, Mask-RCNN, Plant-village dataset, Smart agriculture, U-Net model.Dimensions Badge
Issue
Section
License
Copyright (c) 2025 The Scientific Temper

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Early and accurate identification of pests, diseases, and weeds in modern agriculture is crucial for sustainable crop management and yield optimization to increase productivity. This research proposes a hybrid deep segmentation framework that integrates Dual Attention UNet and Mask-RCNN methods to enhance the precision and reliability of plant disease detection under diverse environmental conditions. The core objective is to improve segmentation accuracy and object localization, particularly in complex field imagery with overlapping foliage, variable lighting, and background noise. The proposed architecture uses the Plant-Village dataset, which includes a diverse collection of annotated crop images representing multiple classes of pests, diseases, and weed species. The Dual Attention UNet emphasizes salient spatial and channel-wise features, enabling refined pixel-level segmentation of affected regions. This is followed by a Mask-RCNN module that performs instance-aware segmentation and bounding box localization, facilitating detailed identification of individual anomalies even in cluttered scenes. The framework is further enhanced through data augmentation and transfer learning strategies to support generalization across varying crop types. Experimental evaluation reveals that the proposed deep learning-based model achieves a Detection Accuracy (DA) of 96.5%, an F1-Score of 95.2%, AUC-PR of 97.4%, Sensitivity of 96.5%, Scalability of 96.2% and a Processing Time (PT) of 12 seconds per batch, demonstrating both precision and efficiency. Moreover, the architecture shows a Scalability of 96.8%, ensuring robustness in large-scale deployments. The comprehensive results are compared with baseline models such as CNN, Faster R-CNN, and CBAM. The hybrid integration of instance-aware detection and attention-driven segmentation, explicitly designed for agricultural situations, shows the novelty, and the model improves detection quality by capturing fine-grained spatial characteristics and allowing for the thorough separation of overlapping anomalies compared to traditional CNN-YOLO pipelines. This model presents a reliable solution for real-time smart agriculture systems aimed at proactive crop health management. Abstract
How to Cite
Downloads
Similar Articles
- Heena Gulia, Sunder Singh Arya, Neha Yadav, Ajay Kumar, Monika Janaagal, Mamta Sawariya, Naveen Kumar, Himanshu Mehra, Sunil Yadav, Sudershan Singh, Reetu Verma, Strategies for adaptations and mitigation of abiotic stresses in crops: A review , The Scientific Temper: Vol. 16 No. 01 (2025): The Scientific Temper
- Birhanu T Sisay, Jadu K. Agerchu, Gizachew W. Nuraga, Effects of bended NPSB fertilizer rates and varieties on growth and yield of garlic (Allium sativum L.) in Gummer district, Central Ethiopia , The Scientific Temper: Vol. 14 No. 04 (2023): The Scientific Temper
- Panda Aditi Ambarish, Kaushik Trivedi, Immersive learning: A virtual reality teaching model for enhancing english speaking skills , The Scientific Temper: Vol. 15 No. spl-2 (2024): The Scientific Temper
- Teklil Abadeye, Teshome Yitbarek, Isreal Zewide, Kibinesh Adimasu, Assessing soil fertility influenced by land use in Moche, Gurage Zone, Ethiopia , The Scientific Temper: Vol. 14 No. 01 (2023): The Scientific Temper
- N. Saranya, M. Kalpana Devi, A. Mythili, Summia P. H, Data science and machine learning methods for detecting credit card fraud , The Scientific Temper: Vol. 14 No. 03 (2023): The Scientific Temper
- Azar Bagheri Masoudzade, Maryam Ebrahim Nezhad, Appraising social class dimensions on learning motivation of Iranian students: Family studies and their status in focus , The Scientific Temper: Vol. 15 No. 02 (2024): The Scientific Temper
- SUVRA MANDAL, PRIYABRATA DASC, ASHES DAS, DHIRENDRA NATH MONDAL, ANINDITA GHOSH, DEBARATI MUKHERJEE, RAGHWENDRA MISHRA, ATANU BHATACHARYYA, MANOJ KAR, EVALUATION OF ANTIOXIDANT ACTIVITY OF THE POLYOXYGENATED XANTHONES FROM SWERTIA CHIRATA BUCH., HAM , The Scientific Temper: Vol. 2 No. 1&2 (2011): The Scientific Temper
- P Janavarthini, I Antonitte Vinoline, Sustainable fuzzy inventory for concurrent fabrication and material depletion modeling with random substandard items , The Scientific Temper: Vol. 16 No. 04 (2025): The Scientific Temper
- R. A. Askerov, The role of improving the business environment in agriculture in ensuring the country’s food security , The Scientific Temper: Vol. 15 No. 02 (2024): The Scientific Temper
- Raja S, Nagarajan L., Hybridization of bio-inspired algorithms with machine learning models for predicting the risk of type 2 diabetes mellitus , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
<< < 9 10 11 12 13 14 15 16 17 18 > >>
You may also start an advanced similarity search for this article.
Most read articles by the same author(s)
- Subna MP, Kamalraj N, Human Activity Recognition through Skeleton-Based Motion Analysis Using YOLOv8 and Graph Convolutional Networks , The Scientific Temper: Vol. 16 No. 12 (2025): The Scientific Temper

