SM-SCAM YOLO: Enhancing Object Detection with Multi-Scale Module and Spatial Channel Attention Mechanism
- 1 Department of Computer Science, BVRIT Hyderabad College of Engineering for Women, Hyderabad, India
- 2 Center for Biomedical Engineering and Research, Chennai Institute of Technology, Chennai, India
Abstract
Detecting tiny objects remains a significant hurdle in computer vision, primarily due to scale variation, occlusion, and the loss of detail in low-resolution features. Although YOLO-based detectors are popular for their speed and efficiency in real-time tasks, they often struggle with accurately identifying small objects because of information loss during downsampling. This study introduces an improved YOLO-based model that integrates a Multi- Scale Module (MSM) and a Spatial-Channel Attention Mechanism (SCAM) to address these challenges. The MSM, replacing YOLO's traditional focus layer, captures features at multiple resolutions to enhance localization across various object sizes. Meanwhile, SCAM improves detection accuracy by emphasizing important spatial and channel features, especially in crowded or visually complex scenes. The model's performance was tested on the PKLot dataset, showing notable gains in precision, recall, and mean average precision (mAP) over the standardYOLO-v5, while preserving real-time processing capabilities. This approach offers a practical and scalable solution for tasks like smart parking, traffic surveillance, and automated vehicle monitoring, where detecting small-scale objects is essential.
DOI: https://doi.org/10.3844/jcssp.2025.1343.1353
Copyright: © 2025 Shanmuga Sundari Mariyappan, Kayalvizhi Mohan and K. B. K. S. Durga. This is an open access article distributed under the terms of the
Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 185 Views
- 79 Downloads
- 0 Citations
Download
Keywords
- Tiny Object Detection
- YOLO-based Framework
- Multi-Scale Module (MSM)
- Spatial-Channel Attention Mechanism (SCAM)
- Real-time Object Detection
- Autonomous Surveillance Systems