TY - JOUR
T1 - Street scenes object detection based on infrared images and improved YOLOv5 network
AU - Tan, Ailing
AU - Li, Xiaohang
AU - Zhao, Yong
AU - Gao, Meijing
N1 - Publisher Copyright:
© 2025 SPIE and IS&T.
PY - 2025/5/1
Y1 - 2025/5/1
N2 - To address the issues of object detection within infrared street scene images, such as low resolution and significant disparities in the feature scale of targets, we propose an MD-YOLOv5 network to improve the detection accuracy of bicycles, cars, and pedestrians. Based on coordinate attention, a multiscale coordinate attention module was designed to simultaneously extract both multiscale spatial features and channel features through pooling of different scales. A dense-C3 structure based on a dense connection approach was designed in the YOLOv5 backbone network to strengthen the transmission of features. Using the internationally available FLIR dataset, the experimental results show that mAP@0.5 and mAP@0.5:0.95 of MD-YOLOv5 reached 80.1% and 41.2%, respectively. Compared with SSD, YOLOv4, YOLOv5, YOLOv8, and YOLOv11, the accuracy of the proposed MD-YOLOv5 methodology has been increased by 16.98%, 13.3%, 2.7%, 2.5%, and 2.3%, respectively. The object detection method based on multiscale coordinate attention and dense-C3 structure proposed in this paper offers a new approach to the detection of infrared images.
AB - To address the issues of object detection within infrared street scene images, such as low resolution and significant disparities in the feature scale of targets, we propose an MD-YOLOv5 network to improve the detection accuracy of bicycles, cars, and pedestrians. Based on coordinate attention, a multiscale coordinate attention module was designed to simultaneously extract both multiscale spatial features and channel features through pooling of different scales. A dense-C3 structure based on a dense connection approach was designed in the YOLOv5 backbone network to strengthen the transmission of features. Using the internationally available FLIR dataset, the experimental results show that mAP@0.5 and mAP@0.5:0.95 of MD-YOLOv5 reached 80.1% and 41.2%, respectively. Compared with SSD, YOLOv4, YOLOv5, YOLOv8, and YOLOv11, the accuracy of the proposed MD-YOLOv5 methodology has been increased by 16.98%, 13.3%, 2.7%, 2.5%, and 2.3%, respectively. The object detection method based on multiscale coordinate attention and dense-C3 structure proposed in this paper offers a new approach to the detection of infrared images.
KW - dense-C3
KW - infrared images
KW - multiscale coordinate attention
KW - object detection
KW - YOLOv5
UR - http://www.scopus.com/pages/publications/105009781185
U2 - 10.1117/1.JEI.34.3.033048
DO - 10.1117/1.JEI.34.3.033048
M3 - Article
AN - SCOPUS:105009781185
SN - 1017-9909
VL - 34
JO - Journal of Electronic Imaging
JF - Journal of Electronic Imaging
IS - 3
M1 - 033048
ER -