TY - JOUR
T1 - IDDNet
T2 - Infrared Object Detection Network Based on Multi-Scale Fusion Dehazing
AU - Sun, Shizun
AU - Han, Shuo
AU - Xu, Junwei
AU - Zhao, Jie
AU - Xu, Ziyu
AU - Li, Lingjie
AU - Han, Zhaoming
AU - Mo, Bo
N1 - Publisher Copyright:
© 2025 by the authors.
PY - 2025/4
Y1 - 2025/4
N2 - In foggy environments, infrared images suffer from reduced contrast, degraded details, and blurred objects, which impair detection accuracy and real-time performance. To tackle these issues, we propose IDDNet, a lightweight infrared object detection network that integrates multi-scale fusion dehazing. IDDNet includes a multi-scale fusion dehazing (MSFD) module, which uses multi-scale feature fusion to eliminate haze interference while preserving key object details. A dedicated dehazing loss function, DhLoss, further improves the dehazing effect. In addition to MSFD, IDDNet incorporates three main components: (1) bidirectional polarized self-attention, (2) a weighted bidirectional feature pyramid network, and (3) multi-scale object detection layers. This architecture ensures high detection accuracy and computational efficiency. A two-stage training strategy optimizes the model’s performance, enhancing its accuracy and robustness in foggy environments. Extensive experiments on public datasets demonstrate that IDDNet achieves 89.4% precision and 83.9% AP, showing its superior accuracy, processing speed, generalization, and robust detection performance.
AB - In foggy environments, infrared images suffer from reduced contrast, degraded details, and blurred objects, which impair detection accuracy and real-time performance. To tackle these issues, we propose IDDNet, a lightweight infrared object detection network that integrates multi-scale fusion dehazing. IDDNet includes a multi-scale fusion dehazing (MSFD) module, which uses multi-scale feature fusion to eliminate haze interference while preserving key object details. A dedicated dehazing loss function, DhLoss, further improves the dehazing effect. In addition to MSFD, IDDNet incorporates three main components: (1) bidirectional polarized self-attention, (2) a weighted bidirectional feature pyramid network, and (3) multi-scale object detection layers. This architecture ensures high detection accuracy and computational efficiency. A two-stage training strategy optimizes the model’s performance, enhancing its accuracy and robustness in foggy environments. Extensive experiments on public datasets demonstrate that IDDNet achieves 89.4% precision and 83.9% AP, showing its superior accuracy, processing speed, generalization, and robust detection performance.
KW - attention mechanism
KW - deep learning
KW - dehazing
KW - feature fusion
KW - infrared object detection
UR - http://www.scopus.com/pages/publications/105002240315
U2 - 10.3390/s25072169
DO - 10.3390/s25072169
M3 - Article
AN - SCOPUS:105002240315
SN - 1424-8220
VL - 25
JO - Sensors
JF - Sensors
IS - 7
M1 - 2169
ER -