IDDNet: Infrared Object Detection Network Based on Multi-Scale Fusion Dehazing

Shizun Sun; Shuo Han; Junwei Xu; Jie Zhao; Ziyu Xu; Lingjie Li; Zhaoming Han; Bo Mo

doi:10.3390/s25072169

IDDNet: Infrared Object Detection Network Based on Multi-Scale Fusion Dehazing

Shizun Sun, Shuo Han, Junwei Xu, Jie Zhao, Ziyu Xu, Lingjie Li, Zhaoming Han, Bo Mo^*

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

1 引用（Scopus）

摘要

In foggy environments, infrared images suffer from reduced contrast, degraded details, and blurred objects, which impair detection accuracy and real-time performance. To tackle these issues, we propose IDDNet, a lightweight infrared object detection network that integrates multi-scale fusion dehazing. IDDNet includes a multi-scale fusion dehazing (MSFD) module, which uses multi-scale feature fusion to eliminate haze interference while preserving key object details. A dedicated dehazing loss function, DhLoss, further improves the dehazing effect. In addition to MSFD, IDDNet incorporates three main components: (1) bidirectional polarized self-attention, (2) a weighted bidirectional feature pyramid network, and (3) multi-scale object detection layers. This architecture ensures high detection accuracy and computational efficiency. A two-stage training strategy optimizes the model’s performance, enhancing its accuracy and robustness in foggy environments. Extensive experiments on public datasets demonstrate that IDDNet achieves 89.4% precision and 83.9% AP, showing its superior accuracy, processing speed, generalization, and robust detection performance.

源语言	英语
文章编号	2169
期刊	Sensors
卷	25
期	7
DOI	http://doi.org/10.3390/s25072169
出版状态	已出版 - 4月 2025
已对外发布	是

访问文件

10.3390/s25072169

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{41ae60cb013448d69ed8d9a3ff66ed9e,

title = "IDDNet: Infrared Object Detection Network Based on Multi-Scale Fusion Dehazing",

abstract = "In foggy environments, infrared images suffer from reduced contrast, degraded details, and blurred objects, which impair detection accuracy and real-time performance. To tackle these issues, we propose IDDNet, a lightweight infrared object detection network that integrates multi-scale fusion dehazing. IDDNet includes a multi-scale fusion dehazing (MSFD) module, which uses multi-scale feature fusion to eliminate haze interference while preserving key object details. A dedicated dehazing loss function, DhLoss, further improves the dehazing effect. In addition to MSFD, IDDNet incorporates three main components: (1) bidirectional polarized self-attention, (2) a weighted bidirectional feature pyramid network, and (3) multi-scale object detection layers. This architecture ensures high detection accuracy and computational efficiency. A two-stage training strategy optimizes the model{\textquoteright}s performance, enhancing its accuracy and robustness in foggy environments. Extensive experiments on public datasets demonstrate that IDDNet achieves 89.4\% precision and 83.9\% AP, showing its superior accuracy, processing speed, generalization, and robust detection performance.",

keywords = "attention mechanism, deep learning, dehazing, feature fusion, infrared object detection",

author = "Shizun Sun and Shuo Han and Junwei Xu and Jie Zhao and Ziyu Xu and Lingjie Li and Zhaoming Han and Bo Mo",

note = "Publisher Copyright: {\textcopyright} 2025 by the authors.",

year = "2025",

month = apr,

doi = "10.3390/s25072169",

language = "English",

volume = "25",

journal = "Sensors",

issn = "1424-8220",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "7",

}

TY - JOUR

T1 - IDDNet

T2 - Infrared Object Detection Network Based on Multi-Scale Fusion Dehazing

AU - Sun, Shizun

AU - Han, Shuo

AU - Xu, Junwei

AU - Zhao, Jie

AU - Xu, Ziyu

AU - Li, Lingjie

AU - Han, Zhaoming

AU - Mo, Bo

PY - 2025/4

Y1 - 2025/4

N2 - In foggy environments, infrared images suffer from reduced contrast, degraded details, and blurred objects, which impair detection accuracy and real-time performance. To tackle these issues, we propose IDDNet, a lightweight infrared object detection network that integrates multi-scale fusion dehazing. IDDNet includes a multi-scale fusion dehazing (MSFD) module, which uses multi-scale feature fusion to eliminate haze interference while preserving key object details. A dedicated dehazing loss function, DhLoss, further improves the dehazing effect. In addition to MSFD, IDDNet incorporates three main components: (1) bidirectional polarized self-attention, (2) a weighted bidirectional feature pyramid network, and (3) multi-scale object detection layers. This architecture ensures high detection accuracy and computational efficiency. A two-stage training strategy optimizes the model’s performance, enhancing its accuracy and robustness in foggy environments. Extensive experiments on public datasets demonstrate that IDDNet achieves 89.4% precision and 83.9% AP, showing its superior accuracy, processing speed, generalization, and robust detection performance.

AB - In foggy environments, infrared images suffer from reduced contrast, degraded details, and blurred objects, which impair detection accuracy and real-time performance. To tackle these issues, we propose IDDNet, a lightweight infrared object detection network that integrates multi-scale fusion dehazing. IDDNet includes a multi-scale fusion dehazing (MSFD) module, which uses multi-scale feature fusion to eliminate haze interference while preserving key object details. A dedicated dehazing loss function, DhLoss, further improves the dehazing effect. In addition to MSFD, IDDNet incorporates three main components: (1) bidirectional polarized self-attention, (2) a weighted bidirectional feature pyramid network, and (3) multi-scale object detection layers. This architecture ensures high detection accuracy and computational efficiency. A two-stage training strategy optimizes the model’s performance, enhancing its accuracy and robustness in foggy environments. Extensive experiments on public datasets demonstrate that IDDNet achieves 89.4% precision and 83.9% AP, showing its superior accuracy, processing speed, generalization, and robust detection performance.

KW - attention mechanism

KW - deep learning

KW - dehazing

KW - feature fusion

KW - infrared object detection

UR - http://www.scopus.com/pages/publications/105002240315

U2 - 10.3390/s25072169

DO - 10.3390/s25072169

M3 - Article

AN - SCOPUS:105002240315

SN - 1424-8220

VL - 25

JO - Sensors

JF - Sensors

IS - 7

M1 - 2169

ER -

IDDNet: Infrared Object Detection Network Based on Multi-Scale Fusion Dehazing

摘要

访问文件

其它文件与链接

指纹

引用此