TY - JOUR
T1 - SPTU-Lite
T2 - 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025
AU - Shan, Chuxuan
AU - Wang, Xiaohua
AU - Qi, Hang
AU - Meng, Qingxu
AU - Wang, Jiabao
AU - Wang, Weijiang
AU - Shi, Yueting
N1 - Publisher Copyright:
© 2025 Institute of Electrical and Electronics Engineers Inc.. All rights reserved.
PY - 2025
Y1 - 2025
N2 - Electrocardiogram (ECG) is essential for the early prevention and diagnosis of cardiovascular diseases. The rise of wearable ECG devices has facilitated high-precision heart rate detection, enabling early interventions. Advances in Artificial Neural Networks (ANNs), particularly One-Dimensional Convolutional Neural Networks (CNNs) and Transformers, have significantly improved ECG signal classification. However, their computational and parametric demands limit their application, especially in small-scale devices. Spiking Neural Networks (SNNs), which operate on an event-driven principle, offer a solution with their resource efficiency. In this study, we propose a U-shaped hybrid CNN-Transformer architecture combining Spiking Fully Connected (FC) Layers and large kernel extractor (LKE) modules, named Spiking TransU-Lite (SPTU-Lite). The LKE module enhances the network's receptive field and improves parameter efficiency, reducing model size without significant precision loss. Additionally, spiking neurons replace the Fully Connected classification layer, further reducing the model's complexity. Empirical results on the MIT-BIH dataset show that SPTU-Lite reduces parameters by approximately 95%, while maintaining an accuracy of 99.62% and a loss rate below 0.5%, demonstrating its potential for ECG analysis in resourceconstrained environments.
AB - Electrocardiogram (ECG) is essential for the early prevention and diagnosis of cardiovascular diseases. The rise of wearable ECG devices has facilitated high-precision heart rate detection, enabling early interventions. Advances in Artificial Neural Networks (ANNs), particularly One-Dimensional Convolutional Neural Networks (CNNs) and Transformers, have significantly improved ECG signal classification. However, their computational and parametric demands limit their application, especially in small-scale devices. Spiking Neural Networks (SNNs), which operate on an event-driven principle, offer a solution with their resource efficiency. In this study, we propose a U-shaped hybrid CNN-Transformer architecture combining Spiking Fully Connected (FC) Layers and large kernel extractor (LKE) modules, named Spiking TransU-Lite (SPTU-Lite). The LKE module enhances the network's receptive field and improves parameter efficiency, reducing model size without significant precision loss. Additionally, spiking neurons replace the Fully Connected classification layer, further reducing the model's complexity. Empirical results on the MIT-BIH dataset show that SPTU-Lite reduces parameters by approximately 95%, while maintaining an accuracy of 99.62% and a loss rate below 0.5%, demonstrating its potential for ECG analysis in resourceconstrained environments.
KW - Depth-wise Separable Convolutions
KW - ECG Classification
KW - Large Kernel
KW - Lightweighting
KW - Spiking Neural Networks
UR - http://www.scopus.com/pages/publications/105009599471
U2 - 10.1109/ICASSP49660.2025.10889748
DO - 10.1109/ICASSP49660.2025.10889748
M3 - Conference article
AN - SCOPUS:105009599471
SN - 0736-7791
JO - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
JF - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
Y2 - 6 April 2025 through 11 April 2025
ER -