TY - JOUR
T1 - Scalable Hardware Architecture for High-Throughput Implementation of ESPRIT Algorithm
AU - Huang, Yanjie
AU - Wang, Weijiang
AU - Xue, Chengbo
AU - Jiang, Rongkun
AU - Dang, Hua
AU - Ren, Shiwei
N1 - Publisher Copyright:
© 2001-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Estimating signal parameter via rotational invariance technique (ESPRIT) algorithm is a high-performance method for direction of arrival (DOA) estimation. Its most cumbersome aspect is performing eigenvalue decomposition twice, particularly with general square matrices. Current implementations lack flexibility in configuring the number of array elements, signal sources, and snapshots at the same time. To tackle these challenges, we propose a scalable hardware acceleration scheme for the QR decomposition (QRD) algorithm on a field programmable gate array (FPGA). Our QRD module leverages orientation displacement, parallel computing, and logic reuse to enhance speed and generality. Building upon this foundation, a reconfigurable design scheme with a parallel Covariance Matrix Computation (CMC) module for ESPRIT is proposed to increase calculation speed and throughput. Our ESPRIT implementation supports up to 26 array elements, 25 signal sources, and 2048 snapshots. As evidenced by experiments, the root mean squared error (RMSE) of the QRD module and ESPRIT implementation achieve 0.02 and 0.04°, respectively. On average, when the number of array elements, signal sources, and snapshots are set to 4, 1, 64 and 8, 1, 128 individually, our ESPRIT implementation completes DOA estimation in 22.04 and 60.50 μs, with throughput of 45372 and 16529 separately. In comparison with CPU implementations, our FPGA implementations offer up to 88.66% and 89.84% time savings for QRD and ESPRIT, respectively.
AB - Estimating signal parameter via rotational invariance technique (ESPRIT) algorithm is a high-performance method for direction of arrival (DOA) estimation. Its most cumbersome aspect is performing eigenvalue decomposition twice, particularly with general square matrices. Current implementations lack flexibility in configuring the number of array elements, signal sources, and snapshots at the same time. To tackle these challenges, we propose a scalable hardware acceleration scheme for the QR decomposition (QRD) algorithm on a field programmable gate array (FPGA). Our QRD module leverages orientation displacement, parallel computing, and logic reuse to enhance speed and generality. Building upon this foundation, a reconfigurable design scheme with a parallel Covariance Matrix Computation (CMC) module for ESPRIT is proposed to increase calculation speed and throughput. Our ESPRIT implementation supports up to 26 array elements, 25 signal sources, and 2048 snapshots. As evidenced by experiments, the root mean squared error (RMSE) of the QRD module and ESPRIT implementation achieve 0.02 and 0.04°, respectively. On average, when the number of array elements, signal sources, and snapshots are set to 4, 1, 64 and 8, 1, 128 individually, our ESPRIT implementation completes DOA estimation in 22.04 and 60.50 μs, with throughput of 45372 and 16529 separately. In comparison with CPU implementations, our FPGA implementations offer up to 88.66% and 89.84% time savings for QRD and ESPRIT, respectively.
KW - Direction of arrival (DOA) estimation
KW - QR decomposition (QRD) algorithm
KW - estimating signal parameter via rotational invariance technique (ESPRIT) algorithm
KW - field programmable gate array (FPGA)
KW - hardware implementation
KW - uniform linear array (ULA)
UR - http://www.scopus.com/pages/publications/105005998103
U2 - 10.1109/JSEN.2025.3568483
DO - 10.1109/JSEN.2025.3568483
M3 - Article
AN - SCOPUS:105005998103
SN - 1530-437X
VL - 25
SP - 24812
EP - 24828
JO - IEEE Sensors Journal
JF - IEEE Sensors Journal
IS - 13
ER -