M2PAIR: A High-Quality Acoustic Impulse Response Computation Model

Zhiyu Li, Xinpei Zhao, Jing Wang, Xinyuan Qian, Xiang Xie

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Acoustic Impulse Response (AIR) provides crucial spatial information about the environment, significantly enhancing audio immersion. However, achieving high perceptual quality while computing AIR in real-time for interactive audio-video media (IAVM) presents a challenging problem. This study proposes the Mesh to Parametric AIR (M2PAIR), a method for computing AIR designed for IAVM. M2PAIR integrates neural networks with psychoacoustics. It takes the 3D scene mesh, the listener positions, and the sound source positions as inputs, utilizes perceptual parameters as intermediaries, and computes the desired high-quality AIR signal based on these parameters. Experimental results demonstrate that M2PAIR improves the perceptual quality of AIR output compared to existing methods while reducing the model complexity. Additionally, it meets the requirements of IAVM, including real-time computation, high sampling rates, and flexible duration for the output AIR.

源语言英语
主期刊名2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Proceedings
编辑Bhaskar D Rao, Isabel Trancoso, Gaurav Sharma, Neelesh B. Mehta
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798350368741
DOI
出版状态已出版 - 2025
已对外发布
活动2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Hyderabad, 印度
期限: 6 4月 202511 4月 2025

出版系列

姓名ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN(印刷版)1520-6149

会议

会议2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025
国家/地区印度
Hyderabad
时期6/04/2511/04/25

指纹

探究 'M2PAIR: A High-Quality Acoustic Impulse Response Computation Model' 的科研主题。它们共同构成独一无二的指纹。

引用此