Semi-supervised Cross-Lingual Speech Recognition Exploiting Articulatory Features

Xinmei Su, Xiang Xie, Chenguang Hu, Shu Wu*, Jing Wang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The state-of-the-art (SOTA) Automatic Speech Recognition (ASR) systems are mostly based on the data-driven methods. However, low-resource languages may lack data for training. Articulatory Features (AFs) describe the movements of the vocal organ which can be shared across languages. Thus, this paper investigates AFs-based semi-supervised techniques to share data between languages. First, the traditional acoustic features and the AFs are combined as front-end features to provide articulatory information for cross-lingual knowledge transfer. Then, the dropout-based lattice decoded are used as the pseudo-labels for the unsupervised data to address the problem of data deficiency. In addition, the Lattice-free Maximum Mutual Information (LF-MMI) objective is adopted to better adapt to small datasets. Experiments show that our system can obtain a relative improvement of 58.6% on Character Error Rate (CER) comparing to the baseline system. More specifically, the smaller the datasets are, the more obvious the advantages of our system can be.

源语言英语
主期刊名Pattern Recognition - 27th International Conference, ICPR 2024, Proceedings
编辑Apostolos Antonacopoulos, Subhasis Chaudhuri, Rama Chellappa, Cheng-Lin Liu, Saumik Bhattacharya, Umapada Pal
出版商Springer Science and Business Media Deutschland GmbH
141-153
页数13
ISBN(印刷版)9783031801358
DOI
出版状态已出版 - 2025
活动27th International Conference on Pattern Recognition, ICPR 2024 - Kolkata, 印度
期限: 1 12月 20245 12月 2024

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
15333 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议27th International Conference on Pattern Recognition, ICPR 2024
国家/地区印度
Kolkata
时期1/12/245/12/24

指纹

探究 'Semi-supervised Cross-Lingual Speech Recognition Exploiting Articulatory Features' 的科研主题。它们共同构成独一无二的指纹。

引用此