MedKit: Multi-level feature distillation with knowledge injection for radiology report generation

Zhaoli Su, Hong Song*, Yucong Lin, You Wu, Xutao Weng, Zhongxuan Mao, Bowen Liu, Hongxia Yin, Jian Yang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Radiology report generation automates the creation of clinically accurate and coherent paragraphs from medical images, reducing the heavy burden of report writing for radiologists. However, current research in this field still faces limitations and urgently requires breakthroughs in feature extraction of image knowledge and model fusion. In this paper, we propose a radiology report generation framework, MedKit, that integrates high information density knowledge fusion with multi-level task feature distillation. We leverage knowledge embedding fusion through a knowledge graph to reduce semantic hallucinations. Additionally, by employing feature extraction techniques within a multi-level task feature distillation architecture, comprehensive image feature information is provided for the primary task. For adapting 2D and 3D images, we propose different visual encoders respectively, which address the issue of inconsistent shapes in medical images. Finally, utilizing a multimodal large model framework enables the generated radiology report to closely approximate medical experts’ fluent expression. Our proposed model significantly outperformed the state-of-the-art model in the MIMIC-CXR dataset with a 20.1 % increase in the BLEU-4 score, from 0.134 to 0.161. We also achieved the best result on the private Liver-CT dataset. Our code is available at http://github.com/sujaly/MedKit.

源语言英语
文章编号129003
期刊Expert Systems with Applications
296
DOI
出版状态已出版 - 15 1月 2026
已对外发布

指纹

探究 'MedKit: Multi-level feature distillation with knowledge injection for radiology report generation' 的科研主题。它们共同构成独一无二的指纹。

引用此