TY - JOUR
T1 - Machine-learning based strategy identifies a robust protein biomarker panel for Alzheimer’s disease in cerebrospinal fluid
AU - Hou, Xiaosen
AU - Qiu, Yunjie
AU - Li, Hui
AU - Yan, Yan
AU - Zhao, Dongxu
AU - Ji, Simei
AU - Ni, Junjun
AU - Zhang, Jun
AU - Liu, Kefu
AU - Qing, Hong
AU - Quan, Zhenzhen
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Background: The complex pathogenesis of Alzheimer’s disease (AD) has resulted in limited current biomarkers for its classification and diagnosis, necessitating further investigation into reliable universal biomarkers or combinations. Methods: In this work, we collect multiple CSF proteomics datasets and build a universal diagnose model by SVM-RFECV method combined with equal sample size and standard normalization design. The model was training in 297_CSF and then test the effect in other datasets. Results: Utilizing machine learning, we identify a 12-protein panel from cerebrospinal fluid proteomic datasets. The universal diagnosis model demonstrated strong diagnostic capability and high accuracy across ten different AD cohorts across different countries and different detection technologies. These proteins involved in various biological processes related to AD and shows a tight correlation with established AD pathogenic biomarkers, including amyloid-β, tau/p-tau, and the Montreal Cognitive Assessment score. The high accuracy in the model may due to multiple protein combination based on comprehensive pathogenesis and different AD progress. Furthermore, it effectively differentiates AD from mild cognitive impairment (MCI) and other neurodegenerative disorders, especially the frontotemporal dementia (FTD), which share similar pathogenesis as AD. Conclusion: This study highlights a high accuracy, robustness and compatibility model of 12-protein panel whose detection is even based on label-free, TMT and DIA mass spectrometry or ELISA technologies, implicating its potential prospect in clinical application.
AB - Background: The complex pathogenesis of Alzheimer’s disease (AD) has resulted in limited current biomarkers for its classification and diagnosis, necessitating further investigation into reliable universal biomarkers or combinations. Methods: In this work, we collect multiple CSF proteomics datasets and build a universal diagnose model by SVM-RFECV method combined with equal sample size and standard normalization design. The model was training in 297_CSF and then test the effect in other datasets. Results: Utilizing machine learning, we identify a 12-protein panel from cerebrospinal fluid proteomic datasets. The universal diagnosis model demonstrated strong diagnostic capability and high accuracy across ten different AD cohorts across different countries and different detection technologies. These proteins involved in various biological processes related to AD and shows a tight correlation with established AD pathogenic biomarkers, including amyloid-β, tau/p-tau, and the Montreal Cognitive Assessment score. The high accuracy in the model may due to multiple protein combination based on comprehensive pathogenesis and different AD progress. Furthermore, it effectively differentiates AD from mild cognitive impairment (MCI) and other neurodegenerative disorders, especially the frontotemporal dementia (FTD), which share similar pathogenesis as AD. Conclusion: This study highlights a high accuracy, robustness and compatibility model of 12-protein panel whose detection is even based on label-free, TMT and DIA mass spectrometry or ELISA technologies, implicating its potential prospect in clinical application.
KW - Alzheimer’s disease
KW - Biomarker
KW - Cerebrospinal fluid
KW - Machine learning
UR - http://www.scopus.com/pages/publications/105010075003
U2 - 10.1186/s13195-025-01789-5
DO - 10.1186/s13195-025-01789-5
M3 - Article
C2 - 40616179
AN - SCOPUS:105010075003
SN - 1758-9193
VL - 17
JO - Alzheimer's Research and Therapy
JF - Alzheimer's Research and Therapy
IS - 1
M1 - 147
ER -