High-precision prediction of fluorescence wavelength of organic based on ensemble automatic machine learning method and online querying

Shao Zhao, Jiadong Li, Lingjun Wu, Xiaoyan Zheng*, Anping Tang*, Wanqiang Liu*

*此作品的通讯作者

科研成果: 期刊稿件文献综述同行评审

摘要

Organic fluorescence is extensively applied in biomedical imaging, chemical sensing, and environmental monitoring etc. However, the traditional trial-and-error method for measuring the wavelength of organic fluorescent molecules is both time-consuming and labour-intensive. Ensemble automated machine learning (AutoML) methods provide a convenient way to evaluate the fluorescence properties of organics. In this work, we constructed a comprehensive fluorescence database containing 24798 organic fluorescent compounds. The maximum emission wavelengths (λem) of these compounds range from 240 nm to 1200 nm. The database was built based on recent peer-reviewed publications. Molecular structures were standardized, and duplicate entries were removed. This dataset were used for machine learning and to build predictive models. Among the prediction models for fluorescence maximum λem were built using the AutoGluon, the WeightedEnsemble_L2 model performed the best, with a mean absolute error (MAE) of 10 nm on the testing. Shapley additive explanation (SHAP) analysis revealed critical molecular descriptors governing λem, offering actionable insights for molecular engineering. The model was deployed as an open-access web platform (http://predixct-ednk9cynnprgqjbmskl95f.streamlit.app), enabling rapid screening of fluorophores for optoelectronic and sensing applications. This work bridges the gap between data-driven design and experimental synthesis, providing a robust tool to accelerate the development of tailored fluorescent probes for chemical sensing, bioimaging, and optical diagnostics.

源语言英语
文章编号113012
期刊Dyes and Pigments
242
DOI
出版状态已出版 - 11月 2025
已对外发布

指纹

探究 'High-precision prediction of fluorescence wavelength of organic based on ensemble automatic machine learning method and online querying' 的科研主题。它们共同构成独一无二的指纹。

引用此