单位:[a]School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China[b]School of Medical Information and Engineering, Southwest Medical University, Luzhou 646000, China[c]Heima Digital Technology Ltd, Luzhou 646000, China[d]Chuanjiang Science and Technology Research Institute Ltd, Luzhou 646000, China[e]Division of International Cooperation, Health Commission of Sichuan Province, Chengdu, 610041, China[f]Beijing Friendship Hospital, Captial Medical University, Beijing 100050, China首都医科大学附属北京友谊医院[g]School of Basic Medical Sciences, Southwest Medical University, Luzhou 646000, China[h]Central Nervous System Drug Key Laboratory of Sichuan Province, Luzhou 646000, China
Diabetes is a global epidemic. Long-term exposure to hyperglycemia can cause chronic damage to various tissues. Thus, early diagnosis of diabetes is crucial. In this study, we designed a computational system to predict diabetes risk by fusing multifarious types of physical examination data. We collected 1,507,563 physical examination data of healthy people and diabetes patients, as well as 387,076 physical examination data from the follow-up records from 2011 to 2017 of diabetes patients in Luzhou City in China. Three types of physical examination indexes were statistically analyzed: demographics, vital signs, and laboratory values. To distinguish diabetes patients from healthy people, a model based on eXtreme Gradient Boosting (XGBoost) was developed, which could produce an area under the receiver operating characteristic curve (AUC) of 0.8768. Moreover, to improve the convenience and flexibility of the model in clinical and real-life scenarios, a diabetes risk scorecard was established based on logistic regression, which could evaluate human health. Lastly, we statistically analyzed the data from the follow-up records to identify the key factors influencing patient control of their conditions. To improve the diabetes cascade screening and personal lifestyle management, an online diabetes risk assessment system was established, which can be freely accessed at http://lin-group.cn/server/DRSC/index.html. This system is expected to provide guidance for human health management.
基金:
National Nature Scientific Foundation of ChinaNational Natural Science Foundation of China (NSFC) [61772119, 61702430]; Sichuan Provincial Science Fund for Distinguished Young Scholars [2020JDJQ0012]
语种:
外文
被引次数:
WOS:
中科院(CAS)分区:
出版当年[2020]版:
大类|1 区工程技术
小类|1 区计算机:人工智能1 区计算机:理论方法
最新[2025]版:
大类|1 区计算机科学
小类|1 区计算机:人工智能1 区计算机:理论方法
JCR分区:
出版当年[2019]版:
Q1COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCEQ1COMPUTER SCIENCE, THEORY & METHODS
最新[2023]版:
Q1COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCEQ1COMPUTER SCIENCE, THEORY & METHODS
第一作者单位:[a]School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
通讯作者:
通讯机构:[g]School of Basic Medical Sciences, Southwest Medical University, Luzhou 646000, China[h]Central Nervous System Drug Key Laboratory of Sichuan Province, Luzhou 646000, China
推荐引用方式(GB/T 7714):
Yang H,Luo Y,Ren X,et al.Risk Prediction of Diabetes: Big data mining with fusion of multifarious physical examination indicators[J].INFORMATION FUSION.2021,75:140-149.doi:10.1016/j.inffus.2021.02.015.
APA:
Yang, H,Luo, Y,Ren, X,Wu, M,He, X...&Lin, H.(2021).Risk Prediction of Diabetes: Big data mining with fusion of multifarious physical examination indicators.INFORMATION FUSION,75,
MLA:
Yang, H,et al."Risk Prediction of Diabetes: Big data mining with fusion of multifarious physical examination indicators".INFORMATION FUSION 75.(2021):140-149