单位:[1]Institute of Electronics Chinese Academy of Sciences, Beijing, China[2]Yanshan University, Qinhuangdao, China[3]Army General Hospital of PLA, Beijing, China[4]China-Japan Friendship Hospital, China[5]University of Chinese Academy of Sciences, Beijing, China
With the development of biomedical equipment and healthcare level, large amounts of data have been brought out in hospital, especially in Intensive Care Unit (ICU). However, how to better exploit meaningful information from these rich data still remains a challenge. This paper focuses on ICU mortality prediction, which is a typical example of second use of ICU big data. Patient ICU mortality prediction faces challenges in many aspects, such as high dimensionality, imbalance distribution and time asynchronization etc. To solve these challenges, a series of analytical methods and tools, including variables selection, preprocessing, feature extraction & feature selection and predictive modeling, have been utilized and developed. High-dimensional and unbalanced natures of the ICU data badly affect the performance of classifiers. We modified the cost-sensitive principal component analysis (CSPCA), which is denoted by MCSPCA, to handle these problems in feature extraction stage. As for parameter optimization, a variant of standard particle swarm optimization called chaos particle swarm optimization (CPSO) was adopted for its capacity of finding optimal solution. In order to obtain the best prediction model, different algorithms were investigated and their AUC performances were evaluated in a large real world benchmark data. The final results show that our proposed method improved the performance of the traditional machine learning methods, in which the support vector machine (SVM) reach best AUC performance of 0.7718. This study gives a paradigm to handle similar problems in big health data and helps promote healthcare services. (C) 2018 Published by Elsevier B.V.
基金:
National Key Research and Development Project [2016YFC1304302, 2016YFC0206502, 2016YFC1303900]; Key Project of Beijing Municipal Natural Science Foundation [Z16003]; Training Program of the Major Research Plan of the National Natural Science Foundation of ChinaNational Natural Science Foundation of China (NSFC) [91643115]; [012016018300B12204]
第一作者单位:[1]Institute of Electronics Chinese Academy of Sciences, Beijing, China[2]Yanshan University, Qinhuangdao, China
通讯作者:
通讯机构:[1]Institute of Electronics Chinese Academy of Sciences, Beijing, China[2]Yanshan University, Qinhuangdao, China[4]China-Japan Friendship Hospital, China[5]University of Chinese Academy of Sciences, Beijing, China[*1]No. 19 North 4th Ring Road West, Haidian District, Beijing, 100190, China.[*2]No. 438, west section of Hebei Avenue, Qinhuangdao Harbor District, Hebei Province, 066004, China[*3]Yinghua Dongjie, Hepingli, Beijing, 100029, China
推荐引用方式(GB/T 7714):
Liu Jiankang,Chen Xian Xiang,Fang Lipeng,et al.Mortality prediction based on imbalanced high-dimensional ICU big data[J].COMPUTERS in INDUSTRY.2018,98:218-225.doi:10.1016/j.compind.2018.01.017.
APA:
Liu, Jiankang,Chen, Xian Xiang,Fang, Lipeng,Li, Jun Xia,Yang, Ting...&Fang, Zhen.(2018).Mortality prediction based on imbalanced high-dimensional ICU big data.COMPUTERS in INDUSTRY,98,
MLA:
Liu, Jiankang,et al."Mortality prediction based on imbalanced high-dimensional ICU big data".COMPUTERS in INDUSTRY 98.(2018):218-225