高级检索
当前位置: 首页 > 详情页

Using a classification model for determining the value of liver radiological reports of patients with colorectal cancer

文献详情

资源类型:
WOS体系:
Pubmed体系:

收录情况: ◇ SCIE

单位: [1]Department of Radiology, Beijing Friendship Hospital, Capital Medical University, Beijing, China. [2]School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China. [3]School of Biological Science and Medical Engineering, Beihang University, Beijing, China.
出处:
ISSN:

关键词: natural language processing colorectal cancer liver lesion medical imaging report classification model

摘要:
Medical imaging is critical in clinical practice, and high value radiological reports can positively assist clinicians. However, there is a lack of methods for determining the value of reports.The purpose of this study was to establish an ensemble learning classification model using natural language processing (NLP) applied to the Chinese free text of radiological reports to determine their value for liver lesion detection in patients with colorectal cancer (CRC).Radiological reports of upper abdominal computed tomography (CT) and magnetic resonance imaging (MRI) were divided into five categories according to the results of liver lesion detection in patients with CRC. The NLP methods including word segmentation, stop word removal, and n-gram language model establishment were applied for each dataset. Then, a word-bag model was built, high-frequency words were selected as features, and an ensemble learning classification model was constructed. Several machine learning methods were applied, including logistic regression (LR), random forest (RF), and so on. We compared the accuracy between priori choosing pertinent word strings and our machine language methodologies.The dataset of 2790 patients included CT without contrast (10.2%), CT with/without contrast (73.3%), MRI without contrast (1.8%), and MRI with/without contrast (14.6%). The ensemble learning classification model determined the value of reports effectively, reaching 95.91% in the CT with/without contrast dataset using XGBoost. The logistic regression, random forest, and support vector machine also achieved good classification accuracy, reaching 95.89%, 95.04%, and 95.00% respectively. The results of XGBoost were visualized using a confusion matrix. The numbers of errors in categories I, II and V were very small. ELI5 was used to select important words for each category. Words such as "no abnormality", "suggest", "fatty liver", and "transfer" showed a relatively large degree of positive correlation with classification accuracy. The accuracy based on string pattern search method model was lower than that of machine learning.The learning classification model based on NLP was an effective tool for determining the value of radiological reports focused on liver lesions. The study made it possible to analyze the value of medical imaging examinations on a large scale.Copyright © 2022 Liu, Zhang, Lv, Li, Liu, Yang, Weng, Lin, Song and Wang.

基金:
语种:
WOS:
PubmedID:
中科院(CAS)分区:
出版当年[2021]版:
大类 | 3 区 医学
小类 | 3 区 肿瘤学
最新[2025]版:
大类 | 3 区 医学
小类 | 4 区 肿瘤学
JCR分区:
出版当年[2020]版:
Q2 ONCOLOGY
最新[2023]版:
Q2 ONCOLOGY

影响因子: 最新[2023版] 最新五年平均[2021-2025] 出版当年[2020版] 出版当年五年平均[2016-2020] 出版前一年[2019版] 出版后一年[2021版]

第一作者:
第一作者单位: [1]Department of Radiology, Beijing Friendship Hospital, Capital Medical University, Beijing, China.
共同第一作者:
通讯作者:
推荐引用方式(GB/T 7714):
APA:
MLA:

资源点击量:1320 今日访问量:0 总访问量:816 更新日期:2025-04-01 建议使用谷歌、火狐浏览器 常见问题

版权所有:重庆聚合科技有限公司 渝ICP备12007440号-3 地址:重庆市两江新区泰山大道西段8号坤恩国际商务中心16层(401121)