高级检索
当前位置: 首页 > 详情页

An hybrid Machine Learning method for the de-identification of Un-Structured Narrative Clinical Text in Multi-Center Chinese Electronic Medical Records Data

文献详情

资源类型:
WOS体系:

收录情况: ◇ CPCI(ISTP)

单位: [1]Peking University Medical Informatics Center, Beijing, China [2]National Medical Service Data Center, Beijing, China [3]Peking University Health Science Center, Beijing, China [4]Peking University School of Public Health, Beijing, China [5]Peking University Fifth Clinical Medical College, Beijing, China [6]China-Japan Friendship Hospital, Beijing, China
出处:

关键词: component Chinese electronic medical record Un-structured machine learning corpora multi-center

摘要:
The premise of the full use of unstructured electronic medical records is to maintain the fully protection of a patient's information privacy. Presently, in prior of processing the electronic medical record date, identification and removing of relevant information which can be used to identify a patient is a research hotspot nowadays. There are very few methods in de identification of Chinese electronic medical records and their cross center performance is poor. Therefore, we develop a de-identification method which is a mixture of rule-based methods and machine learning methods. The method was tested on 700 electronic medical records from six hospitals. Five-fold cross test was used to evaluate the results of c5.0, Random Forest, SVM and XGBOOST. Leave-one-out test was used to evaluate CRF. And the F1 Measure of machine learning reached 91.18% in PHI_Names, 98.21% in PHI_MEDICALID, 95.74% in PHI_OTHERNFC, 97.14% in PHI_GEO, 89.19% in PHI_DATES, and 91.49% in PHI_TEL. And the F1 Measure of rule-based methods reached 93.00% in PHI_Names, 97.00% in PHI_MEDICALID, 97.00% in PHI_OTHERNFC, 97.00% in PHI_GEO, 96.00% in PHI_DATES, and 89.00% in PHI_TEL.

语种:
WOS:
第一作者:
第一作者单位: [1]Peking University Medical Informatics Center, Beijing, China [2]National Medical Service Data Center, Beijing, China
通讯作者:
通讯机构: [1]Peking University Medical Informatics Center, Beijing, China [2]National Medical Service Data Center, Beijing, China
推荐引用方式(GB/T 7714):
APA:
MLA:

资源点击量:1320 今日访问量:0 总访问量:816 更新日期:2025-04-01 建议使用谷歌、火狐浏览器 常见问题

版权所有:重庆聚合科技有限公司 渝ICP备12007440号-3 地址:重庆市两江新区泰山大道西段8号坤恩国际商务中心16层(401121)