CN109710670A - 一种将病历文本从自然语言转换为结构化元数据的方法 - Google Patents
一种将病历文本从自然语言转换为结构化元数据的方法 Download PDFInfo
- Publication number
- CN109710670A CN109710670A CN201811511195.0A CN201811511195A CN109710670A CN 109710670 A CN109710670 A CN 109710670A CN 201811511195 A CN201811511195 A CN 201811511195A CN 109710670 A CN109710670 A CN 109710670A
- Authority
- CN
- China
- Prior art keywords
- data
- content
- row
- characteristic value
- dictionary
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 45
- 238000013480 data collection Methods 0.000 claims abstract description 12
- 210000000056 organ Anatomy 0.000 claims abstract description 11
- 238000004458 analytical method Methods 0.000 claims abstract description 8
- 230000002688 persistence Effects 0.000 claims abstract description 7
- 239000011159 matrix material Substances 0.000 claims description 35
- 238000007689 inspection Methods 0.000 claims description 29
- 210000001072 colon Anatomy 0.000 claims description 9
- 238000004590 computer program Methods 0.000 claims description 8
- 235000013399 edible fruits Nutrition 0.000 claims description 7
- 238000001514 detection method Methods 0.000 claims description 6
- 238000012217 deletion Methods 0.000 claims description 4
- 230000037430 deletion Effects 0.000 claims description 4
- 239000003814 drug Substances 0.000 claims description 4
- 238000012545 processing Methods 0.000 claims description 4
- 230000011218 segmentation Effects 0.000 claims description 4
- 239000013589 supplement Substances 0.000 claims description 4
- 238000000605 extraction Methods 0.000 claims description 3
- 230000002980 postoperative effect Effects 0.000 claims description 3
- 238000012360 testing method Methods 0.000 claims description 3
- 238000006243 chemical reaction Methods 0.000 abstract description 3
- 230000003902 lesion Effects 0.000 abstract description 3
- 238000012549 training Methods 0.000 abstract description 3
- 238000011160 research Methods 0.000 abstract description 2
- 238000002052 colonoscopy Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 5
- 210000002784 stomach Anatomy 0.000 description 5
- 230000002496 gastric effect Effects 0.000 description 3
- 210000002318 cardia Anatomy 0.000 description 2
- 230000008021 deposition Effects 0.000 description 2
- 238000003745 diagnosis Methods 0.000 description 2
- 210000001198 duodenum Anatomy 0.000 description 2
- 210000003405 ileum Anatomy 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 210000001187 pylorus Anatomy 0.000 description 2
- 210000000664 rectum Anatomy 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 210000003384 transverse colon Anatomy 0.000 description 2
- 238000010276 construction Methods 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 230000002183 duodenal effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 210000002409 epiglottis Anatomy 0.000 description 1
- 230000002440 hepatic effect Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 210000001599 sigmoid colon Anatomy 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Landscapes
- Medical Treatment And Welfare Office Work (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811511195.0A CN109710670B (zh) | 2018-12-11 | 2018-12-11 | 一种将病历文本从自然语言转换为结构化元数据的方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811511195.0A CN109710670B (zh) | 2018-12-11 | 2018-12-11 | 一种将病历文本从自然语言转换为结构化元数据的方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109710670A true CN109710670A (zh) | 2019-05-03 |
CN109710670B CN109710670B (zh) | 2020-04-28 |
Family
ID=66256318
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811511195.0A Active CN109710670B (zh) | 2018-12-11 | 2018-12-11 | 一种将病历文本从自然语言转换为结构化元数据的方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109710670B (zh) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110362829A (zh) * | 2019-07-16 | 2019-10-22 | 北京百度网讯科技有限公司 | 结构化病历数据的质量评估方法、装置及设备 |
CN111026799A (zh) * | 2019-12-06 | 2020-04-17 | 安翰科技(武汉)股份有限公司 | 胶囊内窥镜检查报告文本结构化方法、设备及介质 |
CN111259664A (zh) * | 2020-01-14 | 2020-06-09 | 腾讯科技(深圳)有限公司 | 医学文本信息的确定方法、装置、设备及存储介质 |
CN111739599A (zh) * | 2020-06-19 | 2020-10-02 | 北京嘉和海森健康科技有限公司 | 一种教学病历生成方法和装置 |
CN111986754A (zh) * | 2020-08-21 | 2020-11-24 | 南通大学 | 一种基于糖尿病的电子病历管理模型构建方法 |
CN112116968A (zh) * | 2019-06-21 | 2020-12-22 | 上海交通大学医学院附属瑞金医院 | 一种医学检验报告的识别方法、装置、设备及存储介质 |
CN112185572A (zh) * | 2020-09-25 | 2021-01-05 | 志诺维思(北京)基因科技有限公司 | 一种肿瘤专病数据库构建系统、方法、电子设备和介质 |
CN112349367A (zh) * | 2020-11-11 | 2021-02-09 | 北京嘉和海森健康科技有限公司 | 一种生成仿真病历的方法、装置、电子设备及存储介质 |
CN112800763A (zh) * | 2021-04-14 | 2021-05-14 | 北京金山云网络技术有限公司 | 数据处理方法、医学文本数据处理方法、装置及电子设备 |
CN112800759A (zh) * | 2021-04-14 | 2021-05-14 | 北京金山云网络技术有限公司 | 标准化数据的生成方法、医学文本数据的处理方法和装置 |
Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080243500A1 (en) * | 2007-03-30 | 2008-10-02 | Maximilian Bisani | Automatic Editing Using Probabilistic Word Substitution Models |
CN103530513A (zh) * | 2013-10-10 | 2014-01-22 | 中国中医科学院 | 一种实现电子病历快速录入的输入系统 |
US20140344274A1 (en) * | 2013-05-20 | 2014-11-20 | Hitachi, Ltd. | Information structuring system |
US20150347521A1 (en) * | 2014-05-08 | 2015-12-03 | Koninklijke Philips N.V. | Systems and methods for relation extraction for chinese clinical documents |
CN106095913A (zh) * | 2016-06-08 | 2016-11-09 | 广州同构医疗科技有限公司 | 一种电子病历文本结构化方法 |
CN106126577A (zh) * | 2016-06-17 | 2016-11-16 | 北京理工大学 | 一种基于数据源划分矩阵的加权关联规则挖掘方法 |
CN106776606A (zh) * | 2015-11-20 | 2017-05-31 | 株式会社日立制作所 | 基于电子病历数据库的检索装置和检索方法 |
CN106919793A (zh) * | 2017-02-24 | 2017-07-04 | 黑龙江特士信息技术有限公司 | 一种医疗大数据的数据标准化处理方法及装置 |
CN107341264A (zh) * | 2017-07-19 | 2017-11-10 | 东北大学 | 一种支持自定义实体的电子病历检索系统及方法 |
CN107656952A (zh) * | 2016-12-30 | 2018-02-02 | 青岛中科慧康科技有限公司 | 平行智能病例推荐模型的建模方法 |
CN107833595A (zh) * | 2017-10-12 | 2018-03-23 | 山东大学 | 医疗大数据多中心整合平台及方法 |
CN108538395A (zh) * | 2018-04-02 | 2018-09-14 | 上海市儿童医院 | 一种通用的医疗专病数据系统的构建方法 |
CN108711443A (zh) * | 2018-05-07 | 2018-10-26 | 成都智信电子技术有限公司 | 电子病历的文本数据解析方法和装置 |
-
2018
- 2018-12-11 CN CN201811511195.0A patent/CN109710670B/zh active Active
Patent Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080243500A1 (en) * | 2007-03-30 | 2008-10-02 | Maximilian Bisani | Automatic Editing Using Probabilistic Word Substitution Models |
US20140344274A1 (en) * | 2013-05-20 | 2014-11-20 | Hitachi, Ltd. | Information structuring system |
CN103530513A (zh) * | 2013-10-10 | 2014-01-22 | 中国中医科学院 | 一种实现电子病历快速录入的输入系统 |
US20150347521A1 (en) * | 2014-05-08 | 2015-12-03 | Koninklijke Philips N.V. | Systems and methods for relation extraction for chinese clinical documents |
CN106776606A (zh) * | 2015-11-20 | 2017-05-31 | 株式会社日立制作所 | 基于电子病历数据库的检索装置和检索方法 |
CN106095913A (zh) * | 2016-06-08 | 2016-11-09 | 广州同构医疗科技有限公司 | 一种电子病历文本结构化方法 |
CN106126577A (zh) * | 2016-06-17 | 2016-11-16 | 北京理工大学 | 一种基于数据源划分矩阵的加权关联规则挖掘方法 |
CN107656952A (zh) * | 2016-12-30 | 2018-02-02 | 青岛中科慧康科技有限公司 | 平行智能病例推荐模型的建模方法 |
CN106919793A (zh) * | 2017-02-24 | 2017-07-04 | 黑龙江特士信息技术有限公司 | 一种医疗大数据的数据标准化处理方法及装置 |
CN107341264A (zh) * | 2017-07-19 | 2017-11-10 | 东北大学 | 一种支持自定义实体的电子病历检索系统及方法 |
CN107833595A (zh) * | 2017-10-12 | 2018-03-23 | 山东大学 | 医疗大数据多中心整合平台及方法 |
CN108538395A (zh) * | 2018-04-02 | 2018-09-14 | 上海市儿童医院 | 一种通用的医疗专病数据系统的构建方法 |
CN108711443A (zh) * | 2018-05-07 | 2018-10-26 | 成都智信电子技术有限公司 | 电子病历的文本数据解析方法和装置 |
Non-Patent Citations (2)
Title |
---|
张立君: ""电子病历数据的结构化分析与研究"", 《中国优秀硕士学位论文全文数据库 医药卫生科技辑》 * |
陈德华等: ""病理镜检文本数据的结构化处理方法"", 《计算机与现代化》 * |
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112116968A (zh) * | 2019-06-21 | 2020-12-22 | 上海交通大学医学院附属瑞金医院 | 一种医学检验报告的识别方法、装置、设备及存储介质 |
CN110362829B (zh) * | 2019-07-16 | 2023-01-03 | 北京百度网讯科技有限公司 | 结构化病历数据的质量评估方法、装置及设备 |
CN110362829A (zh) * | 2019-07-16 | 2019-10-22 | 北京百度网讯科技有限公司 | 结构化病历数据的质量评估方法、装置及设备 |
CN111026799A (zh) * | 2019-12-06 | 2020-04-17 | 安翰科技(武汉)股份有限公司 | 胶囊内窥镜检查报告文本结构化方法、设备及介质 |
CN111259664A (zh) * | 2020-01-14 | 2020-06-09 | 腾讯科技(深圳)有限公司 | 医学文本信息的确定方法、装置、设备及存储介质 |
CN111739599A (zh) * | 2020-06-19 | 2020-10-02 | 北京嘉和海森健康科技有限公司 | 一种教学病历生成方法和装置 |
CN111739599B (zh) * | 2020-06-19 | 2023-08-08 | 北京嘉和海森健康科技有限公司 | 一种教学病历生成方法和装置 |
CN111986754A (zh) * | 2020-08-21 | 2020-11-24 | 南通大学 | 一种基于糖尿病的电子病历管理模型构建方法 |
CN112185572A (zh) * | 2020-09-25 | 2021-01-05 | 志诺维思(北京)基因科技有限公司 | 一种肿瘤专病数据库构建系统、方法、电子设备和介质 |
CN112185572B (zh) * | 2020-09-25 | 2024-03-01 | 志诺维思(北京)基因科技有限公司 | 一种肿瘤专病数据库构建系统、方法、电子设备和介质 |
CN112349367A (zh) * | 2020-11-11 | 2021-02-09 | 北京嘉和海森健康科技有限公司 | 一种生成仿真病历的方法、装置、电子设备及存储介质 |
CN112349367B (zh) * | 2020-11-11 | 2023-08-08 | 北京嘉和海森健康科技有限公司 | 一种生成仿真病历的方法、装置、电子设备及存储介质 |
CN112800763A (zh) * | 2021-04-14 | 2021-05-14 | 北京金山云网络技术有限公司 | 数据处理方法、医学文本数据处理方法、装置及电子设备 |
CN112800759A (zh) * | 2021-04-14 | 2021-05-14 | 北京金山云网络技术有限公司 | 标准化数据的生成方法、医学文本数据的处理方法和装置 |
CN112800763B (zh) * | 2021-04-14 | 2021-08-06 | 北京金山云网络技术有限公司 | 数据处理方法、医学文本数据处理方法、装置及电子设备 |
CN112800759B (zh) * | 2021-04-14 | 2021-08-06 | 北京金山云网络技术有限公司 | 标准化数据的生成方法、医学文本数据的处理方法和装置 |
Also Published As
Publication number | Publication date |
---|---|
CN109710670B (zh) | 2020-04-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109710670A (zh) | 一种将病历文本从自然语言转换为结构化元数据的方法 | |
Jayatilake et al. | Involvement of machine learning tools in healthcare decision making | |
Zhang et al. | Large-scale domain-specific pretraining for biomedical vision-language processing | |
Li et al. | Auxiliary signal-guided knowledge encoder-decoder for medical report generation | |
Ukwuoma et al. | A hybrid explainable ensemble transformer encoder for pneumonia identification from chest X-ray images | |
Zeng et al. | Counterfactual generator: A weakly-supervised method for named entity recognition | |
Naeem et al. | SCDNet: a deep learning-based framework for the multiclassification of skin cancer using dermoscopy images | |
Iftikhar et al. | An evolution based hybrid approach for heart diseases classification and associated risk factors identification | |
Hassan et al. | Developing intelligent medical image modality classification system using deep transfer learning and LDA | |
CN109670179A (zh) | 基于迭代膨胀卷积神经网络的病历文本命名实体识别方法 | |
CN110428907A (zh) | 一种基于非结构化电子病历的文本挖掘方法及系统 | |
EP4026047A1 (en) | Automated information extraction and enrichment in pathology report using natural language processing | |
Yue et al. | Attention-driven cascaded network for diabetic retinopathy grading from fundus images | |
Zakaria et al. | Mining massive archives of mice sounds with symbolized representations | |
Khan et al. | An effective approach for early liver disease prediction and sensitivity analysis | |
Fung et al. | A self-knowledge distillation-driven CNN-LSTM model for predicting disease outcomes using longitudinal microbiome data | |
Thakur et al. | RNN-CNN based cancer prediction model for gene expression | |
Wu et al. | AGNet: Automatic generation network for skin imaging reports | |
Rajput et al. | Automated detection of colon cancer using deep learning | |
Maheswari et al. | SENTIMENT ANALYSIS IN MELANOMA CANCER DETECTION USING ENSEMBLE LEARNING MODEL. | |
Kapadia et al. | Content based medical image retrieval system for accurate disease diagnoses using modified multi feature fused Xception model | |
Fei et al. | Adversarial shared-private model for cross-domain clinical text entailment recognition | |
Khedikar et al. | Identification of Disease by Using SVM Classifier | |
Tu et al. | Deep Multi-Dictionary Learning for Survival Prediction With Multi-Zoom Histopathological Whole Slide Images | |
Jansi Rani et al. | Microarray Data Classification and Gene Selection Using Convolutional Neural Network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20190902 Address after: Room 102, 104, 108, 110, 112, 114, 116, 122, Building 4, 220 Huashan Road, Zhongyuan District, Zhengzhou City, Henan Province, 450000 Applicant after: Xuan Yun (Henan) Academy of Life Sciences Co.,Ltd. Address before: 450007 No. 1305, Block B, Shengyin Thailand International Center, Zhongyuan District, Zhengzhou City, Henan Province Applicant before: HENAN TONGYU MEDICAL TECHNOLOGY Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP03 | Change of name, title or address | ||
CP03 | Change of name, title or address |
Address after: 450000 rooms 102, 104, 108, 110, 112, 114, 116, 122, 1st floor, building 4, 220 Huashan Road, Zhongyuan District, Zhengzhou City, Henan Province Patentee after: Xuanwei (Henan) Life Science Co.,Ltd. Country or region after: China Address before: 450000 rooms 102, 104, 108, 110, 112, 114, 116, 122, 1st floor, building 4, 220 Huashan Road, Zhongyuan District, Zhengzhou City, Henan Province Patentee before: Xuan Yun (Henan) Academy of Life Sciences Co.,Ltd. Country or region before: China |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240524 Address after: 450000 rooms 109 and 113, 1st floor, building 4, No. 220 Huashan Road, Zhongyuan District, Zhengzhou City, Henan Province Patentee after: Henan Xuanwei Digital Medical Technology Co.,Ltd. Country or region after: China Address before: 450000 rooms 102, 104, 108, 110, 112, 114, 116, 122, 1st floor, building 4, 220 Huashan Road, Zhongyuan District, Zhengzhou City, Henan Province Patentee before: Xuanwei (Henan) Life Science Co.,Ltd. Country or region before: China |