CN112597774B - 中文医疗命名实体识别方法、系统、存储介质和设备 - Google Patents
中文医疗命名实体识别方法、系统、存储介质和设备 Download PDFInfo
- Publication number
- CN112597774B CN112597774B CN202011468199.2A CN202011468199A CN112597774B CN 112597774 B CN112597774 B CN 112597774B CN 202011468199 A CN202011468199 A CN 202011468199A CN 112597774 B CN112597774 B CN 112597774B
- Authority
- CN
- China
- Prior art keywords
- named entity
- dictionary
- embedded
- medical
- chinese
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 53
- 238000003860 storage Methods 0.000 title claims abstract description 14
- 239000013598 vector Substances 0.000 claims abstract description 51
- 230000004927 fusion Effects 0.000 claims abstract description 30
- 239000003814 drug Substances 0.000 claims description 17
- 229940079593 drug Drugs 0.000 claims description 16
- 230000007246 mechanism Effects 0.000 claims description 15
- 238000013508 migration Methods 0.000 claims description 15
- 230000005012 migration Effects 0.000 claims description 15
- 238000004590 computer program Methods 0.000 claims description 14
- 230000014509 gene expression Effects 0.000 claims description 12
- 201000010099 disease Diseases 0.000 claims description 10
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 10
- 230000008569 process Effects 0.000 claims description 10
- 238000012549 training Methods 0.000 claims description 8
- 239000011159 matrix material Substances 0.000 claims description 7
- 238000013519 translation Methods 0.000 claims description 6
- 230000014616 translation Effects 0.000 claims description 6
- 238000012360 testing method Methods 0.000 claims description 3
- 238000010586 diagram Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 7
- 230000011218 segmentation Effects 0.000 description 7
- 238000012545 processing Methods 0.000 description 5
- 238000000605 extraction Methods 0.000 description 4
- 230000001502 supplementing effect Effects 0.000 description 4
- 238000003745 diagnosis Methods 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 238000011282 treatment Methods 0.000 description 3
- 238000000225 bioluminescence resonance energy transfer Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000001356 surgical procedure Methods 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 206010009944 Colon cancer Diseases 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 208000029742 colonic neoplasm Diseases 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000013210 evaluation model Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000012905 input function Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000002483 medication Methods 0.000 description 1
- DWAFYCQODLXJNR-BNTLRKBRSA-L oxaliplatin Chemical compound O1C(=O)C(=O)O[Pt]11N[C@@H]2CCCC[C@H]2N1 DWAFYCQODLXJNR-BNTLRKBRSA-L 0.000 description 1
- 229960001756 oxaliplatin Drugs 0.000 description 1
- 238000002271 resection Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000009469 supplementation Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000002604 ultrasonography Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
- G06F40/295—Named entity recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/237—Lexical tools
- G06F40/242—Dictionaries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Evolutionary Computation (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Biophysics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Machine Translation (AREA)
- Medical Treatment And Welfare Office Work (AREA)
Abstract
Description
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011468199.2A CN112597774B (zh) | 2020-12-14 | 2020-12-14 | 中文医疗命名实体识别方法、系统、存储介质和设备 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011468199.2A CN112597774B (zh) | 2020-12-14 | 2020-12-14 | 中文医疗命名实体识别方法、系统、存储介质和设备 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112597774A CN112597774A (zh) | 2021-04-02 |
CN112597774B true CN112597774B (zh) | 2023-06-23 |
Family
ID=75195221
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011468199.2A Active CN112597774B (zh) | 2020-12-14 | 2020-12-14 | 中文医疗命名实体识别方法、系统、存储介质和设备 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112597774B (zh) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113343694B (zh) * | 2021-04-29 | 2023-04-07 | 山东师范大学 | 一种医疗命名实体识别方法及系统 |
CN113204968A (zh) * | 2021-05-28 | 2021-08-03 | 平安科技(深圳)有限公司 | 医学实体的概念识别方法、装置、设备及存储介质 |
CN113420557B (zh) * | 2021-06-09 | 2024-03-08 | 山东师范大学 | 中文命名实体识别方法、系统、设备及存储介质 |
CN113779993B (zh) * | 2021-06-09 | 2023-02-28 | 北京理工大学 | 一种基于多粒度文本嵌入的医学实体识别方法 |
CN113487024A (zh) * | 2021-06-29 | 2021-10-08 | 任立椋 | 交替序列生成模型训练方法、从文本中抽取图的方法 |
CN113420561B (zh) * | 2021-07-14 | 2022-12-13 | 上海浦东发展银行股份有限公司 | 一种命名实体识别方法、装置、设备及存储介质 |
CN113536799B (zh) * | 2021-08-10 | 2023-04-07 | 西南交通大学 | 基于融合注意力的医疗命名实体识别建模方法 |
CN114564959A (zh) * | 2022-01-14 | 2022-05-31 | 北京交通大学 | 中文临床表型细粒度命名实体识别方法及系统 |
CN114580414A (zh) * | 2022-02-24 | 2022-06-03 | 医渡云(北京)技术有限公司 | 一种基于ac自动机的实体识别方法、装置及电子设备 |
CN116894436B (zh) * | 2023-09-06 | 2023-12-15 | 神州医疗科技股份有限公司 | 基于医学命名实体识别的数据增强方法及系统 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110110061A (zh) * | 2019-04-26 | 2019-08-09 | 同济大学 | 基于双语词向量的低资源语种实体抽取方法 |
CN111738003A (zh) * | 2020-06-15 | 2020-10-02 | 中国科学院计算技术研究所 | 命名实体识别模型训练方法、命名实体识别方法和介质 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107977361B (zh) * | 2017-12-06 | 2021-05-18 | 哈尔滨工业大学深圳研究生院 | 基于深度语义信息表示的中文临床医疗实体识别方法 |
CN111460804B (zh) * | 2019-01-02 | 2023-05-02 | 阿里巴巴集团控股有限公司 | 文本处理方法、装置和系统 |
CN111274829B (zh) * | 2020-02-07 | 2023-06-16 | 中国科学技术大学 | 一种利用跨语言信息的序列标注方法 |
CN112001177A (zh) * | 2020-08-24 | 2020-11-27 | 浪潮云信息技术股份公司 | 融合深度学习与规则的电子病历命名实体识别方法及系统 |
-
2020
- 2020-12-14 CN CN202011468199.2A patent/CN112597774B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110110061A (zh) * | 2019-04-26 | 2019-08-09 | 同济大学 | 基于双语词向量的低资源语种实体抽取方法 |
CN111738003A (zh) * | 2020-06-15 | 2020-10-02 | 中国科学院计算技术研究所 | 命名实体识别模型训练方法、命名实体识别方法和介质 |
Also Published As
Publication number | Publication date |
---|---|
CN112597774A (zh) | 2021-04-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112597774B (zh) | 中文医疗命名实体识别方法、系统、存储介质和设备 | |
Dalianis | Clinical text mining: Secondary use of electronic patient records | |
US11093688B2 (en) | Enhancing reading accuracy, efficiency and retention | |
He et al. | Pathvqa: 30000+ questions for medical visual question answering | |
US10929420B2 (en) | Structured report data from a medical text report | |
Banerjee et al. | Radiology report annotation using intelligent word embeddings: Applied to multi-institutional chest CT cohort | |
Catelli et al. | Crosslingual named entity recognition for clinical de-identification applied to a COVID-19 Italian data set | |
CN109192255B (zh) | 病历结构化方法 | |
CN110705293A (zh) | 基于预训练语言模型的电子病历文本命名实体识别方法 | |
Wang | Annotating and recognising named entities in clinical notes | |
Soysal et al. | Design and evaluation of an ontology based information extraction system for radiological reports | |
Wang et al. | Chinese medical named entity recognition based on multi-granularity semantic dictionary and multimodal tree | |
CN112241457A (zh) | 一种融合扩展特征的事理知识图谱事件检测方法 | |
Viani et al. | Supervised methods to extract clinical events from cardiology reports in Italian | |
Dynomant et al. | Word embedding for the French natural language in health care: comparative study | |
Liu et al. | Effectiveness of lexico-syntactic pattern matching for ontology enrichment with clinical documents | |
Adduru et al. | Towards Dataset Creation And Establishing Baselines for Sentence-level Neural Clinical Paraphrase Generation and Simplification. | |
Ke et al. | Medical entity recognition and knowledge map relationship analysis of Chinese EMRs based on improved BiLSTM-CRF | |
Yu et al. | Bios: An algorithmically generated biomedical knowledge graph | |
Goenaga et al. | A section identification tool: towards hl7 cda/ccr standardization in spanish discharge summaries | |
Wang et al. | Research on named entity recognition of doctor-patient question answering community based on bilstm-crf model | |
Satti et al. | A semantic sequence similarity based approach for extracting medical entities from clinical conversations | |
Chen et al. | Named entity recognition of Chinese electronic medical records based on cascaded conditional random field | |
Nair et al. | Automated clinical concept-value pair extraction from discharge summary of pituitary adenoma patients | |
Zhang et al. | Disease-pertinent knowledge extraction in online health communities using GRU based on a double attention mechanism |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240428 Address after: 230000 room 1414, building D, Yinhe happiness Plaza, intersection of Luzhou Avenue and Fuzhou Road, Baohe District, Hefei City, Anhui Province Patentee after: Hefei keyiguo Information Technology Co.,Ltd. Country or region after: China Address before: 250014 No. 88, Wenhua East Road, Lixia District, Shandong, Ji'nan Patentee before: SHANDONG NORMAL University Country or region before: China |
|
TR01 | Transfer of patent right |
Effective date of registration: 20240510 Address after: 230000, Room 401, Building E3A, Phase II, Innovation Industrial Park, No. 2800 Innovation Avenue, High tech Zone, Hefei Area, China (Anhui) Free Trade Pilot Zone, Hefei City, Anhui Province Patentee after: Micro Test Cloud (Anhui) Medical Information Co.,Ltd. Country or region after: China Address before: 230000 room 1414, building D, Yinhe happiness Plaza, intersection of Luzhou Avenue and Fuzhou Road, Baohe District, Hefei City, Anhui Province Patentee before: Hefei keyiguo Information Technology Co.,Ltd. Country or region before: China |