CN115545021A - Clinical term identification method and device based on deep learning - Google Patents

Clinical term identification method and device based on deep learning Download PDF

Info

Publication number
CN115545021A
CN115545021A CN202210802174.4A CN202210802174A CN115545021A CN 115545021 A CN115545021 A CN 115545021A CN 202210802174 A CN202210802174 A CN 202210802174A CN 115545021 A CN115545021 A CN 115545021A
Authority
CN
China
Prior art keywords
clinical
entity
context
text
term
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210802174.4A
Other languages
Chinese (zh)
Inventor
杨鹏
谢亮亮
李文军
胡皓楠
解然
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Huaxun Technology Co ltd
Original Assignee
Zhejiang Huaxun Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Huaxun Technology Co ltd filed Critical Zhejiang Huaxun Technology Co ltd
Priority to CN202210802174.4A priority Critical patent/CN115545021A/en
Publication of CN115545021A publication Critical patent/CN115545021A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses a clinical term recognition method and device based on deep learning, wherein the method comprises the following steps: fine adjustment of a pre-training model, construction of a clinical entity library, context awareness network and term recognition. Firstly, fine tuning is carried out on a clinical data set by using a pre-training model so as to learn a text representation method in the clinical field; then capturing clinical entity words in a professional clinical medical entity dictionary and an online medical database PubMed by means of a crawler program and storing the clinical entity words into a clinical entity database; matching and marking the clinical text by using a clinical entity library to obtain a clinical entity set, vectorizing and characterizing the clinical text and the clinical entity set by using a pre-training model after pre-fine adjustment, and constructing a feature vector for term recognition by associating information of context-aware network modeling terms and context entities through an attention mechanism; and finally, improving the term recognition accuracy through the dependency relationship among the conditional random field CRF learning labels and outputting the recognition result of the clinical terms.

Description

Clinical term identification method and device based on deep learning
Technical Field
The invention relates to a clinical term identification method and device based on deep learning, and belongs to the technical field of Internet and artificial intelligence.
Background
The clinical term Recognition task belongs to a Named Entity Recognition task (NER) in nature, which is the basis of many natural language processing tasks, such as disease reasoning, co-disease detection, clinical diagnosis and other tasks, and accurate Recognition and marking of related entities can help subsequent tasks to be performed efficiently. The full identification of target entities by means of human beings is undoubtedly inefficient and expensive, requiring a large number of professionals, and the exploration of automatic identification of target entities by means of artificial intelligence is therefore of great research and practical interest. Today's named entity recognition task is usually based on complex multi-layer neural networks in the deep learning field, and needs training learning on a large number of labeled data sets to obtain a model which can be used for entity recognition. However, the annotated resources of the clinical domain are quite limited, especially lacking in labeled corpora for term entities. When the traditional neural network model is used for training and learning on small-scale training data, overfitting or underfitting is easy to happen, and semantic information and feature association of context are difficult to capture. Additionally, named entity recognition tasks are performed on clinical records, usually as unstructured free text, and existing models usually perform context vector representation and feature extraction on tagged entities. However, most of the existing labeled data are labeled based on sentences, the context information of the sentences is relatively limited, and it is difficult for the model training to accurately express words when training on such data, and meanwhile, the interaction between entity contexts is also ignored.
The clinical term entities studied in the present invention include, but are not limited to, the following three classes: disease (protem), treatment (treatment), examination item (test) three types of term entities. For example, clopidogrel is a drug effective in heart diseases and can be taken for cardiac examination. There are three types of entities in the sentence, and the information interaction between the treatment entity (Clopidogrel) for the disease entity (heart disease) and the detection entity (heart examination) is helpful to provide effective information support for the model to identify the three types of entities, but the existing named entity identification model lacks the attention to the information interaction between the entities. Information interaction and association among modeling entities can help the model to better understand data. In recent years, with the continuous supplement of medical data, new entities are appearing, and professional dictionaries for explaining and summarizing terms appear. The original clinical record containing the term, the context of which has limited information representative of the term, may assist the model in understanding the meaning of the term by supplementing the meaning attributes of the term with knowledge of the entity dictionary and provide information for the identification of the relevant entity. Therefore, the existing named entity recognition model based on the hierarchical neural network needs a large amount of labeled data sets, but the labeling data for the specific entity recognition task is very limited; the machine learning method based on the rules and the dictionaries is completed by means of artificial feature extraction, and time and labor are consumed when large-scale feature engineering is performed in a targeted mode. The existence of the above situation limits the model performance of the traditional model on the task of clinical term recognition. Meanwhile, the information association of the terms and the context entities can provide help for identifying the terms, and mining implicit characteristics of the terms and the context entities can assist the model in better understanding data, so that better model expression is obtained. The traditional term recognition model lacks the attention to information association between terms and context entities, and limits the improvement of the accuracy of the traditional model on the term recognition task.
Disclosure of Invention
Aiming at the problem that the existing term recognition method is difficult to extract context information under the condition of limited marked resources and influences the improvement of the recognition effect, the invention combines the interaction of learning terms such as a pre-training model and an attention mechanism and the like with the implicit characteristics and information of clinical text context to improve the efficiency of model training and learning. First, the input to the model includes a clinical record and a set of clinical entities; for clinical records, processing the clinical records by using a pre-training model finely adjusted in the clinical field to obtain word-level vector representation, and then fusing information about context by using a bidirectional GRU to obtain a corresponding context feature vector; for the term entity set, an entity library is used for matching entities in clinical texts to obtain a corresponding entity set, and then a word set of each entity in the entity set is used for obtaining vector characterization related to the entity set through the pre-training model and the bidirectional GRU. And then carrying out attention weighted fusion on the context feature vector of the clinical text and the word vector of the entity set through an attention mechanism, splicing weighted fusion information to each context feature vector, and associating the context vector information of the clinical text with the information of the context entity to obtain a feature vector representation for term recognition. And finally, inputting the obtained feature vector representation into a conditional random field CRF, and improving the accuracy of model output and finally outputting a term recognition result by training and learning the front-back dependency relationship among output labels.
In order to achieve the purpose, the invention provides the following technical scheme:
a clinical term identification method based on deep learning comprises the following steps:
step 1: pre-training model trimming
Training and fine-tuning are carried out in a clinical resource database by using a pre-training model, and the text structure knowledge in the clinical field is learned, so that the vectorization representation of the clinical text can be more accurately represented by pre-training;
step 2: construction of clinical entity library
And capturing clinical entity words in a professional clinical medical entity dictionary and an online medical database PubMed by means of a crawler program and storing the clinical entity words into a clinical entity database. Then, identifying each clinical text by means of an entity library to obtain a corresponding entity set, and further segmenting the processed data to form a training set, a verification set and a test set;
and step 3: context aware network
Firstly, vectorizing representation is carried out on each clinical text and the entity set obtained in the step 2 by using the pre-training model after fine tuning in the step 1 on data in a training set to obtain word vector representation of the clinical text and the entity set, and the word vector representation of the clinical text is input into a bidirectional GRU neural network for feature embedding training to obtain context semantic representation of the clinical text. And simultaneously, carrying out the same processing on the entity set to obtain the word context semantic representation of the entity set. Then, performing weighted calculation on the context semantic representation of the clinical text and the context semantic representation of the entity by using an attention mechanism, modeling information correlation between clinical information and context entity information, paying attention to entity information which has key influence on term identification, and finally obtaining a prediction result for term identification through matrix change and an activation function;
step 4 term identification
Inputting the preliminary prediction result obtained in the step 3 into a conditional random field CRF, learning the front-back dependency relationship among output labels, and outputting the most probable term label prediction result.
The invention also provides a deep learning-based clinical term recognition device, which comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the computer program realizes the deep learning-based clinical term recognition method when being loaded to the processor.
Compared with the prior art, the invention has the following advantages and beneficial effects:
1. the invention trains the word vector characterization model in the clinical field by using abundant un-labeled clinical data, learns the data in the related field to make up the defect that the traditional word vector model cannot be effectively adapted in the professional field with insufficient resources, and ensures the accuracy and the interpretability of the clinical term identification method by supplementing the characteristics of the word vector.
2. The method breaks through the mode of only paying attention to the source clinical text in the past, designs the clinical entity library for identifying the registered entities in the clinical text, effectively pays attention to and extracts the characteristic association between the terms and the context entities through information interaction between the attention mechanism modeling context entities, and further improves the accuracy of identifying the clinical terms.
Drawings
FIG. 1 is a flow chart of a deep learning-based clinical term recognition method provided by the invention.
FIG. 2 is an overall model diagram of the present invention.
FIG. 3 is a context awareness module based on an attention mechanism.
Detailed Description
The technical solutions provided by the present invention will be described in detail with reference to specific examples, which should be understood that the following specific embodiments are only illustrative and not limiting the scope of the present invention.
Example 1: referring to fig. 1 to 3, the invention provides a deep learning-based clinical term recognition method, which includes the following steps: the flow is shown in fig. 1, firstly, a pre-training model is used for fine adjustment on a clinical data set so as to learn a text representation method in the clinical field; then capturing clinical entity words in a professional clinical medical entity dictionary and an online medical database PubMed by means of a crawler program and storing the clinical entity words into a clinical entity database; matching and marking the clinical text by using a clinical entity library to obtain a clinical entity set, vectorizing and characterizing the clinical text and the clinical entity set by using a pre-training model, and constructing a context-aware network modeling term and information association of entities by an attention mechanism to obtain a predicted output label; and finally, learning the dependency relationship among the labels through the conditional random field CRF to improve the accuracy of term identification. Referring to fig. 2 and 3, the detailed implementation steps of the method are as follows:
step 1, fine tuning of a pre-training model, wherein the Bert pre-training model is trained and learned based on a generalized corpus, and the knowledge of a general text structure is learned, so that a conventional word vector expression mode is obtained, but the learning of text expression in a specific field is lacked, the pre-training model can be continuously fine tuned by means of resources in the specific field to obtain the pre-training model with fusion field knowledge, and the vector representation capability in the specific field is improved, which is proved to be effective in a plurality of natural language processing field tasks. The pre-training model for training and learning on the clinical corpus can enable the initial parameters of the model to have knowledge information in the field, and is more beneficial to fine-tuning training and convergence of the target model on a training data set. The invention uses MIMIC-III Clinical database, uses a large amount of unstructured Clinical Records as unlabeled Clinical record data set to carry out fine tuning training on a pre-training model, and obtains a language characterization model (CRBERT) fusing Clinical field knowledge. CRBERTs are mainly used in place of the original berts in the subsequent word vectorization processing as the initial word vectorization tool.
And 2, constructing a clinical entity library. There are many medical entities in clinical records and the necessity of statistics and norms for medical entities has been noted. Such as english-chinese abbreviation dictionary written by zhang xi chen et al and on-line medical knowledge base, can be used to standardize and record commonly used professional vocabulary and clinical terms. The clinical entity library for identifying the target term can be constructed by acquiring registered medical entities in such dictionaries and knowledge bases by means of a crawler program, and the dictionaries and the knowledge bases can provide information such as attributes and interpretations of words, and provide implicit information support for model understanding data and identification of clinical terms. Then, each clinical text is identified by means of an entity library to obtain a corresponding entity set, and finally, all obtained data are processed according to the following steps of 8:1:1, carrying out segmentation to form a training set, a verification set and a test set.
And 3, context awareness network. Firstly, the CRBERT model obtained in step 1 is used for converting clinical text words into serialized vectors H CRBERT =<h 1 ,h 2 ,…,h m >It is then input into the context information of the learned text in the bidirectional GRU neural network to extract the implicit features of the context. After passing through the bidirectional GRU, a context-serialized representation vector H of clinical records is obtained gru . Therefore, the calculation process of the above-mentioned clinical text context information merged by the bidirectional GRU is as shown in formulas (1) to (3):
Figure BDA0003738195290000041
Figure BDA0003738195290000042
Figure BDA0003738195290000043
for each input text x using the entity library in the aforementioned step 2 i =<x i1 ,x i2 ,...,x in >Matching is performed, and the obtained entity set is denoted as a in the step i =<a i1 ,a i2 ,...,a iL >,a i Indicating a certain entity matched with L being a i The number of words matched to the entities, one entity set for each clinical text. To fuse the information in this entity set in the context of clinical records, it is also necessary to input x i And vectorizing the corresponding entity set for representation. From the perspective of less model parameters and faster training speed, the invention uses bidirectional GRUs to complete the task of entity set vectorization. The module combines the word set a of each entity information in the entity set i Learning hidden word vector representation of words through a bidirectional GRU network, learning and embedding entity word context in two directions by the bidirectional GRU network, and finally combining the word embedded representations in the two directions to obtain final entity set vectorization representation H attr P is x i The total number of terms matched across the entity pool. The calculation process is shown in formulas (4) to (7):
Figure BDA0003738195290000051
Figure BDA0003738195290000052
Figure BDA0003738195290000053
H attr =[H 1 ,H 2 ,H 3 ,…,H p ]#(7)
contextually vectorizing a representation of obtained clinical text H gru And vectorized representation of a set of entities H attr The invention performs weighted fusion on the information of the two based on the idea of attention mechanism. Combining information association relation between clinical text context and entity set, a context perception module based on an attention mechanism is provided, attention is generated on information interaction between context entities, feature association between the context entities is extracted, and feature vector representation H used for term recognition is obtained text-dict . The calculation of the attention mechanism in the module is shown in formulas (8) to (9):
H dict =softmax[W q H gru (W k H attr ) T ]W v H attr #(8)
H text-dict =concat(H gru ,H dict )#(9)
and 4, identifying terms. The feature vector representation H for term identification obtained from step 3 text-dict Performing pooling operation on the output sequence to obtain a predicted output sequence after matrix shape conversion and activation function
Figure BDA0003738195290000054
Figure BDA0003738195290000055
The predicted score corresponding to each category label of a word in the clinical record text directly takes the category with the highest score as the predicted category result of the word in the conventional method. However, the predicted output label sequence is obtained based on the mapping association of the sequence of the input text to the result label, and the relation between the result output labels is not learned, which may cause some wrong results. And (3) expressing the constraint relation among the label sequences through the state transition matrix of the sequences, and continuously optimizing parameters to maximize the score of the correct state transition sequence and further improve the term identification accuracy. The above-mentioned meterThe calculation process is shown in formulas (10) to (11):
Figure BDA0003738195290000056
Figure BDA0003738195290000057
x denotes each training sample, score (X, y) denotes the correct annotated sequence score,
Figure BDA0003738195290000058
indicating the probability of a transition between the tags,
Figure BDA0003738195290000059
indicates the ith tag gets y i Probability of the label. The scores for all possible marker sequences were then subjected to softmax normalization. Y represents all possible tag sequences, Y represents already labeled sequences, and the predicted tag sequence with the highest score is output through the CRF, thereby outputting the final term recognition result.
Example 2: based on the same inventive concept, the invention provides a deep learning-based clinical term recognition device, which comprises a memory, a processor and a computer program stored in the memory and capable of running on the processor, wherein the computer program can realize the above deep learning-based clinical term recognition method when being loaded into the processor.
The technical means disclosed in the invention scheme are not limited to the technical means disclosed in the above embodiments, but also include the technical scheme formed by any combination of the above technical features. It should be noted that those skilled in the art can make various improvements and modifications without departing from the principle of the present invention, and such improvements and modifications are also considered to be within the scope of the present invention.

Claims (8)

1. A clinical term recognition method based on deep learning is characterized by comprising the following steps:
step 1, fine adjustment of a pre-training model;
step 2, constructing a clinical entity library;
step 3, context awareness network;
and 4, identifying terms.
2. The deep learning-based clinical term identification method according to claim 1, comprising the steps of:
step 1: fine adjustment of a pre-training model, which is specifically as follows: using a MIMIC-III Clinical database, using a large number of unstructured Clinical Records contained therein as an unlabeled Clinical record data set to perform fine tuning training on a pre-training model to obtain a language characterization model (CRBERT) fusing Clinical domain knowledge, wherein the CRBERT is mainly used for replacing the original BERT as an initial word vectorization tool in the subsequent word vectorization processing.
3. The deep learning based clinical term recognition method according to claim 1, comprising the steps of: and 2, constructing a clinical entity library, specifically, capturing clinical entity words in a professional clinical medical entity dictionary and an online medical database PubMed by means of a crawler program, storing the clinical entity words into the clinical entity library, then, identifying each clinical text by means of the entity library to obtain a corresponding entity set, and further dividing the processed data to form a training set, a verification set and a test set.
4. The deep learning based clinical term recognition method according to claim 1, comprising the steps of: and step 3: the context awareness network comprises the following steps of firstly carrying out vectorization representation on each clinical text and an entity set obtained in step 2 by using a pre-training model subjected to fine tuning in step 1 on data in a training set to obtain word vector representations of the clinical texts and the entity set, inputting the word vector representations of the clinical texts into a bidirectional GRU neural network for feature embedding training to obtain context semantic representations of the clinical texts, simultaneously carrying out same processing on the entity set to obtain word context semantic representations of the entity set, then carrying out weighted calculation on the context semantic representations of the clinical texts and the context semantic representations of the entities by using an attention mechanism, modeling information correlation of the clinical information and the context entity information, paying attention to the entity information which generates key influence in term identification, and finally obtaining a prediction result for the term identification through matrix change and an activation function.
5. The deep learning based clinical term recognition method according to claim 1, comprising the steps of: step 4, term identification, which is specifically as follows: inputting the preliminary prediction result obtained in the step 3 into a conditional random field CRF, learning the front-back dependency relationship among output labels, and outputting the most probable term label prediction result.
6. The deep learning based clinical term recognition method according to claim 3, comprising the steps of: and step 3: the context-aware network specifically comprises the following components: step 3, the context-aware network firstly uses the CRBERT model obtained in step 1 to convert the clinical text words into serialized vectors H CRBERT =<h 1 ,h 2 ,…,h m >Then inputting the context information into the context information of the learning text in the bidirectional GRU neural network to extract the implicit characteristics of the context, and obtaining a context serialization expression vector H of the clinical record after passing through the bidirectional GRU gru Therefore, the above calculation process of merging the context information of the clinical text by the bidirectional GRU is as shown in formulas (1) to (3):
Figure RE-FDA0003941829520000021
Figure RE-FDA0003941829520000022
Figure RE-FDA0003941829520000023
for each input text x using the entity library in the aforementioned step 2 i =<x i1 ,x i2 ,...,x in >Matching is performed, and the obtained entity set is denoted as a in the step i =<a i1 ,a i2 ,...,a iL >,a i Indicating a certain entity matched with L being a i The number of words matched to the entity, one entity set for each clinical text, and the requirement for input x i Vectorizing the corresponding entity set, completing the vectorization task of the entity set by using a bidirectional GRU (generalized regression Unit), and enabling the module to carry out word set a of each entity information in the entity set i Learning hidden word vector representation of words through a bidirectional GRU network, learning and embedding entity word context in two directions by the bidirectional GRU network, and finally combining the word embedded representations in the two directions to obtain final entity set vectorization representation H attr P is x i The total number of terms matched by the entity library is calculated as shown in formulas (4) to (7):
Figure RE-FDA0003941829520000024
Figure RE-FDA0003941829520000025
Figure RE-FDA0003941829520000026
H attr =[H 1 ,H 2 ,H 3 ,…,H p ]#(7)
context vectorized representation of acquired clinical text H gru And entity setsResultant vectorized representation H attr The invention carries out weighted fusion on the information of the two based on the thought of the attention mechanism, combines the information association relation between the clinical text context and the entity set, provides a context perception module based on the attention mechanism, generates attention on the information interaction between the context entities, extracts the feature association between the context entities, and obtains the feature vector representation H for term recognition text-dict The calculation of the attention mechanism in the module is shown in formulas (8) to (9):
H dict =softmax[W q H gru (W k H attr ) T ]W v H attr #(8)
H text-dict =concat(H gru ,H dict )(9)。
7. the deep learning based clinical term recognition method according to claim 5, comprising the steps of: step 4, identifying terms, specifically as follows: the feature vector for term recognition obtained according to step 3 represents H text-dict Performing pooling operation on the output sequence to obtain a predicted output sequence after matrix shape conversion and activation function
Figure RE-FDA0003941829520000031
Figure RE-FDA0003941829520000032
Corresponding to the prediction score of each category label of a word in a clinical record text, the category with the highest score is directly taken as the prediction category result of the word in the conventional method, the problem is solved by means of a conditional random field CRF, the constraint relation among label sequences is expressed by a state transition matrix of the sequences, parameters are continuously optimized, the score of a correct state transition sequence is maximized, and the term recognition accuracy is further improved, wherein the calculation process is shown as formulas (10) to (11):
Figure RE-FDA0003941829520000033
Figure RE-FDA0003941829520000034
x denotes each training sample, score (X, y) denotes the correct annotated sequence score,
Figure RE-FDA0003941829520000035
indicating the probability of a transition between the labels,
Figure RE-FDA0003941829520000036
indicates the ith tag gets y i And (3) carrying out softmax normalization on scores of all possible label sequences, wherein Y represents all possible label sequences, Y represents sequences which are already labeled, and outputting a predicted label sequence with the highest score through a CRF (cross-domain similarity), thereby outputting a final term recognition result.
8. An identification device for implementing the deep learning based clinical term identification method according to any one of claims 1 to 7, wherein the identification device comprises a memory, a processor and a computer program stored in the memory and executable on the processor, and when the computer program is loaded into the processor, the identification device implements the deep learning based clinical term identification method, and the identification method comprises the following steps: step 1, fine adjustment of a pre-training model; step 2, constructing a clinical entity library; step 3, context awareness network; and 4, identifying terms.
CN202210802174.4A 2022-07-08 2022-07-08 Clinical term identification method and device based on deep learning Pending CN115545021A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210802174.4A CN115545021A (en) 2022-07-08 2022-07-08 Clinical term identification method and device based on deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210802174.4A CN115545021A (en) 2022-07-08 2022-07-08 Clinical term identification method and device based on deep learning

Publications (1)

Publication Number Publication Date
CN115545021A true CN115545021A (en) 2022-12-30

Family

ID=84723565

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210802174.4A Pending CN115545021A (en) 2022-07-08 2022-07-08 Clinical term identification method and device based on deep learning

Country Status (1)

Country Link
CN (1) CN115545021A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116646078A (en) * 2023-07-19 2023-08-25 中国人民解放军总医院 Cardiovascular critical clinical decision support system and device based on artificial intelligence
CN117807956A (en) * 2023-12-29 2024-04-02 兰州理工大学 ICD automatic coding method based on clinical text tree structure
CN118194867A (en) * 2023-12-05 2024-06-14 武汉大学 Method and system for establishing enterprise digital term library based on semi-supervised learning

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116646078A (en) * 2023-07-19 2023-08-25 中国人民解放军总医院 Cardiovascular critical clinical decision support system and device based on artificial intelligence
CN116646078B (en) * 2023-07-19 2023-11-24 中国人民解放军总医院 Cardiovascular critical clinical decision support system and device based on artificial intelligence
CN118194867A (en) * 2023-12-05 2024-06-14 武汉大学 Method and system for establishing enterprise digital term library based on semi-supervised learning
CN117807956A (en) * 2023-12-29 2024-04-02 兰州理工大学 ICD automatic coding method based on clinical text tree structure

Similar Documents

Publication Publication Date Title
CN110502749B (en) Text relation extraction method based on double-layer attention mechanism and bidirectional GRU
WO2018028077A1 (en) Deep learning based method and device for chinese semantics analysis
CN112001177A (en) Electronic medical record named entity identification method and system integrating deep learning and rules
CN111062217B (en) Language information processing method and device, storage medium and electronic equipment
CN111046179B (en) Text classification method for open network question in specific field
CN112542223A (en) Semi-supervised learning method for constructing medical knowledge graph from Chinese electronic medical record
CN109508459B (en) Method for extracting theme and key information from news
CN110059185A (en) A kind of medical files specialized vocabulary automation mask method
CN106980608A (en) A kind of Chinese electronic health record participle and name entity recognition method and system
CN115545021A (en) Clinical term identification method and device based on deep learning
US20230069935A1 (en) Dialog system answering method based on sentence paraphrase recognition
CN111401058B (en) Attribute value extraction method and device based on named entity recognition tool
CN114818717B (en) Chinese named entity recognition method and system integrating vocabulary and syntax information
CN113360667B (en) Biomedical trigger word detection and named entity identification method based on multi-task learning
CN112800184B (en) Short text comment emotion analysis method based on Target-Aspect-Opinion joint extraction
CN112749277B (en) Medical data processing method, device and storage medium
CN116341546A (en) Medical natural language processing method based on pre-training model
CN115510230A (en) Mongolian emotion analysis method based on multi-dimensional feature fusion and comparative reinforcement learning mechanism
CN114882488A (en) Multi-source remote sensing image information processing method based on deep learning and attention mechanism
CN116227594A (en) Construction method of high-credibility knowledge graph of medical industry facing multi-source data
CN114970536A (en) Combined lexical analysis method for word segmentation, part of speech tagging and named entity recognition
CN111125378A (en) Closed-loop entity extraction method based on automatic sample labeling
Wang et al. Aspect-based sentiment analysis with graph convolutional networks over dependency awareness
CN114388108A (en) User feedback analysis method based on multi-task learning
CN116757195A (en) Implicit emotion recognition method based on prompt learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination