WO2020244534A1 - 医疗问答方法、医疗问答系统、电子设备和计算机可读存储介质 - Google Patents
医疗问答方法、医疗问答系统、电子设备和计算机可读存储介质 Download PDFInfo
- Publication number
- WO2020244534A1 WO2020244534A1 PCT/CN2020/094068 CN2020094068W WO2020244534A1 WO 2020244534 A1 WO2020244534 A1 WO 2020244534A1 CN 2020094068 W CN2020094068 W CN 2020094068W WO 2020244534 A1 WO2020244534 A1 WO 2020244534A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- semantic
- patient
- medical
- intention
- answer
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 46
- 238000003860 storage Methods 0.000 title claims abstract description 19
- 230000014509 gene expression Effects 0.000 claims abstract description 68
- 238000013507 mapping Methods 0.000 claims abstract description 42
- 208000024891 symptom Diseases 0.000 claims description 64
- 238000004458 analytical method Methods 0.000 claims description 57
- 238000002372 labelling Methods 0.000 claims description 17
- 238000004590 computer program Methods 0.000 claims description 13
- 239000000284 extract Substances 0.000 claims description 10
- 230000015654 memory Effects 0.000 claims description 9
- 238000004364 calculation method Methods 0.000 claims description 6
- 201000010099 disease Diseases 0.000 description 25
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 25
- 239000003814 drug Substances 0.000 description 20
- 229940079593 drug Drugs 0.000 description 9
- 238000005516 engineering process Methods 0.000 description 9
- 238000010586 diagram Methods 0.000 description 8
- 206010013789 Dry throat Diseases 0.000 description 7
- 238000003745 diagnosis Methods 0.000 description 7
- 206010037660 Pyrexia Diseases 0.000 description 6
- 230000000694 effects Effects 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 238000012549 training Methods 0.000 description 5
- 206010019233 Headaches Diseases 0.000 description 4
- 230000036528 appetite Effects 0.000 description 4
- 235000019789 appetite Nutrition 0.000 description 4
- 230000002457 bidirectional effect Effects 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 4
- 231100000869 headache Toxicity 0.000 description 4
- 230000000306 recurrent effect Effects 0.000 description 4
- 206010012735 Diarrhoea Diseases 0.000 description 3
- 230000002155 anti-virotic effect Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 208000008035 Back Pain Diseases 0.000 description 2
- 206010028735 Nasal congestion Diseases 0.000 description 2
- 208000002193 Pain Diseases 0.000 description 2
- RJKFOVLPORLFTN-LEKSSAKUSA-N Progesterone Chemical compound C1CC2=CC(=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H](C(=O)C)[C@@]1(C)CC2 RJKFOVLPORLFTN-LEKSSAKUSA-N 0.000 description 2
- 206010038776 Retching Diseases 0.000 description 2
- 208000036071 Rhinorrhea Diseases 0.000 description 2
- 206010039101 Rhinorrhoea Diseases 0.000 description 2
- 206010047700 Vomiting Diseases 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 239000004053 dental implant Substances 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 230000007787 long-term memory Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 238000003058 natural language processing Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000006403 short-term memory Effects 0.000 description 2
- 230000035900 sweating Effects 0.000 description 2
- 230000008673 vomiting Effects 0.000 description 2
- RZVAJINKPMORJF-UHFFFAOYSA-N Acetaminophen Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 description 1
- 208000010201 Exanthema Diseases 0.000 description 1
- 208000008454 Hyperhidrosis Diseases 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 206010050031 Muscle strain Diseases 0.000 description 1
- 206010033645 Pancreatitis Diseases 0.000 description 1
- 208000013738 Sleep Initiation and Maintenance disease Diseases 0.000 description 1
- 208000022531 anorexia Diseases 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 239000010231 banlangen Substances 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000013145 classification model Methods 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 238000003759 clinical diagnosis Methods 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 206010061428 decreased appetite Diseases 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 201000005884 exanthem Diseases 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 206010022437 insomnia Diseases 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000005906 menstruation Effects 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 208000025661 ovarian cyst Diseases 0.000 description 1
- 230000016087 ovulation Effects 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 229960003387 progesterone Drugs 0.000 description 1
- 239000000186 progesterone Substances 0.000 description 1
- 206010037844 rash Diseases 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 229940072651 tylenol Drugs 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/216—Parsing using statistical methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/237—Lexical tools
- G06F40/247—Thesauruses; Synonyms
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/284—Lexical analysis, e.g. tokenisation or collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
- G06F40/35—Discourse or dialogue representation
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H10/00—ICT specially adapted for the handling or processing of patient-related medical or healthcare data
- G16H10/20—ICT specially adapted for the handling or processing of patient-related medical or healthcare data for electronic clinical trials or questionnaires
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H80/00—ICT specially adapted for facilitating communication between medical practitioners or patients, e.g. for collaborative diagnosis, therapy or health monitoring
Definitions
- the present disclosure relates to the field of Internet technology, and in particular to a medical question answering method, a medical question answering system, an electronic device and a computer-readable storage medium.
- the embodiments of the present disclosure provide a medical question answering method, a medical question answering system, an electronic device, and a non-transitory computer-readable storage medium.
- the first aspect of the present disclosure provides a medical question and answer method, including:
- the synonym mapping table includes a mapping relationship between a plurality of standard expression words and respective corresponding entity words ;
- the corresponding answer is output according to the semantic analysis result.
- the recognizing the patient's intention according to the medical consultation sentence input by the patient includes:
- the patient's intention is determined.
- the extracting at least one entity word corresponding to the condition feature from the medical consultation sentence according to the patient's intention includes:
- the semantic slot template including a plurality of semantic slots for characterizing the characteristics of the condition
- the extracting entity words corresponding to the multiple semantic slots in the semantic slot template from the medical consultation sentence includes:
- a sequence labeling model is used to sequence the medical consultation sentences, and the entity words corresponding to the multiple semantic slots in the semantic slot template are obtained according to the sequence labeling results.
- the generating a semantic analysis result according to the patient's intention and the standard expression word includes:
- the semantic analysis result is generated according to the patient's intention, each semantic slot and its filling value.
- the output of the corresponding answer according to the semantic analysis result includes:
- each of the sample groups including question samples and corresponding answer samples
- the calculating the degree of matching between the semantic analysis result and each sample group in the doctor-patient question and answer knowledge base includes:
- the matching degree is generated according to the similarity degree and the first weighting coefficient, and the correlation degree and the second weighting coefficient.
- the disease characteristics include: at least one of onset symptoms, symptom onset time, symptom duration, accompanying symptoms, medical history, treatment history, and patient age.
- the medical question and answer method before recognizing the patient's intention according to the medical consultation sentence input by the patient, the medical question and answer method further includes:
- the synonym mapping table is generated.
- the second aspect of the present disclosure provides a medical question answering system, including:
- the intention recognizer is used to recognize the patient’s intention according to the medical consultation sentence entered by the patient;
- An entity word extractor which is used to extract at least one entity word corresponding to a feature of the condition from the medical consultation sentence according to the patient's intention
- the standard word acquisition unit is configured to acquire standard expression words that are synonymous with each of the at least one entity word according to a preset synonym mapping table; wherein, the synonym mapping table includes a plurality of standard expression words and their respective corresponding The mapping relationship between entity words;
- a parser configured to generate a semantic analysis result according to the patient's intention and the standard expression
- the output unit outputs the corresponding answer according to the semantic analysis result.
- the intention recognizer is also used for:
- the patient's intention is determined.
- the entity word extractor includes:
- a template obtaining unit configured to obtain a semantic slot template corresponding to the patient's intention, the semantic slot template including a plurality of semantic slots for characterizing the characteristics of the condition;
- the recognition unit is configured to extract entity words corresponding to the multiple semantic slots in the semantic slot template from the medical consultation sentence.
- the identification unit is further configured to:
- a sequence labeling model is used to sequence the medical consultation sentences, and the entity words corresponding to the multiple semantic slots in the semantic slot template are obtained according to the sequence labeling results.
- the parser includes:
- the filling unit is used to fill the standard expression words corresponding to the patient's medical consultation sentence into the corresponding semantic slots of the plurality of semantic slots;
- a judging unit for judging whether there is an unfilled semantic slot in the semantic slot template
- the inquiry unit is used to generate an inquiry question corresponding to the unfilled semantic slot when there is an unfilled semantic slot in the semantic slot template, and respond to the unfilled semantic slot according to the answer sentence entered by the patient for the inquiry question.
- the semantic slot is filled until all the semantic slots of the current semantic slot template are filled;
- the parsing unit is used to generate the semantic parsing result according to the patient's intention, each semantic slot and its filling value.
- the output unit includes:
- the matching degree calculator is used to calculate the matching degree between the semantic analysis result and each sample group in the doctor-patient question and answer knowledge base, each sample group includes a question sample and its corresponding answer sample;
- the output unit is used to output the answer sample corresponding to the maximum matching degree.
- the matching degree calculator includes:
- the generating subunit is configured to generate the matching degree according to the similarity and the first weighting coefficient, and the correlation and the second weighting coefficient.
- the medical question answering system further includes:
- Standard word database generator used to generate a standard word database, in which there are multiple standard expression word samples
- Synonym collector for collecting at least one synonym corresponding to each standard expression word sample
- the filter is used to calculate the similarity between each standard expression word sample and its corresponding synonym, keep the synonym corresponding to the similarity greater than the preset value, and replace the synonym corresponding to the similarity less than or equal to the preset value Remove;
- the mapping table generator is used to generate the synonym mapping table according to each synonym and its corresponding and currently reserved synonyms.
- a third aspect of the present disclosure provides an electronic device including a memory and a processor, and a computer program is stored on the memory, wherein the computer program is executed by the processor to implement the method according to the first aspect of the present disclosure.
- the fourth aspect of the present disclosure provides a non-transitory computer-readable storage medium on which a computer program is stored, wherein, when the computer program is executed by a processor, the implementation in each embodiment of the first aspect of the present disclosure Any one of the mentioned medical question and answer methods.
- FIG. 1 is a flowchart of a medical question-and-answer method provided by an embodiment of the disclosure
- FIG. 3 is a schematic diagram of the process of generating a synonym mapping table provided by an embodiment of the disclosure
- FIG. 4 is a schematic structural diagram of a medical question answering system provided by an embodiment of the disclosure.
- FIG. 5 is a schematic structural diagram of a medical question answering system provided by an embodiment of the disclosure.
- the inventor of the present disclosure found that when a patient (or user) consults a related online disease question and answer website, the question input by the patient is generally colloquial and the description is diverse, resulting in the relevant online disease question and answer website (or related Question answering system) cannot answer the questions entered by patients well.
- Fig. 1 is a flowchart of a medical question-and-answer method provided by an embodiment of the disclosure.
- the medical question answering method may be executed by a medical question answering system, the system may be implemented by software and/or hardware, and the system may be integrated in an electronic device.
- the medical question-and-answer method may include the following steps S11 to S15.
- Step S11 Identify the patient's intention according to the medical consultation sentence input by the patient (or user).
- the types of the patient's intentions may include “disease diagnosis”, “treatment”, “medicine”, “medicine effect consultation”, “pathogenesis consultation”, “surgical consultation”, and the like.
- step S11 may use a preset intention recognition model to determine the specific type of patient intention.
- Step S12 according to the patient's intention, extract at least one entity word corresponding to the condition feature from the medical consultation sentence.
- the patient's condition characteristics include at least one of onset symptoms, symptom onset time, symptom duration, accompanying symptoms, medical history, treatment history, and patient age.
- Each intention can correspond to one or more preset disease characteristics.
- the entity word may be a word corresponding to at least one of the patient's onset symptoms, symptom occurrence time, symptom duration, accompanying symptoms, medical history, treatment history, and age.
- the patient's intention is identified as “treatment”, and based on the intention of "treatment”, extract: corresponding to "patient age”
- Step S13 Obtain a standard expression word that is synonymous with each of the at least one entity word according to a preset synonym mapping table.
- the synonym mapping table may include a mapping relationship between a plurality of standard expression words and their respective corresponding synonyms (ie, the entity words).
- the entity words extracted in step S12 can be colloquialized words, such as “diarrhea”, “cannot eat”, “bad appetite”, “bad appetite”; according to the synonym mapping table, it can be obtained with “diarrhea”
- the corresponding standard expression is “diarrhea”, and the corresponding standard expressions of "can't eat,” “bad appetite,” and “bad appetite” are all “anorexia”.
- Step S14 Generate a semantic analysis result according to the patient's intention and standard expression words.
- Step S15 Output a corresponding answer according to the semantic analysis result.
- FIG. 2 is a flowchart of another medical question and answer method provided by an embodiment of the disclosure. As shown in FIG. 2, the medical question and answer method may include the following steps S21 to S25.
- Step S21 Identify the patient's intention according to the medical consultation sentence input by the patient.
- this step S21 may include the following steps S211 to S213.
- Step S211 Obtain the document subject information of the medical consultation sentence input by the patient; and convert the medical consultation sentence input by the patient from text data into vector data.
- a document topic generation (also called Latent Dirichlet Allocation, referred to as LDA) model may be used to generate document topic information of a medical consultation sentence, and the word2vec model may be used to convert the medical consultation sentence into an embedding vector.
- LDA Latent Dirichlet Allocation
- Step S212 According to the document subject information and vector data corresponding to the medical consultation sentence, a score corresponding to each preset intention of the medical consultation sentence is obtained.
- document subject information and vector data corresponding to medical consultation sentences can be spliced to obtain a vector matrix containing word information and subject information, and the vector matrix can be input to the bidirectional gated recurrent unit (BiGRU) to obtain the medical consultation sentence correspondence
- BiGRU bidirectional gated recurrent unit
- Step S213 Determine the patient's intention according to the score of the medical consultation sentence corresponding to each preset intention.
- a softmax classifier is used to map the score corresponding to each intent to a probability between (0, 1), so that the patient's intent is determined according to the maximum probability.
- the softmax classifier is only for illustration, and other classifiers, such as svm classifier, can also be applied.
- step S21 may be performed using a preset intent recognition model, which may include a word2vec model, a document topic generation (LDA) model, a bidirectional gated recurrent unit (BiGRU), and a softmax classifier.
- a preset intent recognition model which may include a word2vec model, a document topic generation (LDA) model, a bidirectional gated recurrent unit (BiGRU), and a softmax classifier.
- the intent recognition model of the required function can be obtained through training.
- training use samples from professional medical websites or apps (such as Haodafu (see www.haodf.com), Dingxiang Doctor (see www.dxy.com), Ping An Good Doctor (see www.jk.cn) Etc.) or medical inquiry records (patients and doctors’ inquiry records) to collect doctor-patient question and answer data, extract the patient’s medical consultation sentences, and perform data cleaning on the text of the medical consultation sentences (that is, remove non-keywords from the text) , Such as "Hello” etc.).
- clustering algorithm is used to cluster the text data, and the types of intentions commonly asked by patients are determined by sampling; and the specific types of each type of intentions are determined by professionals (doctors or professionals with medical knowledge). And train the intention recognition model according to each medical consultation sentence and its corresponding intention type.
- the types of intentions may include: “disease diagnosis”, “treatment”, “medicine consultation”, “medicine effect consultation”, “inquiry about the cause of disease”, “surgical consultation” and “other”.
- the recognition model determines that the patient's intention is “other", it can directly prompt the user that it cannot answer such questions.
- Step S22 According to the patient's intention, extract at least one entity word corresponding to the condition feature from the medical consultation sentence.
- the disease characteristics include: at least one of onset symptoms, symptom onset time, symptom duration, accompanying symptoms, medical history, treatment history, and patient age.
- this step S22 may include the following steps S221 and S222.
- Step S221 Obtain a semantic slot template corresponding to the patient's intention, and each semantic slot template includes a plurality of semantic slots for characterizing the characteristics of the condition.
- the semantic slot template corresponding to each intent can be preset.
- the semantic slot template corresponding to "medicine consultation” includes multiple semantic slots for representing "symptoms", “time when symptoms occur”, “accompanying symptoms”, “medical history”, and "treatment history”.
- Step S222 Extract entity words corresponding to the semantic slot in the semantic slot template from the medical consultation sentence.
- a named entity recognition method may be used to extract entity words corresponding to semantic slots in the semantic slot template from the medical consultation sentence.
- step S222 may include: sequence labeling the medical consultation sentence using a sequence labeling model, and obtaining the entity word corresponding to the semantic slot in the semantic slot template according to the sequence labeling result.
- the sequence annotation model may be the BiLSTM-CRF model, which uses the BIO annotation set to perform named entity recognition based on the name of the semantic slot.
- the semantic slot template includes two semantic slots: "disease" and "symptom name”.
- B-DIS represents the first word of the disease
- I-DIS represents the non-first word of the disease
- B-SYM represents the first word of the symptom
- I-SYM symptoms are not the first word
- O means that the word is not part of the named entity.
- B1-DIS can represent the first word of the disease
- I1-DIS can represent the disease Not the first word
- B1-SYM represents the first word of symptoms
- I1-SYM is the first word of symptoms
- B2-DIS represents the first word of onset time
- I2-DIS represents the first word of onset time
- B1-SYM represents the first word of medication history
- I2- SYM symptoms are not the first word
- O means that the word is not part of the named entity.
- the BiLSTM-CRF model can be obtained through training. During training, set multiple sample sequences and their corresponding label sequences, each sample sequence and its corresponding label sequence have the same length; use the sample sequence as the input of the initial BiLSTM-CRF model, and set the sample sequence corresponding to the label sequence As the output of the initial BiLSTM-CRF model, the BiLSTM-CRF model with the required function is obtained through multiple training.
- Step S23 Obtain a standard expression word that is synonymous with each of the at least one entity word according to a preset synonym mapping table.
- the synonym mapping table includes a mapping relationship between a plurality of standard expression words and their corresponding synonyms (ie, the entity words).
- the synonym mapping table may be provided before step S21.
- FIG. 3 is a schematic diagram of a process of generating a synonym mapping table provided by an embodiment of the disclosure. As shown in FIG. 3, the process of generating the synonym mapping table may include the following steps S301 to S304.
- Step S301 Generate a standard vocabulary.
- the standard vocabulary stores multiple standard expression word samples (ie, multiple standard expression word samples).
- Step S302 Collect at least one synonym corresponding to each standard expression word sample.
- synonyms corresponding to a sample of standard expressions mean that they have the same or basically the same meaning as the standard expressions.
- Synonyms corresponding to standard expression word samples can be collected from major medical websites, forums, Baidu Encyclopedia (see baike.baidu.com) and other websites. The synonyms collected in this step can be colloquial non-standard expression words.
- standard expressions can be obtained from authoritative medical textbooks, dictionaries, manuals, etc., such as the diagnosis and treatment guidelines for various diseases issued by the medical and health management department, the clinical diagnosis and treatment guidelines issued by the medical industry association, and the doctor's desk reference (PDR, Physician's Desk Reference) ), Pharmacopoeia, etc.
- Step S303 Calculate the similarity (such as cosine similarity) between each standard expression word sample and its corresponding synonym; keep the synonyms corresponding to the similarity greater than the preset value, and compare the similarity less than or equal to the preset value The synonyms corresponding to the degree are removed.
- the similarity such as cosine similarity
- an existing synonym recognition model for example, word2vec can be used to calculate the semantic similarity between words
- the similarity is too small, it indicates that the corresponding standard expression word sample does not have the same meaning as the collected synonyms, and the synonyms can be removed.
- the similarity may be a value between 0 and 1.
- the preset value for determining the similarity between each standard expression word sample and its corresponding synonym can be set according to actual needs.
- Synonyms toolkit In the field of natural language processing technology, a variety of models for identifying synonyms have been developed. For example, Synonyms toolkit, LRWE model, etc. The embodiments of the present disclosure may also use these known models to identify synonyms.
- Step S304 Generate the synonym mapping table according to each standard expression word sample (ie, a standard expression word sample) and its corresponding and currently reserved synonyms.
- Table 2 exemplarily shows a part of the synonym mapping table.
- step S23 the standard expression words that are synonymous with the entity words can be directly searched from the synonym mapping table.
- Step S24 Generate a semantic analysis result according to the patient's intention and standard expression words.
- step S24 specifically includes the following steps S241 to S244.
- Step S241 Fill the standard expression words corresponding to the patient's medical consultation sentence into corresponding semantic slots among the multiple semantic slots of the current semantic slot template.
- the medical consultation sentence entered by the patient is "a cold, dry throat, what kind of medicine do you need to take", and the patient's intention can be identified as “medicine consultation” according to the medical consultation sentence.
- the semantic slot template corresponding to this intention is multiple Semantic slots include: “symptoms”, “time when symptoms occurred”, “accompanying symptoms”, “medical history” and “treatment history”.
- the entity words related to "symptoms” are: “cold, dry throat”; using the synonym mapping table to get the standard expression word “dry throat”, Then, fill "cold, dry throat” into the semantic slot of "symptoms".
- Step S242 Determine whether there is an unfilled semantic slot in the current semantic slot template.
- step S243 If the result of the judgment is that there is no unfilled semantic slot in the current semantic slot template, then proceed to step S243, as described below.
- step S244 If the result of the judgment is that there is an unfilled semantic slot in the current semantic slot template, proceed to step S244.
- step S244 an inquiry question corresponding to the unfilled semantic slot is generated, and the unfilled semantic slot is filled according to the answer sentence input by the patient for the inquiry question until all the semantic slots are filled.
- the medical consultation sentence entered by the patient for the first time may contain only a few disease characteristics, for example, only the symptoms and onset time; and in most cases, the time, characteristics, The state and accompanying symptoms directly determine the possibility that the patient may develop a certain disease.
- vomiting is a common symptom. It may be a symptom caused by a cold, or it may be a symptom caused by other reasons. The time of vomiting is different, and the result of the diagnosis may be different.
- the medical question and answer method breaks through the traditional single-round question and answer method, and realizes multiple rounds of interaction.
- Step S243 Generate a semantic analysis result according to the patient's intention, each semantic slot and its filling value.
- the intent is "medicine consultation”
- the semantic slot includes “symptoms”, “time when symptoms occur”, “accompanying symptoms”, “medical history” and “treatment history”
- the semantic slot "symptoms” has a slot value of "headache”
- semantics The slot "time of symptom occurrence” has a slot value of "one day ago”
- the semantic slot "accompanying symptoms” has a slot value of "retching”
- the semantic slot “medical history” has a slot value of "three positives”
- the slot value of "history” is "anti-virus”
- Step S25 Output a corresponding answer according to the semantic analysis result.
- this step S25 may include the following steps S251 and S252.
- Step S251 Calculate the matching degree between the semantic analysis result and each sample group in the doctor-patient question and answer knowledge base, each sample group includes a question sample (that is, a sample of the question) and its corresponding answer sample (that is, a sample of the answer).
- step S251 may specifically include the following steps S251a and S251b.
- Step S251a Calculate the similarity between the semantic analysis result and the question sample, and the correlation between the semantic analysis result and the answer sample.
- the similarity between the semantic analysis result and the question sample and the correlation between the semantic analysis result and the answer sample can all be calculated using existing correlation calculation methods, such as the BM25 algorithm.
- Step S251b Generate a matching degree according to the similarity degree and the first weighting coefficient, and the correlation degree and the second weighting coefficient. That is, the matching degree is the sum of the product of the similarity degree and the first weighting coefficient and the product of the correlation degree and the second weighting coefficient.
- the first weighting coefficient and the second weighting coefficient can be set according to actual needs, each of the first weighting coefficient and the second weighting coefficient is between 0 and 1, and the sum of the first weighting coefficient and the second weighting coefficient Equal to 1.
- Step S252 Output the answer sample corresponding to the maximum matching degree.
- the matching degree is the similarity between the semantic analysis result and the question sample and the weighted sum of the semantic analysis result and the answer sample, in the sample group with the largest matching degree, the similarity between the question sample and the semantic analysis result and the answer sample and the semantic analysis result The relevance is relatively high.
- Table 3 lists the similarity between the question corresponding to a semantic analysis result and each question sample and the correlation with each answer sample.
- the question samples and answer samples in the same row in Table 3 are the same sample group.
- the semantic analysis results have the highest similarity with the first question sample and the correlation with the first answer sample.
- the semantic analysis result has the highest matching degree with the first sample group, so the first answer sample is output.
- the following example introduces the medical question answering system method.
- the patient entered the medical consultation sentence as "cold, nasal congestion, headache, dry throat, sore back, pain in the temples, I started to get sick yesterday morning, probably a runny nose, a little pain in the head in the afternoon, and dry throat last night.
- the stomach turned a bit, and then I had insomnia until more than two o’clock. I woke up this morning with a clear nose in one nostril, and a yellow nose in one nostril. It turned into a clear nose after rubbing it three or four times. This afternoon, I had a fever and a fever. Sweating. What medicine do I need to take?".
- FIG. 4 is a schematic structural diagram of a medical question answering system provided by an embodiment of the disclosure.
- the medical question answering system can be used to execute the above medical question answering method.
- the medical question answering system may include: an intention recognizer 10, an entity word extractor 20, a standard word acquisition unit 30, a parser 40, and an output unit 50.
- the intention recognizer 10 is used to recognize the patient's intention based on a medical consultation sentence input by the patient.
- the intention recognizer 10 can be used to convert the medical consultation sentence input by the patient from text data into vector data; input the vector data into a preset intention recognition model to recognize the patient's intention.
- the intent recognition model is a classification model based on a document topic generation model and a two-way gated recurrent unit.
- the intent recognizer 10 is further used to: obtain document subject information of the medical consultation sentence input by the patient; convert the medical consultation sentence input by the patient from text data into vector data; The document subject information corresponding to the medical consultation sentence and the vector data, obtaining the score of the medical consultation sentence corresponding to each preset intention; and according to the score of the medical consultation sentence corresponding to each preset intention, Determine the patient's intentions.
- the entity word extractor 20 is used for extracting at least one entity word corresponding to the condition feature from the medical consultation sentence according to the patient's intention.
- the characteristics of the condition include: at least one of onset symptoms, symptom onset time, symptom duration, accompanying symptoms, medical history, treatment history, and patient age.
- the standard word acquiring unit 30 is configured to acquire standard expression words that are synonymous with the entity word according to a preset synonym mapping table.
- the synonym mapping table includes a mapping relationship between a plurality of standard expression words and their corresponding synonyms (ie, the entity words).
- the parser 40 is used to generate a semantic analysis result according to the patient's intention and standard expression words.
- the output unit 50 is configured to output a corresponding answer according to the semantic analysis result.
- FIG. 5 is a schematic structural diagram of a medical question answering system provided by an embodiment of the disclosure.
- the medical question answering system in addition to the above-mentioned intention recognizer 10, the entity word extractor 20, the standard word acquisition unit 30, the parser 40, and the output unit 50, the medical question answering system also includes: a standard dictionary generator 60, Synonym collector 70, filter 80, and mapping table generator 90.
- the entity word extractor 20 includes a template acquisition unit 21 and a recognition unit 22.
- the template obtaining unit 21 is used to obtain a semantic slot template corresponding to the patient's intention, and each semantic slot template includes a plurality of semantic slots for characterizing disease characteristics.
- the recognition unit 22 is used to extract entity words corresponding to the semantic slot in the semantic slot template from the medical consultation sentence.
- the recognition unit 22 may be configured to use the sequence labeling model to sequence the medical consultation sentences, and obtain the entity words corresponding to the semantic slot in the semantic slot template according to the sequence labeling result.
- the recognition unit may also be used to perform sequence labeling on the medical consultation sentence using a sequence labeling model, and obtain entity words corresponding to the plurality of semantic slots in the semantic slot template according to the sequence labeling result.
- the parser 40 includes: a filling unit 41, a judgment unit 42, an inquiry unit 43 and a parsing unit 44.
- the filling unit 41 is used to fill the standard expression words corresponding to the patient's medical consultation sentence into the corresponding semantic slots of the multiple semantic slots of the current semantic slot template.
- the judging unit 42 is used to judge whether there is an unfilled semantic slot in the current semantic slot template.
- the inquiring unit 43 is used to generate an inquiry question corresponding to the unfilled semantic slot when there is an unfilled semantic slot in the current semantic slot template, and respond to the unfilled semantic slot according to the answer sentence entered by the patient for the inquiry question. Filling is performed until all semantic slots in the current semantic slot template are filled.
- the parsing unit 44 is used to generate a semantic parsing result according to the patient's intention, each semantic slot and its filling value.
- the output unit 50 includes: a matching degree calculator 51 and an output unit 52.
- the matching degree calculation unit 51 is used to calculate the matching degree between the semantic analysis result and each sample group in the doctor-patient question and answer knowledge base.
- Each sample group includes question samples and their corresponding answer samples.
- the matching degree calculator 51 includes: a calculation subunit 511 and a generation subunit 512.
- the calculation subunit 511 is configured to calculate the similarity between the semantic analysis result and the question sample, and the correlation between the semantic analysis result and the answer sample.
- the generating subunit 512 is configured to generate the matching degree according to the similarity degree and the first weighting coefficient, and the correlation degree and the second weighting coefficient.
- the output unit 52 is configured to output the answer sample corresponding to the maximum matching degree.
- the standard vocabulary generator 60 is used to generate a standard vocabulary, and a plurality of standard expression word samples are stored in the standard vocabulary.
- the synonym collector 70 is used to collect at least one synonym corresponding to each standard expression word sample.
- the filter 80 is used to calculate the similarity between each standard expression word sample and its corresponding synonym; retain the synonyms corresponding to the similarity greater than the preset value, and remove the synonyms corresponding to the similarity less than or equal to the preset value .
- the preset value may be 0.5, 0.6, 0.7, 0.8, 0.9, etc.
- the mapping table generator 90 is used to generate a synonym mapping table according to each synonym and its corresponding and currently reserved synonyms.
- the medical question answering system shown in FIG. 4 or FIG. 5 may be a single computer or a single computing device, or multiple computers or multiple computing devices connected through a wired network and/or a wireless network.
- the various components of the medical question-and-answer system shown in FIG. 4 or FIG. 5 may be implemented in hardware, or in a combination of hardware and software.
- the various components of the medical question answering system shown in FIG. 4 or FIG. 5 can be implemented through a central processing unit (CPU), an application processor (AP), and a digital signal processor (DSP) having the corresponding functions described in the embodiments of the present disclosure.
- CPU central processing unit
- AP application processor
- DSP digital signal processor
- the various components of the medical question answering system shown in FIG. 4 or FIG. 5 can be implemented by a combination of a processor, a memory, and a computer program.
- the computer program is stored in the memory, and the processor receives The computer program is read and executed in the memory, so as to be used as each component of the medical question answering system shown in FIG. 4 or FIG. 5.
- the embodiments of the present disclosure also provide an electronic device, the electronic device includes: one or more processors and a storage device; wherein, one or more programs are stored on the storage device, and when the one or more programs are When executed by or multiple processors, the above one or more processors implement the medical question and answer method provided in the foregoing embodiments.
- the embodiments of the present disclosure also provide a non-transitory computer-readable storage medium on which a computer program is stored, wherein the computer program is executed by a processor to implement the medical question and answer method provided in the foregoing embodiments.
- word Embedding technology word2vec model, softmax classifier, LDA model, Bi-directional Long-Term Memory (Bi-directional Long-Term Memory) model, two-way gated cyclic unit (BiGRU), BiLSTM-CRF model, BIO annotation set , BM25 algorithm, etc. are all known technologies in the field of artificial intelligence and natural language processing.
- word Embedding technology and word2vec model please refer to Mikolov T, Chen K, Corrado G S, et al. Effective Evaluation of Word Representations in Vector Space[C].
- BiGRU bidirectional gated recurrent unit
- Cho K Van Merrienboer B
- Gulcehre C et al. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation[J].
- arXiv Computation and Language, 2014.
- BiLSTM-CRF model please refer to Huang Z, Xu W, Yu K, et al. Bidirectional LSTM-CRF Models for Sequence Tagging. [J].arXiv: Computation and Language, 2015.
- BIO annotation set please refer to Sang E F, De Meulder F.
- Some physical components or all physical components can be implemented as software executed by a processor, such as a central processing unit (CPU), a digital signal processor (DSP), a field programmable logic circuit (FPGA), or a microprocessor (MCU) , Either implemented as hardware, or implemented as an integrated circuit, such as an application specific integrated circuit (ASIC).
- a processor such as a central processing unit (CPU), a digital signal processor (DSP), a field programmable logic circuit (FPGA), or a microprocessor (MCU) , Either implemented as hardware, or implemented as an integrated circuit, such as an application specific integrated circuit (ASIC).
- a processor such as a central processing unit (CPU), a digital signal processor (DSP), a field programmable logic circuit (FPGA), or a microprocessor (MCU)
- CPU central processing unit
- DSP digital signal processor
- FPGA field programmable logic circuit
- MCU microprocessor
- ASIC application specific integrated circuit
- computer storage medium includes volatile and non-volatile data implemented in any method or technology for storing information (such as computer-readable instructions, data structures, program modules, or other data).
- Computer storage media include but are not limited to RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disk (DVD) or other optical disk storage, magnetic cassette, tape, magnetic disk storage or other magnetic storage device, or Any other medium used to store desired information and that can be accessed by a computer.
- communication media usually contain computer-readable instructions, data structures, program modules, or other data in a modulated data signal such as carrier waves or other transmission mechanisms, and may include any information delivery media .
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Medical Informatics (AREA)
- Epidemiology (AREA)
- Primary Health Care (AREA)
- Public Health (AREA)
- Mathematical Physics (AREA)
- Biomedical Technology (AREA)
- Pathology (AREA)
- Human Computer Interaction (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Probability & Statistics with Applications (AREA)
- Medical Treatment And Welfare Office Work (AREA)
- Machine Translation (AREA)
Abstract
Description
医疗咨询语句 | 意图的种类 |
重症胰腺炎炎症没好转怎么办? | 治疗 |
多囊吃了来曲,想确定有无排卵 | 用药效果咨询 |
身上各个地方先后出现红色疹子 | 疾病诊断 |
腰肌劳损治愈的时间和用药情况 | 用药咨询 |
这是怎么回事,什么引起的发烧? | 发生原因询问 |
挤眼,张嘴,点头症状严重吗? | 疾病诊断 |
多囊,吃黄体酮七天后没来月经 | 用药效果咨询 |
多卵性卵巢囊肿需要小孩怎么办? | 治疗 |
2岁半宝宝摩擦性苔藓,如何用药 | 用药咨询 |
多颗牙齿缺失能不能做种植牙? | 手术咨询 |
Claims (19)
- 一种医疗问答方法,包括:根据患者输入的医疗咨询语句识别患者的意图;根据所述患者的意图,从所述医疗咨询语句中抽取与病情特征对应的至少一个实体词;根据预设的同义词映射表获取与所述至少一个实体词中的每一个同义的标准表述词;其中,所述同义词映射表包括多个标准表述词与各自对应的实体词之间的映射关系;根据所述患者的意图和所述标准表述词生成语义解析结果;以及根据所述语义解析结果输出相应的答案。
- 根据权利要求1所述的医疗问答方法,其中,所述根据患者输入的医疗咨询语句识别患者的意图,包括:获取所述患者输入的医疗咨询语句的文档主题信息;将所述患者输入的医疗咨询语句由文本数据转换为向量数据;根据所述医疗咨询语句所对应的文档主题信息和所述向量数据,获取所述医疗咨询语句对应于每种预设的意图的分数;以及根据所述医疗咨询语句对应于每种预设的意图的分数,确定所述患者的意图。
- 根据权利要求1或2所述的医疗问答方法,其中,所述根据患者的意图,从所述医疗咨询语句中抽取与病情特征对应的至少一个实体词,包括:获取与所述患者的意图相对应的语义槽模板,所述语义槽模板包括用于表征病情特征的多个语义槽;以及从所述医疗咨询语句中抽取与所述语义槽模板中的所述多个语义槽对应的实体词。
- 根据权利要求3所述的医疗问答方法,其中,所述从所述医疗咨询语句中抽取与所述语义槽模板中的所述多个语义槽对应的实体词,包括:利用序列标注模型对所述医疗咨询语句进行序列标注,并根据序列标注结果获得与所述语义槽模板中的所述多个语义槽对应的实体词。
- 根据权利要求3所述的医疗问答方法,其中,所述根据所述患者的意图和所述标准表述词生成语义解析结果,包括:将所述患者的医疗咨询语句所对应的标准表述词填充至所述多个语义槽中相应的语义槽中;判断当前的语义槽模板中是否存在未被填充的语义槽;若判断的结果是当前的语义槽模板中存在未被填充的语义槽,则生成与未填充的语义槽对应的询问问题,并根据患者针对所述询问问题所输入的回答语句,对未填充的语义槽进行填充,直至当前的语义槽模板的所有的语义槽均被填充为止;以及根据所述患者的意图、每个语义槽及其填充值生成所述语义解析结果。
- 根据权利要求1至5中任意一项所述的医疗问答方法,其中,所述根据所述语义解析结果输出相应的答案包括:计算所述语义解析结果与医患问答知识库中各样本组的匹配度,每个所述样本组包括问题样本及其对应的答案样本;以及将最大匹配度所对应的答案样本进行输出。
- 根据权利要求6所述的医疗问答方法,其中,所述计算所述语义解析结果与医患问答知识库中各样本组的匹配度,包括:计算所述语义解析结果与所述问题样本的相似度、以及所述语义解析结果与所述答案样本的相关度;以及根据所述相似度和第一加权系数、以及所述相关度和第二加权 系数,生成所述匹配度。
- 根据权利要求1至7中任意一项所述的医疗问答方法,其中,所述病情特征包括:发病症状、症状发生时间、症状持续时间、伴随症状、病史、治疗史和患者年龄中的至少一者。
- 根据权利要求1至7中任意一项所述的医疗问答方法,其中,在所述根据患者输入的医疗咨询语句识别患者的意图之前,所述医疗问答方法还包括:生成标准词库,该标准词库中存储有多个标准表述词样本;采集与每个标准表述词样本对应的至少一个同义词;计算每个标准表述词样本与其对应的同义词的相似度;将大于预设值的相似度所对应的同义词保留,并将小于或等于所述预设值的相似度所对应的同义词去除;以及根据每个同义词及其对应的、且当前保留的同义词,生成所述同义词映射表。
- 一种医疗问答系统,包括:意图识别器,用于根据患者输入的医疗咨询语句识别患者的意图;实体词抽取器,用于根据所述患者的意图,从所述医疗咨询语句中抽取与病情特征对应的至少一个实体词;标准词获取单元,用于根据预设的同义词映射表获取与所述至少一个实体词中的每一个同义的标准表述词;其中,所述同义词映射表包括多个标准表述词与各自对应的实体词之间的映射关系;解析器,用于根据所述患者的意图和所述标准表述词生成语义解析结果;以及输出单元,根据所述语义解析结果输出相应的答案。
- 根据权利要求10所述的医疗问答系统,其中,所述意图识 别器还用于:获取所述患者输入的医疗咨询语句的文档主题信息;将所述患者输入的医疗咨询语句由文本数据转换为向量数据;根据所述医疗咨询语句所对应的文档主题信息和所述向量数据,获取所述医疗咨询语句对应于每种预设的意图的分数;以及根据所述医疗咨询语句对应于每种预设的意图的分数,确定所述患者的意图。
- 根据权利要求10或11所述的医疗问答系统,其中,所述实体词抽取器包括:模板获取单元,用于获取与所述患者的意图相对应的语义槽模板,所述语义槽模板包括用于表征病情特征的多个语义槽;以及识别单元,用于从所述医疗咨询语句中抽取与所述语义槽模板中的所述多个语义槽对应的实体词。
- 根据权利要求12所述的医疗问答系统,其中,所述识别单元还用于:利用序列标注模型对所述医疗咨询语句进行序列标注,并根据序列标注结果获得与所述语义槽模板中的所述多个语义槽对应的实体词。
- 根据权利要求12所述的医疗问答系统,其中,所述解析器包括:填充单元,用于将所述患者的医疗咨询语句所对应的标准表述词填充至所述多个语义槽中相应的语义槽中;判断单元,用于判断所述语义槽模板中是否存在未被填充的语义槽;询问单元,用于当所述语义槽模板中存在未填充的语义槽时,生成与未填充的语义槽对应的询问问题,并根据患者针对所述询问问题所输入的回答语句,对未填充的语义槽进行填充,直至当前的语义 槽模板的所有的语义槽均被填充为止;以及解析单元,用于根据所述患者的意图、每个语义槽及其填充值生成所述语义解析结果。
- 根据权利要求10至14中任一项所述的医疗问答系统,其中,所述输出单元包括:匹配度计算器,用于计算所述语义解析结果与医患问答知识库中各样本组的匹配度,每个所述样本组包括问题样本及其对应的答案样本;以及输出单元,用于将最大匹配度所对应的答案样本进行输出。
- 根据权利要求15所述的医疗问答系统,其中,所述匹配度计算器包括:计算子单元,用于计算所述语义解析结果与所述问题样本的相似度、以及所述语义解析结果与所述答案样本的相关度;以及生成子单元,用于根据所述相似度和第一加权系数、以及所述相关度和第二加权系数,生成所述匹配度。
- 根据权利要求10至16中任意一项所述的医疗问答系统,还包括:标准词库生成器,用于生成标准词库,该标准词库中存储有多个标准表述词样本;同义词采集器,用于采集与每个标准表述词样本对应的至少一个同义词;筛选器,用于计算每个标准表述词样本与其对应的同义词的相似度,将大于预设值的相似度所对应的同义词保留,将小于或等于所述预设值的相似度所对应的同义词去除;以及映射表生成器,用于根据每个同义词及其对应的、且当前保留的同义词,生成所述同义词映射表。
- 一种电子设备,包括存储器和处理器,所述存储器上存储有计算机程序,其中,所述计算机程序被所述处理器执行时实现根据权利要求1至9中任意一项所述的医疗问答方法。
- 一种非暂时性计算机可读存储介质,其上存储有计算机程序,其中,所述计算机程序被处理器执行时实现根据权利要求1至9中任意一项所述的医疗问答方法。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/282,035 US20210375404A1 (en) | 2019-06-05 | 2020-06-03 | Medical question-answering method, medical question-answering system, electronic device, and computer readable storage medium |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910484808.4A CN110176315B (zh) | 2019-06-05 | 2019-06-05 | 医疗问答方法及系统、电子设备、计算机可读介质 |
CN201910484808.4 | 2019-06-05 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020244534A1 true WO2020244534A1 (zh) | 2020-12-10 |
Family
ID=67697060
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/094068 WO2020244534A1 (zh) | 2019-06-05 | 2020-06-03 | 医疗问答方法、医疗问答系统、电子设备和计算机可读存储介质 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20210375404A1 (zh) |
CN (1) | CN110176315B (zh) |
WO (1) | WO2020244534A1 (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116662374A (zh) * | 2023-07-31 | 2023-08-29 | 天津市扬天环保科技有限公司 | 基于相关性分析的信息技术咨询服务系统 |
CN117476163A (zh) * | 2023-12-27 | 2024-01-30 | 万里云医疗信息科技(北京)有限公司 | 用于确定疾病结论的方法、装置以及存储介质 |
Families Citing this family (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110176315B (zh) * | 2019-06-05 | 2022-06-28 | 京东方科技集团股份有限公司 | 医疗问答方法及系统、电子设备、计算机可读介质 |
CN110781677B (zh) * | 2019-10-12 | 2023-02-07 | 深圳平安医疗健康科技服务有限公司 | 药品信息匹配处理方法、装置、计算机设备和存储介质 |
CN111063429A (zh) * | 2019-10-25 | 2020-04-24 | 中国科学院自动化研究所 | 一种医疗咨询方法、装置、设备和计算机可读存储介质 |
CN111180077A (zh) * | 2019-11-29 | 2020-05-19 | 厦门快商通科技股份有限公司 | 医美主题识别方法、装置、设备及存储介质 |
CN111177334A (zh) * | 2019-11-29 | 2020-05-19 | 厦门快商通科技股份有限公司 | 医美主题切换方法、装置、设备及存储介质 |
CN111339252B (zh) * | 2020-02-25 | 2021-05-11 | 腾讯科技(深圳)有限公司 | 一种搜索方法、装置及存储介质 |
CN111694942A (zh) * | 2020-05-29 | 2020-09-22 | 平安科技(深圳)有限公司 | 问答方法、装置、设备及计算机可读存储介质 |
CN113742480A (zh) * | 2020-06-18 | 2021-12-03 | 北京汇钧科技有限公司 | 客服应答方法和装置 |
CN111966781B (zh) * | 2020-06-28 | 2024-02-20 | 北京百度网讯科技有限公司 | 数据查询的交互方法及装置、电子设备和存储介质 |
CN112037903B (zh) * | 2020-08-31 | 2023-08-15 | 康键信息技术(深圳)有限公司 | 在线问诊开药系统 |
CN112102840B (zh) * | 2020-09-09 | 2024-05-03 | 中移(杭州)信息技术有限公司 | 语义识别方法、装置、终端及存储介质 |
CN113076403A (zh) * | 2021-04-21 | 2021-07-06 | 深圳追一科技有限公司 | 一种用户消息处理方法及相关设备 |
US20220375626A1 (en) * | 2021-05-21 | 2022-11-24 | Nuance Communications, Inc. | Telehealth System and Method |
CN113449089B (zh) * | 2021-06-11 | 2023-12-01 | 车智互联(北京)科技有限公司 | 一种查询语句的意图识别方法、问答方法及计算设备 |
CN113707303A (zh) * | 2021-08-30 | 2021-11-26 | 康键信息技术(深圳)有限公司 | 基于知识图谱的医疗问题解答方法、装置、设备及介质 |
CN113764112A (zh) * | 2021-09-16 | 2021-12-07 | 山东大学第二医院 | 一种在线医疗问答方法 |
CN113642327A (zh) * | 2021-10-14 | 2021-11-12 | 中国光大银行股份有限公司 | 一种标准知识库的构建方法及装置 |
CN114357144B (zh) * | 2022-03-09 | 2022-08-09 | 北京大学 | 基于小样本的医疗数值抽取和理解方法及装置 |
CN115878790B (zh) * | 2022-04-08 | 2023-08-25 | 北京中关村科金技术有限公司 | 一种智能问答方法、装置、存储介质及电子设备 |
CN115080751B (zh) * | 2022-08-16 | 2022-11-11 | 之江实验室 | 一种基于通用模型的医学标准术语管理系统及方法 |
CN115840808B (zh) * | 2022-12-27 | 2023-08-11 | 广州汉申科技中介服务有限公司 | 科技项目咨询方法、装置、服务器及计算机可读存储介质 |
CN116578682B (zh) * | 2023-05-22 | 2024-02-13 | 浙江法之道信息技术有限公司 | 一种法务服务的智能咨询方法及系统 |
CN116756579B (zh) * | 2023-08-22 | 2023-12-12 | 腾讯科技(深圳)有限公司 | 大语言模型的训练方法及基于大语言模型的文本处理方法 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102663129A (zh) * | 2012-04-25 | 2012-09-12 | 中国科学院计算技术研究所 | 医疗领域深度问答方法及医学检索系统 |
US9367608B1 (en) * | 2009-01-07 | 2016-06-14 | Guangsheng Zhang | System and methods for searching objects and providing answers to queries using association data |
CN106897559A (zh) * | 2017-02-24 | 2017-06-27 | 黑龙江特士信息技术有限公司 | 一种面向多数据源的症状体征类实体识别方法及装置 |
CN109637674A (zh) * | 2018-10-30 | 2019-04-16 | 北京健康有益科技有限公司 | 自动获取健康医疗问题答案的方法、系统、介质和设备 |
CN109840275A (zh) * | 2019-01-31 | 2019-06-04 | 北京嘉和美康信息技术有限公司 | 一种医疗搜索语句的处理方法、装置和设备 |
CN110176315A (zh) * | 2019-06-05 | 2019-08-27 | 京东方科技集团股份有限公司 | 医疗问答方法及系统、电子设备、计算机可读介质 |
Family Cites Families (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2004126981A (ja) * | 2002-10-03 | 2004-04-22 | Medifocus:Kk | 診断装置、診断方法およびプログラム |
US8200501B2 (en) * | 2006-01-26 | 2012-06-12 | International Business Machines Corporation | Methods, systems and computer program products for synthesizing medical procedure information in healthcare databases |
US20120141962A1 (en) * | 2010-06-07 | 2012-06-07 | Williamson Gabrielle R | Systems and methods for matching a patient with a mental health care provider |
US20130218596A1 (en) * | 2012-02-16 | 2013-08-22 | dbMotion Ltd. | Method And System For Facilitating User Navigation Through A Computerized Medical Information System |
CN103810218B (zh) * | 2012-11-14 | 2018-06-08 | 北京百度网讯科技有限公司 | 一种基于问题簇的自动问答方法和装置 |
CN103400054A (zh) * | 2013-08-27 | 2013-11-20 | 哈尔滨工业大学 | 计算机辅助心理咨询自动问答机器人系统 |
US10311206B2 (en) * | 2014-06-19 | 2019-06-04 | International Business Machines Corporation | Electronic medical record summary and presentation |
CN104573028B (zh) * | 2015-01-14 | 2019-01-25 | 百度在线网络技术(北京)有限公司 | 实现智能问答的方法和系统 |
US20180005123A1 (en) * | 2016-07-01 | 2018-01-04 | Xerox Corporation | Combining semantic and business process modeling in a multi-layer framework |
US11294942B2 (en) * | 2016-09-29 | 2022-04-05 | Koninklijk Ephilips N.V. | Question generation |
CN107247868B (zh) * | 2017-05-18 | 2020-05-12 | 深思考人工智能机器人科技(北京)有限公司 | 一种人工智能辅助问诊系统 |
US20180365742A1 (en) * | 2017-06-20 | 2018-12-20 | Crystal A. Thomas | System for Data Analysis and Payment Expedition |
CN107516110B (zh) * | 2017-08-22 | 2020-02-18 | 华南理工大学 | 一种基于集成卷积编码的医疗问答语义聚类方法 |
US11430347B2 (en) * | 2017-09-18 | 2022-08-30 | Microsoft Technology Licensing, Llc | Providing diet assistance in a session |
CN108090127B (zh) * | 2017-11-15 | 2021-02-12 | 北京百度网讯科技有限公司 | 建立问答文本评价模型与评价问答文本的方法、装置 |
CN108170684B (zh) * | 2018-01-22 | 2020-06-05 | 京东方科技集团股份有限公司 | 文本相似度计算方法及系统、数据查询系统和计算机产品 |
EP3788632A1 (en) * | 2018-04-30 | 2021-03-10 | Koninklijke Philips N.V. | Visual question answering using on-image annotations |
CN108804532B (zh) * | 2018-05-03 | 2020-06-26 | 腾讯科技(深圳)有限公司 | 一种查询意图的挖掘和查询意图的识别方法、装置 |
CN108733837B (zh) * | 2018-05-28 | 2021-04-27 | 上海依智医疗技术有限公司 | 一种病历文本的自然语言结构化方法及装置 |
CN109117474B (zh) * | 2018-06-25 | 2022-05-03 | 广州多益网络股份有限公司 | 语句相似度的计算方法、装置及存储介质 |
CN109378077B (zh) * | 2018-08-13 | 2022-07-29 | 北京左医科技有限公司 | 诊前病史采集方法及执行该方法的机器可读存储介质 |
CN109684445B (zh) * | 2018-11-13 | 2021-05-28 | 中国科学院自动化研究所 | 口语化医疗问答方法及系统 |
CN109741824B (zh) * | 2018-12-21 | 2023-08-04 | 质直(上海)教育科技有限公司 | 一种基于机器学习的医疗问诊方法 |
CN109817329B (zh) * | 2019-01-21 | 2021-06-29 | 暗物智能科技(广州)有限公司 | 一种医疗问诊对话系统以及应用于该系统的强化学习方法 |
-
2019
- 2019-06-05 CN CN201910484808.4A patent/CN110176315B/zh active Active
-
2020
- 2020-06-03 WO PCT/CN2020/094068 patent/WO2020244534A1/zh active Application Filing
- 2020-06-03 US US17/282,035 patent/US20210375404A1/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9367608B1 (en) * | 2009-01-07 | 2016-06-14 | Guangsheng Zhang | System and methods for searching objects and providing answers to queries using association data |
CN102663129A (zh) * | 2012-04-25 | 2012-09-12 | 中国科学院计算技术研究所 | 医疗领域深度问答方法及医学检索系统 |
CN106897559A (zh) * | 2017-02-24 | 2017-06-27 | 黑龙江特士信息技术有限公司 | 一种面向多数据源的症状体征类实体识别方法及装置 |
CN109637674A (zh) * | 2018-10-30 | 2019-04-16 | 北京健康有益科技有限公司 | 自动获取健康医疗问题答案的方法、系统、介质和设备 |
CN109840275A (zh) * | 2019-01-31 | 2019-06-04 | 北京嘉和美康信息技术有限公司 | 一种医疗搜索语句的处理方法、装置和设备 |
CN110176315A (zh) * | 2019-06-05 | 2019-08-27 | 京东方科技集团股份有限公司 | 医疗问答方法及系统、电子设备、计算机可读介质 |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116662374A (zh) * | 2023-07-31 | 2023-08-29 | 天津市扬天环保科技有限公司 | 基于相关性分析的信息技术咨询服务系统 |
CN116662374B (zh) * | 2023-07-31 | 2023-10-20 | 天津市扬天环保科技有限公司 | 基于相关性分析的信息技术咨询服务系统 |
CN117476163A (zh) * | 2023-12-27 | 2024-01-30 | 万里云医疗信息科技(北京)有限公司 | 用于确定疾病结论的方法、装置以及存储介质 |
CN117476163B (zh) * | 2023-12-27 | 2024-03-08 | 万里云医疗信息科技(北京)有限公司 | 用于确定疾病结论的方法、装置以及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
US20210375404A1 (en) | 2021-12-02 |
CN110176315A (zh) | 2019-08-27 |
CN110176315B (zh) | 2022-06-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020244534A1 (zh) | 医疗问答方法、医疗问答系统、电子设备和计算机可读存储介质 | |
Abacha et al. | Recognizing question entailment for medical question answering | |
US10331659B2 (en) | Automatic detection and cleansing of erroneous concepts in an aggregated knowledge base | |
WO2023029506A1 (zh) | 病情分析方法、装置、电子设备及存储介质 | |
US10380251B2 (en) | Mining new negation triggers dynamically based on structured and unstructured knowledge | |
EP3376400A1 (en) | Dynamic context adjustment in language models | |
JP7357614B2 (ja) | 機械支援対話システム、ならびに病状問診装置およびその方法 | |
US20200211709A1 (en) | Method and system to provide medical advice to a user in real time based on medical triage conversation | |
Li et al. | MLEC-QA: A Chinese multi-choice biomedical question answering dataset | |
US20160098456A1 (en) | Implicit Durations Calculation and Similarity Comparison in Question Answering Systems | |
Miftahutdinov et al. | KFU at CLEF eHealth 2017 Task 1: ICD-10 Coding of English Death Certificates with Recurrent Neural Networks. | |
US20190198137A1 (en) | Automatic Summarization of Patient Data Using Medically Relevant Summarization Templates | |
JP2022537759A (ja) | コンピューティング・スパンに対するディープ・ラーニング・アプローチ | |
CN116992002A (zh) | 一种智能护理方案应答方法及系统 | |
Alghoson | Medical document classification based on MeSH | |
Liu et al. | Integrated cTAKES for Concept Mention Detection and Normalization. | |
Zhou et al. | Converting semi-structured clinical medical records into information and knowledge | |
Montenegro et al. | The HoPE model architecture: A novel approach to pregnancy information retrieval based on conversational agents | |
Hu et al. | Label-indicator morpheme growth on LSTM for Chinese healthcare question department classification | |
Roberts et al. | Annotating Question Decomposition on Complex Medical Questions. | |
Park et al. | A deep learning-based depression trend analysis of korean on social media | |
Li et al. | Extrinsic factors affecting the accuracy of biomedical NER | |
Liu | Research on question classification methods in the medical field | |
Yip et al. | Construction of UMLS Metathesaurus with Knowledge-Infused Deep Learning. | |
Li et al. | Medical text entity recognition based on CRF and joint entity |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20818741 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20818741 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20818741 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20818741 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 21/07/2022) |