CN113488152A

CN113488152A - Semantic triage method and system

Info

Publication number: CN113488152A
Application number: CN202110793215.3A
Authority: CN
Inventors: 张后今; 曾培基; 周金龙; 章昊; 肖航; 吴珂仪
Original assignee: Huazhong University of Science and Technology
Current assignee: Huazhong University of Science and Technology
Priority date: 2021-07-14
Filing date: 2021-07-14
Publication date: 2021-10-08
Anticipated expiration: 2041-07-14
Also published as: CN113488152B

Abstract

The invention discloses a semantic triage method and a semantic triage system, wherein the method comprises the following steps: acquiring actual inquiry data; inputting actual inquiry data into a department classification model to obtain a target department; the establishment method of the department classification model comprises the following steps: acquiring historical inquiry data; the historical inquiry data comprises historical disease description and corresponding historical departments; and training the long-term and short-term memory network model according to historical inquiry data to obtain a department classification model. The traditional triage method needs a diagnostician to manually judge the corresponding department according to the disease description, and the manual speed is limited, so that the target department can be automatically obtained by only inputting the disease description by utilizing a department classification model, and the triage efficiency is improved; and the diagnostician can conduct division diagnosis of departments completely by means of experience, and mistakes are easy to occur.

Description

Semantic triage method and system

Technical Field

The invention relates to the technical field of department consultation, in particular to a semantic triage method and a semantic triage system.

Background

In a hospital, it is time consuming and labor intensive to introduce a patient to the correct department, requiring a care giver to analyze the patient's profile and then assign a department to the patient. However, because of the small number of hospital instructors, the patient needs to wait a long time to obtain triage results. Moreover, since the patients are of various kinds, the doctor guide is not always able to give the patient the correct department, depending on the field of expertise.

Disclosure of Invention

The invention aims to provide a semantic triage method and a semantic triage system so as to improve efficiency and accuracy of triage.

In order to achieve the purpose, the invention provides the following scheme:

a method of semantic triage, the method comprising:

acquiring actual inquiry data;

inputting the actual inquiry data into a department classification model to obtain a target department;

the establishment method of the department classification model comprises the following steps:

acquiring historical inquiry data; the historical interrogation data comprises historical disease description and corresponding historical departments;

and training a long-term and short-term memory network model according to the historical inquiry data to obtain the department classification model.

Optionally, the training of the long-term and short-term memory network model according to the historical inquiry data to obtain the department classification model specifically includes:

inputting the historical disease descriptions into the long-short term memory network model to obtain the probability of each department corresponding to each historical disease description;

judging whether to stop training according to the department with the highest probability and the corresponding historical department;

if so, taking the long-term and short-term memory network model under the current training times as the department classification model;

if not, updating the parameters of the long-term and short-term memory network model, and carrying out next training.

Optionally, before the step of inputting the historical disease descriptions into the long-term and short-term memory network model to obtain the probability of each department corresponding to each historical disease description, the method further includes:

preprocessing the historical disease description.

Optionally, the preprocessing the historical disease description specifically includes:

performing data cleaning on the historical disease description;

performing word segmentation processing on the historical disease description after data cleaning to obtain a plurality of words;

and mapping the words to a vector space to obtain a plurality of numerical data.

Optionally, the data cleaning of the historical disease description specifically includes:

performing at least one deletion operation on the historical disease description; the deleting operation comprises character deletion, letter conversion, prototype conversion, space deletion and information deletion;

the character deletion is to delete irrelevant characters, non-English characters, non-Chinese characters, non-numeric characters and non-Chinese and English punctuation coincidence and hyperlinks in the historical disease description; the extraneous characters include: html tags, messy codes, special characters and tags;

the letters are converted into capital letters of English letters in the historical disease description and are converted into lowercase letters;

converting the prototype into a prototype for converting English letters into the English letters;

the space deletion is to delete redundant spaces;

the information deletion is to delete at least one of text information with personal information; the text information includes: name, contact address, and personal address.

Optionally, the determining, according to the department with the highest probability and the corresponding historical department, whether to stop training specifically includes:

for any historical disease description, judging whether the department with the maximum output probability is consistent with the corresponding historical department;

counting the number of historical disease descriptions of departments with the highest probability and corresponding historical departments to obtain the target number;

calculating the accuracy of the long-term and short-term memory network model under the current training times according to the target number and the total number of the historical disease descriptions;

and if the accuracy is greater than a set threshold, stopping training.

A semantic triage system, the system comprising:

the actual inquiry data acquisition module is used for acquiring actual inquiry data;

the target department determining module is used for inputting the actual inquiry data into a department classification model to obtain a target department;

Optionally, the system further includes: a department classification model establishing module;

the department classification model establishing module is used for:

acquiring historical inquiry data; the historical interrogation data comprises historical disease description and corresponding historical departments; training a long-term and short-term memory network model according to the historical inquiry data to obtain the department classification model;

the department classification model establishing module specifically comprises:

the probability determining unit is used for inputting the historical disease descriptions into the long-term and short-term memory network model to obtain the probability of each department corresponding to each historical disease description;

the stopping judgment unit is used for judging whether to stop training according to the department with the highest probability and the corresponding historical department;

a department classification model generation unit, configured to, when the stop determination unit determines that training is stopped, take a long-term and short-term memory network model under the current training frequency as the department classification model;

and the continuous training unit is used for updating the parameters of the long-term and short-term memory network model and carrying out the next training when the stopping judgment unit judges that the training is continued.

Optionally, the department classification model establishing module further includes: and the preprocessing unit is used for preprocessing the historical disease descriptions before the historical disease descriptions are input into the long-short term memory network model and the probabilities of departments corresponding to the historical disease descriptions are obtained.

Optionally, the stop determining unit specifically includes:

the consistency judgment subunit is used for judging whether the department with the highest output probability is consistent with the corresponding historical department or not for any historical disease description;

the statistical subunit is used for counting the number of historical disease descriptions of departments with the maximum probability and corresponding historical departments to obtain the target number;

the accuracy calculation subunit is used for calculating the accuracy of the long-term and short-term memory network model under the current training times according to the target number and the total number of the historical disease descriptions;

and the training stopping subunit is used for stopping training when the accuracy is greater than a set threshold.

According to the specific embodiment provided by the invention, the invention discloses the following technical effects:

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings without inventive exercise.

FIG. 1 is a flow chart of a semantic triage method provided by an embodiment of the present invention;

FIG. 2 is a flow chart of a modified LSTM model according to an embodiment of the present invention;

fig. 3 is a structural diagram of a semantic triage system according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The invention aims to provide a semantic triage method and a semantic triage system, aims to improve efficiency and accuracy of triage, and can be applied to the technical field of department diagnosis guidance.

In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.

Fig. 1 is a flow chart of a semantic triage method provided by the embodiment of the invention. As shown in fig. 1, the semantic triage method in this embodiment includes:

step 101: actual interrogation data is acquired.

Specifically, historical inquiry data of 12 departments are collected from various large-open-source websites (such as github and the like), the total historical inquiry data is 12845, and in practical application, more historical inquiry data can be collected according to actual needs.

Step 102: and inputting the actual inquiry data into a department classification model to obtain a target department.

acquiring historical inquiry data; the historical interrogation data includes historical disease descriptions and corresponding historical departments.

And training the long-term and short-term memory network model according to historical inquiry data to obtain a department classification model.

Specifically, the long-term and short-term memory network model is a modified long-term and short-term memory network model, and researches show that higher features can be extracted by a deeper network. However, a pure stacked LSTM (long short term memory network model) is likely to result in an overfitting, and thus the trained network, although performing well on the training set, will have a much reduced effect on the new data set. Here, a HighwayNetwork (high speed network) structure is used to reform the LSTM model. The HighwayNetwork establishes two nonlinear gates, one of which is a transform Gate and a Carry Gate. The transfer gate can control the proportion of the currently flowing information that is transferred, and the carrying gate can control the proportion of the currently flowing information that is carried. The following is a detailed procedure for reconstructing LSTM using HighWay.

The conventional LSTM is simplified to an input-output function model, as shown below,

y＝H(x,W_H)。

where x is an output matrix that maps the processed text to a vector space using a Robustly optimized BERT pretraining method (RoBERTa), which is also an input matrix of the LSTM network, rows of the matrix are lengths of text, and columns of the matrix are lengths of word vectors. W_HAre parameters of the LSTM network. y is the output of the LSTM network, and is the information for further extracting the semantic information of the words and sufficiently learning the disease description of the patient.

Now, two further non-linear transformations are defined:

α＝T(x,W_T) And β ═ C (x, W)_C)。

Where T and C are non-linear transformation functions used to calculate the proportion of the current input that is diverted and carrying information. The nonlinear transformations T and C here can take many forms, for example: sigmoid activation functions, etc. Alpha and beta are both values between 0 and 1, alpha represents the proportion of the currently input transferred information, and beta represents the proportion of the currently input carried information. W_TIs a parameter in the transfer gate that calculates the transferred proportion of the LSTM network input x, W_CIs a parameter in the carry gate that calculates the ratio of carried LSTM network input x. The modified LSTM structure is:

y'＝α*y+β*x。

where y' is the output of the modified LSTM model. To simplify the calculation, let β be 1- α. A flow chart of the reconstructed LSTM model is shown in fig. 3.

The network structure can effectively control information flow among different LSTM networks, and the risk of overfitting is greatly reduced. This idea is applied today in many ways to prevent overfitting. The network modified in this way can not only obtain more advanced features but also effectively prevent the occurrence of overfitting when the number of layers is increased. The modified LSTM network may be used as a generic model.

Step 104: and inputting the actual inquiry data into a department classification model to obtain a target department.

As an optional implementation manner, step 103 specifically includes:

and inputting the historical disease descriptions into the long-term and short-term memory network model to obtain the probability of each department corresponding to each historical disease description.

And judging whether to stop training according to the department with the highest probability and the corresponding historical department.

If so, taking the long-term and short-term memory network model under the current training times as a department classification model.

If not, updating the parameters of the long-term and short-term memory network model and carrying out the next training.

As an optional implementation, before inputting the historical disease descriptions into the long-short term memory network model, obtaining the probability of each department corresponding to each historical disease description, the method further includes:

the historical disease description is preprocessed.

As an alternative embodiment, the historical disease description is preprocessed, which specifically includes:

and (5) performing data cleaning on the historical disease description.

And performing word segmentation processing on the historical disease description after data cleaning to obtain a plurality of words.

Specifically, the method comprises the steps of utilizing a jieba (Chinese word segmentation tool) to segment the text of the disease description after data cleaning, and segmenting the text into word sets with actual meanings. For example, for a description of the condition: to ask about the question about blood sugar, is diabetic? How well you should diabetes be treated? ", will be divided into: [ "ask questions", "blood sugar", "question", ",", "blood sugar", "too high", "is", "suffer from", "diabetes", "woolen", "is? "," you good "," diabetes "," should "," what "," treatment "," woolen ","? "].

Specifically, since the neural network model cannot directly process text-type character string data, it can only process numerical-type data. It is necessary to convert a text type character string into a numeric type.

A pre-training model constructed by RoBERTA is selected to map a plurality of words to a vector space, so that the question text data is converted into numerical data which can be used for neural network training. Because the pre-training model is trained in a chinese corpus, the obtained numeric-type vectors can have rich chinese semantic information, including word similarity and text similarity information, i.e., the vector spaces generated by similar words or texts are similar, for example: the vector space of headache and headache has a high similarity. Can be better applied to the task.

As an optional implementation, the data washing is performed on the historical disease description, which specifically includes:

performing at least one deletion operation on the historical disease description; the deleting operation comprises character deletion, letter conversion, prototype conversion, space deletion and information deletion.

The character deletion is to delete irrelevant characters, non-English characters, non-Chinese characters, non-numeric characters and punctuation coincidence and hyperlinks of non-Chinese and English in historical disease description; the extraneous characters include: html tags, messy codes, special characters, and tags.

The letters are converted from capital letters to lowercase letters of the English letters in the historical disease description.

The prototype is converted into a prototype in which english letters are converted into english letters.

Space deletion is the deletion of excess spaces.

Specifically, the directly collected data may include many noises, such as hypertext Markup Language (html) tags, messy codes, special expressions, and the like, and because of the sensitivity of the inquiry data, before the model training, the data needs to be subjected to data cleaning processing, which specifically includes:

1) removing irrelevant characters by using a computer programming technology, such as: html tags, messy codes, special characters and tags, and the like.

2) English characters are processed using a Natural Language Toolkit (NLTK). In english processing, converting capital letters into lowercase letters, and converting an english word into a stem of the word itself by extracting the stem, for example: like, turning to like, this can prevent the same semantic word from having different forms, lead to the sparseness of the word. The vocabulary correction is also needed, and when English is input, a certain letter in a word is likely to be wrong, so that the word cannot be recognized in the vectorization of the word. When judging that an English word does not exist, finding out words with the same prefix in WordNet of NLTK, and finding out candidate words for replacement by using the minimum editing distance (adding, deleting or replacing characters of the current word, and changing into target characters for the minimum times). For example: an applet is an error word, and candidate words include apps, applets, applications, and the like. However, since the edit distance from an applet to an applet is minimum, the semantic meaning is restored by replacing the applet. If the situation with the same edit distance occurs, further confirmation is required according to surrounding semantics or the probability of a daily occurrence of a word.

3) And processing the Chinese words. Punctuation marks, special characters, hyperlinks, messy codes and the like of non-English, non-Chinese, non-numeric and non-Chinese are deleted, and the semantic meaning of the text is prevented from being influenced. And deleting redundant spaces in the Chinese word sequence.

4) To ensure that the training data does not relate to privacy, text information with personal identity is deleted, such as: name, contact phone, personal address, etc.

As an optional implementation manner, judging whether to stop training according to the department with the highest probability and the corresponding historical department specifically includes:

and judging whether the department with the highest output probability is consistent with the corresponding historical department or not for any historical disease description.

And counting the number of historical disease descriptions of departments with the highest probability and corresponding historical departments to obtain the target number.

And calculating the accuracy of the long-term and short-term memory network model under the current training times according to the target number and the total number of the historical disease descriptions.

And if the accuracy is greater than the set threshold, stopping training.

In actual use, it is desirable to construct an application that can be embedded into a portal site of any hospital. Therefore, a simple and easy-to-use web application is designed, and the prediction service can be provided.

In the Web front end, a simple webpage is designed by using HTML, CSS, JavaScript and bootstrap, and comprises a description page, a symptom triage page and a semantic triage page of a department. In the description page of the departments, each hospital can modify the description of the departments according to the condition of the hospital. In the symptom triage page, a symptom function according to the disease incidence part and the selected disease incidence is provided, and the user can obtain the feedback of the department prediction only by selecting the condition of the user. In the semantic triage, any information of the patient is not required to be provided, and feedback of department prediction can be obtained only by inputting description of the illness state, and privacy of the user is not required to be involved.

Django (Python's Web framework) is used to design the Web backend. The Web back end processes the input of the user, converts the text character string of the disease description into a numerical value, inputs the numerical value into the model, converts the numerical value into distribution of departments, and finally returns the departments with the first five probabilities to the webpage, so that the user can clearly and directly go to which department to achieve the effect of triage.

Fig. 3 is a structural diagram of a semantic triage system according to an embodiment of the present invention. As shown in fig. 3, the semantic triage system in this embodiment includes:

and an actual inquiry data acquiring module 201, configured to acquire actual inquiry data.

And the target department determining module 202 is used for inputting the actual inquiry data into the department classification model to obtain the target department.

As an optional implementation, the system further comprises: and a department classification model building module.

The department classification model building module is used for:

acquiring historical inquiry data; the historical inquiry data comprises historical disease description and corresponding historical departments; and training the long-term and short-term memory network model according to historical inquiry data to obtain a department classification model.

The department classification model establishing module specifically comprises:

and the probability determining unit is used for inputting the historical disease description into the long-term and short-term memory network model to obtain the probability of each department corresponding to each historical disease description.

And the stopping judgment unit is used for judging whether to stop training according to the department with the highest probability and the corresponding historical department.

And the department classification model generating unit is used for taking the long-term and short-term memory network model under the current training times as the department classification model when the stopping judging unit judges that the training is stopped.

As an optional implementation manner, the department classification model building module 203 further includes: and the preprocessing unit is used for preprocessing the historical disease description before inputting the historical disease description into the long-short term memory network model and obtaining the probability of each department corresponding to each historical disease description.

As an optional implementation, the method specifically includes:

and the consistency judgment subunit is used for judging whether the department with the maximum output probability is consistent with the corresponding historical department or not for any historical disease description.

And the counting subunit is used for counting the number of historical disease descriptions of departments with the maximum probability and corresponding historical departments to obtain the target number.

And the accuracy calculation subunit is used for calculating the accuracy of the long-term and short-term memory network model under the current training times according to the target number and the total number of the historical disease descriptions.

The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. For the system disclosed by the embodiment, the description is relatively simple because the system corresponds to the method disclosed by the embodiment, and the relevant points can be referred to the method part for description.

The principles and embodiments of the present invention have been described herein using specific examples, which are presented solely to aid in the understanding of the apparatus and its core concepts; meanwhile, for a person skilled in the art, according to the idea of the present invention, the specific embodiments and the application range may be changed. In view of the above, the present disclosure should not be construed as limiting the invention.

Claims

1. A method of semantic triage, the method comprising:

acquiring actual inquiry data;

2. The semantic triage method according to claim 1, wherein the training of the long-term and short-term memory network model according to the historical inquiry data to obtain the department classification model specifically comprises:

3. The semantic triage method according to claim 2, further comprising, before the inputting the historical disease descriptions into the long-short term memory network model to obtain the probability of each department corresponding to each historical disease description:

preprocessing the historical disease description.

4. The semantic triage method according to claim 3, wherein the preprocessing the historical disease description specifically comprises:

performing data cleaning on the historical disease description;

5. The semantic triage method according to claim 4, wherein the data cleansing of the historical disease description specifically comprises:

the space deletion is to delete redundant spaces;

the information deletion is text information deletion; the text information includes: name, contact address, and personal address.

6. The semantic triage method according to claim 2, wherein the step of judging whether to stop training according to the department with the highest probability and the corresponding historical department specifically comprises:

and if the accuracy is greater than a set threshold, stopping training.

7. A semantic triage system, the system comprising:

8. The semantic triage system according to claim 7, further comprising: a department classification model establishing module;

the department classification model establishing module is used for:

the department classification model establishing module specifically comprises:

9. The semantic triage system of claim 8, wherein the department classification model building module further comprises: and the preprocessing unit is used for preprocessing the historical disease descriptions before the historical disease descriptions are input into the long-short term memory network model and the probabilities of departments corresponding to the historical disease descriptions are obtained.

10. The semantic triage system according to claim 8, wherein the stopping judgment unit specifically includes: