CN112216379A

CN112216379A - Disease diagnosis system based on intelligent joint learning

Info

Publication number: CN112216379A
Application number: CN201910630056.8A
Authority: CN
Inventors: 刘璐
Original assignee: Individual
Current assignee: Individual
Priority date: 2019-07-12
Filing date: 2019-07-12
Publication date: 2021-01-12

Abstract

The invention relates to a disease diagnosis system based on intelligent joint learning. The system comprises: the system comprises a context feature embedded representation layer, a medical image embedded representation layer, a medical physical examination experiment data embedded representation layer, a medical knowledge coding layer, a neural network hiding layer and a joint learning output layer. The system adopts a combined learning model of a traditional linear model and a deep learning model for effective disease diagnosis, can realize automatic extraction of high-level information of medical images, electronic medical records and physical examination experiment results by utilizing deep learning, avoids complex characteristic engineering, and can also integrate the knowledge and doctor experience of medical diagnostics by utilizing rules. The invention can comprehensively cover various modal information of patients, and effectively combine the field knowledge of medical diagnostics while keeping the strong generalization ability of the deep learning algorithm, thereby enhancing the interpretability and the accuracy of the whole disease diagnosis algorithm model.

Description

Disease diagnosis system based on intelligent joint learning

Technical Field

The invention belongs to the field of artificial intelligence, and relates to a disease diagnosis system based on intelligent joint learning, which is realized by effectively fusing medical knowledge and a machine learning algorithm by utilizing the joint learning of a traditional linear model and a deep learning model.

Background

Machine learning is the fundamental way for computers to have intelligence, and is the core of artificial intelligence. The essence of machine learning is the use of algorithms to mine from large amounts of data the rules underlying them and to use them for prediction or classification. The combination of emerging artificial intelligence technology and traditional medical treatment is remodeling diagnosis and treatment modes and industrial modes in the medical health field. With the increasing accumulation of medical data and the continuous breakthrough of artificial intelligence technology, machine learning algorithms represented by deep learning are increasingly applied in the medical field in recent years, and the applications mainly focus on disease screening and prediction based on the deep learning.

The existing deep learning-based disease screening and prediction applications can be mainly divided into the following three major categories. (1) Image analysis class based on deep learning. (2) And analyzing the class based on the deep learning sign data. (3) And E-medical record analysis class based on deep learning.

These methods suffer from the following disadvantages: the adopted deep learning algorithm only depends on data of a single source mode, and the training result of the model has one-sidedness. Due to the design limitation of the deep learning algorithm on the neural network structure, medical diagnosis knowledge accumulated and verified through long-term medical practice cannot be used in the neural network. The deep learning algorithm lacking medical knowledge guidance is completely limited by training data of the algorithm, and the result of algorithm analysis is not interpretable and cannot be effectively popularized.

A technical solution is needed to solve the above problems, which can process heterogeneous data of multiple modalities and enrich the expression dimension of patient information. Meanwhile, the domain knowledge of medical diagnostics can be effectively combined, the strong generalization capability of the deep learning algorithm is kept, and the interpretability and the accuracy of the whole disease diagnosis algorithm model are enhanced.

Disclosure of Invention

Aiming at the problems, the invention aims to design and realize a disease diagnosis system based on intelligent combined learning, adopts a combined learning model of a traditional linear model and a deep learning model for effective disease diagnosis, can realize automatic extraction of high-level information of medical images, electronic medical records and physical examination experiment results by utilizing deep learning, avoids complex feature engineering, and can also integrate the knowledge and doctor experience of medical diagnostics by utilizing rules. The combined learning model can comprehensively cover various modal information of a patient, and can effectively combine the field knowledge of medical diagnostics while keeping the strong generalization capability of a deep learning algorithm, thereby enhancing the interpretability and the accuracy of the whole disease diagnosis algorithm model.

In order to achieve the purpose, the invention adopts the following technical scheme:

a disease diagnostic system based on intelligent joint learning, comprising (one module for each of the following layers):

the context characteristic embedding layer is used for automatically extracting context characteristics in the electronic medical record of the text type and coding semantic information of the context;

the medical image embedding layer is used for automatically extracting the characteristics of the medical image of the patient;

the medical physical examination experiment data embedding presentation layer is used for automatically extracting the characteristics of result information generated by a patient physical examination experiment;

the medical knowledge coding layer is connected with the context feature embedding representation layer, the medical image embedding representation layer and the medical physical examination experiment data embedding representation layer and is used for coding the multi-mode feature data of the patient in a linear feature vector form according to medical diagnosis knowledge by adopting a rule-based extraction method;

the neural network hiding layer is connected with the context characteristic embedding representation layer, the medical image embedding representation layer and the medical physical examination experiment data embedding representation layer and is used for training the embedded representation of the text type electronic medical record data, the medical image data and the physical examination experiment data according to the type difference of the input patient multi-model data;

and the joint learning output layer is used for realizing the joint learning of the training result of the neural network hidden layer and the linear model of the medical knowledge coding layer and outputting the final disease diagnosis result.

Further, the context feature embedding representation layer adopts a bidirectional long and short term memory network (BLSTM) to automatically extract the context features in the electronic medical record of the text class. In addition, the context feature embedding representation layer can also adopt a unidirectional LSTM and a recurrent neural network RNN to replace a bidirectional LSTM, and the same implementation effect can be obtained; to enhance the local learning ability of the neural network, an LSTM model with attention mechanism may also be used.

Further, the medical image embedding representation layer automatically extracts the features of the medical image of the patient by using a Convolutional Neural Network (CNN).

Further, the medical physical examination experiment data embedding representation layer adopts a Convolutional Neural Network (CNN) to automatically extract the characteristics of the result information generated by the patient physical examination experiment.

Further, the encoding in the form of linear feature vectors performed by the medical knowledge encoding layer is one-hot encoding.

Further, the neural network hidden layer comprises a BLSTM network and a CNN network which are respectively used for training the embedded representation of the electronic medical record data, the medical image data and the physical examination experiment data of the text type.

Compared with the prior art, the invention has the following positive effects:

the invention designs and realizes a combined learning model of a traditional linear model and a deep learning model, which is used for effective disease diagnosis. The method utilizes the advantage of deep learning for automatically extracting the features, avoids manual feature engineering which has high time cost and high labor cost and is difficult to expand to a large data set, and realizes real data driving; meanwhile, medical professional knowledge and experience are fully exerted. The combined learning model can comprehensively cover various modal information of a patient, and can effectively combine the field knowledge of medical diagnostics while keeping the strong generalization capability of a deep learning algorithm, thereby enhancing the interpretability and the accuracy of the whole disease diagnosis algorithm model. The invention has the advantages of low overhead, high expression and multiple applications.

Drawings

Fig. 1 is a block diagram of a disease diagnosis system based on intelligent joint learning.

Fig. 2 is a workflow diagram of an intelligent joint learning based disease diagnosis system.

Detailed Description

The present invention will be described in detail below with reference to specific embodiments and accompanying drawings.

Fig. 1 is a block diagram of a disease diagnosis system based on intelligent joint learning according to an embodiment of the present invention. The functions and implementations of the modules shown in fig. 1 are described as follows:

(1) the context feature embedding presentation layer facing the bidirectional long and short term memory network (BLSTM) automatically extracts the context features in the electronic medical record of text type and encodes the semantic information of the context.

The electronic medical record is used as a text representation form, and input coding from text to a neural network training model is realized by embedding a representation layer. The invention adopts the prior Skip-Gram model to obtain word vectors based on a large amount of unmarked Chinese corpus training as the embedded expression of the electronic medical record. Meanwhile, labels need to be prepared for the electronic medical record in order to train various parameters of the algorithm. The label preparation has two means, and for the electronic medical record with the diagnosis conclusion, the diagnosis conclusion is directly extracted from the text to be used as the label of the current electronic medical record. For an electronic medical record containing only patient complaints, a professional doctor needs to be asked in advance to fill out the label. The training of the general case algorithm selects the former electronic medical record data.

(2) Embedding the medical image into a presentation layer facing a Convolutional Neural Network (CNN), and automatically extracting the characteristics of the medical image of the patient.

Since the medical image data is two-dimensional image data, which includes information of focus, background, occlusion, illumination, etc., the data needs to be preprocessed. Firstly, an automatic extraction method in the radiology is adopted, and each medical image is labeled by a pathological label. The image also needs to be normalized for mean and standard deviation before being input to the convolutional neural network. Meanwhile, in order to meet the requirement of convolutional neural network training on a large amount of data, a training data set is enhanced through changes such as rotation, translation and deformation.

(3) The Convolution Neural Network (CNN) -oriented medical physical examination experiment data is embedded into a presentation layer, and the characteristics of result information generated by a patient physical examination experiment are automatically extracted.

The medical physical examination experiment data embedding presentation layer is mainly used for processing laboratory detection data of various numerical types and simultaneously comprises sequence data received by sensor equipment such as electrocardiogram monitoring equipment. The data is represented in the form of a matrix as input to a convolutional neural network.

(4) And the medical knowledge coding layer is used for performing one-hot coding in a linear characteristic vector form on the multi-modal data of the patient according to the medical diagnostics knowledge by adopting a rule-based extraction method.

According to medical diagnostics knowledge, multi-modal feature data of a patient is defined in the form of a set of linear feature vectors, a set of features X ═<x_1,x₂,……,x_n>And the number of the features in the feature set is n. The feature set comprises both the original medical features and the cross-transformation features. And carrying out one-hot coding on the value of the original medical characteristic. For example, the patient is eligible for x_iInformation on the item characteristics, then x_iIs 1, otherwise is 0. In order to add description on nonlinear characteristics on the basis of the original linear model, cross conversion characteristics are added. The cross-over feature is defined as follows:

wherein c is_kiIs a Boolean variable, x_iAnd d represents the number of the features, if the ith original medical feature is one part of the kth cross-over feature, the value is 1, and if not, the value is 0. For example, the value of the cross-over feature for two features (genetic feature dominant and weight feature obese) is 1, and if and only if the constituent features (genetic feature dominant and weight feature obese) are both 1, otherwise it is 0.

(5) The neural network hiding layer comprises an LSTM network and a CNN network according to the type difference of input patient multi-model data, and is used for training the embedded representation of text electronic medical record data, medical image data and physical examination experiment data respectively.

The recurrent neural network preserves the state of the context by periodically connecting hidden layer nodes. The cyclic structure part of this patent adopts two-way long-and-short-term memory network (BLSTM), and BLSTM has powerful ability to long-range dependence modeling, and it has inherited the structural feature of most recurrent neural network model to the gradient disappearance problem that probably appears in the recurrent neural network has been avoided. BLSTM includes both forward and backward LSTM structures. Wherein the forward LSTM and backward LSTM are used to capture past and future different feature information, respectively. The BLSTM is more suitable for processing longer texts, can extract more characteristic information, and is very suitable for analyzing long text data such as electronic medical records.

And performing algorithm training on the medical image training data set which is subjected to class labeling and preprocessing by adopting a convolutional neural network. Because the convolutional neural network needs a huge amount of training data, the medical image training data set often cannot meet the requirements of CNN parameter training. According to the method, a pre-training mode is adopted, firstly, large-scale image data of ImageNet is utilized to pre-train model parameters of the CNN, and the obtained parameters of the CNN are used as initial values of the model. And then, taking the medical image training data as the input of the CNN model, and adjusting and optimizing the parameters of the model.

(6) And the joint learning output layer realizes joint learning of the neural network hidden layer training result and the linear model of the medical knowledge coding layer and inputs a final disease diagnosis result.

A set of linear feature vectors obtained via the medical knowledge coding layer, by means of a linear model y ═ w^TX + b constitutes the linear part of the joint learning model, where X is the feature vector, w is the weight vector, and b is the offset. And the neural network model constitutes the deep part of the joint learning model. The outputs of the linear part model and the depth part model are combined through a weighted sum form, and the joint learning training of the models is completed through a logistic regression mode. The logistic regression model is defined as follows:

wherein Y represents a label of the classification and x is texThe eigenvector, σ (·), represents the sigmod function, and θ (x) represents those cross-conversion features. w is a₁Is a weight vector of the linear model, w_deepIs the final activation function value

The vector of (2).

The training of the combined model adopts a back propagation method to simultaneously train a linear part and a depth part and adopts small-batch random optimization.

Fig. 2 is a flowchart of the work flow of the disease diagnosis system based on intelligent joint learning according to the embodiment of the present invention. The steps are specifically described as follows:

step 1.1 prepare the data.

Preparing marked data, segmenting the marked data into a training data set, developing the data set and a testing data set, wherein the training data set and the developing data set are used in a training stage, and the testing data set is used in a testing stage.

Step 1.2 batch sample input

Training of the training data set is input into the system in batches according to a small batch principle and the set size of the batch size.

Step 2.1 electronic medical record text participle

And segmenting each sentence according to the sentence unit by using the text in the electronic medical record.

Step 2.2 word vector embedding representation

For each character of each input sample, word embedding (English is a word, Chinese is a character level) is carried out, and the word is converted into a 300-dimensional vector according to a character index table and a linear layer.

Step 2.3 bidirectional LSTM network

The input sample sequence represented by the word vector is sent to a bidirectional LSTM network to extract deep feature information of the context.

Step 3.1 medical knowledge feature extraction

And constructing a one-dimensional original feature set by using medical diagnostics knowledge, and extracting one-hot codes of original features from multi-modal data of the patient.

Step 3.2 Cross-conversion feature Generation

And calculating the code of the cross conversion characteristics from the original characteristic code according to the definition of the cross conversion characteristics.

Step 4.1 image data normalization

And normalizing the mean value and the standard deviation of the image.

Step 4.2 dataset enhancement

The training data set is enhanced by changes in rotation, translation, and deformation.

Step 4.3CNN model pretraining

And pre-training model parameters of the CNN by using the large-scale image data of ImageNet, and taking the obtained parameters of the CNN as initial values of the model.

Step 5.1 numerical data serialization

And expressing the numerical data in a matrix form as an input of the convolutional neural network.

Step 5.2CNN model

And inputting the numerical data matrix into the CNN network, and training the hidden layer characteristics of the network.

Step 6 of connection

And splicing the calculation results of the linear model and the depth model to establish a target function.

Step 7 calculating a cost function

In the training process, the objective function is to maximize the log-likelihood of the correct label sequence of the training set.

The cost function is the negative of the objective function.

Step 8 back propagation algorithm

And training the model by using a back propagation gradient descent algorithm, and adaptively adjusting the learning rate according to the training speed. If the effect of the model on the test set is reduced, the overfitting is indicated, the training is stopped immediately, and otherwise, the training is continued.

The context feature embedding expression layer can also adopt a unidirectional LSTM and a recurrent neural network RNN to replace a bidirectional LSTM, and the same realization effect can be obtained. To enhance the local learning ability of the neural network, an LSTM model with attention mechanism may also be used.

Another embodiment of the present invention provides a computer/server, which includes the above disease diagnosis system based on intelligent joint learning, that is, the above disease diagnosis system based on intelligent joint learning is deployed as a software product on the computer/server.

The above embodiments are only intended to illustrate the technical solution of the present invention and not to limit the same, and a person skilled in the art can modify the technical solution of the present invention or substitute the same without departing from the principle and scope of the present invention, and the scope of the present invention should be determined by the claims.

Claims

1. An intelligent joint learning system for disease diagnosis, comprising:

the neural network hiding layer is connected with the context characteristic embedding representation layer, the medical image embedding representation layer and the medical physical examination experiment data embedding representation layer and is used for training the embedding representation of the electronic medical record data, the medical image data and the physical examination experiment data of the text type according to the type difference of the input multi-model data of the patient;

and the joint learning output layer is used for performing joint learning on the training result of the neural network hidden layer and the linear model of the medical knowledge coding layer and outputting a final disease diagnosis result.

2. The system of claim 1, wherein the contextual feature embedding representation layer automatically extracts contextual features in the electronic medical record of the textual class using one of the following neural network models: a bidirectional long and short term memory network BLSTM, a unidirectional long and short term memory network LSTM, a recurrent neural network RNN, and an LSTM with attention mechanism.

3. The system of claim 1, wherein the medical image embedded representation layer automatically extracts features of the patient medical image using a convolutional neural network.

4. The system of claim 3, wherein the medical image embedded representation layer firstly labels each medical image with a pathology label using an automatic extraction method in radiology; before the image is input into the convolutional neural network, the image is normalized by mean value and standard deviation, and the training data set is enhanced by rotation, translation, deformation and other changes, so as to meet the requirement of convolutional neural network training on a large amount of data.

5. The system of claim 1, wherein the medical physical examination experiment data embedding representation layer automatically extracts features of result information generated by the patient physical examination experiment using a convolutional neural network.

6. The system according to claim 1, wherein the encoding in the form of linear feature vectors by the medical knowledge encoding layer is a one-hot encoding.

7. The system according to claim 6, wherein the medical knowledge encoding layer defines the multi-modal feature data of the patient as a set of linear feature vectors according to the medical diagnostics knowledge, the feature set including both the original medical features and the cross-transformation features; carrying out one-hot coding on the value of the original medical characteristic; the cross-over feature is defined as follows:

wherein c is_kiIs a Boolean variable, x_iAnd d represents the number of the features, if the ith original medical feature is one part of the kth cross-over feature, the value is 1, and if not, the value is 0.

8. The system of claim 1, wherein the neural network hidden layer comprises a BLSTM network and a CNN network for training embedded representations of electronic medical record data, medical image data, and physical examination data of text type, respectively; the neural network hidden layer adopts a pre-training mode, firstly pre-trains model parameters of the CNN by using the existing large-scale image data, the obtained CNN parameters are used as initial values of the model, then medical image training data are used as input of the CNN model, and the parameters of the model are adjusted and optimized.

9. The system of claim 1, wherein the joint learning model of the joint learning output layer comprises a linear portion and a depth portion; a set of linear feature vectors obtained via a medical knowledge coding layer, by a linear model y ═ w^TX + b constitutes the linear part, where X is a feature vector, w is a weight vector, and b is an offset; a neural network model constitutes the depth component; the outputs of the linear part and the depth part are combined in a weighted sum mode, and the joint learning training of the model is completed through a logistic regression model; the training adopts a back propagation method to train a linear part and a depth part simultaneously and adopts a small batchAnd (4) random optimization.

10. The system of claim 9, wherein the logistic regression model is defined as follows:

where Y represents the label of the classification, x is the feature vector, σ (-) represents the sigmod function, θ (x) represents the cross-transformation feature, w₁Is a weight vector of the linear model, w_deepIs the final activation function value

The vector of (2).