CN116913459A

CN116913459A - Medicine recommendation method and system based on deep convolution network control gate model

Info

Publication number: CN116913459A
Application number: CN202311171207.0A
Authority: CN
Inventors: 刘硕; 白焜太; 宋佳祥; 杨雅婷; 许娟; 史文钊
Original assignee: Digital Health China Technologies Co Ltd
Current assignee: Digital Health China Technologies Co Ltd
Priority date: 2023-09-12
Filing date: 2023-09-12
Publication date: 2023-10-20
Anticipated expiration: 2043-09-12
Also published as: CN116913459B

Abstract

The application provides a medication recommendation method and a system based on a deep convolutional network control gate model, wherein the method comprises the following steps: s1: acquiring the existing diagnosis and treatment examination information of the patient and corresponding final medication data; s2: according to the corresponding relation between the patient diagnosis and treatment examination information and the final medication data, converting the digital ID mapping of each piece of patient diagnosis and treatment examination information and the final medication data; s3: taking the diagnosis and treatment information of the patient converted into the digital ID as input, taking final medication data as a label, and inputting a control gate model for training; s4: the actual patient diagnosis and treatment information is input into a control gate model, and the recommended medication probability is output. According to the application, the existing user diagnosis and inspection data are learned through the deep convolution network control gate model, after training is finished, the trained model can be used for recommending the medication of the patient based on new diagnosis and inspection indexes of the patient, and finally the optimal medication of the patient is recommended.

Description

Medicine recommendation method and system based on deep convolution network control gate model

Technical Field

The application relates to the technical field of medical medication recommendation, in particular to a medication recommendation method and system based on a deep convolutional network control gate model.

Background

With the development of information technology and the wide application in the medical field, a great amount of patient diagnosis and treatment information and medication data are accumulated in the patient diagnosis and treatment process. Based on the diagnosis and treatment information of the patient and the medication data, intelligent medication recommendation of the patient is realized, and the workload of medical staff can be reduced.

However, in the conventional medication recommendation method, the medication recommendation is generally performed by classifying patients according to the categories, however, in actual clinic, the patient diagnosis and treatment information often has specificity, and if the medication recommendation is performed according to the patient diagnosis and treatment information, the patient diagnosis and treatment information needs to be explicitly classified in advance.

Disclosure of Invention

According to the medication recommendation method and system based on the deep convolutional network control gate model, the characteristics of the existing user diagnosis and inspection data are learned through the deep convolutional network control gate model, after training is finished, the trained model can be used for recommending the medication of the patient based on new patient diagnosis and inspection indexes, the optimal medication of the patient is finally recommended, automation of patient medication recommendation to a certain extent is achieved, manual workload is reduced to a certain extent, and therefore the technical problem in the process can be solved.

The technical scheme for solving the technical problems is as follows:

in a first aspect, the present application provides a medication recommendation method based on a deep convolutional network control gate model, comprising the steps of:

s1: acquiring the existing diagnosis and treatment examination information of the patient and corresponding final medication data;

s2: according to the corresponding relation between the patient diagnosis and treatment examination information and the final medication data, converting the digital ID mapping of each piece of patient diagnosis and treatment examination information and the final medication data;

s3: taking the diagnosis and treatment information of the patient converted into the digital ID as input, taking final medication data as a label, and inputting a StackGatedCNN control gate model for training;

s4: the actual patient diagnosis and treatment information is input into a StarkGatedCNN control gate model, and the recommended medication probability is output.

In some embodiments, the S3 comprises:

s31: inputting the patient diagnosis and treatment information converted into the digital ID into an enabling layer, and outputting sentence vector representation x of the patient diagnosis and treatment information;

s32: inputting sentence vector representation x into a control gate model layer, and obtaining an intra-sentence character vector set through calculation;

s33: and carrying out average pooling, tanh function activation and linear layer calculation on the intra-sentence character vector set to obtain a final medication probability value.

In some embodiments, the S31:

s311: matrix multiplication is carried out on the patient diagnosis and treatment information converted into the digital ID and the weighting matrix of the ebedding, so that the ebedding matrix representation of the input data is obtained;

s312: and outputting sentence vector representation x of the diagnosis and treatment information of the patient through an ebedding layer.

In some embodiments, the S31:

In some embodiments, the specific calculation process of S33 is:

wherein ,representing linear layer weights, ++>The dimensions are the same and are d×d, d represents the last vector dimension of the output of the emmbedding layer; />Representing a current sentence vector representation; h represents an intra-sentence character vector set; />Representing the bias weight; j represents the total number of characters in the sentence currently calculated, i represents the ith character in the sentence, and t represents the time of accumulation calculation from the current t-th character; />The weight matrix representing the final linear layer, the dimension is L multiplied by d, L is the number of medicines, and p is the probability value of the final output target medicine; />Representing the result of the calculation through the first linear layer; />Representing the result of the calculation through the second linear layer.

In a second aspect, the present application provides a medication recommendation system based on a deep convolutional network control gate model, comprising:

the data acquisition module is used for acquiring the existing diagnosis and treatment examination information of the patient and corresponding final medication data;

the data mapping conversion module is used for carrying out digital ID mapping conversion on each piece of patient diagnosis and treatment examination information and the final medication data according to the corresponding relation between the patient diagnosis and treatment examination information and the final medication data;

the model training module is used for inputting the diagnosis and treatment information of the patient converted into the digital ID, using the final medication data as a label, and inputting a StackGatedCNN control gate model for training;

the medication prediction module is used for inputting the actual patient diagnosis and treatment information into the StarkGatedCNN control gate model and outputting the recommended medication probability.

In some embodiments, the model training module comprises:

the vector conversion sub-module is used for inputting the patient diagnosis and treatment information converted into the digital ID into the ebedding layer and outputting sentence vector representation x of the patient diagnosis and treatment information;

the control gate submodule is used for inputting the sentence vector representation x into the control gate model layer and obtaining an intra-sentence character vector set through calculation;

and the probability calculation sub-module is used for carrying out average pooling, tanh function activation and linear layer calculation on the intra-sentence character vector set to obtain a final medication probability value.

In some embodiments, the vector conversion submodule includes:

the matrix multiplication unit is used for carrying out matrix multiplication on the patient diagnosis and treatment information converted into the digital ID and the ebedding weight matrix to obtain an ebedding matrix representation of the input data;

and the data output unit is used for outputting sentence vector representation x of the diagnosis and treatment information of the patient through the ebedding layer.

In some embodiments, the control gate submodule includes:

the feature acquisition unit is used for dividing windows of sentence vector representation x by utilizing sliding of the filter, convolving each divided window and activating by utilizing a tanh function to obtain a feature set of sentence vector representation x corresponding to the current filter;

the feature extraction unit is used for extracting the feature with the highest value in the feature set corresponding to each filter in the Maxpooling layer;

and the intra-sentence character vector set output unit is used for performing sigmoid function activation and fusion residual connection on the feature set with the maximum value, and outputting the intra-sentence character vector set through LayerNorm layer conversion.

In some embodiments, the specific calculation process of the probability calculation sub-module is:

The beneficial effects of the application are as follows:

according to the medication recommendation method and system based on the deep convolutional network control gate model, the existing user diagnosis and inspection data are learned through the deep convolutional network model, after training is finished, the trained model can be used for recommending the medication of the patient based on new patient diagnosis and inspection indexes, the optimal medication of the patient is finally recommended, automation of patient medication recommendation to a certain extent is achieved, and manual workload is reduced to a certain extent.

Drawings

FIG. 1 is a flow chart of a medication recommendation method based on a deep convolutional network control gate model of the present application;

FIG. 2 is a sub-flowchart of step S3 of the present application;

FIG. 3 is a sub-flowchart of step S31 of the present application;

fig. 4 is a sub-flowchart of step S32 of the present application.

Detailed Description

The principles and features of the present application are described below with reference to the drawings, the examples are illustrated for the purpose of illustrating the application and are not to be construed as limiting the scope of the application.

In order that the above-recited objects, features and advantages of the present application can be more clearly understood, a more particular description of the application will be rendered by reference to specific embodiments thereof which are illustrated in the appended drawings. It is to be understood that the depicted embodiments are some, but not all, embodiments of the present application. The specific embodiments described herein are to be considered in an illustrative rather than a restrictive sense. All other embodiments, which are obtained by a person skilled in the art based on the described embodiments of the application, fall within the scope of protection of the application.

It should be noted that in this document, relational terms such as "first" and "second" and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions.

FIG. 1 is a flow chart of a medication recommendation method based on a deep convolutional network control gate model of the present application.

The medicine recommendation method based on the deep convolution network control gate model, combined with fig. 1, comprises the following steps:

specifically, the data adopted in the scheme are the existing diagnosis and treatment examination information of the patient and the corresponding final medication data of the patient, and the acquired data can be stored through excel. Firstly, mapping conversion is carried out on the diagnosis and treatment examination information of each patient and the final medication data of the patient with respect to the digital ID, and the diagnosis and treatment examination information of each patient and the final medication data of the patient are converted into digital ID to be represented and used as training data, so that the model can be conveniently input for training.

in some embodiments, in conjunction with the sub-flowchart of fig. 2, i.e., S3, the S3 includes:

in some embodiments, in conjunction with the sub-flowchart of fig. 3, i.e., S31, the S31:

Specifically, the model adopted in the scheme is a StackGatedCNN control gate model, the model firstly comprises an ebedding layer, matrix multiplication is carried out on an ebedding weight matrix and an ID mapped by input data in the ebedding layer, an ebedding word vector is obtained to be used as an ebedding matrix representation of the input data, the vector dimension is 256 dimensions, and the original ID mapped data is output as sentence vector representation x through ebedding.

in some embodiments, in conjunction with the sub-flowchart of fig. 4, i.e., S32, the S32 includes:

s321: window division is carried out on the sentence vector representation x by utilizing sliding of the filter, convolution is carried out on each divided window, and a feature set of the sentence vector representation x corresponding to the current filter is obtained by utilizing tanh function activation;

s322: extracting the feature with the highest value in the feature set corresponding to each filter in the Maxpooling layer;

s323: and performing sigmoid function activation and fusion residual connection on the feature set with the maximum value, and outputting an intra-sentence character vector set through LayerNorm layer conversion.

Specifically, the stackGatedCNN control gate model further comprises a control gate model layer, wherein the calculation process in the control gate model layer is that firstly, a single-layer cnn network is calculated, and sentences can be formed by sliding a filter with a window length of hIn the form of a collection of windows, i.e. +.>And is about the window therein>The convolution is carried out and then is activated by a tanh function, and the formula is as follows:

where n is the number of characters in the sentence vector representing x, b represents the bias weight, W represents the bias weight,representing the results after passing through the linear layer and the activation function layer.

The sliding convolution window is re-activated, and can be obtained:

wherein ,representing the feature set corresponding to the i-th filter.

Thus, the m filters are:

next, a Maxpooling layer is provided, which can receive the output after the convolution kernel is activatedThe objective of the maximum pooling to obtain the features corresponding to a particular filter is to capture the most important features for each feature map, i.e. one with the highest value:

wherein ,representing the feature with the highest value in the feature set corresponding to the ith filter.

Further, the output of the pooling layer is:

calculating the output result of the pooling layer by imitating the idea of a gate unit, defining z_gate=z, and subjecting z_gate to a sigmoid activation function:

then, matrix operation is carried out on z_gated and z activated by the sigmoid activation function, and output of the model layer is obtained:

where z_gated represents the calculation of the pooling layer according to the gate unit idea and z represents the pooling layer output.

Further, using a residual connection mechanism, adding the original vector representation H before the control gate model layer calculation and z output through the control gate model layer, and outputting information z after fusion residual connection:

the data form for z was then transformed by LayerNorm to a mean of 0 and variance of 1:

where f represents LayerNorm conversion, H represents the original vector representation before control gate model layer computation,the calculated result of z through the LayerNorm layer is shown.

Furthermore, in the feed_forward layer, a network structure is defined as two full-connection layers, a relu activation function is added into a similar bert model between the two full-connection layers, then the full-connection layers are added with LayerNorm for residual connection, and the LayerNorm is normalized, and the two full-connection layers are input into a formula to express:

wherein ,representing the result of the calculation through the linear layer and the fedforward layer, < >>Representing weights representing the linear layers inside Relu, < ->Weights representing the linear layers outside Relu, < ->Represents the bias of the linear layer inside Relu, < >>Representing the bias of the linear layer outside the Relu.

It should be noted that, here, the calculation of the stack gateway nn is already completed, and because in this solution, a model mode is adopted in the stack gateway nn, num_layers are set to 2, and the output vector set x of the unbedding network layer is transmitted to the next network for calculation through the above calculation process twice as the finally output character vector representation.

Specifically, let theFor the sentence character vector set which is calculated and output by the StackGatedCNN +.>We pass throughOveraverage pooling and tanh function activation, and linear layer calculation as vector representation of the current sentence, respectively +.>The calculation formula is as follows:

wherein the linear layer weightsD x d, where d is the last vector dimension of the output of the emmbedding layer; />Representing a current sentence vector representation; h represents an intra-sentence character vector set; />Representing the bias weight; j represents the total number of characters in the currently calculated sentence, i represents the ith character in the sentence, and t represents the time of accumulation calculation starting from the current t-th character.

Finally, willThe drug probability p value of the recommended drug of the user is obtained by a full connection layer followed by a softmax, and the formula is as follows:

wherein The weight matrix representing the final linear layer, the dimension is L multiplied by d, L is the number of medicines, and p is the probability value of the final output target medicine; />Representing the result of the calculation through the first linear layer; />Representing the result of the calculation through the second linear layer.

Specifically, after model training is completed, the trained model can be used for recommending the patient medication based on new patient diagnosis, examination and detection indexes, and finally the optimal medication of the patient is recommended, so that the patient medication recommendation automation to a certain extent is realized.

The second aspect of the present application also provides a medication recommendation system based on a deep convolutional network control gate model, comprising:

In some embodiments, the model training module comprises:

In some embodiments, the vector conversion submodule includes:

In some embodiments, the control gate submodule includes:

Those skilled in the art will appreciate that while some embodiments described herein include some features but not others included in other embodiments, combinations of features of different embodiments are meant to be within the scope of the application and form different embodiments.

Those skilled in the art will appreciate that the descriptions of the various embodiments are each focused on, and that portions of one embodiment that are not described in detail may be referred to as related descriptions of other embodiments.

Although the embodiments of the present application have been described with reference to the accompanying drawings, those skilled in the art may make various modifications and alterations without departing from the spirit and scope of the present application, and such modifications and alterations fall within the scope of the appended claims, which are to be construed as merely illustrative of the present application, but the scope of the application is not limited thereto, and various equivalent modifications and substitutions will be readily apparent to those skilled in the art within the scope of the present application, and are intended to be included within the scope of the present application. Therefore, the protection scope of the application is subject to the protection scope of the claims.

The present application is not limited to the above embodiments, and various equivalent modifications and substitutions can be easily made by those skilled in the art within the technical scope of the present application, and these modifications and substitutions are intended to be included in the scope of the present application. Therefore, the protection scope of the application is subject to the protection scope of the claims.

Claims

1. The medicine recommendation method based on the deep convolution network control gate model is characterized by comprising the following steps of:

2. The medication recommendation method based on a deep convolutional network control gate model of claim 1, wherein S3 comprises:

3. The medication recommendation method based on a deep convolutional network control gate model of claim 2, wherein S31:

4. The medication recommendation method based on a deep convolutional network control gate model of claim 2, wherein S32 comprises:

5. The medication recommendation method based on the deep convolutional network control gate model of claim 2, wherein the specific calculation process of S33 is as follows:

wherein ,/>representing linear layer weights, ++>The dimensions are the same and are d×d, d represents the last vector dimension of the output of the emmbedding layer; />Representing a current sentence vector representation; h represents an intra-sentence character vector set; />Representing the bias weight; j represents the total number of characters in the sentence currently calculated, i represents the ith character in the sentence, and t represents the time of accumulation calculation from the current t-th character; />The weight matrix representing the final linear layer, the dimension is L multiplied by d, L is the number of medicines, and p is the probability value of the final output target medicine; />Representing the result of the calculation through the first linear layer; />Representing the result of the calculation through the second linear layer.

6. Drug recommendation system based on deep convolutional network control gate model, characterized by comprising:

7. The medication recommendation system based on a deep convolutional network control gate model of claim 6, wherein the model training module comprises:

8. The medication recommendation system based on a deep convolutional network control gate model of claim 7, wherein the vector conversion submodule comprises:

9. The medication recommendation system based on a deep convolutional network control gate model of claim 7, wherein said control gate submodule comprises:

10. The medication recommendation system based on a deep convolutional network control gate model of claim 7, wherein the probability calculation sub-module comprises the following specific calculation processes:

wherein ,/>Representing linear layer weights, ++>The dimensions are the same and are d×d, d represents the last vector dimension of the output of the emmbedding layer; />Representing a current sentence vector representation; h represents an intra-sentence character vector set; />Representing the bias weight; j represents the total number of characters in the sentence currently calculated, i represents the ith character in the sentence, and t represents the current t th character when the accumulation calculation is performedStarting the character; />The weight matrix representing the final linear layer, the dimension is L multiplied by d, L is the number of medicines, and p is the probability value of the final output target medicine; />Representing the result of the calculation through the first linear layer; />Representing the result of the calculation through the second linear layer.