CN110880362B

CN110880362B - Large-scale medical data knowledge mining and treatment scheme recommending system

Info

Publication number: CN110880362B
Application number: CN201911117826.5A
Authority: CN
Inventors: 张立言; 黄兆孟
Original assignee: Nanjing University of Aeronautics and Astronautics
Current assignee: Nanjing University of Aeronautics and Astronautics
Priority date: 2019-11-12
Filing date: 2019-11-12
Publication date: 2022-10-11
Anticipated expiration: 2039-11-12
Also published as: CN110880362A

Abstract

The invention discloses a large-scale medical data knowledge mining and treatment scheme recommending system, which comprises: the data set preprocessing module is used for acquiring real electronic medical record data and preprocessing the electronic medical record data consisting of a plurality of heterogeneous data sources; the disease severity prediction module is used for obtaining a disease severity score in the treatment process of each patient; the treatment effectiveness measurement module is used for obtaining effective treatment measurement information; the patient similarity measurement module is used for constructing a similarity measurement relation of the patient; and the drug treatment scheme recommendation module is used for obtaining the next-stage drug treatment scheme recommendation. The invention judges and predicts the severity of the disease condition of the patient and defines the effectiveness measure of the treatment by the multitask bidirectional heterogeneous LSTM. And calculating the fine granularity similarity of the patient, and recommending the treatment scheme of the next stage according to the historical treatment record of the patient and the effective treatment scheme of other patients with high pathological similarity.

Description

Large-scale medical data knowledge mining and treatment scheme recommending system

Technical Field

The invention discloses a system for realizing discovery and recommendation of an effective drug treatment scheme by applying deep learning and knowledge introduction, belonging to the field of medical data mining.

Background

Electronic medical record (EHR) data is from millions of patients, and is currently collected and stored periodically at various medical institutions. These EHR data consist of heterogeneous data elements, typically including demographics, diagnostics, physical examinations, sensor measurements, laboratory test results, prescribed or managed medications, and clinical records, among others. With the rapid development of information technology and the rapid popularization of Electronic Medical Record Systems (EMRs), the amount of digital information stored in electronic health medical records in China has increased dramatically over the last decade. It is widely believed that a great deal of hidden knowledge is contained in the massive data, and various types of data in an electronic medical record system (EMR) provide a way to acquire medical knowledge, so that a basis is provided for improving the medical quality and efficiency. Specifically, EMR data has played an important role in many medical applications, especially in providing effective medication recommendations for physicians and patients, increasing the cure rate of disease, reducing the risk of death to clinical patients, and reducing decision costs during physician treatment and avoiding increased medical costs due to ineffective or harmful treatments.

While there is a tremendous interest in using EMRs data to improve medical performance, the gains from the analysis of EMRs data are far less than what EMRs can provide. One reason is that the prognosis of a patient is influenced by many factors, such as the age and sex of the patient, the severity of the disease, and the treatment being administered. While the EMRs data contains comprehensive information about patients, diagnosis and treatment, there is no unified framework to integrate all relevant factors for advanced data modeling. Furthermore, EMRs data is heterogeneous, vertical in nature. For example, a treatment record is a series of orders, where each order typically consists of a medication name, a route of administration, a dose, a start time, and an end time. In general, analyzing large-scale complex EMRs data, extracting medical knowledge, and promoting decision making in treatment practice is a not small challenge.

Scientists have made many beneficial explorations in electronic case data mining in order to analyze large-scale complex EMRs data. According to the data mining paper review [1] [2] applied to EMR, the Recurrent Neural Network (RNN) and its variants (LSTM, GRU) specifically used for sequential modeling can capture the complex temporal dynamics in longitudinal EMR data, which is the first choice for EMR modeling tasks. Chen, W., wang, S. [4] et al dynamically predicted the severity of Intensive Care Unit (ICU) patient' S condition using a multitasking RNN by integrating laboratory test results for different organs of the patient. However, the method in [3] does not make full use of heterogeneous data in EMR, for example, the diagnosis results and the description of the disease are meaningful for the task. Cao X, edward C et al [3] developed a treatment engine based on historical EMR data to provide patients with next-stage prescriptions based on their condition, laboratory results, treatment records, and demographic information. [4] Three different LSTM variants were proposed primarily to address the problem of data heterogeneity, but no overall framework for recommended treatment was proposed. Since the prescription for the next phase of the procedure is from historical treatment, the problem of "cold start", i.e. the treatment recommendation for the first hospitalized patient, is not addressed, and the present invention recognizes that the first 24 hours of treatment in the treatment of critically ill patients is critical. Leileilei Sun, chuanren Liu et al [5] proposed a method for developing and recommending a data-driven automatic treatment plan, mainly using important information in medical advice, and the clustering method used by the method finally obtained a few types of drug treatment combinations, which could not satisfy more refined treatment method recommendations. Meanwhile, none of the above schemes takes into account the problem of reactivity between drugs and the history of drug allergy of patients.

Reference:

[1].Shickel B,Tighe P J,Bihorac A,et al.Deep EHR:A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record(EHR)Analysis[J].IEEE Journal of Biomedical and Health Informatics,2017:1-1.

[2].Cao X,Edward C,Jimeng S.Opportunities and challenges in developing deep learning models using electronic health records data:a systematic review[J].Journal of the American Medical Informatics Association,2018.

[3].Chen,W.,Wang,S.,Long,G.,Yao,L.,Sheng,Q.Z.,Li,X.:Dynamic illness severity prediction via multi-task rnns for intensive care unit.In:ICDM(2018)

[4].Jin B.,Yang H.,Sun L.,Liu C.,Qu Y.,Tong J.A treatment engine by predicting next-period prescriptions Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery&Data Mining,ACM(2018),pp.1608-1616.

[5].Leilei Sun,Chuanren Liu,Chonghui Guo,Hui Xiong,and Yanming Xie.2016.Data-driven Automatic Treatment Regimen Development and Recommendation.In Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.ACM,New York,NY,USA,1865–1874.

disclosure of Invention

The invention aims to provide a large-scale medical data knowledge mining and treatment scheme recommending system, which applies a heterogeneous cyclic neural network and knowledge introduction to find effective treatment segments from a large-scale electronic medical record and can explain the next-stage medicament treatment of a patient based on the fine-grained similarity of the patient so as to meet the modeling requirement and have a good effect.

In order to achieve the purpose, the invention adopts the technical scheme that:

a large-scale medical data knowledge mining and treatment scheme recommendation system comprises: the system comprises a data set preprocessing module, a disease severity prediction module, a treatment effectiveness measurement module, a patient similarity measurement module and a drug treatment scheme recommendation module, wherein:

the data set preprocessing module is used for acquiring real electronic medical record data and preprocessing the electronic medical record data consisting of a plurality of heterogeneous data sources, and the preprocessed electronic medical record comprises five types of patient information, namely demographic information, diagnosis description information, laboratory indexes, medicine prescriptions and discharge results;

the disease severity prediction module is used for training a bidirectional heterogeneous LSTM network through demographic information, diagnosis description information and laboratory index data obtained by the data set preprocessing module to obtain a disease severity score of each patient in the treatment process;

the treatment effectiveness measurement module is used for obtaining effective treatment measurement information through the disease severity grade obtained by the disease severity prediction module, the influence degree of the current treatment on the next stage and the discharge result obtained by the data set preprocessing module;

the patient similarity measurement module is used for constructing a similarity measurement relation of patients and calculating the similarity between the patients through information deposited in the bidirectional heterogeneous LSTM network and static demographic information of the patients;

the drug treatment scheme recommendation module is used for introducing the time sequence of the drug prescription information according to the effective treatment measure information obtained by the treatment effectiveness measure module and the similarity measurement relation of the patient obtained by the patient similarity measurement module to obtain the next-stage drug treatment scheme recommendation, filtering the treatment combination of the adverse reaction drug and the drug containing the current patient allergy drug, and providing the first s effective treatments with high similarity to the patient and the treatment examples of ineffective or negative effect treatment to the doctor.

The electronic medical record data come from a critical medicine database MIMIC III v1.4.

The disease severity prediction module is a bidirectional heterogeneous LSTM network, and the overall structure of the bidirectional heterogeneous LSTM network is as follows:

wherein, input is the input of the heterogeneous LSTM network, and comprises physiological characteristic indexes, demographic information and diagnosis description information of a laboratory,

scoring for disease severity;

the bi-directional heterogeneous LSTM for each time step t is defined as follows:

f _t ＝σ(W _f [Chechup _t ,h _t-1 ]+b _f ) f _t ′＝σ(W′ _f [Chechup _t ,h′ _t+1 ]+b′ _f )

i _t ＝σ(W _i [Chechup _t ,h _t-1 ]+b _i ) i′ _t ＝σ(W _i ′[Chechup _t ,h′ _t+1 ]+b′ _i )

o _t ＝σ(W _o [Chechup _t ,h _t-1 ]+b _o ) o′ _t ＝σ(W′ _o [Chechup _t ,h′ _t+1 ]+b′ _o )

d _t ＝σ(W _d C _t-1 +b _d ) d′ _t ＝σ(W′ _d C′ _t+1 +b′ _d )

h _t ＝o _t tanh(C _t ) h′ _t ＝o′ _t tanh(C′ _t )

D＝relu(W _dense [h _t ,h′ _t ]+W _static P ^Static +b _dense )

wherein σ is Sigmoid function

tan h is tan h function

ReLu is a ReLu function f (x) = max (0, x), W is each weight matrix, b represents an offset term, and W, b are parameters to be learned by the model network; diagnosis (Diagnosis) _t ，Chechup _t Respectively diagnosis description information and laboratory indexes at the time t; i, f, o, C and h, input gate, forget gate, output gate, memory cell and hidden state, respectively, using cell state C _t-1 Structural breakdown door d _t For controlling the amount of added information; by forgetting door f _t Control, add additional candidate values

And cell state C at the previous time _t-1 Add to Current cell State C _t (ii) a From an input gate i _t Controlling new state information

Will update degree of

Add to Current cell State C _t 。

The forward LSTM and the backward LSTM have the same structure, the forward LSTM network is represented by using a label without a prime sign, and the backward LSTM network is represented by using a label without a prime sign; adding a fully connected layer D to manipulate static demographic information P ^Static The weight of the dense connection with the output of the forward and backward LSTM, the output of the forward and backward LSTM is W _dense ，W _static Weight for static information, b _dense A deviation term for this layer; then inputting the data into a sigmoid layer, wherein out represents that the layer is an output layer, and finally obtaining a predicted disease severity score

The model uses SOFA score as the true value y of Cross Encopy for training the bidirectional heterogeneous LSTM model, minimizes Cross Entropy, and finally obtains a disease severity score curve of each patient; and solidifying the structure and parameters of the trained bidirectional heterogeneous LSTM network, and when a new patient enters, obtaining the real-time disease severity score of the new patient by using the bidirectional heterogeneous LSTM network.

The treatment effectiveness measurement module obtains effective treatment measurement information through three aspects of disease severity degree score, influence degree K of current treatment on the next stage and discharge result R = {0001,0010,0100 and 1000 };

wherein the degree of effect K of the current treatment on the next stage is represented using the slope of the disease severity score curve, K being defined as:

where T is the length of the time window, y _T Scoring the severity of each disease within the tth time window;

information of effective treatment measure M = Q [ y ] _T ；K；R]。

The patient similarity measurement relationship constructed by the patient similarity measurement module is as follows:

the patient z is represented as:

wherein

And

from the forward-facing LSTM network, the network,

and

from the backward-direction LSTM network,

static demographic information;

inter-patient similarity is defined as the 2-norm of the subtraction of two patient representations:

Similar<P _z ,P _j >＝||P _z -P _j || ₂ ；

wherein j represents the j th patient.

The drug treatment scheme recommendation module obtains effective treatment measure information through the treatment effectiveness measure module, and similarity among patients obtained through the patient similarity measure module, introduces a time sequence of drug prescription information, and constructs a similarity measure-treatment effectiveness measure-pharmacy-time tensor table.

Compared with the prior art, the technical scheme adopted by the invention has the following beneficial effects:

(1) The system of the present invention explores effective treatment modalities from large-scale real electronic cases, which are fine-grained and short-term, unlike existing treatment recommendation engines whose treatment involves only a generally coarse-grained treatment regimen. Thus, doctors can be guided to treat more finely.

(2) The system of the invention recommends the medication individually according to the physiological condition, the treatment history, the medication history and the like of the patient and updates the medication dynamically.

(3) The invention introduces drug reactivity knowledge and patient allergy history, reduces reactivity and anaphylactic reaction between drugs, and can increase reliability and effectiveness of treatment. The whole treatment process of extracting positive and negative treatment effects is provided for doctors through the comparison of the patient similarity of fine granularity, so that the interpretability and the reliability of the medicine recommendation are enhanced, and the doctors can judge the predicted effectiveness of the recommended treatment scheme according to the treatment cases of similar patients and different effects generated by different schemes and determine whether to adopt or not or adopt own improved treatment scheme.

Drawings

FIG. 1 is a schematic diagram of a large-scale medical data knowledge mining and treatment planning recommendation system according to the present invention.

The specific implementation mode is as follows:

the present invention is further explained below.

Fig. 1 shows a large-scale medical data knowledge mining and treatment scheme recommendation system according to the present invention, which includes a data set preprocessing module, a disease severity prediction module, a treatment effectiveness measurement module, a patient similarity measurement module, and a medication scheme recommendation module, wherein:

the data set preprocessing module is used for acquiring real electronic medical record data and preprocessing the electronic medical record data consisting of a plurality of heterogeneous data sources, wherein the preprocessed electronic medical record comprises five types of patient information which are demographic information, diagnosis description information, laboratory indexes, medicine prescriptions and discharge results respectively;

the disease severity prediction module is used for training the bidirectional heterogeneous LSTM network through the demographic information, the diagnosis description information and the laboratory index data which are obtained by the data set preprocessing module to obtain a disease severity score of each patient in the treatment process;

The realization process of the large-scale medical data knowledge mining and treatment scheme recommendation system provided by the invention is as follows:

step 1: a dataset preprocessing module preprocesses Electronic Medical Record (EMR) data. EMR databases are typically composed of a variety of heterogeneous data sources, and the data retrieved from EMR databases is diverse, incomplete, redundant, and will greatly impact the final mining results. Accordingly, the EMR data must be pre-processed to ensure that the EMR data is accurate, complete, and consistent. First, EMR data is improved by filling in defaults, smoothing noise, and correcting data inconsistencies; second, EMR data may come from multiple EMR systems, and different data sources naturally lead to heterogeneous problems. The heterogeneous problem is mainly manifested as inconsistency of data attributes, such as attribute names and measurement units. For example, the specific gravity of urine may be expressed as SG or specific gravity, and the unit of measurement of triglyceride may be mmol/L, and sometimes may be mg/dl. Redundant data is also processed, and redundancy is mainly expressed by repeated records of data attributes or inconsistent attribute expression modes.

The pre-processed electronic cases typically contain five categories of patient information, demographic information, diagnostic description information, laboratory indices (physical examination results), medication prescriptions (medical orders), and discharge results (death).

Demographic information includes the patient's age, gender, address of residence, educational background, religion, race, marital status, weight, height, and other information. This information is important in the course of clinical decisions such as influencing the design of the overall treatment regimen and the dosage of the drug. Demographic information can be considered static during patient hospitalization, with P ^Static Representative, demographic information is formalized as:

P ^Static ＝{P ^Age ,P ^Gender ,P ^Site ,P ^Education ,...}

the diagnosis description information is given by the doctor and comprises the type of the disease, the qualitative description of the severity of the disease, complications and the like. Patients may suffer from a variety of diseases and during treatment, the disease may gradually heal, or the disease may progress, with new disease or increased complications. This can therefore be viewed as a dynamic process, using Diagnosis _t Representing diagnostic description information at time t. The diagnostic description information is formalized as:

laboratory physiological characteristic indicators (physical examination results): during the course of treatment, in order to accurately assess the efficacy of the treatment, multiple examinations are performed during hospitalization of the patient. For the invention

Shows the result of the physical examination at the t-th time, wherein

As the physiological characteristic index of jth laboratory

The value at time t.

The drug prescription (order) includes the name of the drug, route of administration, daily dosage, start time, end time, and the invention uses Treatment _t Representing a prescription for a drug, as a combination of a series of drugs, the prescription for the drug is formulated as:

wherein, therein

The name of the used medicine is shown,

is the route of administration, by "intravenous" (IV), "intramuscular" (IM), "oral" (Per os, PO) and the like.

Is the dose of the medicament per time,

which indicates how many times a day each time,

the time of administration is indicated as such,

day d. dr indicates that the sub-optimal drug prescription is a total of dr different drugs. In the present invention, a time window of a specific size is considered to beOne complete treatment, therefore medication was rewritten as:

discharge outcome (mortality): when a patient is discharged, a doctor gives a discharge evaluation result according to the actual condition of the patient, the patient result can be cure, improvement, invalidation or death, and the four results R = {0001,0010,0100 and 1000} are expressed by a single-hot code R.

And 2, step: the disease severity prediction module intensively predicts the ICU patient's criticality by building a bidirectional heterogeneous LSTM network W1.

In the ICU, the SOFA scoring system may reflect the severity of the patient's condition. SOFA assessments are performed over a long period of time, such as 24 hours, which results in a lower level of response to critically ill patients, and predicting the severity of the disease score in a more intensive way is an effective solution for rapidly monitoring patients in the ICU.

The overall structure of the bidirectional heterogeneous LSTM network W1 is as follows:

wherein, input is physiological characteristic index (physical examination result), demographic information and diagnosis description information of the laboratory, and the heterogeneous LSTM can use the three types of heterogeneous data as input.

The predicted disease severity was scored.

The LSTM at each time step t comprises i, f, o, c and h which are respectively an input gate, a forgetting gate, an output gate, a memory unit and a hidden state, wherein the forgetting gate controls the amount of memory to be forgotten, and the input gate controls the updating of each unit and the exposure of the state of the output gate control unit; if all the physiological characteristic indexes (physical examination results), the demographic information, the diagnosis description information and the like of the laboratory are differentConstructing a sequence as input and constructing sequential hidden states for each time series, the fully connected hidden neurons of different time series may confound the intrinsic dynamics of each time series. In order to realize flexible interaction of multi-surface time series, the invention only reserves the memory related to the physiological characteristic index. Under control of the previous memory, the additional diagnostic description information time series affects the cell state only through a unique structure called a decomposition gate. Using cell state C _t-1 Structural breakdown door d _t Which is used to control the amount of added information. By controlling the resolution gate, additional candidates are added

Add to cell state C _t . The forward LSTM and the backward LSTM have the same structure, the forward LSTM network is represented by using a label without a prime sign, and the backward LSTM network is represented by using a label without a prime sign; adding a fully connected layer D to manipulate static demographic information P ^Static The weight of the dense connection with the output of the forward and backward LSTM, the output of the forward and backward LSTM is W _dense ，W _static As weights of static information, b _dense A deviation term for this layer; then inputting the data into a sigmoid layer, and out represents that the layer is an output layer, and finally obtaining a predicted disease severity score

The model uses SOFA score as a true value y of Cross entry for training a bidirectional heterogeneous LSTM model, minimizes Cross Entropy, and finally obtains a disease severity score curve of each patient; and solidifying the structure and parameters of the trained bidirectional heterogeneous LSTM network, and when a new patient enters, obtaining the real-time disease severity score of the new patient by using the bidirectional heterogeneous LSTM network.

Bi-directional heterogeneous LSTM is defined as follows:

f _t ＝σ(W _f [Chechup _t ,h _t-1 ]+b _f )

i _t ＝σ(W _i [Chechup _t ,h _t-1 ]+b _i )

o _t ＝σ(W _o [Chechup _t ,h _t-1 ]+b _o )

d _t ＝σ(W _d C _t-1 +b _d )

h _t ＝o _t tanh(C _t )

f _t ′＝σ(W′ _f [Chechup _t ,h′ _t+1 ]+b′ _f )

i′ _t ＝σ(W _i ′[Chechup _t ,h′ _t+1 ]+b′ _i )

o′ _t ＝σ(W′ _o [Chechup _t ,h′ _t+1 ]+b′ _o )

d′ _t ＝σ(W′ _d C′ _t+1 +b′ _d )

h′ _t ＝o′ _t tanh(C′ _t )

D＝relu(W _dense [h _t ,h′ _t ]+W _static P ^Static +b _dense )

wherein σ is Sigmoid function

tan h is tan h function

ReLu is a ReLu function f (x) = max (0, x), W is each weight matrix, b represents an offset term, and W, b are parameters to be learned by the model network; diagnosines _t ，Chechup _t Diagnostic description information at time t, laboratory indices (abbreviated to Di, ch omitted), respectively; i, f, o, C and h, respectively input gate, forget gate, output gate, memory cell and hidden state, using cell state C _t-1 Structural breakdown door d _t For controlling the amount of added information; by forgetting door f _t Control to add additional candidate values

Will update degree of

Add to Current cell State C _t 。

And 3, step 3: a treatment effectiveness measurement module for defining what treatment is effective; the disease severity score for each patient during treatment obtained in step two. The structure and parameters of the trained network W1 are solidified, and when a new patient enters, a real-time disease severity score of the new patient can be obtained by inputting current laboratory physiological characteristic indexes (physical examination results), demographic information and diagnosis description information.

For treatments in EMR data, a measure of treatment effectiveness is defined. Evaluation is based on three considerations, the current disease severity of the patient, the degree of impact of the current treatment on the next stage (time window) and the outcome of the treatment at the final discharge. Wherein the degree of influence K of the current treatment on the next stage is represented using the slope of the disease severity score curve, for ease of calculation and considering that the score curve is not smooth, K is defined as:

where T is the length of the time window, y _T The severity of each disease within the tth time window was scored.

Therapeutic efficacy M = Q [ y [ ] _T ；K；R]In the embodiment of the present invention, y _T After K, R normalization, Q = [1,2,1]。

And 4, step 4: the patient similarity measurement module constructs a similarity measurement relation of the patient: information such as laboratory physiological characteristic indicators, demographic information, diagnostic description information, and disease severity of patients is important to construct a measure of similarity between patients. When using the network W1 to measure the disease severity score, the patient's information is already deposited in the network.

Each patient is represented as:

wherein

And

from the forward LSTM network, the network,

and

from the backward-direction LSTM network,

static demographic information.

Similar<P _z ,P _j >＝||P _z -P _j || ₂ ；

wherein j represents the jth patient.

And 5: the drug treatment scheme recommendation module provides interpretability by searching and recommending a drug treatment scheme of the next stage through the positive and negative similarity treatment samples: the effective treatment measure information obtained by the treatment effectiveness measure module and the similarity measure relation of the patient obtained by the patient similarity measure module. And introducing a time sequence of medicine prescription information, and constructing a similarity measure-a treatment effectiveness measure-a pharmacy-a time tensor table. When a new patient is hospitalized, a treatment pharmacy with the highest treatment effect at the current stage and the highest similarity with the patient is recommended to the patient. It should be noted that, as the patient treatment is recommended, the patient status changes, and the similarity between the current patient and the patient in the EMR data also changes, so the recommendation of the present invention is dynamically changed according to the patient status.

Considering adverse reactions between medicines and allergy history of patients, the invention filters the combination of large adverse reactions and the medicines containing the current allergy medicines of the patients when recommending the embodiment, and selects a suboptimal method, so that the recommendation is more reliable for the current patients. Meanwhile, the treatment examples of the first s effective treatments and the first s ineffective or negative treatments with high similarity to the patient are provided for the doctor to help the doctor to make a better decision.

In this embodiment, the electronic medical record data is from the critical medicine database MIMIC-III. The MIMIC-III database is a real clinical database containing health data related to more than 40,000 patients admitted to the ICU by the Beth Israel Deaconess medical center within 11 years of age, and the invention applies the latest version of MIMIC III v1.4, including 50206 medical treatment records, relating to 6695 different diseases and 4127 drugs. The examples exclude those patients under 15 years of age or staying in the ICU for less than 48 hours. Children were excluded because the definition of the normal range of medical metrics varied between adults and children, and the 48-hour requirement in the ICU ensured sufficient data for analysis. At the same time, patients with large amounts of missing data are excluded because overestimation of the missing data may introduce differences with negative effects. Finally 3255 patients were selected for modeling and analysis.

The foregoing is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, various modifications and decorations can be made without departing from the principle of the present invention, and these modifications and decorations should also be regarded as the protection scope of the present invention.

Claims

1. A large-scale medical data knowledge mining and treatment scheme recommendation system is characterized in that: the system comprises a data set preprocessing module, a disease severity predicting module, a treatment effectiveness measuring module, a patient similarity measuring module and a drug treatment scheme recommending module, wherein:

the data set preprocessing module is used for acquiring real electronic medical record data and preprocessing the electronic medical record data consisting of various heterogeneous data sources, and the preprocessed electronic medical record comprises five types of patient information which are static demographic information P respectively ^Static Diagnosis description information Diagnosines, laboratory index Chechup, medicine prescription Treatment and discharge result R;

the disease severity prediction module is used for training the bidirectional heterogeneous LSTM network through the demographic information, the diagnosis description information and the laboratory index data which are obtained by the data set preprocessing module to obtain a disease severity score of each patient in the treatment process; the disease severity prediction module is a bidirectional heterogeneous LSTM network, and the overall structure of the bidirectional heterogeneous LSTM network is as follows:

scoring for disease severity;

f _t ＝σ(W _f [Chechup _t ,h _t-1 ]+b _f ) f _t ′＝σ(W _f ′[Chechup _t ,h _t ′ ₊₁ ]+b′ _f )

i _t ＝σ(W _i [Chechup _t ,h _t-1 ]+b _i ) i _t ′＝σ(W _i ′[Chechup _t ,h _t ′ ₊₁ ]+b _i ′)

o _t ＝σ(W _o [Chechup _t ,h _t-1 ]+b _o ) o _t ′＝σ(W _o ′[Chechup _t ,h _t ′ ₊₁ ]+b _o ′)

d _t ＝σ(W _d C _t-1 +b _d ) d _t ′＝σ(W _d ′C _t ′ ₊₁ +b _d ′)

h _t ＝o _t tanh(C _t ) h _t ′＝o _t ′tanh(C _t ′)

D＝relu(W _dense [h _t ,h _t ′]+W _static P ^Static +b _dense )

wherein σ is Sigmoid function

tan h is tan h function

ReLu is a ReLu function f (x) = max (0, x), W is each weight matrix, b represents a bias term, and W, b are parameters to be learned by the model network; diagnosis (Diagnosis) _t ，Chechup _t Respectively diagnosis description information and laboratory indexes at the time t; i, f, o, C and h, respectively input gate, forget gate, output gate, memory cell and hidden state, using cell state C _t-1 Structural breakdown door d _t For controlling the amount of added information; by forgetting door f _t Control, add additional candidate values

Will update degree of

Add to Current cell State C _t ；

Forward and backward LSTM has the same structure, forward LSTM network is represented by using a label without a prime sign, and backward LSTM network is represented by using a label without a prime sign; adding a fully connected layer D to manipulate static demographic information P ^Static The weight of the dense connection with the output of the forward and backward LSTM, the output of the forward and backward LSTM is W _dense ,W _static Weight for static information, b _dense A deviation term for this layer; then inputting the data into a sigmoid layer, wherein out represents that the layer is an output layer, and finally obtaining a predicted disease severity score

The model uses SOFA score as a true value y of Cross entry for training a bidirectional heterogeneous LSTM model, minimizes Cross Entropy, and finally obtains a disease severity score curve of each patient; solidifying the structure and parameters of the trained bidirectional heterogeneous LSTM network, and when a new patient enters, obtaining a real-time disease severity score of the new patient by using the bidirectional heterogeneous LSTM network;

the treatment effectiveness measurement module is used for obtaining effective treatment measurement information through the disease severity score obtained by the disease severity prediction module, the influence degree of the current treatment on the next stage and the discharge result obtained by the data set preprocessing module;

2. The large-scale medical data knowledge mining and therapy planning recommendation system of claim 1, wherein: the electronic medical record data is from an intensive care medical database MIMIC III.

3. The large-scale medical data knowledge mining and treatment protocol recommendation system of claim 1, wherein: the treatment effectiveness measurement module obtains effective treatment measurement information through three aspects of disease severity score, influence degree K of current treatment on the next stage and discharge result R = {0001,0010,0100,1000 };

information of effective treatment measure M = Q [ y ] _T ；K；R]。

4. The large-scale medical data knowledge mining and treatment protocol recommendation system of claim 1, wherein: the patient similarity measurement relationship constructed by the patient similarity measurement module is as follows:

the patient z is expressed as:

wherein

And

from the forward LSTM network, the network,

and

from backward LSTM network, P ^Static Static demographic information;

Similar<P _z ,P _j >＝||P _z -P _j || ₂ ；

wherein j represents the jth patient.

5. The large-scale medical data knowledge mining and treatment protocol recommendation system of claim 1, wherein: the drug treatment scheme recommendation module obtains effective treatment measure information through the treatment effectiveness measure module, and similarity among patients obtained through the patient similarity measure module, introduces a time sequence of drug prescription information, and constructs a similarity measure-treatment effectiveness measure-pharmacy-time tensor table.