WO2021073277A1

WO2021073277A1 - Personalized precise medication recommendation method and apparatus

Info

Publication number: WO2021073277A1
Application number: PCT/CN2020/112186
Authority: WO
Inventors: 李艳圃; 黄思皖; 陈天歌; 李响; 谢国彤
Original assignee: 平安科技（深圳）有限公司
Priority date: 2019-10-16
Filing date: 2020-08-28
Publication date: 2021-04-22
Also published as: CN110880361A; CN110880361B

Abstract

Disclosed are a personalized precise medication recommendation method and apparatus, relating to the technical field of artificial intelligence. The method comprises: acquiring medical record data of a plurality of patients suffering from the same disease, wherein the medical record data comprises structural data, text data and image data (S01); obtaining medication information of a patient from the text data (S02); screening medication information of the plurality of historical patients to obtain a first medicine recommendation result for a target patient (S03); performing merging processing on the medical record data of the patient to obtain illness state feature information of the patient (S04); screening out, from the plurality of historical patients, at least one similar patient with the illness state feature information being similar to the current illness state feature information of the target patient (S05); generating a second medicine recommendation result according to medication information of the similar patient (S06); and obtaining a personalized medicine recommendation result for the target patient according to the first medicine recommendation result and the second medicine recommendation result (S07). The method and apparatus can solve the problem in the prior art of low medication precision for patients.

Description

Method and device for personalized and accurate medication recommendation

This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on October 16, 2019, with application number 201910983855.3, and the title of the invention "A method and device for personalized and precise medication recommendation", the entire content of which is incorporated by reference In this application.

Technical field

This application relates to the field of artificial intelligence technology, and in particular to a method and device for personalized and accurate medication recommendation.

Background technique

Precision medicine is the core component of precision medicine. In the process of disease diagnosis and treatment, the accurate medication plan and the on-time and proper medication process play a decisive role in improving the patient's therapeutic effect. In order to prescribe the right medicine, it needs to be different from person to person and from disease to disease, and comprehensively consider the patient's current condition, past medical history, medication history, and family medical history.

The inventor realizes that the use of patient EHR (Electronic Health Record) data in existing intelligent systems is generally limited to the use of numerical and structured parts of it. For individual patients, especially those suffering from chronic diseases, their diagnosis and treatment records have a long span. Only structured numerical EHR data is used to recommend drugs. The accuracy of medication is low and it is difficult to meet the patient’s personality. Chemical drug demand.

Summary of the invention

In view of this, the embodiments of the present application provide a personalized accurate medication recommendation method and device to solve the problem of low accuracy in medication medication of patients in the prior art, and difficulty in meeting patients' personalized medication needs.

In order to achieve the above objective, according to one aspect of the present application, a personalized and accurate medication recommendation method is provided, the method includes:

Obtain medical record data of multiple patients suffering from the same disease, the medical record data including structured data, text data, and image data, where the patients include historical patients and target patients who currently need to be recommended for medication; use The named entity recognition algorithm obtains the patient’s medication information from the text data of each patient; the drug-based collaborative filtering algorithm selects the medication information of multiple historical patients to obtain the target patient’s medication information. The first drug recommendation result; the patient’s medical record data is merged to obtain the patient’s condition feature information; the patient-based collaborative filtering algorithm selects the target from the multiple historical patients At least one similar patient whose current condition feature information of the patient is similar; generating a second drug recommendation result according to the medication information of the similar patient; performing fusion processing on the first drug recommendation result and the second drug recommendation result , Obtain the personalized medicine recommendation result of the target patient.

In order to achieve the above objective, according to one aspect of the present application, there is provided a personalized accurate medication recommendation device, the device comprising:

The acquiring unit is used to acquire medical record data of multiple patients suffering from the same disease, the medical record data including structured data, text data, and image data, wherein the patients include historical patients and those who currently need to be recommended for medication Target patients; identification unit, used to use named entity recognition algorithm to obtain the patient’s medication information from the text data of each patient; the first screening unit, used for drug-based collaborative filtering algorithm from multiple The first drug recommendation result of the target patient is obtained by screening the medication information of the historical patient; the processing unit is configured to merge the medical record data of the patient to obtain the characteristic information of the patient's condition; The second screening unit is used to screen out at least one similar patient that is similar to the current condition feature information of the target patient from the multiple historical patients based on the patient-based collaborative filtering algorithm; the generating unit is used to select according to The medication information of the similar patients generates a second drug recommendation result; the fusion unit is used to perform fusion processing on the first drug recommendation result and the second drug recommendation result to obtain the personalized medicine of the target patient Recommended results.

In order to achieve the foregoing objective, according to one aspect of the present application, a computer non-volatile storage medium is provided, the storage medium includes a stored program, and when the program runs, the device where the storage medium is located is controlled to perform the following steps:

In order to achieve the foregoing objective, according to one aspect of the present application, a computer device is provided, including a memory, a processor, and a computer program stored in the memory and running on the processor, and the processor executes all When the computer program is described, the following steps are implemented:

This program considers the dynamic change process of the patient's condition more comprehensively, provides the accuracy of medication, and meets the patient's personalized medication needs.

Description of the drawings

In order to explain the technical solutions of the embodiments of the present application more clearly, the following will briefly introduce the drawings needed in the embodiments. Obviously, the drawings in the following description are only some embodiments of the present application. For those of ordinary skill in the art, without creative labor, other drawings can be obtained from these drawings.

FIG. 1 is a flowchart of an optional personalized and precise medication recommendation method provided by an embodiment of the present application;

2 is a schematic diagram of an optional personalized precise medication recommendation device provided by an embodiment of the present application;

Fig. 3 is a schematic diagram of an optional computer device provided by an embodiment of the present application.

Detailed ways

In order to better understand the technical solutions of the present application, the embodiments of the present application will be described in detail below with reference to the accompanying drawings.

The technical solution of this application can be applied to the fields of artificial intelligence, big data and/or digital medical technology, and the data involved can be stored in a database, or can be distributed storage through a blockchain, which is not limited by this application.

The terms used in the embodiments of the present application are only for the purpose of describing specific embodiments, and are not intended to limit the present application. The singular forms of "a", "said" and "the" used in the embodiments of the present application and the appended claims are also intended to include plural forms, unless the context clearly indicates other meanings.

It should be understood that the term "and/or" used in this text is only an association relationship describing the associated objects, indicating that there can be three types of relationships, for example, A and/or B can mean that A alone exists, and both A and A exist at the same time. B, there are three cases of B alone. In addition, the character "/" in this text generally indicates that the associated objects before and after are in an "or" relationship.

Fig. 1 is a flowchart of a method for personalized accurate medication recommendation according to an embodiment of the present application. As shown in Fig. 1, the method includes:

Step S01: Obtain medical record data of multiple patients suffering from the same disease. The medical record data includes structured data, text data, and image data. Among them, patients include historical patients and target patients who currently need to be recommended for medication;

Step S02, using a named entity recognition algorithm to obtain the patient's medication information from the text data of each patient;

Step S03, the drug-based collaborative filtering algorithm selects the drug use information of multiple historical patients to obtain the first drug recommendation result of the target patient;

Step S04, merge the patient's medical record data to obtain the patient's condition characteristic information;

In step S05, the patient-based collaborative filtering algorithm selects at least one similar patient that is similar to the current condition feature information of the target patient from multiple historical patients;

Step S06, generating a second drug recommendation result based on the medication information of similar patients;

In step S07, the first drug recommendation result and the second drug recommendation result are fused to obtain the personalized drug recommendation result of the target patient.

In this scheme, by merging the medical record data of patients with the same disease, patients with similar characteristics of the current condition of the target patient are screened out, and the second drug recommendation result is further obtained based on the medication records of similar patients Finally, the first drug recommendation result and the second drug recommendation result are merged. Compared with the existing results of only the first drug recommendation system to provide medication recommendations to patients, the dynamic changes of the patient’s condition are more comprehensively considered. Process, provide the accuracy of medication, and meet the personalized medication needs of patients.

The specific technical solution of the image processing method provided in this embodiment will be described in detail below.

Step S01: Obtain medical record data of multiple patients suffering from the same disease. The medical record data includes structured data, text data, and image data. The patients include historical patients and target patients who currently need to be recommended for medication.

Among them, historical patients refer to patients who have suffered from the above-mentioned diseases, the target patients are the patients currently suffering from the disease, and the target patients are the target objects of the medication recommendation. Specifically, the structured data is the numerical data in the patient’s Electronic Health Record (EHR), such as heart rate, blood pressure, blood sugar, urine volume and other test data, which are stored in the electronic health record in numerical form. File. Text data such as medication records, discharge summary, nursing records, ward round records, etc., are stored in electronic health records in text form, and image data such as CT images, MRI images, X-ray images, etc., are stored in electronic health in the form of pictures File.

In step S02, a named entity recognition algorithm is used to obtain the patient's medication information from the text data of each patient.

Among them, the text data may be, for example, a medication record, which may be a scanned image file or a text format. Specifically, the text data of each patient is divided into words to obtain multiple words; the named entity recognition algorithm is used to identify each patient's medication information from multiple words. In this embodiment, the medication information of each patient is expressed in the form of a patient-drug coding matrix. Named Entity Recognition (Named Entity Recognition; hereinafter referred to as: NER) refers to the recognition of entities with specific meanings in the text, mainly including names of persons, names of diseases, names of drugs, and/or proper nouns. Among them, a named entity can be used to identify the patient’s social security card number, such as social security card number: 6123456. Since the social security card number is the patient’s unique identification code, the patient’s social security card number is used to represent the patient, and the drug code is based on The mapping relationship between drugs and codes recorded in the drug knowledge base can be queried and obtained by inputting the identified drug name into the drug knowledge base.

For example, if the patient’s social security card number is 6123456, and the drug is identified (drug a: D120; drug b: D130;), then the information integration to obtain patient A’s medication information is {6123456, D120, D130}.

In other embodiments, the medication information can also be obtained from text data by means such as keyword recognition, which is not limited here.

In step S03, the drug-based collaborative filtering algorithm selects the drug use information of multiple historical patients to obtain the first drug recommendation result of the target patient.

Among them, the drug-based collaborative filtering algorithm refers to finding the scores of certain drugs by historical patients by searching for the similarity between drugs and drugs, and then recommending several similar drugs with the highest scores to the target patients. The first drug recommendation result includes at least one drug. In one embodiment, the drugs included in the first drug recommendation result are drugs with a score higher than a preset value by filtering from the medication information of multiple historical patients suffering from the same disease. Historical patients refer to patients who have visited a doctor who have the same disease as the target patient.

Understandably, collaborative filtering is a method of predicting the medications of target patients by collecting medication information from many patients. For example, if drug a is used to treat a certain disease with a high score, the system will consider that drug a is suitable for the treatment of the disease.

In step S04, the patient's medical record data is merged to obtain the patient's condition characteristic information.

Among them, the patient's condition feature information includes first feature information, second feature information, and third feature information. The condition feature information is a piece of multi-dimensional record data, which is obtained by combining the first feature information, the second feature information, and the third feature information of each patient based on the time dimension information. The first characteristic information is extracted from the text data of the medical record data, the second characteristic information is extracted from the structured data of the medical record data, and the third characteristic information is extracted from the image data of the medical record data.

Further, step S04 specifically includes:

In step S041, the medication information of each identified patient is converted into corresponding numerical data according to the preset mapping table. Understandably, the drug is mapped to a value to facilitate subsequent calculations, such as mapping "nifedipine" to "D130", mapping "levoamlodipine besylate" to "D131" and so on.

In step S042, the structured data in the medical record data of each patient is formed into a sparse matrix according to the time sequence. Specifically, a Variational Autoencoder (VAE) is used to process the structured data and compress it into a sparse matrix according to the timing information. VAE is a kind of self-encoder, and the coding result of VAE can reduce the dimensionality of high-dimensional data. The structured data of each patient, such as Xvae={x1,x2,...xi,...xn}, each variable in X represents an input vector, and the elements of the vector are factors related to the patient, such as heart rate, blood pressure, The experiment tests creatinine, blood sugar, and urea, and X represents a disease.

Step S043: Use a variational autoencoder to compress and encode the numerical data and the sparse matrix to obtain the first code and the second code of the patient. The first code includes first feature information and time dimension information derived from text data. Similarly, the second code includes second feature information and time dimension information derived from structured data.

Further, the time dimension information here is not only the date of treatment, but the time used to indicate the course of the disease, such as the time of the first visit, the time of the first follow-up visit, and the time of the second follow-up visit, so as to determine the stage of the patient's condition.

In step S044, the image data is pooled using a preset convolutional neural network to obtain a third code of the patient.

In the pooling process, first obtain the three primary color values of each pixel in the image data; then use the preset convolutional neural network to extract the characteristic part according to the three primary color values of all pixels to form the third code of the patient.

Specifically, the convolutional neural network is used to pool it to highlight the key information, such as the area of pneumonia. The convolutional neural network outputs a third code, which includes the third feature information and time dimension information derived from the image data . For example, the patchy fuzzy area of the lower right lung in the lung X-ray of patient a on 2019-9-30 is 1cm*2cm; the right side of the lung X-ray of patient a on 2019-10-02 The patchy fuzzy area of the lower lung is 0.5cm*0.5cm. The third code of the patient includes the third feature information and time dimension information derived from the image data.

In step S045, the first code, the second code, and the third code are combined to obtain disease characteristic information of each patient. Specifically, the first feature information, the second feature information, and the third feature information of each patient are combined and processed based on the time dimension information to obtain the condition feature information of each patient. The condition feature information is a piece of multi-dimensional record data. .

In step S05, the patient-based collaborative filtering algorithm selects at least one similar patient that is similar to the current condition feature information of the target patient from multiple historical patients.

Among them, the patient-based collaborative filtering algorithm refers to finding the similarity of the feature information between the patient and the patient, and then confirming the patient with the highest similarity of the feature information as a similar patient. For example, if the similarity between the condition characteristic information of one patient at a certain stage and the current condition characteristic information of the target patient among multiple historical patients is 92%, which is greater than a preset threshold of 90%, then the patient is a similar patient.

Understandably, by screening similar patients, highlighting the characteristics of the target patient's current condition, and considering the dynamic change process of the patient's condition more comprehensively, making medication recommendations more accurate and more suitable for the patient's current condition.

The specific steps of step S05 include:

Step S051: Input the condition feature information of each patient into the preset word vector representation model to obtain the feature information vector of each patient. In this embodiment, the preset word vector representation model can be, for example, the Word2vea model. In other embodiments, other word vector representation models can also be used, so that the multi-dimensional record data can be converted into word vectors to facilitate subsequent similarity calculations. .

Step S052: Calculate the Euclidean distance between the feature information vector of each historical patient and the feature information vector of the target patient. Understandably, by calculating the similarity of each patient’s condition, we can find patients who are similar to the target patient’s current condition. Because each person’s disease course changes, physical fitness, etc., are different, so when giving medication recommendations, it should be sufficient Consider individual differences.

In step S053, the Euclidean distance is confirmed as the degree of similarity between the historical patient and the target patient.

The specific calculation formula is:

Among them, x represents the feature information vector of the condition of the historical patient; y represents the feature information vector of the condition of the target patient; d(x,y) represents the Euclidean distance between the vector x and the vector y, and n represents the total number of dimensions of the vector .

In other embodiments, other similarity calculation methods can also be used to calculate the similarity between the history patient and the target patient, such as cosine distance, edit distance, etc., which are not limited here.

In step S054, at least one similar patient is screened from multiple historical patients according to the similarity of the condition, where the similarity of the condition between the similar patient and the target patient is greater than a preset threshold.

For example, the preset threshold is 90%. If the condition similarity between the historical patient Jia and the target patient is 92%, then the historical patient Jia can be confirmed as a similar patient to the target patient.

Step S06: Generate a second drug recommendation result based on the medication information of similar patients.

In one embodiment, the medication records of similar patients are first obtained from a preset database, and the medication records of similar patients are segmented to obtain multiple vocabulary; the named entity recognition algorithm is used to recognize the medication records from the multiple vocabularies. Medication information for similar patients. The database here can be a hospital's case database. For example, if the medication information of similar patient B is {6123457, D120, D130}, then the second drug recommendation results are D120 and D130.

Step S06 specifically includes:

Step S061, find the disease-related diseases from the preset disease-drug directed connection graph; step S062, obtain the related diseases and the medication information of the diseases according to the directed connection graph; step S063, according to the related diseases and the medication information of the diseases And the medication information of similar patients to generate a second drug recommendation result.

In this embodiment, the disease-drug directed connection graph uses NER mining to mark the association between the disease and the corresponding drug preparation (for example, statins can be used to reduce blood lipids), forming points and edges, and fusion with the disease VS disease graph network , And finally form a directed connection graph with diseases and corresponding drugs as the vertices and the associations as edges to record the associations between diseases, such as complications.

In one embodiment, the first drug recommendation result is first merged with the second drug recommendation result, and then duplicate drugs are deleted from the merged multiple drugs to obtain a personalized drug recommendation result.

For example, the first drug recommendation result is (x1, x2), the second drug recommendation result is (x2, y1, z1), and the fusion is (x1, x2, x2, y1, z1), a duplicate x2 is deleted, and the final personality The recommended result of chemical medicine is (x1, x2, y1, z1).

The specific steps of step S07 include:

In step S071, the first drug recommendation result and the second drug recommendation result are fused to obtain the fusion drug recommendation result; step S072, the fusion drug recommendation result is compared with the preset mutually exclusive drug group to determine the fusion drug recommendation Whether there is a mutually exclusive drug group in the result; step S073, if it exists, adopt the same-drug substitution strategy to adjust the fusion drug recommendation result to eliminate the mutually exclusive drug group; step S074, generate the target patient’s profile based on the adjusted fusion drug recommendation result Personalized drug recommendation results.

For example, the combination of "chloramphenicol" in antibiotics and sulfonylurea hypoglycemic agents can cause hypoglycemia. Therefore, it is a mutually exclusive drug group and cannot be taken at the same time; aspirin and indomethacin are also mutually exclusive drug groups.

Further, after step S072, step S075 is further included. If it does not exist, the personalized medicine recommendation result of the target patient is generated according to the fusion medicine recommendation result.

The embodiment of the present application provides a personalized accurate medication recommendation device, which is used to implement the above-mentioned personalized accurate medication recommendation method. As shown in FIG. 2, the device includes: an acquisition unit 10, an identification unit 20, and a first screening unit 30. A processing unit 40, a second screening unit 50, a generating unit 60, and a fusion unit 70.

The acquiring unit 10 is used to acquire medical record data of multiple patients suffering from the same disease, the medical record data including structured data, text data, and image data, wherein the patients include historical patients and current medications that need to be recommended Target patients.

The recognition unit 20 is configured to obtain the medication information of the patient from the text data of each patient by using a named entity recognition algorithm.

The first screening unit 30 is configured to obtain the first drug recommendation result of the target patient by screening the medication information of the multiple historical patients based on the drug-based collaborative filtering algorithm. Among them, the drug-based collaborative filtering algorithm refers to finding the scores of certain drugs by historical patients by searching for the similarity between drugs and drugs, and then recommending several similar drugs with the highest scores to the target patients. The first drug recommendation result includes at least one drug. In one embodiment, the drugs included in the first drug recommendation result are drugs with a score higher than a preset value by filtering from the medication information of multiple historical patients suffering from the same disease. Historical patients refer to patients who have visited a doctor who have the same disease as the target patient.

Understandably, collaborative filtering is a method of predicting the medications of target patients by collecting medication information from many patients. For example, if drug a is used to treat a certain disease with a high score, the system will consider that drug a is suitable for the treatment of the disease. For example, antihypertensive drugs with better efficacy for drugs suitable for high blood pressure.

The processing unit 40 is configured to merge the medical record data of the patient to obtain the characteristic information of the patient's condition. Among them, the patient's condition feature information includes first feature information, second feature information, and third feature information. The condition feature information is a piece of multi-dimensional record data, which is obtained by combining the first feature information, the second feature information, and the third feature information of each patient based on the time dimension information. The first characteristic information is extracted from the text data of the medical record data, the second characteristic information is extracted from the structured data of the medical record data, and the third characteristic information is extracted from the image data of the medical record data.

The second screening unit 50 is configured to screen out at least one similar patient that is similar to the current condition feature information of the target patient from the multiple historical patients based on the patient's collaborative filtering algorithm. Among them, the patient-based collaborative filtering algorithm refers to finding the similarity of the feature information between the patient and the patient, and then confirming the patient with the highest similarity of the feature information as a similar patient. For example, if the similarity between the condition characteristic information of one patient at a certain stage and the current condition characteristic information of the target patient among multiple historical patients is 92%, which is greater than a preset threshold of 90%, then the patient is a similar patient.

The generating unit 60 is configured to generate a second drug recommendation result according to the medication information of the similar patients.

The fusion unit 70 is configured to perform fusion processing on the first drug recommendation result and the second drug recommendation result to obtain the personalized drug recommendation result of the target patient.

Optionally, the processing unit 40 includes a conversion sub-unit, a processing sub-unit, a compression sub-unit, a pooling sub-unit, and a merging sub-unit.

The transformation subunit is used to transform the medication information of each identified patient into corresponding numerical data according to the preset mapping table. Understandably, the drug is mapped to a value to facilitate subsequent calculations, such as mapping "nifedipine" to "D130", mapping "levoamlodipine besylate" to "D131" and so on.

The processing subunit is used to form a sparse matrix according to the time sequence of the structured data in the medical record data of each patient. Specifically, a Variational Autoencoder (VAE) is used to process the structured data and compress it into a sparse matrix according to the timing information. VAE is a kind of self-encoder, and the coding result of VAE can reduce the dimensionality of high-dimensional data. The structured data of each patient, such as Xvae={x1,x2,...xi,...xn}, each variable in X represents an input vector, and the elements of the vector are factors related to the patient, such as heart rate, blood pressure, The experiment tests creatinine, blood sugar, and urea, and X represents a disease.

The compression subunit is used to compress and encode the numerical data and the sparse matrix by using the variational autoencoder to obtain the first code and the second code of the patient. The first code includes first feature information and time dimension information derived from text data. Similarly, the second code includes second feature information and time dimension information derived from structured data.

The pooling subunit is used to pool the image data using a preset convolutional neural network to obtain the third code of the patient.

The merging subunit is used for merging the first code, the second code, and the third code to obtain disease characteristic information of each patient. Specifically, the first characteristic information, the second characteristic information, and the third characteristic information of each patient are combined and processed based on the time dimension information to obtain the disease characteristic information of each patient. The disease characteristic information is a piece of multi-dimensional record data. .

Optionally, the second screening unit 50 includes a preprocessing subunit, a first calculation subunit, a confirmation subunit, and a screening subunit.

The pre-processing subunit is used to input the disease feature information of each patient into the preset word vector representation model to obtain the feature information vector of each patient. In this embodiment, the preset word vector representation model can be, for example, the Word2vea model. In other embodiments, other word vector representation models can also be used, so that the multi-dimensional record data can be converted into word vectors to facilitate subsequent similarity calculations. .

The first calculation subunit is used to calculate the Euclidean distance between the feature information vector of each historical patient and the feature information vector of the target patient. Understandably, by calculating the similarity of each patient’s condition, we can find patients who are similar to the target patient’s current condition. Because each person’s disease course changes, physical fitness, etc., are different, so when giving medication recommendations, it should be sufficient Consider individual differences.

The confirmation subunit is used to confirm the Euclidean distance as the similarity between the historical patient and the target patient.

The specific calculation formula is:

The screening subunit is used to screen out at least one similar patient from multiple historical patients according to the similarity of the condition, where the similarity between the condition of the similar patient and the target patient is greater than a preset threshold.

Optionally, the generation unit 60 includes a search subunit, an acquisition subunit, and a first generation subunit.

The search subunit is used to search the disease-related diseases from the preset disease-drug directed connection graph; the acquisition subunit is used to obtain the related diseases and the medication information of the disease according to the directed connection graph; the first generation subunit , Used to generate the second drug recommendation result based on the related disease, the medication information of the disease, and the medication information of similar patients.

Optionally, the fusion unit 70 includes a fusion subunit, a comparison subunit, an adjustment subunit, and a second generation subunit.

The fusion subunit is used to merge the first drug recommendation result with the second drug recommendation result to obtain the fusion drug recommendation result; the comparison subunit is used to compare the fusion drug recommendation result with the preset mutually exclusive drug group Yes, to determine whether there is a mutually exclusive drug group in the fusion drug recommendation result; the adjustment sub-unit is used to adjust the fusion drug recommendation result by using the same drug replacement strategy if it exists, so as to eliminate the mutually exclusive drug group; the second generation sub-unit, It is used to generate personalized drug recommendation results for target patients based on the adjusted fusion drug recommendation results.

The fusion unit 70 also includes a third generation unit, which is configured to generate a personalized drug recommendation result of the target patient according to the fusion drug recommendation result if it does not exist.

The embodiment of the present application provides a storage medium, and the storage medium includes a stored program. The storage medium involved can be a computer-readable storage medium. The storage medium, such as a computer-readable storage medium, can be nonvolatile (such as a computer nonvolatile storage medium) or volatile (such as a computer volatile Storage media).

Among them, when the program is running, the device where the storage medium is located is controlled to perform the following steps:

Obtain medical record data of multiple patients suffering from the same disease. The medical record data includes structured data, text data and image data. Among them, patients include historical patients and target patients who currently need to be recommended for medication; use named entity recognition algorithm Obtain the patient's medication information from the text data of each patient; the drug-based collaborative filtering algorithm selects the first drug recommendation result of the target patient from the medication information of multiple historical patients; combines the patient's medical history data Perform merge processing to obtain the patient’s condition feature information; the patient-based collaborative filtering algorithm selects from multiple historical patients at least one similar patient that is similar to the target patient’s current condition feature information; The medication information generates a second drug recommendation result; the first drug recommendation result and the second drug recommendation result are fused to obtain the personalized drug recommendation result of the target patient.

Optionally, when the program is running, the step of controlling the device where the storage medium is located to merge the patient's medical record data to obtain the patient's condition feature information includes:

According to the preset mapping table, the medication information of each identified patient is transformed into corresponding numerical data; the structured data in the medical record data of each patient is formed into a sparse matrix according to the time sequence; the use of variational auto-encoding The device compresses and encodes the numerical data and sparse matrix to obtain the patient's first code and second code; uses the preset convolutional neural network to pool the image data to obtain the patient's third code; The first code, the second code, and the third code are combined to obtain the disease characteristic information of each patient.

Optionally, the first code includes first feature information and time dimension information derived from text data, the second code includes second feature information and time dimension information derived from structured data, and the third code includes information derived from image data. The third characteristic information and time dimension information; when the program is running, the step of controlling the device where the storage medium is located to perform the combined processing of the first code, the second code, and the third code to obtain the disease characteristic information of each patient includes:

Based on the time dimension information, the first feature information, the second feature information, and the third feature information of each patient are combined and processed to obtain the condition feature information of each patient. The condition feature information is a piece of multi-dimensional record data.

Optionally, when the program is running, controlling the device where the storage medium is located to execute a patient-based collaborative filtering algorithm to select at least one similar patient that is similar to the target patient’s current condition feature information from multiple historical patients includes :

Input the disease feature information of each patient into the preset word vector representation model to obtain the feature information vector of each patient; calculate the European style between the feature information vector of each historical patient and the feature information vector of the target patient Distance; Euclidean distance is recognized as the similarity between the historical patient and the target patient; at least one similar patient is selected from multiple historical patients according to the similarity of the disease, among which the similar patient is similar to the target patient. The degree is greater than the preset threshold.

Optionally, when the program is running, controlling the device where the storage medium is located to execute the step of generating the second drug recommendation result based on the medication information of similar patients includes:

Find the related diseases of the disease from the preset disease-drug directed connection graph; obtain the related diseases and medication information of the disease according to the directed connection graph; generate the first based on the related diseases, medication information of the disease and the medication information of similar patients 2. Results of drug recommendation.

Fig. 3 is a schematic diagram of a computer device provided by an embodiment of the present application. As shown in FIG. 3, the computer device 100 of this embodiment includes: a processor 101, a memory 102, and a computer program 103 stored in the memory 102 and running on the processor 101. The processor 101 executes the computer program 103 when the computer program 103 is executed. In order to avoid repetition, the personalized and accurate medication recommendation method in the example will not be repeated here. Alternatively, when the computer program is executed by the processor 101, the function of each model/unit in the personalized accurate medication recommendation device in the embodiment is realized. In order to avoid repetition, it will not be repeated here.

The computer device 100 may be a computing device such as a desktop computer, a notebook, a palmtop computer, and a cloud server. The computer device may include, but is not limited to, a processor 101 and a memory 102. Those skilled in the art can understand that FIG. 3 is only an example of the computer device 100 and does not constitute a limitation on the computer device 100. It may include more or less components than those shown in the figure, or a combination of certain components, or different components. For example, computer equipment may also include input and output devices, network access devices, buses, and so on.

The so-called processor 101 may be a central processing unit (Central Processing Unit, CPU), other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, etc. The general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like.

The memory 102 may be an internal storage unit of the computer device 100, such as a hard disk or a memory of the computer device 100. The memory 102 may also be an external storage device of the computer device 100, such as a plug-in hard disk equipped on the computer device 100, a smart media card (SMC), a secure digital (SD) card, and a flash memory card (Flash). Card) and so on. Further, the memory 102 may also include both an internal storage unit of the computer device 100 and an external storage device. The memory 102 is used to store computer programs and other programs and data required by the computer equipment. The memory 102 can also be used to temporarily store data that has been output or will be output.

Those skilled in the art can clearly understand that, for the convenience and conciseness of the description, the specific working process of the above-described system, device, and unit can refer to the corresponding process in the foregoing method embodiment, which will not be repeated here.

In the several embodiments provided in this application, it should be understood that the disclosed system, device, and method may be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components may be combined. Or it can be integrated into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.

The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.

In addition, the functional units in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above-mentioned integrated unit may be implemented in the form of hardware, or may be implemented in the form of hardware plus software functional units.

The above-mentioned integrated unit implemented in the form of a software functional unit may be stored in a computer readable storage medium. The above-mentioned software functional unit is stored in a storage medium and includes several instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor (Processor) execute the method described in each embodiment of the present application. Part of the steps. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disk and other media that can store program code .

The above descriptions are only preferred embodiments of this application and are not intended to limit this application. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of this application shall be included in this application Within the scope of protection.

Claims

A personalized and accurate medication recommendation method, wherein the method includes:

Acquiring medical record data of multiple patients suffering from the same disease, the medical record data including structured data, text data, and image data, wherein the patients include historical patients and target patients who currently need to be recommended for medication;

Obtain the medication information of the patient from the text data of each patient by using a named entity recognition algorithm;

The drug-based collaborative filtering algorithm selects the drug information of multiple historical patients to obtain the first drug recommendation result of the target patient;

Merge the medical record data of the patient to obtain the characteristic information of the patient's condition;

The patient-based collaborative filtering algorithm selects at least one similar patient that is similar to the current condition feature information of the target patient from a plurality of the historical patients;

Generating a second drug recommendation result according to the medication information of the similar patients;

The first drug recommendation result and the second drug recommendation result are fused to obtain the personalized drug recommendation result of the target patient.
The method according to claim 1, wherein the step of combining the medical record data of the patient to obtain the characteristic information of the patient's condition comprises:

Converting the medication information of each identified patient into corresponding numerical data according to a preset mapping table;

Forming a sparse matrix according to the time sequence of the structured data in the medical record data of each patient;

Using a variational autoencoder to compress and encode the numerical data and the sparse matrix to obtain the first code and the second code of the patient;

Pooling the image data by using a preset convolutional neural network to obtain the third code of the patient;

The first code, the second code, and the third code are combined to obtain disease characteristic information of each of the patients.
3. The method according to claim 2, wherein the first code includes first feature information and time dimension information derived from the text data, and the second code includes a second feature derived from the structured data Information and the time dimension information, the third code includes third feature information derived from the image data and the time dimension information; the first code, the second code, and the first code The step of combining the three codes to obtain the characteristic information of each patient's condition includes:

Combine the first feature information, the second feature information, and the third feature information of each patient on the basis of the time dimension information to obtain the condition feature information of each patient , The disease characteristic information is a piece of multi-dimensional record data.
The method according to claim 1, wherein the patient-based collaborative filtering algorithm selects from a plurality of historical patients the information of at least one similar patient that is similar to the current condition feature information of the target patient The steps include:

Inputting the disease feature information of each patient into a preset word vector representation model to obtain the feature information vector of each patient;

Calculating the Euclidean distance between the feature information vector of each historical patient and the feature information vector of the target patient;

Confirming the Euclidean distance as the degree of similarity between the historical patient and the target patient;

At least one similar patient is selected from a plurality of historical patients according to the similarity of the condition, wherein the similarity between the condition of the similar patient and the target patient is greater than a preset threshold.
The method according to claim 1, wherein the step of generating a second drug recommendation result based on the medication information of the similar patients comprises:

Find the related diseases of the disease from the preset disease-drug directed connection graph;

Acquiring the associated disease and the medication information of the disease according to the directed connection graph;

A second drug recommendation result is generated according to the associated disease, the medication information of the disease, and the medication information of the similar patients.
The method according to claim 1, wherein the step of performing fusion processing on the result of the first drug recommendation and the result of the second drug recommendation to obtain the personalized drug recommendation result of the target patient comprises:

Fusion processing the first drug recommendation result and the second drug recommendation result to obtain a fusion drug recommendation result;

Comparing the result of the fusion drug recommendation with a preset mutually exclusive drug group to determine whether there is a mutually exclusive drug group in the result of the fusion drug recommendation;

If it exists, adopt the same-drug replacement strategy to adjust the result of the fusion drug recommendation to eliminate the mutually exclusive drug group;

According to the adjusted result of the fusion medicine recommendation, a personalized medicine recommendation result of the target patient is generated.
The method according to any one of claim 2, wherein the step of using a preset convolutional neural network to pool the image data to obtain the third code of the patient comprises:

Acquiring the three primary color values of each pixel in the image data;

Using a preset convolutional neural network and extracting characteristic parts according to the three primary color values of all the pixels, the third code of the patient is formed.
A personalized accurate medication recommendation device, wherein the device includes:

The acquiring unit is used to acquire medical record data of multiple patients suffering from the same disease, the medical record data including structured data, text data, and image data, wherein the patients include historical patients and those who currently need to be recommended for medication Target patient

The recognition unit is configured to obtain the medication information of the patient from the text data of each patient by using a named entity recognition algorithm;

The first screening unit is configured to obtain the first drug recommendation result of the target patient by screening from the medication information of the multiple historical patients based on the drug-based collaborative filtering algorithm;

A processing unit, configured to merge and process the patient's medical record data to obtain the patient's condition characteristic information;

The second screening unit is configured to screen out at least one similar patient that is similar to the current condition feature information of the target patient from the multiple historical patients based on the patient's collaborative filtering algorithm;

A generating unit, configured to generate a second drug recommendation result according to the medication information of the similar patients;

The fusion unit is configured to perform fusion processing on the first drug recommendation result and the second drug recommendation result to obtain the personalized drug recommendation result of the target patient.
A storage medium, the storage medium includes a stored program, wherein, when the program is running, the device where the storage medium is located is controlled to perform the following steps:

Acquiring medical record data of multiple patients suffering from the same disease, the medical record data including structured data, text data, and image data, wherein the patients include historical patients and target patients who currently need to be recommended for medication;

Obtain the medication information of the patient from the text data of each patient by using a named entity recognition algorithm;

The drug-based collaborative filtering algorithm selects the drug information of multiple historical patients to obtain the first drug recommendation result of the target patient;

Merge the medical record data of the patient to obtain the characteristic information of the patient's condition;

The patient-based collaborative filtering algorithm selects at least one similar patient that is similar to the current condition feature information of the target patient from a plurality of the historical patients;

Generating a second drug recommendation result according to the medication information of the similar patients;

The first drug recommendation result and the second drug recommendation result are fused to obtain the personalized drug recommendation result of the target patient.
The storage medium according to claim 9, wherein the step of combining the medical record data of the patient to obtain the characteristic information of the patient's condition comprises:

Converting the medication information of each identified patient into corresponding numerical data according to a preset mapping table;

Forming a sparse matrix according to the time sequence of the structured data in the medical record data of each patient;

Using a variational autoencoder to compress and encode the numerical data and the sparse matrix to obtain the first code and the second code of the patient;

Pooling the image data by using a preset convolutional neural network to obtain the third code of the patient;

The first code, the second code, and the third code are combined to obtain disease characteristic information of each of the patients.
The storage medium according to claim 10, wherein the first code includes first feature information and time dimension information derived from the text data, and the second code includes a second feature information derived from the structured data. Feature information and the time dimension information, the third code includes third feature information derived from the image data and the time dimension information; the first code, the second code, and the The step of combining the third code to obtain the characteristic information of each patient's condition includes:

Combine the first feature information, the second feature information, and the third feature information of each patient on the basis of the time dimension information to obtain the condition feature information of each patient , The disease characteristic information is a piece of multi-dimensional record data.
The storage medium according to claim 9, wherein the patient-based collaborative filtering algorithm selects at least one similar patient that is similar to the current condition feature information of the target patient from a plurality of the historical patients The steps include:

Inputting the disease feature information of each patient into a preset word vector representation model to obtain the feature information vector of each patient;

Calculating the Euclidean distance between the feature information vector of each historical patient and the feature information vector of the target patient;

Confirming the Euclidean distance as the degree of similarity between the historical patient and the target patient;

At least one similar patient is selected from a plurality of historical patients according to the similarity of the condition, wherein the similarity between the condition of the similar patient and the target patient is greater than a preset threshold.
9. The storage medium according to claim 9, wherein the step of generating a second drug recommendation result based on the medication information of the similar patients comprises:

Find the related diseases of the disease from the preset disease-drug directed connection graph;

Acquiring the associated disease and the medication information of the disease according to the directed connection graph;

A second drug recommendation result is generated according to the associated disease, the medication information of the disease, and the medication information of the similar patients.
The storage medium according to claim 9, wherein the step of fusing the result of the first drug recommendation and the result of the second drug recommendation to obtain the personalized drug recommendation result of the target patient comprises :

Fusion processing the first drug recommendation result and the second drug recommendation result to obtain a fusion drug recommendation result;

Comparing the result of the fusion drug recommendation with a preset mutually exclusive drug group to determine whether there is a mutually exclusive drug group in the result of the fusion drug recommendation;

If it exists, adopt the same-drug replacement strategy to adjust the result of the fusion drug recommendation to eliminate the mutually exclusive drug group;

According to the adjusted result of the fusion medicine recommendation, a personalized medicine recommendation result of the target patient is generated.
A computer device includes a memory, a processor, and a computer program that is stored in the memory and can run on the processor, wherein the processor implements the following steps when the processor executes the computer program:

Acquiring medical record data of multiple patients suffering from the same disease, the medical record data including structured data, text data, and image data, wherein the patients include historical patients and target patients who currently need to be recommended for medication;

Obtain the medication information of the patient from the text data of each patient by using a named entity recognition algorithm;

The drug-based collaborative filtering algorithm selects the drug information of multiple historical patients to obtain the first drug recommendation result of the target patient;

Merge the medical record data of the patient to obtain the characteristic information of the patient's condition;

The patient-based collaborative filtering algorithm selects at least one similar patient that is similar to the current condition feature information of the target patient from a plurality of the historical patients;

Generating a second drug recommendation result according to the medication information of the similar patients;

The first drug recommendation result and the second drug recommendation result are fused to obtain the personalized drug recommendation result of the target patient.
The computer device according to claim 15, wherein the step of merging the medical record data of the patient to obtain the characteristic information of the patient's condition comprises:

Converting the medication information of each identified patient into corresponding numerical data according to a preset mapping table;

Forming a sparse matrix according to the time sequence of the structured data in the medical record data of each patient;

Using a variational autoencoder to compress and encode the numerical data and the sparse matrix to obtain the first code and the second code of the patient;

Pooling the image data by using a preset convolutional neural network to obtain the third code of the patient;

The first code, the second code, and the third code are combined to obtain disease characteristic information of each of the patients.
The computer device according to claim 16, wherein the first code includes first feature information and time dimension information derived from the text data, and the second code includes a second feature information derived from the structured data. Feature information and the time dimension information, the third code includes third feature information derived from the image data and the time dimension information; the first code, the second code, and the The step of combining the third code to obtain the characteristic information of each patient's condition includes:

Combine the first feature information, the second feature information, and the third feature information of each patient on the basis of the time dimension information to obtain the condition feature information of each patient , The disease characteristic information is a piece of multi-dimensional record data.
The computer device according to claim 15, wherein the patient-based collaborative filtering algorithm selects at least one similar patient that is similar to the current condition feature information of the target patient from a plurality of the historical patients The steps include:

Inputting the disease feature information of each patient into a preset word vector representation model to obtain the feature information vector of each patient;

Calculating the Euclidean distance between the feature information vector of each historical patient and the feature information vector of the target patient;

Confirming the Euclidean distance as the degree of similarity between the historical patient and the target patient;

At least one similar patient is selected from a plurality of historical patients according to the similarity of the condition, wherein the similarity of the condition between the similar patient and the target patient is greater than a preset threshold.
15. The computer device according to claim 15, wherein the step of generating a second drug recommendation result based on the medication information of the similar patients comprises:

Find the related diseases of the disease from the preset disease-drug directed connection graph;

Acquiring the associated disease and the medication information of the disease according to the directed connection graph;

A second drug recommendation result is generated according to the associated disease, the medication information of the disease, and the medication information of the similar patients.
The computer device according to claim 15, wherein the step of fusing the result of the first drug recommendation and the result of the second drug recommendation to obtain the personalized drug recommendation result of the target patient comprises :

Fusion processing the first drug recommendation result and the second drug recommendation result to obtain a fusion drug recommendation result;

Comparing the result of the fusion drug recommendation with a preset mutually exclusive drug group to determine whether there is a mutually exclusive drug group in the result of the fusion drug recommendation;

If it exists, adopt the same-drug replacement strategy to adjust the result of the fusion drug recommendation to eliminate the mutually exclusive drug group;

According to the adjusted result of the fusion medicine recommendation, a personalized medicine recommendation result of the target patient is generated.