CN113012770A - Medicine-medicine interaction event prediction method, system, terminal and readable storage medium based on multi-modal deep neural network - Google Patents

Medicine-medicine interaction event prediction method, system, terminal and readable storage medium based on multi-modal deep neural network Download PDF

Info

Publication number
CN113012770A
CN113012770A CN202110287239.1A CN202110287239A CN113012770A CN 113012770 A CN113012770 A CN 113012770A CN 202110287239 A CN202110287239 A CN 202110287239A CN 113012770 A CN113012770 A CN 113012770A
Authority
CN
China
Prior art keywords
drug
medicine
interaction event
knowledge graph
embedded representation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110287239.1A
Other languages
Chinese (zh)
Other versions
CN113012770B (en
Inventor
高建良
吕腾飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Central South University
Original Assignee
Central South University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Central South University filed Critical Central South University
Priority to CN202110287239.1A priority Critical patent/CN113012770B/en
Publication of CN113012770A publication Critical patent/CN113012770A/en
Application granted granted Critical
Publication of CN113012770B publication Critical patent/CN113012770B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C20/00Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
    • G16C20/30Prediction of properties of chemical compounds, compositions or mixtures
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C20/00Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
    • G16C20/50Molecular design, e.g. of drugs
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C20/00Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
    • G16C20/70Machine learning, data mining or chemometrics
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H70/00ICT specially adapted for the handling or processing of medical references
    • G16H70/40ICT specially adapted for the handling or processing of medical references relating to drugs, e.g. their side effects or intended usage

Landscapes

  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Medicinal Chemistry (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Medical Informatics (AREA)
  • Toxicology (AREA)
  • Public Health (AREA)
  • Primary Health Care (AREA)
  • Epidemiology (AREA)
  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Evolutionary Computation (AREA)
  • Software Systems (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

The invention discloses a method, a system, a terminal and a readable storage medium for predicting a drug-drug interaction event based on a multi-modal deep neural network, wherein the method comprises the following steps: acquiring drug-drug interaction events, drug heterogeneous characteristics and drug knowledge maps; acquiring topological embedded representation of the medicine in the knowledge graph and similar embedded representation of the medicine; splicing and fusing the topological embedded representation of the medicine and the similar embedded representation of the medicine to construct a medicine-medicine interaction event prediction model, and performing model training by using the medicine-medicine interaction event in the sample; then, predicting the drug-drug interaction event by using a prediction model; the topological embedded representation of the medicine to be predicted in the knowledge graph and the similar embedded representation of the medicine are input into a prediction model to obtain a medicine-medicine interaction event prediction result. The method utilizes the relevance and complementarity among different modal characteristics to further improve the accuracy of the prediction result.

Description

Medicine-medicine interaction event prediction method, system, terminal and readable storage medium based on multi-modal deep neural network
Technical Field
The invention belongs to the technical field of drug design and medicine, and particularly relates to a method, a system, a terminal and a readable storage medium for predicting drug-drug interaction events based on a multi-modal deep neural network.
Background
With the rapid increase in the number of kinds of medicines, when a plurality of medicines are taken for the treatment of diseases, it is important to ensure safety between the medicines. The problem of Drug-Drug Interaction (DDI) prediction means that when a plurality of drugs are taken simultaneously, adverse Interaction between the drugs may occur due to the influence of molecules of compounds constituting the drugs, targeting proteins of Drug action, targeting pathways, and the action of different enzymes, thereby causing adverse harm to patients or enormous medical expenses. In addition, DDI may also lead to different biological consequences and events. Therefore, accurate prediction of DDI events has become a clinically important task that can help clinicians make effective decisions and establish appropriate treatment regimens. In addition, proper use of multiple drugs can minimize the risk of treatment for the patient and bring about the synergistic benefits of the drugs.
Drug-drug interaction events come from a variety of sources, including the effects of the molecules of the compounds that make up the drug, the targeting proteins for drug action, the targeting pathways, and the action of enzymes, among others, which affect the interaction events between drugs. Each factor and the drug can be combined into an independent feature matrix, and the information aggregation of a plurality of feature matrices can provide great help for the drug-drug interaction event prediction task. However, the task of predicting drug-drug interaction events is a serious challenge due to the lack of adequate clinical data and knowledge. Therefore, research programs that find potential drug-drug interaction events have a profound impact on improving healthcare and increasing drug alertness.
However, most of the existing methods utilize the multi-source characteristics of the drugs for independent analysis, so that the reliability of the prediction result of drug-drug interaction in the existing methods still needs to be further improved.
Disclosure of Invention
Aiming at the problems in the prior art, the invention provides a medicine-medicine interaction event prediction method based on a multi-mode deep neural network, which analyzes the relevance and complementarity among different modal characteristics to improve the accuracy of DDI event prediction.
In one aspect, the invention provides a method for predicting a drug-drug interaction event based on a multi-modal deep neural network, which comprises the following steps:
step S1: acquiring drug data as a sample, wherein the drug data comprises drug-drug interaction events, drug heterogeneous characteristics and a drug knowledge graph constructed based on drug attributes;
step S2: obtaining a topological embedded representation of the drug in the knowledge graph based on the drug knowledge graph; similarity calculation is carried out on the basis of the heterogeneous characteristics of the medicines to obtain similar embedded representation of the medicines;
step S3: performing splicing and fusion on the topological embedded representation of the medicine in the knowledge graph and the similar embedded representation of the medicine to construct a medicine-medicine interaction event prediction model, and performing model training by using the medicine-medicine interaction event in the sample in the step S1 to obtain a medicine-medicine interaction event prediction model;
step S4: predicting the drug-drug interaction event by using the drug-drug interaction event prediction model obtained in the step S3;
and inputting the topological embedded representation of the medicine in the knowledge graph and the similar embedded representation of the medicine in the medicine pair to be predicted into the medicine-medicine interaction event prediction model to obtain a medicine-medicine interaction event prediction result.
The technical idea of the invention is as follows: by constructing a medicine knowledge graph, a design graph neural network obtains topological embedded representation of the medicine by utilizing the topological structure and semantic information of the medicine; calculating the similarity of the heterogeneous characteristics of the medicines as similar embedded representation of the medicines by collecting the heterogeneous characteristics of the medicines; the topological embedded representation and the similar embedded representation of the medicine are spliced, multi-modal characteristic analysis is carried out through a plurality of layers of full connection layers, and the prediction of the medicine-medicine interaction event is carried out.
Certain complementarity often exists among the multi-modal characteristics, and the correlation and the complementarity among different modal characteristics can be analyzed to improve the accuracy of DDI event prediction. In addition, the medicine knowledge graph contains rich information of the medicine, including topological structure information and semantic information of the medicine, and the importance of the information is not negligible. Therefore, the invention applies the above two characteristics to the scheme of the invention to improve the reliability of the prediction result.
Optionally, the drug-drug interaction event prediction model is as follows:
Figure BDA0002980987120000021
wherein the content of the first and second substances,
Figure BDA0002980987120000022
denotes the drug di,djThe predicted outcome of the drug-drug interaction event of (a),
Figure BDA0002980987120000023
are respectively medicine di,djIs then represented by the final embedded representation of (c),
Figure BDA0002980987120000024
denotes the stitching operation, σ is a non-linear activation function softmax, W3,b3Are trainable weights and biases.
Optionally, if the drug heterogeneity characteristics comprise: substructure FsTarget FtAnd enzyme FeIn step S2, similarity calculation is performed based on the heterogeneous characteristics of the drugs to obtain a similar embedding expression of the drugs as follows:
Figure BDA0002980987120000025
wherein, the drug diIs embedded in
Figure BDA0002980987120000026
Target-like embedding
Figure BDA0002980987120000027
Intercalation similar to enzymes
Figure BDA0002980987120000028
Corresponds to Es、Et、EmRespectively a substructure similarity matrix, a target similarity matrix and an enzyme similarity matrix;
the elements in the substructure similarity matrix, the target similarity matrix and the enzyme similarity matrix are obtained by calculation by using a medicament heterogeneous characteristic and a similarity value calculation formula of a medicament pair.
Optionally, the elements in the substructure similarity matrix, the target similarity matrix, and the enzyme similarity matrix are obtained by using a Jaccard similarity calculation, where the Jaccard similarity calculation formula is as follows:
Figure BDA0002980987120000031
wherein, J (F)i,Fj) Representing a value of similarity between the heterogeneous characteristics of two drugs, Fi,FjAre the heterogeneous characteristics of the two drugs, respectively.
Optionally, the process of obtaining the topologically embedded representation of the drug in the knowledge graph based on the drug knowledge graph in step S2 is as follows:
firstly, initializing embedded representation of each node in a medicine knowledge graph, wherein the nodes comprise medicine nodes and attribute nodes;
then, calculating the weight scores of the edges of the drug nodes and the attribute nodes in the drug knowledge graph, wherein the formula is as follows:
Figure BDA0002980987120000032
wherein the content of the first and second substances,
Figure BDA0002980987120000033
is node d of the drugiAnd the weight scores of the edges corresponding to the attribute nodes,
Figure BDA0002980987120000034
respectively represent the drugs diAnd relation rinIs embedded in the representation, said relation rinFunction description indicating an attribute node, <' > indicating a Hamada product, W1And b1Respectively representing trainable weights and biases;
second, the neighbor-embedded representation of the drug node is calculated according to the different weight scores of each neighbor node:
Figure BDA0002980987120000035
wherein N iss(di) Denotes the drug diOf the sampling of the set of neighboring nodes,
Figure BDA0002980987120000036
indicating drug node diIs embedded in a representation of the neighborhood of (c),
Figure BDA0002980987120000037
denotes the drug diNeighbor node t ofnThe initial embedded representation of (a);
finally, the drug diTopological embedded representation in a knowledge graph:
Figure BDA0002980987120000038
wherein the content of the first and second substances,
Figure BDA0002980987120000039
indicating a splicing operation. Likewise, the drug d can be obtainedjTopology embedding of
Figure BDA00029809871200000310
Optionally, the drug knowledge graph constructed based on the drug attributes in step S1 is represented as follows:
Figure BDA00029809871200000311
wherein D represents a drug set, R represents a relationship set, T represents an attribute node set of the drug, D, RdtAnd t represents the drug, relationship and attribute nodes, respectively.
In another aspect, the present invention further provides a system based on the foregoing method, including:
a drug data acquisition module: for collecting drug-drug interaction events, drug heterogeneity characteristics;
the medicine knowledge graph building module is used for building a medicine knowledge graph based on the medicine attribute;
the embedded representation acquisition module of the medicine is used for acquiring topological embedded representation of the medicine in the knowledge graph based on the medicine knowledge graph; similarity calculation is carried out on the basis of the heterogeneous characteristics of the medicines to obtain similar embedded representation of the medicines;
the prediction model construction and training module is used for splicing and fusing the topological embedded representation of the medicine in the knowledge graph and the similar embedded representation of the medicine to construct a medicine-medicine interaction event prediction model, and performing model training by using the collected medicine-medicine interaction event to obtain a medicine-medicine interaction event prediction model; and a prediction module, configured to perform drug-drug interaction event prediction by using the drug-drug interaction event prediction model obtained in step S3.
In another aspect, the present invention provides a terminal, including a processor and a memory, where the memory stores a computer program, and the processor calls the computer program to execute: a method for predicting drug-drug interaction events based on a multi-modal deep neural network.
In a fourth aspect, the present invention also provides a readable storage medium storing a computer program, the computer program being invoked by a processor to perform: a method for predicting drug-drug interaction events based on a multi-modal deep neural network.
Advantageous effects
1. The invention provides a medicine-medicine interaction event prediction method based on a multi-modal deep neural network, which splices multi-modal embedded representations of medicines, efficiently analyzes the relevance and complementarity of the medicine embedded representations in different modes and brings better medicine-medicine interaction event prediction effect.
2. The invention effectively constructs the medicine knowledge graph, designs the graph neural network, obtains the embedded expression of the medicine by using the topological structure information and the semantic information in the medicine knowledge graph, and improves the accuracy of the prediction of the medicine-medicine interaction event.
3. The invention considers the heterogeneous characteristics of different medicines, can analyze the relation of medicine-medicine interaction from multiple angles, and improves the accuracy of medicine embedding expression.
Drawings
FIG. 1 is an overall framework diagram of a drug-drug interaction event prediction method based on a multi-modal deep neural network provided by the invention.
Fig. 2 is a schematic flow chart of a drug-drug interaction event prediction method based on a multi-modal deep neural network provided by the invention.
Figure 3 is a schematic of drug-drug interaction events.
Detailed Description
The invention provides a medicine-medicine interaction event prediction method based on a multi-modal deep neural network. The present invention will be further described with reference to the following examples.
Fig. 1 and 2 show the overall framework of the method of the invention: the invention provides a medicine-medicine interaction event prediction method based on a multi-mode deep neural network, which comprises the following steps:
step S1: drug data including drug-drug interaction events, drug heterogeneous characteristics, and drug knowledge maps constructed based on drug attributes are obtained as samples.
The specific flow is as follows.
FIG. 3 is a schematic representation of drug interaction events. The data set used by the invention is mainly obtained from a drug bank, and mainly comprises three parts: drug-drug interaction events, drug heterogeneous characteristics, and drug knowledge profiles.
A. Drug-drug interaction events
Drug-drug interaction event matrix
Figure BDA0002980987120000051
NdIndicates the amount of the drug, yijRepresenting the event of drug i interacting with drug j. Wherein events of interactions between pairs of drugs are encoded, such as: when the drug i Bomacillin and the drug j dabrafenib are taken together, the serum concentration of a patient may be reduced, namely yijLabels with elevated serum concentrations are numbered.
B. Heterogeneous character of the drug
The heterogeneous characteristics of drugs mainly include substructure (F)sTarget FtAnd enzyme (enzyme) Fe
Figure BDA0002980987120000052
Respectively represent the drugs diThe heterostructure, target and enzyme of (a). The above-mentioned heterogeneous characteristics are all (0, 1) binary sequences, i.e. s0,…,sn,t0,…,tm,m0,…,mkEpsilon (0, 1), 1 represents drug diComprising this feature, 0 denotes the drug diDoes not contain the characteristic; where n +1, m +1 and k +1 represent the total number of features of the substructure, target and enzyme, respectively.
C. Knowledge map of medicine
The invention obtains a medicine knowledge graph G ═ (D, R, T) on a drug Bank, and the medicine knowledge graph G ═ D, R, T is specifically expressed as:
Figure BDA0002980987120000053
wherein D represents a drug set, R represents a relationship set, and T represents an attribute node set of the drug. And (5) constructing a biological knowledge map by combining a DrugBank database. The knowledge-graph can be visualized as a triplet, i.e., < head entity, tail entity, relationship >. Detailed attribute information of each drug, including information on the substructures, transporters, pathways, targets, etc. of the drug is contained in the drug bank database. The drug is taken as a head entity, an attribute node contained in the drug is taken as a tail entity, and the relationship is the functional description of the tail entity. For example, the drug DB00130 contains a target with number P17812, which functions as: ctp synthase activity, then the triplet is denoted < DB00130, P17812, Ctp synthase activity >. By collecting attributes of each drug, a knowledge graph of the drug can be constructed, thereby providing topological structure information for subsequent use of the graph neural network to obtain drug-embedded representations.
S2, the design chart neural network module takes the medicine knowledge graph obtained in the step S1 as input information to obtain topological embedded representation of the medicine in the knowledge graph; and performing similarity calculation based on the heterogeneous characteristics of the medicines to obtain a similar embedded representation of the medicines.
A: the acquisition procedure for the topologically embedded representation of the drug in the knowledge graph is as follows:
based on the knowledge graph of the drug obtained in step S1, the present invention designs a graph neural network to obtain the topology embedded representation of the drug by using the topology structure information and semantic information in the knowledge graph of the drug.
An embedded representation of each node in the drug knowledge-graph is first initialized, the embedded representation being randomly generated numerical information, such as a 128-dimensional row vector. The nodes comprise medicine nodes and attribute nodes; then, 6 neighbor nodes are randomly selected for each node to update the embedded representation of the central node. For example, for a drug node i, all nodes connected with edges of the drug node i are collectively referred to as neighbor nodes of the drug node i;
calculating the weight fraction of the edges of the drug nodes and the attribute nodes in the drug knowledge graph by utilizing the semantic information:
Figure BDA0002980987120000061
wherein the content of the first and second substances,
Figure BDA0002980987120000062
respectively represent the drugs diAnd relation rinIs initialized with an embedded representation of the relationship rinInitialized embedded representation of
Figure BDA0002980987120000063
Also based on its relation rinCode formation, <' > indicates a Hamada product, W1And b1Trainable weights and biases are represented separately, sum being a summation function.
Calculating a neighbor-embedded representation of the drug node according to the different weight scores of each neighbor node:
Figure BDA0002980987120000064
wherein N iss(di) Denotes the drug diOf the sampling of the set of neighboring nodes,
Figure BDA0002980987120000065
indicating drug node diIs embedded in a representation of the neighborhood of (c),
Figure BDA0002980987120000066
denotes the drug diNeighbor node t ofnInitial embedding of (1).
Calculating drug diIs topologically embedded in the representation
Figure BDA0002980987120000067
Figure BDA0002980987120000068
Wherein the content of the first and second substances,
Figure BDA0002980987120000069
representing a splicing operation, ReLU being an activation function, W2And b2Trainable weights and biases, respectively. Likewise, the drug d can be obtainedjTopology embedding of
Figure BDA00029809871200000610
From the knowledge map, the drug diIs topologically embedded in the representation
Figure BDA00029809871200000611
The information of the drug node and the relationship between the drug node and the attribute node are covered.
B: the heterogeneous characteristic similarity calculation of the medicine comprises the following specific processes:
the invention uses the heterogeneous characteristics of the Jaccard similarity calculation medicine to obtain the similarity embedding of the medicine, and the Jaccard similarity calculation can be expressed as:
Figure BDA00029809871200000612
wherein, J (F)i,Fj) Representing a value of similarity between two drug heterogeneous characteristics, Fi,FjAre respectively the heterogeneous characteristics of two medicaments.
By using the Jaccard similarity method for calculation, a substructure similarity matrix can be obtained
Figure BDA0002980987120000071
Target similarity matrix
Figure BDA0002980987120000072
Enzyme similarity matrix
Figure BDA0002980987120000073
Thereby obtaining a drug diIs embedded in
Figure BDA0002980987120000074
Target-like embedding
Figure BDA0002980987120000075
Intercalation similar to enzymes
Figure BDA0002980987120000076
Wherein the sub-structure-like intercalation, target-like intercalation and enzyme-like intercalation are performed by corresponding drugs in a sub-structure-like matrix EsTarget similarity matrix EtAnd enzyme similarity matrix EmThe corresponding data. Medicine diThe similar embedded representation of (c) can be expressed as:
Figure BDA0002980987120000077
in the same way, the drug d can be obtainediIs similar to the embedded representation
Figure BDA0002980987120000078
S3: the multi-modal representation fusion analysis module specifically comprises the steps of splicing and fusing the topological embedded representation of the medicine in the knowledge graph and the similar embedded representation of the medicine to construct a medicine-medicine interaction event prediction model, and performing model training by using the medicine-medicine interaction event in the sample in the step S1 to obtain the medicine-medicine interaction event prediction model.
According to the drug topology embedded representation and the drug similarity embedded representation obtained in step S2, the invention combines the two embedded parts as the drug diAnd finally, performing multi-mode embedding analysis on the embedded representation. Is represented as follows:
Figure BDA0002980987120000079
likewise, the drug d can be obtainedjFinal embedded representation
Figure BDA00029809871200000710
The resulting inserts of drug pairs were predictively classified as follows:
Figure BDA00029809871200000711
where σ is a non-linear activation function softmax, W3,b3In order to train the weights and biases,
Figure BDA00029809871200000712
is a medicine djAnd a drug diThe predicted outcome of the drug-drug interaction event between.
In the training and optimizing process, updating weight parameters are optimized through back propagation gradient descent, training is carried out by using a 5-fold cross validation method, and the accuracy of the super-reference validation model is adjusted. The model loss function samples a multi-class cross entropy loss function. Since this process is a prior art implementation process, it is not specifically described. It should be understood that, in the present invention, the drug-drug interaction event is collected and used as sample data in step S1, and then, the topological embedded representation of the drug in the knowledge graph and the similar embedded representation of the drug can be obtained by using the sample and according to step S2, and then the two are substituted into the prediction model, and the prediction model is obtained by performing model training with the drug-drug interaction event of the sample. Wherein drug-drug interaction events between pairs of drugs can be predicted using the predictive model.
In some implementations, the present invention also provides a system for a method for drug-drug interaction event prediction based on a multimodal deep neural network, comprising: the system comprises a medicine data acquisition module, a medicine knowledge map construction module, an embedded expression acquisition module of medicines, a prediction model construction and training module and a prediction module.
Wherein, the medicine data acquisition module: for collecting drug-drug interaction events, drug heterogeneity characteristics;
the medicine knowledge graph building module is used for building a medicine knowledge graph based on the medicine attribute;
the embedded representation acquisition module of the medicine is used for acquiring topological embedded representation of the medicine in the knowledge graph based on the medicine knowledge graph; similarity calculation is carried out on the basis of the heterogeneous characteristics of the medicines to obtain similar embedded representation of the medicines;
the prediction model construction and training module is used for splicing and fusing the topological embedded representation of the medicine in the knowledge graph and the similar embedded representation of the medicine to construct a medicine-medicine interaction event prediction model, and performing model training by using the collected medicine-medicine interaction event to obtain a medicine-medicine interaction event prediction model;
and a prediction module, configured to perform drug-drug interaction event prediction by using the drug-drug interaction event prediction model obtained in step S3.
For the specific implementation process of each unit module, refer to the corresponding process of the foregoing method. It should be understood that, the specific implementation process of the above unit module refers to the method content, and the present invention is not described herein in detail, and the division of the above functional module unit is only a division of a logic function, and there may be another division manner in the actual implementation, for example, multiple units or components may be combined or may be integrated into another system, or some features may be omitted, or may not be executed. Meanwhile, the integrated unit can be realized in a hardware form, and can also be realized in a software functional unit form.
In some implementations, the present invention also provides a terminal comprising a processor and a memory, the memory storing a computer program, the processor invoking the computer program to perform: a method for predicting drug-drug interaction events based on a multi-modal deep neural network.
In some implementations, the invention also provides a readable storage medium storing a computer program for invocation by a processor to perform: a method for predicting drug-drug interaction events based on a multi-modal deep neural network.
For the implementation process of each step, please refer to the specific implementation process of the foregoing method, which is not described herein again.
It should be understood that in the embodiments of the present invention, the Processor may be a Central Processing Unit (CPU), and the Processor may also be other general purpose processors, Digital Signal Processors (DSPs), Application Specific Integrated Circuits (ASICs), Field Programmable Gate Arrays (FPGAs) or other Programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, and the like. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The memory may include both read-only memory and random access memory, and provides instructions and data to the processor. The portion of memory may also include non-volatile random access memory. For example, the memory may also store device type information.
The readable storage medium is a computer readable storage medium, which may be an internal storage unit of the controller according to any of the foregoing embodiments, for example, a hard disk or a memory of the controller. The readable storage medium may also be an external storage device of the controller, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like provided on the controller. Further, the readable storage medium may also include both an internal storage unit of the controller and an external storage device. The readable storage medium is used for storing the computer program and other programs and data required by the controller. The readable storage medium may also be used to temporarily store data that has been output or is to be output.
It should be emphasized that the examples described herein are illustrative and not restrictive, and thus the invention is not to be limited to the examples described herein, but rather to other embodiments that may be devised by those skilled in the art based on the teachings herein, and that various modifications, alterations, and substitutions are possible without departing from the spirit and scope of the present invention.

Claims (9)

1. A method for predicting drug-drug interaction events based on a multi-modal deep neural network is characterized by comprising the following steps: the method comprises the following steps:
step S1: acquiring drug data as a sample, wherein the drug data comprises drug-drug interaction events, drug heterogeneous characteristics and a drug knowledge graph constructed based on drug attributes;
step S2: obtaining a topological embedded representation of the drug in the knowledge graph based on the drug knowledge graph; similarity calculation is carried out on the basis of the heterogeneous characteristics of the medicines to obtain similar embedded representation of the medicines;
step S3: performing splicing and fusion on the topological embedded representation of the medicine in the knowledge graph and the similar embedded representation of the medicine to construct a medicine-medicine interaction event prediction model, and performing model training by using the medicine-medicine interaction event in the sample in the step S1 to obtain a medicine-medicine interaction event prediction model;
step S4: predicting the drug-drug interaction event by using the drug-drug interaction event prediction model obtained in the step S3;
and inputting the topological embedded representation of the medicine in the knowledge graph and the similar embedded representation of the medicine in the medicine pair to be predicted into the medicine-medicine interaction event prediction model to obtain a medicine-medicine interaction event prediction result.
2. The method of claim 1, wherein: the drug-drug interaction event prediction model is as follows:
Figure FDA0002980987110000011
wherein the content of the first and second substances,
Figure FDA0002980987110000012
denotes the drug di,djThe predicted outcome of the drug-drug interaction event of (a),
Figure FDA0002980987110000013
are respectively medicine di,djIs then represented by the final embedded representation of (c),
Figure FDA0002980987110000014
denotes the stitching operation, σ is a non-linear activation function softmax, W3,b3Are trainable weights and biases.
3. The method of claim 1, wherein: if the drug heterogeneity characteristics include: substructure FsTarget FtAnd enzyme FeIn step S2, similarity calculation is performed based on the heterogeneous characteristics of the drugs to obtain a similar embedding expression of the drugs as follows:
Figure FDA0002980987110000015
wherein, the drug diIs embedded in
Figure FDA0002980987110000016
Target-like embedding
Figure FDA0002980987110000017
Intercalation similar to enzymes
Figure FDA0002980987110000018
Corresponds to Es、Et、EmRespectively a substructure similarity matrix, a target similarity matrix and an enzymeA similarity matrix;
the elements in the substructure similarity matrix, the target similarity matrix and the enzyme similarity matrix are obtained by calculation by using a medicament heterogeneous characteristic and a similarity value calculation formula of a medicament pair.
4. The method of claim 3, wherein: the elements in the substructure similarity matrix, the target similarity matrix and the enzyme similarity matrix are obtained by Jaccard similarity calculation, wherein the Jaccard similarity calculation formula is as follows:
Figure FDA0002980987110000019
wherein, J (F)i,Fj) Representing a value of similarity between the heterogeneous characteristics of two drugs, Fi,FjAre the heterogeneous characteristics of the two drugs, respectively.
5. The method of claim 1, wherein: the process of obtaining the topologically embedded representation of the drug in the knowledge graph based on the drug knowledge graph in step S2 is as follows:
firstly, initializing embedded representation of each node in a medicine knowledge graph, wherein the nodes comprise medicine nodes and attribute nodes;
then, calculating the weight scores of the edges of the drug nodes and the attribute nodes in the drug knowledge graph, wherein the formula is as follows:
Figure FDA0002980987110000021
wherein the content of the first and second substances,
Figure FDA0002980987110000022
is node d of the drugiAnd the weight scores of the edges corresponding to the attribute nodes,
Figure FDA0002980987110000023
respectively represent the drugs diAnd relation rinIs embedded in the representation, said relation rinFunction description indicating an attribute node, <' > indicating a Hamada product, W1And b1Respectively representing trainable weights and biases;
second, the neighbor-embedded representation of the drug node is calculated according to the different weight scores of each neighbor node:
Figure FDA0002980987110000024
wherein N iss(di) Denotes the drug diOf the sampling of the set of neighboring nodes,
Figure FDA0002980987110000025
indicating drug node diIs embedded in a representation of the neighborhood of (c),
Figure FDA0002980987110000026
denotes the drug diNeighbor node t ofnThe initial embedded representation of (a);
finally, the drug diTopological embedded representation in a knowledge graph:
Figure FDA0002980987110000027
wherein the content of the first and second substances,
Figure FDA0002980987110000028
indicating a splicing operation.
6. The method of claim 5, wherein: the drug knowledge graph constructed based on the drug attributes in step S1 is represented as follows:
Figure FDA0002980987110000029
wherein D represents a drug set, R represents a relationship set, T represents an attribute node set of the drug, D, RdtAnd t represents the drug, relationship and attribute nodes, respectively.
7. A system based on the method of any one of claims 1-6, characterized by: the method comprises the following steps:
a drug data acquisition module: for collecting drug-drug interaction events, drug heterogeneity characteristics;
the medicine knowledge graph building module is used for building a medicine knowledge graph based on the medicine attribute;
the embedded representation acquisition module of the medicine is used for acquiring topological embedded representation of the medicine in the knowledge graph based on the medicine knowledge graph; similarity calculation is carried out on the basis of the heterogeneous characteristics of the medicines to obtain similar embedded representation of the medicines;
the prediction model construction and training module is used for splicing and fusing the topological embedded representation of the medicine in the knowledge graph and the similar embedded representation of the medicine to construct a medicine-medicine interaction event prediction model, and performing model training by using the collected medicine-medicine interaction event to obtain a medicine-medicine interaction event prediction model; and a prediction module, configured to perform drug-drug interaction event prediction by using the drug-drug interaction event prediction model obtained in step S3.
8. A terminal, characterized by: comprising a processor and a memory, the memory storing a computer program that the processor calls to perform: the process steps of any one of claims 1 to 6.
9. A readable storage medium, characterized by: a computer program is stored, which is invoked by a processor to perform: the process steps of any one of claims 1 to 6.
CN202110287239.1A 2021-03-17 2021-03-17 Multi-modal deep neural network based prediction of drug-drug interaction events Active CN113012770B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110287239.1A CN113012770B (en) 2021-03-17 2021-03-17 Multi-modal deep neural network based prediction of drug-drug interaction events

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110287239.1A CN113012770B (en) 2021-03-17 2021-03-17 Multi-modal deep neural network based prediction of drug-drug interaction events

Publications (2)

Publication Number Publication Date
CN113012770A true CN113012770A (en) 2021-06-22
CN113012770B CN113012770B (en) 2022-05-10

Family

ID=76409325

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110287239.1A Active CN113012770B (en) 2021-03-17 2021-03-17 Multi-modal deep neural network based prediction of drug-drug interaction events

Country Status (1)

Country Link
CN (1) CN113012770B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114242162A (en) * 2021-07-29 2022-03-25 北京百度网讯科技有限公司 Method for establishing drug synergy prediction model, prediction method and corresponding device
CN114420309A (en) * 2021-09-13 2022-04-29 北京百度网讯科技有限公司 Method for establishing drug synergy prediction model, prediction method and corresponding device
CN114842927A (en) * 2022-05-20 2022-08-02 华东师范大学 Medicine and pathway association prediction method of knowledge graph attention network
CN116429710A (en) * 2023-06-15 2023-07-14 武汉大学人民医院(湖北省人民医院) Drug component detection method, device, equipment and readable storage medium
CN117079835A (en) * 2023-08-21 2023-11-17 广东工业大学 Multi-view-based medicine-medicine interaction prediction method and system
CN118072862A (en) * 2024-04-24 2024-05-24 临沂大学 Knowledge characterization and reasoning method for multi-mode drug knowledge graph

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109887540A (en) * 2019-01-15 2019-06-14 中南大学 A kind of drug targets interaction prediction method based on heterogeneous network insertion
CN111564186A (en) * 2020-03-25 2020-08-21 湖南大学 Method and system for predicting interaction of graph-volume drug pairs based on knowledge graph
CN112100380A (en) * 2020-09-16 2020-12-18 浙江大学 Generation type zero sample prediction method based on knowledge graph
CN112200317A (en) * 2020-09-28 2021-01-08 西南电子技术研究所(中国电子科技集团公司第十研究所) Multi-modal knowledge graph construction method
CN112270950A (en) * 2020-11-04 2021-01-26 中山大学 Fusion network drug target relation prediction method based on network enhancement and graph regularization

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109887540A (en) * 2019-01-15 2019-06-14 中南大学 A kind of drug targets interaction prediction method based on heterogeneous network insertion
CN111564186A (en) * 2020-03-25 2020-08-21 湖南大学 Method and system for predicting interaction of graph-volume drug pairs based on knowledge graph
CN112100380A (en) * 2020-09-16 2020-12-18 浙江大学 Generation type zero sample prediction method based on knowledge graph
CN112200317A (en) * 2020-09-28 2021-01-08 西南电子技术研究所(中国电子科技集团公司第十研究所) Multi-modal knowledge graph construction method
CN112270950A (en) * 2020-11-04 2021-01-26 中山大学 Fusion network drug target relation prediction method based on network enhancement and graph regularization

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
DENT ET AL.: "A multimodal deep learning framework for predicting drup-drug interaction events", 《BIOFINFORMATICS》 *
桑盛田 等: "融合知识图谱与深度学习的药物发现方法", 《模式识别与人工智能》 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114242162A (en) * 2021-07-29 2022-03-25 北京百度网讯科技有限公司 Method for establishing drug synergy prediction model, prediction method and corresponding device
CN114242162B (en) * 2021-07-29 2022-11-01 北京百度网讯科技有限公司 Method for establishing drug synergy prediction model, prediction method and corresponding device
CN114420309A (en) * 2021-09-13 2022-04-29 北京百度网讯科技有限公司 Method for establishing drug synergy prediction model, prediction method and corresponding device
CN114420309B (en) * 2021-09-13 2023-11-21 北京百度网讯科技有限公司 Method for establishing medicine synergistic effect prediction model, prediction method and corresponding device
CN114842927A (en) * 2022-05-20 2022-08-02 华东师范大学 Medicine and pathway association prediction method of knowledge graph attention network
CN114842927B (en) * 2022-05-20 2024-06-07 华东师范大学 Association prediction method for medicine and path of knowledge graph attention network
CN116429710A (en) * 2023-06-15 2023-07-14 武汉大学人民医院(湖北省人民医院) Drug component detection method, device, equipment and readable storage medium
CN116429710B (en) * 2023-06-15 2023-09-26 武汉大学人民医院(湖北省人民医院) Drug component detection method, device, equipment and readable storage medium
CN117079835A (en) * 2023-08-21 2023-11-17 广东工业大学 Multi-view-based medicine-medicine interaction prediction method and system
CN117079835B (en) * 2023-08-21 2024-02-20 广东工业大学 Multi-view-based medicine-medicine interaction prediction method and system
CN118072862A (en) * 2024-04-24 2024-05-24 临沂大学 Knowledge characterization and reasoning method for multi-mode drug knowledge graph

Also Published As

Publication number Publication date
CN113012770B (en) 2022-05-10

Similar Documents

Publication Publication Date Title
CN113012770B (en) Multi-modal deep neural network based prediction of drug-drug interaction events
Zhavoronkov et al. Will artificial intelligence for drug discovery impact clinical pharmacology?
JP6916107B2 (en) Bayesian Causal Network Model for Health Examination and Treatment Based on Patient Data
US20220165435A1 (en) Drug repositioning candidate recommendation system, and computer program stored in medium in order to execute each function of system
Farid et al. An adaptive rule-based classifier for mining big biological data
US10360349B2 (en) Personalized medicine service
US11551109B2 (en) System and method for patient health data prediction using knowledge graph analysis
US20140089003A1 (en) Patient health record similarity measure
CN114765075A (en) Medicine recommendation method, device and system, electronic equipment and storage medium
WO2014052921A2 (en) Patient health record similarity measure
Xuan et al. HeteroDualNet: A dual convolutional neural network with heterogeneous layers for drug-disease association prediction via Chou’s five-step rule
Liu et al. Pathogenic gene prediction based on network embedding
Nandhini et al. Hybrid CNN-LSTM and modified wild horse herd Model-based prediction of genome sequences for genetic disorders
KR102321732B1 (en) Device, Method, and Computer-Readable Medium for Predicting Change in Efficacy of a Drug Considering SNP
US20130253892A1 (en) Creating synthetic events using genetic surprisal data representing a genetic sequence of an organism with an addition of context
US20140089004A1 (en) Patient cohort laboratory result prediction
Alkhateeb et al. Deep learning in multi-omics data integration in cancer diagnostic
CN116486899A (en) Method, system, equipment and medium for judging matching of medicine and target point
CN116153527A (en) Attention mechanism-based method and system for predicting side effects of combination of psychotropic drugs
CN112802573B (en) Medicine package recommendation method, device, computer system and readable storage medium
CN112071439B (en) Drug side effect relationship prediction method, system, computer device, and storage medium
EP4315190A1 (en) Graph database techniques for machine learning
Boată et al. Using digital twins in health care
Hilbush In Silico Dreams: How Artificial Intelligence and Biotechnology Will Create the Medicines of the Future
US20200090818A1 (en) System and method for mapping biomedical entities

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant