CN112599207A - Cancer drug sensitivity prediction method based on pathway activity and elastic net - Google Patents
Cancer drug sensitivity prediction method based on pathway activity and elastic net Download PDFInfo
- Publication number
- CN112599207A CN112599207A CN202011539984.2A CN202011539984A CN112599207A CN 112599207 A CN112599207 A CN 112599207A CN 202011539984 A CN202011539984 A CN 202011539984A CN 112599207 A CN112599207 A CN 112599207A
- Authority
- CN
- China
- Prior art keywords
- drug
- matrix
- pathway
- pathway activity
- cell line
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000000694 effects Effects 0.000 title claims abstract description 55
- 230000037361 pathway Effects 0.000 title claims abstract description 50
- 230000035945 sensitivity Effects 0.000 title claims abstract description 30
- 238000000034 method Methods 0.000 title claims abstract description 25
- 239000003560 cancer drug Substances 0.000 title claims abstract description 13
- 229940079593 drug Drugs 0.000 claims abstract description 45
- 239000003814 drug Substances 0.000 claims abstract description 45
- 239000011159 matrix material Substances 0.000 claims abstract description 41
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 33
- 230000014509 gene expression Effects 0.000 claims abstract description 23
- 239000000126 substance Substances 0.000 claims abstract description 22
- 239000002246 antineoplastic agent Substances 0.000 claims abstract description 19
- 229940041181 antineoplastic drug Drugs 0.000 claims abstract description 19
- 239000013598 vector Substances 0.000 claims abstract description 15
- 238000012549 training Methods 0.000 claims abstract description 3
- 108091006146 Channels Proteins 0.000 claims description 33
- 206010028980 Neoplasm Diseases 0.000 claims description 23
- 201000011510 cancer Diseases 0.000 claims description 23
- 230000003993 interaction Effects 0.000 claims description 11
- 238000012795 verification Methods 0.000 claims description 2
- 230000004044 response Effects 0.000 abstract description 4
- 238000002474 experimental method Methods 0.000 abstract description 2
- 101150044508 key gene Proteins 0.000 abstract description 2
- 239000000090 biomarker Substances 0.000 abstract 1
- 238000011156 evaluation Methods 0.000 abstract 1
- AYUNIORJHRXIBJ-TXHRRWQRSA-N tanespimycin Chemical compound N1C(=O)\C(C)=C\C=C/[C@H](OC)[C@@H](OC(N)=O)\C(C)=C\[C@H](C)[C@@H](O)[C@@H](OC)C[C@H](C)CC2=C(NCC=C)C(=O)C=C1C2=O AYUNIORJHRXIBJ-TXHRRWQRSA-N 0.000 description 3
- 229950007866 tanespimycin Drugs 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- YABJJWZLRMPFSI-UHFFFAOYSA-N 1-methyl-5-[[2-[5-(trifluoromethyl)-1H-imidazol-2-yl]-4-pyridinyl]oxy]-N-[4-(trifluoromethyl)phenyl]-2-benzimidazolamine Chemical compound N=1C2=CC(OC=3C=C(N=CC=3)C=3NC(=CN=3)C(F)(F)F)=CC=C2N(C)C=1NC1=CC=C(C(F)(F)F)C=C1 YABJJWZLRMPFSI-UHFFFAOYSA-N 0.000 description 1
- QQWUGDVOUVUTOY-UHFFFAOYSA-N 5-chloro-N2-[2-methoxy-4-[4-(4-methyl-1-piperazinyl)-1-piperidinyl]phenyl]-N4-(2-propan-2-ylsulfonylphenyl)pyrimidine-2,4-diamine Chemical compound COC1=CC(N2CCC(CC2)N2CCN(C)CC2)=CC=C1NC(N=1)=NC=C(Cl)C=1NC1=CC=CC=C1S(=O)(=O)C(C)C QQWUGDVOUVUTOY-UHFFFAOYSA-N 0.000 description 1
- AECDBHGVIIRMOI-UHFFFAOYSA-N 7-[3-(azetidin-1-ylmethyl)cyclobutyl]-5-(3-phenylmethoxyphenyl)pyrrolo[2,3-d]pyrimidin-4-amine Chemical compound C1=2C(N)=NC=NC=2N(C2CC(CN3CCC3)C2)C=C1C(C=1)=CC=CC=1OCC1=CC=CC=C1 AECDBHGVIIRMOI-UHFFFAOYSA-N 0.000 description 1
- MLDQJTXFUGDVEO-UHFFFAOYSA-N BAY-43-9006 Chemical compound C1=NC(C(=O)NC)=CC(OC=2C=CC(NC(=O)NC=3C=C(C(Cl)=CC=3)C(F)(F)F)=CC=2)=C1 MLDQJTXFUGDVEO-UHFFFAOYSA-N 0.000 description 1
- 239000005551 L01XE03 - Erlotinib Substances 0.000 description 1
- 239000005511 L01XE05 - Sorafenib Substances 0.000 description 1
- 239000002136 L01XE07 - Lapatinib Substances 0.000 description 1
- 239000005536 L01XE08 - Nilotinib Substances 0.000 description 1
- 108010042423 LBW242 Proteins 0.000 description 1
- OYONTEXKYJZFHA-SSHUPFPWSA-N PHA-665752 Chemical compound CC=1C(C(=O)N2[C@H](CCC2)CN2CCCC2)=C(C)NC=1\C=C(C1=C2)/C(=O)NC1=CC=C2S(=O)(=O)CC1=C(Cl)C=CC=C1Cl OYONTEXKYJZFHA-SSHUPFPWSA-N 0.000 description 1
- YZDJQTHVDDOVHR-UHFFFAOYSA-N PLX-4720 Chemical compound CCCS(=O)(=O)NC1=CC=C(F)C(C(=O)C=2C3=CC(Cl)=CN=C3NC=2)=C1F YZDJQTHVDDOVHR-UHFFFAOYSA-N 0.000 description 1
- 229930012538 Paclitaxel Natural products 0.000 description 1
- 238000000692 Student's t-test Methods 0.000 description 1
- 102000004887 Transforming Growth Factor beta Human genes 0.000 description 1
- 108090001012 Transforming Growth Factor beta Proteins 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- PIQCTGMSNWUMAF-UHFFFAOYSA-N chembl522892 Chemical compound C1CN(C)CCN1C1=CC=C(NC(=N2)C=3C(NC4=CC=CC(F)=C4C=3N)=O)C2=C1 PIQCTGMSNWUMAF-UHFFFAOYSA-N 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 229960001433 erlotinib Drugs 0.000 description 1
- AAKJLRGGTJKAMG-UHFFFAOYSA-N erlotinib Chemical compound C=12C=C(OCCOC)C(OCCOC)=CC2=NC=NC=1NC1=CC=CC(C#C)=C1 AAKJLRGGTJKAMG-UHFFFAOYSA-N 0.000 description 1
- 229960004891 lapatinib Drugs 0.000 description 1
- BCFGMOOMADDAQU-UHFFFAOYSA-N lapatinib Chemical compound O1C(CNCCS(=O)(=O)C)=CC=C1C1=CC=C(N=CN=C2NC=3C=C(Cl)C(OCC=4C=C(F)C=CC=4)=CC=3)C2=C1 BCFGMOOMADDAQU-UHFFFAOYSA-N 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- HHZIURLSWUIHRB-UHFFFAOYSA-N nilotinib Chemical compound C1=NC(C)=CN1C1=CC(NC(=O)C=2C=C(NC=3N=C(C=CN=3)C=3C=NC=CC=3)C(C)=CC=2)=CC(C(F)(F)F)=C1 HHZIURLSWUIHRB-UHFFFAOYSA-N 0.000 description 1
- 229960001346 nilotinib Drugs 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 229960001592 paclitaxel Drugs 0.000 description 1
- 229960005184 panobinostat Drugs 0.000 description 1
- FWZRWHZDXBDTFK-ZHACJKMWSA-N panobinostat Chemical compound CC1=NC2=CC=C[CH]C2=C1CCNCC1=CC=C(\C=C\C(=O)NO)C=C1 FWZRWHZDXBDTFK-ZHACJKMWSA-N 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 229950009919 saracatinib Drugs 0.000 description 1
- OUKYUETWWIPKQR-UHFFFAOYSA-N saracatinib Chemical compound C1CN(C)CCN1CCOC1=CC(OC2CCOCC2)=C(C(NC=2C(=CC=C3OCOC3=2)Cl)=NC=N2)C2=C1 OUKYUETWWIPKQR-UHFFFAOYSA-N 0.000 description 1
- 238000010845 search algorithm Methods 0.000 description 1
- CYOHGALHFOKKQC-UHFFFAOYSA-N selumetinib Chemical compound OCCONC(=O)C=1C=C2N(C)C=NC2=C(F)C=1NC1=CC=C(Br)C=C1Cl CYOHGALHFOKKQC-UHFFFAOYSA-N 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 229960003787 sorafenib Drugs 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- 238000012353 t test Methods 0.000 description 1
- RCINICONZNJXQF-MZXODVADSA-N taxol Chemical compound O([C@@H]1[C@@]2(C[C@@H](C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)OC(=O)[C@H](O)[C@@H](NC(=O)C=1C=CC=CC=1)C=1C=CC=CC=1)O)C(=O)C1=CC=CC=C1 RCINICONZNJXQF-MZXODVADSA-N 0.000 description 1
- ZRKFYGHZFMAOKI-QMGMOQQFSA-N tgfbeta Chemical compound C([C@H](NC(=O)[C@H](C(C)C)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(C)C)[C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 ZRKFYGHZFMAOKI-QMGMOQQFSA-N 0.000 description 1
- UCFGDBYHRUNTLO-QHCPKHFHSA-N topotecan Chemical compound C1=C(O)C(CN(C)C)=C2C=C(CN3C4=CC5=C(C3=O)COC(=O)[C@]5(O)CC)C4=NC2=C1 UCFGDBYHRUNTLO-QHCPKHFHSA-N 0.000 description 1
- 229960000303 topotecan Drugs 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 229960000241 vandetanib Drugs 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16C—COMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
- G16C20/00—Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
- G16C20/30—Prediction of properties of chemical compounds, compositions or mixtures
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16C—COMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
- G16C20/00—Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
- G16C20/70—Machine learning, data mining or chemometrics
Landscapes
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Crystallography & Structural Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computing Systems (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Software Systems (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention provides a cancer drug sensitivity prediction method based on pathway activity and elastic network, firstly, selecting key gene expression values which are closely connected with other genes in each pathway network in a gene expression matrix, calculating and analyzing to obtain each pathway activity vector, and then combining all pathway activity vectors into a matrix; combining the molecular description of each drug as a chemical characteristic matrix and a pathway activity characteristic matrix of the drug into a new characteristic matrix; finally, training the feature matrix of the 24 medicines by using an elastic net and optimizing parameters, and using a mean square error as an evaluation standard; the invention can effectively predict the sensitivity level of each drug and the biomarker thereof, has higher precision on most drugs, and can help clinical experiments to reduce time and money cost, thereby effectively predicting drug response; the method of the invention can learn the potential relation of the anti-cancer drugs from multi-dimensional characteristics, and has better biological interpretation.
Description
Technical Field
The invention belongs to the technical field of bioinformatics and machine learning, and particularly relates to a cancer drug sensitivity prediction method based on pathway activity and elastic network regression, and more particularly relates to a prediction method for predicting cancer cell lines sensitive to 24 anti-cancer drugs.
Background
Determining whether a patient will respond to an anticancer drug typically takes months, goes through trial and error, and is accompanied by the possibility of misuse over time. The development of effective anti-cancer drug response prediction methods has been an important topic of oncology research.
At present, most methods predict drug sensitivity by establishing a prediction model of a single gene, and do not consider the interaction relation among genes in a pathway and key genes in the pathway, and the repeatability of the prediction model is low due to biodiversity and the biological relevance of new gene-drug association which is difficult to explain.
Disclosure of Invention
In view of the deficiencies of the prior art, it is an object of the present invention to provide a method for predicting cancer drug sensitivity based on pathway activity and elastic network. The model has high prediction precision, can effectively predict the drug sensitivity, and has better prediction effect and better biological explanation for the cancer drug sensitivity by the method for selecting the expression level of the key gene in the pathway to represent the activity level of the pathway.
In order to achieve the above purpose, the solution of the invention is as follows:
a method for predicting cancer drug sensitivity based on pathway activity and elastic network, comprising the steps of:
s1: acquiring gene expression characteristic data of a cancer cell line and chemical characteristic data of an anti-cancer drug;
s2: downloading and integrating from a KEGG pathway database to obtain an interaction network table among genes in each pathway (gene set), and selecting key genes which are tightly connected with gene expression characteristic data genes of the cancer cell line from the network table;
s3: calculating the channel activity vector of each channel, and combining all the channel activity vectors to obtain a channel activity characteristic matrix;
s4: integrating the chemical characteristic data of the anti-cancer drugs in the step S1 into a chemical characteristic matrix of the drugs, and connecting and merging the chemical characteristic matrix of the drugs and the pathway activity characteristic matrix in the step S3 to obtain a new matrix containing the characteristics of the cancer cell line and the chemical characteristics of the drugs;
s5: constructing an elastic network model for predicting the sensitivity of the anti-cancer drugs based on the steps S1-S4, inputting a new matrix containing the characteristics of the cancer cell line and the chemical characteristics of the drugs into the model as the characteristic tensor to train and predict, and verifying the reproducibility of the model.
As a preferred scheme, in step S1, the chemical characteristics of the anticancer drug and the gene expression characteristics of the cancer cell line are integrated from the internet public database, and a two-dimensional tensor of chemical characteristics and a two-dimensional tensor of gene expression characteristics of the drug are formed corresponding to the known sensitivity level of the anticancer drug. And downloading the channel gene interaction network data from the channel database.
As a preferable embodiment, in step S2, the two-dimensional tensor of gene expression characteristics and the two-dimensional tensor of chemical characteristics of the anticancer drug of the cancer cell line are normalized, then genes with higher degree of selection from the pathway gene interaction network are selected as key genes, and the mean/variance of the gene expression amounts of the genes is taken. Drug response is a synergistic behavior of multiple genes, and applying key genes of gene interaction networks in pathways to drug sensitivity will improve the prediction effect.
As a preferable scheme, in step S3, the mean/variance activity vectors of the expression levels of all key genes in the cancer cell line sample in each pathway of step S2 are combined into a pathway activity vector according to all the pathway data of step S1, when the pathway is characterized, and the sample is a cancer cell line. Wherein, each channel activity vector is the mean value/variance of the expression quantity of the key genes in the channel interaction network table in the gene expression matrix, and the characteristics in the channel activity matrix after all the channels are combined become the channels.
As a preferable scheme, in step S4, the pathway activity characteristic matrix and the pharmaceutical chemistry characteristic matrix are merged into a new characteristic matrix, i.e., a new matrix including the characteristics of the cancer cell line and the pharmaceutical chemistry characteristics, and are used as input values of the elastic network model, so as to perform regression prediction.
As a preferable scheme, in step S5, an elastic network model is constructed for the new matrix and the drug sensitivity level obtained in step S4, and the new matrix including the cancer cell line characteristics and the drug chemical characteristics is used as a model input value, the drug sensitivity level is used as a model output value, and the model is verified by adjusting and optimizing the adjustment parameters according to Mean Square Error (MSE).
Due to the adoption of the scheme, the invention has the beneficial effects that:
first, the method of the present invention for predicting cancer drug sensitivity using pathway activity and elastic network has higher accuracy in most drugs, and can help clinical experiments to reduce time and money costs, thereby effectively predicting drug response.
Secondly, the method for combining the chemical characteristics of the anti-cancer drug and the pathway activity characteristics of the cancer cell line, which is provided by the invention, can learn the potential relationship of the anti-cancer drug from multidimensional characteristics, and has better biological explanation.
Drawings
FIG. 1 is a flow chart of channel activity inference in an embodiment of the invention.
FIG. 2 is a graph of the predicted performance of 24 drugs on a validation set in an example of the present invention.
FIG. 3 is a graph of the pathway activity of drug PF2341066 between sensitive and resistant cell lines in an example of the invention.
FIG. 4 is a graph of the pathway activity of drug 17-AAG between sensitive and resistant cell lines in an example of the invention.
Detailed Description
The invention provides a cancer drug sensitivity prediction method based on pathway activity and elastic network.
The present invention will be further described with reference to the following examples.
Example (b):
the method for predicting cancer drug sensitivity based on pathway activity and elastic network of the present embodiment comprises:
1 materials and analysis
1.1 data Source
The data used for model training and prediction in the present invention is derived from a data set published on the internet. Specifically, the chemical characteristic data set of the anti-cancer drug is derived from the results calculated by the alvaDesc software, the gene expression data and the anti-cancer drug sensitivity value of the cancer cell line are derived from a database CCLE (cancer cell line encyclopedia https:// portals. branched. induced. org/cc /), and the pathway database is derived from a KEGG database (Kyoto gene and genome encyclopedia https:// www.genome.jp/KEGG /).
1.2 model method
FIG. 1 is a flow chart of the inferred pathway activity of the present invention. The invention firstly calculates the expression quantity of high-connectivity genes (key genes) in a channel gene interaction network to obtain channel activity vectors, then combines all the channel activity vectors to obtain a channel activity characteristic matrix, and then combines the channel activity characteristic matrix and a drug chemical characteristic matrix into a new matrix, thereby serving as an input value of an elastic network model and finally predicting drug sensitivity.
1.2.1 pathway Activity inference
Inferring pathway activity comprises the steps of:
the first step is as follows: gene expression data were normalized. And (3) normalization processing, namely subtracting the mean value of the dimensions of all the elements in each dimension of the data tensor, and dividing the mean value by the standard deviation of each dimension to obtain the new data tensor which is the normalized data.
The second step is that: the mean/variance of the gene expression levels with high connectivity in each pathway gene interaction network is taken as the pathway activity vector.
The third step: all pathway activity vectors are combined into a pathway activity signature matrix.
1.2.2 construction of models
Combining the channel activity characteristic matrix and the pharmaceutical chemistry characteristic matrix into a new characteristic matrix, namely, the new matrix containing the cancer cell line characteristics and the pharmaceutical chemistry characteristics, serving as an input value of an elastic net model, corresponding to the cancer drug sensitivity IC50 value, selecting the optimal parameters to predict the drug sensitivity level by using a grid search algorithm, and simultaneously ranking the model coefficients of the elastic net and selecting the channel characteristics which are ranked 10 percent of the first rank for analysis.
2 results and analysis
2.1 channel Activity inference
The channel activity inference considers the interaction relationship among the genes of the channel, selects the genes with the degree of top 10% in the channel, converts the gene characteristics in the gene expression matrix into the channel characteristics in the channel activity matrix by using the gene relationship in the channel, converts the gene characteristics of more than 2 ten thousand in the original gene expression matrix into 388 channel characteristics, and greatly reduces the dimension.
2.2 prediction of cancer drug sensitivity based on pathway Activity and elastic Net
An elastic network is adopted to construct a regression model, input data is a new matrix containing cancer cell line characteristics and drug chemical characteristics, output data is a drug sensitivity IC50 value, a prediction model of pathway activity to drug sensitivity is constructed, and a simulation experiment is carried out on the prediction model by using a glmnet package in R language.
The model is trained, tested and predicted on the test set, and the experimental result obtained on the verification set is shown in figure 2. The experimental results obtained on the test set are shown in table 1.
Table 1 predictive performance on test set of drug susceptibility prediction models based on pathway activity
Medicine | MSE | Medicine | MSE | Medicine | MSE | Medicine | MSE |
17-AAG | 3.730 | L-68545 | 3.464 | Panobinostat | 0.006 | RAF265 | 8.241 |
AEW541 | 6.566 | Lapatinib | 7.265 | PD-0325901 | 9.112 | Sorafenib | 2.753 |
AZD0530 | 7.287 | LBW242 | 3.107 | PD-0332991 | 3.601 | TAE684 | 7.465 |
AZD6244 | 10.381 | Nilotinib | 2.437 | PF2341066 | 4.488 | TKI258 | 5.766 |
Erlotinib | 5.765 | Nutin-3 | 1.614 | PHA-665752 | 1.814 | Topotecan | 4.256 |
lrinotecan | 1.516 | Paclitaxel | 4.945 | PLX4720 | 2.412 | ZD-6474 | 6.595 |
2.2 biological interpretation of pathways and drug sensitivity
In fig. 3 for PF2341066 drug, the 30 cell lines with the lowest and highest IC50 were selected as sensitive and resistant groups, respectively, and the p-value of the t-test provided the difference in pathway activity. TGF-beta signaling pathway ranks high in the drug PF2341066, and it has been found in fact that activation of this pathway leads to resistance to the drug PF 2341066.
The difference in pathway activity of the drug 17-AAG between sensitive and resistant cell lines is also shown in FIG. 4, consistent with the fact.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. It will be readily apparent to those skilled in the art that various modifications to these embodiments and the generic principles defined herein may be applied to other embodiments without the use of the inventive faculty. Therefore, the present invention is not limited to the above-described embodiments. Those skilled in the art should appreciate that many modifications and variations are possible in light of the above teaching without departing from the scope of the invention.
Claims (5)
1. A method for predicting cancer drug sensitivity based on pathway activity and elastic network, characterized by: which comprises the following steps:
s1: acquiring gene expression characteristic data of a cancer cell line and chemical characteristic data of an anti-cancer drug;
s2: obtaining an interaction network table among genes in each channel by using a channel database, and selecting key genes which are tightly connected with gene expression characteristic data genes of the cancer cell line from the network table;
s3: calculating the channel activity vector of each channel, and combining all the channel activity vectors to obtain a channel activity characteristic matrix;
s4: integrating the chemical characteristic data of the anti-cancer drugs in the step S1 into a drug chemical characteristic matrix, and connecting and merging the drug chemical characteristic matrix and the pathway activity characteristic matrix in the step S3 to obtain a new matrix containing the characteristics of the cancer cell line and the drug chemical characteristics;
s5: constructing an elastic network model for predicting the sensitivity of the anti-cancer drugs based on the steps S1-S4, inputting a new matrix containing the characteristics of the cancer cell line and the chemical characteristics of the drugs into the model as the characteristic tensor to train and predict, and verifying the reproducibility of the model.
2. The method of claim 1, wherein the method comprises: in step S1, gene expression profile data of the cancer cell line and chemical profile data of the anticancer drug are collected from the internet, and the sensitivity level of the anticancer drug is learned from the multidimensional profile of the cancer cell line.
3. The method of claim 1, wherein the method comprises: in step S2, genes having a high degree of gene interaction network in the pathway are selected as key genes, and the mean/variance of the gene expression levels of these genes is taken.
4. The method of claim 1, wherein the method comprises: in step S3, pathway activity vectors are combined based on the mean/variance activity vectors of the gene expression levels to obtain a pathway activity feature matrix.
5. The method of claim 1, wherein the method comprises: in step S5, the elastic network model constructed in steps S2-S4 is integrated, a new matrix including cancer cell line characteristics and pharmaceutical chemistry characteristics is used as a model input value, a drug sensitivity level is used as a model output value, and parameters of the optimized model are adjusted according to a mean square error for training and verification.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011539984.2A CN112599207A (en) | 2020-12-23 | 2020-12-23 | Cancer drug sensitivity prediction method based on pathway activity and elastic net |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011539984.2A CN112599207A (en) | 2020-12-23 | 2020-12-23 | Cancer drug sensitivity prediction method based on pathway activity and elastic net |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112599207A true CN112599207A (en) | 2021-04-02 |
Family
ID=75200439
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011539984.2A Pending CN112599207A (en) | 2020-12-23 | 2020-12-23 | Cancer drug sensitivity prediction method based on pathway activity and elastic net |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112599207A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115458188A (en) * | 2022-11-11 | 2022-12-09 | 神州医疗科技股份有限公司 | Mining method and system for drug high-efficiency response candidate marker |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107609326A (en) * | 2017-07-26 | 2018-01-19 | 同济大学 | Drug sensitivity prediction method in the accurate medical treatment of cancer |
CN108830040A (en) * | 2018-06-07 | 2018-11-16 | 中南大学 | A kind of drug sensitivity prediction method based on cell line and drug similitude network |
CN110232978A (en) * | 2019-06-14 | 2019-09-13 | 西安电子科技大学 | Cancer cell system therapeutic agent prediction technique based on multidimensional network |
CN110277174A (en) * | 2019-06-14 | 2019-09-24 | 上海海洋大学 | A kind of prediction technique of anticancer drug synergistic effect neural network based |
-
2020
- 2020-12-23 CN CN202011539984.2A patent/CN112599207A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107609326A (en) * | 2017-07-26 | 2018-01-19 | 同济大学 | Drug sensitivity prediction method in the accurate medical treatment of cancer |
CN108830040A (en) * | 2018-06-07 | 2018-11-16 | 中南大学 | A kind of drug sensitivity prediction method based on cell line and drug similitude network |
CN110232978A (en) * | 2019-06-14 | 2019-09-13 | 西安电子科技大学 | Cancer cell system therapeutic agent prediction technique based on multidimensional network |
CN110277174A (en) * | 2019-06-14 | 2019-09-24 | 上海海洋大学 | A kind of prediction technique of anticancer drug synergistic effect neural network based |
Non-Patent Citations (3)
Title |
---|
LEE E等: "Inferring Pathway Activity toward Precise Disease Classification", 《PLOS COMPUT》, vol. 4, no. 11, 30 November 2008 (2008-11-30), pages 1000217 * |
MENDEN MP等: "Machine Learning Prediction of Cancer Cell Sensitivity to Drugs Based on Genomic and Chemical Properties", 《PLOS ONE》, vol. 8, no. 4, 30 April 2013 (2013-04-30), pages 61318 * |
陈希,秦玉芳,陈明,张重阳: "基于多输入神经网络的药物组合协同作用预测", 《生物医学工程学杂志》, vol. 37, no. 4, 31 August 2020 (2020-08-31) * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115458188A (en) * | 2022-11-11 | 2022-12-09 | 神州医疗科技股份有限公司 | Mining method and system for drug high-efficiency response candidate marker |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zeng et al. | Non-parametric genetic prediction of complex traits with latent Dirichlet process regression models | |
Sánchez‐Ferro et al. | New methods for the assessment of Parkinson's disease (2005 to 2015): a systematic review | |
De Leeuw et al. | The statistical properties of gene-set analysis | |
Sherwin et al. | Fundamentals of population pharmacokinetic modelling: validation methods | |
Li et al. | A prognostic model of Alzheimer's disease relying on multiple longitudinal measures and time-to-event data | |
US20170147743A1 (en) | Rapid identification of pharmacological targets and anti-targets for drug discovery and repurposing | |
Saffian et al. | Warfarin dosing algorithms underpredict dose requirements in patients requiring≥ 7 mg daily: A systematic review and meta‐analysis | |
Guidi et al. | Parametric approaches in population pharmacokinetics | |
Masoudi-Nejad et al. | RETRACTED ARTICLE: Candidate gene prioritization | |
Li et al. | Accommodating informative dropout and death: a joint modelling approach for longitudinal and semicompeting risks data | |
Zhang et al. | Predicting tumor cell response to synergistic drug combinations using a novel simplified deep learning model | |
Alden et al. | Using emulation to engineer and understand simulations of biological systems | |
Iturria-Medina et al. | Integrating molecular, histopathological, neuroimaging and clinical neuroscience data with NeuroPM-box | |
Lin et al. | Machine learning in neural networks | |
Wang et al. | Efficient gene–environment interaction tests for large biobank‐scale sequencing studies | |
Pinol et al. | Rare disease discovery: An optimized disease ranking system | |
Khalilimeybodi et al. | Context-specific network modeling identifies new crosstalk in β-adrenergic cardiac hypertrophy | |
Iddi et al. | Power and Sample Size for Longitudinal Models in R-The longpower Package and Shiny App. | |
CN112599207A (en) | Cancer drug sensitivity prediction method based on pathway activity and elastic net | |
WO2014121257A1 (en) | Prescription decision support system and method using comprehensive multiplex drug monitoring | |
KR101067352B1 (en) | System and method comprising algorithm for mode-of-action of microarray experimental data, experiment/treatment condition-specific network generation and experiment/treatment condition relation interpretation using biological network analysis, and recording media having program therefor | |
Li et al. | End-to-end interpretable disease–gene association prediction | |
US20240079142A1 (en) | A system and method to predict health outcomes and optimize health interventions | |
JP2007505405A (en) | Apparatus and method for identifying therapeutic targets using a computer model | |
Lip et al. | Transforming Clinical Trials with Artificial Intelligence |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |