WO2023107570A1 - Charge mutationnelle tumorale pondérée par l'expression en tant que biomarqueur oncologique - Google Patents

Charge mutationnelle tumorale pondérée par l'expression en tant que biomarqueur oncologique Download PDF

Info

Publication number
WO2023107570A1
WO2023107570A1 PCT/US2022/052153 US2022052153W WO2023107570A1 WO 2023107570 A1 WO2023107570 A1 WO 2023107570A1 US 2022052153 W US2022052153 W US 2022052153W WO 2023107570 A1 WO2023107570 A1 WO 2023107570A1
Authority
WO
WIPO (PCT)
Prior art keywords
cancer
tmb
genes
rna
sequencing
Prior art date
Application number
PCT/US2022/052153
Other languages
English (en)
Inventor
David Y. Zhang
Alessandro Pinto
Cailin B. WELLER
Deepak THIRUNAVUKARASU
Original Assignee
Nuprobe Usa, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nuprobe Usa, Inc. filed Critical Nuprobe Usa, Inc.
Publication of WO2023107570A1 publication Critical patent/WO2023107570A1/fr

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/40ICT specially adapted for the handling or processing of patient-related medical or healthcare data for data related to laboratory analysis, e.g. patient specimen analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • G16B25/10Gene or protein expression profiling; Expression-ratio estimation or normalisation
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H15/00ICT specially adapted for medical reports, e.g. generation or transmission thereof
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H20/00ICT specially adapted for therapies or health-improving plans, e.g. for handling prescriptions, for steering therapy or for monitoring patient compliance
    • G16H20/10ICT specially adapted for therapies or health-improving plans, e.g. for handling prescriptions, for steering therapy or for monitoring patient compliance relating to drugs or medications, e.g. for ensuring correct administration to patients
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H20/00ICT specially adapted for therapies or health-improving plans, e.g. for handling prescriptions, for steering therapy or for monitoring patient compliance
    • G16H20/40ICT specially adapted for therapies or health-improving plans, e.g. for handling prescriptions, for steering therapy or for monitoring patient compliance relating to mechanical, radiation or invasive therapies, e.g. surgery, laser therapy, dialysis or acupuncture
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/30ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment

Definitions

  • the present disclosure relates to the field of determining tumor mutational burden (TMB) using sequencing data. More particularly, it relates to methods for expression-weighted tumor mutational burden (EW-TMB) analysis, and the use of EW- TMB as an oncology biomarker.
  • TMB tumor mutational burden
  • EW-TMB expression-weighted tumor mutational burden
  • TMB Tumor mutational burden
  • ICIs immune checkpoint inhibitors
  • New methods of EW-TMB can be used in the prognostication of patient outcome and the determination of treatment prescription for patients.
  • this disclosure provides a method comprising: obtaining a RNA sample, performing RNA sequencing (RNA Seq), analyzing the RNA sequencing data to determine an expression level E for each gene i (Ei) within a set of genes X, analyzing the RNA sequencing data to determine a number of coding mutations M for each gene i (Mi) within the same set of genes X, and determining an EW-TMB based on Ei and Mi for each gene i within the set of genes X.
  • RNA Seq RNA sequencing
  • Figure 1 depicts an illustration of the genome with mutations in coding regions (exons), introns, and intergenic regions. Only mutations in coding regions result in potential neoantigens.
  • Figure 2 comprises panels A and B.
  • Figure 2 depicts an illustration of a comparison of low EW-TMB samples (panel A) and high EW-TMB samples (panel B), showing E (expression analysis) and M (mutation analysis).
  • Low EW-TMB samples present few mutations on a limited amount of genes.
  • FIG. 3 depicts an illustration of the EW-TMB analysis workflow.
  • the RNA is extracted and either used as input for next-generation sequencing (NGS) or long-read sequencing.
  • NGS next-generation sequencing
  • the RNA is reverse transcribed, and adapters are attached throughout library preparation steps.
  • the adapters are attached to the crude RNA molecules.
  • the resulting reads are aligned and assembled as necessary, and the data set is further analyzed to establish E (analysis of the expression level) and M (analysis of the presence of mutations). From E and M, the EW- TMB is computed.
  • Figure 4 depicts an illustration of RNA library preparation methods for NGS read-out.
  • RNA Seq methods routinely used can be adapted to collect the dataset necessary for the EW-TMB calculation.
  • Figure 5 depicts an illustration of the EW-TMB Logistic and Heaviside functions.
  • composition provided herein is specifically envisioned for use with any applicable method provided herein.
  • any and all combinations of the members that make up that grouping of alternatives is specifically envisioned. For example, if an item is selected from a group consisting of A, B, C, and D, the inventors specifically envision each alternative individually (e.g., A alone, B alone, etc.), as well as combinations such as A, B, and D; A and C; B and C; etc.
  • the term “and/or” when used in a list of two or more items means any one of the listed items by itself or in combination with any one or more of the other listed items.
  • the expression “A and/or B” is intended to mean either or both of A and B - i.e., A alone, B alone, or A and B in combination.
  • the expression “A, B and/or C” is intended to mean A alone, B alone, C alone, A and B in combination, A and C in combination, B and C in combination, or A, B, and C in combination.
  • range is understood to inclusive of the edges of the range as well as any number between the defined edges of the range.
  • “between 1 and 10” includes any number between 1 and 10, as well as the number 1 and the number 10.
  • a compound or “at least one compound” may include a plurality of compounds, including mixtures thereof.
  • plural refers to any number greater than one.
  • DNA refers to deoxyribonucleic acid. DNA can be either single-stranded or double-stranded. DNA typically comprises four nucleotides: cytosine (C), guanine (G), adenine (A), and thymine (T). In an aspect, the sequence of a DNA molecule provided herein comprises one or more degenerate nucleotides. As used herein, a “degenerate nucleotide” refers to a nucleotide that can perform the same function or yield the same output as a structurally different nucleotide.
  • Non-limiting examples of degenerate nucleotides include a C, G, or T nucleotide (B); an A, G, or T nucleotide (D); an A, C, or T nucleotide (H); a G or T nucleotide (K); an A or C nucleotide (M); any nucleotide (N); an A or G nucleotide (R); a G or C nucleotide (S); an A, C, or G nucleotide (V); an A or T nucleotide (W), and a C or T nucleotide (Y).
  • RNA refers to ribonucleic acid. RNA can be either singlestranded or double-stranded. RNA typically comprises four nucleotides: cytosine (C), guanine (G), adenine (A), and uracil (U). In an aspect, the sequence of an RNA molecule provided herein comprises one or more degenerate nucleotides. As used herein, a “degenerate nucleotide” refers to a nucleotide that can perform the same function or yield the same output as a structurally different nucleotide.
  • Non-limiting examples of degenerate nucleotides include a C, G, or U nucleotide (B); an A, G, or U nucleotide (D); an A, C, or U nucleotide (H); a G or U nucleotide (K); an A or C nucleotide (M); any nucleotide (N); an A or G nucleotide (R); a G or C nucleotide (S); an A, C, or G nucleotide (V); an A or U nucleotide (W), and a C or U nucleotide (Y).
  • B C, G, or U nucleotide
  • D A, G, or U nucleotide
  • H A, C, or U nucleotide
  • K G or U nucleotide
  • M any nucleotide (N); an A or G nucleotide (R); a G or C nucleotide (S); an A,
  • RNA is present in, or obtained from, a sample (e.g, a “biological sample”).
  • a “biological sample” refers to any biological material that is capable of being analyzed by or subjected to the methods and/or compositions provided herein. Any suitable method known in the art can be used to obtain a nucleic acid (e.g, an RNA molecule) from a sample.
  • a sample comprises RNA.
  • a sample comprises RNA and DNA.
  • a sample comprises cells.
  • a sample comprises cell-free nucleic acids.
  • sample and “biological sample” are intended to be interchangeable.
  • a sample can include at least one cell, fetal cell, cell culture, tissue specimen, blood, serum, plasma, saliva, urine, tear, vaginal secretion, sweat, lymph fluid, cerebrospinal fluid, mucosa secretion, peritoneal fluid, ascites fluid, fecal matter, body exudates, umbilical cord blood, chorionic villi, amniotic fluid, embryonic tissue, multicellular embryo, lysate, extract, solution, or reaction mixture suspected of containing nucleic acids.
  • a sample can comprise DNA, RNA, or both.
  • a sample is from a healthy organism.
  • a sample is from a diseased organism.
  • a sample is from a mutagenized sample.
  • the nucleic acids obtained from a sample can be converted to cDNA.
  • a sample includes one or more cells associated with a tumor.
  • a sample includes one or more circulating tumor cells (CTC).
  • CTC circulating tumor cells
  • a sample includes one or more cells not associated with a tumor.
  • a cell is a prokaryotic cell.
  • a cell is a eukaryotic cell.
  • a cell is an animal cell.
  • a cell is a plant cell.
  • a cell is a fungal cell.
  • a cell is a mammal cell.
  • a cell is a primate cell.
  • a cell is a human cell.
  • a cell is a human cancer cell.
  • a sample is obtained from a subject.
  • a “subject” refers to an animal (e.g, without being limiting, a mammal, reptile, bird, fish, amphibian) or other organism, such as, without being limiting, a plant or fungus.
  • a subject can be a healthy individual, an individual that has or is suspected of having a disease or a predisposition to the disease, or an individual that is in need of therapy or suspected of needing therapy.
  • the term “individual” and “subject” are intended to be interchangeable.
  • a subject is a eukaryote.
  • a subject is a prokaryote.
  • a subject is a virus. In an aspect, a subject is an animal. In an aspect, a subject is a plant. In an aspect, a subject is a fungus. In an aspect, a subject is a mammal. In an aspect, a subject is a rodent. In an aspect, a subject is a mouse. In an aspect, a subject is a rat. In an aspect, a subject is a rabbit. In an aspect, a subject is a cat. In an aspect, a subject is a dog. In an aspect, a subject is a horse. In an aspect, a subject is a cow. In an aspect, a subject is a pig. In an aspect, a subject is a primate.
  • a subject is a monkey. In an aspect, a subject is a chimpanzee. In an aspect, a subject is a human. In an aspect, a subject is a bird. In an aspect, a subject is a chicken. In an aspect, a subject is a fish. In an aspect, a subject is a reptile. In an aspect, a subject is an amphibian. In an aspect, a subject is an insect. In an aspect, a subject is an arachnid. In an aspect, a subject is a crustacean. In an aspect, a subject is a mollusk. In an aspect, a subject is a nematode. In an aspect, a subject is an annelid.
  • a subject has, or is suspected of having, cancer.
  • a subject has, or is suspected of having, colorectal cancer.
  • a subject has, or is suspected of having, breast cancer.
  • a subject has, or is suspected of having, gastric cancer.
  • a subject has, or is suspected of having, endometrial cancer.
  • a subject has, or is suspected of having, bladder cancer.
  • a subject has, or is suspected of having, cervical cancer.
  • a subject has, or is suspected of having, colon cancer.
  • a subject has, or is suspected of having, head and neck cancer.
  • a subject has, or is suspected of having, liver cancer.
  • a subject has, or is suspected of having, renal cell cancer. In an aspect, a subject has, or is suspected of having, skin cancer. In an aspect, a subject has, or is suspected of having, stomach cancer. In an aspect, a subject has, or is suspected of having, rectal cancer. In an aspect, a subject has, or is suspected of having, lymphoma. In an aspect, a subject has, or is suspected of having, non-Hodgkin lymphoma. In an aspect, a subject has, or is suspected of having, Hodgkin lymphoma. In an aspect, a subject has, or is suspected of having, solid tumors. In an aspect, a subject has, or is suspected of having, non-solid tumors. In an aspect, a subject has, or is suspected of having, a genetic-based disease, disorder, or condition.
  • RNA can originate from and/or be isolated from any types of cancer for use with the methods and compositions provided herein.
  • Samples can be obtained from any type of cancer.
  • cancers include biliary tract cancer, bladder cancer, transitional cell carcinoma, urothelial carcinoma, brain cancer, gliomas, astrocytomas, breast carcinoma, metaplastic carcinoma, cervical cancer, cervical squamous cell carcinoma, rectal cancer, colorectal carcinoma, colon cancer, hereditary nonpolyposis colorectal cancer, colorectal adenocarcinomas, gastrointestinal stromal tumors (GISTs), endometrial carcinoma, endometrial stromal sarcomas, esophageal cancer, esophageal squamous cell carcinoma, esophageal adenocarcinoma, ocular melanoma, uveal melanoma, gallbladder carcinomas, gallbladder adenocarcinoma, renal cell carcinoma
  • Prostate cancer prostate adenocarcinoma, skin cancer, melanoma, malignant melanoma, cutaneous melanoma, small intestine carcinomas, stomach cancer, gastric carcinoma, gastrointestinal stromal tumor (GIST), uterine cancer, or uterine sarcoma.
  • a sample comprises a cell.
  • a sample comprises a tissue.
  • a sample comprises an organ.
  • a sample comprises blood.
  • a sample comprises plasma.
  • a sample comprises urine.
  • a sample comprises feces. Additional non-limiting examples of samples include serum, sputum, semen, vaginal fluid, synovial fluid, spinal fluid, cerebral spinal fluid, amniotic fluid, peritoneal fluid, interstitial fluid, bone marrow aspirate, and saliva.
  • a sample provided herein is obtained from a source selected from the group consisting of formalin-fixed paraffin-embedded tissue, whole blood, peripheral blood mononuclear cell, plasma, buffy coat, fresh frozen tissue, fresh tissue, biopsy, and tissue or cells from a surgical margin.
  • a sample provided herein is obtained from formalin-fixed paraffin-embedded tissue.
  • a sample provided herein is obtained from whole blood.
  • a sample provided herein is obtained from peripheral blood mononuclear cell.
  • a sample provided herein is obtained from plasma.
  • a sample provided herein is obtained from buffy coat.
  • a sample provided herein is obtained from fresh frozen tissue.
  • a sample provided herein is obtained from fresh tissue. In an aspect, a sample provided herein is obtained from a biopsy. In an aspect, a sample provided herein is obtained from tissue or cells from a surgical margin. In an aspect, a sample provided herein is a human sample. In an aspect, a sample provided herein is an animal sample.
  • a sample provided herein comprises less than or equal to 100 ng of RNA. In an aspect, a sample provided herein comprises less than or equal to 75 ng of RNA. In an aspect, a sample provided herein comprises less than or equal to 50 ng of
  • RNA in an aspect, comprises less than or equal to 25 ng of
  • RNA in an aspect, comprises less than or equal to 20 ng of
  • RNA in an aspect, comprises less than or equal to 15 ng of
  • RNA in an aspect, comprises less than or equal to 10 ng of
  • RNA comprises less than or equal to 5 ng of RNA. In an aspect, a sample provided herein comprises less than or equal to 1 ng of RNA.
  • the sample is tumor sample or sample derived from a tumor.
  • the sample is acquired from a solid tumor.
  • the sample is acquired from a non-solid tumor.
  • the sample is obtained from a subject having a cancer.
  • the sample is obtained from a healthy subject.
  • the sample is obtained from a subject who is receiving a therapy or has received a therapy.
  • the sample is obtained from a subject who is receiving immune checkpoint blockade therapy.
  • the sample is obtained from a subject who has received immune checkpoint blockade therapy.
  • the sample is obtained from a subject who is receiving neoadjuvant therapy.
  • the sample is obtained from a subject who has received neoadjuvant therapy. In an aspect, the sample is obtained from a subject who is receiving adjuvant therapy. In an aspect, the sample is obtained from a subject who has received adjuvant therapy.
  • cancer refers to malignant cancers, premalignant cancers, solid tumors, non-solid tumors, liquid tumors, soft tissue tumors, blood cancers, metastatic lesions, carcinoma, sarcoma, myeloma, leukemia, and lymphoma.
  • cancer or “tumor” is used interchangeably herein.
  • the a coding mutation can include, but are not limited to a single nucleotide polymorphism (SNP), a base substitution, a point mutation, an indel, an inversion, a duplication, an amplification, a translocation, an inter- and intra- chromosomal rearrangement, a missense mutation, a nonsense mutation, a frameshift mutation, and a silent mutation.
  • SNP single nucleotide polymorphism
  • a coding mutation is a SNP.
  • a coding mutation is a base substitution.
  • a coding mutation is a point mutation.
  • a coding mutation is an indel.
  • a coding mutation is an insertion.
  • a coding mutation is a deletion. In an aspect, a coding mutation is an inversion. In an aspect, a coding mutation is a duplication. In an aspect, a coding mutation is an amplification. In an aspect, a coding mutation is a translocation. In an aspect, a coding mutation is a translocation. In an aspect, a coding mutation is an inter- chromosomal rearrangement. In an aspect, a coding mutation is an intra-chromosomal rearrangement. In an aspect, a coding mutation is a missense mutation. In an aspect, a coding mutation is a nonsense mutation. In an aspect, a coding mutation is a frameshift mutation.
  • a coding mutation is a silent mutation.
  • an “indel” refers to an insertion, a deletion, or both, of one or more nucleotides in a nucleic acid of a cell.
  • an indel includes both an insertion and a deletion of one or more nucleotides, where both the insertion and deletion are nearby on the nucleic acid.
  • the indel results in a net change in the total number of nucleotides.
  • the indel results in no net change in the total number of nucleotides.
  • the indel results in a net change of about 1 to 10 nucleotides.
  • the indel results in a net change of about 1 to 20 nucleotides. In an aspect, the indel results in a net change of about 1 to 30 nucleotides. In an aspect, the indel results in a net change of about 1 to 40 nucleotides. In an aspect, the indel results in a net change of about 1 to 50 nucleotides. In an aspect, the indel results in a net change of about 1 to 75 nucleotides. In an aspect, the indel results in a net change of about 1 to 100 nucleotides. In an aspect, the coding mutation results in a net change in the total number of nucleotides.
  • the coding mutation results in a net change of about 1 to 10 nucleotides. In an aspect, the coding mutation results in a net change of about 1 to 25 nucleotides. In an aspect, the coding mutation results in a net change of about 1 to 50 nucleotides. In an aspect, the coding mutation results in a net change of about 1 to 75 nucleotides. In an aspect, the coding mutation results in a net change of about 1 to 100 nucleotides. In an aspect, the coding mutation results in no net change in the total number of nucleotides.
  • a set of genes X provided herein comprises between 1 gene and 10,000 genes. In an aspect, a set of genes X provided herein comprises between 1 gene and 100,000 genes. In an aspect, a set of genes X provided herein comprises between 1 gene and 1000 genes. In an aspect, a set of genes X provided herein comprises between 1 gene and 500 genes. In an aspect, a set of genes X provided herein comprises between 1 gene and 100 genes. In an aspect, a set of genes X provided herein comprises between 1 gene and 10 genes. In an aspect, a set of genes X provided herein comprises between 5 genes and 10 genes. In an aspect, a set of genes X provided herein comprises between 5 genes and 50 genes. In an aspect, a set of genes X provided herein comprises between 5 genes and 100 genes. In an aspect, a set of genes X provided herein comprises between 10 genes and 50 genes.
  • a set of genes X provided herein comprises at least 1 gene or fragment thereof. In an aspect, a set of genes X provided herein comprises at least 2 genes or fragments thereof. In an aspect, a set of genes X provided herein comprises at least 5 genes or fragments thereof. In an aspect, a set of genes X provided herein comprises at least 10 genes or fragments thereof. In an aspect, a set of genes X provided herein comprises at least 25 genes or fragments thereof. In an aspect, a set of genes X provided herein comprises at least 50 genes or fragments thereof. In an aspect, a set of genes X provided herein comprises at least 100 genes or fragments thereof.
  • a set of genes X provided herein comprises at least 1000 genes or fragments thereof. In an aspect, a set of genes X provided herein comprises at least 10,000 genes or fragments thereof. In an aspect, a set of genes X provided herein comprises at least 100,000 genes or fragments thereof. In an aspect, a set of genes X provided herein comprises a pre-determined number of genes or fragments thereof.
  • a set of genes X comprises substantially all exons in a genome. In an aspect, a set of genes X comprises a subset of exome. In an aspect, a set of genes X comprises one or more exons, or fragments thereof. In an aspect, a set of genes X comprises one or more genes or fragments thereof associated with cancer or cancerous phenotypes. In an aspect, a set of genes X comprises one or more genes or fragments thereof not associated with cancer or cancerous phenotypes.
  • a set of genes X comprises at least one exon, or fragment thereof, which includes one or more mutations selected from the group consisting of single nucleotide polymorphisms (SNPs), base substitutions, point mutations, indels, inversions, duplications, amplifications, translocations, inter- and intra-chromosomal rearrangements, missense mutations, nonsense mutations, frameshift mutations, and silent mutations.
  • SNPs single nucleotide polymorphisms
  • a set of genes X comprises one or more exons or fragments thereof with mutations that are associated with cancer.
  • a set of genes X comprises one or more exons or fragments thereof with mutations in which the gene products are associated with cancer.
  • TMB tumor mutation burden
  • Expression-Weighted TMB determines the level of mutations in a pre-determined set of genes from factors including, but not limited to, the expression level of each gene within the pre-determined set of genes, and the number of coding mutations for each gene within the pre-determined set of genes.
  • EW-TMB can be determined for the whole exome.
  • EW-TMB can be determined for a subset of exome.
  • EW-TMB determined from a subset of exome can be extrapolated to determine the whole exome TMB.
  • EW-TMB determined from a subset of exome can be extrapolated to determine the whole genome TMB. In an aspect, EW-TMB determined from a sample can be extrapolated to the subject from which the sample is obtained. In an aspect, the EW-TMB determined from a subset of exome correlates with the whole exome TMB. In an aspect, the EW-TMB determined from a subset of exome correlates with the whole genome TMB.
  • the EW-TMB determined from a sample or a subject can be compared to EW-TMB determined from a reference sample or a reference subject.
  • the reference sample is obtained from subjects in a reference population.
  • the reference population comprises patients having the same type of cancer as the subject.
  • the reference population comprises patients who are receiving, or have received, the same type of therapy, as the subject.
  • the reference population comprises patients who are in the same category as the subject based on one or more demographics.
  • the expression level E is expressed linearly or in log format.
  • an “expression level E” refers to the RNA expression level for one gene within a pre-defined set of genes.
  • RNA sequencing data is used to determine expression level E for each gene within a pre-defined set of genes.
  • RNA sequencing data is used to determine expression level E for each gene within a set of genes X.
  • the expression level E is expressed linearly.
  • the expression level E is expressed in log format.
  • the function f(E) is selected from the group consisting of a linear function, a super-linear function, and a sub-linear function.
  • the function f(E) is linear in E.
  • f(E) a*E + b, where a and b are constants.
  • the function f(E) is super-linear in E.
  • f(E) a*E A 2.
  • the function f(E) is sublinear in E.
  • f(E) log(E).
  • the function f(E) is selected from the group consisting of a sigmoidal function, a logistic function, and a hyperbolic function.
  • the function f(E) is sigmoidal in E.
  • f(E) erf(E)
  • erf the Gauss error function.
  • the function f(E) is logistic function in E.
  • f(E) a / (1 + e A (-b(E - c))).
  • the function f(E) is hyperbolic function in E.
  • a hyperbolic tangent (tanh) function is a non-limiting example of a hyperbolic function.
  • the function f(E) is selected from the group consisting of a Heaviside step function, and a rectified linear activation function.
  • the function f(E) is a Heaviside step function in E.
  • f(Ei) a if Ei > T
  • f(Ei) b if Ei ⁇ T
  • T is a pre-determined threshold that may be different for each value of i.
  • the function f(E) is a rectified linear activation function in E.
  • f(Ei) a * max(0, Ei-Ti) + b, where a and b are constants, and where T is a pre-determined threshold constant.
  • the number of coding mutations M is expressed linearly.
  • a “number of coding mutations M” refers to the number of coding mutations for one gene within a pre-defined set of genes.
  • RNA sequencing data is used to determine the number of coding mutations M for each gene within a pre-defined set of genes.
  • RNA sequencing data is used to determine the number of coding mutations M for each gene within a set of genes X.
  • the function g(M) is selected from the group consisting of a linear function, and a Heaviside step function.
  • the function g(M) is a linear function in M.
  • g(M) a*M + b, and where a and b are constants.
  • the function g(M) is a Heaviside step function in M.
  • g(Mi) a if Mi > T
  • g(Mi) b if Mi ⁇ T
  • T is a pre-determined threshold that may be different for each value of i.
  • the determination of the expression level E for each gene within a set of genes X is performed via methods selected from the group consisting of the use of de novo transcriptome assembly approach, and a genome-guided transcriptome assembly. In an aspect, the determination of the expression level E for each gene within a set of genes X is performed via the use of de novo transcriptome assembly approach, where contiguous sequences (contigs) are assembled in silico from pair-wise overlapping reads. In an aspect, the determination of the expression level E for each gene within a set of genes X is performed via the use of a genome-guided transcriptome assembly, where reads are aligned to reference genomes or transcript sequences.
  • a method comprises the use of a reverse transcription primer.
  • a reverse transcription primer refers to a primer used in a reverse transcription reaction, where RNA in an RNA sample is converted to complementary DNA (cDNA).
  • complementary DNA or “cDNA” refers to a DNA copy of a messenger RNA (mRNA) molecule produced by a reverse transcriptase.
  • a reverse transcription primer comprises at least 1 degenerate nucleotide.
  • a reverse transcription primer comprises at least 2 degenerate nucleotides.
  • a reverse transcription primer comprises at least 3 degenerate nucleotides.
  • a reverse transcription primer comprises at least 4 degenerate nucleotides. In an aspect, a reverse transcription primer comprises at least 5 degenerate nucleotides. In an aspect, a reverse transcription primer comprises at least 6 degenerate nucleotides. In an aspect, a reverse transcription primer comprises at least 7 degenerate nucleotides. In an aspect, a reverse transcription primer comprises at least 8 degenerate nucleotides. In an aspect, a reverse transcription primer comprises at least 9 degenerate nucleotides. In an aspect, a reverse transcription primer comprises at least 10 degenerate nucleotides. In an aspect, a reverse transcription primer comprises between 1 and 5 degenerate nucleotides.
  • a reverse transcription primer comprises between 1 and 10 degenerate nucleotides. In an aspect, a reverse transcription primer comprises between 1 and 15 degenerate nucleotides. In an aspect, a reverse transcription primer comprises a random hexamer. As is known in the art, a “hexamer” comprises six nucleotides. In an aspect, a reverse transcription primer comprises a polyT string.
  • a “polyT string” refers to at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 25, or at least 30 consecutive thymine nucleobases.
  • this disclosure provides at least one DNA polymerase.
  • a “DNA polymerase” refers to an enzyme that is capable of catalyzing the synthesis of a DNA molecule from nucleoside triphosphates.
  • DNA polymerases add a nucleotide to the 3' end of a DNA strand one nucleotide at a time, creating an antiparallel DNA strand as compared to a template DNA strand. DNA polymerases are unable to begin a new DNA molecule de novo; they require a primer to which it can add a first new nucleotide.
  • a “reagent” refers to any substance or compound added to a mixture to cause a chemical reaction or to test if a chemical reaction occurs.
  • a reagent comprises a component selected from the group consisting of magnesium, at least one dNTP, phosphatase, betaine, dimethyl sulfoxide (DMSO), and tetramethylammonium chloride (TMAC).
  • Non-limiting examples of reagents and buffers needed for DNA polymerase extension include Tris-HCl, potassium chloride, magnesium chloride, oligonucleotide primers, deoxynucleotides (dNTPs), betaine, and dimethyl sulfoxide.
  • Tris-HCl Tris-HCl
  • potassium chloride potassium chloride
  • magnesium chloride oligonucleotide primers
  • dNTPs deoxynucleotides
  • betaine betaine
  • dimethyl sulfoxide dimethyl sulfoxide
  • DNA polymerases can extend primers at different temperatures, depending on the DNA polymerase.
  • a DNA polymerase extends primers at a temperature of at least 40°C.
  • a DNA polymerase extends primers at a temperature of at least 50°C.
  • a DNA polymerase extends primers at a temperature of at least 55°C.
  • a DNA polymerase extends primers at a temperature of at least 60°C.
  • a DNA polymerase extends primers at a temperature of at least 65°C.
  • a DNA polymerase extends primers at a temperature of at least 70°C.
  • a DNA polymerase extends primers at a temperature of at least 75°C.
  • a DNA polymerase extends primers at a temperature of at least 80°C.
  • Primers can bind, or anneal, to a complementary sequence at a variety of temperatures, depending on the structure and length of the sequences involved.
  • primer binding occurs at a temperature of at least 35°C.
  • primer binding occurs at a temperature of at least 40°C.
  • primer binding occurs at a temperature of at least 45°C.
  • primer binding occurs at a temperature of at least 50°C.
  • primer binding occurs at a temperature of at least 55°C.
  • primer binding occurs at a temperature of at least 60°C.
  • primer binding occurs at a temperature of at least 65°C.
  • primer binding occurs at a temperature of at least 70°C.
  • DNA polymerase extension and primer binding occur at different temperatures. In an aspect, DNA polymerase extension and primer binding occur at the same temperature.
  • a DNA polymerase is a thermostable DNA polymerase.
  • a “thermostable DNA polymerase” refers to DNA polymerases that can function at high temperatures (e.g, greater than 65°C) and can survive higher temperatures (e.g, up to about 100°C). Thermostable DNA polymerases often have maximal catalytic activity at temperatures between 70°C and 80°C.
  • a thermostable DNA polymerase is selected from the group consisting of comprising Taq DNA polymerase, Phusion® DNA polymerase, Q5® DNA polymerase, and KAPA High Fidelity DNA polymerase.
  • a DNA polymerase is a non-thermostable DNA polymerase.
  • a “non-thermostable DNA polymerase” refers to DNA polymerases that cannot function at high temperatures.
  • a non-thermostable DNA polymerase is selected from the group consisting of phi29 DNA polymerase and Bst DNA polymerase.
  • an “amplicon” refers to a copy of DNA made via PCR.
  • a method comprises preparing a sequencing library.
  • sequencing library refers to a pool of nucleic acid molecules.
  • the sequencing library comprises a pool of whole genomic sequences, subgenomic fragments, cDNA, cDNA fragments, RNA, mRNA, RNA fragments, or a combination thereof.
  • the sequencing library comprises a pool of whole genomic sequences.
  • the sequencing library comprises a pool of subgenomic fragments.
  • the sequencing library comprises a pool of cDNA.
  • the sequencing library comprises a pool of cDNA fragments.
  • the sequencing library comprises a pool of RNA.
  • the sequencing library comprises a pool of mRNA. In an aspect, the sequencing library comprises a pool of RNA fragments. In an aspect, the sequencing library comprises a pool of nucleic acid molecules with adapter sequences attached. In an aspect, a sequencing library comprises a pool of DNA with adapter sequences attached. In an aspect, a sequencing library comprises a pool of RNA with adapter sequences attached. In an aspect, the sequencing library comprises a pool of whole genomic sequences with adapter sequences attached. In an aspect, the sequencing library comprises a pool of subgenomic fragments with adapter sequences attached. In an aspect, the sequencing library comprises a pool of cDNA with adapter sequences attached.
  • the sequencing library comprises a pool of cDNA fragments with adapter sequences attached. In an aspect, the sequencing library comprises a pool of mRNA with adapter sequences attached. In an aspect, the sequencing library comprises a pool of RNA fragments with adapter sequences attached. In an aspect, the adapter sequence can be located at one or both ends. In an aspect, a portion or all of the sequencing library comprises an adapter sequence. In an aspect, the adapter sequence can be useful for a sequencing method. In an aspect, the adapter sequence is useful for NGS. In an aspect, the adapter sequence is useful for amplification. In an aspect, the adapter sequence is useful for reverse transcription. In an aspect, the adapter sequence is useful for cloning into a vector.
  • a method comprises at least one enrichment step of targeted RNA or cDNA molecules during preparation of sequencing libraries.
  • the enrichment step is selected from the group consisting of pulldown probes, and amplification by target specific primers.
  • the enrichment of targeted RNA or cDNA molecules during preparation of sequencing libraries is performed via pulldown probes.
  • the enrichment of targeted RNA or cDNA molecules during preparation of sequencing libraries is performed via amplification by target specific primers.
  • preparing the sequencing library comprises a step selected from the group consisting of reverse transcription, PCR amplification, ligation, ribosomal RNA depletion, labeling with unique molecular identifier (UMI).
  • preparing the sequencing library comprises a reverse transcription step.
  • preparing the sequencing library comprises a reverse transcription step with random primers.
  • preparing the sequencing library comprises a PCR amplification step.
  • preparing the sequencing library comprises a PCR amplification step with random primers.
  • preparing the sequencing library comprises a ligation step.
  • preparing the sequencing library comprises a ribosomal RNA depletion step.
  • preparing the sequencing library comprises a labeling with UMI step.
  • a method comprises high-throughput sequencing.
  • a method comprises subjecting a sequencing library to high-throughput sequencing.
  • “high-throughput sequencing” refers to any sequencing method that is capable of sequencing multiple (e.g, tens, hundreds, thousands, millions, hundreds of millions) nucleic acid molecules in parallel.
  • Sanger sequencing is not high- throughput sequencing.
  • high-throughput sequencing comprises the use of a sequencing-by-synthesis (SBS) flow cell.
  • SBS flow cell is selected from the group consisting of an Illumina SBS flow cell and a Pacific Biosciences (PacBio) SBS flow cell.
  • high-throughput sequencing is performed via electrical current measurements in conjunction with an Oxford nanopore.
  • a method comprises sequencing-by-synthesis.
  • a method comprises nanopore-based sequencing.
  • a method comprises targeted RNA sequencing for a pre-specified set of genes.
  • a method comprises long- read sequencing.
  • a “sequencing read” or “read” refers to a nucleotide sequence of a single nucleic acid molecule generated via a high-throughput sequencing method.
  • sequencing reads are provided in a FASTX or FASTQ file type.
  • a sequencing read comprises a UMI sequence.
  • a sequencing read comprises a sequence from a gene.
  • a sequencing read comprises a UMI sequence and a sequence from a gene.
  • the term “sequencing read” and “read” are intended to be interchangeable.
  • RNA sequencing data comprises reads, and the minimal length of each read is selected from the group consisting of 25 nucleotides long, 50 nucleotides long, 75 nucleotides long, 150 nucleotides long, 200 nucleotides long, 300 nucleotides long, 1000 nucleotides long, 5000 nucleotides long, 10000 nucleotides long, and 50000 nucleotides long.
  • the minimal length of each read from RNA sequencing data is 25 nucleotides long.
  • the minimal length of each read from RNA sequencing data is 50 nucleotides long.
  • the minimal length of each read from RNA sequencing data is 75 nucleotides long.
  • the minimal length of each read from RNA sequencing data is 150 nucleotides long. In an aspect, the minimal length of each read from RNA sequencing data is 200 nucleotides long. In an aspect, the minimal length of each read from RNA sequencing data is 300 nucleotides long. In an aspect, the minimal length of each read from RNA sequencing data is 1000 nucleotides long. In an aspect, the minimal length of each read from RNA sequencing data is 5000 nucleotides long. In an aspect, the minimal length of each read from RNA sequencing data is 10000 nucleotides long. In an aspect, the minimal length of each read from RNA sequencing data is 50000 nucleotides long. In an aspect, the minimal length of each read from RNA sequencing data is variable. In an aspect, the minimal length of each read from RNA sequencing data is identical.
  • a sequencing read comprises at least 10 nucleotides. In an aspect, a sequencing read comprises at least 25 nucleotides. In an aspect, a sequencing read comprises at least 50 nucleotides. In an aspect, a sequencing read comprises at least 100 nucleotides. In an aspect, a sequencing read comprises at least 250 nucleotides. In an aspect, a sequencing read comprises at least 500 nucleotides. In an aspect, a sequencing read comprises at least 1000 nucleotides. [0064] In an aspect, a sequencing read comprises between 10 nucleotides and 10,000 nucleotides. In an aspect, a sequencing read comprises between 10 nucleotides and 5000 nucleotides.
  • a sequencing read comprises between 10 nucleotides and 1000 nucleotides. In an aspect, a sequencing read comprises between 10 nucleotides and 500 nucleotides. In an aspect, a sequencing read comprises between 10 nucleotides and 100 nucleotides. In an aspect, a sequencing read comprises between 25 nucleotides and 150 nucleotides.
  • the determination of the expression level E for each gene within a set of genes X comprises considering the total amount of reads mapped to one particular transcript. In an aspect, the determination of the expression level E for each gene within a set of genes X comprises a function L of the total amount of reads mapped to one particular transcript and the total length of the transcripts.
  • the determination of the expression level E for each gene within a set of genes X comprises normalizing the value.
  • a set of genes Y is used to normalize or is compared to the expression level E for each gene within the set of genes X.
  • the expression level E for each gene within a set of genes X from one particular sample or sample type is normalized or compared to the expression level E for each gene within the same set of genes X from a different sample or sample type.
  • the number of coding mutations M is estimated by the homologous number of reads (align) to a reference genome or trans criptome, with at least one gap, one base change or one nucleotide change, and up to 10% of the total bases or total nucleotides are different.
  • the number of coding mutations M is estimated by the number of contiguous sequences (contigs) assembled in silico from pair-wise overlapping reads that is homologous to a reference genome or transcriptome with at least one gap, one base change or one nucleotide change, and up to 10% of the total bases or total nucleotides are different.
  • sequencing an amplicon, or a plurality of amplicons is performed using next-generation sequencing technologies.
  • sequencing an amplicon, or a plurality of amplicons is performed using a sequencing instrument selected from the group consisting of an Oxford Nanopore sequencer, a PacBio sequencer, an Illumina Miseq sequencer, an Illumina MiniSeq sequencer, an Illumina NextS eq sequencer, an Ion Torrent sequencer, and an Illumina Hiseq sequencer to generate at least one sequencing read.
  • a sequencing read is aligned to a reference sequence to identify an RNA splicing variant.
  • next generation sequencing refers to any high-throughput sequencing technology.
  • next-generation sequencing include single-molecule real-time sequencing (e.g., Pacific Biosciences), Ion Torrent sequencing, sequencing-by-synthesis (e.g., Illumina), sequencing by ligation (SOLiD sequencing), nanopore sequencing, and GenapSys sequencing.
  • high-throughput sequencing refers to any sequences method that is capable of sequencing multiple (e.g, tens, hundreds, thousands, millions, hundreds of millions) DNA molecules in parallel. In an aspect, Sanger sequencing is not high-throughput sequencing. In an aspect, high- throughput sequencing comprises the use of a sequencing-by-synthesis (SBS) flow cell.
  • SBS sequencing-by-synthesis
  • an SBS flow cell is selected from the group consisting of an Illumina SBS flow cell and a Pacific Biosciences (PacBio) SBS flow cell.
  • high-throughput sequencing is performed via electrical current measurements in conjunction with an Oxford nanopore.
  • one nucleic acid molecule can be complementary to a second nucleic acid molecule (e.g, a Target RNA Region subsequence).
  • a second nucleic acid molecule e.g, a Target RNA Region subsequence
  • the sequence of a nucleic acid molecule need not be 100% complementary to that of its target nucleic acid molecule to be specifically hybridizable or hybridizable.
  • a polynucleotide may hybridize over one or more segments such that intervening or adjacent segments are not involved in the hybridization event (e.g., a loop structure or hairpin structure).
  • an antisense nucleic acid molecule in which 18 of 20 nucleotides of the antisense compound are complementary to a target region, and would therefore specifically hybridize would represent 90 percent complementarity.
  • the remaining noncomplementary nucleotides may be clustered or interspersed with complementary nucleotides and need not be contiguous to each other or to complementary nucleotides. Percent complementarity between particular stretches of nucleic acid sequences within nucleic acids can be determined routinely using BLAST® programs (basic local alignment search tools) and PowerBLAST programs known in the art (see Altschul et al., J. Mol.
  • a DNA molecule comprises a UMI.
  • an RNA molecule comprises a UMI.
  • a primer comprises a UMI.
  • a “unique molecular identifier” refers to a unique nucleotide sequence that serves as a molecular barcode for an individual molecule. UMIs are often attached to DNA molecules in a sample library to uniquely tag each molecule. UMIs enable error correction and increased accuracy during sequencing of DNA molecules.
  • a UMI sequence comprises between 7 nucleotides and 30 nucleotides. In an aspect, a UMI sequence comprises between 5 nucleotides and 40 nucleotides. In an aspect, a UMI sequence comprises between 10 nucleotides and 20 nucleotides. In an aspect, a UMI sequence comprises at least 5 nucleotides. In an aspect, a UMI sequence comprises at least 7 nucleotides. In an aspect, a UMI sequence comprises at least 10 nucleotides. In an aspect, a UMI sequence comprises at least 15 nucleotides. In an aspect, a UMI sequence comprises fewer than 50 nucleotides. In an aspect, a UMI sequence comprises fewer than 40 nucleotides. In an aspect, a UMI sequence comprises fewer than 30 nucleotides. In an aspect, a UMI sequence comprises fewer than 20 nucleotides.
  • a UMI sequence comprises between 7 degenerate nucleotides and 30 degenerate nucleotides. In an aspect, a UMI sequence comprises between 5 degenerate nucleotides and 40 degenerate nucleotides. In an aspect, a UMI sequence comprises between 10 degenerate nucleotides and 20 degenerate nucleotides. In an aspect, a UMI sequence comprises at least 5 degenerate nucleotides. In an aspect, a UMI sequence comprises at least 7 degenerate nucleotides. In an aspect, a UMI sequence comprises at least 10 degenerate nucleotides.
  • a UMI sequence comprises at least 15 degenerate nucleotides. In an aspect, a UMI sequence comprises fewer than 50 degenerate nucleotides. In an aspect, a UMI sequence comprises fewer than 40 degenerate nucleotides. In an aspect, a UMI sequence comprises fewer than 30 degenerate nucleotides. In an aspect, a UMI sequence comprises fewer than 20 degenerate nucleotides.
  • each degenerate nucleotide in a UMI sequence is individually selected from the group consisting of N, B, D, H, V, S, W, Y, R, M, and K.
  • a UMI sequence comprises between 7 degenerate nucleotides and 30 degenerate nucleotides, where each degenerate nucleotide is selected from the group consisting of N, B, D, H, V, S, W, Y, R, M, and K.
  • a method comprises removal of sequencing reads where the UMI sequence of the sequencing reads do not comprise a predefined UMI degenerate base design pattern.
  • a “predefined UMI degenerate base design pattern” refers to a UMI sequence comprising the expected number of degenerate bases and the expected type of degenerate bases for a given method. Non-limiting examples of inappropriate degenerate base designs would include UMI sequences comprising too many degenerate bases or too few degenerate bases.
  • a method comprises removal of at least one sequencing read where the UMI sequence of the at least one sequencing read does not comprise a predefined UMI degenerate base design pattern.
  • a method comprises removal of at least two sequencing reads where the UMI sequence of the at least two sequencing reads do not comprise a predefined UMI degenerate base design pattern. In an aspect, a method comprises removal of at least three sequencing reads where the UMI sequence of the at least three sequencing reads do not comprise a predefined UMI degenerate base design pattern. In an aspect, a method comprises removal of at least four sequencing reads where the UMI sequence of the at least four sequencing reads do not comprise a predefined UMI degenerate base design pattern. In an aspect, a method comprises removal of at least five sequencing reads where the UMI sequence of the at least five sequencing reads do not comprise a predefined UMI degenerate base design pattern.
  • the determination of the expression level E for each gene within a set of genes X comprises considering the total amount of similar molecules harboring the same UMI. In an aspect, the determination of the expression level E for each gene within a set of genes X comprises a function L of the total amount of similar molecules harboring the same UMI mapping one particular transcript and the total length of the transcripts.
  • the prognostication of an outcome for a patient comprises differentially prognosticating the outcome of the patient based on the value of the EW- TMB.
  • the EW-TMB is used to determine the high tumor mutation burden (TMB-H) for solid and non-solid tumors selected from the group consisting of breast cancer, bladder cancer, cervical cancer, colon cancer, head and neck cancer, liver cancer, renal cell cancer, skin cancer, stomach cancer, rectal cancer, lymphomas, Non-Hodgkin lymphoma, Hodgkin lymphoma, and any other type of cancer.
  • the EW-TMB is used to determine the TMB-H for breast cancer.
  • the EW-TMB is used to determine the TMB-H for bladder cancer. In an aspect, the EW-TMB is used to determine the TMB-H for cervical cancer. In an aspect, the EW-TMB is used to determine the TMB-H for colon cancer. In an aspect, the EW-TMB is used to determine the TMB-H for head and neck cancer. In an aspect, the EW-TMB is used to determine the TMB-H for liver cancer. In an aspect, the EW-TMB is used to determine the TMB-H for renal cell cancer. In an aspect, the EW-TMB is used to determine the TMB-H for skin cancer. In an aspect, the EW-TMB is used to determine the TMB-H for stomach cancer.
  • the EW-TMB is used to determine the TMB-H for rectal cancer. In an aspect, the EW- TMB is used to determine the TMB-H for lymphomas. In an aspect, the EW-TMB is used to determine the TMB-H for Non-Hodgkin lymphoma. In an aspect, the EW-TMB is used to determine the TMB-H for Hodgkin lymphoma.
  • the EW-TMB is used to determine the minimum residual disease (MRD) for solid and non-solid tumors selected from the group consisting of breast cancer, bladder cancer, cervical cancer, colon cancer, head and neck cancer, liver cancer, renal cell cancer, skin cancer, stomach cancer, rectal cancer, lymphomas, Non-Hodgkin lymphoma, Hodgkin lymphoma, and any other type of cancer.
  • MRD minimum residual disease
  • the EW-TMB is used to determine the MRD for breast cancer.
  • the EW-TMB is used to determine the MRD for bladder cancer.
  • the EW-TMB is used to determine the MRD for cervical cancer.
  • the EW-TMB is used to determine the MRD for colon cancer.
  • the EW-TMB is used to determine the MRD for head and neck cancer. In an aspect, the EW-TMB is used to determine the MRD for liver cancer. In an aspect, the EW-TMB is used to determine the MRD for renal cell cancer. In an aspect, the EW-TMB is used to determine the MRD for skin cancer. In an aspect, the EW-TMB is used to determine the MRD for stomach cancer. In an aspect, the EW-TMB is used to determine the MRD for rectal cancer. In an aspect, the EW-TMB is used to determine the MRD for lymphomas. In an aspect, the EW-TMB is used to determine the MRD for NonHodgkin lymphoma. In an aspect, the EW-TMB is used to determine the MRD for Hodgkin lymphoma.
  • determining treatment prescription for a patient comprises differentially prescribing treatments for the patient based on the value of the EW-TMB.
  • the EW-TMB is used as a predictive biomarker of response to immune checkpoint blockade therapy, with drugs targeting one or more specific sites selected from the group consisting of CTLA-5, PD-1 and PD-L1.
  • the EW-TMB is used as a predictive biomarker of response to drugs targeting CTLA-5.
  • the EW-TMB is used as a predictive biomarker of response to drugs targeting PD-1.
  • the EW-TMB is used as a predictive biomarker of response to drugs targeting PD- Ll.
  • the EW-TMB is used as a predictive biomarker for solid and non-solid tumors selected from the group consisting of breast cancer, bladder cancer, cervical cancer, colon cancer, head and neck cancer, liver cancer, renal cell cancer, skin cancer, stomach cancer, rectal cancer, lymphomas, Non-Hodgkin lymphoma, Hodgkin lymphoma, and any other type of cancer.
  • the EW-TMB is used as a predictive biomarker for solid cancer.
  • the EW-TMB is used as a predictive biomarker for non-solid cancer.
  • the EW-TMB is used as a predictive biomarker for breast cancer.
  • the EW-TMB is used as a predictive biomarker for bladder cancer. In an aspect, the EW-TMB is used as a predictive biomarker for cervical cancer. In an aspect, the EW-TMB is used as a predictive biomarker for colon cancer. In an aspect, the EW-TMB is used as a predictive biomarker for head and neck cancer. In an aspect, the EW-TMB is used as a predictive biomarker for liver cancer. In an aspect, the EW-TMB is used as a predictive biomarker for renal cell cancer. In an aspect, the EW-TMB is used as a predictive biomarker for skin cancer. In an aspect, the EW-TMB is used as a predictive biomarker for stomach cancer.
  • the EW- TMB is used as a predictive biomarker for rectal cancer.
  • the EW-TMB is used as a predictive biomarker for lymphomas.
  • the EW-TMB is used as a predictive biomarker for Non-Hodgkin lymphoma.
  • the EW-TMB is used as a predictive biomarker for Hodgkin lymphoma.
  • the EW-TMB is used as a predictive biomarker of response to neoadjuvant therapy (before the primary treatment).
  • the neoadjuvant therapy is selected from the group consisting of chemotherapy, radiation therapy, hormone therapy, targeted therapy, and immune therapy.
  • the EW-TMB is used as a predictive biomarker of response to neoadjuvant chemotherapy.
  • the EW- TMB is used as a predictive biomarker of response to neoadjuvant radiation therapy.
  • the EW-TMB is used as a predictive biomarker of response to neoadjuvant hormone therapy.
  • the EW-TMB is used as a predictive biomarker of response to neoadjuvant targeted therapy. In an aspect, the EW-TMB is used as a predictive biomarker of response to neoadjuvant immune therapy.
  • the EW-TMB is used as a predictive biomarker of response to adjuvant therapy (after the primary treatment).
  • the adjuvant therapy is selected from the group consisting of chemotherapy, radiation therapy, hormone therapy, targeted therapy, and immune therapy.
  • the EW-TMB is used as a predictive biomarker of response to adjuvant chemotherapy.
  • the EW-TMB is used as a predictive biomarker of response to adjuvant radiation therapy.
  • the EW- TMB is used as a predictive biomarker of response to adjuvant hormone therapy.
  • the EW-TMB is used as a predictive biomarker of response to adjuvant targeted therapy.
  • the EW-TMB is used as a predictive biomarker of response to adjuvant immune therapy.
  • a method further comprises classifying the sample or the subject from which the sample was obtained as responsive to immune checkpoint blockade therapy. In an aspect, a method further comprises classifying the sample as responsive to immune checkpoint blockade therapy. In an aspect, a method further comprises classifying the subject from which the sample was obtained as responsive to immune checkpoint blockade therapy.
  • a method further comprises classifying the sample or the subject from which the sample was obtained as responsive to neoadjuvant therapy. In an aspect, a method further comprises classifying the sample as responsive to neoadjuvant therapy. In an aspect, a method further comprises classifying the subject from which the sample was obtained as responsive to neoadjuvant therapy.
  • a method further comprises classifying the sample or the subject from which the sample was obtained as responsive to adjuvant therapy. In an aspect, a method further comprises classifying the sample as responsive to adjuvant therapy. In an aspect, a method further comprises classifying the subject from which the sample was obtained as responsive to adjuvant therapy.
  • a method further comprises generating and delivering a report to the subject, or another person or entity.
  • the report is selected from the group consisting of an electronic report, a web-based report, a digital report, and a paper report.
  • the other person or entity is selected from the group consisting of a caregiver, a physician, an oncologist, a database, a server, a hospital, a clinic, a third- party payer, an insurance company, and a government office.
  • the report contains information that is selected from the group consisting of an EW-TMB, a TMB-
  • the report comprises the EW- TMB. In an aspect, the report comprises the TMB-H. In an aspect, the report comprises the results derived from the methods described herein. In an aspect, the report comprises information on the likelihood of patient response to a therapy. In an aspect, the report comprises information on the likelihood of patient response to immune check point blockade therapy. In an aspect, the report comprises information on the prognostication of patient outcome. In an aspect, the report comprises information on the determination of treatment prescription for the patient.
  • a method for calculating an Expression-Weighted Tumor Mutational Burden (EW- TMB) from a biological sample comprising: extracting RNA from the biological sample, performing RNA sequencing (RNA Seq) on the RNA, analyzing the RNA sequencing data to determine an expression level E for each gene i within a set of genes X, analyzing the RNA sequencing data to determine a number of coding mutations M for each gene i within the same set of genes X, and calculating an EW-TMB as the sum of the product of a function f(») of Ei and a function g(») of Mi for each gene i within the set of genes X, mathematically expressed as:
  • RNA sequencing is targeted RNA sequencing for a pre-specified set of genes.
  • RNA sequencing data comprises reads
  • the minimal length of each read is selected from the group consisting of 25 nucleotides long, 50 nucleotides long, 75 nucleotides long, 150 nucleotides long, 200 nucleotides long, 300 nucleotides long, 1000 nucleotides long, 5000 nucleotides long, 10000 nucleotides long, and 50000 nucleotides long.
  • RNA sequencing data comprises reads
  • determination of the expression level E for each gene within the set of genes X comprises considering the total amount of reads mapped to one particular transcript.
  • RNA sequencing data comprises reads
  • determination of the expression level E for each gene within the set of genes X comprises a function L of the total amount of reads mapped to one particular transcript and the total length of the transcripts.
  • FFPE Formalin-Fixed Paraffin-Embedded
  • a method for prognosticating the outcome of a patient comprising: collecting a biological sample from the patient, performing the method of any one of embodiments 1 to 33 to determine an EW-TMB, and differentially prognosticating the outcome of the patient based on the value of the EW-TMB.
  • the EW-TMB is used to determine the high tumor mutation burden (TMB-H) for solid and non-solid tumors selected from the group consisting of breast cancer, bladder cancer, cervical cancer, colon cancer, head and neck cancer, liver cancer, renal cell cancer, skin cancer, stomach cancer, rectal cancer, lymphomas, Non-Hodgkin lymphoma, and Hodgkin lymphoma.
  • EW-TMB is used to determine minimum residual disease (MRD) for solid and non-solid tumors selected from the group consisting of lymphomas, breast cancer, bladder cancer, cervical cancer, colon cancer, head and neck cancer, liver cancer, renal cell cancer, skin cancer, stomach cancer, rectal cancer, Non-Hodgkin lymphoma, and Hodgkin lymphoma.
  • MRD minimum residual disease
  • a method for determining treatment prescription for a patient comprising: collecting a biological sample from the patient, performing the method of any one of embodiments 1 to 33 to determine an EW-TMB, and differentially prescribing treatments for the patient based on the value of the EW- TMB.
  • RNA Seq RNA sequencing
  • Ei expression level of each gene i
  • Mi number of coding mutations M for each gene i (Mi) within the same set of genes X
  • EW-TMB based on Ei and Mi for each gene i within the set of genes X.

Landscapes

  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Medical Informatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Public Health (AREA)
  • Primary Health Care (AREA)
  • Epidemiology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Physics & Mathematics (AREA)
  • Genetics & Genomics (AREA)
  • Biophysics (AREA)
  • Chemical & Material Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Molecular Biology (AREA)
  • Analytical Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Biomedical Technology (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Pathology (AREA)
  • Medicinal Chemistry (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Surgery (AREA)
  • Urology & Nephrology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

La présente invention concerne des procédés de calcul d'une charge mutationnelle tumorale pondérée par l'expression à partir d'un échantillon biologique. L'ARN est extrait de l'échantillon biologique, suivi du séquençage de l'ARN et de l'analyse pondérée par l'expression des données de séquençage de l'ARN. La charge mutationnelle tumorale pondérée par l'expression calculée à partir de ces procédés peut être utilisée dans le pronostic sur l'évolution de l'état de patients et la détermination d'une prescription de traitement pour des patients.
PCT/US2022/052153 2021-12-08 2022-12-07 Charge mutationnelle tumorale pondérée par l'expression en tant que biomarqueur oncologique WO2023107570A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202163287455P 2021-12-08 2021-12-08
US63/287,455 2021-12-08

Publications (1)

Publication Number Publication Date
WO2023107570A1 true WO2023107570A1 (fr) 2023-06-15

Family

ID=86731165

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2022/052153 WO2023107570A1 (fr) 2021-12-08 2022-12-07 Charge mutationnelle tumorale pondérée par l'expression en tant que biomarqueur oncologique

Country Status (1)

Country Link
WO (1) WO2023107570A1 (fr)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190018926A1 (en) * 2017-07-14 2019-01-17 Cofactor Genomics, Inc. Immuno-oncology applications using next generation sequencing
WO2020136133A1 (fr) * 2018-12-23 2020-07-02 F. Hoffmann-La Roche Ag Classification de tumeur basée sur une charge mutationnelle tumorale prédite

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190018926A1 (en) * 2017-07-14 2019-01-17 Cofactor Genomics, Inc. Immuno-oncology applications using next generation sequencing
WO2020136133A1 (fr) * 2018-12-23 2020-07-02 F. Hoffmann-La Roche Ag Classification de tumeur basée sur une charge mutationnelle tumorale prédite

Similar Documents

Publication Publication Date Title
US20220195530A1 (en) Identification and use of circulating nucleic acid tumor markers
Xuan et al. Next-generation sequencing in the clinic: promises and challenges
Newman et al. An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage
KR102393608B1 (ko) 희귀 돌연변이 및 카피수 변이를 검출하기 위한 시스템 및 방법
EP3322816B1 (fr) Système et méthodologie pour l'analyse de données génomiques obtenues à partir d'un sujet
US20230119938A1 (en) Methods of Preparing Dual-Indexed DNA Libraries for Bisulfite Conversion Sequencing
JP2022058904A (ja) 無細胞核酸の多重解像度分析のための方法
Strausberg et al. Sequence-based cancer genomics: progress, lessons and opportunities
US20200327954A1 (en) Methods and systems for differentiating somatic and germline variants
CN111534580A (zh) 用于检测遗传变异的方法和系统
US20130317083A1 (en) Non-coding transcripts for determination of cellular states
JP2022505050A (ja) プーリングを介した多数の試料の効率的な遺伝子型決定のための方法および試薬
Suela et al. DNA profiling analysis of 100 consecutive de novo acute myeloid leukemia cases reveals patterns of genomic instability that affect all cytogenetic risk groups
US20240013857A1 (en) Methods and systems for analyzing nucleic acid sequences
JP7407193B2 (ja) 可変の複製多重pcrを使用した配列決定方法
De Carvalho et al. miRNA genetic variants alter their secondary structure and expression in patients with RASopathies syndromes
US20240141425A1 (en) Correcting for deamination-induced sequence errors
US20200071754A1 (en) Methods and systems for detecting contamination between samples
JP2022512848A (ja) エピジェネティック区画アッセイを較正するための方法、組成物およびシステム
JP2021518106A (ja) 治療用核酸構築物の非侵襲的な検出およびモニタリングのための方法
WO2023107570A1 (fr) Charge mutationnelle tumorale pondérée par l'expression en tant que biomarqueur oncologique
EP3847276A2 (fr) Procédés et systèmes pour détecter un déséquilibre allélique dans des échantillons d'acides nucléiques acellulaires
Amr et al. Targeted Hybrid Capture for Inherited Disease Panels
Rebollar-Vega et al. Clinical Applications of Next-Generation Sequencing
WO2023056300A1 (fr) Biopsies liquides personnalisées dans le cadre du cancer en utilisant des amorces provenant d'une banque d'amorces

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22905085

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE