CN116157533A - Capturing genetic targets using hybridization methods - Google Patents
Capturing genetic targets using hybridization methods Download PDFInfo
- Publication number
- CN116157533A CN116157533A CN202180031221.3A CN202180031221A CN116157533A CN 116157533 A CN116157533 A CN 116157533A CN 202180031221 A CN202180031221 A CN 202180031221A CN 116157533 A CN116157533 A CN 116157533A
- Authority
- CN
- China
- Prior art keywords
- capture
- analyte
- nucleic acid
- sequence
- extended
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 283
- 238000009396 hybridization Methods 0.000 title abstract description 82
- 230000002068 genetic effect Effects 0.000 title description 29
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 589
- 239000000523 sample Substances 0.000 claims abstract description 557
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 553
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 553
- 239000012491 analyte Substances 0.000 claims abstract description 448
- 108091034117 Oligonucleotide Proteins 0.000 claims abstract description 385
- 239000012472 biological sample Substances 0.000 claims abstract description 322
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 212
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 claims abstract description 209
- 239000000758 substrate Substances 0.000 claims abstract description 151
- 238000012163 sequencing technique Methods 0.000 claims abstract description 96
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 81
- 230000000295 complement effect Effects 0.000 claims description 165
- 239000003795 chemical substances by application Substances 0.000 claims description 152
- 210000004027 cell Anatomy 0.000 claims description 110
- 230000027455 binding Effects 0.000 claims description 106
- 206010028980 Neoplasm Diseases 0.000 claims description 91
- 239000002299 complementary DNA Substances 0.000 claims description 83
- 201000011510 cancer Diseases 0.000 claims description 82
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 claims description 76
- 108010090804 Streptavidin Proteins 0.000 claims description 54
- 239000011324 bead Substances 0.000 claims description 54
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 48
- 230000037361 pathway Effects 0.000 claims description 48
- 108090001008 Avidin Proteins 0.000 claims description 46
- 229960002685 biotin Drugs 0.000 claims description 40
- 239000011616 biotin Substances 0.000 claims description 40
- 239000003153 chemical reaction reagent Substances 0.000 claims description 39
- 102000053602 DNA Human genes 0.000 claims description 38
- 108020004414 DNA Proteins 0.000 claims description 38
- 235000020958 biotin Nutrition 0.000 claims description 38
- 125000003729 nucleotide group Chemical group 0.000 claims description 36
- 239000000203 mixture Substances 0.000 claims description 35
- 239000002773 nucleotide Substances 0.000 claims description 35
- 201000010099 disease Diseases 0.000 claims description 28
- 102000004190 Enzymes Human genes 0.000 claims description 25
- 108090000790 Enzymes Proteins 0.000 claims description 25
- 230000002074 deregulated effect Effects 0.000 claims description 24
- WZUVPPKBWHMQCE-UHFFFAOYSA-N Haematoxylin Chemical compound C12=CC(O)=C(O)C=C2CC2(O)C1C1=CC=C(O)C(O)=C1OC2 WZUVPPKBWHMQCE-UHFFFAOYSA-N 0.000 claims description 22
- 150000003384 small molecules Chemical class 0.000 claims description 22
- 208000035475 disorder Diseases 0.000 claims description 20
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 claims description 17
- 150000001720 carbohydrates Chemical class 0.000 claims description 13
- 210000002865 immune cell Anatomy 0.000 claims description 13
- YQGOJNYOYNNSMM-UHFFFAOYSA-N eosin Chemical compound [Na+].OC(=O)C1=CC=CC=C1C1=C2C=C(Br)C(=O)C(Br)=C2OC2=C(Br)C(O)=C(Br)C=C21 YQGOJNYOYNNSMM-UHFFFAOYSA-N 0.000 claims description 10
- 238000012165 high-throughput sequencing Methods 0.000 claims description 9
- 238000010166 immunofluorescence Methods 0.000 claims description 9
- 108091036066 Three prime untranslated region Proteins 0.000 claims description 7
- 239000012188 paraffin wax Substances 0.000 claims description 7
- 108091026898 Leader sequence (mRNA) Proteins 0.000 claims description 6
- 230000005754 cellular signaling Effects 0.000 claims description 6
- 210000003061 neural cell Anatomy 0.000 claims description 6
- 238000003364 immunohistochemistry Methods 0.000 claims description 5
- 238000001514 detection method Methods 0.000 abstract description 35
- 230000002708 enhancing effect Effects 0.000 abstract description 3
- 210000001519 tissue Anatomy 0.000 description 93
- 235000018102 proteins Nutrition 0.000 description 66
- 229920002477 rna polymer Polymers 0.000 description 56
- 230000014509 gene expression Effects 0.000 description 52
- 108010029485 Protein Isoforms Proteins 0.000 description 51
- 102000001708 Protein Isoforms Human genes 0.000 description 51
- 238000010186 staining Methods 0.000 description 42
- 230000000875 corresponding effect Effects 0.000 description 35
- 108020004999 messenger RNA Proteins 0.000 description 33
- 108091028043 Nucleic acid sequence Proteins 0.000 description 31
- 238000004458 analytical method Methods 0.000 description 31
- 210000005036 nerve Anatomy 0.000 description 27
- 239000000872 buffer Substances 0.000 description 25
- 230000003321 amplification Effects 0.000 description 24
- 229940088598 enzyme Drugs 0.000 description 24
- 238000011223 gene expression profiling Methods 0.000 description 24
- 238000003384 imaging method Methods 0.000 description 24
- 238000003199 nucleic acid amplification method Methods 0.000 description 24
- 238000006243 chemical reaction Methods 0.000 description 23
- 238000003776 cleavage reaction Methods 0.000 description 23
- 230000007017 scission Effects 0.000 description 23
- 239000000243 solution Substances 0.000 description 23
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 21
- 238000012732 spatial analysis Methods 0.000 description 21
- -1 antibodies Proteins 0.000 description 20
- 230000008823 permeabilization Effects 0.000 description 17
- 238000003752 polymerase chain reaction Methods 0.000 description 16
- 238000005406 washing Methods 0.000 description 15
- 230000001900 immune effect Effects 0.000 description 14
- 239000000543 intermediate Substances 0.000 description 14
- 238000002360 preparation method Methods 0.000 description 14
- 238000010839 reverse transcription Methods 0.000 description 13
- 238000011282 treatment Methods 0.000 description 13
- 102100031780 Endonuclease Human genes 0.000 description 12
- 235000014633 carbohydrates Nutrition 0.000 description 12
- 238000010586 diagram Methods 0.000 description 12
- 230000008685 targeting Effects 0.000 description 12
- 230000000903 blocking effect Effects 0.000 description 11
- 239000008191 permeabilizing agent Substances 0.000 description 11
- 102000003960 Ligases Human genes 0.000 description 10
- 108090000364 Ligases Proteins 0.000 description 10
- 239000003599 detergent Substances 0.000 description 10
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 9
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 9
- 206010006187 Breast cancer Diseases 0.000 description 9
- 208000026310 Breast neoplasm Diseases 0.000 description 9
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 9
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 9
- 239000000427 antigen Substances 0.000 description 9
- 108091007433 antigens Proteins 0.000 description 9
- 102000036639 antigens Human genes 0.000 description 9
- 239000012530 fluid Substances 0.000 description 9
- 230000002055 immunohistochemical effect Effects 0.000 description 9
- 230000036961 partial effect Effects 0.000 description 9
- 239000002253 acid Substances 0.000 description 8
- 239000003431 cross linking reagent Substances 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 230000003993 interaction Effects 0.000 description 8
- 230000035772 mutation Effects 0.000 description 8
- 108020004418 ribosomal RNA Proteins 0.000 description 8
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 7
- 108700018351 Major Histocompatibility Complex Proteins 0.000 description 7
- 108020004566 Transfer RNA Proteins 0.000 description 7
- 150000001875 compounds Chemical class 0.000 description 7
- 238000001914 filtration Methods 0.000 description 7
- 239000012634 fragment Substances 0.000 description 7
- 239000003960 organic solvent Substances 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 108090000765 processed proteins & peptides Proteins 0.000 description 7
- 230000020382 suppression by virus of host antigen processing and presentation of peptide antigen via MHC class I Effects 0.000 description 7
- 230000000007 visual effect Effects 0.000 description 7
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 6
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 6
- 108020004635 Complementary DNA Proteins 0.000 description 6
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 6
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 6
- 102000005593 Endopeptidases Human genes 0.000 description 6
- 108010059378 Endopeptidases Proteins 0.000 description 6
- 210000001744 T-lymphocyte Anatomy 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 229940098773 bovine serum albumin Drugs 0.000 description 6
- 238000004113 cell culture Methods 0.000 description 6
- 238000004042 decolorization Methods 0.000 description 6
- 238000013461 design Methods 0.000 description 6
- 230000001965 increasing effect Effects 0.000 description 6
- 230000003834 intracellular effect Effects 0.000 description 6
- 108091070501 miRNA Proteins 0.000 description 6
- 238000003786 synthesis reaction Methods 0.000 description 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 5
- 101150058571 OLIG2 gene Proteins 0.000 description 5
- 239000002981 blocking agent Substances 0.000 description 5
- 238000010804 cDNA synthesis Methods 0.000 description 5
- 238000012512 characterization method Methods 0.000 description 5
- 238000004132 cross linking Methods 0.000 description 5
- 238000001035 drying Methods 0.000 description 5
- 239000000975 dye Substances 0.000 description 5
- 238000002509 fluorescent in situ hybridization Methods 0.000 description 5
- 208000005017 glioblastoma Diseases 0.000 description 5
- 238000007901 in situ hybridization Methods 0.000 description 5
- 238000011065 in-situ storage Methods 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 238000005457 optimization Methods 0.000 description 5
- 238000003753 real-time PCR Methods 0.000 description 5
- 125000006850 spacer group Chemical group 0.000 description 5
- 238000003860 storage Methods 0.000 description 5
- 230000003612 virological effect Effects 0.000 description 5
- 102000012410 DNA Ligases Human genes 0.000 description 4
- 108010061982 DNA Ligases Proteins 0.000 description 4
- 108010042407 Endonucleases Proteins 0.000 description 4
- 108700024394 Exon Proteins 0.000 description 4
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 4
- 108090000284 Pepsin A Proteins 0.000 description 4
- 102000057297 Pepsin A Human genes 0.000 description 4
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 4
- 206010035226 Plasma cell myeloma Diseases 0.000 description 4
- 206010060862 Prostate cancer Diseases 0.000 description 4
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 4
- 101710086015 RNA ligase Proteins 0.000 description 4
- 108091023045 Untranslated Region Proteins 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 238000001574 biopsy Methods 0.000 description 4
- 210000000481 breast Anatomy 0.000 description 4
- 238000004925 denaturation Methods 0.000 description 4
- 230000036425 denaturation Effects 0.000 description 4
- 239000003596 drug target Substances 0.000 description 4
- 230000002255 enzymatic effect Effects 0.000 description 4
- 230000001605 fetal effect Effects 0.000 description 4
- 238000000799 fluorescence microscopy Methods 0.000 description 4
- 238000010438 heat treatment Methods 0.000 description 4
- 208000026278 immune system disease Diseases 0.000 description 4
- 238000002372 labelling Methods 0.000 description 4
- 230000001537 neural effect Effects 0.000 description 4
- 229940111202 pepsin Drugs 0.000 description 4
- 102000040430 polynucleotide Human genes 0.000 description 4
- 108091033319 polynucleotide Proteins 0.000 description 4
- 239000002157 polynucleotide Substances 0.000 description 4
- 102000004196 processed proteins & peptides Human genes 0.000 description 4
- 238000011084 recovery Methods 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- 108091093088 Amplicon Proteins 0.000 description 3
- 101000980980 Arabidopsis thaliana Phosphatidate cytidylyltransferase 5, chloroplastic Proteins 0.000 description 3
- 206010021143 Hypoxia Diseases 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 101710163270 Nuclease Proteins 0.000 description 3
- 108091005461 Nucleic proteins Proteins 0.000 description 3
- MUBZPKHOEPUJKR-UHFFFAOYSA-N Oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 description 3
- 102000035195 Peptidases Human genes 0.000 description 3
- 108091005804 Peptidases Proteins 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- 208000007660 Residual Neoplasm Diseases 0.000 description 3
- 108700009124 Transcription Initiation Site Proteins 0.000 description 3
- 238000002679 ablation Methods 0.000 description 3
- 238000003491 array Methods 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 239000012620 biological material Substances 0.000 description 3
- 239000003535 biological staining Substances 0.000 description 3
- 239000000090 biomarker Substances 0.000 description 3
- 238000004061 bleaching Methods 0.000 description 3
- 210000004369 blood Anatomy 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 210000005013 brain tissue Anatomy 0.000 description 3
- 230000022131 cell cycle Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000002144 chemical decomposition reaction Methods 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- UQLDLKMNUJERMK-UHFFFAOYSA-L di(octadecanoyloxy)lead Chemical compound [Pb+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O UQLDLKMNUJERMK-UHFFFAOYSA-L 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 239000006167 equilibration buffer Substances 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 230000007954 hypoxia Effects 0.000 description 3
- 230000028993 immune response Effects 0.000 description 3
- 238000011901 isothermal amplification Methods 0.000 description 3
- 210000001985 kidney epithelial cell Anatomy 0.000 description 3
- 238000000608 laser ablation Methods 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 150000002632 lipids Chemical class 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 230000002503 metabolic effect Effects 0.000 description 3
- 239000002207 metabolite Substances 0.000 description 3
- 238000000386 microscopy Methods 0.000 description 3
- 201000000050 myeloid neoplasm Diseases 0.000 description 3
- 230000007472 neurodevelopment Effects 0.000 description 3
- 230000000926 neurological effect Effects 0.000 description 3
- 238000007481 next generation sequencing Methods 0.000 description 3
- 230000009871 nonspecific binding Effects 0.000 description 3
- 230000001575 pathological effect Effects 0.000 description 3
- 239000008363 phosphate buffer Substances 0.000 description 3
- 230000008488 polyadenylation Effects 0.000 description 3
- 235000019419 proteases Nutrition 0.000 description 3
- 230000002829 reductive effect Effects 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 239000003161 ribonuclease inhibitor Substances 0.000 description 3
- 238000010008 shearing Methods 0.000 description 3
- 230000019491 signal transduction Effects 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 2
- FWBHETKCLVMNFS-UHFFFAOYSA-N 4',6-Diamino-2-phenylindol Chemical compound C1=CC(C(=N)N)=CC=C1C1=CC2=CC=C(C(N)=N)C=C2N1 FWBHETKCLVMNFS-UHFFFAOYSA-N 0.000 description 2
- 101150021183 65 gene Proteins 0.000 description 2
- 102100024085 Alpha-aminoadipic semialdehyde dehydrogenase Human genes 0.000 description 2
- 108091023037 Aptamer Proteins 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 108091033409 CRISPR Proteins 0.000 description 2
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 2
- 108091006146 Channels Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- 102100029764 DNA-directed DNA/RNA polymerase mu Human genes 0.000 description 2
- 101100310856 Drosophila melanogaster spri gene Proteins 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 108010067770 Endopeptidase K Proteins 0.000 description 2
- KRHYYFGTRYWZRS-UHFFFAOYSA-N Fluorane Chemical compound F KRHYYFGTRYWZRS-UHFFFAOYSA-N 0.000 description 2
- 102000003688 G-Protein-Coupled Receptors Human genes 0.000 description 2
- 108090000045 G-Protein-Coupled Receptors Proteins 0.000 description 2
- 108091092584 GDNA Proteins 0.000 description 2
- 108010010803 Gelatin Proteins 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- 102000003886 Glycoproteins Human genes 0.000 description 2
- 108090000288 Glycoproteins Proteins 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 101000875075 Homo sapiens Cannabinoid receptor 2 Proteins 0.000 description 2
- 101000990902 Homo sapiens Matrix metalloproteinase-9 Proteins 0.000 description 2
- 101001102154 Homo sapiens Phosphatidylserine synthase 2 Proteins 0.000 description 2
- 101001116931 Homo sapiens Protocadherin alpha-6 Proteins 0.000 description 2
- 101000620856 Homo sapiens Rab5 GDP/GTP exchange factor Proteins 0.000 description 2
- 101001012157 Homo sapiens Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 description 2
- 101000990915 Homo sapiens Stromelysin-1 Proteins 0.000 description 2
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 2
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 2
- 102100027998 Macrophage metalloelastase Human genes 0.000 description 2
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 2
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 2
- 102100030412 Matrix metalloproteinase-9 Human genes 0.000 description 2
- 108091027974 Mature messenger RNA Proteins 0.000 description 2
- 108020005196 Mitochondrial DNA Proteins 0.000 description 2
- 108010058682 Mitochondrial Proteins Proteins 0.000 description 2
- 102000006404 Mitochondrial Proteins Human genes 0.000 description 2
- 241000202934 Mycoplasma pneumoniae Species 0.000 description 2
- 208000012902 Nervous system disease Diseases 0.000 description 2
- 208000025966 Neurological disease Diseases 0.000 description 2
- 108700020796 Oncogene Proteins 0.000 description 2
- 102000043276 Oncogene Human genes 0.000 description 2
- 102000016304 Origin Recognition Complex Human genes 0.000 description 2
- 108010067244 Origin Recognition Complex Proteins 0.000 description 2
- 102100039300 Phosphatidylserine synthase 2 Human genes 0.000 description 2
- 108091007412 Piwi-interacting RNA Proteins 0.000 description 2
- 102100024616 Platelet endothelial cell adhesion molecule Human genes 0.000 description 2
- 108091036407 Polyadenylation Proteins 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- 239000004372 Polyvinyl alcohol Substances 0.000 description 2
- 102100033721 Pro-MCH Human genes 0.000 description 2
- 102100034433 Protein kinase C-binding protein NELL2 Human genes 0.000 description 2
- 102100038669 Protein quaking Human genes 0.000 description 2
- 102100024278 Protocadherin alpha-6 Human genes 0.000 description 2
- 108091034057 RNA (poly(A)) Proteins 0.000 description 2
- 108050007310 Rab39 Proteins 0.000 description 2
- 102100022851 Rab5 GDP/GTP exchange factor Human genes 0.000 description 2
- 102100022122 Ras-related C3 botulinum toxin substrate 1 Human genes 0.000 description 2
- 102100031921 Ras-related protein Rab-34, isoform NARR Human genes 0.000 description 2
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- 102100031206 Serine/threonine-protein kinase N1 Human genes 0.000 description 2
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 2
- 108020004459 Small interfering RNA Proteins 0.000 description 2
- 241000191940 Staphylococcus Species 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 102100030416 Stromelysin-1 Human genes 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- 102000008579 Transposases Human genes 0.000 description 2
- 108010020764 Transposases Proteins 0.000 description 2
- 239000007984 Tris EDTA buffer Substances 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- 102000015098 Tumor Suppressor Protein p53 Human genes 0.000 description 2
- 108010078814 Tumor Suppressor Protein p53 Proteins 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 230000004721 adaptive immunity Effects 0.000 description 2
- 230000001464 adherent effect Effects 0.000 description 2
- 230000003376 axonal effect Effects 0.000 description 2
- 210000003719 b-lymphocyte Anatomy 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- WPYMKLBDIGXBTP-UHFFFAOYSA-N benzoic acid Chemical compound OC(=O)C1=CC=CC=C1 WPYMKLBDIGXBTP-UHFFFAOYSA-N 0.000 description 2
- 230000031018 biological processes and functions Effects 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 239000012830 cancer therapeutic Substances 0.000 description 2
- BVKZGUZCCUSVTD-UHFFFAOYSA-N carbonic acid Chemical compound OC(O)=O BVKZGUZCCUSVTD-UHFFFAOYSA-N 0.000 description 2
- 230000003915 cell function Effects 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 238000004140 cleaning Methods 0.000 description 2
- 238000005202 decontamination Methods 0.000 description 2
- 230000003588 decontaminative effect Effects 0.000 description 2
- 230000003831 deregulation Effects 0.000 description 2
- 238000007847 digital PCR Methods 0.000 description 2
- 230000008482 dysregulation Effects 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 210000002919 epithelial cell Anatomy 0.000 description 2
- 238000011067 equilibration Methods 0.000 description 2
- 239000007850 fluorescent dye Substances 0.000 description 2
- 238000007672 fourth generation sequencing Methods 0.000 description 2
- 238000013467 fragmentation Methods 0.000 description 2
- 238000006062 fragmentation reaction Methods 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 239000008273 gelatin Substances 0.000 description 2
- 229920000159 gelatin Polymers 0.000 description 2
- 235000019322 gelatine Nutrition 0.000 description 2
- 235000011852 gelatine desserts Nutrition 0.000 description 2
- 230000004547 gene signature Effects 0.000 description 2
- 230000015788 innate immune response Effects 0.000 description 2
- 210000000936 intestine Anatomy 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 210000002751 lymph Anatomy 0.000 description 2
- 238000007403 mPCR Methods 0.000 description 2
- 229920002521 macromolecule Polymers 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000004060 metabolic process Effects 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 210000003470 mitochondria Anatomy 0.000 description 2
- 230000002438 mitochondrial effect Effects 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 210000004498 neuroglial cell Anatomy 0.000 description 2
- 210000002569 neuron Anatomy 0.000 description 2
- 210000000440 neutrophil Anatomy 0.000 description 2
- 238000001921 nucleic acid quantification Methods 0.000 description 2
- 229920002113 octoxynol Polymers 0.000 description 2
- 210000004248 oligodendroglia Anatomy 0.000 description 2
- 230000006712 oncogenic signaling pathway Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 210000003463 organelle Anatomy 0.000 description 2
- 235000011007 phosphoric acid Nutrition 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 238000006116 polymerization reaction Methods 0.000 description 2
- 102000054765 polymorphisms of proteins Human genes 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 229920002451 polyvinyl alcohol Polymers 0.000 description 2
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 2
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 2
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 2
- 238000011176 pooling Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000004393 prognosis Methods 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 238000003908 quality control method Methods 0.000 description 2
- 238000010791 quenching Methods 0.000 description 2
- 230000000171 quenching effect Effects 0.000 description 2
- 108010062302 rac1 GTP Binding Protein Proteins 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 230000008439 repair process Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical compound OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 229910052709 silver Inorganic materials 0.000 description 2
- 239000004332 silver Substances 0.000 description 2
- 238000001847 surface plasmon resonance imaging Methods 0.000 description 2
- 239000004094 surface-active agent Substances 0.000 description 2
- 238000001356 surgical procedure Methods 0.000 description 2
- 208000011580 syndromic disease Diseases 0.000 description 2
- 238000013422 transcriptome assay Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 239000011534 wash buffer Substances 0.000 description 2
- 239000008096 xylene Substances 0.000 description 2
- 150000003738 xylenes Chemical class 0.000 description 2
- BJEPYKJPYRNKOW-REOHCLBHSA-N (S)-malic acid Chemical compound OC(=O)[C@@H](O)CC(O)=O BJEPYKJPYRNKOW-REOHCLBHSA-N 0.000 description 1
- HWFKCAFKXZFOQT-UHFFFAOYSA-N 1-(3,6-dibromocarbazol-9-yl)-3-piperazin-1-ylpropan-2-ol;dihydrochloride Chemical compound Cl.Cl.C12=CC=C(Br)C=C2C2=CC(Br)=CC=C2N1CC(O)CN1CCNCC1 HWFKCAFKXZFOQT-UHFFFAOYSA-N 0.000 description 1
- XQUPVDVFXZDTLT-UHFFFAOYSA-N 1-[4-[[4-(2,5-dioxopyrrol-1-yl)phenyl]methyl]phenyl]pyrrole-2,5-dione Chemical compound O=C1C=CC(=O)N1C(C=C1)=CC=C1CC1=CC=C(N2C(C=CC2=O)=O)C=C1 XQUPVDVFXZDTLT-UHFFFAOYSA-N 0.000 description 1
- 102100030389 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase beta-2 Human genes 0.000 description 1
- 102100030388 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase beta-3 Human genes 0.000 description 1
- 102100038366 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase beta-4 Human genes 0.000 description 1
- 102100026210 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase gamma-2 Human genes 0.000 description 1
- 108020004465 16S ribosomal RNA Proteins 0.000 description 1
- 108010041801 2',3'-Cyclic Nucleotide 3'-Phosphodiesterase Proteins 0.000 description 1
- GNENVASJJIUNER-UHFFFAOYSA-N 2,4,6-tricyclohexyloxy-1,3,5,2,4,6-trioxatriborinane Chemical compound C1CCCCC1OB1OB(OC2CCCCC2)OB(OC2CCCCC2)O1 GNENVASJJIUNER-UHFFFAOYSA-N 0.000 description 1
- BMYNFMYTOJXKLE-UHFFFAOYSA-N 3-azaniumyl-2-hydroxypropanoate Chemical compound NCC(O)C(O)=O BMYNFMYTOJXKLE-UHFFFAOYSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 1
- OLQIKGSZDTXODA-UHFFFAOYSA-N 4-[3-(4-hydroxy-2-methylphenyl)-1,1-dioxo-2,1$l^{6}-benzoxathiol-3-yl]-3-methylphenol Chemical compound CC1=CC(O)=CC=C1C1(C=2C(=CC(O)=CC=2)C)C2=CC=CC=C2S(=O)(=O)O1 OLQIKGSZDTXODA-UHFFFAOYSA-N 0.000 description 1
- 102100035923 4-aminobutyrate aminotransferase, mitochondrial Human genes 0.000 description 1
- 102100036009 5'-AMP-activated protein kinase catalytic subunit alpha-2 Human genes 0.000 description 1
- WBSMIPAMAXNXFS-UHFFFAOYSA-N 5-Nitro-2-(3-phenylpropylamino)benzoic acid Chemical compound OC(=O)C1=CC([N+]([O-])=O)=CC=C1NCCCC1=CC=CC=C1 WBSMIPAMAXNXFS-UHFFFAOYSA-N 0.000 description 1
- 108020004565 5.8S Ribosomal RNA Proteins 0.000 description 1
- 108020005075 5S Ribosomal RNA Proteins 0.000 description 1
- 101150039504 6 gene Proteins 0.000 description 1
- CDWGDLKZKCYUFO-UHFFFAOYSA-N 6-(trifluoromethyl)-1h-indole-2-carboxylic acid Chemical compound C1=C(C(F)(F)F)C=C2NC(C(=O)O)=CC2=C1 CDWGDLKZKCYUFO-UHFFFAOYSA-N 0.000 description 1
- 102100026802 72 kDa type IV collagenase Human genes 0.000 description 1
- 101150092476 ABCA1 gene Proteins 0.000 description 1
- 102100028220 ABI gene family member 3 Human genes 0.000 description 1
- 108091007504 ADAM10 Proteins 0.000 description 1
- 108091007505 ADAM17 Proteins 0.000 description 1
- 102000017920 ADRB1 Human genes 0.000 description 1
- 102000017919 ADRB2 Human genes 0.000 description 1
- 102100033793 ALK tyrosine kinase receptor Human genes 0.000 description 1
- 102100028777 AP-1 complex subunit sigma-1A Human genes 0.000 description 1
- 102100022974 AP-2 complex subunit alpha-2 Human genes 0.000 description 1
- 102100033347 AP-2 complex subunit beta Human genes 0.000 description 1
- 102100040058 AP-4 complex subunit sigma-1 Human genes 0.000 description 1
- 102000000872 ATM Human genes 0.000 description 1
- 108700005241 ATP Binding Cassette Transporter 1 Proteins 0.000 description 1
- 102100021921 ATP synthase subunit a Human genes 0.000 description 1
- 102100033350 ATP-dependent translocase ABCB1 Human genes 0.000 description 1
- 102100021176 ATP-sensitive inward rectifier potassium channel 10 Human genes 0.000 description 1
- 102100030913 Acetylcholine receptor subunit alpha Human genes 0.000 description 1
- 102100022144 Achaete-scute homolog 2 Human genes 0.000 description 1
- 102100030374 Actin, cytoplasmic 2 Human genes 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 208000031261 Acute myeloid leukaemia Diseases 0.000 description 1
- 102100034540 Adenomatous polyposis coli protein Human genes 0.000 description 1
- 102100033346 Adenosine receptor A1 Human genes 0.000 description 1
- 102100032161 Adenylate cyclase type 5 Human genes 0.000 description 1
- 102100032153 Adenylate cyclase type 8 Human genes 0.000 description 1
- 102100032156 Adenylate cyclase type 9 Human genes 0.000 description 1
- 102100032605 Adhesion G protein-coupled receptor B1 Human genes 0.000 description 1
- 102100031934 Adhesion G-protein coupled receptor G1 Human genes 0.000 description 1
- 102100027211 Albumin Human genes 0.000 description 1
- 102100040069 Aldehyde dehydrogenase 1A1 Human genes 0.000 description 1
- 102100040121 Allograft inflammatory factor 1 Human genes 0.000 description 1
- 102100034163 Alpha-actinin-1 Human genes 0.000 description 1
- 102100032047 Alsin Human genes 0.000 description 1
- 102100032044 Amphoterin-induced protein 1 Human genes 0.000 description 1
- 102100036439 Amyloid beta precursor protein binding family B member 1 Human genes 0.000 description 1
- 102000052587 Anaphase-Promoting Complex-Cyclosome Apc3 Subunit Human genes 0.000 description 1
- 108700004606 Anaphase-Promoting Complex-Cyclosome Apc3 Subunit Proteins 0.000 description 1
- 102100028118 Annexin A11 Human genes 0.000 description 1
- 102100023005 Anoctamin-3 Human genes 0.000 description 1
- 108020000948 Antisense Oligonucleotides Proteins 0.000 description 1
- 102100033715 Apolipoprotein A-I Human genes 0.000 description 1
- 102100027308 Apoptosis regulator BAX Human genes 0.000 description 1
- 108050006685 Apoptosis regulator BAX Proteins 0.000 description 1
- 102100034524 Apoptotic protease-activating factor 1 Human genes 0.000 description 1
- 108010036280 Aquaporin 4 Proteins 0.000 description 1
- 102000012002 Aquaporin 4 Human genes 0.000 description 1
- 101100379160 Arabidopsis thaliana AP3M gene Proteins 0.000 description 1
- 101100339431 Arabidopsis thaliana HMGB2 gene Proteins 0.000 description 1
- 101100523940 Arabidopsis thaliana RAD23A gene Proteins 0.000 description 1
- 101100523944 Arabidopsis thaliana RAD23B gene Proteins 0.000 description 1
- 101100194005 Arabidopsis thaliana RAD23C gene Proteins 0.000 description 1
- 101100194006 Arabidopsis thaliana RAD23D gene Proteins 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- 102100028493 Armadillo repeat-containing protein 10 Human genes 0.000 description 1
- DJHGAFSJWGLOIV-UHFFFAOYSA-N Arsenic acid Chemical compound O[As](O)(O)=O DJHGAFSJWGLOIV-UHFFFAOYSA-N 0.000 description 1
- 108010004586 Ataxia Telangiectasia Mutated Proteins Proteins 0.000 description 1
- 102000007371 Ataxin-3 Human genes 0.000 description 1
- 108010032947 Ataxin-3 Proteins 0.000 description 1
- 102000007368 Ataxin-7 Human genes 0.000 description 1
- 108010032953 Ataxin-7 Proteins 0.000 description 1
- 102100039339 Atrial natriuretic peptide receptor 1 Human genes 0.000 description 1
- 108010092776 Autophagy-Related Protein 5 Proteins 0.000 description 1
- 102000016614 Autophagy-Related Protein 5 Human genes 0.000 description 1
- 102100035656 BCL2/adenovirus E1B 19 kDa protein-interacting protein 3 Human genes 0.000 description 1
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 description 1
- 102100035080 BDNF/NT-3 growth factors receptor Human genes 0.000 description 1
- 208000035143 Bacterial infection Diseases 0.000 description 1
- 108091032955 Bacterial small RNA Proteins 0.000 description 1
- 102100021663 Baculoviral IAP repeat-containing protein 5 Human genes 0.000 description 1
- 102100022440 Battenin Human genes 0.000 description 1
- 102100026596 Bcl-2-like protein 1 Human genes 0.000 description 1
- 101150008012 Bcl2l1 gene Proteins 0.000 description 1
- 102100021251 Beclin-1 Human genes 0.000 description 1
- 239000005711 Benzoic acid Substances 0.000 description 1
- 102100034732 Beta-1,3-glucuronyltransferase LARGE1 Human genes 0.000 description 1
- 102100027386 Beta-1,4-galactosyltransferase 6 Human genes 0.000 description 1
- 102100029648 Beta-arrestin-2 Human genes 0.000 description 1
- 102100021257 Beta-secretase 1 Human genes 0.000 description 1
- 102100021277 Beta-secretase 2 Human genes 0.000 description 1
- 206010005949 Bone cancer Diseases 0.000 description 1
- 208000018084 Bone neoplasm Diseases 0.000 description 1
- 101001042041 Bos taurus Isocitrate dehydrogenase [NAD] subunit beta, mitochondrial Proteins 0.000 description 1
- 102000004219 Brain-derived neurotrophic factor Human genes 0.000 description 1
- 108090000715 Brain-derived neurotrophic factor Proteins 0.000 description 1
- 102100028243 Breast carcinoma-amplified sequence 1 Human genes 0.000 description 1
- 102100022595 Broad substrate specificity ATP-binding cassette transporter ABCG2 Human genes 0.000 description 1
- 102100031151 C-C chemokine receptor type 2 Human genes 0.000 description 1
- 101710149815 C-C chemokine receptor type 2 Proteins 0.000 description 1
- 102100035875 C-C chemokine receptor type 5 Human genes 0.000 description 1
- 101710149870 C-C chemokine receptor type 5 Proteins 0.000 description 1
- 102100032367 C-C motif chemokine 5 Human genes 0.000 description 1
- 102100031478 C-type natriuretic peptide Human genes 0.000 description 1
- 102100027209 CD2-associated protein Human genes 0.000 description 1
- 101150013553 CD40 gene Proteins 0.000 description 1
- 102100032912 CD44 antigen Human genes 0.000 description 1
- 102100037904 CD9 antigen Human genes 0.000 description 1
- 101150108242 CDC27 gene Proteins 0.000 description 1
- 102100030933 CDK-activating kinase assembly factor MAT1 Human genes 0.000 description 1
- 102100038451 CDK5 regulatory subunit-associated protein 2 Human genes 0.000 description 1
- 102100038460 CDK5 regulatory subunit-associated protein 3 Human genes 0.000 description 1
- 102000056162 CELF1 Human genes 0.000 description 1
- 108700015925 CELF1 Proteins 0.000 description 1
- 101150107790 CELF1 gene Proteins 0.000 description 1
- 101150085314 CERS4 gene Proteins 0.000 description 1
- 101150061077 CERS6 gene Proteins 0.000 description 1
- 102000017925 CHRM3 Human genes 0.000 description 1
- 102000017923 CHRM5 Human genes 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- 238000010453 CRISPR/Cas method Methods 0.000 description 1
- 102100025805 Cadherin-1 Human genes 0.000 description 1
- 102100036364 Cadherin-2 Human genes 0.000 description 1
- 101100452003 Caenorhabditis elegans ape-1 gene Proteins 0.000 description 1
- 102100021851 Calbindin Human genes 0.000 description 1
- 102100025588 Calcitonin gene-related peptide 1 Human genes 0.000 description 1
- 102100023073 Calcium-activated potassium channel subunit alpha-1 Human genes 0.000 description 1
- 102100022533 Calcium-binding protein 39 Human genes 0.000 description 1
- 102100032582 Calcium-dependent secretion activator 1 Human genes 0.000 description 1
- 102100022789 Calcium/calmodulin-dependent protein kinase type IV Human genes 0.000 description 1
- 102100025580 Calmodulin-1 Human genes 0.000 description 1
- 102100036419 Calmodulin-like protein 5 Human genes 0.000 description 1
- 102100025172 Calpain-1 catalytic subunit Human genes 0.000 description 1
- 102100021849 Calretinin Human genes 0.000 description 1
- 102100035605 Cas scaffolding protein family member 4 Human genes 0.000 description 1
- 108090000397 Caspase 3 Proteins 0.000 description 1
- 102100035904 Caspase-1 Human genes 0.000 description 1
- 102100029855 Caspase-3 Human genes 0.000 description 1
- 102100038918 Caspase-6 Human genes 0.000 description 1
- 102100038902 Caspase-7 Human genes 0.000 description 1
- 102100026548 Caspase-8 Human genes 0.000 description 1
- 102100026550 Caspase-9 Human genes 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 102100035888 Caveolin-1 Human genes 0.000 description 1
- 108091007854 Cdh1/Fizzy-related Proteins 0.000 description 1
- 102100024046 Cell adhesion molecule 3 Human genes 0.000 description 1
- 102100034787 Cell cycle exit and neuronal differentiation protein 1 Human genes 0.000 description 1
- 102100027047 Cell division control protein 6 homolog Human genes 0.000 description 1
- 108010051109 Cell-Penetrating Peptides Proteins 0.000 description 1
- 102000020313 Cell-Penetrating Peptides Human genes 0.000 description 1
- 102100023441 Centromere protein J Human genes 0.000 description 1
- 102100038741 Centrosomal protein of 135 kDa Human genes 0.000 description 1
- 101710117958 Centrosomal protein of 135 kDa Proteins 0.000 description 1
- 102100024503 Centrosomal protein of 41 kDa Human genes 0.000 description 1
- 101710193262 Centrosomal protein of 41 kDa Proteins 0.000 description 1
- 102100035401 Ceramide synthase 2 Human genes 0.000 description 1
- 102100035418 Ceramide synthase 4 Human genes 0.000 description 1
- 102100035434 Ceramide synthase 6 Human genes 0.000 description 1
- 102100025745 Cerberus Human genes 0.000 description 1
- 102100034505 Ceroid-lipofuscinosis neuronal protein 5 Human genes 0.000 description 1
- 101150087263 Cers2 gene Proteins 0.000 description 1
- 206010008342 Cervix carcinoma Diseases 0.000 description 1
- 108010066813 Chitinase-3-Like Protein 1 Proteins 0.000 description 1
- 102100038196 Chitinase-3-like protein 1 Human genes 0.000 description 1
- 241000282552 Chlorocebus aethiops Species 0.000 description 1
- 101710147336 Choline/ethanolamine kinase Proteins 0.000 description 1
- 102100032404 Cholinesterase Human genes 0.000 description 1
- 108010077544 Chromatin Proteins 0.000 description 1
- 102100040509 Chromatin modification-related protein MEAF6 Human genes 0.000 description 1
- 102100038214 Chromodomain-helicase-DNA-binding protein 4 Human genes 0.000 description 1
- 108010038447 Chromogranin A Proteins 0.000 description 1
- 102000010792 Chromogranin A Human genes 0.000 description 1
- 208000010833 Chronic myeloid leukaemia Diseases 0.000 description 1
- 108090000317 Chymotrypsin Proteins 0.000 description 1
- 108091028075 Circular RNA Proteins 0.000 description 1
- 208000005443 Circulating Neoplastic Cells Diseases 0.000 description 1
- 102100039588 Claudin-15 Human genes 0.000 description 1
- 102100038446 Claudin-5 Human genes 0.000 description 1
- 102100023692 Coiled-coil-helix-coiled-coil-helix domain-containing protein 2 Human genes 0.000 description 1
- 108060005980 Collagenase Proteins 0.000 description 1
- 102000029816 Collagenase Human genes 0.000 description 1
- 206010009944 Colon cancer Diseases 0.000 description 1
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 102100032903 Copper chaperone for superoxide dismutase Human genes 0.000 description 1
- 102100022785 Creatine kinase B-type Human genes 0.000 description 1
- 102100023583 Cyclic AMP-dependent transcription factor ATF-6 alpha Human genes 0.000 description 1
- 108010058546 Cyclin D1 Proteins 0.000 description 1
- 102100036883 Cyclin-H Human genes 0.000 description 1
- 102100026865 Cyclin-dependent kinase 5 activator 1 Human genes 0.000 description 1
- 102100026810 Cyclin-dependent kinase 7 Human genes 0.000 description 1
- 102100034746 Cyclin-dependent kinase-like 5 Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 102100034560 Cytosol aminopeptidase Human genes 0.000 description 1
- 102100039077 Cytosolic 10-formyltetrahydrofolate dehydrogenase Human genes 0.000 description 1
- 230000005778 DNA damage Effects 0.000 description 1
- 231100000277 DNA damage Toxicity 0.000 description 1
- 102100037373 DNA-(apurinic or apyrimidinic site) endonuclease Human genes 0.000 description 1
- 101000802964 Dendroaspis angusticeps Muscarinic toxin 1 Proteins 0.000 description 1
- 108010082610 Deoxyribonuclease (Pyrimidine Dimer) Proteins 0.000 description 1
- 102000004099 Deoxyribonuclease (Pyrimidine Dimer) Human genes 0.000 description 1
- FEWJPZIEWOKRBE-JCYAYHJZSA-N Dextrotartaric acid Chemical compound OC(=O)[C@H](O)[C@@H](O)C(O)=O FEWJPZIEWOKRBE-JCYAYHJZSA-N 0.000 description 1
- 101100216227 Dictyostelium discoideum anapc3 gene Proteins 0.000 description 1
- 101100086373 Dictyostelium discoideum rcbA gene Proteins 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- RWSOTUBLDIXVET-UHFFFAOYSA-N Dihydrogen sulfide Chemical compound S RWSOTUBLDIXVET-UHFFFAOYSA-N 0.000 description 1
- 102100039673 Disintegrin and metalloproteinase domain-containing protein 10 Human genes 0.000 description 1
- 102100031113 Disintegrin and metalloproteinase domain-containing protein 15 Human genes 0.000 description 1
- 102100031111 Disintegrin and metalloproteinase domain-containing protein 17 Human genes 0.000 description 1
- 201000010374 Down Syndrome Diseases 0.000 description 1
- 102100031480 Dual specificity mitogen-activated protein kinase kinase 1 Human genes 0.000 description 1
- 102100023266 Dual specificity mitogen-activated protein kinase kinase 2 Human genes 0.000 description 1
- 102000012199 E3 ubiquitin-protein ligase Mdm2 Human genes 0.000 description 1
- 108050002772 E3 ubiquitin-protein ligase Mdm2 Proteins 0.000 description 1
- 102100039368 ER lumen protein-retaining receptor 2 Human genes 0.000 description 1
- 101150029707 ERBB2 gene Proteins 0.000 description 1
- 102100029722 Ectonucleoside triphosphate diphosphohydrolase 1 Human genes 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 241000283070 Equus zebra Species 0.000 description 1
- 229910052691 Erbium Inorganic materials 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 102100024165 G1/S-specific cyclin-D1 Human genes 0.000 description 1
- 102100033512 GTP:AMP phosphotransferase AK3, mitochondrial Human genes 0.000 description 1
- 102100030708 GTPase KRas Human genes 0.000 description 1
- 108010051815 Glutamyl endopeptidase Proteins 0.000 description 1
- 102100034473 H(+)/Cl(-) exchange transporter 6 Human genes 0.000 description 1
- 102100040485 HLA class II histocompatibility antigen, DRB1 beta chain Human genes 0.000 description 1
- 108010039343 HLA-DRB1 Chains Proteins 0.000 description 1
- 108700010013 HMGB1 Proteins 0.000 description 1
- 101150021904 HMGB1 gene Proteins 0.000 description 1
- 101150096895 HSPB1 gene Proteins 0.000 description 1
- 101150017422 HTR1 gene Proteins 0.000 description 1
- 102100028761 Heat shock 70 kDa protein 6 Human genes 0.000 description 1
- 102100039165 Heat shock protein beta-1 Human genes 0.000 description 1
- 208000002250 Hematologic Neoplasms Diseases 0.000 description 1
- 102100031573 Hematopoietic progenitor cell antigen CD34 Human genes 0.000 description 1
- 102100028006 Heme oxygenase 1 Human genes 0.000 description 1
- 241000711549 Hepacivirus C Species 0.000 description 1
- 102100028895 Heterogeneous nuclear ribonucleoprotein M Human genes 0.000 description 1
- 102100035616 Heterogeneous nuclear ribonucleoproteins A2/B1 Human genes 0.000 description 1
- 102100037907 High mobility group protein B1 Human genes 0.000 description 1
- 230000004655 Hippo pathway Effects 0.000 description 1
- 102100028993 Hippocalcin-like protein 1 Human genes 0.000 description 1
- 102100039542 Homeobox protein Hox-A2 Human genes 0.000 description 1
- 102100027893 Homeobox protein Nkx-2.1 Human genes 0.000 description 1
- 102100028096 Homeobox protein Nkx-6.2 Human genes 0.000 description 1
- 102100023607 Homer protein homolog 1 Human genes 0.000 description 1
- 101000583066 Homo sapiens 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase beta-2 Proteins 0.000 description 1
- 101000583069 Homo sapiens 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase beta-3 Proteins 0.000 description 1
- 101000605565 Homo sapiens 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase beta-4 Proteins 0.000 description 1
- 101000691589 Homo sapiens 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase gamma-2 Proteins 0.000 description 1
- 101001000686 Homo sapiens 4-aminobutyrate aminotransferase, mitochondrial Proteins 0.000 description 1
- 101000783681 Homo sapiens 5'-AMP-activated protein kinase catalytic subunit alpha-2 Proteins 0.000 description 1
- 101000627872 Homo sapiens 72 kDa type IV collagenase Proteins 0.000 description 1
- 101000724234 Homo sapiens ABI gene family member 3 Proteins 0.000 description 1
- 101000779641 Homo sapiens ALK tyrosine kinase receptor Proteins 0.000 description 1
- 101000768000 Homo sapiens AP-1 complex subunit sigma-1A Proteins 0.000 description 1
- 101000757394 Homo sapiens AP-2 complex subunit alpha-2 Proteins 0.000 description 1
- 101000732341 Homo sapiens AP-2 complex subunit beta Proteins 0.000 description 1
- 101000890244 Homo sapiens AP-4 complex subunit sigma-1 Proteins 0.000 description 1
- 101000753741 Homo sapiens ATP synthase subunit a Proteins 0.000 description 1
- 101000722210 Homo sapiens ATP-dependent DNA helicase DDX11 Proteins 0.000 description 1
- 101000614696 Homo sapiens ATP-sensitive inward rectifier potassium channel 10 Proteins 0.000 description 1
- 101000726895 Homo sapiens Acetylcholine receptor subunit alpha Proteins 0.000 description 1
- 101000901109 Homo sapiens Achaete-scute homolog 2 Proteins 0.000 description 1
- 101000773237 Homo sapiens Actin, cytoplasmic 2 Proteins 0.000 description 1
- 101000924577 Homo sapiens Adenomatous polyposis coli protein Proteins 0.000 description 1
- 101000799712 Homo sapiens Adenosine receptor A1 Proteins 0.000 description 1
- 101000775478 Homo sapiens Adenylate cyclase type 5 Proteins 0.000 description 1
- 101000775481 Homo sapiens Adenylate cyclase type 8 Proteins 0.000 description 1
- 101000775499 Homo sapiens Adenylate cyclase type 9 Proteins 0.000 description 1
- 101000614487 Homo sapiens Adenylate kinase 4, mitochondrial Proteins 0.000 description 1
- 101000796780 Homo sapiens Adhesion G protein-coupled receptor B1 Proteins 0.000 description 1
- 101000775042 Homo sapiens Adhesion G-protein coupled receptor G1 Proteins 0.000 description 1
- 101000693913 Homo sapiens Albumin Proteins 0.000 description 1
- 101000890570 Homo sapiens Aldehyde dehydrogenase 1A1 Proteins 0.000 description 1
- 101000890626 Homo sapiens Allograft inflammatory factor 1 Proteins 0.000 description 1
- 101000799406 Homo sapiens Alpha-actinin-1 Proteins 0.000 description 1
- 101000690235 Homo sapiens Alpha-aminoadipic semialdehyde dehydrogenase Proteins 0.000 description 1
- 101000776160 Homo sapiens Alsin Proteins 0.000 description 1
- 101000776170 Homo sapiens Amphoterin-induced protein 1 Proteins 0.000 description 1
- 101000928670 Homo sapiens Amyloid beta precursor protein binding family B member 1 Proteins 0.000 description 1
- 101000768066 Homo sapiens Annexin A11 Proteins 0.000 description 1
- 101000757285 Homo sapiens Anoctamin-3 Proteins 0.000 description 1
- 101000733802 Homo sapiens Apolipoprotein A-I Proteins 0.000 description 1
- 101000924629 Homo sapiens Apoptotic protease-activating factor 1 Proteins 0.000 description 1
- 101000769233 Homo sapiens Armadillo repeat-containing protein 10 Proteins 0.000 description 1
- 101000961044 Homo sapiens Atrial natriuretic peptide receptor 1 Proteins 0.000 description 1
- 101000803294 Homo sapiens BCL2/adenovirus E1B 19 kDa protein-interacting protein 3 Proteins 0.000 description 1
- 101000596896 Homo sapiens BDNF/NT-3 growth factors receptor Proteins 0.000 description 1
- 101000901683 Homo sapiens Battenin Proteins 0.000 description 1
- 101000894649 Homo sapiens Beclin-1 Proteins 0.000 description 1
- 101000892264 Homo sapiens Beta-1 adrenergic receptor Proteins 0.000 description 1
- 101001090662 Homo sapiens Beta-1,3-glucuronyltransferase LARGE1 Proteins 0.000 description 1
- 101000937502 Homo sapiens Beta-1,4-galactosyltransferase 6 Proteins 0.000 description 1
- 101000959437 Homo sapiens Beta-2 adrenergic receptor Proteins 0.000 description 1
- 101000894895 Homo sapiens Beta-secretase 1 Proteins 0.000 description 1
- 101000894883 Homo sapiens Beta-secretase 2 Proteins 0.000 description 1
- 101000935635 Homo sapiens Breast carcinoma-amplified sequence 1 Proteins 0.000 description 1
- 101000797762 Homo sapiens C-C motif chemokine 5 Proteins 0.000 description 1
- 101100287682 Homo sapiens CAMK2G gene Proteins 0.000 description 1
- 101100126883 Homo sapiens CAMK4 gene Proteins 0.000 description 1
- 101000914499 Homo sapiens CD2-associated protein Proteins 0.000 description 1
- 101000868273 Homo sapiens CD44 antigen Proteins 0.000 description 1
- 101000738354 Homo sapiens CD9 antigen Proteins 0.000 description 1
- 101000583935 Homo sapiens CDK-activating kinase assembly factor MAT1 Proteins 0.000 description 1
- 101000882873 Homo sapiens CDK5 regulatory subunit-associated protein 2 Proteins 0.000 description 1
- 101000882982 Homo sapiens CDK5 regulatory subunit-associated protein 3 Proteins 0.000 description 1
- 101000714537 Homo sapiens Cadherin-2 Proteins 0.000 description 1
- 101000898082 Homo sapiens Calbindin Proteins 0.000 description 1
- 101000741445 Homo sapiens Calcitonin Proteins 0.000 description 1
- 101000932890 Homo sapiens Calcitonin gene-related peptide 1 Proteins 0.000 description 1
- 101001049859 Homo sapiens Calcium-activated potassium channel subunit alpha-1 Proteins 0.000 description 1
- 101000899411 Homo sapiens Calcium-binding protein 39 Proteins 0.000 description 1
- 101000867747 Homo sapiens Calcium-dependent secretion activator 1 Proteins 0.000 description 1
- 101000984164 Homo sapiens Calmodulin-1 Proteins 0.000 description 1
- 101000714353 Homo sapiens Calmodulin-like protein 5 Proteins 0.000 description 1
- 101000934069 Homo sapiens Calpain-1 catalytic subunit Proteins 0.000 description 1
- 101000898072 Homo sapiens Calretinin Proteins 0.000 description 1
- 101000710899 Homo sapiens Cannabinoid receptor 1 Proteins 0.000 description 1
- 101000947106 Homo sapiens Cas scaffolding protein family member 4 Proteins 0.000 description 1
- 101000715398 Homo sapiens Caspase-1 Proteins 0.000 description 1
- 101000741087 Homo sapiens Caspase-6 Proteins 0.000 description 1
- 101000741014 Homo sapiens Caspase-7 Proteins 0.000 description 1
- 101000983528 Homo sapiens Caspase-8 Proteins 0.000 description 1
- 101000983523 Homo sapiens Caspase-9 Proteins 0.000 description 1
- 101000715467 Homo sapiens Caveolin-1 Proteins 0.000 description 1
- 101000910449 Homo sapiens Cell adhesion molecule 3 Proteins 0.000 description 1
- 101000945882 Homo sapiens Cell cycle exit and neuronal differentiation protein 1 Proteins 0.000 description 1
- 101000914465 Homo sapiens Cell division control protein 6 homolog Proteins 0.000 description 1
- 101000907924 Homo sapiens Centromere protein J Proteins 0.000 description 1
- 101000914195 Homo sapiens Cerberus Proteins 0.000 description 1
- 101000710208 Homo sapiens Ceroid-lipofuscinosis neuronal protein 5 Proteins 0.000 description 1
- 101000943274 Homo sapiens Cholinesterase Proteins 0.000 description 1
- 101000817406 Homo sapiens Chromatin modification-related protein MEAF6 Proteins 0.000 description 1
- 101000883749 Homo sapiens Chromodomain-helicase-DNA-binding protein 4 Proteins 0.000 description 1
- 101000888605 Homo sapiens Claudin-15 Proteins 0.000 description 1
- 101000882896 Homo sapiens Claudin-5 Proteins 0.000 description 1
- 101000906986 Homo sapiens Coiled-coil-helix-coiled-coil-helix domain-containing protein 2 Proteins 0.000 description 1
- 101000868241 Homo sapiens Copper chaperone for superoxide dismutase Proteins 0.000 description 1
- 101000905751 Homo sapiens Cyclic AMP-dependent transcription factor ATF-6 alpha Proteins 0.000 description 1
- 101000713120 Homo sapiens Cyclin-H Proteins 0.000 description 1
- 101000911952 Homo sapiens Cyclin-dependent kinase 7 Proteins 0.000 description 1
- 101000945692 Homo sapiens Cyclin-dependent kinase-like 5 Proteins 0.000 description 1
- 101000924389 Homo sapiens Cytosol aminopeptidase Proteins 0.000 description 1
- 101000959030 Homo sapiens Cytosolic 10-formyltetrahydrofolate dehydrogenase Proteins 0.000 description 1
- 101000804964 Homo sapiens DNA polymerase subunit gamma-1 Proteins 0.000 description 1
- 101000806846 Homo sapiens DNA-(apurinic or apyrimidinic site) endonuclease Proteins 0.000 description 1
- 101000777455 Homo sapiens Disintegrin and metalloproteinase domain-containing protein 15 Proteins 0.000 description 1
- 101000722054 Homo sapiens Dynamin-like 120 kDa protein, mitochondrial Proteins 0.000 description 1
- 101000812465 Homo sapiens ER lumen protein-retaining receptor 2 Proteins 0.000 description 1
- 101001012447 Homo sapiens Ectonucleoside triphosphate diphosphohydrolase 1 Proteins 0.000 description 1
- 101001065295 Homo sapiens Fas-binding factor 1 Proteins 0.000 description 1
- 101000998053 Homo sapiens GTP:AMP phosphotransferase AK3, mitochondrial Proteins 0.000 description 1
- 101000584612 Homo sapiens GTPase KRas Proteins 0.000 description 1
- 101000876511 Homo sapiens General transcription and DNA repair factor IIH helicase subunit XPD Proteins 0.000 description 1
- 101000710240 Homo sapiens H(+)/Cl(-) exchange transporter 6 Proteins 0.000 description 1
- 101001078680 Homo sapiens Heat shock 70 kDa protein 6 Proteins 0.000 description 1
- 101000777663 Homo sapiens Hematopoietic progenitor cell antigen CD34 Proteins 0.000 description 1
- 101001079623 Homo sapiens Heme oxygenase 1 Proteins 0.000 description 1
- 101000839073 Homo sapiens Heterogeneous nuclear ribonucleoprotein M Proteins 0.000 description 1
- 101000854026 Homo sapiens Heterogeneous nuclear ribonucleoproteins A2/B1 Proteins 0.000 description 1
- 101000838883 Homo sapiens Hippocalcin-like protein 1 Proteins 0.000 description 1
- 101000962636 Homo sapiens Homeobox protein Hox-A2 Proteins 0.000 description 1
- 101000632178 Homo sapiens Homeobox protein Nkx-2.1 Proteins 0.000 description 1
- 101000578258 Homo sapiens Homeobox protein Nkx-6.2 Proteins 0.000 description 1
- 101001048469 Homo sapiens Homer protein homolog 1 Proteins 0.000 description 1
- 101000977643 Homo sapiens Immunoglobulin superfamily containing leucine-rich repeat protein 2 Proteins 0.000 description 1
- 101000691610 Homo sapiens Inactive phospholipase C-like protein 2 Proteins 0.000 description 1
- 101001037256 Homo sapiens Indoleamine 2,3-dioxygenase 1 Proteins 0.000 description 1
- 101000975421 Homo sapiens Inositol 1,4,5-trisphosphate receptor type 2 Proteins 0.000 description 1
- 101000975401 Homo sapiens Inositol 1,4,5-trisphosphate receptor type 3 Proteins 0.000 description 1
- 101001044342 Homo sapiens Insulin-degrading enzyme Proteins 0.000 description 1
- 101000599951 Homo sapiens Insulin-like growth factor I Proteins 0.000 description 1
- 101001044940 Homo sapiens Insulin-like growth factor-binding protein 2 Proteins 0.000 description 1
- 101000994369 Homo sapiens Integrin alpha-5 Proteins 0.000 description 1
- 101000599852 Homo sapiens Intercellular adhesion molecule 1 Proteins 0.000 description 1
- 101000599858 Homo sapiens Intercellular adhesion molecule 2 Proteins 0.000 description 1
- 101000599940 Homo sapiens Interferon gamma Proteins 0.000 description 1
- 101001032345 Homo sapiens Interferon regulatory factor 8 Proteins 0.000 description 1
- 101001076407 Homo sapiens Interleukin-1 receptor antagonist protein Proteins 0.000 description 1
- 101001076418 Homo sapiens Interleukin-1 receptor type 1 Proteins 0.000 description 1
- 101001083151 Homo sapiens Interleukin-10 receptor subunit alpha Proteins 0.000 description 1
- 101001003135 Homo sapiens Interleukin-13 receptor subunit alpha-1 Proteins 0.000 description 1
- 101001003140 Homo sapiens Interleukin-15 receptor subunit alpha Proteins 0.000 description 1
- 101000960234 Homo sapiens Isocitrate dehydrogenase [NADP] cytoplasmic Proteins 0.000 description 1
- 101000599886 Homo sapiens Isocitrate dehydrogenase [NADP], mitochondrial Proteins 0.000 description 1
- 101001050321 Homo sapiens Junctional adhesion molecule C Proteins 0.000 description 1
- 101000992298 Homo sapiens Kappa-type opioid receptor Proteins 0.000 description 1
- 101000604641 Homo sapiens Katanin p60 ATPase-containing subunit A1 Proteins 0.000 description 1
- 101000971879 Homo sapiens Kell blood group glycoprotein Proteins 0.000 description 1
- 101000971521 Homo sapiens Kinetochore scaffold 1 Proteins 0.000 description 1
- 101001003581 Homo sapiens Lamin-B1 Proteins 0.000 description 1
- 101000972491 Homo sapiens Laminin subunit alpha-2 Proteins 0.000 description 1
- 101001008558 Homo sapiens Laminin subunit beta-2 Proteins 0.000 description 1
- 101000946306 Homo sapiens Laminin subunit gamma-1 Proteins 0.000 description 1
- 101000620451 Homo sapiens Leucine-rich glioma-inactivated protein 1 Proteins 0.000 description 1
- 101000981680 Homo sapiens Leucine-rich repeat and immunoglobulin-like domain-containing nogo receptor-interacting protein 1 Proteins 0.000 description 1
- 101001039157 Homo sapiens Leucine-rich repeat-containing protein 25 Proteins 0.000 description 1
- 101001017833 Homo sapiens Leucine-rich repeat-containing protein 4 Proteins 0.000 description 1
- 101000579805 Homo sapiens Leucine-rich repeat-containing protein 61 Proteins 0.000 description 1
- 101000692954 Homo sapiens Lysine-specific demethylase PHF2 Proteins 0.000 description 1
- 101001047677 Homo sapiens Lysocardiolipin acyltransferase 1 Proteins 0.000 description 1
- 101000966782 Homo sapiens Lysophosphatidic acid receptor 1 Proteins 0.000 description 1
- 101000979046 Homo sapiens Lysosomal alpha-mannosidase Proteins 0.000 description 1
- 101001059427 Homo sapiens MAP/microtubule affinity-regulating kinase 4 Proteins 0.000 description 1
- 101100456626 Homo sapiens MEF2A gene Proteins 0.000 description 1
- 101000577881 Homo sapiens Macrophage metalloelastase Proteins 0.000 description 1
- 101000934372 Homo sapiens Macrosialin Proteins 0.000 description 1
- 101000575454 Homo sapiens Major facilitator superfamily domain-containing protein 8 Proteins 0.000 description 1
- 101001011906 Homo sapiens Matrix metalloproteinase-14 Proteins 0.000 description 1
- 101001011886 Homo sapiens Matrix metalloproteinase-16 Proteins 0.000 description 1
- 101001011896 Homo sapiens Matrix metalloproteinase-19 Proteins 0.000 description 1
- 101000627858 Homo sapiens Matrix metalloproteinase-24 Proteins 0.000 description 1
- 101001120868 Homo sapiens Meckel syndrome type 1 protein Proteins 0.000 description 1
- 101000955333 Homo sapiens Mediator of RNA polymerase II transcription subunit 10 Proteins 0.000 description 1
- 101000614988 Homo sapiens Mediator of RNA polymerase II transcription subunit 12 Proteins 0.000 description 1
- 101001057156 Homo sapiens Melanoma-associated antigen C2 Proteins 0.000 description 1
- 101001057137 Homo sapiens Melanoma-associated antigen E1 Proteins 0.000 description 1
- 101000587539 Homo sapiens Metallothionein-1A Proteins 0.000 description 1
- 101001027956 Homo sapiens Metallothionein-1B Proteins 0.000 description 1
- 101001027945 Homo sapiens Metallothionein-1E Proteins 0.000 description 1
- 101001027943 Homo sapiens Metallothionein-1F Proteins 0.000 description 1
- 101001027938 Homo sapiens Metallothionein-1G Proteins 0.000 description 1
- 101001013794 Homo sapiens Metallothionein-1H Proteins 0.000 description 1
- 101001013797 Homo sapiens Metallothionein-1L Proteins 0.000 description 1
- 101001013796 Homo sapiens Metallothionein-1M Proteins 0.000 description 1
- 101001013799 Homo sapiens Metallothionein-1X Proteins 0.000 description 1
- 101001028019 Homo sapiens Metastasis-associated protein MTA2 Proteins 0.000 description 1
- 101000615613 Homo sapiens Mineralocorticoid receptor Proteins 0.000 description 1
- 101001052493 Homo sapiens Mitogen-activated protein kinase 1 Proteins 0.000 description 1
- 101000628949 Homo sapiens Mitogen-activated protein kinase 10 Proteins 0.000 description 1
- 101001052490 Homo sapiens Mitogen-activated protein kinase 3 Proteins 0.000 description 1
- 101000950695 Homo sapiens Mitogen-activated protein kinase 8 Proteins 0.000 description 1
- 101000950669 Homo sapiens Mitogen-activated protein kinase 9 Proteins 0.000 description 1
- 101000987094 Homo sapiens Moesin Proteins 0.000 description 1
- 101001122476 Homo sapiens Mu-type opioid receptor Proteins 0.000 description 1
- 101001114675 Homo sapiens Multimerin-2 Proteins 0.000 description 1
- 101000928919 Homo sapiens Muscarinic acetylcholine receptor M3 Proteins 0.000 description 1
- 101000720516 Homo sapiens Muscarinic acetylcholine receptor M5 Proteins 0.000 description 1
- 101000970561 Homo sapiens Myc box-dependent-interacting protein 1 Proteins 0.000 description 1
- 101000982010 Homo sapiens Myelin proteolipid protein Proteins 0.000 description 1
- 101000616778 Homo sapiens Myelin-associated glycoprotein Proteins 0.000 description 1
- 101000934338 Homo sapiens Myeloid cell surface antigen CD33 Proteins 0.000 description 1
- 101001030182 Homo sapiens Myogenesis-regulating glycosidase Proteins 0.000 description 1
- 101001000109 Homo sapiens Myosin-10 Proteins 0.000 description 1
- 101000991534 Homo sapiens NACHT domain- and WD repeat-containing protein 1 Proteins 0.000 description 1
- 101000604411 Homo sapiens NADH-ubiquinone oxidoreductase chain 1 Proteins 0.000 description 1
- 101001124388 Homo sapiens NPC intracellular cholesterol transporter 1 Proteins 0.000 description 1
- 101001109579 Homo sapiens NPC intracellular cholesterol transporter 2 Proteins 0.000 description 1
- 101001128158 Homo sapiens Nanos homolog 2 Proteins 0.000 description 1
- 101001128156 Homo sapiens Nanos homolog 3 Proteins 0.000 description 1
- 101000928278 Homo sapiens Natriuretic peptides B Proteins 0.000 description 1
- 101000979297 Homo sapiens Negative elongation factor A Proteins 0.000 description 1
- 101000636823 Homo sapiens Neogenin Proteins 0.000 description 1
- 101000978743 Homo sapiens Nephrocystin-1 Proteins 0.000 description 1
- 101000604463 Homo sapiens Netrin-G1 Proteins 0.000 description 1
- 101000604469 Homo sapiens Netrin-G2 Proteins 0.000 description 1
- 101001112222 Homo sapiens Neural cell adhesion molecule L1-like protein Proteins 0.000 description 1
- 101001108436 Homo sapiens Neurexin-1 Proteins 0.000 description 1
- 101001108433 Homo sapiens Neurexin-1-beta Proteins 0.000 description 1
- 101000996111 Homo sapiens Neuroligin-4, X-linked Proteins 0.000 description 1
- 101000604168 Homo sapiens Neuromedin-B Proteins 0.000 description 1
- 101000603399 Homo sapiens Neuronal PAS domain-containing protein 4 Proteins 0.000 description 1
- 101000745163 Homo sapiens Neuronal acetylcholine receptor subunit alpha-3 Proteins 0.000 description 1
- 101000745167 Homo sapiens Neuronal acetylcholine receptor subunit alpha-4 Proteins 0.000 description 1
- 101000745175 Homo sapiens Neuronal acetylcholine receptor subunit alpha-5 Proteins 0.000 description 1
- 101000822103 Homo sapiens Neuronal acetylcholine receptor subunit alpha-7 Proteins 0.000 description 1
- 101000726901 Homo sapiens Neuronal acetylcholine receptor subunit beta-2 Proteins 0.000 description 1
- 101000602176 Homo sapiens Neurotensin/neuromedin N Proteins 0.000 description 1
- 101000634196 Homo sapiens Neurotrophin-3 Proteins 0.000 description 1
- 101001112229 Homo sapiens Neutrophil cytosol factor 1 Proteins 0.000 description 1
- 101000996058 Homo sapiens Nicotinamide/nicotinic acid mononucleotide adenylyltransferase 2 Proteins 0.000 description 1
- 101001124309 Homo sapiens Nitric oxide synthase, endothelial Proteins 0.000 description 1
- 101001124991 Homo sapiens Nitric oxide synthase, inducible Proteins 0.000 description 1
- 101000603107 Homo sapiens Noelin-3 Proteins 0.000 description 1
- 101000604123 Homo sapiens Noggin Proteins 0.000 description 1
- 101000979342 Homo sapiens Nuclear factor NF-kappa-B p105 subunit Proteins 0.000 description 1
- 101000896414 Homo sapiens Nuclear nucleic acid-binding protein C1D Proteins 0.000 description 1
- 101001109698 Homo sapiens Nuclear receptor subfamily 4 group A member 2 Proteins 0.000 description 1
- 101000973960 Homo sapiens Nucleolar protein 3 Proteins 0.000 description 1
- 101001128742 Homo sapiens Nucleoside diphosphate kinase homolog 5 Proteins 0.000 description 1
- 101000721384 Homo sapiens OTU domain-containing protein 4 Proteins 0.000 description 1
- 101001086785 Homo sapiens Occludin Proteins 0.000 description 1
- 101001137060 Homo sapiens Oligophrenin-1 Proteins 0.000 description 1
- 101000586302 Homo sapiens Oncostatin-M-specific receptor subunit beta Proteins 0.000 description 1
- 101000992164 Homo sapiens One cut domain family member 2 Proteins 0.000 description 1
- 101000992283 Homo sapiens Optineurin Proteins 0.000 description 1
- 101000986786 Homo sapiens Orexin/Hypocretin receptor type 1 Proteins 0.000 description 1
- 101001134134 Homo sapiens Oxidation resistance protein 1 Proteins 0.000 description 1
- 101001098179 Homo sapiens P2X purinoceptor 4 Proteins 0.000 description 1
- 101001098175 Homo sapiens P2X purinoceptor 7 Proteins 0.000 description 1
- 101001120086 Homo sapiens P2Y purinoceptor 12 Proteins 0.000 description 1
- 101001071236 Homo sapiens PHD finger protein 19 Proteins 0.000 description 1
- 101000595929 Homo sapiens POLG alternative reading frame Proteins 0.000 description 1
- 101001094700 Homo sapiens POU domain, class 5, transcription factor 1 Proteins 0.000 description 1
- 101000613577 Homo sapiens Paired box protein Pax-2 Proteins 0.000 description 1
- 101000613490 Homo sapiens Paired box protein Pax-3 Proteins 0.000 description 1
- 101000981502 Homo sapiens Pantothenate kinase 2, mitochondrial Proteins 0.000 description 1
- 101000735228 Homo sapiens Paralemmin-1 Proteins 0.000 description 1
- 101000987581 Homo sapiens Perforin-1 Proteins 0.000 description 1
- 101001000631 Homo sapiens Peripheral myelin protein 22 Proteins 0.000 description 1
- 101000922137 Homo sapiens Peripheral plasma membrane protein CASK Proteins 0.000 description 1
- 101000987578 Homo sapiens Peripherin Proteins 0.000 description 1
- 101001082860 Homo sapiens Peroxisomal membrane protein 2 Proteins 0.000 description 1
- 101000741790 Homo sapiens Peroxisome proliferator-activated receptor gamma Proteins 0.000 description 1
- 101001120056 Homo sapiens Phosphatidylinositol 3-kinase regulatory subunit alpha Proteins 0.000 description 1
- 101000595741 Homo sapiens Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit beta isoform Proteins 0.000 description 1
- 101001102158 Homo sapiens Phosphatidylserine synthase 1 Proteins 0.000 description 1
- 101000579123 Homo sapiens Phosphoglycerate kinase 1 Proteins 0.000 description 1
- 101000600387 Homo sapiens Phosphoglycerate mutase 1 Proteins 0.000 description 1
- 101000595800 Homo sapiens Phospholipase A and acyltransferase 3 Proteins 0.000 description 1
- 101000929663 Homo sapiens Phospholipid-transporting ATPase ABCA7 Proteins 0.000 description 1
- 101000735539 Homo sapiens Pituitary adenylate cyclase-activating polypeptide Proteins 0.000 description 1
- 101000728115 Homo sapiens Plasma membrane calcium-transporting ATPase 3 Proteins 0.000 description 1
- 101000582929 Homo sapiens Plasmolipin Proteins 0.000 description 1
- 101000596041 Homo sapiens Plastin-1 Proteins 0.000 description 1
- 101001116302 Homo sapiens Platelet endothelial cell adhesion molecule Proteins 0.000 description 1
- 101001064282 Homo sapiens Platelet-activating factor acetylhydrolase IB subunit beta Proteins 0.000 description 1
- 101001001799 Homo sapiens Pleckstrin homology domain-containing family O member 2 Proteins 0.000 description 1
- 101001067168 Homo sapiens Plexin-B3 Proteins 0.000 description 1
- 101001094872 Homo sapiens Plexin-C1 Proteins 0.000 description 1
- 101000887201 Homo sapiens Polyamine-transporting ATPase 13A2 Proteins 0.000 description 1
- 101000872170 Homo sapiens Polycomb complex protein BMI-1 Proteins 0.000 description 1
- 101000611427 Homo sapiens Polyglutamine-binding protein 1 Proteins 0.000 description 1
- 101001064864 Homo sapiens Polyunsaturated fatty acid lipoxygenase ALOX12 Proteins 0.000 description 1
- 101001050878 Homo sapiens Potassium channel subfamily K member 9 Proteins 0.000 description 1
- 101000994626 Homo sapiens Potassium voltage-gated channel subfamily A member 1 Proteins 0.000 description 1
- 101000997292 Homo sapiens Potassium voltage-gated channel subfamily B member 1 Proteins 0.000 description 1
- 101001135496 Homo sapiens Potassium voltage-gated channel subfamily C member 3 Proteins 0.000 description 1
- 101001135471 Homo sapiens Potassium voltage-gated channel subfamily D member 3 Proteins 0.000 description 1
- 101001122801 Homo sapiens Pre-mRNA-processing factor 17 Proteins 0.000 description 1
- 101000642431 Homo sapiens Pre-mRNA-splicing factor SPF27 Proteins 0.000 description 1
- 101001003584 Homo sapiens Prelamin-A/C Proteins 0.000 description 1
- 101000617536 Homo sapiens Presenilin-1 Proteins 0.000 description 1
- 101000617546 Homo sapiens Presenilin-2 Proteins 0.000 description 1
- 101001018494 Homo sapiens Pro-MCH Proteins 0.000 description 1
- 101000690940 Homo sapiens Pro-adrenomedullin Proteins 0.000 description 1
- 101000725565 Homo sapiens Pro-opiomelanocortin Proteins 0.000 description 1
- 101000874919 Homo sapiens Probable arginine-tRNA ligase, mitochondrial Proteins 0.000 description 1
- 101001000545 Homo sapiens Probable hydrolase PNKD Proteins 0.000 description 1
- 101000600395 Homo sapiens Probable phosphoglycerate mutase 4 Proteins 0.000 description 1
- 101000595907 Homo sapiens Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 Proteins 0.000 description 1
- 101000577619 Homo sapiens Profilin-1 Proteins 0.000 description 1
- 101000690268 Homo sapiens Proline-rich AKT1 substrate 1 Proteins 0.000 description 1
- 101000577696 Homo sapiens Proline-rich transmembrane protein 2 Proteins 0.000 description 1
- 101001043564 Homo sapiens Prolow-density lipoprotein receptor-related protein 1 Proteins 0.000 description 1
- 101000614345 Homo sapiens Prolyl 4-hydroxylase subunit alpha-1 Proteins 0.000 description 1
- 101001136986 Homo sapiens Proteasome subunit beta type-8 Proteins 0.000 description 1
- 101001136981 Homo sapiens Proteasome subunit beta type-9 Proteins 0.000 description 1
- 101000710213 Homo sapiens Protein CLN8 Proteins 0.000 description 1
- 101000995264 Homo sapiens Protein kinase C-binding protein NELL2 Proteins 0.000 description 1
- 101001100767 Homo sapiens Protein quaking Proteins 0.000 description 1
- 101001116937 Homo sapiens Protocadherin alpha-4 Proteins 0.000 description 1
- 101001072243 Homo sapiens Protocadherin-19 Proteins 0.000 description 1
- 101000730612 Homo sapiens Puratrophin-1 Proteins 0.000 description 1
- 101000798015 Homo sapiens RAC-beta serine/threonine-protein kinase Proteins 0.000 description 1
- 101000798007 Homo sapiens RAC-gamma serine/threonine-protein kinase Proteins 0.000 description 1
- 101001069891 Homo sapiens RAS guanyl-releasing protein 1 Proteins 0.000 description 1
- 101000591115 Homo sapiens RNA-binding protein Musashi homolog 1 Proteins 0.000 description 1
- 101000604114 Homo sapiens RNA-binding protein Nova-1 Proteins 0.000 description 1
- 101000620554 Homo sapiens Ras-related protein Rab-38 Proteins 0.000 description 1
- 101000579226 Homo sapiens Renin receptor Proteins 0.000 description 1
- 101000927778 Homo sapiens Rho guanine nucleotide exchange factor 10 Proteins 0.000 description 1
- 101000984753 Homo sapiens Serine/threonine-protein kinase B-raf Proteins 0.000 description 1
- 101001129076 Homo sapiens Serine/threonine-protein kinase N1 Proteins 0.000 description 1
- 101001123846 Homo sapiens Serine/threonine-protein kinase Nek1 Proteins 0.000 description 1
- 101000605835 Homo sapiens Serine/threonine-protein kinase PINK1, mitochondrial Proteins 0.000 description 1
- 101000799194 Homo sapiens Serine/threonine-protein kinase receptor R3 Proteins 0.000 description 1
- 101000611254 Homo sapiens Serine/threonine-protein phosphatase 2B catalytic subunit beta isoform Proteins 0.000 description 1
- 101000611251 Homo sapiens Serine/threonine-protein phosphatase 2B catalytic subunit gamma isoform Proteins 0.000 description 1
- 101001026232 Homo sapiens Small conductance calcium-activated potassium channel protein 3 Proteins 0.000 description 1
- 101000753178 Homo sapiens Sodium/potassium-transporting ATPase subunit alpha-3 Proteins 0.000 description 1
- 101000716102 Homo sapiens T-cell surface glycoprotein CD4 Proteins 0.000 description 1
- 101000946843 Homo sapiens T-cell surface glycoprotein CD8 alpha chain Proteins 0.000 description 1
- 101000613001 Homo sapiens Tetraspanin-6 Proteins 0.000 description 1
- 101000773116 Homo sapiens Thioredoxin domain-containing protein 3 Proteins 0.000 description 1
- 101000904724 Homo sapiens Transmembrane glycoprotein NMB Proteins 0.000 description 1
- 101000823316 Homo sapiens Tyrosine-protein kinase ABL1 Proteins 0.000 description 1
- 101000863873 Homo sapiens Tyrosine-protein phosphatase non-receptor type substrate 1 Proteins 0.000 description 1
- 101000610640 Homo sapiens U4/U6 small nuclear ribonucleoprotein Prp3 Proteins 0.000 description 1
- 101000610557 Homo sapiens U4/U6 small nuclear ribonucleoprotein Prp31 Proteins 0.000 description 1
- 101000965658 Homo sapiens U6 snRNA-associated Sm-like protein LSm7 Proteins 0.000 description 1
- 101000806419 Homo sapiens V-type proton ATPase subunit G 2 Proteins 0.000 description 1
- 101000852150 Homo sapiens V-type proton ATPase subunit d 1 Proteins 0.000 description 1
- 101000749359 Homo sapiens V-type proton ATPase subunit e 1 Proteins 0.000 description 1
- 101000639096 Homo sapiens V-type proton ATPase subunit e 2 Proteins 0.000 description 1
- 101000596394 Homo sapiens Vesicle-fusing ATPase Proteins 0.000 description 1
- 241000276450 Hucho Species 0.000 description 1
- 241000725303 Human immunodeficiency virus Species 0.000 description 1
- 108060006678 I-kappa-B kinase Proteins 0.000 description 1
- 102000001284 I-kappa-B kinase Human genes 0.000 description 1
- 102100023540 Immunoglobulin superfamily containing leucine-rich repeat protein 2 Human genes 0.000 description 1
- 102100026208 Inactive phospholipase C-like protein 2 Human genes 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 102100040061 Indoleamine 2,3-dioxygenase 1 Human genes 0.000 description 1
- 102100024037 Inositol 1,4,5-trisphosphate receptor type 2 Human genes 0.000 description 1
- 102100024035 Inositol 1,4,5-trisphosphate receptor type 3 Human genes 0.000 description 1
- 102100021496 Insulin-degrading enzyme Human genes 0.000 description 1
- 102100037852 Insulin-like growth factor I Human genes 0.000 description 1
- 102100022710 Insulin-like growth factor-binding protein 2 Human genes 0.000 description 1
- 102100032817 Integrin alpha-5 Human genes 0.000 description 1
- 102100032832 Integrin alpha-7 Human genes 0.000 description 1
- 102100037877 Intercellular adhesion molecule 1 Human genes 0.000 description 1
- 102100037872 Intercellular adhesion molecule 2 Human genes 0.000 description 1
- 102100037850 Interferon gamma Human genes 0.000 description 1
- 102100038069 Interferon regulatory factor 8 Human genes 0.000 description 1
- 102100026018 Interleukin-1 receptor antagonist protein Human genes 0.000 description 1
- 102100026016 Interleukin-1 receptor type 1 Human genes 0.000 description 1
- 102100030236 Interleukin-10 receptor subunit alpha Human genes 0.000 description 1
- 102100020791 Interleukin-13 receptor subunit alpha-1 Human genes 0.000 description 1
- 102100020789 Interleukin-15 receptor subunit alpha Human genes 0.000 description 1
- 102000003810 Interleukin-18 Human genes 0.000 description 1
- 108090000171 Interleukin-18 Proteins 0.000 description 1
- 102000000588 Interleukin-2 Human genes 0.000 description 1
- 108010002350 Interleukin-2 Proteins 0.000 description 1
- 108010063738 Interleukins Proteins 0.000 description 1
- 102000015696 Interleukins Human genes 0.000 description 1
- 208000037396 Intraductal Noninfiltrating Carcinoma Diseases 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 102100039905 Isocitrate dehydrogenase [NADP] cytoplasmic Human genes 0.000 description 1
- 102100037845 Isocitrate dehydrogenase [NADP], mitochondrial Human genes 0.000 description 1
- OWIKHYCFFJSOEH-UHFFFAOYSA-N Isocyanic acid Chemical compound N=C=O OWIKHYCFFJSOEH-UHFFFAOYSA-N 0.000 description 1
- 102100023429 Junctional adhesion molecule C Human genes 0.000 description 1
- 108010006746 KCNQ2 Potassium Channel Proteins 0.000 description 1
- 108010038888 KCNQ3 Potassium Channel Proteins 0.000 description 1
- 102100029874 Kappa-casein Human genes 0.000 description 1
- 102100031819 Kappa-type opioid receptor Human genes 0.000 description 1
- 102100038197 Katanin p60 ATPase-containing subunit A1 Human genes 0.000 description 1
- 102000004034 Kelch-Like ECH-Associated Protein 1 Human genes 0.000 description 1
- 108090000484 Kelch-Like ECH-Associated Protein 1 Proteins 0.000 description 1
- 102100021447 Kell blood group glycoprotein Human genes 0.000 description 1
- 102100021464 Kinetochore scaffold 1 Human genes 0.000 description 1
- 101710159002 L-lactate oxidase Proteins 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- 102100026517 Lamin-B1 Human genes 0.000 description 1
- 102100022745 Laminin subunit alpha-2 Human genes 0.000 description 1
- 102100027454 Laminin subunit beta-2 Human genes 0.000 description 1
- 241000222722 Leishmania <genus> Species 0.000 description 1
- 108010020246 Leucine-Rich Repeat Serine-Threonine Protein Kinase-2 Proteins 0.000 description 1
- 102100022275 Leucine-rich glioma-inactivated protein 1 Human genes 0.000 description 1
- 102100024102 Leucine-rich repeat and immunoglobulin-like domain-containing nogo receptor-interacting protein 1 Human genes 0.000 description 1
- 102100032693 Leucine-rich repeat serine/threonine-protein kinase 2 Human genes 0.000 description 1
- 102100040695 Leucine-rich repeat-containing protein 25 Human genes 0.000 description 1
- 102100033304 Leucine-rich repeat-containing protein 4 Human genes 0.000 description 1
- 102100028214 Leucine-rich repeat-containing protein 61 Human genes 0.000 description 1
- 102100032352 Leukemia inhibitory factor Human genes 0.000 description 1
- 108090000581 Leukemia inhibitory factor Proteins 0.000 description 1
- HLFSDGLLUJUHTE-SNVBAGLBSA-N Levamisole Chemical compound C1([C@H]2CN3CCSC3=N2)=CC=CC=C1 HLFSDGLLUJUHTE-SNVBAGLBSA-N 0.000 description 1
- 108010013563 Lipoprotein Lipase Proteins 0.000 description 1
- 102100022119 Lipoprotein lipase Human genes 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 102000004895 Lipoproteins Human genes 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 102100026395 Lysine-specific demethylase PHF2 Human genes 0.000 description 1
- 102100024033 Lysocardiolipin acyltransferase 1 Human genes 0.000 description 1
- 102100040607 Lysophosphatidic acid receptor 1 Human genes 0.000 description 1
- 102100023231 Lysosomal alpha-mannosidase Human genes 0.000 description 1
- 108010009254 Lysosomal-Associated Membrane Protein 1 Proteins 0.000 description 1
- 102100035133 Lysosome-associated membrane glycoprotein 1 Human genes 0.000 description 1
- 108010068342 MAP Kinase Kinase 1 Proteins 0.000 description 1
- 108010068353 MAP Kinase Kinase 2 Proteins 0.000 description 1
- 102100034069 MAP kinase-activated protein kinase 2 Human genes 0.000 description 1
- 108010041955 MAP-kinase-activated kinase 2 Proteins 0.000 description 1
- 102100028913 MAP/microtubule affinity-regulating kinase 4 Human genes 0.000 description 1
- 101150083522 MECP2 gene Proteins 0.000 description 1
- 101150053046 MYD88 gene Proteins 0.000 description 1
- 102100025136 Macrosialin Human genes 0.000 description 1
- 101100018141 Magnaporthe oryzae (strain 70-15 / ATCC MYA-4617 / FGSC 8958) HTR5 gene Proteins 0.000 description 1
- 102100025613 Major facilitator superfamily domain-containing protein 8 Human genes 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 101001129124 Mannheimia haemolytica Outer membrane lipoprotein 1 Proteins 0.000 description 1
- 108010000684 Matrix Metalloproteinases Proteins 0.000 description 1
- 102000002274 Matrix Metalloproteinases Human genes 0.000 description 1
- 102100030216 Matrix metalloproteinase-14 Human genes 0.000 description 1
- 102100030200 Matrix metalloproteinase-16 Human genes 0.000 description 1
- 102100030218 Matrix metalloproteinase-19 Human genes 0.000 description 1
- 102100024129 Matrix metalloproteinase-24 Human genes 0.000 description 1
- 102100026048 Meckel syndrome type 1 protein Human genes 0.000 description 1
- 102100038976 Mediator of RNA polymerase II transcription subunit 10 Human genes 0.000 description 1
- 102100021070 Mediator of RNA polymerase II transcription subunit 12 Human genes 0.000 description 1
- 101800002739 Melanin-concentrating hormone Proteins 0.000 description 1
- 102100027252 Melanoma-associated antigen C2 Human genes 0.000 description 1
- 108010047230 Member 1 Subfamily B ATP Binding Cassette Transporter Proteins 0.000 description 1
- 108010090306 Member 2 Subfamily G ATP Binding Cassette Transporter Proteins 0.000 description 1
- 102100029698 Metallothionein-1A Human genes 0.000 description 1
- 102100037511 Metastasis-associated protein MTA2 Human genes 0.000 description 1
- 102100039124 Methyl-CpG-binding protein 2 Human genes 0.000 description 1
- 102100021316 Mineralocorticoid receptor Human genes 0.000 description 1
- 102100024193 Mitogen-activated protein kinase 1 Human genes 0.000 description 1
- 102100026931 Mitogen-activated protein kinase 10 Human genes 0.000 description 1
- 102100024192 Mitogen-activated protein kinase 3 Human genes 0.000 description 1
- 102100037808 Mitogen-activated protein kinase 8 Human genes 0.000 description 1
- 102100037809 Mitogen-activated protein kinase 9 Human genes 0.000 description 1
- 102100027869 Moesin Human genes 0.000 description 1
- 102100028647 Mu-type opioid receptor Human genes 0.000 description 1
- 102100023346 Multimerin-2 Human genes 0.000 description 1
- 208000034578 Multiple myelomas Diseases 0.000 description 1
- 101100079042 Mus musculus Myef2 gene Proteins 0.000 description 1
- 102100021970 Myc box-dependent-interacting protein 1 Human genes 0.000 description 1
- 102100026784 Myelin proteolipid protein Human genes 0.000 description 1
- 102100021831 Myelin-associated glycoprotein Human genes 0.000 description 1
- 208000033761 Myelogenous Chronic BCR-ABL Positive Leukemia Diseases 0.000 description 1
- 208000033776 Myeloid Acute Leukemia Diseases 0.000 description 1
- 102100025243 Myeloid cell surface antigen CD33 Human genes 0.000 description 1
- 102100024134 Myeloid differentiation primary response protein MyD88 Human genes 0.000 description 1
- 102100021148 Myocyte-specific enhancer factor 2A Human genes 0.000 description 1
- 102100038899 Myogenesis-regulating glycosidase Human genes 0.000 description 1
- 102100036640 Myosin-10 Human genes 0.000 description 1
- 101001055320 Myxine glutinosa Insulin-like growth factor Proteins 0.000 description 1
- 102100030894 NACHT domain- and WD repeat-containing protein 1 Human genes 0.000 description 1
- 102100038625 NADH-ubiquinone oxidoreductase chain 1 Human genes 0.000 description 1
- 108010071382 NF-E2-Related Factor 2 Proteins 0.000 description 1
- 102100029565 NPC intracellular cholesterol transporter 1 Human genes 0.000 description 1
- 102100022737 NPC intracellular cholesterol transporter 2 Human genes 0.000 description 1
- 102100029166 NT-3 growth factor receptor Human genes 0.000 description 1
- 102100036836 Natriuretic peptides B Human genes 0.000 description 1
- 102100023062 Negative elongation factor A Human genes 0.000 description 1
- 102100031900 Neogenin Human genes 0.000 description 1
- 102000048850 Neoplasm Genes Human genes 0.000 description 1
- 108700019961 Neoplasm Genes Proteins 0.000 description 1
- 102100023187 Nephrocystin-1 Human genes 0.000 description 1
- 108090000028 Neprilysin Proteins 0.000 description 1
- 108010074223 Netrin-1 Proteins 0.000 description 1
- 102000009065 Netrin-1 Human genes 0.000 description 1
- 102100038699 Netrin-G2 Human genes 0.000 description 1
- 108010012255 Neural Cell Adhesion Molecule L1 Proteins 0.000 description 1
- 102100024964 Neural cell adhesion molecule L1 Human genes 0.000 description 1
- 102100023616 Neural cell adhesion molecule L1-like protein Human genes 0.000 description 1
- 102000048238 Neuregulin-1 Human genes 0.000 description 1
- 108090000556 Neuregulin-1 Proteins 0.000 description 1
- 102100021582 Neurexin-1-beta Human genes 0.000 description 1
- 206010029260 Neuroblastoma Diseases 0.000 description 1
- 102100034441 Neuroligin-4, X-linked Human genes 0.000 description 1
- 208000005890 Neuroma Diseases 0.000 description 1
- 102100038877 Neuronal PAS domain-containing protein 4 Human genes 0.000 description 1
- 102100039908 Neuronal acetylcholine receptor subunit alpha-3 Human genes 0.000 description 1
- 102100039909 Neuronal acetylcholine receptor subunit alpha-4 Human genes 0.000 description 1
- 102100039907 Neuronal acetylcholine receptor subunit alpha-5 Human genes 0.000 description 1
- 102100021511 Neuronal acetylcholine receptor subunit alpha-7 Human genes 0.000 description 1
- 102100030912 Neuronal acetylcholine receptor subunit beta-2 Human genes 0.000 description 1
- 108090000770 Neuropilin-2 Proteins 0.000 description 1
- 101100355599 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) mus-11 gene Proteins 0.000 description 1
- 102100037590 Neurotensin/neuromedin N Human genes 0.000 description 1
- 102100029268 Neurotrophin-3 Human genes 0.000 description 1
- 102100023620 Neutrophil cytosol factor 1 Human genes 0.000 description 1
- 102100034450 Nicotinamide/nicotinic acid mononucleotide adenylyltransferase 2 Human genes 0.000 description 1
- 108010008858 Nitric Oxide Synthase Type I Proteins 0.000 description 1
- GRYLNZFGIOXLOG-UHFFFAOYSA-N Nitric acid Chemical compound O[N+]([O-])=O GRYLNZFGIOXLOG-UHFFFAOYSA-N 0.000 description 1
- 102100022397 Nitric oxide synthase, brain Human genes 0.000 description 1
- 102100028452 Nitric oxide synthase, endothelial Human genes 0.000 description 1
- 102100029438 Nitric oxide synthase, inducible Human genes 0.000 description 1
- IOVCWXUNBOPUCH-UHFFFAOYSA-N Nitrous acid Chemical compound ON=O IOVCWXUNBOPUCH-UHFFFAOYSA-N 0.000 description 1
- 102100037046 Noelin-3 Human genes 0.000 description 1
- 102100038454 Noggin Human genes 0.000 description 1
- 102000001756 Notch2 Receptor Human genes 0.000 description 1
- 108010029751 Notch2 Receptor Proteins 0.000 description 1
- 102000001760 Notch3 Receptor Human genes 0.000 description 1
- 108010029756 Notch3 Receptor Proteins 0.000 description 1
- 102000001753 Notch4 Receptor Human genes 0.000 description 1
- 108010029741 Notch4 Receptor Proteins 0.000 description 1
- 102100023050 Nuclear factor NF-kappa-B p105 subunit Human genes 0.000 description 1
- 102100031701 Nuclear factor erythroid 2-related factor 2 Human genes 0.000 description 1
- 102100021713 Nuclear nucleic acid-binding protein C1D Human genes 0.000 description 1
- 102100030569 Nuclear receptor corepressor 2 Human genes 0.000 description 1
- 101710153660 Nuclear receptor corepressor 2 Proteins 0.000 description 1
- 102100022676 Nuclear receptor subfamily 4 group A member 2 Human genes 0.000 description 1
- 102100022400 Nucleolar protein 3 Human genes 0.000 description 1
- 102100032210 Nucleoside diphosphate kinase homolog 5 Human genes 0.000 description 1
- 102100025193 OTU domain-containing protein 4 Human genes 0.000 description 1
- 102100032604 Occludin Human genes 0.000 description 1
- 101000761187 Odontomachus monticola U-poneritoxin(01)-Om1a Proteins 0.000 description 1
- 108010019644 Oligodendrocyte Transcription Factor 2 Proteins 0.000 description 1
- 102000005803 Oligodendrocyte Transcription Factor 2 Human genes 0.000 description 1
- 102100035592 Oligophrenin-1 Human genes 0.000 description 1
- 102100030098 Oncostatin-M-specific receptor subunit beta Human genes 0.000 description 1
- 102100031943 One cut domain family member 2 Human genes 0.000 description 1
- 102100031822 Optineurin Human genes 0.000 description 1
- 102100028141 Orexin/Hypocretin receptor type 1 Human genes 0.000 description 1
- 101100236423 Oryza sativa subsp. japonica MADS3 gene Proteins 0.000 description 1
- 206010061535 Ovarian neoplasm Diseases 0.000 description 1
- 102100025386 Oxidized low-density lipoprotein receptor 1 Human genes 0.000 description 1
- 102100037601 P2X purinoceptor 4 Human genes 0.000 description 1
- 102100037602 P2X purinoceptor 7 Human genes 0.000 description 1
- 102100026171 P2Y purinoceptor 12 Human genes 0.000 description 1
- 101700056750 PAK1 Proteins 0.000 description 1
- 108010032788 PAX6 Transcription Factor Proteins 0.000 description 1
- KJWZYMMLVHIVSU-IYCNHOCDSA-N PGK1 Chemical compound CCCCC[C@H](O)\C=C\[C@@H]1[C@@H](CCCCCCC(O)=O)C(=O)CC1=O KJWZYMMLVHIVSU-IYCNHOCDSA-N 0.000 description 1
- 102100036870 PHD finger protein 19 Human genes 0.000 description 1
- 102100035196 POLG alternative reading frame Human genes 0.000 description 1
- 102100035423 POU domain, class 5, transcription factor 1 Human genes 0.000 description 1
- 102100040852 Paired box protein Pax-2 Human genes 0.000 description 1
- 102100040891 Paired box protein Pax-3 Human genes 0.000 description 1
- 102100037506 Paired box protein Pax-6 Human genes 0.000 description 1
- 108020002591 Palmitoyl protein thioesterase Proteins 0.000 description 1
- 102000005327 Palmitoyl protein thioesterase Human genes 0.000 description 1
- 108010067372 Pancreatic elastase Proteins 0.000 description 1
- 102000016387 Pancreatic elastase Human genes 0.000 description 1
- 102100024127 Pantothenate kinase 2, mitochondrial Human genes 0.000 description 1
- 102100035006 Paralemmin-1 Human genes 0.000 description 1
- 102100037499 Parkinson disease protein 7 Human genes 0.000 description 1
- 108010067902 Peptide Library Proteins 0.000 description 1
- 102100028467 Perforin-1 Human genes 0.000 description 1
- 102100028465 Peripherin Human genes 0.000 description 1
- 102100030564 Peroxisomal membrane protein 2 Human genes 0.000 description 1
- 102100038825 Peroxisome proliferator-activated receptor gamma Human genes 0.000 description 1
- 102100028960 Peroxisome proliferator-activated receptor gamma coactivator 1-alpha Human genes 0.000 description 1
- 102100038223 Phenylalanine-4-hydroxylase Human genes 0.000 description 1
- 101710125939 Phenylalanine-4-hydroxylase Proteins 0.000 description 1
- 102100026169 Phosphatidylinositol 3-kinase regulatory subunit alpha Human genes 0.000 description 1
- 102100036061 Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit beta isoform Human genes 0.000 description 1
- 102100039298 Phosphatidylserine synthase 1 Human genes 0.000 description 1
- 102100028251 Phosphoglycerate kinase 1 Human genes 0.000 description 1
- 102100037389 Phosphoglycerate mutase 1 Human genes 0.000 description 1
- 102100036066 Phospholipase A and acyltransferase 3 Human genes 0.000 description 1
- 102100033616 Phospholipid-transporting ATPase ABCA1 Human genes 0.000 description 1
- 102100036620 Phospholipid-transporting ATPase ABCA7 Human genes 0.000 description 1
- 108010089430 Phosphoproteins Proteins 0.000 description 1
- 102000007982 Phosphoproteins Human genes 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 208000007913 Pituitary Neoplasms Diseases 0.000 description 1
- 102100035733 Pituitary adenylate cyclase-activating polypeptide Human genes 0.000 description 1
- 102100029744 Plasma membrane calcium-transporting ATPase 3 Human genes 0.000 description 1
- 102100030265 Plasmolipin Human genes 0.000 description 1
- 102100035181 Plastin-1 Human genes 0.000 description 1
- 102100030655 Platelet-activating factor acetylhydrolase IB subunit beta Human genes 0.000 description 1
- 102100036245 Pleckstrin homology domain-containing family O member 2 Human genes 0.000 description 1
- 102100034390 Plexin-B3 Human genes 0.000 description 1
- 102100035381 Plexin-C1 Human genes 0.000 description 1
- 108010064218 Poly (ADP-Ribose) Polymerase-1 Proteins 0.000 description 1
- 102100023712 Poly [ADP-ribose] polymerase 1 Human genes 0.000 description 1
- 102100039917 Polyamine-transporting ATPase 13A2 Human genes 0.000 description 1
- 102100033566 Polycomb complex protein BMI-1 Human genes 0.000 description 1
- 102100040748 Polyglutamine-binding protein 1 Human genes 0.000 description 1
- 102100031949 Polyunsaturated fatty acid lipoxygenase ALOX12 Human genes 0.000 description 1
- 102100024986 Potassium channel subfamily K member 9 Human genes 0.000 description 1
- 102100034368 Potassium voltage-gated channel subfamily A member 1 Human genes 0.000 description 1
- 102100034310 Potassium voltage-gated channel subfamily B member 1 Human genes 0.000 description 1
- 102100033172 Potassium voltage-gated channel subfamily C member 3 Human genes 0.000 description 1
- 102100033184 Potassium voltage-gated channel subfamily D member 3 Human genes 0.000 description 1
- 102100034354 Potassium voltage-gated channel subfamily KQT member 2 Human genes 0.000 description 1
- 102100034360 Potassium voltage-gated channel subfamily KQT member 3 Human genes 0.000 description 1
- 102100028730 Pre-mRNA-processing factor 17 Human genes 0.000 description 1
- 102100036347 Pre-mRNA-splicing factor SPF27 Human genes 0.000 description 1
- 102100026531 Prelamin-A/C Human genes 0.000 description 1
- 102100022033 Presenilin-1 Human genes 0.000 description 1
- 102100022036 Presenilin-2 Human genes 0.000 description 1
- 102100026651 Pro-adrenomedullin Human genes 0.000 description 1
- 102100027467 Pro-opiomelanocortin Human genes 0.000 description 1
- 102100036134 Probable arginine-tRNA ligase, mitochondrial Human genes 0.000 description 1
- 102100035920 Probable hydrolase PNKD Human genes 0.000 description 1
- 102100035198 Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 Human genes 0.000 description 1
- 102100028857 Profilin-1 Human genes 0.000 description 1
- 102100024091 Proline-rich AKT1 substrate 1 Human genes 0.000 description 1
- 102100028840 Proline-rich transmembrane protein 2 Human genes 0.000 description 1
- 102100040477 Prolyl 4-hydroxylase subunit alpha-1 Human genes 0.000 description 1
- 102100035760 Proteasome subunit beta type-8 Human genes 0.000 description 1
- 102100035764 Proteasome subunit beta type-9 Human genes 0.000 description 1
- 102100034479 Protein CLN8 Human genes 0.000 description 1
- 108010032428 Protein Deglycase DJ-1 Proteins 0.000 description 1
- 108091000521 Protein-Arginine Deiminase Type 2 Proteins 0.000 description 1
- 102100035735 Protein-arginine deiminase type-2 Human genes 0.000 description 1
- 102100026858 Protein-lysine 6-oxidase Human genes 0.000 description 1
- 108010026552 Proteome Proteins 0.000 description 1
- 102100024261 Protocadherin alpha-4 Human genes 0.000 description 1
- 102100036389 Protocadherin-19 Human genes 0.000 description 1
- 102100032590 Puratrophin-1 Human genes 0.000 description 1
- 102100032315 RAC-beta serine/threonine-protein kinase Human genes 0.000 description 1
- 102100032314 RAC-gamma serine/threonine-protein kinase Human genes 0.000 description 1
- 101150105148 RAD23 gene Proteins 0.000 description 1
- 101150006234 RAD52 gene Proteins 0.000 description 1
- 102100034220 RAS guanyl-releasing protein 1 Human genes 0.000 description 1
- 102100034026 RNA-binding protein Musashi homolog 1 Human genes 0.000 description 1
- 102100038427 RNA-binding protein Nova-1 Human genes 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 102000053062 Rad52 DNA Repair and Recombination Human genes 0.000 description 1
- 108700031762 Rad52 DNA Repair and Recombination Proteins 0.000 description 1
- 102100034485 Ras-related protein Rab-2A Human genes 0.000 description 1
- 102100022305 Ras-related protein Rab-38 Human genes 0.000 description 1
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 description 1
- 102100028254 Renin receptor Human genes 0.000 description 1
- 102100033203 Rho guanine nucleotide exchange factor 10 Human genes 0.000 description 1
- 102000002278 Ribosomal Proteins Human genes 0.000 description 1
- 108010000605 Ribosomal Proteins Proteins 0.000 description 1
- 102100025831 Scavenger receptor cysteine-rich type 1 protein M130 Human genes 0.000 description 1
- 102100027103 Serine/threonine-protein kinase B-raf Human genes 0.000 description 1
- 102100028751 Serine/threonine-protein kinase Nek1 Human genes 0.000 description 1
- 102100038376 Serine/threonine-protein kinase PINK1, mitochondrial Human genes 0.000 description 1
- 102100034136 Serine/threonine-protein kinase receptor R3 Human genes 0.000 description 1
- 102100040321 Serine/threonine-protein phosphatase 2B catalytic subunit beta isoform Human genes 0.000 description 1
- 102100040320 Serine/threonine-protein phosphatase 2B catalytic subunit gamma isoform Human genes 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 102100037442 Small conductance calcium-activated potassium channel protein 3 Human genes 0.000 description 1
- 102100021952 Sodium/potassium-transporting ATPase subunit alpha-3 Human genes 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 108091027544 Subgenomic mRNA Proteins 0.000 description 1
- KDYFGRWQOYBRFD-UHFFFAOYSA-N Succinic acid Natural products OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 description 1
- YCUVUDODLRLVIC-UHFFFAOYSA-N Sudan black B Chemical compound C1=CC(=C23)NC(C)(C)NC2=CC=CC3=C1N=NC(C1=CC=CC=C11)=CC=C1N=NC1=CC=CC=C1 YCUVUDODLRLVIC-UHFFFAOYSA-N 0.000 description 1
- LSNNMFCWUKXFEE-UHFFFAOYSA-N Sulfurous acid Chemical compound OS(O)=O LSNNMFCWUKXFEE-UHFFFAOYSA-N 0.000 description 1
- 108010002687 Survivin Proteins 0.000 description 1
- 230000006044 T cell activation Effects 0.000 description 1
- 108091008874 T cell receptors Proteins 0.000 description 1
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 1
- 102100036011 T-cell surface glycoprotein CD4 Human genes 0.000 description 1
- 102100034922 T-cell surface glycoprotein CD8 alpha chain Human genes 0.000 description 1
- 238000010459 TALEN Methods 0.000 description 1
- FEWJPZIEWOKRBE-UHFFFAOYSA-N Tartaric acid Natural products [H+].[H+].[O-]C(=O)C(O)C(O)C([O-])=O FEWJPZIEWOKRBE-UHFFFAOYSA-N 0.000 description 1
- 102100040869 Tetraspanin-6 Human genes 0.000 description 1
- 241000205188 Thermococcus Species 0.000 description 1
- 108090001109 Thermolysin Proteins 0.000 description 1
- 101000803959 Thermus thermophilus (strain ATCC 27634 / DSM 579 / HB8) DNA ligase Proteins 0.000 description 1
- 102100030271 Thioredoxin domain-containing protein 3 Human genes 0.000 description 1
- 102100032120 Toll/interleukin-1 receptor domain-containing adapter protein Human genes 0.000 description 1
- 108091093130 Toxic Small RNA Proteins 0.000 description 1
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102100023935 Transmembrane glycoprotein NMB Human genes 0.000 description 1
- 208000003721 Triple Negative Breast Neoplasms Diseases 0.000 description 1
- 206010044688 Trisomy 21 Diseases 0.000 description 1
- GLNADSQYFUSGOU-GPTZEZBUSA-J Trypan blue Chemical compound [Na+].[Na+].[Na+].[Na+].C1=C(S([O-])(=O)=O)C=C2C=C(S([O-])(=O)=O)C(/N=N/C3=CC=C(C=C3C)C=3C=C(C(=CC=3)\N=N\C=3C(=CC4=CC(=CC(N)=C4C=3O)S([O-])(=O)=O)S([O-])(=O)=O)C)=C(O)C2=C1N GLNADSQYFUSGOU-GPTZEZBUSA-J 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 108010040002 Tumor Suppressor Proteins Proteins 0.000 description 1
- 102000001742 Tumor Suppressor Proteins Human genes 0.000 description 1
- 102100040245 Tumor necrosis factor receptor superfamily member 5 Human genes 0.000 description 1
- 102100022596 Tyrosine-protein kinase ABL1 Human genes 0.000 description 1
- 102100029948 Tyrosine-protein phosphatase non-receptor type substrate 1 Human genes 0.000 description 1
- 102100040374 U4/U6 small nuclear ribonucleoprotein Prp3 Human genes 0.000 description 1
- 102100040118 U4/U6 small nuclear ribonucleoprotein Prp31 Human genes 0.000 description 1
- 102100040951 U6 snRNA-associated Sm-like protein LSm7 Human genes 0.000 description 1
- 108010005656 Ubiquitin Thiolesterase Proteins 0.000 description 1
- 102000005918 Ubiquitin Thiolesterase Human genes 0.000 description 1
- 108010005705 Ubiquitinated Proteins Proteins 0.000 description 1
- 102000006943 Uracil-DNA Glycosidase Human genes 0.000 description 1
- 108010072685 Uracil-DNA Glycosidase Proteins 0.000 description 1
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 1
- 102100037430 V-type proton ATPase subunit G 2 Human genes 0.000 description 1
- 102100036507 V-type proton ATPase subunit d 1 Human genes 0.000 description 1
- 102100040563 V-type proton ATPase subunit e 1 Human genes 0.000 description 1
- 102100031384 V-type proton ATPase subunit e 2 Human genes 0.000 description 1
- 108010053099 Vascular Endothelial Growth Factor Receptor-2 Proteins 0.000 description 1
- 102100033177 Vascular endothelial growth factor receptor 2 Human genes 0.000 description 1
- 102100035054 Vesicle-fusing ATPase Human genes 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- 108020000999 Viral RNA Proteins 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 241000726445 Viroids Species 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 102000013814 Wnt Human genes 0.000 description 1
- 108050003627 Wnt Proteins 0.000 description 1
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 239000000061 acid fraction Substances 0.000 description 1
- RZUBARUFLYGOGC-MTHOTQAESA-L acid fuchsin Chemical compound [Na+].[Na+].[O-]S(=O)(=O)C1=C(N)C(C)=CC(C(=C\2C=C(C(=[NH2+])C=C/2)S([O-])(=O)=O)\C=2C=C(C(N)=CC=2)S([O-])(=O)=O)=C1 RZUBARUFLYGOGC-MTHOTQAESA-L 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- DPKHZNPWBDQZCN-UHFFFAOYSA-N acridine orange free base Chemical compound C1=CC(N(C)C)=CC2=NC3=CC(N(C)C)=CC=C3C=C21 DPKHZNPWBDQZCN-UHFFFAOYSA-N 0.000 description 1
- 230000001919 adrenal effect Effects 0.000 description 1
- 238000007605 air drying Methods 0.000 description 1
- BJEPYKJPYRNKOW-UHFFFAOYSA-N alpha-hydroxysuccinic acid Natural products OC(=O)C(O)CC(O)=O BJEPYKJPYRNKOW-UHFFFAOYSA-N 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 238000002669 amniocentesis Methods 0.000 description 1
- 208000036878 aneuploidy Diseases 0.000 description 1
- 231100001075 aneuploidy Toxicity 0.000 description 1
- XLJMAIOERFSOGZ-UHFFFAOYSA-N anhydrous cyanic acid Natural products OC#N XLJMAIOERFSOGZ-UHFFFAOYSA-N 0.000 description 1
- 230000030741 antigen processing and presentation Effects 0.000 description 1
- 239000000074 antisense oligonucleotide Substances 0.000 description 1
- 238000012230 antisense oligonucleotides Methods 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 101150070427 argC gene Proteins 0.000 description 1
- 101150089042 argC2 gene Proteins 0.000 description 1
- 229940000488 arsenic acid Drugs 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 208000022362 bacterial infectious disease Diseases 0.000 description 1
- 210000003651 basophil Anatomy 0.000 description 1
- 108700000711 bcl-X Proteins 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 235000010233 benzoic acid Nutrition 0.000 description 1
- DZBUGLKDJFMEHC-UHFFFAOYSA-N benzoquinolinylidene Natural products C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 1
- 108010032967 beta-Arrestin 2 Proteins 0.000 description 1
- 239000003181 biological factor Substances 0.000 description 1
- 230000008236 biological pathway Effects 0.000 description 1
- FUHMZYWBSHTEDZ-UHFFFAOYSA-M bispyribac-sodium Chemical compound [Na+].COC1=CC(OC)=NC(OC=2C(=C(OC=3N=C(OC)C=C(OC)N=3)C=CC=2)C([O-])=O)=N1 FUHMZYWBSHTEDZ-UHFFFAOYSA-M 0.000 description 1
- 210000000601 blood cell Anatomy 0.000 description 1
- 210000001772 blood platelet Anatomy 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 210000004958 brain cell Anatomy 0.000 description 1
- 238000000339 bright-field microscopy Methods 0.000 description 1
- 239000008364 bulk solution Substances 0.000 description 1
- KDYFGRWQOYBRFD-NUQCWPJISA-N butanedioic acid Chemical compound O[14C](=O)CC[14C](O)=O KDYFGRWQOYBRFD-NUQCWPJISA-N 0.000 description 1
- 238000002619 cancer immunotherapy Methods 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 235000012730 carminic acid Nutrition 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 230000004640 cellular pathway Effects 0.000 description 1
- 210000003169 central nervous system Anatomy 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 210000001638 cerebellum Anatomy 0.000 description 1
- 201000010881 cervical cancer Diseases 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- BPHHNXJPFPEJOF-UHFFFAOYSA-J chembl296966 Chemical compound [Na+].[Na+].[Na+].[Na+].[O-]S(=O)(=O)C1=CC(S([O-])(=O)=O)=C(N)C2=C(O)C(N=NC3=CC=C(C=C3OC)C=3C=C(C(=CC=3)N=NC=3C(=C4C(N)=C(C=C(C4=CC=3)S([O-])(=O)=O)S([O-])(=O)=O)O)OC)=CC=C21 BPHHNXJPFPEJOF-UHFFFAOYSA-J 0.000 description 1
- 239000003638 chemical reducing agent Substances 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 210000003763 chloroplast Anatomy 0.000 description 1
- 210000003483 chromatin Anatomy 0.000 description 1
- KRVSOGSZCMJSLX-UHFFFAOYSA-L chromic acid Substances O[Cr](O)(=O)=O KRVSOGSZCMJSLX-UHFFFAOYSA-L 0.000 description 1
- 229960002376 chymotrypsin Drugs 0.000 description 1
- 210000000078 claw Anatomy 0.000 description 1
- 108090001092 clostripain Proteins 0.000 description 1
- 230000008045 co-localization Effects 0.000 description 1
- 229960002424 collagenase Drugs 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 238000004624 confocal microscopy Methods 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000001054 cortical effect Effects 0.000 description 1
- 210000005257 cortical tissue Anatomy 0.000 description 1
- 101150021854 cosT gene Proteins 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000001351 cycling effect Effects 0.000 description 1
- 210000001151 cytotoxic T lymphocyte Anatomy 0.000 description 1
- 230000009615 deamination Effects 0.000 description 1
- 238000006481 deamination reaction Methods 0.000 description 1
- 230000018044 dehydration Effects 0.000 description 1
- 238000006297 dehydration reaction Methods 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 210000004443 dendritic cell Anatomy 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 208000028715 ductal breast carcinoma in situ Diseases 0.000 description 1
- 230000002526 effect on cardiovascular system Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 229940066758 endopeptidases Drugs 0.000 description 1
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 1
- 230000037149 energy metabolism Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 210000003979 eosinophil Anatomy 0.000 description 1
- 230000001973 epigenetic effect Effects 0.000 description 1
- UYAHIZSMUZPPFV-UHFFFAOYSA-N erbium Chemical compound [Er] UYAHIZSMUZPPFV-UHFFFAOYSA-N 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 210000001808 exosome Anatomy 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 238000001917 fluorescence detection Methods 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- AWJWCTOOIBYHON-UHFFFAOYSA-N furo[3,4-b]pyrazine-5,7-dione Chemical compound C1=CN=C2C(=O)OC(=O)C2=N1 AWJWCTOOIBYHON-UHFFFAOYSA-N 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 238000003205 genotyping method Methods 0.000 description 1
- 230000000762 glandular Effects 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 230000002518 glial effect Effects 0.000 description 1
- 210000002288 golgi apparatus Anatomy 0.000 description 1
- 210000003714 granulocyte Anatomy 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 210000002443 helper t lymphocyte Anatomy 0.000 description 1
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 210000003630 histaminocyte Anatomy 0.000 description 1
- 229920001519 homopolymer Polymers 0.000 description 1
- XMBWDFGMSWQBCA-UHFFFAOYSA-N hydrogen iodide Chemical compound I XMBWDFGMSWQBCA-UHFFFAOYSA-N 0.000 description 1
- 229910000037 hydrogen sulfide Inorganic materials 0.000 description 1
- 229940071870 hydroiodic acid Drugs 0.000 description 1
- 230000003100 immobilizing effect Effects 0.000 description 1
- 230000036737 immune function Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 238000012744 immunostaining Methods 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 208000027866 inflammatory disease Diseases 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- 230000028709 inflammatory response Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 108010092830 integrin alpha7beta1 Proteins 0.000 description 1
- PNDPGZBMCMUPRI-UHFFFAOYSA-N iodine Chemical compound II PNDPGZBMCMUPRI-UHFFFAOYSA-N 0.000 description 1
- JEIPFZHSYJVQDO-UHFFFAOYSA-N iron(III) oxide Inorganic materials O=[Fe]O[Fe]=O JEIPFZHSYJVQDO-UHFFFAOYSA-N 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 208000017169 kidney disease Diseases 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 229960001614 levamisole Drugs 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 101150094164 lysY gene Proteins 0.000 description 1
- 210000003712 lysosome Anatomy 0.000 description 1
- 230000001868 lysosomic effect Effects 0.000 description 1
- 210000002540 macrophage Anatomy 0.000 description 1
- 239000006148 magnetic separator Substances 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 239000001630 malic acid Substances 0.000 description 1
- 235000011090 malic acid Nutrition 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000008774 maternal effect Effects 0.000 description 1
- 101150014102 mef-2 gene Proteins 0.000 description 1
- 210000003593 megakaryocyte Anatomy 0.000 description 1
- 210000004779 membrane envelope Anatomy 0.000 description 1
- 208000030159 metabolic disease Diseases 0.000 description 1
- DWCZIOOZPIDHAB-UHFFFAOYSA-L methyl green Chemical compound [Cl-].[Cl-].C1=CC(N(C)C)=CC=C1C(C=1C=CC(=CC=1)[N+](C)(C)C)=C1C=CC(=[N+](C)C)C=C1 DWCZIOOZPIDHAB-UHFFFAOYSA-L 0.000 description 1
- 108010065059 methylaspartate ammonia-lyase Proteins 0.000 description 1
- CXKWCBBOMKCUKX-UHFFFAOYSA-M methylene blue Chemical compound [Cl-].C1=CC(N(C)C)=CC2=[S+]C3=CC(N(C)C)=CC=C3N=C21 CXKWCBBOMKCUKX-UHFFFAOYSA-M 0.000 description 1
- 229960000907 methylthioninium chloride Drugs 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- VLAPMBHFAWRUQP-UHFFFAOYSA-L molybdic acid Chemical compound O[Mo](O)(=O)=O VLAPMBHFAWRUQP-UHFFFAOYSA-L 0.000 description 1
- 210000001616 monocyte Anatomy 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 210000000066 myeloid cell Anatomy 0.000 description 1
- SHXOKQKTZJXHHR-UHFFFAOYSA-N n,n-diethyl-5-iminobenzo[a]phenoxazin-9-amine;hydrochloride Chemical compound [Cl-].C1=CC=C2C3=NC4=CC=C(N(CC)CC)C=C4OC3=CC(=[NH2+])C2=C1 SHXOKQKTZJXHHR-UHFFFAOYSA-N 0.000 description 1
- 210000000581 natural killer T-cell Anatomy 0.000 description 1
- 210000000822 natural killer cell Anatomy 0.000 description 1
- 210000002501 natural regulatory T cell Anatomy 0.000 description 1
- 238000002663 nebulization Methods 0.000 description 1
- 230000001272 neurogenic effect Effects 0.000 description 1
- 108010064131 neuronal Cdk5 activator (p25-p35) Proteins 0.000 description 1
- PGSADBUBUOPOJS-UHFFFAOYSA-N neutral red Chemical compound Cl.C1=C(C)C(N)=CC2=NC3=CC(N(C)C)=CC=C3N=C21 PGSADBUBUOPOJS-UHFFFAOYSA-N 0.000 description 1
- VOFUROIFQGPCGE-UHFFFAOYSA-N nile red Chemical compound C1=CC=C2C3=NC4=CC=C(N(CC)CC)C=C4OC3=CC(=O)C2=C1 VOFUROIFQGPCGE-UHFFFAOYSA-N 0.000 description 1
- 229910017604 nitric acid Inorganic materials 0.000 description 1
- 108091027963 non-coding RNA Proteins 0.000 description 1
- 102000042567 non-coding RNA Human genes 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 210000002220 organoid Anatomy 0.000 description 1
- 239000012285 osmium tetroxide Substances 0.000 description 1
- 229910000489 osmium tetroxide Inorganic materials 0.000 description 1
- 235000006408 oxalic acid Nutrition 0.000 description 1
- FJKROLUGYXJWQN-UHFFFAOYSA-N papa-hydroxy-benzoic acid Natural products OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 1
- 101150029755 park gene Proteins 0.000 description 1
- 235000012736 patent blue V Nutrition 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 102000013415 peroxidase activity proteins Human genes 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 108010060054 peroxisome-proliferator-activated receptor-gamma coactivator-1 Proteins 0.000 description 1
- 230000002085 persistent effect Effects 0.000 description 1
- 208000028591 pheochromocytoma Diseases 0.000 description 1
- 102000013451 phosphoglycerate mutase activity proteins Human genes 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 208000010916 pituitary tumor Diseases 0.000 description 1
- 229920003192 poly(bis maleimide) Polymers 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 238000012913 prioritisation Methods 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- XJMOSONTPMZWPB-UHFFFAOYSA-M propidium iodide Chemical compound [I-].[I-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CCC[N+](C)(CC)CC)=C1C1=CC=CC=C1 XJMOSONTPMZWPB-UHFFFAOYSA-M 0.000 description 1
- ODLMAHJVESYWTB-UHFFFAOYSA-N propylbenzene Chemical compound CCCC1=CC=CC=C1 ODLMAHJVESYWTB-UHFFFAOYSA-N 0.000 description 1
- 238000002331 protein detection Methods 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 108010067765 rab2 GTP Binding protein Proteins 0.000 description 1
- 102000016914 ras Proteins Human genes 0.000 description 1
- 102000027426 receptor tyrosine kinases Human genes 0.000 description 1
- 108091008598 receptor tyrosine kinases Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- OARRHUQTFTUEOS-UHFFFAOYSA-N safranin Chemical compound [Cl-].C=12C=C(N)C(C)=CC2=NC2=CC(C)=C(N)C=C2[N+]=1C1=CC=CC=C1 OARRHUQTFTUEOS-UHFFFAOYSA-N 0.000 description 1
- 229960004889 salicylic acid Drugs 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 210000003079 salivary gland Anatomy 0.000 description 1
- 239000012266 salt solution Substances 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 108700004121 sarkosyl Proteins 0.000 description 1
- 108091005418 scavenger receptor class E Proteins 0.000 description 1
- SPVXKVOXSXTJOY-UHFFFAOYSA-N selane Chemical compound [SeH2] SPVXKVOXSXTJOY-UHFFFAOYSA-N 0.000 description 1
- 229910000058 selane Inorganic materials 0.000 description 1
- 229940000207 selenious acid Drugs 0.000 description 1
- MCAHWIHFGHIESP-UHFFFAOYSA-N selenous acid Chemical compound O[Se](O)=O MCAHWIHFGHIESP-UHFFFAOYSA-N 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 238000012174 single-cell RNA sequencing Methods 0.000 description 1
- 235000020183 skimmed milk Nutrition 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 239000012279 sodium borohydride Substances 0.000 description 1
- 229910000033 sodium borohydride Inorganic materials 0.000 description 1
- KSAVQLQVUXSOCR-UHFFFAOYSA-M sodium lauroyl sarcosinate Chemical compound [Na+].CCCCCCCCCCCC(=O)N(C)CC([O-])=O KSAVQLQVUXSOCR-UHFFFAOYSA-M 0.000 description 1
- 229940045885 sodium lauroyl sarcosinate Drugs 0.000 description 1
- 238000011895 specific detection Methods 0.000 description 1
- 210000000278 spinal cord Anatomy 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 238000011476 stem cell transplantation Methods 0.000 description 1
- 230000035882 stress Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 239000011975 tartaric acid Substances 0.000 description 1
- 235000002906 tartaric acid Nutrition 0.000 description 1
- WWJZWCUNLNYYAU-UHFFFAOYSA-N temephos Chemical compound C1=CC(OP(=S)(OC)OC)=CC=C1SC1=CC=C(OP(=S)(OC)OC)C=C1 WWJZWCUNLNYYAU-UHFFFAOYSA-N 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 238000011222 transcriptome analysis Methods 0.000 description 1
- 210000003956 transport vesicle Anatomy 0.000 description 1
- YNJBWRMUSHSURL-UHFFFAOYSA-N trichloroacetic acid Chemical compound OC(=O)C(Cl)(Cl)Cl YNJBWRMUSHSURL-UHFFFAOYSA-N 0.000 description 1
- 208000022679 triple-negative breast carcinoma Diseases 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 108010064892 trkC Receptor Proteins 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 210000003934 vacuole Anatomy 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 238000007482 whole exome sequencing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6834—Enzymatic or biochemical coupling of nucleic acids to a solid phase
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Analytical Chemistry (AREA)
- Biophysics (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Provided herein are methods of determining the location of an analyte using hybridization as a method of enhancing analyte detection. In particular, capture probes comprising a spatial barcode and a capture domain are used to capture analytes in a biological sample in contact with a substrate. The analyte may be a nucleic acid or a protein. Decoy oligonucleotides are used to enrich for nucleic acids of interest prior to sequencing.
Description
Cross Reference to Related Applications
The present application claims U.S. provisional patent application Ser. No. 62/979,652, filed 2/21/2020; U.S. provisional patent application Ser. No. 62/980,124, filed 2/21/2020; and priority of U.S. provisional patent application No. 63/077,019 filed on 9/11/2020. The contents of the above-mentioned applications are incorporated herein by reference in their entirety.
Background
Cells within a subject tissue differ in cell morphology and/or function due to different analyte abundances (e.g., gene and/or protein expression) within different cells. Specific locations of cells within a tissue (e.g., locations of cells relative to neighboring cells or locations of cells relative to the tissue microenvironment) may affect, for example, morphology, differentiation, fate, viability, proliferation, behavior of cells, and signal transduction and crosstalk with other cells in the tissue.
Spatial heterogeneity has previously been investigated using techniques that provide data for only a small amount of analyte in whole or part of tissue, or for a large amount of analyte data for a single cell, but not information about the location of a single cell in a parent biological sample (e.g., a tissue sample).
Whole exome sequencing provides coverage for each transcript in the sample. There is a need in the art for transcriptome-specific methods for high fidelity enrichment of nucleic acid molecules for targeted sequencing while reducing cost, maximizing efficiency, and minimizing redundancy.
Sequence listing
The present application contains a sequence listing submitted by compact disc and incorporated herein by reference in its entirety. The optical disc created at 2021, 2 and 19 is named 47706-0198WO1_sequence_running_CORRECTED. Txt, size 316,281,109 bytes. 3 copies of the disc were submitted.
Summary of The Invention
Disclosed herein are methods for identifying the abundance and location of an analyte in a biological sample, the method comprising: (a) Contacting a plurality of nucleic acids with a plurality of decoy oligonucleotides, in some embodiments, the plurality of nucleic acids comprises an extended nucleic acid comprising (i) a spatial barcode or complement thereof and (ii) an analyte, analyte derivative, or complement thereof, all or part of the sequence; the decoy oligonucleotides of the plurality of decoy oligonucleotides comprise: (i) A capture domain that hybridizes to all or part of the sequence of an analyte, analyte derivative, or complement thereof, and (ii) a molecular tag; (b) Capturing decoy oligonucleotides bound to the extended nucleic acids using a substrate comprising an agent that binds to a molecular tag; and (c) determining all or part of the sequence of (i) the spatial barcode or its complement and all or part of the sequence of (ii) the extended nucleic acid, and using the sequences determined in (i) and (ii) to identify the abundance and location of the analyte in the biological sample.
In some embodiments, the analyte is a nucleic acid.
In some embodiments, the method further comprises generating a plurality of nucleic acids, the method comprising: (a) Contacting a biological sample with a substrate comprising a plurality of attached capture probes, in some embodiments, the capture probes of the plurality of capture probes comprise (i) a spatial barcode and (ii) a capture domain that binds to a sequence present in a nucleic acid; (b) hybridizing the capture probe to the nucleic acid; (c) Extending the 3' end of the capture probe using the nucleic acid bound to the capture domain as a template to generate an extended capture probe; and (d) amplifying the extended capture probe to produce an extended nucleic acid.
In some embodiments, the extended nucleic acid is released from the extended capture probe.
In some embodiments, the analyte is a protein.
In some embodiments, the analyte derivative is an oligonucleotide comprising an analyte binding moiety barcode, and an analyte capture sequence.
In some embodiments, the method further comprises generating a plurality of nucleic acids, the method comprising: (a) Contacting a plurality of analyte capture agents with a biological sample, in some embodiments, the analyte capture agents of the plurality of analyte capture agents comprise (i) an analyte binding moiety that binds to a protein and (ii) an oligonucleotide comprising an analyte binding moiety barcode and an analyte capture sequence; (b) Contacting a plurality of analyte capture agents with a substrate comprising a plurality of capture probes, in some embodiments, the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain, in some embodiments, the capture domain binds to an analyte capture sequence; (c) Extending the 3' end of the capture probe using the analyte binding moiety barcode as a template to generate an extended capture probe; and (d) amplifying the extended capture probe to produce an extended nucleic acid.
In some embodiments, the extended nucleic acid is released from the extended capture probe.
In some embodiments, step (a) and step (b) are performed substantially simultaneously. In some embodiments, step (a) is performed before step (b). In some embodiments, step (b) is performed before step (a).
Also provided herein are methods for enriching a biological sample for an analyte or analyte derivative, the method comprising: (a) Contacting a plurality of nucleic acids with a plurality of decoy oligonucleotides, in some embodiments, the plurality of nucleic acids comprises an extended nucleic acid comprising (i) a spatial barcode or complement thereof and (ii) an analyte, analyte derivative, or complement thereof, all or part of the sequence; the decoy oligonucleotides of the plurality of decoy oligonucleotides comprise: (i) A capture domain that hybridizes to all or part of the sequence of an analyte, analyte derivative, or complement thereof, and (ii) a molecular tag; (b) Capturing a complex of decoy oligonucleotides bound to an extended nucleic acid using a substrate comprising an agent bound to a molecular tag; and (c) isolating the complex of decoy oligonucleotides bound to the extended nucleic acid, thereby enriching the biological sample for the analyte or analyte derivative.
In some embodiments, the analyte is a nucleic acid.
In some embodiments, the method further comprises generating a plurality of nucleic acids, the method comprising: (a) Contacting a biological sample with a substrate comprising a plurality of attached capture probes, in some embodiments, the capture probes of the plurality of capture probes comprise (i) a spatial barcode and (ii) a capture domain that binds to a sequence present in a nucleic acid; (b) hybridizing the capture probe to the nucleic acid; (c) Extending the 3' end of the capture probe using the nucleic acid bound to the capture domain as a template to generate an extended capture probe; and (d) amplifying the extended capture probe to produce an extended nucleic acid.
In some embodiments, the extended nucleic acid is released from the extended capture probe.
In some embodiments, the analyte is a protein.
In some embodiments, the analyte derivative is an oligonucleotide comprising an analyte binding moiety barcode, and an analyte capture sequence.
In some embodiments, the method further comprises generating a plurality of nucleic acids, the method comprising: (a) Contacting a plurality of analyte capture agents with a biological sample, in some embodiments, the analyte capture agents of the plurality of analyte capture agents comprise (i) an analyte binding moiety that binds to a protein and (ii) an oligonucleotide comprising an analyte binding moiety barcode and an analyte capture sequence; (b) Contacting a plurality of analyte capture agents with a substrate comprising a plurality of capture probes, in some embodiments, the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain, in some embodiments, the capture domain binds to an analyte capture sequence; (c) Extending the 3' end of the capture probe using the analyte binding moiety barcode as a template to generate an extended capture probe; and (d) amplifying the extended capture probe to produce an extended nucleic acid.
In some embodiments, the extended nucleic acid is released from the extended capture probe.
In some embodiments, step (a) and step (b) are performed substantially simultaneously. In some embodiments, step (a) is performed before step (b). In some embodiments, step (b) is performed before step (a).
In some embodiments, the analyte from the biological sample is associated with a disease or disorder.
In some embodiments, the capture domain of the decoy oligonucleotide binds to a 3 'portion, a 5' portion, an intron, an exon, a 3 'untranslated region, or a 5' untranslated region of the sequence of the analyte, the analyte derivative, or the complement thereof.
In some embodiments, the capture domain of the decoy oligonucleotide comprises a total of about 10 nucleotides to about 300 nucleotides.
In some embodiments, the molecular tag comprises a protein, a small molecule, a nucleic acid, or a carbohydrate.
In some embodiments, the molecular tag is streptavidin, avidin, biotin, or a fluorophore.
In some embodiments, the agent that binds to the molecular tag comprises a protein (e.g., an antibody), a nucleic acid, or a small molecule.
In some embodiments, the molecular tag is biotin and the agent bound to the molecular tag is avidin or streptavidin.
In some embodiments, an agent that specifically binds to a molecular tag is attached to a substrate. In some embodiments, the substrate is a bead, well, or slide.
In some embodiments, the extended nucleic acid is a DNA molecule (e.g., a cDNA molecule).
In some embodiments, the extended nucleic acid further comprises a primer sequence or complement thereof; a unique molecular sequence or its complement; or other primer binding sequences or complements thereof.
In some embodiments, the biological sample is a tissue sample selected from a formalin fixed, paraffin embedded (FFPE) tissue sample or a frozen tissue sample.
In some embodiments, the biological sample is pre-stained with a detectable label, hematoxylin and eosin (H & E) dye, immunofluorescence, or immunohistochemistry.
In some embodiments, the biological sample is a permeabilized biological sample.
In some embodiments, the determining step described herein comprises sequencing all or part of (i) the sequence of the spatial barcode or its complement and (ii) the sequence of all or part of a nucleic acid from the biological sample.
In some embodiments, the sequencing is high throughput sequencing.
In some embodiments, the analyte is deregulated or differentially expressed in cancer cells, immune cells, cell signaling pathways, or neural cells.
Also disclosed herein are methods for identifying nucleic acid abundance and location in a biological sample, the method comprising: (a) Contacting a biological sample with a substrate comprising a plurality of attached capture probes, in some embodiments, the capture probes of the plurality of capture probes comprise (i) a spatial barcode and (ii) a capture domain that binds to a sequence present in a nucleic acid; (b) hybridizing the capture probe to the nucleic acid; (c) Extending the 3' end of the capture probe using the nucleic acid bound to the capture domain as a template to generate an extended capture probe; and (d) amplifying the extended capture probes to produce extended nucleic acids; in some embodiments, the extended nucleic acid comprises (i) a spatial barcode or complement thereof and (ii) all or part of the sequence of the nucleic acid or complement thereof; (e) releasing the extended nucleic acid from the extended capture probe; (f) Contacting a plurality of released nucleic acids with a plurality of decoy oligonucleotides, the released nucleic acids comprising the extended nucleic acids from step (e), in some embodiments the decoy oligonucleotides of the plurality of decoy oligonucleotides comprise: (i) A capture domain that hybridizes to all or a portion of a nucleic acid or complement thereof and (ii) a molecular tag; (g) Capturing decoy oligonucleotides bound to the extended nucleic acids using a substrate comprising an agent that binds to a molecular tag; and (h) determining all or part of the sequence of (i) the spatial barcode or its complement and all or part of the sequence of (ii) the extended nucleic acid, and using the sequences determined in (i) and (ii) to identify the abundance and location of the nucleic acid in the biological sample.
Also disclosed herein are methods for identifying protein abundance and location in a biological sample, the method comprising: (a) Contacting a plurality of analyte capture agents with a biological sample, in some embodiments, the analyte capture agents of the plurality of analyte capture agents comprise (i) an analyte binding moiety that binds to a protein and (ii) an oligonucleotide comprising an analyte binding moiety barcode and an analyte capture sequence; (b) Contacting a plurality of analyte capture agents with a substrate comprising a plurality of capture probes, in some embodiments, the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain, in some embodiments, the capture domain binds to an analyte capture sequence; (c) Extending the 3' end of the capture probe using the analyte binding moiety barcode as a template to generate an extended capture probe; (d) Amplifying the extended capture probes to produce extended nucleic acids; in some embodiments, the extended nucleic acid comprises all or part of the sequence of (i) a spatial barcode or complement thereof and (ii) an oligonucleotide or complement thereof; (e) releasing the extended nucleic acid from the extended capture probe; (f) Contacting a plurality of released nucleic acids with a plurality of decoy oligonucleotides, the released nucleic acids comprising the extended nucleic acids from step (e), in some embodiments, the decoy oligonucleotides of the plurality of decoy oligonucleotides comprise (i) a capture domain that hybridizes to all or a portion of the sequence of the oligonucleotide or its complement and (ii) a molecular tag; (g) Capturing decoy oligonucleotides bound to the extended nucleic acids using a substrate comprising an agent that binds to a molecular tag; and (h) determining all or part of the sequence of (i) the spatial barcode or its complement and all or part of the sequence of (ii) the extended nucleic acid, and using the sequences determined in (i) and (ii) to identify the abundance and location of the protein in the biological sample.
Also disclosed herein are compositions comprising a decoy oligonucleotide and an extended nucleic acid, in some embodiments, the decoy oligonucleotide is bound to the extended nucleic acid, in some embodiments, the extended nucleic acid comprises (i) a spatial barcode or complement thereof and (ii) an analyte, an analyte derivative, or a complement thereof, in whole or in part; in some embodiments, the decoy oligonucleotide comprises a molecular tag, in some embodiments, the molecular tag is selected from streptavidin, avidin, biotin, or a fluorophore.
In some embodiments, the decoy oligonucleotide binds to the extended nucleic acid through a capture domain that hybridizes to all or a portion of the sequence of the analyte, analyte derivative, or complement thereof.
In some embodiments, the compositions as described herein further comprise an agent that binds to the molecular tag.
In some embodiments, the agent that binds to the molecular tag comprises a protein (e.g., an antibody), a nucleic acid, or a small molecule.
In some embodiments, the molecular tag is biotin and the agent bound to the molecular tag is avidin or streptavidin.
In some embodiments, the compositions as described herein further comprise a substrate, in some embodiments, an agent that binds to the molecular tag is attached to the substrate. In some embodiments, the substrate is a bead, well, or slide.
Also provided herein are kits comprising: an array comprising a plurality of capture probes, in some embodiments, the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain; a plurality of decoy oligonucleotides; and instructions for performing the methods described herein.
Also provided herein are kits comprising: a plurality of analyte capture agents, in some embodiments, the analyte capture agents of the plurality of analyte capture agents comprise an analyte binding moiety, an analyte binding moiety barcode, and an analyte capture sequence; an array comprising a plurality of capture probes, in some embodiments, the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain; a plurality of decoy oligonucleotides; and instructions for performing the methods described herein.
In some embodiments, the kits described herein further comprise reagents and/or enzymes for performing the methods.
Also disclosed herein are methods for identifying the location of a nucleic acid in a biological sample, the method comprising: (a) Contacting a plurality of nucleic acids from a biological sample with a plurality of decoy oligonucleotides, in some embodiments, the plurality of nucleic acids comprises (i) a spatial barcode or complement thereof and (ii) a partial sequence of a nucleic acid or complement thereof from the biological sample; a decoy oligonucleotide of the plurality of decoy oligonucleotides comprises a capture domain that specifically binds to all or part of a sequence of a nucleic acid or complement thereof from a biological sample and a molecular tag; (b) Capturing decoy oligonucleotides that specifically bind to nucleic acids using a substrate comprising a reagent that specifically binds to a molecular tag; and (c) determining (i) all or part of the sequence of the spatial barcode or its complement and (ii) all or part of the sequence of the nucleic acid from the biological sample, and using the sequences determined in (i) and (ii) to identify the location of the nucleic acid in the biological sample.
In some embodiments, the capture domain of the decoy oligonucleotide binds specifically to all or part of the sequence of a nucleic acid from a biological sample. In some embodiments, the capture domain of the decoy oligonucleotide specifically binds to a 3' portion of the sequence of a nucleic acid or complement thereof from a biological sample. In some embodiments, the capture domain of the decoy oligonucleotide specifically binds to a 5' portion of the sequence of a nucleic acid or complement thereof from a biological sample. In some embodiments, the capture domain of the decoy oligonucleotide specifically binds to an intron in the sequence of a nucleic acid or complement thereof from the biological sample. In some embodiments, the capture domain of the decoy oligonucleotide specifically binds to an exon in the sequence of a nucleic acid or complement thereof from a biological sample. In some embodiments, the capture domain of the decoy oligonucleotide specifically binds to a 3' untranslated region of a nucleic acid or its complement from a biological sample. In some embodiments, the capture domain of the decoy oligonucleotide specifically binds to a 5' untranslated region of a nucleic acid or its complement from a biological sample.
In some embodiments, the nucleic acid from the biological sample is associated with a disease or disorder. In some embodiments, the nucleic acid from the biological sample comprises a mutation. In some embodiments, the nucleic acid from the biological sample comprises a Single Nucleotide Polymorphism (SNP). In some embodiments, the nucleic acid from the biological sample comprises a trinucleotide repeat.
In some embodiments, the capture domain of the decoy oligonucleotide comprises a total of about 10 nucleotides to about 300 nucleotides.
In some embodiments, the molecular tag includes a moiety (moiety). In some embodiments, the moiety is streptavidin, avidin, biotin, or a fluorophore. In some embodiments, the molecular tag comprises a small molecule. In some embodiments, the molecular tag comprises a nucleic acid. In some embodiments, the molecular tag comprises a carbohydrate. In some embodiments, the molecular tag is located 5' to the capture domain in the decoy oligonucleotide. In some embodiments, the molecular tag is located at the 3' position of the capture domain in the decoy oligonucleotide. In some embodiments, the agent that specifically binds to the molecular tag comprises a protein. In some embodiments, the protein is an antibody. In some embodiments, the agent that specifically binds to the molecular tag comprises a nucleic acid. In some embodiments, the agent that specifically binds to the molecular tag comprises a small molecule. In some embodiments, an agent that specifically binds to a molecular tag is attached to a substrate. In some embodiments, the substrate is a bead. In some embodiments, the substrate is a well. In some embodiments, the substrate is a slide.
In some embodiments, the nucleic acid is DNA. In some embodiments, the nucleic acid further comprises a functional sequence. In some embodiments, the functional sequence is a primer sequence or a complement thereof. In some embodiments, the nucleic acid further comprises a unique molecular sequence or complement thereof. In some embodiments, the nucleic acid further comprises other primer binding sequences or complements thereof.
In some embodiments, the biological sample is a tissue sample. In some embodiments, the tissue sample is a formalin fixed, paraffin embedded (FFPE) tissue sample or a frozen tissue sample. In some embodiments, the biological sample is pre-stained with a detectable label. In some embodiments, the biological sample is pre-stained. In some embodiments, the biological sample is pre-stained with hematoxylin and eosin (H & E). In some embodiments, the biological sample is pre-stained using immunofluorescence or immunohistochemistry. In some embodiments, the biological sample is a permeabilized biological sample that has been permeabilized with a permeabilizing agent selected from the group consisting of organic solvents, cross-linking agents, detergents, and enzymes, or a combination thereof. In some embodiments, the permeabilizing agent is a cross-linking agent. In some embodiments, the analyte is an RNA molecule. In some embodiments, the RNA molecule is an mRNA molecule.
In some embodiments, the determining in step (c) comprises sequencing all or part of the sequence of (i) the spatial barcode or its complement and (ii) the sequence of all or part of the nucleic acid from the biological sample. In some embodiments, the sequencing is high throughput sequencing. In some embodiments, sequencing comprises ligating an adapter to the nucleic acid.
In some embodiments, the method further comprises generating a plurality of nucleic acids comprising: (a) Contacting a biological sample with a substrate comprising a plurality of attached capture probes, in some embodiments, the capture probes of the plurality of capture probes comprise (i) a spatial barcode and (ii) a capture domain that specifically binds to a sequence present in an analyte; (b) Extending the 3' end of the capture probe using the analyte that specifically binds to the capture domain as a template to generate an extended capture probe; and (c) amplifying the extended capture probe to produce nucleic acid. In some embodiments, the amplification is isothermal.
In some embodiments, the nucleic acid produced is released from the extended capture probe.
In some embodiments, the nucleic acid is deregulated or differentially expressed in cancer cells. In some embodiments, the nucleic acid is deregulated or differentially expressed in immune cells. In some embodiments, the nucleic acid is deregulated in the cell signaling pathway. In some embodiments, the nucleic acid is deregulated or differentially expressed in the neural cells.
Also disclosed herein are methods for identifying the location of a nucleic acid in a biological sample, the method comprising: (a) Contacting a plurality of nucleic acids with a plurality of decoy oligonucleotides, in some embodiments, the nucleic acids of the plurality of nucleic acids comprise (i) a spatial barcode or complement thereof and (ii) a moiety that binds to a partial barcode or complement thereof; a decoy oligonucleotide of the plurality of decoy oligonucleotides comprises a capture domain that specifically binds to (i) all or part of a nucleic acid or complement thereof and/or (ii) to all or part of a partial barcode or complement thereof and a molecular tag; (b) Capturing a complex of decoy oligonucleotides that specifically bind to a nucleic acid or complement thereof or a complex of decoy oligonucleotides that specifically bind to an analyte binding moiety barcode or complement thereof using a substrate comprising an agent that specifically binds to a molecular tag; and (c) determining all or part of the sequence of (i) the nucleic acid or complement thereof and/or (ii) the analyte binding moiety barcode or complement thereof, and using the sequences determined in (i) and (ii) to identify the location of the nucleic acid in the biological sample.
In some embodiments, the capture domain of the decoy oligonucleotide specifically binds to all or part of a nucleic acid sequence from a biological sample. In some embodiments, the capture domain of the decoy oligonucleotide specifically binds to all or part of the analyte binding moiety barcode.
In some embodiments, the nucleic acid from the biological sample is associated with a disease or disorder.
In some embodiments, the capture domain of the decoy oligonucleotide comprises a total of about 10 nucleotides to about 300 nucleotides.
In some embodiments, the molecular tag comprises a protein. In some embodiments, the protein is streptavidin, avidin, or biotin. In some embodiments, the molecular tag comprises a small molecule. In some embodiments, the molecular tag comprises a nucleic acid. In some embodiments, the molecular tag comprises a carbohydrate. In some embodiments, the molecular tag is located 5' to the domain in the decoy oligonucleotide. In some embodiments, the molecular tag is located 3' of the domain in the decoy oligonucleotide. In some embodiments, the agent that specifically binds to the molecular tag comprises a protein. In some embodiments, the protein is an antibody. In some embodiments, the agent that specifically binds to the molecular tag comprises a nucleic acid. In some embodiments, the agent that specifically binds to the molecular tag comprises a small molecule.
In some embodiments, an agent that specifically binds to a molecular tag is attached to a substrate. In some embodiments, the substrate is a bead. In some embodiments, the substrate is a well. In some embodiments, the substrate is a slide.
In some embodiments, the nucleic acid is DNA. In some embodiments, the nucleic acid further comprises a primer binding sequence or a complement thereof. In some embodiments, the nucleic acid further comprises a unique molecular sequence or complement thereof. In some embodiments, the nucleic acid further comprises other primer binding sequences or complements thereof.
In some embodiments, the biological sample is a tissue sample. In some embodiments, the tissue sample is a formalin fixed, paraffin embedded (FFPE) tissue sample or a frozen tissue sample. In some embodiments, the biological sample is pre-stained with a detectable label. In some embodiments, the biological sample is pre-stained. In some embodiments, the biological sample is pre-stained with hematoxylin and eosin (H & E). In some embodiments, the biological sample is pre-stained using immunofluorescence or immunohistochemistry. In some embodiments, the biological sample is a permeabilized biological sample that has been permeabilized with a permeabilizing agent selected from the group consisting of organic solvents, cross-linking agents, detergents, and enzymes, or a combination thereof. In some embodiments, the permeabilizing agent is selected from the group consisting of organic solvents, cross-linking agents, detergents, and enzymes, or combinations thereof. In some embodiments, the analyte is an RNA molecule. In some embodiments, the RNA molecule is an mRNA molecule.
In some embodiments, the determining in step (c) comprises sequencing all or part of the sequence of (i) the nucleic acid or complement thereof and (ii) the analyte binding moiety barcode. In some embodiments, the sequencing is high throughput sequencing. In some embodiments, sequencing comprises ligating an adapter to the nucleic acid.
In some embodiments, the method further comprises generating a plurality of nucleic acids comprising: (a) Contacting a plurality of capture agents with a biological sample disposed on a substrate, in some embodiments, the capture agents of the plurality of capture agents comprise (i) a binding moiety that specifically binds to nucleic acids in the biological sample, (ii) a binding moiety barcode, and (iii) a capture sequence; the substrate comprises a plurality of capture probes, in some embodiments, the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain, in some embodiments, the capture domain specifically binds to an analyte capture sequence; and (b) extending the 3' end of the capture probe using the analyte binding moiety barcode as a template to generate an extended capture probe; and (c) amplifying the extended capture probe to produce nucleic acid. In some embodiments, the amplification is isothermal.
In some embodiments, the nucleic acid produced is released from the extended capture probe. In some embodiments, the nucleic acid is a nucleic acid that is deregulated or differentially expressed in cancer cells. In some embodiments, the nucleic acid is a nucleic acid that is deregulated or differentially expressed in immune cells. In some embodiments, the nucleic acid is a nucleic acid that is deregulated in a cell signaling pathway. In some embodiments, the nucleic acid is a nucleic acid that is deregulated or differentially expressed in neural cells.
Also disclosed herein are methods for enriching nucleic acids in a biological sample, the method comprising: (a) Contacting a plurality of nucleic acids with a plurality of decoy oligonucleotides, in some embodiments, the nucleic acids of the plurality of nucleic acids comprise (i) a spatial barcode or complement thereof and (ii) a moiety that binds to a partial barcode or complement thereof; a decoy oligonucleotide of the plurality of decoy oligonucleotides comprises a capture domain that specifically binds to (i) all or a portion of a nucleic acid or a complement thereof and/or (ii) to all or a portion of a partial barcode or a complement thereof and a molecular tag; (b) Capturing a complex of decoy oligonucleotides that specifically bind to a nucleic acid or complement thereof or a complex of decoy oligonucleotides that specifically bind to an analyte binding moiety barcode or complement thereof using a substrate comprising an agent that specifically binds to a molecular tag; and (c) isolating the complex of decoy oligonucleotides that specifically bind to the nucleic acid or its complement or the complex of decoy oligonucleotides that specifically bind to the analyte binding moiety barcode or its complement, thereby enriching the nucleic acid in the biological sample.
In some embodiments, the capture domain of the decoy oligonucleotide specifically binds to all or part of a nucleic acid sequence from a biological sample. In some embodiments, the capture domain of the decoy oligonucleotide specifically binds to all or part of the analyte binding moiety barcode. In some embodiments, the nucleic acid from the biological sample is associated with a disease or disorder. In some embodiments, the capture domain of the decoy oligonucleotide comprises a total of about 10 nucleotides to about 300 nucleotides.
In some embodiments, the molecular tag comprises a protein. In some embodiments, the protein is streptavidin, avidin, or biotin. In some embodiments, the molecular tag comprises a small molecule. In some embodiments, the molecular tag comprises a nucleic acid. In some embodiments, the molecular tag comprises a carbohydrate. In some embodiments, the molecular tag is located 5' to the domain in the decoy oligonucleotide. In some embodiments, the molecular tag is located 3' of the domain in the decoy oligonucleotide. In some embodiments, the agent that specifically binds to the molecular tag comprises a protein. In some embodiments, the protein is an antibody. In some embodiments, the agent that specifically binds to the molecular tag comprises a nucleic acid. In some embodiments, the agent that specifically binds to the molecular tag comprises a small molecule. In some embodiments, an agent that specifically binds to a molecular tag is attached to a substrate. In some embodiments, the substrate is a bead. In some embodiments, the substrate is a well. In some embodiments, the substrate is a slide.
In some embodiments, the nucleic acid is DNA. In some embodiments, the nucleic acid further comprises a primer binding sequence or a complement thereof. In some embodiments, the nucleic acid further comprises a unique molecular sequence or complement thereof. In some embodiments, the nucleic acid further comprises other primer binding sequences or complements thereof.
In some embodiments, the biological sample is a tissue sample. In some embodiments, the tissue sample is a formalin fixed, paraffin embedded (FFPE) tissue sample or a frozen tissue sample. In some embodiments, the biological sample is pre-stained with a detectable label. In some embodiments, the biological sample is pre-stained. In some embodiments, the biological sample is pre-stained with hematoxylin and eosin (H & E). In some embodiments, the biological sample is pre-stained using immunofluorescence or immunohistochemistry.
In some embodiments, the biological sample is a permeabilized biological sample that has been permeabilized with a permeabilizing agent selected from the group consisting of organic solvents, cross-linking agents, detergents, and enzymes, or a combination thereof. In some embodiments, the permeabilizing agent is selected from the group consisting of organic solvents, cross-linking agents, detergents, and enzymes, or combinations thereof.
In some embodiments, the analyte is an RNA molecule. In some embodiments, the RNA molecule is an mRNA molecule.
In some embodiments, the method further comprises generating a plurality of nucleic acids comprising: (a) Contacting a plurality of capture agents with a biological sample disposed on a substrate, in some embodiments, the capture agents of the plurality of capture agents comprise (i) a binding moiety that specifically binds to nucleic acids in the biological sample, (ii) a binding moiety barcode, and (iii) a capture sequence; the substrate comprises a plurality of capture probes, in some embodiments, the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain, in some embodiments, the capture domain specifically binds to an analyte capture sequence; and (b) extending the 3' end of the capture probe using the analyte binding moiety barcode as a template to generate an extended capture probe; and (c) amplifying the extended capture probe to produce nucleic acid. In some embodiments, the amplification is isothermal.
In some embodiments, the nucleic acid produced is released from the extended capture probe.
In some embodiments, the nucleic acid is a nucleic acid that is deregulated or differentially expressed in cancer cells. In some embodiments, the nucleic acid is a nucleic acid that is deregulated or differentially expressed in immune cells. In some embodiments, the nucleic acid is a nucleic acid that is deregulated in a cell signaling pathway. In some embodiments, the nucleic acid is a nucleic acid that is deregulated or differentially expressed in neural cells.
Also disclosed herein are methods for identifying the location of a nucleic acid in a biological sample, the method comprising: (a) Contacting a plurality of capture agents with a biological sample disposed on a substrate, in some embodiments, the capture agents of the plurality of capture agents comprise (i) a binding moiety that specifically binds to nucleic acids in the biological sample, (ii) a binding moiety barcode, and (iii) a capture sequence; the substrate comprises a plurality of capture probes, in some embodiments, the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain, in some embodiments, the capture domain specifically binds to an analyte capture sequence; and (b) extending the 3' end of the capture probe using the analyte binding moiety barcode as a template to generate an extended capture probe; and (c) amplifying the extended capture probes to produce nucleic acids; in some embodiments, the nucleic acid comprises (i) a spatial barcode or complement thereof and (ii) a partial analyte binding moiety barcode or complement thereof; (d) releasing the resulting nucleic acid from the extended capture probe; (e) Contacting the plurality of released nucleic acids from step (d) with a plurality of decoy oligonucleotides, in some embodiments, the decoy oligonucleotides of the plurality of decoy oligonucleotides comprise a capture domain that specifically binds to all or part of (i) the nucleic acid or complement thereof and/or (ii) the analyte binding moiety barcode or complement thereof; (f) Capturing a complex of decoy oligonucleotides that specifically bind to a nucleic acid or complement thereof or a complex of decoy oligonucleotides that specifically bind to an analyte binding moiety barcode or complement thereof using a substrate comprising an agent that specifically binds to a molecular tag; and (g) determining all or part of the sequence of (i) the nucleic acid or complement thereof and/or (ii) the analyte binding moiety barcode or complement thereof, and using the sequences determined in (i) and (ii) to identify the location of the nucleic acid in the biological sample.
Also disclosed herein are compositions comprising decoy oligonucleotides bound to nucleic acids, in some embodiments, the nucleic acids comprise (i) a spatial barcode or complement thereof and (ii) a partial analyte binding moiety barcode or complement thereof.
In some embodiments, the decoy oligonucleotide binds to the nucleic acid through a capture domain that specifically binds to (i) all or a portion of the nucleic acid and/or (ii) all or a portion of the analyte binding moiety barcode or its complement.
In some embodiments, the composition further comprises a molecular tag. In some embodiments, the composition further comprises an agent that specifically binds to the molecular tag. In some embodiments, the molecular tag comprises a protein. In some embodiments, the protein is streptavidin, avidin, or biotin. In some embodiments, the molecular tag comprises a small molecule. In some embodiments, the molecular tag comprises a nucleic acid. In some embodiments, the molecular tag comprises a carbohydrate. In some embodiments, the molecular tag is located 5' to the domain in the decoy oligonucleotide. In some embodiments, the molecular tag is located 3' of the domain in the decoy oligonucleotide.
In some embodiments, the agent comprises streptavidin, avidin, or biotin. In some embodiments, the agent that specifically binds to the molecular tag comprises a protein. In some embodiments, the protein is an antibody. In some embodiments, the agent that specifically binds to the molecular tag comprises a nucleic acid. In some embodiments, the agent that specifically binds to the molecular tag comprises a small molecule.
In some embodiments, the composition further comprises a substrate. In some embodiments, the substrate is a bead. In some embodiments, the substrate is a well. In some embodiments, the substrate is a slide.
All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent application, or information item was specifically and individually indicated to be incorporated by reference. To the extent that publications, patents, patent applications, and information items incorporated by reference contradict the disclosure contained in this specification, it is intended that this specification take precedence over any conflicting material.
Where a range is recited, it is understood that the description includes disclosure of all possible sub-ranges within the range, as well as disclosure of particular values within the range, whether or not the particular values or sub-ranges are explicitly recited.
The term "each" when referring to a group of items is intended to identify an individual item in the collection, but does not necessarily refer to each item in the collection unless specifically stated otherwise or unless the context of the usage clearly indicates otherwise.
As used herein, the term "analyte derivative" refers to a molecule (e.g., a nucleic acid or protein molecule) derived from an analyte (e.g., a nucleic acid) or conveying information (e.g., homology information) with respect to the analyte (e.g., a protein). In some embodiments, the analyte derivative includes all or part of an analyte (e.g., a nucleic acid) described herein. In some embodiments, the analyte derivative includes all or part of the analyte capture agents described herein (e.g., all or part of the analyte capture moiety, analyte binding moiety barcode, and/or analyte capture sequence).
Various embodiments of features of the present invention are described herein. However, it should be understood that these embodiments are provided by way of example only and that many changes, modifications, and substitutions may be made by one of ordinary skill in the art without departing from the scope of the present disclosure. It should also be understood that various alternatives to the specific embodiments described herein are also within the scope of the present disclosure.
Drawings
The following drawings illustrate certain embodiments of the features and advantages of the present invention. These embodiments are not intended to limit the scope of the appended claims in any way. Like reference symbols in the drawings indicate like elements.
Fig. 1 is a schematic diagram illustrating an example of a barcoded capture probe as described herein.
FIG. 2 is a schematic diagram illustrating a cleavable capture probe, wherein the cleaved capture probe can enter a non-permeabilized cell and bind to a target analyte within a sample.
FIG. 3 is a schematic diagram of an exemplary multiple spatial barcoded feature.
FIG. 4 is a schematic diagram of an exemplary analyte capture agent.
FIG. 5 is a schematic diagram depicting an exemplary interaction between a feature-immobilized capture probe 524 and an analyte capture agent 526.
Fig. 6A-6C are schematic diagrams illustrating how streptavidin cell tags are utilized in an array-based system to generate spatially barcoded cells or cell contents.
FIG. 7 is a schematic of a workflow showing targeted spatial gene expression.
FIGS. 8A-8B show mid-target, enrichment and complexity values, wherein different spatial libraries were used. The spatial library comprises samples from heart, breast cancer or lymph. Either hl1k_200.1 group or Immune group was used.
FIG. 9 is an exemplary workflow showing targeted spatial gene expression.
FIG. 10 is an exemplary workflow for hybridization and capture-based enrichment of targeted nucleic acid sequences using a spatial gene expression library.
FIG. 11 is a schematic showing correlation between UMI counts in mouse brain tissue space versus Visium control whole transcriptome analysis (X-axis) using 65 gene targeted enriched nerve groups (Y-axis).
Fig. 12A-12B are exemplary pictures showing a) OLIG2 gene expression from a spatial array of the visual full transcriptome (control) and B) OLIG2 gene expression from a 65 gene targeted enriched nerve group. The picture is associated with the data of fig. 11.
Fig. 13A-13B are exemplary graphs showing UMI count correlation between a) map readings of whole transcriptome Visium versus 4 Visium-targeted genomes and B) Visium whole transcriptome control (X axis) versus Visium-targeted pan-oncogene genome (Y axis).
Fig. 14A-14B are exemplary graphs showing targeting metrics for a spatial array of 12 human tissue types using 4 targeted gene-enriched sets compared to a) a gene portion recovered for a matched Visium full transcriptome and B) a UMI R square (R-squared) consensus Visium full transcriptome spatial array for a matched Visium full transcriptome.
FIGS. 15A-15C show human cortex clusters of 6 gene clusters of A) Visium holoton and B) neuroscience targeted gene-enriched group. Fig. 15C shows the UMI correlation between fig. 15A and 15B.
FIGS. 16A-16D show spatial gene expression in human breast cancer tissue sections from the pan-oncogene genome; a) pathologist annotation (control), B) full transcriptome Visium data, C) 196 breast oncogenes from the pan-cancer enriched group, and D) ERBB2 gene expression from the pan-cancer enriched group.
17A-17D provide details regarding genetic targets included in a cancer group according to embodiments of the present disclosure.
18A-18C provide details regarding the genetic targets contained in the immunological probe sets according to embodiments of the present disclosure.
19A-19D provide details regarding the genetic targets contained in a pathway set according to embodiments of the disclosure.
Figures 20A-20D provide details regarding genetic targets contained in a neural group according to embodiments of the present disclosure.
Detailed Description
I. Introduction to the invention
The spatial analysis methods and compositions described herein can provide expression data for a large number of analytes and/or a wide variety of analytes within a biological sample with high spatial resolution while preserving natural spatial background information. Spatial analysis methods and compositions can include, for example, the use of capture probes that include a spatial barcode (e.g., a nucleic acid sequence that provides information about the location or orientation of an analyte within a cell or tissue sample (e.g., a mammalian cell or mammalian tissue sample), and a capture domain that is capable of binding to an analyte (e.g., a protein and/or nucleic acid) that is produced by the cell and/or is present therein. The spatial analysis methods and compositions may also include the use of capture probes with capture domains that capture intermediates (intermediate agent) for the indirect detection of analytes. For example, an intermediate may include a nucleic acid sequence (e.g., a barcode) associated with the intermediate. Thus, detection of the intermediate is indicative of the analyte in the cell or tissue sample.
Non-limiting aspects of methods and compositions of spatial analysis are described in U.S. patent nos. 10,774,374, 10,724,078, 10,480,022, 10,059,990, 10,041,949, 10,002,316,9,879,313,9,783,841,9,727,810,9,593,365,8,951,726,8,604,182,7,709,198, U.S. patent application publication nos. 2020/239946, 2020/080136, 2020/0277663, 2020/024341, 2019/330617, 2019/264268, 2020/256867, 2020/224244, 2019/194709, 2019/161796, 2019/085383, 2019/055594, 2018/216161, 2018/051322, 2018/0245142, 2017/241911, 2017/089811, 2017/067096, 2017/029875, 2017/0016053, 2016/108458, 2015/000854, 2013/171621, wo 2018/091676, wo 2020/176788, rodriliques et al, science 363 (6434): 1463-1467, 2019; lee et al, nat. Protoc.10 (3): 442-458, 2015; trejo et al, PLoS ONE 14 (2): e0212031 2019; chen et al, science 348 (6233): aaa6090, 2015; gao et al, BMC biol 15:50 2017; and Gupta et al, nature biotechnology.36: 1197-1202, 2018; visium spatial gene expression kit user guide (Visium Spatial Gene Expression Reagent Kits User Guide) (e.g., rev C, month 6 of date 2020), and/or Visium spatial tissue optimization kit user guide (Visium Spatial Tissue Optimization Reagent Kits User Guide) (e.g., rev C, month 7 of date 2020), both available from 10x Genomics Inc. (10 x Genomics) support document sites, which can be used in any combination. Other non-limiting aspects of the spatial analysis methods and compositions are described herein.
Some general terms that may be used in the present disclosure may be found in part (I) (b) of WO2020/176788 and/or U.S. patent application publication No. 2020/0277663. Typically, a "barcode" is a label or identifier that conveys or is capable of conveying information (e.g., information about analytes, beads, and/or capture probes in a sample). The barcode may be part of the analyte or may be independent of the analyte. The barcode may be attached to the analyte. Certain bar codes may be unique relative to other bar codes. For purposes of the present invention, an "analyte" may include any biological substance, structure, moiety or component to be analyzed. The term "target" may similarly refer to an analyte of interest.
Analytes can be broadly divided into two categories: nucleic acid analytes and non-nucleic acid analytes. Examples of non-nucleic acid analytes include, but are not limited to, lipids, carbohydrates, peptides, proteins, glycoproteins (N-linked or O-linked), lipoproteins, phosphoproteins, specific phosphorylated or acetylated variants of proteins, amidated variants of proteins, hydroxylated variants of proteins, methylated variants of proteins, ubiquitinated variants of proteins, sulfated variants of proteins, viral proteins (e.g., viral capsids, viral envelopes, viral shells, viral appendages, viral glycoproteins, viral spikes, etc.), extracellular and intracellular proteins, antibodies, and antigen binding fragments. In some embodiments, the analyte can be localized to a subcellular location, including, for example, organelles such as mitochondria, golgi apparatus, endoplasmic reticulum, chloroplast, endocytic vesicle, efflux vesicle, vacuole, lysosome, and the like. In some embodiments, the analyte may be a peptide or protein, including but not limited to antibodies and enzymes. Other examples of analytes can be found in WO2020/176788 part (I) (c) and/or U.S. patent application publication No. 2020/0277663. In some embodiments, the analyte can be detected indirectly, for example, by detecting an intermediate, such as a ligated probe (e.g., ligation product) or an analyte capture agent (e.g., an oligonucleotide-coupled antibody), such as those described herein.
A "biological sample" is typically obtained from a subject for analysis using any of a variety of techniques, including but not limited to biopsy, surgery, and Laser Capture Microscopy (LCM), and typically includes cells and/or other biological material from the subject. In some embodiments, the biological sample may be a tissue slice. In some embodiments, the biological sample may be a fixed and/or stained biological sample (e.g., a fixed and/or stained tissue section). Non-limiting examples of staining agents include tissue staining agents (e.g., hematoxylin and/or eosin) and immunostaining agents (e.g., fluorescent staining agents). In some embodiments, a biological sample (e.g., a fixed and/or stained biological sample) can be imaged. Biological samples are also described in WO2020/176788 part (I) (d) and/or U.S. patent application publication No. 2020/0277663.
In some embodiments, the biological sample is permeabilized with one or more permeabilization reagents. For example, permeabilization of a biological sample can facilitate capture of an analyte. Exemplary permeabilizing agents and conditions are described in WO2020/176788 part (I) (d) (ii) (13) or exemplary embodiment part and/or U.S. patent application publication No. 2020/0277663.
Array-based spatial analysis methods involve transferring one or more analytes from a biological sample to an array of features on a substrate, where each feature is associated with a unique spatial location on the array. Subsequent analysis of the transferred analyte includes determining identity of the analyte and spatial location of the analyte in the biological sample. The spatial location of the analyte in the biological sample is determined based on the characteristics of the array to which the analyte binds (e.g., directly or indirectly) and the relative spatial location of the characteristics on the array.
"capture probe" refers to any molecule capable of capturing (directly or indirectly) and/or labeling an analyte (e.g., an analyte of interest) in a biological sample. In some embodiments, the capture probe is a nucleic acid or a polypeptide. In some embodiments, the capture probe includes a barcode (e.g., a spatial barcode and/or a Unique Molecular Identifier (UMI)) and a capture domain. In some embodiments, the capture probes can include cleavage domains and/or functional domains (e.g., primer binding sites, e.g., for Next Generation Sequencing (NGS)).
Fig. 1 is a schematic diagram illustrating an exemplary capture probe as described herein. As shown, capture probes 102 are optionally coupled to features 103 through cleavage domains 101, e.g., disulfide bonds. The capture probe may include a functional sequence 104 useful for subsequent processing. Functional sequence 104 may include all or part of a sequencer-specific flow cell attachment sequence (e.g., a P5 or P7 sequence), all or part of a sequencing primer sequence (e.g., an R1 primer binding site, an R2 primer binding site), or a combination thereof. The acquisition probe may also include a spatial barcode 10. The capture probe may also include a Unique Molecular Identifier (UMI) sequence 106. Although fig. 1 shows spatial barcode 105 located upstream (5 ') of UMI sequence 106, it should be understood that capture probes wherein UMI sequence 106 is located upstream (5') of spatial barcode 105 are also suitable for use in any of the methods described herein. The capture probes may also include a capture domain 107 to facilitate capture of target analytes. In some embodiments, the capture probes comprise one or more additional functional sequences, which may be located, for example, between the spatial barcode 105 and the UMI sequence 106, between the UMI sequence 106 and the capture domain 107, or after the capture domain 107. The capture domain may have a sequence complementary to the nucleic acid analyte sequence. The capture domain may have a sequence complementary to the ligated probes described herein. The capture domain may have a sequence complementary to a capture handle (handle) sequence present in the analyte capture agent. The capture domain may have a sequence complementary to a splint (splint) oligonucleotide. Such splint oligonucleotides may have, in addition to the sequence complementary to the capture domain of the capture probe, the sequence of the nucleic acid analyte, the sequence complementary to a portion of the ligated probe described herein, and/or the capture handle sequence described herein.
The functional sequence may generally be selected to be compatible with any of a variety of different sequencing systems, such as ion torrent protons (Ion Torrent Proton) or PGMs, illumina sequencers, pacbrio, oxford nanopores (Oxford nanopores), and the like, and the requirements thereof. In some embodiments, the functional sequences may be selected to be compatible with non-commercial sequencing systems. Examples of such sequencing systems and techniques that may use suitable functional sequences include, but are not limited to, ion-torrent proton or PGM sequencing, illumina sequencing, pacbrio SMRT sequencing, and oxford nanopore sequencing. Furthermore, in some embodiments, the functional sequences may be selected to be compatible with other sequencing systems (including non-commercial sequencing systems).
In some embodiments, the spatial barcode 105 and the functional sequence 104 are common to all probes attached to a given feature. In some embodiments, the UMI sequence 106 of the capture probe attached to a given feature is different from the UMI sequence of a different capture probe attached to a given feature.
FIG. 2 is a schematic diagram illustrating a cleavable capture probe, wherein the cleaved capture probe can enter a non-permeabilized cell and bind to an analyte within a sample. The capture probe 201 comprises a cleavage domain 202, a cell penetrating peptide 203, a reporter 204 and a disulfide (-S-). 205 represents all other parts of the capture probe, e.g. the spatial barcode and the capture domain.
FIG. 3 is a schematic diagram of an exemplary multiple spatial barcoded feature. In fig. 3, feature 301 may be coupled to a spatially barcoded capture probe, where the spatially barcoded probe of a particular feature may have the same spatial barcode, but have different capture domains designed to associate the spatial barcode of the feature with multiple target analytes. For example, the features may be coupled to four different types of spatially barcoded capture probes, each type of spatially barcoded capture probe having a spatial barcode 302. One type of capture probe associated with this feature includes a combination of a spatial barcode 303 and a poly (T) capture domain 403, which is designed to capture an mRNA target analyte. The second type of capture probes associated with this feature include a combination of spatial barcodes 304 and random N-mer capture domains 804 for gDNA analysis. A third type of capture probe associated with this feature comprises a spatial barcode 302 in combination with a capture domain complementary to the capture handle sequence of the analyte capture agent of interest 305. A fourth type of capture probe associated with this feature includes a spatial barcode 306 in combination with a capture probe that can specifically bind to a nucleic acid molecule 806 that can function in a CRISPR assay (e.g., CRISPR/Cas 9). Although only four different capture probe barcoded constructs are shown in fig. 3, the capture probe barcoded constructs can be tailored for analysis of any given analyte associated with a nucleic acid and can bind to such constructs. For example, the protocol shown in fig. 3 may also be used for simultaneous analysis of other analytes disclosed herein, including but not limited to: (a) mRNA, lineage-tracking constructs, cell surface or intracellular proteins and metabolites, and gDNA; (b) mRNA, available chromatin (e.g., ATAC-seq, dnase-seq, and/or mnazyme-seq) cell surface or intracellular proteins and metabolites, and perturbation agents (e.g., CRISPR-crRNA/sgRNA, TALEN, zinc finger nucleases, and/or antisense oligonucleotides as described herein); (c) mRNA, cell surface or intracellular proteins and/or metabolites, barcoded markers (e.g., MHC multimers described herein) and V (D) J sequences of immune cell receptors (e.g., T cell receptors). In some embodiments, the perturbation agent may be a small molecule, an antibody, a drug, an aptamer, a miRNA, a physical environment (e.g., a temperature change), or any other known perturbation agent. See, for example, WO2020/176788, section (II) (b) (e.g., sections (i) - (vi)) and/or U.S. patent application publication No. 2020/0277663. The generation of capture probes may be achieved by any suitable method, including those described in section (II) (d) (II) of WO2020/176788 and/or U.S. patent application publication No. 2020/0277663.
In some embodiments, any suitable multiplexing technique (e.g., as described in section (IV) of WO 2020/176788 and/or U.S. patent application publication No. 2020/0277663) may be employed to detect (e.g., simultaneously or sequentially detect) more than one analyte type (e.g., nucleic acid and protein) from a biological sample.
In some embodiments, detection of one or more analytes (e.g., protein analytes) may be performed using one or more analyte capture agents. As used herein, an "analyte capture agent" refers to a substance that interacts with an analyte (e.g., an analyte in a biological sample) and with a capture probe (e.g., a capture probe attached to a substrate or feature) to identify the analyte. In some embodiments, the analyte capture agent comprises: (i) An analyte binding moiety (e.g., that binds to an analyte), such as an antibody or antigen binding fragment thereof; (ii) an analyte binding moiety barcode; and (iii) a capture handle sequence (e.g., an analyte capture sequence). As used herein, the term "analyte binding moiety barcode" refers to a barcode associated with or otherwise identifying an analyte binding moiety. As used herein, the term "analyte capture sequence" or "capture handle sequence" refers to a region or portion that is configured to hybridize to, bind to, couple to, or otherwise interact with a capture domain of a capture probe. In some embodiments, the capture handle sequence is complementary to the capture domain of the capture probe. In some cases, the analyte binding moiety bar code (or portion thereof) may be capable of being removed (e.g., cleaved) from the analyte capture agent.
FIG. 4 is a schematic diagram of an exemplary analyte capture agent 402 comprised of an analyte binding moiety 404 and an analyte binding moiety barcode 408. Exemplary analyte binding moieties 404 are molecules capable of binding to analyte 406, and a reactive species capture agent is capable of interacting with a spatially barcoded capture probe. The analyte binding moiety can bind to analyte 406 with high affinity and/or high specificity. The analyte capture agent can include an analyte binding moiety barcode domain 408, a nucleotide sequence (e.g., an oligonucleotide) that hybridizes to at least a portion or all of the capture domain of the capture probe. Reactive species binding moiety barcode processing and use 408 may comprise an analyte binding moiety barcode and capture handle sequence as described herein. Analyte binding moiety 404 may include a polypeptide and/or an aptamer. Analyte binding moiety 404 can include an antibody or antibody fragment (e.g., an antigen binding fragment).
FIG. 5 is a schematic diagram depicting an exemplary interaction between a feature-immobilized capture probe 524 and an analyte capture agent 526. The feature-immobilized capture probe 524 may include a spatial barcode 508 and functional sequences 506 and UMI 510, as described elsewhere herein. The capture probes may also include capture domains 512 capable of binding to analyte capture agents 526. The analyte capture agent 526 may include a functional sequence 518, an analyte binding moiety barcode 516, and a capture handle sequence 514 capable of binding to the capture domain 512 of the capture probe 524. The analyte capture agent can also include a linker 520 that couples the capture agent barcode domain 516 to the analyte binding moiety 522.
Fig. 6A, 6B and 6C are schematic diagrams illustrating how streptavidin cell tags are utilized in an array-based system to generate spatially barcoded cells or cell contents. For example, as shown in fig. 6A, the peptide-bound Major Histocompatibility Complex (MHC) may be associated with biotin (β2m) alone and bound to a streptavidin moiety such that the streptavidin moiety comprises multiple pMHC moieties. Each of these moieties can bind to the TCR such that streptavidin binds to the target T cell through multiple MCH/TCR binding interactions. The multiple interactions act synergistically to greatly increase binding affinity. This improved affinity may improve the labeling of T cells and may also reduce the likelihood of dissociation of the label from the T cell surface. As shown in fig. 6B, the capture agent barcode domain 601 may be modified by streptavidin 602 and contacted with multiple molecules of biotinylated MHC603 such that the biotinylated MHC603 molecules are coupled to the streptavidin-coupled capture agent barcode domain 601. The result is a barcoded MHC multimeric complex 605. As shown in fig. 6B, the capture agent barcode domain sequence 601 may recognize MHC as its associated tag, and also include optional functional sequences, such as sequences for hybridization with other oligonucleotides. In certain instances, the capture probe 606 may be first associated with and released from a feature (e.g., a gel bead), in other embodiments, the capture probe 606 may hybridize to the capture agent barcode domain 605 of the MHC-oligonucleotide complex 601, the hybridized oligonucleotides (spacer C C and spacer rGrGrG) may then be extended in a primer extension reaction such that a construct is generated comprising sequences corresponding to each of the two spatial barcode sequences (spatial barcode associated with the capture probe, the barcode associated with the MHC-oligonucleotide complex), in certain instances, the capture probe 606 may be first associated with and released from a feature (e.g., a gel bead), in other embodiments, the capture probe 606 may be hybridized to the capture agent barcode domain 605 of the MHC-oligonucleotide complex 601, the spacer C C and spacer rGrG) may then be extended in a primer extension reaction such that a construct comprising sequences corresponding to each of the two spatial barcode sequences (spatial barcode associated with the capture probe), in certain instances, one or both of the corresponding sequences may be the capture probe 606 may be attached to the capture probe and optionally be cleaved together at the capture domain, such as the capture probe 606 may be further processed and/or the capture domain may be further processed, and sequences derived from the spatial barcode sequence on the capture agent barcode domain 601 can be used to recognize specific peptide MHC complexes 604 bound on the cell surface (e.g., when using MHC peptide libraries for screening immune cells or immune cell populations).
Additional descriptions of analyte capture agents can be found in WO2020/176788 part (II) (b) (ix) and/or U.S. patent application publication No. 2020/0277663 part (II) (b) (viii).
There are at least two methods of associating a spatial barcode with one or more adjacent cells, such that the spatial barcode identifies the one or more cells and/or the content of the one or more cells as being associated with a particular spatial location. One approach is to facilitate removal of the analyte or analyte surrogate (proxy) (e.g., an intermediate) from the cell and toward a spatial barcoded array (e.g., including a spatial barcoded capture probe). Another approach is to cleave spatially barcoded capture probes from the array and facilitate the spatially barcoded capture probes toward and/or into or onto the biological sample.
In some cases, the capture probes may be used to prime, replicate and optionally generate a barcoded extension product from a template (e.g., a DNA or RNA template, such as an analyte or intermediate (e.g., a conjugated probe (e.g., ligation product) or analyte capture agent) or a portion thereof) or derivative thereof (see, e.g., the (II) (b) (vii) portion of WO2020/176788 and/or U.S. patent application publication No. 2020/0277663 for extended capture probes). In some cases, the capture probes can be used to form bound probes (e.g., ligation products) with templates (e.g., DNA or RNA templates, such as analytes or intermediates or portions thereof), thereby producing ligation products that serve as template substitutes.
As used herein, an "extended capture probe" refers to a capture probe having additional nucleotides added to the end (e.g., the 3 'or 5' end) of the capture probe to extend the total length of the capture probe. For example, "extended 3 'end" means that additional nucleotides are added to the most 3' nucleotide of the capture probe to extend the length of the capture probe, e.g., by polymerization reactions for extended nucleic acid molecules, including templated polymerization catalyzed by a polymerase (e.g., DNA polymerase or reverse transcriptase). In some embodiments, the extended capture probe comprises adding to the 3' end of the capture probe a nucleic acid sequence that is complementary to a nucleic acid sequence of an analyte or intermediate that specifically binds to the capture domain of the capture probe. In some embodiments, the capture probe uses reverse transcription extension. In some embodiments, the capture probes are extended using one or more DNA polymerases. The extended capture probes include the sequence of the capture probes and the spatial barcode sequences of the capture probes.
In some embodiments, the extended capture probes are amplified (e.g., in bulk solution or on an array) to produce an amount sufficient for downstream analysis (e.g., by DNA sequencing). In some embodiments, the extended capture probes (e.g., DNA molecules) serve as templates for an amplification reaction (e.g., polymerase chain reaction).
Other variations of the spatial analysis method, including in some embodiments an imaging step, are described in WO2020/176788, part (II) (a) and/or U.S. patent application publication No. 2020/0277663. Analysis of captured analytes (and/or intermediates or portions thereof), for example, includes sample removal, extension of the capture probes, sequencing (e.g., sequencing of cleaved extended capture probes and/or cDNA molecules complementary to extended capture probes), sequencing on an array (e.g., using, for example, in situ hybridization or in situ ligation methods), time domain analysis and/or proximity capture, as described in WO2020/176788, section (II) (g) and/or U.S. patent application publication No. 2020/0277663. Some quality control measures are also described in WO2020/176788 part (II) (h) and/or U.S. patent application publication No. 2020/0277663.
The spatial information may provide information of biological and/or medical importance. For example, the methods and compositions described herein may allow for: identifying one or more biomarkers of a disease or disorder (e.g., diagnosis, prognosis, and/or for determining treatment efficacy); determining candidate drug targets for treating a disease or disorder; identifying (e.g., diagnosing) a subject as having a disease or disorder; identifying a stage and/or prognosis of a disease or disorder in a subject; identifying the subject as having an increased likelihood of developing a disease or disorder; monitoring the progress of a disease or disorder in a subject; determining the efficacy of treating a disease or disorder in a subject; determining a patient subpopulation for which treatment is effective against the disease or disorder; modification of treatment of a subject suffering from a disease or disorder; selecting a subject for participation in a clinical trial; and/or selecting a treatment for a subject suffering from a disease or disorder.
The spatial information may provide information of biological importance. For example, the methods and compositions described herein may allow for: identifying transcriptome and/or proteome expression profiles (e.g., in healthy and/or diseased tissue); identifying multiple analyte types at close range (e.g., nearest neighbor analysis); determining genes and/or proteins up-regulated and/or down-regulated in diseased tissue; characterization of tumor microenvironment; characterization of tumor immune response; characterization of cell types and their co-localization in tissues; identification of genetic variation within a tissue (e.g., based on gene and/or protein expression profiles associated with specific disease or disorder biomarkers).
Typically, for spatial array-based methods, the substrate serves to support the attachment of capture probes directly or indirectly to the array features. A "feature" is an entity that serves as a support or repository for various molecular entities used in spatial analysis. In some embodiments, some or all of the features in the array are functionalized for analyte capture. Exemplary substrates are described in WO2020/176788, section (II) (c) and/or U.S. patent application publication No. 2020/0277663. Exemplary features and geometrical properties of the arrays can be found in sections (II) (d) (i), (II) (d) (iii) and (II) (d) (iv) of WO2020/176788 and/or in U.S. patent application publication No. 2020/0277663.
Typically, the analyte and/or intermediate (or portion thereof) may be captured when the biological sample is contacted with a substrate comprising capture probes (e.g., a substrate having capture probes embedded, spotted, printed, fabricated on the substrate, or a substrate having features (e.g., beads, wells) comprising capture probes). As used herein, contacting a biological sample with a substrate refers to any contact (e.g., direct or indirect) such that the capture probes can interact (e.g., covalently or non-covalently bind (e.g., hybridize)) with an analyte from the biological sample. The capturing may be effected actively (e.g., using electrophoresis) or passively (e.g., using diffusion). Analyte capture is further described in WO2020/176788 part (II) (e) and/or U.S. patent application publication No. 2020/0277663.
In some cases, spatial analysis may be performed by attaching and/or introducing molecules (e.g., peptides, lipids, or nucleic acid molecules) having a barcode (e.g., a spatial barcode) to a biological sample (e.g., to cells in a biological sample). In some embodiments, a plurality of molecules (e.g., a plurality of nucleic acid molecules) having a plurality of barcodes (e.g., a plurality of spatial barcodes) are introduced into a biological sample (e.g., a plurality of cells in a biological sample) for spatial analysis. In some embodiments, after attaching and/or introducing the molecule with the barcode to the biological sample, the biological sample may be physically separated (e.g., dissociated) into single cells or cell populations for analysis. Some such spatial analysis methods are described in WO2020/176788 part (III) and/or U.S. patent application publication No. 2020/0277663.
In some cases, spatial analysis may be performed by detecting a plurality of oligonucleotides hybridized to the analyte. In some cases, for example, spatial analysis may be performed using RNA Template Ligation (RTL). The method of RTL has been described previously. See, e.g., credle et al, nucleic Acids res.2017, 8, 21; 45 (14): e128. typically, RTL involves hybridization of two oligonucleotides to adjacent sequences on an analyte (e.g., an RNA molecule, such as an mRNA molecule). In some cases, the oligonucleotide is a DNA molecule. In some cases, one of the oligonucleotides comprises at least two ribonucleobases at the 3 'end and/or the other oligonucleotide comprises a phosphorylated nucleotide at the 5' end. In some cases, one of the two oligonucleotides includes a capture domain (e.g., a poly (a) sequence, a non-homopolymeric sequence). After hybridization to the analyte, a ligase (e.g., a SplintR ligase) ligates the two oligonucleotides together, producing a bound probe (e.g., ligation product). In some cases, two oligonucleotides hybridize to sequences that are not adjacent to each other. For example, hybridization of two oligonucleotides creates a gap between hybridized oligonucleotides. In some cases, a polymerase (e.g., a DNA polymerase) may extend one of the oligonucleotides prior to ligation. After ligation, the bound probes (e.g., ligation products) are released from the analyte. In some cases, the bound probe (e.g., ligation product) is released using an endonuclease (e.g., rnase H). The released bound probes (e.g., ligation products) may then be captured by capture probes on the array (e.g., in lieu of direct capture of the analyte), optionally amplified and sequenced, to thereby determine the location and optionally abundance of the analyte in the biological sample.
During spatial information analysis, sequence information of the spatial barcode associated with the analyte is obtained and can be used to provide information about the spatial distribution of the analyte in the biological sample. Various methods may be used to obtain the spatial information. In some embodiments, specific capture probes and analytes they capture are associated with specific locations in the feature array on the substrate. For example, a particular spatial barcode may be associated with a particular array location prior to array fabrication, and a sequence of spatial barcodes may be stored (e.g., in a database) with particular array location information such that each spatial barcode is uniquely mapped to a particular array location.
Alternatively, a particular spatial barcode may be deposited at predetermined locations in the array of features during manufacture such that at each location there is only one type of spatial barcode, whereby the spatial barcode is uniquely associated with a single feature of the array. If desired, the array may be decoded using any of the methods described herein so that the spatial bar code is uniquely associated with the array feature locations, and the mapping may be stored as described above.
When sequence information for the capture probes and/or analytes is obtained during spatial information analysis, the location of the capture probes and/or analytes may be determined by reference to stored information that uniquely correlates each spatial barcode with a characteristic location of the array. In this way, specific capture probes and capture analytes are associated with specific locations in the feature array. Each array feature location represents a location of a coordinate reference point (e.g., array location, fiducial marker) relative to the array. Thus, each feature location has an "address" or location in the coordinate space of the array.
Some exemplary spatial analysis workflows are described in the exemplary embodiments section of WO2020/176788 and/or in U.S. patent application publication No. 2020/0277663. See, for example, WO2020/176788 and/or U.S. patent application publication 2020/0277663 for some non-limiting examples of workflows described herein, a sample may be immersed in an exemplary embodiment beginning with … … ". See also, e.g., visium spatial gene expression kit user guide (Visium Spatial Gene Expression Reagent Kits User Guide) (e.g., rev C, month 6 of 2020), targeted gene expression spatial user guide (the Targeted Gene Expression-Spatial User Guide) (e.g., rev A, month 10 of 2020), and/or Visium spatial tissue optimization kit user guide (Visium Spatial Tissue Optimization Reagent Kits User Guide) (e.g., rev C, month 7 of 2020).
In some embodiments, spatial analysis may be performed using dedicated hardware and/or software, such as part (II) (e) (II) and/or (V) of WO2020/176788 and/or any of the systems described in U.S. patent application publication No. 2020/0277663, or any one or more of the devices or methods described in the control slide for imaging, the method of using the control slide and substrate for imaging, the system of using the control slide and substrate for imaging and/or the sample and array alignment device and method, the information tag of WO 2020/123320.
Suitable systems for performing spatial analysis may include components such as a chamber (e.g., a flow cell or sealable fluid tight chamber) for containing a biological sample. The biological sample may be immobilized, for example, in a biological sample container. One or more fluid chambers may be connected to the chambers and/or sample containers by fluid conduits, and fluids may be delivered into the chambers and/or sample containers by fluid pumps, vacuum sources, or other devices connected to the fluid conduits that create pressure gradients to drive the fluid flow. One or more valves may also be connected to the fluid conduit to regulate the flow of reagents from the reservoir to the chamber and/or sample container.
The system may optionally include a control unit comprising one or more electronic processors, input interfaces, output interfaces (e.g., a display), and storage units (e.g., solid-state storage media such as, but not limited to, magnetic, optical, or other solid-state, persistent, writable, and/or rewritable storage media). The control unit may optionally be connected to one or more remote devices via a network. The control unit (and its components) may generally perform any of the steps and functions described herein. Where the system is connected to a remote device, the remote device (or devices) may perform any of the steps or features described herein. The system may optionally include one or more detectors (e.g., CCD, CMOS) for capturing images. The system may also optionally include one or more light sources (e.g., LED-based, diode-based, laser-based) for illuminating the sample, a substrate having features, analytes from the biological sample captured on the substrate, and various control and calibration media.
The system may optionally include software instructions encoded and/or implemented in one or more tangible storage media and hardware components (e.g., application specific integrated circuits). The software instructions, when executed by a control unit (particularly an electronic processor) or integrated circuit, may cause the control unit, integrated circuit, or other component executing the software instructions to perform any of the method steps or functions described herein.
In some cases, the systems described herein can detect (e.g., register images) biological samples on an array. Exemplary methods of detecting biological samples on an array are described in PCT application No. 2020/061064 and/or U.S. patent application Ser. No. 16/951,854.
The biological sample may be aligned with the array prior to transferring the analyte from the biological sample to the array of features on the substrate. Alignment of the biological sample and the feature array comprising capture probes may facilitate spatial analysis, which may be used to detect differences in the presence and/or level of an analyte in different locations in the biological sample, e.g., to generate a three-dimensional map of the presence and/or level of the analyte. Exemplary methods for generating two-and/or three-dimensional maps of analyte presence and/or level are described in PCT application No. 2020/053655, spatial analysis methods are generally described in WO2020/061108 and/or U.S. patent application serial No. 16/951,864.
In some cases, one or more fiducial markers may be used to align a map of analyte presence and/or level with an image of a biological sample, e.g., an object placed in the field of view of an imaging system appears in the generated image, as described in the substrate properties portion of WO2020/123320, the control slide portion for imaging, PCT application No. 2020/061066, and/or U.S. patent application serial No. 16/951,843. Fiducial markers may be used as reference points or measurement scales for alignment (e.g., to align a sample and an array, to align two substrates, to determine the position of a sample or array on a substrate relative to a fiducial marker) and/or for quantitative measurement of size and/or distance.
The sandwich process is described in PCT patent application publication WO 2020/123320, which is incorporated herein by reference in its entirety.
Targeted spatial gene expression profiling by hybridization and capture of spatial cDNA
(a) Introduction to the invention
Spatial analysis methods using capture probes and/or analyte capture agents provide information about the abundance and location of an analyte (e.g., a nucleic acid or protein). Traditionally, sequencing results contained unwanted genes, ribosomal or mitochondrial transcripts, as well as other uninteresting reads. Detection of the analyte of interest is largely dependent on the sequencing capability of all captured analytes on the (at least part of) array. Disclosed herein are methods and compositions for capturing target analytes of interest using decoy oligonucleotides specific for the target analytes of interest. In this way, the sequencing result contains a higher percentage of reads from the target analyte of interest, which increases the spatial resolution and sequencing cost of the target analyte of interest
Disclosed herein are methods of capturing target analytes of interest using decoy oligonucleotides specific for the target analytes of interest. As disclosed herein, decoy oligonucleotides are short (40 bp to 160 bp) oligonucleotides that hybridize to transcribed (e.g., mRNA) sequences, thereby detecting mRNA and its expression. The decoy oligonucleotide has been identified to hybridize to the 5 'end of the transcript, the 3' end of the transcript, or any intervening sequence of the transcript. In particular, there are certain advantages to designing probes to hybridize to the insertion sequences (i.e., not the 5 'and 3' ends) of transcripts. For example, many transcripts in the human genome have different sequences at the 5 'and 3' ends. Thus, designing a decoy that will target an intervening transcript sequence, particularly a decoy that targets a conserved sequence, can allow a single decoy to hybridize to multiple isoforms of the same gene. Thus, disclosed herein are compositions and methods comprising decoy oligonucleotides that hybridize to multiple isoforms of the same gene.
Thus, provided herein are methods that include designing and using probes (e.g., decoy oligonucleotides, nucleic acid decoys, etc.) to capture full-length (e.g., non-fragmented) cDNA for sequencing analysis, rather than decoy oligonucleotides for capturing final library fragments comprising the 3' utr. By targeting full-length cdnas, it is possible to utilize regions, e.g., coding sequences, of the target gene that are more reliably annotated for individual transcripts (e.g., isoforms) of the individual genes. Thus, the step of using the method to identify a target of interest in a sample includes, but is not limited to, preparing a nucleic acid library so that a nucleic acid decoy can hybridize to one or more analytes of interest; hybridizing the nucleic acid decoy to one or more analytes of interest; and determining the location and abundance of the target analyte in the biological sample.
Also featured herein are methods of detecting an analyte of interest that has been captured by a capture probe on a substrate (i.e., a spatial array). In some cases, after the analyte of interest is captured on the spatial array, a probe (e.g., a nucleic acid decoy) is used to identify the target analyte of interest. Thus, the step of using the method to identify a target of interest in a sample includes, but is not limited to, capturing an analyte in the sample using a capture probe; amplifying the hybridized capture probes/analyte products to create a cDNA library; preparing a cDNA library so that the nucleic acid bait can hybridize to one or more analytes of interest; hybridizing the nucleic acid decoy to one or more analytes of interest; and determining the location and abundance of the target analyte in the biological sample.
Profiling the phenotype of a biological sample using any of the methods described above, as well as those further illustrated herein, helps to avoid confounding biological factors, such as detection of strongly expressed genes that are not relevant to the study or cell cycle phenotypes that may mask biological differences. In addition, enhancing detection of one or more analytes of interest helps to significantly reduce the sequencing cost of obtaining information about the genome of interest by minimizing the number of reads that are consumed on genes that are not in the group (e.g., ribosomal protein transcripts, mitochondrial transcripts, etc.). As such, the methods described herein are also cost effective.
(b) Biological sample, analyte and preparation thereof
(i) Biological sample and analyte
In some embodiments, the biological sample used in the methods disclosed herein is a cell culture sample. In some cases, the biological sample is a tissue sample. In some embodiments, the biological sample is a tissue sample slice. In some embodiments, the biological sample is a fresh tissue sample. In some embodiments, the biological sample is a fresh frozen tissue sample. In some embodiments, the biological sample is a tissue sample that has been Formalin Fixed and Paraffin Embedded (FFPE) (i.e., FFPE sample). In some embodiments, the biological sample is a tissue sample embedded in an Optimal Cutting Temperature (OCT) compound. In some embodiments, the biological sample has been pre-stained (e.g., immunohistochemical (IHC) or histological staining) and imaged, and optionally decolorized.
In some embodiments, biological samples are obtained from a subject for analysis using any of a variety of techniques, including but not limited to biopsies, surgery, and Laser Capture Microscopy (LCM), and generally include cells and/or other biological materials from the subject. The biological sample may also be obtained from a prokaryote, such as a bacterium, e.g., escherichia coli, staphylococcus (staphylococcus) or mycoplasma pneumoniae (Mycoplasma pneumoniae); archaebacteria; viruses, such as hepatitis c virus or human immunodeficiency virus; or a viroid. Biological samples may also be obtained from eukaryotic organisms, such as patient-derived organoids (PDOs) or patient-derived xenografts (PDXs). The biological sample may be from a mammal, such as a human, a non-human mammal (e.g., a mouse), and the like. The subject from which the biological sample may be obtained may be a healthy or asymptomatic individual, an individual having or suspected of having a disease (e.g., a patient having a disease such as cancer), or an individual having a pretreatment for a disease, and/or an individual in need of treatment or suspected of requiring treatment.
The biological sample may include any number of macromolecules, such as cellular macromolecules and organelles (e.g., mitochondria and nuclei). The biological sample may be a nucleic acid sample and/or a protein sample. The biological sample may be a carbohydrate sample or a lipid sample. The biological sample may be obtained as a tissue sample, such as a tissue section, biopsy, core biopsy, needle aspirate, or fine needle aspirate. The sample may be a liquid sample, such as a blood sample, a urine sample, or a saliva sample. The sample may be a skin sample, a colon sample, a cheek swab, a histological sample, a histopathological sample, a plasma or serum sample, a tumor sample, living cells, cultured cells, a clinical sample, such as whole blood or blood derived products, blood cells or cultured tissues or cells, including cell suspensions.
The biological sample may be from a homogeneous culture or population of objects or organisms described herein, or alternatively from a collection of several different organisms in, for example, a community or an ecosystem. In some embodiments, the biological sample is a human sample.
The biological sample may include one or more diseased cells. The diseased cells may have altered metabolic characteristics, gene expression, protein expression, and/or morphological characteristics. Examples of diseases include inflammatory disorders, metabolic disorders, neurological disorders, and cancers. Cancer cells may be derived from solid tumors, hematological malignancies, and cell lines, and may also be obtained in the form of circulating tumor cells.
The biological sample may also include fetal cells. For example, procedures such as amniocentesis may be performed to obtain a fetal cell sample from the maternal circulation. Fetal cell sequencing can be used to identify any of a number of genetic diseases, including, for example, aneuploidy, such as down's syndrome, edwardsies syndrome, and pampers Tao Zeng syndrome. In addition, the cell surface characteristics of fetal cells may be used to identify any of a variety of diseases or conditions.
The biological sample may also include immune cells. Sequence analysis of immune functions of such cells, including genomics, proteomics, and cell surface features, can provide rich information that helps to understand the state and function of the immune system. For example, determining the status of Minimal Residual Disease (MRD) (e.g., negative or positive) in a Multiple Myeloma (MM) patient after autologous stem cell transplantation is considered a predictor of MRD in MM patients (see, e.g., U.S. patent application publication No. 2018/0156784, which is incorporated herein by reference in its entirety).
Examples of immune cells in a biological sample include, but are not limited to, B cells, T cells (e.g., cytotoxic T cells, natural killer T cells, regulatory T cells, and helper T cells), natural killer cells, cytokine-induced killer (CIK) cells, myeloid cells, such as granulocytes (basophils, eosinophils, neutrophils/multisection nuclear neutrophils), monocytes/macrophages, mast cells, platelets/megakaryocytes, and dendritic cells.
In some embodiments, the biological sample is fixed to a slide. In some embodiments, the sample is stained prior to creating the nucleic acid library (e.g., the plurality of nucleic acids). In some embodiments, the biological sample is stained while on a slide. In some embodiments, the stained biological sample is imaged prior to creating the nucleic acid library (e.g., the plurality of nucleic acids).
In some embodiments, staining includes biological staining techniques, such as H & E staining. In some embodiments, staining includes identifying the analyte using a fluorescent conjugated antibody (e.g., immunofluorescence). In some embodiments, the biological sample is stained using two or more different types of stains or two or more different staining techniques (e.g., IF, IHC, and/or H & E staining). For example, a biological sample may be prepared by: staining and imaging are performed by using one technique (e.g., H & E staining and bright field imaging), bleaching (e.g., quenching or photobleaching), and then staining and imaging the same biological sample using another technique (e.g., IHC/IF staining and fluorescence microscopy).
In some embodiments, the biological sample may be decolorized prior to creating the nucleic acid space library. Methods of decontamination or decolorization of biological samples are known in the art and generally depend on the nature of the stain applied to the sample.
In some embodiments, the analyte in the biological sample is a nucleic acid. In some embodiments, the analyte (e.g., nucleic acid) is obtained from a biological sample. In some embodiments, the nucleic acid is DNA (e.g., genomic DNA, mitochondrial DNA, or exosome DNA). In some embodiments, the nucleic acid is RNA. In some embodiments, the RNA is mRNA. Other examples of RNAs are, for example, various types of coding and non-coding RNAs. Different types of RNA analytes include messenger RNA (mRNA), ribosomal RNA (rRNA), transfer RNA (tRNA), small RNA (miRNA), and viral RNA. The RNA can be a transcript (e.g., present in a tissue section). The RNA can be small (e.g., less than 200 nucleobases in length) or large (e.g., RNA greater than 200 nucleobases in length). The micrornas include mainly 5.8S ribosomal RNAs (rRNA), 5S rRNA, transfer RNAs (tRNA), micrornas (miRNA), small interfering RNAs (siRNA), micronucleolar RNAs (snRNA), piwi interacting RNAs (piRNA), tRNA-derived micrornas (tsRNA), and small rDNA-derived RNAs (srrrna). The RNA may be double-stranded RNA or single-stranded RNA. The RNA may be circular RNA. The RNA may be bacterial rRNA (e.g., 16s rRNA or 23s rRNA).
In some embodiments, the nucleic acid comprises DNA. Examples of DNA include genomic DNA, methylated DNA, specific methylated DNA sequences, fragmented DNA, mitochondrial DNA, PCR products synthesized in situ, and RNA/DNA hybrids.
(ii) Imaging and staining
In some cases, the biological sample may be stained using a variety of stains and staining techniques. In some cases, the biological sample is a slice of tissue (e.g., a 10 μm slice). In some cases, the biological sample is dried after being placed on the slide. In some cases, the biological sample is dried at 42 ℃. In some cases, drying occurs for about 1 hour, about 2 hours, about 3 hours, or until the slice becomes transparent. In some cases, the biological sample may be dried overnight (e.g., in a desiccator at room temperature).
In some embodiments, the sample may be stained using any number of biological stains including, but not limited to, acridine orange, bismaleimide brown, carmine, coomassie blue, cresol purple, DAPI, eosin, ethidium bromide, acid fuchsin, hematoxylin, holtz stain, iodine, methyl green, methylene blue, neutral red, nile blue, nile red, osmium tetroxide, propidium iodide, rhodamine, or safranine. In some cases, the methods disclosed herein comprise imaging a biological sample. In some cases, sample imaging occurs prior to biological sample deamination. In some cases, the sample may be stained using known staining techniques, including candlen (Can-Grunwald), giemsa (Giemsa), hematoxylin and eosin (H & E), hucho's (Jenner's), leishmania (Leishman), masson's) trichromatism, papanicolaou (Papanicolaou), luo Manla s-based (roman sky), silver (silver), sudan (Sudan), rayleigh's (Wright's), and/or PAS staining techniques. PAS staining is usually performed after formaldehyde or acetone fixation. In some cases, the dye is an H & E dye.
In some embodiments, a biological sample may be stained with a detectable label (e.g., radioisotope, fluorophore, chemiluminescent compound, bioluminescent compound, and dye) as described elsewhere herein. In some embodiments, the biological sample is stained using only one type of stain or one technique. In some embodiments, staining includes biological staining techniques, such as H & E staining. In some embodiments, staining comprises identifying the analyte using a fluorescent conjugated antibody. In some embodiments, the biological sample is stained using two or more different types of stains or two or more different staining techniques. For example, a biological sample may be prepared by staining and imaging using one technique (e.g., H & E staining and bright field imaging) followed by staining and imaging the same biological sample using another technique (e.g., IHC/IF staining and fluorescence microscopy).
In some embodiments, the biological sample may be decolorized. Methods of decontamination or decolorization of biological samples are known in the art and generally depend on the nature of the stain applied to the sample. For example, H & E staining can be decolorized by washing the sample in HCl or any other acid (e.g., selenoic acid, sulfuric acid, hydroiodic acid, benzoic acid, carbonic acid, malic acid, phosphoric acid, oxalic acid, succinic acid, salicylic acid, tartaric acid, sulfurous acid, trichloroacetic acid, hydrobromic acid, hydrochloric acid, nitric acid, orthophosphoric acid, arsenic acid, selenious acid, chromic acid, citric acid, hydrofluoric acid, nitrous acid, isocyanic acid, formic acid, hydrogen selenide, molybdic acid, lactic acid, acetic acid, carbonic acid, hydrogen sulfide, or combinations thereof). In some embodiments, decolorization may include 1, 2, 3, 4, 5 or more washes in acid (e.g., HCl). In some embodiments, decolorizing may include adding HCl to a downstream solution (e.g., permeabilization solution). In some embodiments, decolorization can include dissolving an enzyme (e.g., pepsin) used in the disclosed methods in an acid (e.g., HCl) solution. In some embodiments, after the hematoxylin is decolorized with acid, other reagents may be added to the decolorized solution to raise the pH for other applications. For example, SDS may be added to the acid decolorization solution to raise the pH as compared to the acid decolorization solution alone. As another example, in some embodiments, one or more immunofluorescent stains are applied to the sample by antibody coupling. These stains can be removed using techniques such as cleavage of disulfide bonds by reducing agent and detergent wash treatments, hygrosalt treatments, antigen recovery solution treatments, and acidic glycine buffer treatments. For example, bolognesi et al, J.Histochem. Cytochem.2017;65 (8): 431-444, lin et al, nat Commun.2015;6:8390, pirici et al, j. Histochem. Cytochem.2009;57:567-75, and Glass et al, j. Histochem. Cytochem.2009;57:899-905, the entire contents of which are incorporated herein by reference, describes a method of multiple staining and bleaching.
In some embodiments, immunofluorescence or immunohistochemical protocols (direct and indirect staining techniques) may be performed as part of or in addition to the exemplary spatial workflow presented herein. For example, tissue sections may be fixed according to the methods described herein. The biological sample may be transferred to an array (e.g., a capture probe array) where the analyte (e.g., protein) is detected using an immunofluorescence protocol. For example, samples can be rehydrated, blocked and permeabilized (3 XSSC,2% BSA, 0.1% Triton X, 1U/. Mu.l RNase inhibitor, 4℃for 10 min) and then stained with fluorescent primary antibodies (1:100 at 3XSSC,2% BSA, 0.1% Triton X, 1U/. Mu.l RNase inhibitor, 4℃for 30 min). Biological samples may be washed, coverslipped (in glycerol+1u/. Mu.l rnase inhibitor), imaged (e.g., using confocal microscopy or other device capable of fluorescence detection), washed, and processed (according to the analyte capture or spatial workflow described herein).
In some cases, a glycerol solution and a cover slip may be added to the sample. In some cases, the glycerol solution may include a counterstain (e.g., DAPI).
As used herein, antigen retrieval buffers may improve antibody capture in IF/IHC protocols. An exemplary protocol for antigen retrieval may be to preheat the antigen retrieval buffer (e.g., to 95 ℃), immerse the biological sample in the heated antigen retrieval buffer for a predetermined time, then remove the biological sample from the antigen retrieval buffer and wash the biological sample.
In some embodiments, optimizing permeabilization may be useful for identifying intracellular analytes. Permeabilization optimization may include selection of permeabilizing agent, concentration of permeabilizing agent, and duration of permeabilization. Tissue permeabilization is discussed elsewhere herein.
In some embodiments, blocking the array and/or biological sample during preparation of the labeled (labeling) biological sample reduces non-specific binding (background reduction) of antibodies to the array and/or biological sample. Some embodiments provide a blocking buffer/blocking solution that may be applied prior to and/or during administration of the label, wherein the blocking buffer may include a blocking agent and optionally a surfactant and/or salt solution. In some embodiments, the blocking agent may be Bovine Serum Albumin (BSA), serum, gelatin (e.g., fish gelatin), milk (e.g., skim milk powder), casein, polyethylene glycol (PEG), polyvinyl alcohol (PVA), or polyvinylpyrrolidone (PVP), biotin blocking agent, peroxidase blocking agent, levamisole, carnoy's solution, glycine, lysine, sodium borohydride, pantamine sky blue (pontamine sky blue), sudan black, trypan blue, FITC blocking agent, and/or acetic acid. The blocking buffer/blocking solution may be applied to the array and/or the biological sample prior to and/or during the labeling of the biological sample (e.g., the application of fluorophore-conjugated antibodies).
(iii) Sample preparation for probe applications
In some cases, the biological sample is dewaxed. Dewaxing may be accomplished using any method known in the art. For example, in some cases, a biological sample is treated with a series of wash solutions comprising xylenes and various concentrations of ethanol. In some cases, the dewaxing process includes treatment with xylenes (e.g., 3 washes 5 minutes each). In some cases, the method further comprises treating with ethanol (e.g., 100% ethanol twice per wash for 10 minutes, 95% ethanol twice per wash for 10 minutes, 70% ethanol twice per wash for 10 minutes, 50% ethanol twice per wash for 10 minutes). In some cases, after the ethanol wash, the biological sample may be washed with deionized water (e.g., twice for 5 minutes each time). It will be appreciated that one skilled in the art can adjust these methods to optimize dewaxing.
In some cases, the biological sample is de-crosslinked. In some cases, the biological sample is de-crosslinked in a solution containing TE buffer (including Tris and EDTA). In some cases, the TE buffer is alkaline (e.g., at a pH of about 9). In some cases, the de-crosslinking occurs at about 50 ℃ to about 80 ℃. In some cases, the de-crosslinking occurs at about 70 ℃. In some cases, the de-crosslinking occurs at 70 ℃ for about 1 hour. Just prior to de-crosslinking, the biological sample may be treated with an acid (e.g., 0.1M HCl, about 1 minute). After the de-crosslinking step, the biological sample may be washed (e.g., with 1 x PBST).
In some cases, a method of preparing a biological sample for a probe application includes permeabilizing the sample. In some cases, the biological sample is permeabilized using phosphate buffer. In some cases, the phosphate buffer is PBS (e.g., 1 x PBS). In some cases, the phosphate buffer is PBST (e.g., 1 x PBST). In some cases, the permeabilization step is performed multiple times (e.g., 3 times per 5 minutes).
In some cases, the method of preparing a biological sample for a probe application includes the steps of equilibrating and blocking the biological sample. In some cases, equilibration is performed using pre-hybridization (pre-Hyb) buffer. In some cases, the pre-Hyb buffer is free of rnase. In some cases, the pre-Hyb buffer is free of Bovine Serum Albumin (BSA), denhardt's solution, or other biological material that may be contaminated with nucleases.
In some cases, the equilibration step is performed multiple times (e.g., 2 times for 5 minutes each, 3 times for 5 minutes each). In some cases, the biological sample is blocked with a blocking buffer. In some cases, the blocking buffer includes a carrier (e.g., tRNA), such as a yeast tRNA from Saccharomyces cerevisiae (e.g., at a final concentration of 10-20 μg/mL). In some cases, the closing may be performed for 5, 10, 15, 20, 25, or 30 minutes.
Any of the foregoing steps may be optimized for performance. For example, the temperature may be changed. In some cases, the prehybridization process is performed at room temperature. In some cases, the prehybridization method is performed at 4 ℃ (in some cases, the time ranges provided herein are altered).
(c) Nucleic acid library preparation
(i) Single cell library preparation
Disclosed herein are methods of preparing a nucleic acid library (e.g., a plurality of nucleic acids) from a cell or population of cells. In some embodiments, the biological sample may be obtained from a cell culture grown in vitro. Samples from cell cultures may include one or more suspended cells that are anchored independently within the cell culture. Examples of such cells include, but are not limited to, cell lines derived from hematopoietic cells and from the following cell lines: colo205, CCRF-CEM, HL-60, K562, MOLT-4, RPMI-8226, SR, HOP-92, NCI-H322M and MALME-3M.
Samples from cell cultures may include one or more adherent cells grown on the surface of a vessel containing the culture medium. Non-limiting examples of adherent cells include DU145 (prostate cancer) cells, H295R (adrenal cortical cancer) cells, heLa (cervical cancer) cells, KBM-7 (chronic granulocytic leukemia) cells, LNCaP (prostate cancer) cells, MCF-7 (breast cancer) cells, MDA-MB-468 (breast cancer) cells, PC3 (prostate cancer) cells, saOS-2 (bone cancer) cells, SH-SY5Y (neuroblastoma, cloned from myeloma) cells, T-47D (breast cancer) cells, THP-1 (acute myeloid leukemia) cells, U87 (glioblastoma) cells, national cancer institute 60 cancer cell line panel (NCI 60) vero (african green monkey kidney epithelial cell line) cells, MC3T3 (embryo) cells, GH3 (pituitary tumor) cells, PC12 (pheochromocytoma) cells, dog MDCK kidney epithelial cells, claw A6 kidney epithelial cells, AB9 insect epithelial cells and zebra 9 insect epithelial cells.
Other examples of samples that are considered cells or cell populations include, but are not limited to, those listed in priority file U.S. provisional patent application Ser. Nos. 62/979,652, 62/980,124, and 63/077,019, each of which is incorporated herein by reference in its entirety. It should be understood that the population of cells in the single cell library preparation may be from any of the cells or cell culture populations disclosed herein.
In some embodiments, a nucleic acid library (e.g., a plurality of nucleic acids) includes one or more nucleic acids of interest, for which detection may be enhanced using hybridization methods. Analytes may be isolated, amplified, and/or otherwise processed for subsequent analysis, such as fragmentation and sequencing library preparation.
In some embodiments, the biological sample is permeabilized using any of the methods described herein. The biological sample is permeabilized such that the analyte is released from one or more cells in the biological sample. In some embodiments, obtaining an analyte set (e.g., mRNA) from a biological sample includes any nucleic acid extraction and/or isolation method known in the art and/or described herein. In some embodiments, the released and obtained analytes may be amplified. In some embodiments, the polyadenylation mRNA collection is obtained from a biological sample in preparation for a sequencing method (e.g., by any of the methods contained in the sequencing library preparation workflow).
Other reagents may be added to the biological sample to perform various functions prior to analyzing the sample. In some embodiments, dnase and rnase inactivating agents or inhibitors and/or chelating agents such as EDTA may be added to the sample. In some embodiments, the sample may be treated with one or more enzymes. For example, one or more endonucleases for fragmenting DNA or RNA, as well as polymerases, may be added for amplifying nucleic acids. Enzymes that may be added to the sample include, but are not limited to, polymerases, transposases, ligases, dnases, and rnases.
In some embodiments, reverse transcriptase may be added to a sample, including enzymes, primers, and switching oligonucleotides having terminal transferase activity. Template switching can be used to increase the length of the cDNA, for example by appending a predefined nucleic acid sequence to the cDNA, whereby nucleic acid extension can be performed.
In some embodiments, obtaining the plurality of cDNA sequences includes first strand cDNA synthesis by reverse transcription of a corresponding set of analytes (e.g., polyadenylation mRNA). In some such embodiments, the cDNA is generated using a primer-containing poly (T). In some embodiments, the generated cDNA is barcoded using a capture probe characterized by having a barcode sequence (and optionally a UMI sequence) that hybridizes to at least a portion of the generated cDNA. In some embodiments, a Unique Molecular Identifier (UMI) is attached to the generated cDNA using a capture probe characterized by having a UMI that hybridizes to at least a portion of the generated cDNA. In some embodiments, the template switching oligonucleotide hybridizes to a poly (C) tail that is added to the 3' end of the cDNA by reverse transcriptase. In some such embodiments, the original mRNA template and template switching oligonucleotide are denatured from the cDNA, and the barcoded capture probe, having an optional UMI, is hybridized to the cDNA to generate a complement of the cDNA.
In some embodiments, obtaining the plurality of cDNA sequences further comprises amplifying (e.g., PCR amplifying) and/or adaptor extending each of the plurality of cDNA sequences. In some embodiments, adapter extension occurs prior to cDNA amplification. In some embodiments, adapter extension occurs during first strand cDNA synthesis. In some such embodiments, the extension of the adapter occurs through hybridization of the RNA molecule to the capture probe. In some such embodiments, the extension of the adapter occurs through hybridization of the cDNA molecule to the capture probe.
In some embodiments, the plurality of cDNA sequences are not fragmented, such that each of the plurality of cDNA sequences is a full-length cDNA sequence. In some such embodiments, the plurality of cDNA sequences comprises intermediates of a gene expression library preparation workflow (e.g., an unfractionated cDNA sequence generated using a 3' gene expression library preparation method).
In some embodiments, the plurality of cDNA sequences comprises a first subset of cDNA sequences, wherein each respective cDNA sequence in the first subset of cDNA sequences maps to each respective gene in the plurality of genes. The plurality of genes may be targeted genomes (e.g., genomes of interest). In some embodiments, the plurality of genes is between 5 genes and 20000 genes. In some embodiments, the plurality of genes is between 100 genes and 10000 genes. In some embodiments, the plurality of genes is between 500 genes and 2000 genes. In some embodiments, the plurality of genes is more than 10, more than 50, more than 100, more than 500, more than 1000, more than 2000, more than 5000, more than 10000, more than 15000, or more than 20000 genes.
In some embodiments, the plurality of cDNA sequences comprises a second subset of cDNA sequences, wherein each respective cDNA sequence in the second subset of cDNA sequences maps to a reference genome portion represented by the plurality of genes that is not mapped to by the first set of cDNA sequences. The second subset of cDNA sequences may include cDNA sequences that map to regions of the reference genome that are not targeted and/or other genes that are not included in the targeted genome.
In some embodiments, the plurality of cDNA sequences consists of a first subset of cDNA sequences and a second subset of cDNA sequences. In some such embodiments, each cDNA sequence in the plurality of cDNA sequences maps to a non-target region of the gene or reference genome of interest.
Each respective gene of the plurality of genes may be characterized by any number of transcripts. For example, in some embodiments, 3 or more transcripts correspond to each respective gene. For example, in some embodiments, 4 or more transcripts correspond to each respective gene. In some embodiments, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more transcripts correspond to each respective gene. In some embodiments, the plurality of transcripts corresponding to each respective gene in the plurality of genes comprises a plurality of isoforms of each respective gene.
An isoform of a gene refers to an mRNA molecule (and corresponding cDNA molecule) derived from the same genomic locus but comprising different nucleic acid sequences, including, but not limited to, transcription initiation sites (TSSs), protein Coding DNA Sequences (CDS), and/or untranslated regions (UTRs). These differences are caused by alternative splicing, use of alternative promoters, gene fusion or deletion, single Nucleotide Polymorphisms (SNPs) and/or other mutations or post-transcriptional genetic modifications. Due to differences in the mRNA sequences of the coding sequences and/or the cis-regulatory elements in the promoter sequences, isoforms of the gene may have different functional capabilities.
Identification of different isoforms of a gene is an important step in accurately enriching the cDNA sequence of the corresponding gene. For example, when different nucleic acid sequences of two or more isoforms of a target gene include different transcription initiation sites and/or untranslated regions, a decoy probe complementary to the 3 'or 5' annotated end of the gene may not capture all possible isoforms. If a nucleic acid decoy is designed to hybridize to a region of a first isoform that extends beyond either end of a second isoform, it may occur that while a minimal set of decoy probes is ideal for cost-effective and linear targeted sequencing analysis, in some cases, when two or more isoforms have such significantly different lengths that they do not overlap, it may be necessary to design multiple nucleic acid decoys so that the various isoforms can be bound and enriched. In some such cases, each non-overlapping isoform hybridizes to a different unique bait probe.
Thus, appropriate annotation of genomic regions (e.g., coding and/or non-coding sequences) spanned by the various isoforms is necessary to ensure that the decoy probe design generates multiple decoy probes (e.g., hybridizable) that represent each transcript of the target gene.
In some embodiments, each transcript of the plurality of transcripts is protein encoded. Alternatively, one or more transcripts of the plurality of transcripts may comprise non-coding sequences (e.g., 3 'or 5' untranslated regions). For example, one or more transcripts of the plurality of transcripts may comprise a 3'utr sequence downstream of a stop codon or a 5' utr upstream of a start codon. In some embodiments, one or more transcripts of the plurality of transcripts are protein-encoded, but comprise an incomplete coding sequence. For example, in some such embodiments, the transcript of the plurality of transcripts corresponding to the respective gene is a coding sequence (CDS) 3 'incomplete, a CDS 5' incomplete, or a 3 'and CDS 5' incomplete. As used herein, CDS 3' incomplete refers to protein encoded transcripts that do not include a stop codon due to incomplete verification (authentication). As used herein, a CDS 5' imperfection refers to a protein encoded transcript that does not include the initiation codon due to incomplete verification.
In some embodiments, the reference database can be used to align transcripts corresponding to respective genes, e.g., reference genome genode version 33 (grch 38. P13). For example, the annotation of the corresponding gene can be used as a reference database within the GENCODE Consortium (Consortium). In other cases, annotations of the corresponding genes can be obtained by Ensembl project, see Harrow et al 2012, "GENCODE: reference human Genome annotation of the code project (genode: the reference human Genome annotation for The ENCODE Project), "Genome res.22 (9): 1760-1774: doi:10.1101/gr.135350.111; and flick et al, 2014, "Ensembl 2014," Nucleic Acids Res.42 (database issue): D749-D755: doi:10.1093/nar/gkt1196, the entire contents of which are incorporated herein by reference.
(ii) Spatial library preparation
Disclosed herein are methods of generating a plurality of extended nucleic acids (e.g., any of the extended nucleic acids described herein) for detecting analytes including proteins and nucleic acids. In some cases, the method includes detecting the nucleic acid. In some embodiments, the method comprises detecting a protein.
As used herein, "extended nucleic acid" refers to a nucleic acid having additional nucleotides added to the end (e.g., the 3 'or 5' end) of the nucleic acid to extend the total length of the nucleic acid.
Disclosed herein are methods of making a library of target nucleic acids using decoy oligonucleotides, wherein the library of target nucleic acids is initially made from nucleic acids that hybridize to probes on a capture space array (i.e., a substrate). For example, the nucleic acid sequences captured on the substrate may be passed through a spatial workflow to provide the resulting nucleic acids that may be isolated, amplified, and/or otherwise processed for subsequent analysis (e.g., fragmentation and sequencing library preparation). Capture probes, substrates and arrays have been described in the previous section of this application and are incorporated into this section.
In some cases, embodiments of any of the methods described herein can include contacting a biological sample with a substrate comprising a plurality of attached capture probes, wherein the capture probes of the plurality of capture probes comprise (i) a spatial barcode and (ii) a capture domain that specifically binds to a sequence (e.g., a nucleic acid) present in an analyte; extending the 3' end of the capture probe using the analyte that specifically binds to the capture domain as a template to generate an extended capture probe; and amplifying the extended capture probe. In some cases, embodiments of any of the methods described herein can include contacting a plurality of analyte capture agents with a biological sample, wherein the analyte capture agents of the plurality of analyte capture agents comprise (i) an analyte binding moiety that binds to an analyte (e.g., a protein) and (ii) an oligonucleotide comprising an analyte binding moiety barcode and an analyte capture sequence; contacting a plurality of analyte capture agents with a substrate, the substrate comprising a plurality of capture probes, wherein the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain, wherein the capture domain binds to an analyte capture sequence; extending the 3' end of the capture probe using the analyte binding moiety barcode as a template to generate an extended capture probe; and amplifying the extended capture probe. Individual method steps and system features may be present in combination in many different embodiments; the particular combinations described herein are not limited in any way to other combinations of steps or features.
When a biological sample is contacted with a substrate (e.g., an array) that includes capture probes, for example, the analyte can be captured on the array. The capture probes interact with analytes released from the biological sample through the capture domains described throughout this application to capture the analytes. For example, the capture domain captures the analyte by hybridization to a nucleic acid sequence in a target nucleic acid molecule from a biological sample. In some cases, the sequence complementary to the capture domain is a polyadenylation (poly (a)) sequence. In some cases, the sequence complementary to the capture domain is designed to be specific for the sequence of interest (i.e., in the analyte of interest).
In a nucleic acid detection setting, a nucleic acid analyte hybridizes directly to capture probes on an array (e.g., a substrate). In a protein detection setting, referring to FIG. 4, analyte binding moiety 402 comprises a protein binding moiety 404 and an oligonucleotide 408. In some cases, the analyte binding moiety is added to the biological sample. After the protein binds to the protein binding moiety 404, the oligonucleotide is captured by a capture domain of the array (e.g., a substrate). Thus, an analyte binding moiety as used herein can be considered an analyte derivative, and its oligonucleotides can be captured and analyzed in a similar manner as the analysis of a nucleic acid analyte. Embodiments of analyte binding moieties have been previously described, for example in WO 2020/176788 and/or U.S. patent application publication No. 2020/0277663, each of which is incorporated herein by reference in its entirety.
Alternatively, a surrogate (proxy) of the analyte may be captured by the capture domain of the capture probes on the array. For example, two probes can hybridize to a target nucleic acid, whereby they can be ligated together to produce a ligation product that can be used as a surrogate for the target sequence. The ligation product may comprise a sequence substantially complementary to the capture domain of the capture probes on the array. Ligation products can be captured on the array and serve as a surrogate for the presence of target nucleic acids.
In some embodiments, the sample is stained prior to creating the nucleic acid library. In some embodiments, the biological sample is stained while on a slide. In some embodiments, the stained biological sample is imaged prior to creating the nucleic acid library. In some embodiments, staining includes biological staining techniques, such as H & E staining. In some embodiments, staining includes identifying the analyte using a fluorescent conjugated antibody (e.g., immunofluorescence). In some embodiments, the biological sample is stained using two or more different types of stains or two or more different staining techniques (e.g., IF, IHC, and/or H & E staining). For example, a biological sample may be prepared by: staining and imaging are performed by using one technique (e.g., H & E staining and bright field imaging), bleaching (e.g., quenching or photobleaching), and then staining and imaging the same biological sample using another technique (e.g., IHC/IF staining and fluorescence microscopy).
In some embodiments, a target-specific reaction is performed in the biological sample to enhance detection of one or more targets of interest prior to interaction with capture probes on the array. Examples of target-specific reactions include, but are not limited to, ligation of target-specific adaptors, probes, and/or other oligonucleotides, target-specific amplification using primers specific for one or more analytes, and target-specific detection using in situ hybridization, DNA microscopy, and/or antibody detection. In some embodiments, the capture probes comprise a capture domain (e.g., amplification or ligation) that is targeted to a target-specific product. The target analyte may then be captured by a capture probe (e.g., as described throughout herein).
In some embodiments, the methods provided herein include a permeabilization step to release an analyte from a biological sample. In some embodiments, permeabilization occurs using a protease. In some embodiments, the protease is an endopeptidase. Endopeptidases that may be used include, but are not limited to, trypsin, chymotrypsin, elastase, thermolysin, pepsin, clostripain, glutamyl endopeptidase (GluC), argC, peptidyl-aspartic endopeptidase (ApsN), endopeptidase LysC, and endopeptidase LysN. In some embodiments, the endopeptidase is pepsin.
In some embodiments, the methods provided herein include permeabilization of the biological sample, such that the capture probes can more readily bind to the analyte or analyte derivative (i.e., as compared to no permeabilization). In some embodiments, a Reverse Transcription (RT) reagent may be added to the permeabilized biological sample. Incubation with RT reagents can produce spatially barcoded full-length cdnas from captured analytes (e.g., polyadenylated mRNA). A second strand reagent (e.g., a second strand primer, an enzyme) may be added to the biological sample on the slide to initiate second strand synthesis.
In some cases, the permeabilizing step includes applying a permeabilizing buffer to the biological sample. In some cases, the permeabilization buffer comprises a buffer (e.g., tris pH 7.5), mgCl 2 Sarcosinyl detergent (e.g., sodium lauroyl sarcosinate) or other detergents, enzymes (e.g., proteinase K, pepsin, collagenase, etc.), and water without ribozymes. In some cases, the permeabilization step is performed at 37 ℃. In some cases, the permeabilization step is performed at about 20Minutes to 2 hours (e.g., about 20 minutes, about 30 minutes, about 40 minutes, about 50 minutes, about 1 hour, about 1.5 hours, or about 2 hours). In some cases, the releasing step is performed for about 40 minutes.
After permeabilization, in some cases, the analyte is captured by a capture probe and/or an analyte capture agent. In some embodiments, such methods of increasing the capture efficiency of a spatial array described herein comprise contacting the spatial array with a biological sample and allowing an analyte to interact with a capture probe and/or an analyte capture agent.
In some embodiments, using the hybridized analyte as a template, a capture probe hybridized to the analyte can be extended with a polymerase (e.g., reverse transcriptase) to produce an extended capture probe. In some embodiments, using the hybridized analyte as a template, a polymerase (e.g., reverse transcriptase) can be used to extend the 3' end of the capture probe hybridized to the analyte to generate an extended capture probe. In some embodiments, using the hybridized analyte as a template, a polymerase (e.g., reverse transcriptase) can be used to extend the 5' end of the capture probe hybridized to the analyte to generate an extended capture probe. The extended capture probe can be amplified (e.g., by second strand synthesis) to generate a single-stranded nucleic acid comprising a sequence complementary to the extended capture probe. Single stranded nucleic acids comprising sequences complementary to the extended capture probes may be used to generate a nucleic acid library or may be part of a nucleic acid library.
After hybridization of the analyte with the capture probe, the hybridized product is amplified. For example, obtaining a cDNA sequence library also includes amplifying (e.g., PCR amplifying) and/or adaptor extending the cDNA sequence. In some embodiments, adapter extension occurs prior to cDNA amplification. In some embodiments, adapter extension occurs during first strand cDNA synthesis.
In some embodiments, the method comprises amplifying all or a portion of the analyte using isothermal amplification or non-isothermal amplification. In some embodiments, amplification produces an amplification product comprising (i) all or part of the sequence of the analyte or analyte derivative bound to the capture probe or its complement, and (ii) all or part of the sequence of the spatial barcode or its complement. In some embodiments, the determining step comprises sequencing. A non-limiting example of sequencing that can be used to determine the sequence of an analyte, analyte derivative, and/or spatial barcode is in situ sequencing. In some aspects, in situ sequencing is performed by Sequencing By Synthesis (SBS), sequential fluorescent hybridization, ligation sequencing, nucleic acid hybridization, or high throughput digital sequencing techniques. In some embodiments, the analyte is DNA or RNA. In some embodiments, the analyte is a protein.
In some embodiments, after contacting the biological sample with the substrate comprising capture probes, a removal step may optionally be performed to remove all or a portion of the biological sample from the substrate. In some embodiments, the removing step comprises enzymatic and/or chemical degradation of the biological sample cells. For example, the removing step may include treating the biological sample with an enzyme (e.g., a protease, e.g., proteinase K) to remove at least a portion of the biological sample from the substrate. In some embodiments, the removing step may include ablation of tissue (e.g., laser ablation).
In some embodiments, provided herein are methods for spatially detecting an analyte (e.g., detecting a location of an analyte (e.g., a biological analyte)) from a biological sample (e.g., present in a biological sample), the methods comprising: (a) Optionally staining and/or imaging a biological sample on a substrate; (b) Permeabilizing a biological sample on a substrate (e.g., providing a solution comprising a permeabilizing reagent); (c) Contacting the biological sample with an array comprising a plurality of capture probes, wherein the capture probes capture a biological analyte; and (d) analyzing the captured biological analyte, thereby spatially detecting the biological analyte; wherein the biological sample is removed from the substrate in whole or in part.
In some embodiments, the biological sample is not removed from the substrate. For example, the biological sample is not removed from the substrate until the capture probes (e.g., capture probes bound to the analyte) are released from the substrate. In some embodiments, such release comprises cleavage of the capture probes from the substrate (e.g., by cleavage of the domains). In some embodiments, such release does not include release of the capture probes from the substrate (e.g., a copy of the analyte-bound capture probes may be prepared and the copy may be released from the substrate, e.g., by denaturation). In some embodiments, after release of the biological sample from the substrate, the biological sample is not removed from the substrate prior to analysis of the analyte bound to the capture probes. In some embodiments, the biological sample remains on the substrate during removal of the capture probes from the substrate and/or during analysis of analytes bound to the capture probes after release from the substrate. In some embodiments, the biological sample remains on the substrate during removal (e.g., by denaturation) of the copy of the capture probe (e.g., complement). In some embodiments, analysis of analytes bound to capture probes from a substrate can be performed without subjecting the biological sample to enzymatic and/or chemical degradation of cells (e.g., permeabilized cells) or ablation of tissue (e.g., laser ablation).
In some embodiments, at least a portion of the biological sample is not removed from the substrate. For example, a portion of the biological sample may remain on the substrate prior to release of the capture probes (e.g., analyte-bound capture probes) from the substrate and/or analysis of the analyte released from the substrate that is bound to the capture probes. In some embodiments, at least a portion of the biological sample is not subjected to enzymatic and/or chemical degradation of cells (e.g., permeabilized cells) or ablation of tissue (e.g., laser ablation) prior to analysis of analytes bound to capture probes from the substrate.
In some embodiments, provided herein are methods for spatially detecting an analyte (e.g., detecting a location of the analyte, e.g., a biological analyte) from a biological sample (e.g., present in a biological sample), comprising: (a) Optionally staining and/or imaging a biological sample on a substrate; (b) Permeabilizing a biological sample on a substrate (e.g., providing a solution comprising a permeabilizing reagent); (c) Contacting the biological sample with an array comprising a plurality of capture probes, wherein the capture probes of the plurality of capture probes capture a biological analyte; and (d) analyzing the captured biological analyte, thereby spatially detecting the biological analyte; wherein the biological sample is not removed from the substrate.
In some embodiments, provided herein are methods for detecting a biological analyte of interest from a biological sample space, comprising: (a) staining and imaging a biological sample on a substrate; (b) Providing a solution comprising a permeabilizing reagent to a biological sample on a substrate; (c) Contacting the biological sample with an array on a substrate, wherein the array comprises one or more capture probes, such that the one or more capture probes capture a biological analyte of interest; and (d) analyzing the captured biological analyte to spatially detect the biological analyte of interest; wherein the biological sample is not removed from the substrate.
In some embodiments, the method further comprises performing a spatial transcriptomic analysis of the region of interest in the biological sample. In some embodiments, the one or more capture probes comprise a capture domain. In some embodiments, the one or more capture probes comprise a Unique Molecular Identifier (UMI). In some embodiments, the one or more capture probes comprise a cleavage domain. In some embodiments, the cleavage domain comprises a sequence recognized and cleaved by uracil DNA glycosylase, purine-free/pyrimidine-free (AP) endonuclease (APE 1), U uracil-specific excision reagent (USER), and/or endonuclease VIII. In some embodiments, the one or more capture probes do not comprise a cleavage domain and are not cleaved from the array.
In some embodiments, the capture probes may be extended ("extended capture probes", e.g., as described herein). For example, the extended capture probe may comprise generating cDNA from captured (hybridized) RNA. This process involves synthesizing complementary strands of the hybridized nucleic acid, e.g., generating cDNA based on the captured RNA template (RNA hybridized to the capture domain of the capture probe). Thus, in an initial step of extending the capture probe (e.g., cDNA generation), the captured (hybridized) nucleic acid (e.g., RNA) acts as a template for extension (e.g., reverse transcription step).
In some embodiments, the capture probe uses reverse transcription extension. For example, reverse transcription involves the synthesis of cDNA (complementary or copy DNA) from RNA (e.g., messenger RNA) using reverse transcriptase. In some embodiments, reverse transcription is performed while the tissue is still in place, producing an analyte library, wherein the analyte library comprises a spatial barcode from a proximity capture probe. In some embodiments, the capture probes are extended using one or more DNA polymerases.
In some embodiments, the capture domain of the capture probe includes primers for generating a complementary strand of nucleic acid hybridized to the capture probe, e.g., primers for DNA polymerase and/or reverse transcription. Nucleic acid (e.g., DNA and/or cDNA) molecules resulting from the extension reaction incorporate the sequence of the capture probe. Extension of the capture probes, such as DNA polymerase and/or reverse transcription reactions, may be performed using a variety of suitable enzymes and protocols.
In some embodiments, full-length DNA (e.g., cDNA) molecules are produced. In some embodiments, a "full-length" DNA molecule refers to the entire captured nucleic acid molecule. However, if the nucleic acid (e.g., RNA) is partially degraded in the tissue sample, the captured nucleic acid molecules will be of a different length than the original RNA in the tissue sample. In some embodiments, the 3' end of the extended probe (e.g., the first strand cDNA molecule) is modified. For example, a linker or adapter may be attached to the 3' end of the extended probe. This can be accomplished by using a single-stranded ligase such as T4 RNA ligase or CircleGase TM (available from the company Lu Cigen of Midelton, wisconsin) (Lucigen). In some embodiments, the template switch oligonucleotide is used to extend the cDNA to produce full-length cDNA (or as close as possible to full-length cDNA). In some embodiments, a second strand synthesis auxiliary probe (a partially double-stranded DNA molecule capable of hybridizing to the 3 'end of the extended capture probe) may be ligated to the 3' end of the extended probe using a double-stranded ligase (e.g., T4DNA ligase), such as a first strand cDNA molecule. Other enzymes suitable for use in the ligation step are known in the art and include, for example, tth DNA ligase, taq DNA ligase, thermococcus (strain 9 DEG N) DNA ligase (9 DEG N) TM DNA ligase, new England Biolabs (New England Biolabs)), amphigenase TM (available from Lu Cigen company (Lucigen) of middeton, wisconsin) and splattr (available from the new england biology laboratory of ibos, massachusetts). In some embodiments, the polynucleotide tail (e.g., poly (a) tail) incorporates an extended probe molecule3' of (3). In some embodiments, a terminal transferase active enzyme is used to incorporate the polynucleotide tail.
In some embodiments, the double-stranded extended capture probes are treated to remove any unextended capture probes prior to amplification and/or analysis (e.g., sequence analysis). This can be accomplished by a variety of methods, for example, using enzymes to degrade non-extended probes, such as exonucleases or purification columns.
In some embodiments, the extended capture probe is amplified to produce an amount sufficient for analysis, such as by DNA sequencing. In some embodiments, the first strand of the extended capture probe (e.g., a DNA and/or cDNA molecule) is used as a template for an amplification reaction (e.g., a polymerase chain reaction).
In some embodiments, the amplification reaction uses primers that include affinity groups to incorporate the affinity groups onto an extended capture probe (e.g., RNA-cDNA hybridization). In some embodiments, the primer includes an affinity group and the extended capture probe includes an affinity group. The affinity group may correspond to any of the affinity groups described previously.
In some embodiments, extended capture probes comprising an affinity group may be coupled to a substrate specific for the affinity group. In some embodiments, the substrate may comprise an antibody or antibody fragment. In some embodiments, the substrate comprises avidin or streptavidin, and the affinity group comprises biotin. In some embodiments, the substrate comprises maltose and the affinity group comprises a maltose binding protein. In some embodiments, the substrate comprises a maltose binding protein and the affinity group comprises maltose. In some embodiments, amplifying the extended capture probes may act to release the extended probes from the substrate surface as long as a copy of the extended probes is not immobilized to the substrate.
In some embodiments, the extended capture probe or its complement or amplicon is released. The step of releasing the extended capture probes or their complements or amplicons from the surface of the substrate can be accomplished in a variety of ways. In some embodiments, the extended capture probes or their complements are released from the array by nucleic acid cleavage and/or denaturation (e.g., denaturation of double-stranded molecules by heating).
In some embodiments, the extended capture probes or their complements or amplicons are physically released from the surface (e.g., array) of the substrate. For example, when the extended capture probes are immobilized indirectly on the array substrate, such as by hybridization to surface probes, it can be sufficient to disrupt the interaction between the extended capture probes and the surface probes. Methods of disrupting the interaction between nucleic acid molecules include denaturing double-stranded nucleic acid molecules as known in the art. A straightforward method of releasing DNA molecules (i.e. stripping the extended probe array) is to use a solution that interferes with double-stranded molecular hydrogen bonding. In some embodiments, the extended capture probe is released by applying a heated solution (e.g., water or buffer at least 85 ℃, such as water at least 90 ℃, 91 ℃, 92 ℃, 93 ℃, 94 ℃, 95 ℃, 96 ℃, 97 ℃, 98 ℃, or 99 ℃). In some embodiments, a solution is added that includes salts, surfactants, etc., that may further destabilize interactions between nucleic acid molecules to release extended capture probes from the substrate.
In some embodiments, the extended capture probes comprise cleavage domains, and the extended capture probes are released from the substrate surface by cleavage. For example, the cleavage domain of the extended capture probe may be cleaved by any of the methods described herein. In some embodiments, the extended capture probes are released from the substrate surface prior to the step of amplifying the extended capture probes, for example by cleaving cleavage domains in the extended capture probes.
In some embodiments, the entire sample may be sequenced by direct barcoding of the sample by hybridization with a capture probe or analyte capture agent that hybridizes, binds or associates with, or is introduced into the cell surface, as described above.
A wide variety of different sequencing methods are available for analyzing the barcoded analytes or moieties. In general, the sequenced polynucleotide may be, for example, a nucleic acid molecule, such as deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), including variants or derivatives thereof (e.g., single stranded DNA or DNA/RNA hybrids, as well as nucleic acid molecules having nucleotide analogs).
Sequencing of polynucleotides can be performed by various systems. More generally, nucleic acid amplification, polymerase Chain Reaction (PCR) (e.g., digital PCR and drop digital PCR (ddPCR), quantitative PCR, real-time PCR, multiplex PCR, PCR-based single-sided methods, emulsion PCR), and/or isothermal amplification can be used for sequencing. Non-limiting examples of methods of sequencing genetic material include, but are not limited to, DNA hybridization methods (e.g., southern blotting), restriction enzyme digestion methods, sanger sequencing methods, next generation sequencing methods (e.g., single molecule real-time sequencing, sequencing by synthesis, nanopore sequencing, and poony sequencing), ligation methods, and microarray methods.
(iii) Treatment of nucleic acid libraries
In some embodiments, after a nucleic acid library (e.g., a plurality of nucleic acids) is created, one or more decoy oligonucleotides (e.g., from one or more of the sets described herein) are hybridized to the plurality of nucleic acids. The nucleic acid comprises all or part of the sequence of the analyte of interest or its complement and/or comprises all or part of the spatial barcode of interest or its complement. In some embodiments, one or more decoy oligonucleotides are hybridized to a nucleic acid that comprises all or a portion of the sequence of an analyte of interest or its complement.
In some embodiments, one or more nucleic acid libraries are pooled. In some cases, one or more libraries are incubated with Cot DNA (e.g., human Cot DNA). In some cases, one or more libraries are incubated with universal blocker nucleic acids that hybridize to one or more well-expressed nucleic acids to prevent unwanted nucleic acids from hybridizing to decoy oligonucleotides.
In other embodiments, the library, nucleic acid, or enriched nucleic acid may be quantified using quantitative PCR (qPCR) prior to hybridization of the decoy oligonucleotide. In some embodiments, the library, nucleic acid, or enriched nucleic acid may be fragmented. In some embodiments, the library, nucleic acid, or enriched nucleic acid may be fragmented by an enzyme-based method (e.g., by restriction enzymes, nicking enzymes, and/or transposases). In some embodiments, the library, nucleic acid, or enriched nucleic acid may be fragmented by an endonuclease. In some embodiments, the library, nucleic acid, or enriched nucleic acid may be fragmented by mechanical shearing (e.g., sonic shearing, hydrodynamic shearing, and/or nebulization). In some embodiments, the library, nucleic acid, or enriched nucleic acid may be fragmented by a pooled enzyme-based method and mechanical cleavage. In some embodiments, the library, nucleic acid, or enriched nucleic acid may be fragmented by end repair, poly-a tailing, or a combination thereof. In some embodiments, an adapter is ligated to each nucleic acid or enriched nucleic acid sequence. In some embodiments, the adapter is ligated to the 3' end of the nucleic acid or enriched nucleic acid sequence. In some embodiments, the adapter is ligated to the 5' end of the nucleic acid sequence or the enriched nucleic acid sequence. In some embodiments, the adapter may be an additional functional nucleic acid sequence, e.g., a spacer sequence, a primer sequence/site, a barcode sequence, a Unique Molecular Identifier (UMI) sequence, a linker, and/or a sequencing adapter.
In some embodiments, the methods disclosed herein include Sample Index (SI) PCR that adds a nucleic acid sequence (e.g., a barcode) to the 5 'and/or 3' end of the nucleic acid sequence or enriched nucleic acid sequence. In some cases, the SI-PCR reaction is performed at 67 ℃. In some embodiments, SI-PCR is a PCR reaction that introduces sample index sequences (e.g., i5 and i 7) into the nucleic acid sequence or enriches the 5 'and/or 3' ends of the nucleic acid sequence. In some embodiments, the method for SI-PCR adds an i5 sample index sequence. In some embodiments, the method for SI-PCR adds an i7 sample index sequence. In some embodiments, the P5 adapter is added to the nucleic acid sequence or the enriched nucleic acid sequence. In some embodiments, the P7 adapter is added to the nucleic acid sequence or the enriched nucleic acid sequence. In some embodiments, SI-PCR is performed prior to enrichment of decoy oligonucleotides for the nucleic acid of interest. In some embodiments, SI-PCR is performed after enrichment of decoy oligonucleotides for the nucleic acid of interest.
In some embodiments, the nucleic acid of interest (either before or after enrichment with decoy oligonucleotides) or the library generated therefrom may be dried. In some embodiments, drying includes a dehydration process, such as heating, vacuum, lyophilization, drying, filtration, and air drying. In some embodiments, a vacuum centrifuge is used to dry the sample. In some cases, drying is performed at about 50 ℃, about 55 ℃, about 60 ℃, about 63 ℃, about 65 ℃, about 67 ℃, about 70 ℃, or about 75 ℃. In some embodiments, drying may be performed for at least 1 hour, at least 2 hours, at least 3 hours, or at least 4 hours. If the sample is not used immediately, the sample may be stored (e.g., at-20 ℃). In some embodiments, the nucleic acid of interest (either before or after enrichment with decoy oligonucleotides) or the library generated therefrom is not dried.
(d) Hybridization targeting of capture analytes using decoy oligonucleotides
After preparing a nucleic acid library from a biological sample, the library may be incubated with a plurality of decoy oligonucleotides, thereby selectively enriching the library of targets of interest. The target analytes can be separated from the library, thereby producing an enriched population of target analytes. While whole transcriptome spatial analysis provides very useful information, more targeted gene enrichment is able to spatially localize a subset of targets of particular interest, such as for cancer or disease detection-related cancer or disease-related genes and gene expression. Thus, research can focus on subsets of genes of interest and maximize knowledge of the spatial knowledge of these genes, while minimizing the costs and reagents associated with spatial whole transcriptome workflow.
(i) Rust erbium oligonucleotide design
In some embodiments, the decoy oligonucleotide sets are designed to target and hybridize to multiple nucleic acids (e.g., a prepared spatial library, such as a prepared cDNA library). In some embodiments, the decoy oligonucleotide sets hybridize to target nucleic acids (e.g., cdnas) from a broader library nucleic acid set. In some embodiments, the hybridized product (e.g., decoy oligonucleotide and hybridized nucleic acid) is then captured by streptavidin beads. In some embodiments, the hybridized product (e.g., decoy oligonucleotide and hybridized nucleic acid) is then captured by an avidin bead. Unhybridized nucleic acids are washed away. The target product is again amplified and sequenced. In some embodiments, the re-amplified target product may be fragmented, ligated to a linker sequence, and amplified by SI-PCR.
Disclosed herein are methods of designing and testing candidate decoy oligonucleotide sequences. Candidate decoy oligonucleotide sequences are designed such that each decoy oligonucleotide sequence theoretically hybridizes to a unique target of interest. Thus, the decoy oligonucleotide is designed to be at least 40 nucleotides in length. To identify decoy oligonucleotides of interest, aligners designed to align RNA-seq data were used to identify and align 40 nucleotide portions of the human transcriptome that are unique to the genome. In some embodiments, decoy oligonucleotides are designed to hybridize to a particular exon. In some embodiments, the decoy oligonucleotide is designed to span an exon-exon junction (exon-exon junction). In some embodiments, decoy oligonucleotides can hybridize to a target, thereby being able to recognize spliced and alternatively spliced transcripts in the transcriptome. Alignment is used to identify and classify sequences aligned one or more times with the genome. Each designed bait may be tested against (i.e., compared to) sequences identified in the genome. If the decoy oligonucleotide and sequences in the genome do not match, the decoy may be tested in one or more groups as disclosed herein.
The present disclosure provides a method of designing nucleic acid decoys for full-length cdnas comprising obtaining coding sequences for each transcript (e.g., isoform) of each target gene in a targeted genome. When the coding sequence is less than a threshold length (e.g., 120 base pairs), the complete mRNA sequence may be used instead. The method further comprises, for each 120 base pair sequence in each coding sequence, obtaining a count of the number of transcripts in which the 120 base pair sequence occurs. The subsequences are ordered by the obtained count and the first subsequence is selected from the subsequences that occur in the largest number of transcripts of the corresponding gene. The first subsequence is further subjected to filtering criteria (e.g., uniqueness, mappability, absence of repeated subsequences and/or overall GC content). In some embodiments, if a first subsequence fails to meet one or more filtering criteria, the first subsequence is rejected and a new first subsequence is selected from subsequences that occur in the largest number of transcripts of the corresponding gene that further meet the filtering criteria. In some embodiments, if the first subsequence fails to meet one or more filtering criteria, the first subsequence is modified (e.g., by truncating the first subsequence such that it meets one or more filtering criteria and/or by shifting the first subsequence along the reference genome such that it meets one or more filtering criteria).
The method further includes selecting a second subsequence (e.g., a transcript that is not the first subsequence) from the subsequences that occur in the largest number of remaining transcripts of the corresponding gene. The method is iterated over all remaining transcripts until no transcripts remain (e.g., at least one subsequence of the plurality of selected subsequences is present in each of the plurality of transcripts of the target gene).
The methods provided in the present disclosure improve upon the current technology by utilizing full-length cDNA sequences rather than 3' fragmented sequencing libraries, allowing access to larger regions of reliably annotated sequences for nucleic acid bait design. For example, full-length cDNA sequences contain coding sequences that are generally well annotated, ensuring better targeted hybridization results and mid-targeting rates.
Full-length cDNA sequences, including common sequences shared between transcripts (e.g., isoforms), can be designed to target multiple transcripts and reduce the number of nucleic acid decoys required by a factor of 10. Thus, the resulting plurality of nucleic acid decoys can be reduced in size and linearized, thereby resulting in greater efficiency and lower cost for users who need nucleic acid decoys for large and/or custom targeted genomes. For example, while a typical "tiled" plurality of nucleic acid baits would comprise 38,000 to 68,000 nucleic acid baits for 1-fold coverage of 1000 genomes, a plurality of nucleic acid baits designed using the methods of the present disclosure would comprise 2,500 to 4,000 nucleic acid baits for the same 1000 genomes. Furthermore, the methods of the present disclosure may be used in any application where targeted analysis is desired (e.g., single cell RNA sequencing and/or spatial gene expression profiling). In some cases, the plurality of nucleic acid baits comprises at least 500, at least 1000, at least 2000, at least 3000, at least 4000, or at least 5000 nucleic acid baits.
In some cases, each respective nucleic acid decoy of the plurality of nucleic acid decoys hybridized to the cDNA sequences of the respective genes of the plurality of genes hybridizes (i) selectively to a first subset of transcripts in a corresponding plurality of transcripts of the plurality of transcripts corresponding to the respective genes, or (ii) to another subset of transcripts other than the first subset of transcripts in a corresponding plurality of transcripts of the plurality of transcripts corresponding to the respective genes. Each respective transcript of the plurality of transcripts corresponding to each respective gene of the plurality of genes is hybridizable to a nucleic acid decoy of the plurality of nucleic acid decoys.
For example, the nucleic acid decoy may hybridize to (e.g., may comprise a nucleic acid sequence complementary to) one or more nucleic acid sequences corresponding to the target gene. In some cases, the one or more nucleic acid sequences hybridized to the nucleic acid decoy represent a respective one or more transcripts or isoforms of the target gene. As used herein, a subset of transcripts (e.g., a subset of isoforms) is defined as a set of transcripts (e.g., isoforms) of a target gene that hybridize to a corresponding nucleic acid decoy.
In some embodiments, a nucleic acid decoy that hybridizes to a cDNA sequence of a target gene selectively hybridizes to each of a plurality of transcripts of the corresponding gene. For example, a single nucleic acid decoy may hybridize to all isoforms of a target gene such that a first subset of isoforms consists of multiple isoforms of the target gene. In some such cases, no other subset of isoforms are defined (e.g., multiple isoforms may be grouped into a single or first subset of isoforms).
In some cases, multiple transcripts of a target gene may be subdivided into multiple transcript subsets. The plurality of transcript subsets may include first and second transcript subsets.
In some embodiments, each of the plurality of transcript subsets is defined as a set of transcripts that hybridize to a corresponding nucleic acid decoy. In this case, the first subset of transcripts is defined as the set of transcripts with which at least the first nucleic acid decoy hybridizes, and the second subset of transcripts is defined as the set of transcripts with which at least the second nucleic acid decoy hybridizes. For example, a first set of transcripts (e.g., isoforms) of a target gene that hybridizes to a first nucleic acid decoy is defined as a first subset of transcripts (e.g., a first subset of isoforms). In addition, a second set of transcripts (e.g., isoforms) of the target gene that hybridizes to the second nucleic acid decoy is defined as a second subset of transcripts (e.g., a second subset of isoforms)
In some embodiments, the second subset of transcripts consists of transcripts not included in the first subset of transcripts (e.g., the first subset of isoforms and the second subset of isoforms may include mutually exclusive sets of isoforms).
In some cases, the first subset of isoforms may be selected by counting matches of possible subsequences of a given length (e.g., subsequences of the target gene coding sequence or mRNA sequence) of the candidate bait sequence.
Hybridization matches between each possible decoy subsequence and each position in each isoform were repeated for each isoform of the corresponding gene. The number of isoforms (e.g., comprising complementary sequences) that match each decoy subsequence is counted, the decoy subsequence with the highest number of matches is selected as the first nucleic acid decoy, and the subset of isoforms that match the first nucleic acid decoy is defined as the first subset of isoforms. In some cases, when the corresponding first subset of isoforms fails to account for all isoforms of the target gene, the process is repeated for all remaining isoforms that fail to match the first nucleic acid decoy. Thus, the decoy subsequence having the highest number of matches to the remaining isoforms (e.g., isoforms not included in the first subset of isoforms) is selected as the second nucleic acid decoy, and the second subset of isoforms that match the second nucleic acid decoy is defined as the second subset of isoforms.
In some cases, the process can be repeated as many times as desired until a plurality of nucleic acid decoys are identified such that all of the plurality of transcripts of the respective genes are hybridizable to at least one nucleic acid decoy of the plurality of nucleic acid decoys.
In some embodiments, the plurality of transcript subsets comprises at least one, at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten transcript subsets. In some embodiments, each subset of transcripts corresponds to a single nucleic acid decoy. In some alternative embodiments, each transcript subset corresponds to a plurality of nucleic acid decoys.
In some embodiments, the subset of transcripts other than the first subset of transcripts consists of one or more, two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, or ten or more transcripts. In some embodiments, the plurality of nucleic acid baits comprises at least 2 x 10 3 At least 3 x 10 3 At least 4 x 10 3 At least 5 x 10 3 At least 1 x 10 4 At least 2 x 10 4 At least 3 x 10 4 At least 4 x 10 4 At least 5 x 10 4 At least 6 x 10 4 At least 7 x 10 4 Or at least 1 x 10 5 Nucleic acid decoys.
In some embodiments, the plurality of nucleic acid decoys comprises a minimum number of decoys required for selective hybridization of each respective transcript of the plurality of transcripts corresponding to the respective gene of the plurality of genes. For example, each respective transcript of the plurality of transcripts corresponding to each respective gene of the plurality of genes is capable of hybridizing to at least one nucleic acid decoy of the plurality of nucleic acid decoys.
In some embodiments, each nucleic acid decoy of the plurality of nucleic acid decoys is capable of hybridizing to a single transcript of the plurality of transcripts, and the number of nucleic acid decoys in the plurality of nucleic acid decoys is equal to the number of transcripts in the plurality of transcripts of each respective gene of the plurality of genes. In some embodiments, each nucleic acid decoy of the plurality of nucleic acid decoys is capable of hybridizing to a plurality of transcripts, and the number of nucleic acid decoys in the plurality of nucleic acid decoys is less than the number of transcripts in the plurality of transcripts of each respective gene of the plurality of genes.
In some such embodiments, the decoy coverage for each respective transcript in the plurality of transcripts of the respective gene is less than 1X. In some such embodiments, the bait for each respective isoform of the plurality of isoforms of the first genetic target covers less than 0.8X, less than 0.6X, less than 0.4X, less than 0.2X, or less than 0.1X.
In some embodiments, each respective nucleic acid decoy in the plurality of nucleic acid decoys has less than a threshold percentage of sequence identity to any other nucleic acid decoy in the plurality of nucleic acid decoys. For example, in some embodiments, each respective nucleic acid bait of the plurality of nucleic acid baits has less than 100%, less than 98%, less than 96%, less than 94%, less than 92%, less than 90%, less than 88%, less than 86%, less than 84%, less than 82%, less than 80, less than 70%, less than 60%, less than 50%, or less than 40% identity to any other nucleic acid bait of the plurality of nucleic acid baits. In some embodiments, the percentage sequence identity threshold is 10%, 20%, 30%, or 5% -50%. In some embodiments, a threshold value of shared sequence identity between a respective nucleic acid decoy of the plurality of nucleic acid decoys and any other nucleic acid decoys of the plurality of nucleic acid decoys determines a level of cross-hybridization of each respective nucleic acid decoy to an off-target sequence reading. In some embodiments, each respective nucleic acid decoy in the plurality of nucleic acid decoys comprises a nucleic acid sequence that has at least 90% minimal identity to a reference genome. In some embodiments, each respective nucleic acid decoy in the plurality of nucleic acid decoys comprises a nucleic acid sequence that has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% minimal identity to a reference genome.
In some embodiments, each respective nucleic acid decoy of the plurality of nucleic acid decoys hybridized to a transcript of the plurality of transcripts of the respective genes has a Tm relative to the transcript that is between the first threshold temperature and the second threshold temperature. In some such embodiments, the first threshold temperature is between 55 ℃ and 85 ℃, and the second threshold temperature is between 90 ℃ and 110 ℃ and in some cases hybridization occurs at 65 ℃. In some cases, hybridization occurs at 60 ℃.
In some embodiments, each respective nucleic acid decoy of the plurality of nucleic acid decoys hybridizes to a respective region of the gene that is at least a minimum threshold distance from any annotated start and/or end sites of the respective gene. In some embodiments, a respective nucleic acid decoy of a plurality of nucleic acid decoys mapped to a respective sequence read of a respective gene is located at least a minimum threshold distance from the 3' end of the respective sequence read. In some such embodiments, off-target hybridization of a corresponding nucleic acid decoy to a corresponding cDNA sequence may occur when there are unexpired poly a sites or poly a sequences in the genomic exons and/or mRNA sequences that result in mispriming of oligo dT. Thus, in some such embodiments, the optimal position for hybridization of the nucleic acid decoy to the corresponding cDNA sequence is located at least a minimum threshold distance from the 3' end. Non-limiting examples of minimum threshold distances are 100-200 base pairs (bp), 200-300bp, 300-400bp, 400-500bp, 500-600bp, 600-700bp, 700-800bp, 800-900bp, 900-1000bp, or more than 1000bp. In some embodiments, the percentage of cDNA sequences that preferentially hybridize to the nucleic acid decoy at least a minimum threshold distance from the 3' end is between 0% and 10%, between 10% and 20%, or between 20% and 30%. Furthermore, in some embodiments, nucleic acid decoys comprising unannotated poly-a sites or poly-a sequences in the mRNA sequence are removed from the plurality of nucleic acid decoys. In some cases, a modified decoy is designed if the sequence in the decoy oligonucleotide and the sequence in the genome match. To prepare a modified bait, the original sequence can be slid +/-40bp from the original position to identify potentially new decoy oligonucleotides. For each design, the new decoy oligonucleotides were tested against the genome. After all such candidates are classified, the decoy oligonucleotides that are ultimately included in one or more of the sets described herein are sorted (i.e., ordered) according to the decoy oligonucleotide length (i.e., longer decoys are prioritized), and then by distance from the original intended position (sequence that is closer to the original intended position is prioritized). However, if no bait meets the required criteria, then discarding the bait does not design the bait at that location.
In some embodiments, the decoy oligonucleotide sequence is 40 nucleotides in length. In some embodiments, the decoy oligonucleotide sequence is 40-160 nucleotides in length. In some embodiments, the decoy oligonucleotide sequence is 40-120 nucleotides in length. In some embodiments of the present invention, in some embodiments, the bait oligonucleotide sequences have a length of about 40, about 41, about 42, about 43, about 44, about 45, about 46, about 47, about 48, about 49, about 50, about 51, about 52, about 53, about 54, about 55, about 56, about 57, about 58, about 59, about 60, about 61, about 62, about 63, about 64, about 65, about 66, about 67, about 68, about 69, about 70, about 71, about 72, about 73, about 74, about 75, about 76, about 77, about 78, about 79, about 80, about 81, about 82, about 83, about 84, about 85, about 86, about 87, about 88, about 89, about 90, about 91, about 92, about 93, about 94, about 95, about 96, about 97, about 98, about 99, about 100, about 101, about about 102, about 103, about 104, about 105, about 106, about 107, about 108, about 109, about 110, about 111, about 112, about 113, about 114, about 115, about 116, about 117, about 118, about 119, about 120, about 121, about 122, about 123, about 124, about 125, about 126, about 127, about 128, about 129, about 130, about 131, about 132, about 133, about 134, about 135, about 136, about 137, about 138, about 139, about 140, about 141, about 142, about 143, about 144, about 145, about 146, about 147, about 148, about 149, about 150, about 151, about 152, about 153, about 154, about 155, about 156, about 157, about 158, about 159 or about 160 nucleotides. In some cases, the decoy oligonucleotide is a single-stranded 120 nucleotide long DNA oligonucleotide with a 5' biotin modification. In some cases, each decoy targets a unique library molecule. Decoy oligonucleotides can span all mature mRNA sequences, including UTRs and all annotated isoforms.
In some embodiments, the decoy oligonucleotides of the plurality of decoy oligonucleotides comprise domains that specifically bind to all or part of the spatial barcode or its complement. In some embodiments, the decoy oligonucleotides of the plurality of decoy oligonucleotides comprise domains that specifically bind all or part of the sequence of an analyte or complement thereof from a biological sample. In some embodiments, the decoy oligonucleotides of the plurality of decoy oligonucleotides comprise domains that specifically bind all or part of the spatial barcode or its complement or all or part of the sequence of an analyte or its complement from a biological sample. In some embodiments, the domain of the decoy oligonucleotide hybridizes to the analyte of interest. In some embodiments, the domain of the decoy oligonucleotide specifically binds to the analyte of interest. In some embodiments, the domain of the decoy oligonucleotide specifically binds to all or part of the spatial barcode or its complement. In some embodiments, the domain of the decoy oligonucleotide specifically binds to all or part of the sequence of the analyte from the biological sample. In some embodiments, the domain of the decoy oligonucleotide specifically binds to a 3' portion of the sequence of the analyte or complement thereof from the biological sample. In some embodiments, the domain of the decoy oligonucleotide specifically binds to the 5' portion of the sequence of the analyte or complement thereof from the biological sample. In some embodiments, the domain of the decoy oligonucleotide specifically binds to an intron in the sequence of the analyte or complement thereof from the biological sample. In some embodiments, the domain of the decoy oligonucleotide specifically binds to an exon in the sequence of an analyte or complement thereof from a biological sample. In some embodiments, the domain of the decoy oligonucleotide specifically binds to a 3' untranslated region of an analyte or its complement from a biological sample. In some embodiments, the domain of the decoy oligonucleotide specifically binds to a 5' untranslated region of an analyte or its complement from a biological sample.
In some embodiments, the domain of the decoy oligonucleotide sequence is 40 nucleotides in length. In some embodiments, the domain of the decoy oligonucleotide sequence is 40-160 nucleotides in length. In some embodiments, the domain of the decoy oligonucleotide sequence is 40-120 nucleotides in length. In some embodiments of the present invention, in some embodiments, the domains of the bait oligonucleotide sequences have a length of about 40, about 41, about 42, about 43, about 44, about 45, about 46, about 47, about 48, about 49, about 50, about 51, about 52, about 53, about 54, about 55, about 56, about 57, about 58, about 59, about 60, about 61, about 62, about 63, about 64, about 65, about 66, about 67, about 68, about 69, about 70, about 71, about 72, about 73, about 74, about 75, about 76, about 77, about 78, about 79, about 80, about 81, about 82, about 83, about 84, about 85, about 86, about 87, about 88, about 89, about 90, about 91, about 92, about 93, about 94, about 95, about 96, about 97, about 98, about 99, about 100 about 101, about 102, about 103, about 104, about 105, about 106, about 107, about 108, about 109, about 110, about 111, about 112, about 113, about 114, about 115, about 116, about 117, about 118, about 119, about 120, about 121, about 122, about 123, about 124, about 125, about 126, about 127, about 128, about 129, about 130, about 131, about 132, about 133, about 134, about 135, about 136, about 137, about 138, about 139, about 140, about 141, about 142, about 143, about 144, about 145, about 146, about 147, about 148, about 149, about 150, about 151, about 152, about 153, about 154, about 155, about 156, about 157, about 158, about 159, or about 160 nucleotides.
In some embodiments, the analyte from the biological sample is associated with a disease or disorder. In some embodiments, the analyte from the biological sample comprises a mutation. In some embodiments, the analyte from the biological sample comprises a Single Nucleotide Polymorphism (SNP). In some embodiments, the analyte from the biological sample comprises a trinucleotide repeat.
In some embodiments, the domain of the decoy oligonucleotide hybridizes to a specific exon of a transcript (i.e., an mRNA molecule). For example, transcripts may be processed such that exons to be excised in a normal setting are contained in mature mRNA products in different environments (e.g., pathological environments such as cancer). In some embodiments, the domain of the decoy oligonucleotide recognizes (e.g., hybridizes to) one or more isoforms or analytes, but not others. In some embodiments, for example, decoy oligonucleotides hybridize to specific exons detected in a pathological environment (e.g., cancer).
In some embodiments, there is more than one decoy oligonucleotide for a particular analyte of interest in the set. For example, the analyte may undergo alternative splicing (e.g., as in an mRNA molecule). In some embodiments, different baits may detect and enhance a particular transcript, thereby detecting whether a particular exon is contained in the analyte. Thus, in some embodiments, for one analyte, a panel will include at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, or more decoy oligonucleotides.
In some embodiments, the decoy oligonucleotide is fully complementary (i.e., 100% complementary) to a portion of the target analyte. In some embodiments, the decoy oligonucleotide is partially complementary (i.e., less than 100% complementary) to a portion of the analyte of interest. In some embodiments, the decoy oligonucleotide has at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity to a portion of the analyte of interest.
In some embodiments, the decoy oligonucleotide is partially complementary (i.e., less than 100% complementary) to a portion of the analyte of interest. In some embodiments, the decoy oligonucleotide is partially complementary (i.e., less than 100% complementary) to a portion of the analyte of interest. In some embodiments, a portion of the decoy oligonucleotide hybridizes to a portion of the analyte of interest. In some embodiments, a portion of the decoy oligonucleotide has at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity to a portion of the analyte of interest.
Decoy oligonucleotides are included in one or more groups (e.g., clusters). The panel includes a panel of decoy oligonucleotides targeted to a particular group of analytes. For example, a panel may include a collection of decoy oligonucleotides specific to a pathological environment. In some embodiments, more than one set (e.g., a pool of decoy oligonucleotides) may be used to enhance detection of the analyte. For example, in some embodiments, at least 1, at least 2, at least 3, at least 4, at least 5, or more groups (e.g., a collection of decoy oligonucleotides) may be used to enhance detection of an analyte. All sets of decoy oligonucleotides and target analytes disclosed are provided in the accompanying sequencing list disclosed herein.
In some embodiments, the panel includes decoy oligonucleotides that hybridize and enhance detection of an analyte of interest in a cancer (e.g., a cancer panel). In some embodiments, the cancer group is capable of quantifying analytes of aberrant expression in the cancer transcriptome while avoiding the additional costs and time associated with sequencing the entire exome. Decoy oligonucleotides and target analytes for an exemplary cancer panel are provided in table 1 and the accompanying sequencing list disclosed herein. In some embodiments, decoy oligonucleotides in a cancer group can include analytes that detect disorders associated with biological processes such as apoptosis, metabolism, cell cycle (e.g., checkpoint analytes), DNA damage and repair, hypoxia, and stress toxicity. More specifically, decoy oligonucleotides in the cancer group target cancer specific analytes, such as analytes that function in pathways including, but not limited to, myc pathway, hippo pathway, RTK/RAS pathway, TP53 and TP53 related pathway, tfgβ pathway, and Wnt pathway. Decoy oligonucleotides in the cancer group can also detect cancer hotspots, tumor suppressor analytes, and immune disorder analytes. Decoy oligonucleotides and target analytes for an exemplary cancer set are disclosed in U.S. patent application No. 62/970,066 (titled "Capture targeted genetic target using hybridization/Capture method)") and U.S. patent application No. 62/929,686 (titled "Capture targeted genetic target using hybridization/Capture method)", each of which is incorporated herein by reference in its entirety.
In some embodiments, the panel includes decoy oligonucleotides that hybridize and enhance detection of analytes associated with immune disorders (e.g., an immunological panel). In some embodiments, the immunology group is able to quantitatively analyze analytes associated with immune disorders while avoiding the additional costs and time associated with sequencing the entire exome. Exemplary immunological sets of decoy oligonucleotides and target analytes are provided in table 2 and the accompanying sequencing list disclosed herein. In some embodiments, decoy oligonucleotides in an immunological set may include detecting deregulated analytes associated with biological processes such as B cell function, T cell function, cell cycle, cell signaling, interleukin signaling, and metabolism. More specifically, decoy oligonucleotides target analytes, including but not limited to transcription factors, T cell activation markers, antigen presentation genes, metabolic genes, and SIRP family members. In some embodiments, the immunology group has applications in detecting immune disorders related to examining innate immunity, adaptive immunity, one or more inflammatory responses, detecting one or more infectious diseases (e.g., bacterial infection; viral infection), and immune responses of transplant recipients. More specifically, decoy oligonucleotides in the immunology group target immunospecific biomarkers including, but not limited to, lineage markers, tissue markers, and cancer markers. In some embodiments, the decoy oligonucleotide enhances detection of an analyte expressed in bone marrow, intestine, lung, salivary gland, intestine, lymph node, stem cells, or a combination thereof. Exemplary sets of decoy oligonucleotides and target analytes are disclosed in U.S. patent application No. 62/970,066 (titled "Capture targeted genetic target using hybridization/Capture method)", and U.S. patent application No. 62/929,686 (titled "Capture targeted genetic target using hybridization/Capture method)", each of which is incorporated herein by reference in its entirety.
In some embodiments, the panel includes decoy oligonucleotides that hybridize and enhance detection of analytes that detect a deregulated pathway (e.g., a pathway panel or a gene signature panel). In some embodiments, the set of pathways is capable of quantifying analytes associated with deregulation of pathways while avoiding the additional costs and time associated with sequencing the entire set of exons. Decoy oligonucleotides and target analytes for an exemplary set of pathways are provided in table 3 and the accompanying sequencing list disclosed herein. In some embodiments, decoy oligonucleotides in a set of pathways may include detecting deregulated analytes associated with complex signal transduction pathways. Target analytes include analytes specific for a disease or drug target; g protein-coupled receptors (GPCRs), one or more kinases; one or more epigenetic markers; or one or more checkpoint analytes. Analytes in the pathway group that are detected by decoy oligonucleotides include, but are not limited to, tissue markers of cancer, central nervous system, inflammatory, metabolic, cardiovascular, respiratory, and reproductive disorders. Exemplary sets of decoy oligonucleotides and target analytes are disclosed in U.S. patent application No. 62/970,066 (titled "Capture targeted genetic target using hybridization/Capture method)", and U.S. patent application No. 62/929,686 (titled "Capture targeted genetic target using hybridization/Capture method)", each of which is incorporated herein by reference in its entirety.
In some embodiments, the panel includes decoy oligonucleotides that hybridize and enhance detection of analytes that detect neurodevelopment and/or dysregulation. In some embodiments, the panel includes decoy oligonucleotides that hybridize and enhance detection of analytes that detect neurodevelopment. In some embodiments, the panel includes decoy oligonucleotides that hybridize and enhance detection of analytes that detect neurological disorders. In some embodiments, the panel includes decoy oligonucleotides that hybridize and enhance detection of analytes that detect neurodevelopment and dysregulation. In some embodiments, the neurones are able to quantitatively analyze analytes associated with deregulation of the pathway while avoiding the additional costs and time associated with sequencing the entire transcriptome. Exemplary neural sets of decoy oligonucleotides and target analytes are provided in table 4 and the accompanying sequencing list disclosed herein. In some embodiments, decoy oligonucleotides in the nerve group may include detection of deregulated analytes associated with axonal targeting, hypoxia, and glioblastoma. In some embodiments, decoy oligonucleotides in the nerve group may include detecting a deregulated analyte associated with axonal targeting. In some embodiments, decoy oligonucleotides in the nerve group may include detecting dysregulated analytes associated with hypoxia. In some embodiments, decoy oligonucleotides in the nerve group may include detecting a deregulated analyte associated with glioblastoma. In some embodiments, decoy oligonucleotides in the nerve group may include detecting a gene encoding a mitochondrial protein. In some embodiments, decoy oligonucleotides in the nerve group may include detecting genes encoding mitochondrial proteins to assess energy metabolism.
In some cases, additional (i.e., custom) baits may be added to each group. Furthermore, in some cases, fully customized groups may be prepared.
In some embodiments, any of the sets disclosed herein include decoy oligonucleotides that can enhance detection of at least about 100 analytes, about 200 analytes, about 300 analytes, about 400 analytes, about 500 analytes, about 600 analytes, about 700 analytes, about 800 analytes, about 900 analytes, about 1000 analytes, about 1100 analytes, about 1200 analytes, about 1300 analytes, about 1400 analytes, about 1500 analytes, about 1600 analytes, about 1700 analytes, about 1800 analytes, about 1900 analytes, about 2000 analytes, or more.
In some embodiments, the decoy oligonucleotide increases detection of the analyte of interest by about 1.5-fold, about 2-fold, about 3-fold, about 4-fold, about 5-fold, about 6-fold, about 7-fold, about 8-fold, about 9-fold, about 10-fold, about 20-fold, about 50-fold, about 100-fold, 500-fold, 1000-fold, or more as compared to an analyte not detected using the decoy oligonucleotide.
In some embodiments, the decoy oligonucleotide comprises a molecular tag. As disclosed herein, the molecular tag of the decoy oligonucleotide is attached to (e.g., coupled to) the nucleic acid sequence of the decoy oligonucleotide. In some embodiments, the molecular tag includes one or more moieties. In some embodiments, the moiety comprises a label as described herein. The label is capable of detecting hybridization of the decoy oligonucleotide to the analyte. In some embodiments, the label is directly associated (i.e., coupled) with the decoy oligonucleotide. The detectable label may be directly detectable by itself (e.g., a radioisotope label or a fluorescent label), or, in the case of an enzymatic label, may be indirectly detectable, e.g., by catalyzing a chemical change in a chemical substrate compound or composition that is directly detectable. The detectable label may be suitable for small scale detection and/or for high throughput screening. Thus, suitable detectable labels include, but are not limited to, radioisotopes, fluorophores, chemiluminescent compounds, bioluminescent compounds, and dyes.
In some embodiments, the molecular tag comprises a small molecule. In some embodiments, the molecular tag comprises a nucleic acid. In some embodiments, the target nucleic acid is single stranded. In some embodiments, the target nucleic acid is double stranded. In some embodiments, the nucleic acid is RNA. In some embodiments, the nucleic acid is DNA. In some embodiments, the molecular tag comprises a carbohydrate. In some embodiments, the molecular tag is located 5' to the domain in the decoy oligonucleotide. In some embodiments, the molecular tag is located 3' of the domain in the decoy oligonucleotide.
In some embodiments, the agent that specifically binds to the molecular tag comprises a protein. In some embodiments, the protein is an antibody. In some embodiments, the agent that specifically binds to the molecular tag comprises a nucleic acid. In some embodiments, the agent that specifically binds to the molecular tag comprises a small molecule. In some embodiments, an agent that specifically binds to a molecular tag is attached to a substrate. In some embodiments, the substrate is a bead. In some embodiments, the substrate is a well. In some embodiments, the substrate is a slide.
In some embodiments, the moiety is biotin. In some embodiments, the biotin molecule is directly associated (i.e., coupled) with the decoy oligonucleotide at the 3' end. In some embodiments, the biotin molecule is directly associated (i.e., coupled) with the decoy oligonucleotide at the 5' end. In some embodiments, and as disclosed below, a biotin molecule can be associated (e.g., coupled) with an avidin molecule, thereby enabling pull-down (pulldown) of the analyte. In some embodiments, and as disclosed below, a biotin molecule can be associated (e.g., coupled) with a streptavidin molecule, thereby enabling the pull-down of the analyte.
In some embodiments, the decoy oligonucleotide does not include a portion attached to the sequence (i.e., the decoy oligonucleotide is a naked decoy oligonucleotide).
(ii) Hybridization method
In some cases, nucleic acids from a single nucleic acid library (i.e., a single sample) are hybridized to an oligonucleotide decoy. In some cases, multiple samples may be pooled prior to hybridization. . In some cases, at least 2, 3, 4, 5, 6, 7, 8, or more samples are pooled. In some cases, each library is complexed with a barcode specific for a particular nucleic acid set. In some cases, the collection library is from the same cell or tissue type. In some cases, the collection library is from a different cell or tissue type.
In an arrangement where multiple samples from different points on one or more arrays are pooled, the pooling calculation takes into account the number of points covered by the tissue on the array. In some cases, the number of gene expression points covered by tissue may be estimated visually or more accurate measurements made by using an automated calculation method. In an arrangement having an array of about 5000 spots, the number of spots covered by a sample on the array can be calculated by multiplying the percentage of coverage by the total number of gene expression spots (e.g., about 5000).
In some cases, a universal blocker is added to the pool during the step of prehybridization pooling nucleic acids. In some embodiments, human Cot DNA is added to the collection. Human Cot DNA is rich in repetitive non-coding elements common in genomic DNA. These repeated sequences often result in non-specific binding during the hybridization reaction. The addition of Cot DNA to these reactions reduced non-specific binding associated with these repeated sequences, improving accuracy.
In some cases, the decoy oligonucleotide hybridization methods disclosed herein include a variety of buffers and components to aid in nucleic acid capture. In some cases, the method includes a hybridization enhancer, a wash buffer, an equilibration buffer, a hybridization buffer, or any combination thereof. Any buffer may be in concentrated form, which may be diluted to an appropriate concentration by one skilled in the art using, for example, nuclease-free water. In some cases, the component comprises streptavidin beads. In some cases, the component comprises avidin beads.
In some embodiments, wherein the RNA is a nucleic acid analyte, one or more RNA analytes of interest may be selectively enriched. For example, one or more RNAs of interest may be selected by adding one or more oligonucleotides to the sample. In some embodiments, the other oligonucleotides are sequences for initiating a reaction by a polymerase. For example, one or more primer sequences having sequence complementarity to one or more RNAs of interest may be used to amplify one or more RNAs of interest, thereby selectively enriching those RNAs. In some embodiments, an oligonucleotide (e.g., probe) having sequence complementarity to the complementary strand of the captured RNA (e.g., cDNA) can bind to the cDNA. For example, biotinylated oligonucleotides having sequences complementary to one or more cdnas of interest can be bound to the cdnas and can be selected using any of a variety of methods known in the art (e.g., streptavidin beads) using biotinylation-streptavidin affinity. He is known in the art as a non-nucleic acid affinity moiety, such as 2- (4-hydroxyphenylazo) benzoic acid (HABA) or a compound listed in Table 5.
Alternatively, any of a variety of methods may be used to downwardly select (e.g., remove) one or more RNAs. For example, probes can be administered to a sample that selectively hybridizes to ribosomal RNA (rRNA), thereby reducing the collection and concentration of rRNA in the sample. Subsequent application of the capture probe to the sample may result in improved capture of other types of RNA due to the reduction of non-specific RNA present in the sample.
(1) Direct hybridization of decoy oligonucleotides
After creating a plurality of nucleic acids or libraries generated using the same, one or more decoy oligonucleotides can be used to enrich for one or more nucleic acids of interest.
In some embodiments, decoy oligonucleotides are added to a plurality of nucleic acids (e.g., a nucleic acid library). In some embodiments, the plurality of nucleic acids includes a nucleic acid having a partial sequence of a spatial barcode or complement thereof, an analyte from a biological sample, or complement thereof. In some embodiments, the spatial barcode includes a sequence corresponding to a region of interest in the biological sample. In some embodiments, the spatial barcode is capable of detecting and correlating to a particular region of interest in the biological sample. In some embodiments, the methods disclosed herein include identifying a region of interest. In some embodiments, the spatial barcode provides information about the location of the analyte in the biological sample. In some embodiments, the methods disclosed herein comprise identifying the location and/or abundance of an analyte in a biological sample.
In some embodiments, decoy oligonucleotides are added to a plurality of nucleic acids. In some embodiments, the decoy oligonucleotide comprises a domain that specifically binds to all or part of a spatial barcode or complement thereof and/or all or part of the sequence of an analyte or complement thereof. In some embodiments, complexes of decoy oligonucleotides that specifically bind to nucleic acids may be enriched. For example, the decoy oligonucleotide may include a molecular tag, and a reagent that specifically binds to the molecular tag may be used to enrich for complexes of the decoy oligonucleotide that specifically bind to nucleic acids. In some embodiments, the molecular tag may be attached (directly or indirectly) to a substrate (e.g., a slide, well, or bead). In some embodiments, the molecular tag may include a protein, a nucleic acid, a carbohydrate, a small molecule, or any combination thereof. In some embodiments, the agent that specifically binds to the molecular tag may be a protein, a nucleic acid, a carbohydrate, a small molecule, or any combination thereof. In some embodiments, the molecular tag may be avidin and the agent that specifically binds to the molecular tag may be biotin. In some embodiments, the molecular tag may be streptavidin and the agent that specifically binds the molecular tag may be biotin.
In some embodiments, the molecular tag can be biotin and the agent that specifically binds to the molecular tag can be avidin or streptavidin (e.g., streptavidin attached to a bead). In some cases, the beads are prepared using methods known in the art. In some cases, a buffer (e.g., equilibration buffer) is used to suspend the beads. In some cases, the beads are separated from the buffer using methods known in the art (e.g., using a magnetic separator; centrifugation). In some cases, the beads are resuspended in buffer for capture of the decoy oligonucleotides.
In some embodiments where the molecular tag is biotin and the method includes using avidin or streptavidin beads to enrich for nucleic acid complexed with a decoy oligonucleotide, the streptavidin beads may be washed using any method known in the art. In some embodiments, streptavidin beads may be washed 1, 2, 3, 4, 5, 6, 7, or more times. In some embodiments, streptavidin beads may be washed strictly 1, 2, 3, 4, 5, 6, 7 or more times. In some embodiments, streptavidin bead washing can be at about 15 ℃, about 20 ℃, about 25 ℃, about 30 ℃, about 35 ℃, about 40 ℃, about 45 ℃, about 48 ℃, about 50 ℃, about 55 ℃, about 60 ℃, about 62 ℃, about 65 ℃, about 67 ℃, about 70 ℃, about 75 ℃, or more. In some embodiments, streptavidin beads may be washed at about 67 ℃. In some cases, the temperature of the avidin or streptavidin bead wash is the same as the temperature of the decoy oligonucleotide hybridization step (e.g., 60 ℃ or 65 ℃). In some embodiments, after one or more wash steps, one or more nucleic acids that hybridize to one or more decoy oligonucleotides can be recovered and enriched.
In some embodiments, the recovered nucleic acid may be released from one or more decoy oligonucleotides and purified to remove avidin or streptavidin and biotin (or any other molecular tag and reagent that specifically binds to the molecular tag).
In some embodiments, the molecular tag can be biotin and the agent that specifically binds to the molecular tag can be avidin (e.g., avidin attached to a bead). In some embodiments where the molecular tag is biotin and the method includes using the avidin beads to enrich for nucleic acids complexed with decoy oligonucleotides, the avidin beads may be washed using any method known in the art. In some embodiments, the avidin beads may be washed 1, 2, 3, 4, 5, 6, 7 or more times. In some embodiments, avidin beads may be washed strictly 1, 2, 3, 4, 5, 6, 7 or more times. In some embodiments, the avidin bead wash can be at about 15 ℃, about 20 ℃, about 25 ℃, about 30 ℃, about 35 ℃, about 40 ℃, about 45 ℃, about 48 ℃, about 50 ℃ or higher. In some embodiments, after one or more wash steps, one or more nucleic acids that hybridize to one or more decoy oligonucleotides can be recovered and enriched. In some embodiments, a plurality of decoy oligonucleotides can be used in any of the methods described herein to enrich for one or more nucleic acids of interest from a plurality of nucleic acids. In some embodiments, the plurality of decoy oligonucleotides are designed to enrich for one or more nucleic acids including all or part of the sequence of an analyte of interest (e.g., one or more genes that function or are abnormally expressed in a particular cellular state or pathway) or a complement thereof. For example, in some embodiments, a plurality of decoy oligonucleotides can be used to enrich for nucleic acids including all or a portion of the sequence of a cancer-associated transcript or complement thereof (e.g., U.S. patent application No. 62/970,066 (entitled "Capture of targeted genetic target using hybridization/Capture method)") and U.S. patent application No. 62/929,686 (entitled "Capture of targeted genetic target using hybridization/Capture method)"), each of which is incorporated herein by reference in its entirety. In some embodiments, a plurality of decoy oligonucleotides can be used to enrich for nucleic acids including all or a portion of the sequence of an immune-related transcript or its complement (e.g., as disclosed in U.S. patent application No. 62/970,066 (entitled "Capture of targeted genetic target using hybridization/Capture method)") and U.S. patent application No. 62/929,686 (entitled "Capture of targeted genetic target using hybridization/Capture method)", each of which is incorporated herein by reference in its entirety). In some embodiments, a plurality of decoy oligonucleotides can be used to enrich for nucleic acids including all or part of the sequence of a pathway-specific transcript or its complement (e.g., U.S. patent application No. 62/970,066 (entitled "Capture of targeted genetic target using hybridization/Capture method)") and U.S. patent application No. 62/929,686 (entitled "Capture of targeted genetic target using hybridization/Capture method)"), each of which is incorporated herein by reference in its entirety. In some embodiments, a plurality of decoy oligonucleotides can be used to enrich for nucleic acids comprising all or part of the sequence of a neurological-specific transcript or its complement, as set forth in table 4 and the accompanying sequence listing.
In some embodiments, after hybridization of one or more decoy oligonucleotides to their target nucleic acids, the hybridized nucleic acids are enriched, thereby producing a nucleic acid set enriched for the particular nucleic acid of interest. In some cases, after hybridization and binding to streptavidin or avidin beads, the unhybridized nucleic acid is washed out of the sample, thereby enriching the nucleic acid of interest. In some embodiments, after hybridization of one or more decoy oligonucleotides to their target nucleic acids, the unhybridized nucleic acids are degraded (e.g., by nucleases), thereby enriching for hybridized nucleic acids. In some embodiments, after hybridization of one or more decoy oligonucleotides, the hybridized nucleic acids are degraded (e.g., by nucleases), thereby enriching for unhybridized nucleic acids; for example, the techniques may be used to reduce the amount of high abundance nucleic acids that are not of interest. In some embodiments, the decoy oligonucleotide may not include a detectable moiety. In some embodiments, the enriched nucleic acid is purified. In some embodiments, the enriched nucleic acid is sample indexed (e.g., prior to sequencing).
In some embodiments, the decoy oligonucleotide hybridizes to the nucleic acid at about 40 ℃, about 45 ℃, about 50 ℃, about 55 ℃, about 60 ℃, about 65 ℃, about 70 ℃, about 75 ℃, about 80 ℃ or more. In some embodiments, the decoy oligonucleotide hybridizes to the nucleic acid for at least 15 minutes, at least 30 minutes, at least 45 minutes, at least 1 hour, at least 90 minutes, at least 2 hours, at least 3 hours, at least 4 hours, at least 5 hours, or more.
In some embodiments, the decoy oligonucleotide hybridizes to the nucleic acid for about 2 hours. In some cases, the decoy oligonucleotide hybridizes to the nucleic acid overnight (e.g., at least about 8 hours, about 9 hours, about 10 hours, about 11 hours, about 12 hours, or more). In some cases, the decoy oligonucleotide hybridizes to the nucleic acid for about 6 hours, about 7 hours, about 8 hours, about 9 hours, about 10 hours, about 11 hours, about 12 hours, or more. In some cases, after hybridization, the sample comprising decoy oligonucleotides hybridized to the nucleic acid of interest is washed. In some cases, washing is performed using any of the washing solutions described herein. In some cases, the washing occurs at 65 ℃. In some cases, the washing occurs at 60 ℃. In some cases, hybridization occurs overnight at 60 ℃, followed by a subsequent wash step at 60 ℃. In the case of nucleic acid molecules having an average length of less than 700 base pairs, hybridization should be extended to 60℃overnight. In this setup, the subsequent wash is performed at 60 ℃. In the case where the library set comprises nucleic acid molecules of different lengths (e.g., if the set comprises short (e.g., < 700 bp) and long (e.g., > 700 bp)), hybridization can be extended to 60 ℃ overnight, followed by washing at 60 ℃. For example, a percentage (e.g., about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%) of the nucleic acid fraction can be less than 700 base pairs, so the alternative method is overnight at 60 ℃, followed by washing at 60 ℃.
In some embodiments, one or more detectable moieties may be bound (e.g., directly or indirectly linked) to the decoy oligonucleotide. In some embodiments, one or more detectable moieties can be used to detect (or enhance detection of) decoy oligonucleotides (e.g., decoy oligonucleotides that hybridize to nucleic acids).
In some embodiments, the enriched nucleic acid may be amplified. After amplification, the enriched and amplified nucleic acids can be used to generate a nucleic acid library and sequenced using any method known in the art, including the exemplary sequencing methods described herein. In some embodiments, sequencing may include determining all or part of the sequence of a spatial barcode or its complement in a nucleic acid. In some embodiments, sequencing comprises determining all or part of the sequence of an analyte or its complement from a biological sample in a nucleic acid. In some embodiments, sequencing comprises high throughput sequencing.
In some cases, the amplification method includes a step that includes a composition that includes a buffer and a library primer set. Thermal cyclers can be used to amplify enriched nucleic acid libraries (e.g., cycling steps at 98 ℃, 67 ℃, and 72 ℃). It will be appreciated that one skilled in the art can determine temperature and run time parameters to amplify a nucleic acid library. In some cases, the total number of cycles of amplification may be varied to optimize detection of one or more decoy oligonucleotides. In some cases, the number of PCR cycles performed to amplify the library is at least 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more cycles. It will be appreciated that if the expected expression of a given target is lower, more cycles are required to detect the target. In some cases, amplification can be performed while the beads (e.g., affinity cords or streptavidin) are in the mixture. In some cases, one or more beads may be removed using methods known in the art (e.g., using a magnet).
In some cases, the average fragment size of the nucleic acids in the library may be determined after amplification. In some cases, an automated electrophoresis system (e.g., tapeStation from Agilent) may be run as a quality control for nucleic acid library samples. In some cases, the size, number, and integrity of the samples may be analyzed prior to downstream analysis (e.g., sequencing).
In some embodiments, targeted spatial gene expression profiling using one or more sets (e.g., cancer set, immune set, pathway set, or nerve set) as described herein can enrich for about 1-fold, about 2-fold, about 3-fold, about 4-fold, about 5-fold, about 6-fold, about 7-fold, about 8-fold, about 9-fold, about 10-fold, about 20-fold, about 50-fold, about 100-fold, or more target nucleic acids as compared to unbiased spatial profiling. In some embodiments, target nucleic acids may be enriched about 1-fold, about 2-fold, about 3-fold, about 4-fold, about 5-fold, about 6-fold, about 7-fold, about 8-fold, about 9-fold, about 10-fold, about 20-fold, about 50-fold, about 100-fold, or more of each gene using targeted spatial gene expression profiling of one or more groups (e.g., cancer group, immune group, pathway group, or nerve group) as described herein as compared to unbiased spatial profiling.
In some embodiments, using targeted spatial gene expression profiling of one or more groups (e.g., cancer group, immune group, pathway group, or nerve group) as described herein can increase the percentage of mid-target reads by about 1-fold, about 2-fold, about 3-fold, about 4-fold, about 5-fold, about 6-fold, about 7-fold, about 8-fold, about 9-fold, about 10-fold, about 20-fold, about 50-fold, about 100-fold, or more as compared to unbiased spatial profiling. In some embodiments, the number of mid-target reads using one or more sets of targeted spatial gene expression profiling as described herein is at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% compared to using unbiased spatial profiling. In some embodiments, using targeted spatial gene expression profiling of a group as described herein (e.g., cancer group, immune group, pathway group, or nerve group) can reduce the percent of off-target reads by about 1-fold, about 2-fold, about 3-fold, about 4-fold, about 5-fold, about 6-fold, about 7-fold, about 8-fold, about 9-fold, about 10-fold, about 20-fold, about 50-fold, about 100-fold, or more compared to an unbiased spatial profiling.
In some embodiments, detection of enriched target analytes is analyzed using targeted spatial gene expression profiles of one or more groups (e.g., cancer group, immune group, pathway group, or nerve group) as described herein. In some embodiments, detection of one or more analytes of interest is enriched using targeted spatial gene expression profiling analysis of one or more groups (e.g., cancer group, immune group, pathway group, or nerve group) as described herein. In some embodiments, enriching comprises increasing the number of sequencing reads detected when sequencing a plurality of nucleic acids and/or probes. In some embodiments, enriching comprises increasing the number of sequencing reads detected when sequencing a plurality of nucleic acids after avidin-biotin pull-down as described herein. In some embodiments, enriching comprises increasing the number of sequencing reads detected when sequencing a plurality of nucleic acids after streptavidin-biotin pulldown as described herein. In some embodiments, the number of enriched target analyte sequence reads is increased by about 50%, about 55%, about 60%, about 62%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, about 1-fold, about 2-fold, about 3-fold, about 4-fold, about 5-fold, about 6-fold, about 7-fold, about 8-fold, about 9-fold, about 10-fold, about 20-fold, about 50-fold, about 100-fold, or more compared to the number of sequence reads in the same target analyte when whole locus sequencing is performed.
In some embodiments, enriched target analyte sequence reads obtained from targeted spatial gene expression profiling using one or more sets (e.g., cancer set, immune set, pathway set, or nerve set) as described herein are highly correlated with sequence reads when performing unbiased (e.g., non-targeted) spatial gene expression profiling in the same biological sample. In some embodiments, at least 50%, at least 60%, at least 70%, at least 80%, or at least 90% umi detected from the enriched analyte sequence reads matches the sequence reads when subjected to unbiased (e.g., non-targeted) spatial gene expression profiling in the same biological sample. In some embodiments, the matched UMI count R2 is at least 0.95, at least 0.96, at least 0.97, at least 0.98, or at least 0.99.
In some embodiments, the number of reads (reads/points) from each point obtained from targeted spatial gene expression profiling using one or more sets (e.g., cancer set, immune set, pathway set, or nerve set) described herein is less than 20%, less than 15%, less than 10%, less than 9%, less than 8%, less than 7%, less than 6%, less than 5%, less than 4%, less than 3%, less than 2%, or less than 1% as compared to the number of reads (reads/points) from each point obtained from unbiased (e.g., non-targeted) spatial gene expression profiling. In some embodiments, the total reading obtained from targeted spatial gene expression profiling using one or more of the groups (e.g., cancer group, immune group, pathway group, or nerve group) described herein is less than 20%, less than 15%, less than 10%, less than 9%, less than 8%, less than 7%, less than 6%, less than 5%, less than 4%, less than 3%, less than 2%, or less than 1% as compared to the total reading obtained from unbiased (e.g., non-targeted) spatial gene expression profiling. In some embodiments, while the total reads and/or reads/spots are reduced by using one or more sets of targeted spatial gene expression profiling as described herein, the biological pattern (e.g., spatial pattern of specific gene expression in a biological sample) remains consistent compared to the results obtained from unbiased (e.g., non-targeted) spatial gene expression profiling.
In some embodiments, about 50%, about 55%, about 60%, about 62%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95% or more of the group gene recovery can be obtained using targeted spatial gene expression profiling of one or more groups (e.g., cancer group, immune group, pathway group, or nerve group) as described herein relative to unbiased spatial gene expression profiling.
In some embodiments, about 50%, about 55%, about 60%, about 62%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95% or more of the group UMI recovery and/or UMI retention can be obtained using targeted spatial gene expression profiling of one or more groups (e.g., cancer group, immune group, pathway group or nerve group) as described herein relative to unbiased (e.g., non-targeted) spatial gene expression profiling. In some embodiments, targeted spatial gene expression profiling using one or more groups (e.g., cancer group, immune group, pathway group, or nerve group) as described herein can reduce the complexity (or complexity) of gene or UMI targeting to about 90%, about 80%, about 70%, about 60%, about 50%, about 40%, about 30%, about 20% or less of an unbiased (e.g., non-targeted) spatial gene expression profiling.
In some embodiments, the double index library may be used for targeted spatial gene expression profiling. In some embodiments, the individually indexed libraries may be mixed by hybridization/capture prior to the enrichment step to generate targeted libraries for downstream high throughput sequencing. In some embodiments, the individually indexed libraries can be mixed by hybridization/capture after the enrichment step to generate targeted libraries for downstream high throughput sequencing. In some embodiments, the targeted spatial gene expression profile of one or more groups as described herein is used to provide comparable, even more detailed spatial gene profiling results as compared to annotated and/or unbiased (e.g., non-targeted) spatial gene profiling obtained from a pathologist.
(2) Direct hybridization of decoy oligonucleotides to target analytes
In some embodiments, decoy oligonucleotides (e.g., a collection of decoy oligonucleotides from one or more of the sets disclosed herein) can be added directly to an analyte. In some embodiments, the decoy oligonucleotide is directly associated (i.e., conjugated) at the 3' end with a detectable moiety or molecular tag. In some embodiments, the decoy oligonucleotide is directly associated (i.e., conjugated) at the 5' end with a detectable moiety or molecular tag. In some embodiments, the detectable moiety is a fluorophore or radioisotope.
In some embodiments, the target-specific reaction is performed in a biological sample. In some embodiments, the nucleic acid analyte may be denatured prior to contact with the decoy oligonucleotide to produce single stranded analytes. In some embodiments, the decoy oligonucleotide directly binds (e.g., hybridizes) to an analyte (e.g., a single-stranded analyte), and one or more imaging techniques can be used to identify the detectable moiety (e.g., a fluorophore). In some embodiments, the target-specific reaction comprises in situ hybridization of a decoy oligonucleotide to an analyte (e.g., a single-stranded analyte). In some embodiments, the in situ hybridization is Fluorescence In Situ Hybridization (FISH). In some embodiments, a fluorophore or molecular tag can be used as a marker to identify (e.g., enrich) an analyte of interest. In some embodiments, fluorescence microscopy can be used to locate fluorophore-labeled decoy oligonucleotides in a biological sample or elsewhere. In some embodiments, analytes that do not hybridize to decoy oligonucleotides may be washed away. In some embodiments, the washing method includes separating only those analytes that fluoresce. In some embodiments, the washing step comprises any of the washing steps provided herein.
In some embodiments, a fluorophore (e.g., any of the fluorophores disclosed herein) can be directly conjugated (e.g., coupled) to a decoy oligonucleotide. In some embodiments, the decoy oligonucleotide may include a non-fluorescent moiety that is directly bound (e.g., coupled) to the decoy oligonucleotide. In some embodiments, the fluorophore may bind to a non-fluorescent moiety, thereby enhancing detection of the analyte.
In some embodiments, the fluorescent in situ hybridization methods disclosed herein detect a specific set of transcripts. In some embodiments, the fluorescent in situ hybridization methods disclosed herein detect cancer-related transcripts (e.g., as disclosed in U.S. patent application No. 62/970,066 (titled "Capture targeted genetic target using hybridization/Capture method)") and U.S. patent application No. 62/929,686 (titled "Capture targeted genetic target using hybridization/Capture method)"), each of which is incorporated herein by reference in its entirety. In some embodiments, the fluorescent in situ hybridization methods disclosed herein detect immune-related transcripts (e.g., as disclosed in U.S. patent application No. 62/970,066 (entitled "Capture of targeted genetic target using hybridization/Capture method)") and U.S. patent application No. 62/929,686 (entitled "Capture of targeted genetic target using hybridization/Capture method)"), each of which is incorporated herein by reference in its entirety. In some embodiments, the fluorescent in situ hybridization methods disclosed herein detect pathway-specific transcripts (e.g., as disclosed in U.S. patent application No. 62/970,066 (entitled "Capture of targeted genetic target using hybridization/Capture method)") and U.S. patent application No. 62/929,686 (entitled "Capture of targeted genetic target using hybridization/Capture method)"), each of which is incorporated herein by reference in its entirety. In some embodiments, the fluorescent in situ hybridization methods disclosed herein detect a neurological-specific transcript as described herein.
In some embodiments, the hybridized analyte/decoy oligonucleotide complex can be cleaved from the substrate. In some embodiments, a spatial barcoded array populated with a plurality of capture probes may be contacted with a biological sample. The spatially barcoded capture probes can be lysed and then interact with cells within the biological sample. In some embodiments, the capture probes of the plurality of capture probes may optionally include at least one cleavage domain representing a capture probe moiety for reversibly attaching the capture probes to an array feature. Furthermore, one or more segments or regions of the capture probes may optionally be released from the array features by cleavage of the cleavage domain. For example, a spatial barcode may be released by cleavage of a cleavage domain. In some embodiments, the capture probe may further comprise a cleavage site (e.g., a cleavage recognition site for a restriction endonuclease), a photolabile bond, a thermo-bond, or a chemo-sensitive bond. In some embodiments, once the spatially barcoded capture probes are associated with a particular cell or analyte thereof, the biological sample may optionally be removed.
In some embodiments, the cleaved or enriched analyte/decoy oligonucleotide complexes may be purified prior to downstream steps (e.g., sequencing). Sequencing can be performed using any method known in the art, including the exemplary sequencing methods described herein. In some cases, the methods disclosed herein comprise sequencing of a single index library. In some cases, the methods disclosed herein comprise sequencing a double index library. In some cases, the library comprises a standard Illumina end-paired construct, including P5 and P7 sequences. For a single index library, an 8bp sample index sequence was included as the i7 sample index sequence. For the double index library, the 10bp sample index sequence was included as the i7 and i5 sample index sequences.
In some embodiments, nucleic acid sequencing comprises determining all or part of the sequence of a spatial barcode or its complement. In some embodiments, sequencing comprises determining all or part of the sequence of an analyte or its complement from a biological sample. In some embodiments, sequencing comprises high throughput sequencing. In some embodiments, sequencing comprises ligating or adding one or more adaptor sequences or complements thereof, poly (GI) sequences or complements thereof, template switching oligonucleotide sequences or complements thereof, primer binding sites or complements thereof to the nucleic acid.
In some embodiments, a plurality of decoy oligonucleotides (e.g., two or more decoy oligonucleotides) can be used to interrogate spatial gene expression in a biological sample through RNA templated ligation.
(3) Downstream application of decoy oligonucleotides after hybridization
In some embodiments, the methods disclosed herein comprise post-capture amplification. In some cases, hybridization library molecules that bind to streptavidin or avidin beads may be amplified prior to a downstream sequencing event. In some cases, primers (e.g., P5 adaptors and/or P7 adaptors) are added to the nucleic acid sequence or the enriched nucleic acid sequence. In some embodiments, amplification is performed prior to enrichment of the decoy oligonucleotide with the nucleic acid of interest. In some embodiments, amplification occurs after enrichment of the decoy oligonucleotide with the nucleic acid of interest.
In some embodiments of the methods provided herein, RNA templated ligation is used to interrogate spatial gene expression in a biological sample (e.g., FFPE tissue section). In some aspects, the step of RNA templated ligation comprises: (1) Hybridizing a decoy oligonucleotide pair to a nucleic acid (e.g., single stranded cDNA or RNA molecule); (2) in situ linking pairs of adjacent annealed probes; (3) Rnase H treatment that (i) releases RNA-templated ligation products from tissue (e.g., into solution) and (ii) breaks unwanted DNA-templated ligation products; and optionally, (4) amplifying the RNA-templated ligation product (e.g., by multiplex PCR).
In some aspects, RNA templated ligation is used to detect target analytes, determine sequence identity, and/or expression monitoring and transcript analysis. In some aspects, the RNA templated ligation allows detection of a specific change in nucleic acid (e.g., a mutation or Single Nucleotide Polymorphism (SNP)), detection or expression of a specific nucleic acid, or detection or expression of a specific set of nucleic acids (e.g., expression in a similar cellular pathway or in a specific pathology). In some embodiments, methods involving RNA templated ligation are used to analyze nucleic acids, e.g., by genotyping, DNA copy number or quantification of RNA transcripts, localization of specific transcripts within a sample, etc. In some aspects, the systems and methods provided herein, including RNA templated ligation, are capable of identifying Single Nucleotide Polymorphisms (SNPs). In some aspects, such systems and methods are capable of identifying mutations.
In some aspects, disclosed herein are methods of detecting RNA expression comprising contacting a first decoy oligonucleotide, a second decoy oligonucleotide, and a ligase (e.g., T4 RNA ligase). In some embodiments, the first decoy oligonucleotide and the second decoy oligonucleotide are designed to hybridize to a target sequence such that the 5 'end of the first decoy oligonucleotide and the 3' end of the second decoy oligonucleotide are adjacent and can be ligated. After hybridization, if the target sequence is present in the target sample, a ligase (e.g., T4 RNA ligase) ligates the first decoy oligonucleotide and the second decoy oligonucleotide, but if the target sequence is not present in the target sample, the ligase (e.g., T4 RNA ligase) does not ligate the first decoy oligonucleotide and the second decoy oligonucleotide. The presence or absence of a target sequence in a biological sample can be determined by determining whether the first decoy oligonucleotide and the second decoy oligonucleotide are ligated in the presence of a ligase.
In some aspects, two or more RNA analytes are analyzed using a method that includes RNA templated ligation. In some aspects, when two or more analytes are analyzed, a first decoy oligonucleotide and a second decoy oligonucleotide are used that are specific (e.g., specifically hybridize) to each RNA analyte.
In some aspects, three or more decoy oligonucleotides are used in the RNA templated ligation methods provided herein. In some embodiments, three or more decoy oligonucleotides are designed to hybridize to a target sequence such that the three or more decoy oligonucleotides hybridize adjacent to each other such that the 5 'and 3' ends of adjacent probes can be ligated. In some embodiments, the presence or absence of a target sequence in a biological sample can be determined by determining whether three or more decoy oligonucleotides are ligated in the presence of a ligase.
(e) Compositions and kits
Also provided herein are kits comprising components and instructions for performing the methods described herein. In some cases, the kit includes a collection of decoy oligonucleotides. In some cases, the pool of decoy oligonucleotides is specific for a particular pool of transcripts (e.g., a cancer group, an immunological group, a cell pathway group, or a neuroscience group as disclosed herein). In some embodiments, the kit comprises decoy oligonucleotides specific for a cancer group as described herein. In some embodiments, the kit comprises decoy oligonucleotides specific for the immune groups described herein. In some embodiments, the kit comprises decoy oligonucleotides specific for the set of cell pathways described herein. In some embodiments, the kit comprises decoy oligonucleotides specific for the neural groups described herein. It will also be appreciated that custom sets may be designed using the methods of designing decoy oligonucleotides as described herein. In some cases, the kit includes multiple sets of baits for the same set, so that multiple (e.g., double, triple) reactions can be performed simultaneously on the same slide. In some cases, other decoy oligonucleotides can be added to one of the groups described herein (e.g., cancer group, immune group, pathway group, or nerve group as disclosed herein). In some cases, the kit comprises at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or more reactions, wherein each reaction refers to testing one sample.
In some cases, a moiety (e.g., biotin) is attached to each decoy in the pool of decoy oligonucleotides. In some cases, the kit further comprises streptavidin beads. In some cases, the kit further comprises avidin beads. In some cases, the kit includes an array comprising a plurality of capture probes (i.e., as described herein). The capture probes on the array each comprise a spatial barcode and a capture domain.
The kit may also include reagents for performing the methods disclosed herein. For example, in some cases, the kit includes reagents for performing a reverse transcription reaction (e.g., reverse transcriptase), nucleic acid quantification as described herein, adding fragments to a nucleic acid sequence, and nucleic acid cleaning. In some cases, the kit includes reagents necessary for immobilizing, staining, and destaining the biological sample.
In some cases, the kit includes components capable of performing multiple reactions. In some cases, the kit includes components capable of target hybridization. In some cases, the components for target hybridization include one or more of a universal blocker, hybridization buffer, hybridization enhancer, equilibration buffer (concentrated or 1×), wash buffer (concentrated or 1×), primer, control sample, or any combination thereof. In some cases, the kit includes components for amplifying the nucleic acid library. In some cases, the components used to amplify the nucleic acid library include mixtures, primers, or any combination thereof that facilitate amplification.
In some cases, the kit further comprises instructions for library preparation and nucleic acid detection using the methods described herein.
Also provided herein are systems for analyzing one or more biological analytes present in a biological sample, comprising an array comprising a plurality of capture probes. Any of the capture probes described herein may be included in part of the system. In some cases, the capture probes each comprise a spatial barcode and a capture domain. In some cases, the system includes a composition (e.g., an enzyme) that can cleave the probes from the array. In some cases, cleavage of the probe occurs after hybridization of the probe to the analyte.
In some cases, the system includes a pool of decoy oligonucleotides specific for a target of interest. In some cases, the system includes decoy oligonucleotides specific for the cancer groups described herein. In some cases, the system includes decoy oligonucleotides specific for the immune groups described herein. In some cases, the system includes decoy oligonucleotides specific for the sets of pathways described herein. In some cases, the system includes decoy oligonucleotides specific for the neural groups described herein.
The system may also include reagents for performing the methods disclosed herein. For example, in some cases, the system includes reagents for performing reverse transcription reactions (e.g., reverse transcriptase), nucleic acid quantification as described herein, adding fragments to nucleic acid sequences, and nucleic acid cleaning. In some cases, the system includes reagents necessary to immobilize, stain, and decolorize the biological sample.
In some cases, the system includes a reagent delivery system. The reagent delivery system includes an instrument capable of delivering reagents to discrete portions of the biological sample, thereby preserving the integrity of the spatial pattern of the addressing scheme. In some cases, the reagent delivery system of the present assay system includes optional imaging means, reagent delivery hardware, and control software. Reagent delivery may be achieved in a number of different ways. . It should be noted that the reagent may be delivered to a variety of different biological samples at once. Individual tissue sections are exemplified herein; however, multiple biological samples may be simultaneously manipulated and analyzed. For example, successive slices of tissue samples may be analyzed in parallel and the data combined to construct a 3D map. In one exemplary aspect, the reagent delivery system may be a flow-based system. Flow-based systems for reagent delivery in the present invention may include instrumentation such as one or more pumps, valves, fluid containers, channels, and/or reagent storage units.
Also provided herein are compositions comprising decoy oligonucleotides bound to nucleic acids, wherein the nucleic acids comprise (i) a spatial barcode or complement thereof and (ii) a partial analyte binding moiety barcode or complement thereof. In some cases, the composition is an intermediate composition produced by the methods disclosed herein.
The decoy oligonucleotide may be any of the decoy oligonucleotides described herein. In some cases, the decoy oligonucleotide binds to the nucleic acid through a capture domain that specifically binds to (i) all or a portion of the nucleic acid and/or (ii) all or a portion of the analyte binding moiety barcode or its complement.
In some cases, the compositions described herein further comprise a molecular tag. In some cases, the composition further comprises an agent that specifically binds to the molecular tag. The molecular tag may be any suitable molecular tag described herein. In some cases, the molecular tag comprises a protein. In some cases, the protein is streptavidin, avidin, or biotin. In some cases, the molecular tag comprises a small molecule. In some cases, the molecular tag comprises a nucleic acid. In some cases, the molecular tag includes a carbohydrate.
In some cases, the molecular tag is located 5' to the domain in the decoy oligonucleotide. In some cases, the molecular tag is located 3' of the domain in the decoy oligonucleotide.
The agent that specifically binds to the molecular tag may be any suitable agent described herein. In some cases, the reagent comprises streptavidin, avidin, or biotin. In some cases, the molecular tag is biotin and the agent that specifically binds to the molecular tag comprises avidin. In some cases, the molecular tag comprises biotin and the agent that specifically binds to the molecular tag comprises streptavidin. In some cases, the molecular tag comprises avidin and the agent that specifically binds to the molecular tag comprises biotin. In some cases, the molecular tag comprises streptavidin and the agent that specifically binds the molecular tag comprises biotin.
In some cases, the agent that specifically binds to the molecule comprises a protein. In some cases, the protein is an antibody. In some cases, the agent that specifically binds to the molecular tag includes a nucleic acid. In some cases, the agent that specifically binds to the molecular tag comprises a small molecule.
In some cases, the compositions described herein further comprise a substrate. The substrate may be any suitable substrate described herein. In some embodiments, the agent that specifically binds to the molecular tag is immobilized to the substrate. In some embodiments, the molecular tag is immobilized to a substrate. In some cases, the substrate is a bead. In some cases, the substrate is a hole. In some cases, the substrate is a slide.
Exemplary embodiments
In some embodiments, disclosed herein are methods for identifying a location of an analyte of interest (e.g., a nucleic acid) in a biological sample, the method comprising (a) contacting a plurality of nucleic acids with a plurality of decoy oligonucleotides, wherein the nucleic acids of the plurality of nucleic acids comprise (i) a spatial barcode or complement thereof and (ii) a portion of an analyte binding moiety barcode or complement thereof; and the decoy oligonucleotides of the plurality of decoy oligonucleotides comprise a domain that specifically binds to all or part of the sequence of interest in (i) the nucleic acid or its complement, and a molecular tag; (b) Capturing and/or isolating a complex of decoy oligonucleotides that specifically bind to nucleic acids using a substrate comprising an agent that specifically binds to a molecular tag; and (c) determining all or part of the sequence of (i) the spatial barcode or its complement and (ii) the sequence of the analyte binding moiety barcode, and using the sequences determined in (i) and (ii) to identify the location of the analyte in the biological sample.
In some embodiments, disclosed herein are methods for enriching a biological sample for an analyte of interest (e.g., a nucleic acid), the method comprising (a) contacting a plurality of nucleic acids with a plurality of decoy oligonucleotides, wherein the nucleic acids of the plurality of nucleic acids comprise (i) a spatial barcode or complement thereof and (ii) a portion of an analyte binding moiety barcode or complement thereof; and the decoy oligonucleotides of the plurality of decoy oligonucleotides comprise a domain that specifically binds to all or part of the sequence of interest in (i) the nucleic acid or its complement, and a molecular tag; (b) Capturing and/or isolating a complex of decoy oligonucleotides that specifically bind to nucleic acids using a substrate comprising an agent that specifically binds to a molecular tag; thereby enriching the biological sample for the analyte (e.g., nucleic acid).
In some embodiments, the analyte from the biological sample is associated with a disease or disorder. In some embodiments, the domain of the decoy oligonucleotide comprises a total of about 10 nucleotides to about 300 nucleotides. In some embodiments, the molecular tag comprises a protein. In some embodiments, the protein is biotin. In some embodiments, the molecular tag comprises a small molecule. In some embodiments, the molecular tag comprises a nucleic acid. In some embodiments, the molecular tag comprises a carbohydrate. In some embodiments, the molecular tag is located 5' to the domain in the decoy oligonucleotide. In some embodiments, the molecular tag is located 3' of the domain in the decoy oligonucleotide. In some embodiments, the agent that specifically binds to the molecular tag comprises a protein. In some embodiments, the protein is an antibody. In some embodiments, the agent that specifically binds to the molecular tag comprises streptavidin or avidin. In some embodiments, an agent that specifically binds to a molecular tag is attached to a substrate. In some embodiments, the substrate is a bead, a slide, or a well. In some embodiments, the nucleic acid is DNA. In some embodiments, the nucleic acid further comprises a primer binding sequence or a complement thereof. In some embodiments, the nucleic acid further comprises a unique molecular sequence or complement thereof. In some embodiments, the nucleic acid further comprises other primer binding sequences or complements thereof.
In some embodiments, the biological sample is a tissue sample. In some embodiments, the tissue sample is a formalin fixed, paraffin embedded (FFPE) tissue sample or a frozen tissue sample. In some embodiments, the biological sample is pre-stained with a detectable label. In some embodiments, the biological sample is pre-stained with hematoxylin and eosin (H & E). In some embodiments, the biological sample is a permeabilized biological sample. In some embodiments, the permeabilized biological sample has been permeabilized with a permeabilizing reagent. In some embodiments, the permeabilizing agent is selected from the group consisting of organic solvents, cross-linking agents, detergents, and enzymes, or combinations thereof.
In some embodiments, the permeabilizing agent is selected from the group consisting of organic solvents, cross-linking agents, detergents, and enzymes, or combinations thereof. In some embodiments, the analyte is an RNA molecule. In some embodiments, the RNA molecule is an mRNA molecule. In some embodiments, the determining in step (c) comprises sequencing all or part of the sequence of (i) the spatial barcode or its complement and (ii) the analyte binding moiety barcode. In some embodiments, the sequencing is high throughput sequencing. In some embodiments, sequencing comprises ligating an adapter to the nucleic acid.
In some embodiments, the method further comprises generating a plurality of nucleic acids comprising: (a) Contacting a plurality of analyte capture agents with a biological sample disposed on a substrate, wherein the analyte capture agents of the plurality of analyte capture agents comprise (i) an analyte binding moiety that specifically binds to an analyte, (ii) an analyte binding moiety barcode, and (iii) an analyte capture sequence; the substrate comprises a plurality of capture probes, wherein the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain, wherein the capture domain specifically binds to an analyte capture sequence; and (b) extending the 3' end of the capture probe using the analyte binding moiety barcode as a template to generate an extended capture probe; and (c) amplifying the extended capture probe to produce nucleic acid. In some embodiments, the amplification is isothermal. In some embodiments, the nucleic acid produced is released from the extended capture probe. In some embodiments, the analyte is an analyte that is deregulated or differentially expressed in cancer cells. In some embodiments, the analyte is an analyte that is deregulated or differentially expressed in immune cells. In some embodiments, the analyte is an analyte that is deregulated in a cell signal transduction pathway. In some embodiments, the analyte is an analyte that is deregulated or differentially expressed in neural cells.
Examples
Example 1: targeting spatial gene expression
FIG. 7 shows an exemplary targeted spatial gene expression workflow. Typically, the slide with the tissue attached thereto is stored at-80℃for less than one week. After thawing, the biological samples were fixed in methanol and stained with H & E (hematoxylin and eosin), imaged using a bright field microscope and permeabilized.
After intracellular analytes (e.g., mRNA) are obtained by permeabilization, the analytes are captured by capture probes attached to a slide, in which case first and second strand cDNA synthesis is performed, wherein the second strand cDNA can be used as a surrogate for the captured analytes (e.g., mRNA). After cDNA synthesis, double stranded analytes (e.g., cDNA molecules) are denatured and transferred from slides and quantified by qPCR. After the cDNA is quantified, the cDNA analyte is fragmented and end repaired. The adaptors were ligated to the cDNA molecules and Sample Index (SI) PCR was performed. After SI-PCR, the library prepared as above may be dried.
A panel comprising decoy oligonucleotides specific for a cancer target, immune target, pathway target, or neurological target hybridizes to a cDNA analyte. Decoy oligonucleotides are associated with biotin. After hybridization of the cDNA analyte to the biotinylated decoy oligonucleotide, the biotinylated decoy is captured using avidin or streptavidin beads. After a washing step to remove unbound analyte, the retained library is reamplified into a new nucleic acid library. The purified targeted library set is then sequenced.
As shown in fig. 8A-8B, spatial libraries from multiple samples (i.e., heart, breast cancer and lymph) showed very high percentage of mid-target and map reads, enrichment and UMI complexity (relative to the parental samples) using two different sets (cancer set of 200 genes (i.e., hl1 k.200) and immune set of 400 genes). These data demonstrate that the set of methods disclosed herein for targeted hybridization enrichment is compatible with the prepared spatial library.
Example 2: targeted gene capture
In another example, the step of targeting oligonucleotide hybridization may be performed prior to the adaptor ligation and SI-PCR steps.
Fig. 9 shows an exemplary workflow. The workflow utilizes a plurality of spatially generated library molecules obtained from a gene expression library. Optionally, the library is amplified by PCR to increase the number of library molecules used for targeted enrichment. In this example, up to 8 different libraries were pooled prior to enrichment. Optionally, blocking oligonucleotides are hybridized to the library to prevent modification of the targeting sequence, thereby preventing downstream interference during sequencing applications. The pooled library was dried in SpeedVac for 2-4 hours, which allowed the sample to dry at low pressure without heating. Biotinylated decoy oligonucleotides were added and hybridized to pooled library samples for 2 hours at 65 ℃. Avidin or streptavidin coated beads were provided to capture biotinylated oligonucleotide/targeted capture complex for 5 min at 65 ℃. The avidin or streptavidin beads are washed to remove any unbound sequence reads and post-capture PCR is performed to re-amplify the library of interest (e.g., to amplify the target sequence hybridized to the biotinylated oligonucleotide decoys and then captured by the avidin or streptavidin beads), followed by SPRI clean-up for 15 minutes. The purified targeted library set is then sequenced.
Example 3: hybridization and Capture Using targeting groups
An exemplary workflow of a hybridization and capture method using a spatial gene expression library is shown in fig. 10.
In particular, biological samples (e.g., tissue sections) are obtained using the suitable methods described herein, such as by frozen sections of the tissue samples. The tissue sections are then stained, for example by H & E staining, and imaged using any suitable method described herein (e.g., bright field microscopy). Stained tissue sections are then destained using the appropriate methods described herein.
After imaging and decolorizing the tissue section, it is permeabilized and then contacted with an array comprising a plurality of capture probes. Each capture probe comprises a spatial barcode and a poly (T) sequence that hybridizes to a poly (a) tail of an mRNA analyte in a tissue section, thereby capturing a plurality of mRNA analytes on the array. The captured mRNA is then reverse transcribed into cDNA and amplified using the appropriate methods described herein. Thus, a cDNA library (final library from the GEX spatial array) was constructed for spatial gene expression analysis.
Hybridization and capture using the bait sets was then performed. The blocking oligonucleotides are hybridized to analytes in the library to prevent modification of sequencing-related positions, thereby preventing downstream interference during sequencing applications. The library was then dried in SpeedVac for 2-4 hours, which allowed the sample to dry at low pressure without heating. Biotinylated bait sets were added to hybridize to the library for 2 hours at 65 ℃. Avidin or streptavidin coated beads were provided to capture biotinylated bait set-sequence read complexes for 5 minutes at 65 ℃. The avidin or streptavidin beads were washed at 65℃to remove any unbound sequence reads. Alternatively, biotinylated bait sets are added to hybridize to library samples overnight (e.g., at least 8 hours) at 65 ℃ and the avidin or streptavidin beads are washed at 60 ℃ to remove any unbound sequence reads. The avidin or streptavidin beads are subjected to a 15 minute PCR to re-amplify the targeted library (e.g., amplify a sequence read that hybridizes to the biotinylated bait and is then captured by the avidin or streptavidin beads), followed by 15 minute SPRI purification. The purified targeted library set is then sequenced.
Example 4: mouse tissue and neurome space transcriptomics
The mouse glial group of decoy oligonucleotides was generated and targeted gene expression was performed on mouse brain tissue sections. Experiments were performed as described in the 10X genome-targeted gene expression-spatial user guidance protocol, except that mouse brain tissue was used instead of human tissue and the mouse glial cell line was used.
The complete transcriptome (control) was run on a Visium array against the targeted 65 neurogenic mouse glia group. As shown in fig. 11, the control (Visium whole transcriptome assay) has very high quality UMI recovery efficiency and spatial distribution retention compared to the targeted genome. FIGS. 12A-12B illustrate the spatial gene distribution of OLIG2 in a control library (FIG. 12A) compared to the OLIG2 gene in the targeted genome (FIG. 12B). For the control, approximately 72k reads/spot had 2211 UMI counts, while approximately 3.5k reads/spot had 3315 UMI counts for the targeted genome. OLIG2 gene encodes oligodendrocyte transcription factor 2, which is expressed in oligodendrocyte brain cells, and in humans, OLIG2 gene is expressed in brain oligodendrocyte tumors. The results demonstrate that the same targeted gene expression enrichment method can also be performed using mouse decoy oligonucleotides in mouse tissue samples.
Example 5: targeting genome applied to human tissue section
The targeted genome was used to target capture gene expression in different human tissues from a visual array. The targeted gene expression method as described in the 10X genome targeted gene expression space user guidance protocol was followed.
Figures 13A-13B show general data results when comparing target enrichment using four different genomes on human glioblastoma tissue sections compared to a conventional full transcriptome control visual (non-targeted) spatial array. For all four groups (i.e., pan-cancer, immunology, gene signature, and neuroscience-enriched genomes), the mid-target reads for the different genes were between 70% -90% regardless of the targeted group used (fig. 13A). As shown in the Visium control, approximately 3% -10% of the reads of the full transcriptome map corresponded to genes in the targeted group. Furthermore, when comparing the correlation between the Visium control and the pan-cancer targeted group (as a representative targeted group, but with evidence for all groups), a high degree of reproducibility and correlation of the expression profile was observed (R 2 0.98) (fig. 13B).
To demonstrate that multiple tissue types are compatible with the targeted gene enrichment workflow, twelve different tissue types for each of the four targeted genomes (fig. 14A-14B) were evaluated for the visual control whole transcriptome spatial experiment. The twelve tissue types tested were all from humans: breast IDC, breast ILC, triple negative breast cancer, cerebellum, colorectal cancer, cortex, glioblastoma, heart, kidney disease, lung, ovarian tumor and spinal cord. Regardless of tissue type or targeted gene enriched set, more than 90% of the targeted gene was recovered relative to the control Visium whole transcriptome assay (fig. 14A), and UMI matched more than 80% (fig. 14B). Thus, a variety of tissue types are compatible with the targeted gene enrichment workflow described herein.
To demonstrate the maintenance of clustering when performing the targeted gene enrichment methods described herein, a spatial array of Visium full transcriptome on human brain cortical tissue was compared to a spatial array of targeted gene enrichment by the neuroscience group. As shown in fig. 15A (visual full transcriptome control) and fig. 15B (neuroscience group), the clusters were comparable. In fact, for the neuroscience panel, only 5k raw sequencing reads/spot were needed to recapitulate the major biological pattern shown in the full transcriptome test (full transcriptome control was 50k reads/spot). Fig. 15C demonstrates this data, demonstrating excellent correlation between the two data sets of UMI plotted for each gene.
The next question is whether the targeted genome enrichment protocol can refine the subcategories of related genes in the tissue and how it will be compared to the pathologist's annotation of the sample. Thus, experiments were performed on human breast cancer tissue sections as described herein. Fig. 16A shows pathologist notes of different cells in breast cancer tissue: DCIS, adipose, fibrous tissue, immune cells, invasive cancer cells and normal glandular cells were identified. Specifically, the areas representing invasive cancer cells, fibrous tissue, and immune cells are circled with dashed lines, dotted lines, and solid lines, respectively. Fig. 16B shows the conversion of pathologist notes to visual full transcriptome data. FIG. 16C shows 196 breast oncogenes from the pan-oncogene enriched group, shrinking to the expression of ERBB2 genes from the pan-oncogene enriched group (FIG. 16D). As shown, targeting the pan-cancer group, and 196 breast cancer genes from the group, demonstrated targeted gene expression comparable to pathologist annotation, further suggesting that ERBB2 is most abundantly expressed in the invasive cancer compartment as expected. Furthermore, all data were obtained using only 5k reads/point compared to 50k reads/point for the visual full transcriptome. This efficiency requires a small portion of the sequencing cost of performing a full transcriptome spatial analysis, thus not only facilitating targeted gene expression, but also saving costs.
Thus, this data demonstrates the ability to use a targeted genome with spatial gene expression of Visium to identify gene expression information for specific gene targets associated with cancer and other diseases important to human health and disease for different targeted genomes as well as all targeted groups.
EXAMPLE 6 cancer group
FIGS. 17A-17D detail the sequence listing of each probe in the cancer group. Cancer group represents SEQ ID NO:1 to SEQ ID NO: 1041963. Each entry in the cancer group is the SEQ ID NO of the probe in the cancer group found in the sequence listing. Thus, entry 843 of the cancer group is SEQ ID NO:843. turning to SEQ ID NO:843, the name of the sequence can be seen to be probe ENSG00000002016_0. The base part ENSG00000002016 of probe ENSG00000002016_0 represents the gene ENSG00000002016 or RAD52 in the Ensembl version 101 database, i.e. the gene of probe ENSG00000002016_0 is designed to be pulled down during targeted sequencing. See, cunningham et al Ensembl 2019, pubMed PMID: 3049321, doi:10.1093/nar/gky1113, incorporated by reference, obtain details of the Ensembl database. Each sequence in the sequence listing provides the name of the entry in the Ensembl database for the corresponding gene for which the probe was designed. 17A-17D, when the cancer group includes a contiguous range of sequences, for example, SEQ ID NOs: 843-887, which are listed as 843-887, rather than listing the individual sequence listings (e.g., 843, 844, 845, …, 887). However, the cancer group includes each probe within each range provided in fig. 17A-17D. Genetic targets in the cancer group are associated with cancer. See, e.g., kandoth et al, 2013, "mutation profile and meaning of 12 major cancer types (Mutational landscape and significance across 12 major cancer types)," Nature 502 (7471), pages 333-339; hoadley et al, 2018, "molecular classification of 10,000 tumors for 33 cancers is dominated by Cell-origin pattern (Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer), cell 173 (2), pages 291-304; bailey et al, 2018, "comprehensive characterization of cancer driver genes and mutations (Comprehensive Characterization of Cancer Driver Genes and Mutations)," Cell 173 (2), pages 371-385, each of which is incorporated by reference. See also, cancer genome map study networks (Cancer Genome Atlas Research Network), illumina TruSight 500, nanostring nCounter pan cancer pathway, idtxgen pan cancer, and Agilent ClearSeq comprehensive cancer.
EXAMPLE 7 immunological Probe set
FIGS. 18A-18D detail the sequence listing of each probe in an immunological probe set. Immunological probe sets represent SEQ ID NOs: 1 to SEQ ID NO: 1041963. Each entry in the immunological probe set is the SEQ ID NO of the probe in the immunological probe set found in the sequence listing. Thus, entry 888 of the immunological probe set is SEQ ID NO:888. turning to SEQ ID NO:888, it can be seen that the name of the sequence is probe ENSG00000002549_0. The base part ENSG00000002549 of probe ENSG00000002549_0 represents the gene ENSG00000002549 or LAP3 in the Ensembl version 101 database, i.e. the gene the probe ENSG00000002549 is designed to pull down during targeted sequencing. See, cunningham et al Ensembl 2019, pubMed PMID: 3049321, doi:10.1093/nar/gky1113, incorporated by reference, obtain details of the Ensembl database. Each sequence in the sequence listing provides the name of the entry in the Ensembl database for the corresponding gene for which the probe was designed. For simplicity, in 18A-18D, when the immunological probe set comprises a contiguous range of sequences, for example SEQ ID NOs: 888-894, which is listed as 888-894, rather than listing the individual sequence listings (e.g., 888, 889, 890, …, 894). However, the immunological probe set includes each probe within each range provided in fig. 18A-18D. Genetic targets in the immunological probe set are immunologically relevant. See, e.g., thorsson et al, 2018, "immune profile of cancer (The Immune Landscape of Cancer)," Immunity 48 (4), pages 812-830, which are incorporated by reference. See also BD (T cells, immune response) and HTG (HTG EdgeSeq immunooncology assay), nanostring (pan cancer immunospectral analysis, adaptive immunity, innate immunity).
Example 8 sets of passages
FIGS. 19A-19D detail the sequence listing of each probe in the pathway set. The set of pathways represents SEQ ID NO:1 to SEQ ID NO: 1041963. Each entry in the pathway set is the SEQ ID NO of the probe in the pathway set found in the sequence listing. Thus, entry 1 of the pathway set is SEQ ID NO:1. turning to SEQ ID NO:1, it can be seen that the name of the sequence is probe ENSG00000000003_0. The base portion ENSG00000000003 of probe ENSG00000000003_0 represents the gene ENSG00000000003 or TSPAN6 in the Ensembl version 101 database, i.e., the gene that probe ENSG00000000003 is designed to pull down during targeted sequencing. See, cunningham et al Ensembl 2019, pubMed PMID: 3049321, doi:10.1093/nar/gky1113, incorporated by reference, obtain details of the Ensembl database. Each sequence in the sequence listing provides the name of the entry in the Ensembl database for the corresponding gene for which the probe was designed. For simplicity, in 19A-19D, when the set of pathways comprises a contiguous range of sequences, for example SEQ ID NO:1-37, which are listed as 1-37, rather than listing each sequence list (e.g., 1, 2, 3, …, 37). However, the set of channels includes probes within the ranges provided in FIGS. 19A-19D. Genetic targets in the pathway set are associated with biological pathways. See, e.g., behan et al, 2019, "priority for screening cancer therapeutic targets using CRISPR-Cas9 (Prioritization of cancer therapeutic targets using CRISPR-Cas9 screens), nature 568, pages 511-516; sanchez-Vega et al, 2018, "oncogenic signaling pathway in cancer genomic map (Oncogenic Signaling Pathways in The Cancer Genome Atlas)," Cell 173 (2), pages 321-337; and Fang et al 2019, "Genetics dominant approach defines a drug target profile of 30 immune-related traits (a Genetics-led approach defines the drug target landscape of 30 immune-related tracks)," Nature Genetics 51 (7); pages 1082-1091, each of which is incorporated by reference.
EXAMPLE 9 neurones
FIGS. 20A-20D detail the sequence listing of each probe in the nerve group. The nerve group represents SEQ ID NO:1 to SEQ ID NO: 1041963. Each entry in the neurone is the SEQ ID NO of the probe in the neurone found in the sequence listing. Thus, entry 632 of the nerve group is SEQ ID NO:632. turning to SEQ ID NO:632, the name of the sequence can be seen to be probe ENSG00000001626_0. The base part ENSG00000001626 of probe ENSG00000001626_0 represents the gene ENSG00000001626 or CFTR in the Ensembl version 101 database, i.e. the gene that this probe ENSG00000001626 is designed to pull down during targeted sequencing. See, cunningham et al Ensembl 2019, pubMed PMID: 3049321, doi:10.1093/nar/gky1113, incorporated by reference, obtain details of the Ensembl database. Each sequence in the sequence listing provides the name of the entry in the Ensembl database for the corresponding gene for which the probe was designed. For simplicity, in FIGS. 20A-20D, when the nerve group includes a contiguous range of sequences, for example, SEQ ID NO:632-705, which are listed as 632-705, rather than listing each sequence list (e.g., 632, 633, 634, …, 705). However, the nerve group includes probes within the ranges provided in FIGS. 20A-20D. Genetic targets in the neurogroup are related to neurobiology.
Genes represented in the nerve group are ABAT, ABCA1, ABCA7, ABCB1, ABCG2, ABI3, ABL1, ACA 1, ACTG1, ACTN1, ACVRL1, ADAM10, ADAM15, ADAM17, ADRB1, ADCY5, ADCY8, ADCY9, ADCYAP1, ADGRG1, ADM, ADORA1, ADORA 22 21, ADRB2, 1, AIF1, AK3, AKT1S1, AKT2, AKT3, ALB, ALDH1A1, ALDH1L1, ALDH7A1, ALK, ALOX12, ALS2, AMIGO1, 2, ANO3, ANXA11, AP1S1, AP2A2, AP2B1, AP3M 2AP 3S1, AP4S1, APAF1, APBB1, APC, APEX1, APOA1, AQP4, 44, ARHGEF10, ARMC10, ARRB2, 1, ASCL2, 4, ATF6, ATG5, ATM, ATP13A2, ATP1A3, ATP2B3, ATP6AP2, ATP6V0 6V0D1, ATP6V0D1 ATP6V0E1, ATP6V0E2, ATP6V1 6V1E1, ATP6V1G2, ATP6V 17A 2, ATXN3, ATXN7, AVP, B4GALT6, BACE1, BACE2, BAD, BAI1, BAX, BCAS1, BCAS2, BCHE, BCL2L1, BDNF, BECN1, BID BIN1, BIRC5, BMI1, BNIP3, BRAF, BRMS 12, C21ORF2, C3, C5, C6, C9ORF72, CA1, CA2, CAB39, CACNA 11 11 11 11 12, CANB 4, CADM3, CADPS, CALB1, CALB2, CALCA, CALM1, CALML5, CAMK 22 24, CAPN1, CASK, CASP1, CASP3, CASP6, CASP7, CASP8, CASP9, CASS4, CAST, CAV1, 2, CCL5, CCND1, CCNH, CCR2, CCR5, CCS, CD11, CD163, CD2AP, CD31, CD33, CD34, CD39, CD4, CD40, CD44, CD68, CD8, CD9 CDC27, CDC40, CDC6, CDH1, CDH2, CDK5R1, CDK5RAP2, CDK5RAP3, CDK7, CDKL5, CDKN 11 21, CELF1, CEND1, CENPJ, CEP135, CEP41, CER1, CERS2, CERS4, CERS6, 10, CHCHD2, CHD4, CHGA, CHI3L1, CHL1, CHMP 12, CHRM3, CHRM5, CHRNA1, CHRNA3, CHRNA4, CHRNA5, CHRNA7, CHRNB2, CKB, CLCN6, CLDN15, CLDN5, CLN3, CLN5, CLN8, CLU, CNR2, CNP, CNR1, CNR2, device for detecting and controlling temperature of water, 1 3-DRA 1, HLA-DRB1, HLA-a, HMGB1, HMOX1, HNRNPA2B1, HNRNPM, HOMER1, HOXA2, HPCAL1, 3, HSPA6, HSPB1, HTR 12 22 34, HTR5 2, HTT, ICAM1, ICAM2, IDE, IDH1, IDH2, IDO1, IFNG, IGF 12, IGFBP2, IKBKB, IL10RA, IL13RA1, IL15RA, IL18, IL1R1, IL1RN, IL2, IL 46, IL6 4 5 5 5, IRF8 ISLR2, ITGA5, ITGA7, 3, ITM 21, ITPR2, ITPR3, JAM3, JUN, KATNA1, KCNA1, KCNB1, KCNC3, KCND3, KCNJ10, KCNK9, KCNMA1, KCNN3, KCNQ2, KCNQ3, KDELR2, KDR, KEAP1, KEL, KIAA1161, KIF 35 6, KNL1, KRAS, L1CAM, LAMA2, LAMB2, LAMP1, LARGE1, LCLAT1, 3, LGI1, LIF, LINGO1, LMNA, LMNB1, LOX, LPAR1, LPL LRP1, LRRC25, LRRC4, LRRC61, LRRK2, LSM7, 1, MAG, MAGEE1, MAL, MAN2B1, MAP2K1, MAP2K2, MAPK1, MAPK10, MAPK3, MAPK8, MAPK9, MAPKAPK2, MARK4, ENSG, MBP, MDM2, MEAF6, MECP2, MED10, MEF2, MFSD8, 67, MKS1, MME, MMP12, MMP14, MMP16, MMP19, MMP2, MMP24, MMP3, MMP9, MMRN2, MMP3, MMP9, MMP MNAT1, 4A6, MSI1, MSN, MT-ATP6, MT-ND1, MT1, MTA2, 1, MYD88, MYH10, 1, NCF1, NEK1, NELFA, NELL2, NEO1, NFE2L2, NFKB1, 2, NKX2-1, NKX6-2, NLGN4 3, NMB, NME5, NME8, NMNAT2, NOG, NOL3, NOS1, NOS2, NOS3, 1, NOTCH2, NOTCH3, NOTCH4, NOVA1, NPAS4, NPC1, NPC2, NPHP1, NPPB, NPR1, NR3C2, NR4A2, NRG1, NRP2, NRXN1, NSF, NTF3, NTN1, NTNG1, NTRK2, NTRK3, NTS, NWD1, OCLN, OLFM3, OLIG2, OLR1, ONECUT2, OPA1, OPHN1, OPRK1, OPRM1, OPTN, ORC4, ORC6, OSMR, OTUD4, OXR1, OXTRR, P2RX4, P2RX7, P2RY12, P4HA1, PADI2, PAFAH1B1, PAH, PAK1, PALM, PANK2, PARK5, PARK7, PARP1, PAX2, PAX3, PAX6, PCDH19, PARK 22, PDE 14 1, PECAM1, PFN1, PGAM1, PGK1, PHF19, PHF2, PHF21 3CA, PIK3CB, PIK3R1, PINK1, PKN1, PLA2G16, PLA2G 4G 6, 1, PLCB2, PLCB3, PLCB4 PLCG2, PLCL2, PLEKHG4, PLEKHO2, PLLP, PLOD2, PLP1, PLS1, PLXNB3, PLXNC1, PMCH, PMP22, PNKD, POC 11, POLG, POLR 22 22 21, POU5F1, PPARG, PPARGC1 1R 12 CA, PPP2R5 CA, PPP3CB PPP3CC, PPT1, PQBP1, PRF1, PRKAA2, 11, PRPF3, PRPF31, PRPH, PRRT2, PSEN1, PSEN2, PSMB8, PSMB9, PTDSS1, PTDSS2, PTK2, QKI, RAB2 38, RAB39 3 3L1, RABGEF1, RAC1, PTDSS2, PTK2, QKI, RAB38, RAB39 3L1, RABGEF1, RAC1, RAG 1, PSK 2, PSP RAD23 1, RAN, RAGEF 2, RARS2, RASGRP1, RB1, RBBP8, RBFOX3, RCAN1, RIN3, RING1, RIT2, RNF216, ROBO1, RPL39 25, RRAS, RTN4, RYR1, RYR2, RYR3, S100 1, SART1, SCAMP2, SCARB2, RABO 1, RPL39 25, RRAS, RYR2, RYR3, S100 1, SART1, SCAMP2, SCARB2, RABO 1, RAGG 2, RASGR 1, RASGR 2, RA SCN 12 5 8 9 3 6, SERPINE1, SERPINI1, SETX, SF3A2, SF3B4, SGK1, SGPL1, SGTA, SH3TC2, SHANK2, SHH, sigma R1, SIRT2, SIRT7, SIX3, SLA, SLC11A1, SLC12A5, SLC17A6, SLC17A7, SLC18A2, SLC18A3, SLC1A1, SLC1A2, SLC1A3, SLC24A4, SLC25a19, SLC2A1, SLC2A3, SLC32A1, SLC4a10, SLC6A3, SLC6A4, SLC6A6, SLC8A1, SLC9A6, SLIT1, SLIT2, SLIT3, SLU7, SMARCB1, SMN1, SMPD4, SMYD1, SNAP91, SNCA, SNCAIP, SNCB, SNRPA, SOD1, SOD2, SORCS3, SORL1, SOX10, SOX2, SOX9, SP1, SP100, SPAG4, SPARC, SPAST, SPG, SPI1, SPP1, SPTBN2, SQSTM1, SRC, SRGAP1, SRGAP2, SRGAP3, SRI, SRSF4, SRY, SST, ST3GAL3, ST3GAL5, starb 1, starbpl 1, STAT3, STC1 STC2, STIL, STX1A, STX1B, STX2, SUCLA2, SYN1, SYNJ1, SYP, SYT1, SYT13, SYT4, SYT7, TAC1, TACR1, TAF10, TAF4B, TAF6L, TAF9, TARDBP, TAU, TAZ, TBC D24, TBK1, TBP, TBPL1, TBR1, TCERG1, TCIRG1, TENM2, TERT, TF, TFAM, TGFB1, TGFB2, TGFBR2, TGIF1, TGM2, TH, THAP1, THY1, TIA1, TIE1 TLR2, TLR3, TLR4, TMEM106B, TMEM119, TMEM216, TMEM230, TMEM237, TMEM67, B, TMEM 10B, TMEM 10B, TMEM 10B, TMEM 11B, TMEM 12B, TMEM 1B, TMEM10, TNNI3, TNR, TOMM40, TMEM67, TMEM TOR 1B, TMEM, TP53INP1, TP73, TPH1, TPH2, TPM1, TPP1, TRADD, TREM1, TREM2, TRIM28, TRIM37, TRIP4, TRPA1, TRPM2, TRPV1, TRPV4, TRPV1, TREM2 TSC1, TSC2, TSEN34, TSPAN13, TSPO, TUBA 4B, TMEM 8, TUBB3, B, TMEM1, TYROBP, U2AF2, UBB, UBE 2B, TMEM 2B, TMEM 3B, TMEM1, UBQLN2, UCHL1, UGCG, UGT8, UNC 13B, TMEM21, USP30, B, TMEM 13B, TMEM 13B, TMEM 35, WDR62, WFDC2, WFS1, WNT 10B, TMEM6, WT1, XAB2, XBP1, B, TMEM CL, ZCWPW1, ZEB2, ZIC2 and ZNF24.
Claims (47)
1. A method for identifying an abundance and location of an analyte in a biological sample, the method comprising:
(a) Contacting a plurality of nucleic acids with a plurality of decoy oligonucleotides, wherein
The plurality of nucleic acids comprises an extended nucleic acid comprising (i) a spatial barcode or complement thereof, and (ii) all or part of the sequence of the analyte, analyte derivative, or complement thereof; and
the decoy oligonucleotides of the plurality of decoy oligonucleotides comprise:
(i) A capture domain that hybridizes to all or a portion of the sequence of the analyte, analyte derivative, or complement thereof, and
(ii) A molecular tag;
(b) Capturing decoy oligonucleotides bound to the extended nucleic acids using a substrate comprising an agent bound to the molecular tag; and
(c) Determining (i) all or part of the sequence of the spatial barcode or its complement, and (ii) all or part of the sequence of the extended nucleic acid, and using the sequences determined in (i) and (ii) to identify the abundance and location of the analyte in the biological sample.
2. The method of claim 1, wherein the analyte is a nucleic acid.
3. The method of claim 2, wherein the method further comprises generating a plurality of nucleic acids comprising:
(a) Contacting the biological sample with a substrate comprising a plurality of attached capture probes, wherein the capture probes of the plurality of capture probes comprise (i) a spatial barcode and (ii) a capture domain that binds to a sequence present in the nucleic acid;
(b) Hybridizing the capture probes to nucleic acids;
(c) Extending the 3' end of the capture probe using the nucleic acid bound to the capture domain as a template to generate an extended capture probe; and
(d) Amplifying the extended capture probe to produce an extended nucleic acid.
4. The method of claim 3, wherein the extended nucleic acid is released from the extended capture probe.
5. The method of claim 1, wherein the analyte is a protein.
6. The method of claim 1, wherein the analyte derivative is an oligonucleotide comprising an analyte binding moiety barcode, and an analyte capture sequence.
7. The method of claim 6, wherein the method further comprises generating a plurality of nucleic acids, the method comprising:
(a) Contacting a plurality of analyte capture agents with the biological sample, wherein:
the analyte capture agents of the plurality of analyte capture agents comprise (i) an analyte binding moiety that binds to a protein and (ii) an oligonucleotide comprising an analyte binding moiety barcode and an analyte capture sequence;
(b) Contacting the plurality of analyte capture agents with a substrate comprising a plurality of capture probes, wherein the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain, wherein the capture domain binds to an analyte capture sequence;
(c) Extending the 3' end of the capture probe using the analyte binding moiety barcode as a template to generate an extended capture probe; and
(d) Amplifying the extended capture probe to produce an extended nucleic acid.
8. The method of claim 7, wherein the extended nucleic acid is released from the extended capture probe.
9. The method of claim 7, wherein step (a) and step (b) are performed substantially simultaneously.
10. A method for enriching a biological sample for an analyte or analyte derivative, the method comprising:
(a) Contacting a plurality of nucleic acids with a plurality of decoy oligonucleotides, wherein
The plurality of nucleic acids comprises an extended nucleic acid comprising (i) a spatial barcode or complement thereof, and (ii) all or part of the sequence of the analyte, analyte derivative, or complement thereof; and
the decoy oligonucleotides of the plurality of decoy oligonucleotides comprise:
(i) A capture domain that hybridizes to all or a portion of the sequence of the analyte, analyte derivative, or complement thereof, and
(ii) A molecular tag;
(b) Capturing a complex of decoy oligonucleotides bound to an extended nucleic acid using a substrate comprising an agent bound to the molecular tag; and
(c) Isolating the complex of decoy oligonucleotides bound to the extended nucleic acids, thereby enriching the biological sample for the analyte or analyte derivative.
11. The method of claim 10, wherein the analyte is a nucleic acid.
12. The method of claim 11, wherein the method further comprises generating a plurality of nucleic acids comprising:
(a) Contacting the biological sample with a substrate comprising a plurality of attached capture probes, wherein the capture probes of the plurality of capture probes comprise (i) a spatial barcode and (ii) a capture domain that binds to a sequence present in the nucleic acid;
(b) Hybridizing the capture probes to nucleic acids;
(c) Extending the 3' end of the capture probe using the nucleic acid bound to the capture domain as a template to generate an extended capture probe; and
(d) Amplifying the extended capture probe to produce an extended nucleic acid.
13. The method of claim 12, wherein the extended nucleic acid is released from the extended capture probe.
14. The method of claim 10, wherein the analyte is a protein.
15. The method of claim 10 or 14, wherein the analyte derivative is an oligonucleotide comprising an analyte binding moiety barcode, and an analyte capture sequence.
16. The method of claim 15, wherein the method further comprises generating a plurality of nucleic acids, the method comprising:
(a) Contacting a plurality of analyte capture agents with the biological sample, wherein:
the analyte capture agents of the plurality of analyte capture agents comprise (i) an analyte binding moiety that binds to a protein and (ii) an oligonucleotide comprising an analyte binding moiety barcode and an analyte capture sequence;
(b) Contacting the plurality of analyte capture agents with a substrate comprising a plurality of capture probes, wherein the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain, wherein the capture domain binds to an analyte capture sequence;
(c) Extending the 3' end of the capture probe using the analyte binding moiety barcode as a template to generate an extended capture probe; and
(d) Amplifying the extended capture probe to produce an extended nucleic acid.
17. The method of claim 16, wherein the extended nucleic acid is released from the extended capture probe.
18. The method of claim 16, wherein step (a) and step (b) are performed substantially simultaneously.
19. The method of any one of claims 1-18, wherein the analyte from the biological sample is associated with a disease or disorder.
20. The method of any one of claims 1-19, wherein the capture domain of the decoy oligonucleotide binds to a 3 'portion, a 5' portion, an intron, an exon, a 3 'untranslated region, or a 5' untranslated region of the sequence of the analyte, analyte derivative, or complement thereof.
21. The method of any one of claims 1-20, wherein the capture domain of the decoy oligonucleotide comprises a total of about 10 nucleotides to about 300 nucleotides.
22. The method of any one of claims 1-21, wherein the molecular tag comprises a protein, a small molecule, a nucleic acid, or a carbohydrate.
23. The method of any one of claims 1-21, wherein the molecular tag is streptavidin, avidin, biotin, or a fluorophore.
24. The method of any one of claims 1-23, wherein the agent that binds to a molecular tag comprises a protein (e.g., an antibody), a nucleic acid, or a small molecule.
25. The method of any one of claims 1-24, wherein the molecular tag is biotin and the agent bound to the molecular tag is avidin or streptavidin.
26. The method of any one of claims 1-25, wherein the agent that specifically binds to a molecular tag is attached to the substrate.
27. The method of claim 26, wherein the substrate is a bead, a slide, or a well.
28. The method of any one of claims 1-27, wherein the extended nucleic acid is a DNA molecule (e.g., a cDNA molecule).
29. The method of any one of claims 1-28, wherein the extended nucleic acid further comprises a primer sequence or complement thereof; a unique molecular sequence or its complement; or other primer binding sequences or complements thereof.
30. The method of any one of claims 1-29, wherein the biological sample is a tissue sample selected from a formalin fixed, paraffin embedded (FFPE) tissue sample or a frozen tissue sample.
31. The method of any one of claims 1-30, wherein the biological sample is pre-stained with a detectable label, hematoxylin and eosin (H & E) dye, immunofluorescence, or immunohistochemistry.
32. The method of any one of claims 1-31, wherein the biological sample is a permeabilized biological sample.
33. The method of any one of claims 1-9 and 19-32, wherein the determining step comprises sequencing (i) all or part of the sequence of the spatial barcode or its complement and (ii) all or part of the sequence of nucleic acid from the biological sample.
34. The method of claim 33, wherein the sequencing is high throughput sequencing.
35. The method of any one of claims 1-34, wherein the analyte is deregulated or differentially expressed in cancer cells, immune cells, cell signaling pathways, or neural cells.
36. A method for identifying nucleic acid abundance and location in a biological sample, the method comprising:
(a) Contacting the biological sample with a substrate comprising a plurality of attached capture probes, wherein the capture probes of the plurality of capture probes comprise (i) a spatial barcode and (ii) a capture domain that binds to a sequence present in the nucleic acid;
(b) Hybridizing the capture probes to nucleic acids;
(c) Extending the 3' end of the capture probe using the nucleic acid bound to the capture domain as a template to generate an extended capture probe; and
(d) Amplifying the extended capture probes to produce extended nucleic acids; wherein the extended nucleic acid comprises all or part of the sequence of (i) a spatial barcode or complement thereof and (ii) a nucleic acid or complement thereof;
(e) Releasing the extended nucleic acid from the extended capture probe;
(f) Contacting a plurality of released nucleic acids with a plurality of decoy oligonucleotides, the released nucleic acids comprising the extended nucleic acids from step (e), wherein
The decoy oligonucleotides of the plurality of decoy oligonucleotides comprise:
(i) A capture domain that hybridizes to all or a portion of the sequence of the nucleic acid or complement thereof, and
(ii) A molecular tag;
(g) Capturing decoy oligonucleotides bound to the extended nucleic acids using a substrate comprising an agent bound to the molecular tag; and
(h) Determining (i) all or part of the sequence of the spatial barcode or its complement, and (ii) all or part of the sequence of the extended nucleic acid, and using the sequences determined in (i) and (ii) to identify the abundance and location of nucleic acid in the biological sample.
37. A method for identifying protein abundance and location in a biological sample, the method comprising:
(a) Contacting a plurality of analyte capture agents with the biological sample, wherein:
The analyte capture agents of the plurality of analyte capture agents comprise (i) an analyte binding moiety that binds to a protein and (ii) an oligonucleotide comprising an analyte binding moiety barcode and an analyte capture sequence;
(b) Contacting the plurality of analyte capture agents with a substrate comprising a plurality of capture probes, wherein the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain, wherein the capture domain binds to an analyte capture sequence;
(c) Extending the 3' end of the capture probe using the analyte binding moiety barcode as a template to generate an extended capture probe;
(d) Amplifying the extended capture probes to produce extended nucleic acids; wherein the extended nucleic acid comprises all or part of the sequence of (i) a spatial barcode or complement thereof and (ii) an oligonucleotide or complement thereof;
(e) Releasing the extended nucleic acid from the extended capture probe;
(f) Contacting a plurality of released nucleic acids with a plurality of decoy oligonucleotides, the released nucleic acids comprising the extended nucleic acids from step (e), wherein
The decoy oligonucleotides of the plurality of decoy oligonucleotides comprise:
(i) A capture domain that hybridizes to all or a portion of the nucleotide or complement thereof, and
(ii) A molecular tag;
(g) Capturing decoy oligonucleotides bound to the extended nucleic acids using a substrate comprising an agent bound to the molecular tag; and
(h) Determining (i) all or part of the sequence of the spatial barcode or its complement, and (ii) all or part of the sequence of the extended nucleic acid, and using the sequences determined in (i) and (ii) to identify the abundance and location of the protein in the biological sample.
38. A composition comprising a decoy oligonucleotide and an extended nucleic acid, wherein the decoy oligonucleotide is bound to the extended nucleic acid,
wherein the extended nucleic acid comprises (i) a spatial barcode or complement thereof, and (ii) all or part of the sequence of the analyte, analyte derivative, or complement thereof; and
wherein the decoy oligonucleotide comprises a molecular tag, wherein the molecular tag is selected from the group consisting of streptavidin, avidin, biotin, or a fluorophore.
39. The composition of claim 38, wherein the decoy oligonucleotide binds to the extended nucleic acid through a capture domain that hybridizes to all or a portion of the sequence of the analyte, analyte derivative, or complement thereof.
40. The composition of claim 38 or 39, comprising an agent that binds to a molecular tag.
41. The composition of claim 40, wherein the agent that binds to a molecular tag comprises a protein (e.g., an antibody), a nucleic acid, or a small molecule.
42. The composition of claim 40, wherein the molecular tag is biotin and the agent that binds to the molecular tag is avidin or streptavidin.
43. The composition of any one of claims 38-42, further comprising a substrate, wherein the agent that specifically binds to a molecular tag is attached to the substrate.
44. The composition of claim 43, wherein the substrate is a bead, well or slide.
45. A kit, comprising:
an array comprising a plurality of capture probes, wherein the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain;
a plurality of decoy oligonucleotides; and
instructions for carrying out the method of any one of claims 1-4, 10-13 and 19-36.
46. A kit, comprising:
a plurality of analyte capture agents, wherein the analyte capture agents of the plurality of analyte capture agents comprise an analyte binding moiety, an analyte binding moiety barcode, and an analyte capture sequence;
an array comprising a plurality of capture probes, wherein the capture probes of the plurality of capture probes comprise a spatial barcode and a capture domain;
A plurality of decoy oligonucleotides; and
instructions for carrying out the method of any one of claims 1, 5-9, 10, 14-35 and 37.
47. The kit of claim 45 or 46, further comprising reagents and/or enzymes for performing the method.
Applications Claiming Priority (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202062979652P | 2020-02-21 | 2020-02-21 | |
US202062980124P | 2020-02-21 | 2020-02-21 | |
US62/980,124 | 2020-02-21 | ||
US62/979,652 | 2020-02-21 | ||
US202063077019P | 2020-09-11 | 2020-09-11 | |
US63/077,019 | 2020-09-11 | ||
PCT/US2021/018795 WO2021168261A1 (en) | 2020-02-21 | 2021-02-19 | Capturing genetic targets using a hybridization approach |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116157533A true CN116157533A (en) | 2023-05-23 |
Family
ID=74875299
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202180031221.3A Pending CN116157533A (en) | 2020-02-21 | 2021-02-19 | Capturing genetic targets using hybridization methods |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP4107282A1 (en) |
CN (1) | CN116157533A (en) |
AU (1) | AU2021224760A1 (en) |
WO (1) | WO2021168261A1 (en) |
Families Citing this family (53)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190300945A1 (en) | 2010-04-05 | 2019-10-03 | Prognosys Biosciences, Inc. | Spatially Encoded Biological Assays |
US10787701B2 (en) | 2010-04-05 | 2020-09-29 | Prognosys Biosciences, Inc. | Spatially encoded biological assays |
GB201106254D0 (en) | 2011-04-13 | 2011-05-25 | Frisen Jonas | Method and product |
GB2513024B (en) | 2012-02-27 | 2016-08-31 | Cellular Res Inc | A clonal amplification method |
DK3013983T3 (en) | 2013-06-25 | 2023-03-06 | Prognosys Biosciences Inc | SPATIALLY ENCODED BIOLOGICAL ASSAYS USING A MICROFLUIDIC DEVICE |
GB2546833B (en) | 2013-08-28 | 2018-04-18 | Cellular Res Inc | Microwell for single cell analysis comprising single cell and single bead oligonucleotide capture labels |
JP6828007B2 (en) | 2015-04-10 | 2021-02-10 | スペーシャル トランスクリプトミクス アクチボラグ | Spatial-identified multiplex nucleic acid analysis of biological samples |
US10301677B2 (en) | 2016-05-25 | 2019-05-28 | Cellular Research, Inc. | Normalization of nucleic acid libraries |
JP7091348B2 (en) | 2016-09-26 | 2022-06-27 | ベクトン・ディキンソン・アンド・カンパニー | Measurement of protein expression using reagents with barcoded oligonucleotide sequences |
US11519033B2 (en) | 2018-08-28 | 2022-12-06 | 10X Genomics, Inc. | Method for transposase-mediated spatial tagging and analyzing genomic DNA in a biological sample |
EP3861134A1 (en) | 2018-10-01 | 2021-08-11 | Becton, Dickinson and Company | Determining 5' transcript sequences |
EP3894586A2 (en) | 2018-12-10 | 2021-10-20 | 10X Genomics, Inc. | Methods for determining a location of a biological analyte in a biological sample |
US11926867B2 (en) | 2019-01-06 | 2024-03-12 | 10X Genomics, Inc. | Generating capture probes for spatial analysis |
US11649485B2 (en) | 2019-01-06 | 2023-05-16 | 10X Genomics, Inc. | Generating capture probes for spatial analysis |
EP3976820A1 (en) | 2019-05-30 | 2022-04-06 | 10X Genomics, Inc. | Methods of detecting spatial heterogeneity of a biological sample |
CN114051534A (en) | 2019-07-22 | 2022-02-15 | 贝克顿迪金森公司 | Single cell chromatin immunoprecipitation sequencing assay |
JP2023500679A (en) | 2019-11-08 | 2023-01-10 | ベクトン・ディキンソン・アンド・カンパニー | Using random priming to obtain full-length V(D)J information for immune repertoire sequencing |
WO2021091611A1 (en) | 2019-11-08 | 2021-05-14 | 10X Genomics, Inc. | Spatially-tagged analyte capture agents for analyte multiplexing |
WO2021092433A2 (en) | 2019-11-08 | 2021-05-14 | 10X Genomics, Inc. | Enhancing specificity of analyte binding |
EP4069255A4 (en) * | 2019-12-04 | 2024-02-21 | Athma A Pai | Identifying non-productive splice sites |
EP3891300B1 (en) | 2019-12-23 | 2023-03-29 | 10X Genomics, Inc. | Methods for spatial analysis using rna-templated ligation |
CN115244184A (en) | 2020-01-13 | 2022-10-25 | 贝克顿迪金森公司 | Methods and compositions for quantifying protein and RNA |
US11732299B2 (en) | 2020-01-21 | 2023-08-22 | 10X Genomics, Inc. | Spatial assays with perturbed cells |
US11702693B2 (en) | 2020-01-21 | 2023-07-18 | 10X Genomics, Inc. | Methods for printing cells and generating arrays of barcoded cells |
US11821035B1 (en) | 2020-01-29 | 2023-11-21 | 10X Genomics, Inc. | Compositions and methods of making gene expression libraries |
US11898205B2 (en) | 2020-02-03 | 2024-02-13 | 10X Genomics, Inc. | Increasing capture efficiency of spatial assays |
US11835462B2 (en) | 2020-02-11 | 2023-12-05 | 10X Genomics, Inc. | Methods and compositions for partitioning a biological sample |
US11891654B2 (en) | 2020-02-24 | 2024-02-06 | 10X Genomics, Inc. | Methods of making gene expression libraries |
US11926863B1 (en) | 2020-02-27 | 2024-03-12 | 10X Genomics, Inc. | Solid state single cell method for analyzing fixed biological cells |
US11768175B1 (en) | 2020-03-04 | 2023-09-26 | 10X Genomics, Inc. | Electrophoretic methods for spatial analysis |
EP4242325A3 (en) | 2020-04-22 | 2023-10-04 | 10X Genomics, Inc. | Methods for spatial analysis using targeted rna depletion |
EP4153775A1 (en) | 2020-05-22 | 2023-03-29 | 10X Genomics, Inc. | Simultaneous spatio-temporal measurement of gene expression and cellular activity |
AU2021275906A1 (en) | 2020-05-22 | 2022-12-22 | 10X Genomics, Inc. | Spatial analysis to detect sequence variants |
WO2021242834A1 (en) | 2020-05-26 | 2021-12-02 | 10X Genomics, Inc. | Method for resetting an array |
AU2021283184A1 (en) | 2020-06-02 | 2023-01-05 | 10X Genomics, Inc. | Spatial transcriptomics for antigen-receptors |
WO2021247543A2 (en) | 2020-06-02 | 2021-12-09 | 10X Genomics, Inc. | Nucleic acid library methods |
EP4162074B1 (en) | 2020-06-08 | 2024-04-24 | 10X Genomics, Inc. | Methods of determining a surgical margin and methods of use thereof |
WO2021252591A1 (en) | 2020-06-10 | 2021-12-16 | 10X Genomics, Inc. | Methods for determining a location of an analyte in a biological sample |
WO2021263111A1 (en) | 2020-06-25 | 2021-12-30 | 10X Genomics, Inc. | Spatial analysis of dna methylation |
US11761038B1 (en) | 2020-07-06 | 2023-09-19 | 10X Genomics, Inc. | Methods for identifying a location of an RNA in a biological sample |
US11932901B2 (en) | 2020-07-13 | 2024-03-19 | Becton, Dickinson And Company | Target enrichment using nucleic acid probes for scRNAseq |
US11926822B1 (en) | 2020-09-23 | 2024-03-12 | 10X Genomics, Inc. | Three-dimensional spatial analysis |
US11827935B1 (en) | 2020-11-19 | 2023-11-28 | 10X Genomics, Inc. | Methods for spatial analysis using rolling circle amplification and detection probes |
EP4247967A1 (en) | 2020-11-20 | 2023-09-27 | Becton, Dickinson and Company | Profiling of highly expressed and lowly expressed proteins |
AU2021409136A1 (en) | 2020-12-21 | 2023-06-29 | 10X Genomics, Inc. | Methods, compositions, and systems for capturing probes and/or barcodes |
AU2022238446A1 (en) | 2021-03-18 | 2023-09-07 | 10X Genomics, Inc. | Multiplex capture of gene and protein expression from a biological sample |
EP4196605A1 (en) | 2021-09-01 | 2023-06-21 | 10X Genomics, Inc. | Methods, compositions, and kits for blocking a capture probe on a spatial array |
WO2023034872A1 (en) * | 2021-09-01 | 2023-03-09 | Becton, Dickinson And Company | Spatial multiomics using in situ reverse transcription |
WO2023172783A2 (en) * | 2022-01-07 | 2023-09-14 | The Government Of The United States, As Represented By The Secretary Of The Army | Hemostatic composition containing recombinant human clotting factors, and method of producing |
CN114958763B (en) * | 2022-01-14 | 2023-02-14 | 江苏省农业科学院 | Method for constructing Vero cell strain with DBN1 gene knocked out and application thereof |
WO2023194586A2 (en) * | 2022-04-08 | 2023-10-12 | Secarna Pharmaceuticals Gmbh & Co. Kg | Oligonucleotide for reducing the expression of leucine-rich repeat kinase 2 (lrrk2) and its use for preventing and/or treating human diseases |
WO2023201235A2 (en) * | 2022-04-12 | 2023-10-19 | 10X Genomics, Inc. | Compositions and methods for generating and characterizing recombinant antigen binding molecules |
WO2023197075A1 (en) * | 2022-04-12 | 2023-10-19 | Concordia University | Aptamer-based electrochemical drug detection assay |
Family Cites Families (43)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104673905B (en) | 2005-06-20 | 2016-09-07 | 领先细胞医疗诊断有限公司 | Nucleic acid in detection individual cells and the method for rare cells in the heterogeneous maxicell group of qualification |
EP2529030B1 (en) | 2010-01-29 | 2019-03-13 | Advanced Cell Diagnostics, Inc. | Methods of in situ detection of nucleic acids |
ES2555106T3 (en) | 2010-04-05 | 2015-12-29 | Prognosys Biosciences, Inc. | Spatially coded biological assays |
GB201106254D0 (en) | 2011-04-13 | 2011-05-25 | Frisen Jonas | Method and product |
US9783841B2 (en) | 2012-10-04 | 2017-10-10 | The Board Of Trustees Of The Leland Stanford Junior University | Detection of target nucleic acids in a cellular sample |
CA2886974C (en) | 2012-10-17 | 2021-06-29 | Spatial Transcriptomics Ab | Methods and product for optimising localised or spatial detection of gene expression in a tissue sample |
EP2971184B1 (en) | 2013-03-12 | 2019-04-17 | President and Fellows of Harvard College | Method of generating a three-dimensional nucleic acid containing matrix |
DK3013983T3 (en) | 2013-06-25 | 2023-03-06 | Prognosys Biosciences Inc | SPATIALLY ENCODED BIOLOGICAL ASSAYS USING A MICROFLUIDIC DEVICE |
US20150000854A1 (en) | 2013-06-27 | 2015-01-01 | The Procter & Gamble Company | Sheet products bearing designs that vary among successive sheets, and apparatus and methods for producing the same |
US10041949B2 (en) | 2013-09-13 | 2018-08-07 | The Board Of Trustees Of The Leland Stanford Junior University | Multiplexed imaging of tissues using mass tags and secondary ion mass spectrometry |
US20150218620A1 (en) * | 2014-02-03 | 2015-08-06 | Integrated Dna Technologies, Inc. | Methods to capture and/or remove highly abundant rnas from a heterogenous rna sample |
WO2015161173A1 (en) | 2014-04-18 | 2015-10-22 | William Marsh Rice University | Competitive compositions of nucleic acid molecules for enrichment of rare-allele-bearing species |
WO2016007839A1 (en) | 2014-07-11 | 2016-01-14 | President And Fellows Of Harvard College | Methods for high-throughput labelling and detection of biological features in situ using microscopy |
US20160108458A1 (en) | 2014-10-06 | 2016-04-21 | The Board Of Trustees Of The Leland Stanford Junior University | Multiplexed detection and quantification of nucleic acids in single-cells |
KR20170073667A (en) * | 2014-10-29 | 2017-06-28 | 10엑스 제노믹스, 인크. | Methods and compositions for targeted nucleic acid sequencing |
CN107208158B (en) | 2015-02-27 | 2022-01-28 | 贝克顿迪金森公司 | Spatially addressable molecular barcode |
JP6828007B2 (en) | 2015-04-10 | 2021-02-10 | スペーシャル トランスクリプトミクス アクチボラグ | Spatial-identified multiplex nucleic acid analysis of biological samples |
US10724078B2 (en) | 2015-04-14 | 2020-07-28 | Koninklijke Philips N.V. | Spatial mapping of molecular profiles of biological tissue samples |
US10059990B2 (en) | 2015-04-14 | 2018-08-28 | Massachusetts Institute Of Technology | In situ nucleic acid sequencing of expanded biological samples |
CA2992492A1 (en) | 2015-07-17 | 2017-01-26 | Nanostring Technologies, Inc. | Simultaneous quantification of gene expression in a user-defined region of a cross-sectioned tissue |
DK3329012T3 (en) | 2015-07-27 | 2021-10-11 | Illumina Inc | Spatial mapping of nucleic acid sequence information |
EP3332029B1 (en) | 2015-08-07 | 2021-10-06 | Massachusetts Institute of Technology | Nanoscale imaging of proteins and nucleic acids via expansion microscopy |
CN108139408B (en) | 2015-08-07 | 2020-08-28 | 麻省理工学院 | Protein retention expansion microscopy |
US20170241911A1 (en) | 2016-02-22 | 2017-08-24 | Miltenyi Biotec Gmbh | Automated analysis tool for biological specimens |
EP3420110B1 (en) | 2016-02-26 | 2021-09-22 | The Board of Trustees of the Leland Stanford Junior University | Multiplexed single molecule rna visualization with a two-probe proximity ligation system |
SG11201809649WA (en) * | 2016-05-02 | 2018-11-29 | Encodia Inc | Macromolecule analysis employing nucleic acid encoding |
US11352667B2 (en) | 2016-06-21 | 2022-06-07 | 10X Genomics, Inc. | Nucleic acid sequencing |
WO2018022809A1 (en) | 2016-07-27 | 2018-02-01 | The Board Of Trustees Of The Leland Stanford Junior University | Highly-multiplexed fluorescent imaging |
WO2018045181A1 (en) | 2016-08-31 | 2018-03-08 | President And Fellows Of Harvard College | Methods of generating libraries of nucleic acid sequences for detection via fluorescent in situ sequencing |
JP7057348B2 (en) | 2016-08-31 | 2022-04-19 | プレジデント アンド フェローズ オブ ハーバード カレッジ | A method of combining biomolecule detection with a single assay using fluorescent in situ sequencing |
US11505819B2 (en) | 2016-09-22 | 2022-11-22 | William Marsh Rice University | Molecular hybridization probes for complex sequence capture and analysis |
GB201619458D0 (en) | 2016-11-17 | 2017-01-04 | Spatial Transcriptomics Ab | Method for spatial tagging and analysing nucleic acids in a biological specimen |
US10656144B2 (en) | 2016-12-02 | 2020-05-19 | The Charlotte Mecklenburg Hospital Authority | Immune profiling and minimal residue disease following stem cell transplantation in multiple myeloma |
CA3043639A1 (en) | 2016-12-09 | 2018-06-14 | Ultivue, Inc. | Improved methods for multiplex imaging using labeled nucleic acid imaging agents |
WO2018136856A1 (en) | 2017-01-23 | 2018-07-26 | Massachusetts Institute Of Technology | Multiplexed signal amplified fish via splinted ligation amplification and sequencing |
CN111263819A (en) | 2017-10-06 | 2020-06-09 | 卡特阿纳公司 | RNA templated ligation |
WO2019075091A1 (en) | 2017-10-11 | 2019-04-18 | Expansion Technologies | Multiplexed in situ hybridization of tissue sections for spatially resolved transcriptomics with expansion microscopy |
EP3788146A4 (en) * | 2018-05-02 | 2022-06-01 | The General Hospital Corporation | High-resolution spatial macromolecule abundance assessment |
US20230323447A1 (en) * | 2018-08-28 | 2023-10-12 | 10X Genomics, Inc. | Method for transposase-mediated spatial tagging and analyzing genomic dna in a biological sample |
US11429718B2 (en) | 2018-09-17 | 2022-08-30 | Schneider Electric Systems Usa, Inc. | Industrial system event detection and corresponding response |
US11694779B2 (en) | 2018-09-17 | 2023-07-04 | Labsavvy Health, Llc | Systems and methods for automated reporting and education for laboratory test results |
EP3894586A2 (en) | 2018-12-10 | 2021-10-20 | 10X Genomics, Inc. | Methods for determining a location of a biological analyte in a biological sample |
WO2020176788A1 (en) | 2019-02-28 | 2020-09-03 | 10X Genomics, Inc. | Profiling of biological analytes with spatially barcoded oligonucleotide arrays |
-
2021
- 2021-02-19 CN CN202180031221.3A patent/CN116157533A/en active Pending
- 2021-02-19 EP EP21712302.5A patent/EP4107282A1/en active Pending
- 2021-02-19 WO PCT/US2021/018795 patent/WO2021168261A1/en unknown
- 2021-02-19 AU AU2021224760A patent/AU2021224760A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
AU2021224760A1 (en) | 2022-09-15 |
EP4107282A1 (en) | 2022-12-28 |
WO2021168261A1 (en) | 2021-08-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN116157533A (en) | Capturing genetic targets using hybridization methods | |
JP6977014B2 (en) | Transfer to Natural Chromatin for Individual Epigenomics | |
JP7256748B2 (en) | Methods for targeted nucleic acid sequence enrichment with application to error-corrected nucleic acid sequencing | |
WO2022271820A1 (en) | Spatial detection of sars-cov-2 using templated ligation | |
CN115916999A (en) | Methods for spatial analysis using targeted RNA depletion | |
KR102398465B1 (en) | In-depth sequencing profiling of tumors | |
CN107699957B (en) | DNA-based fusion gene quantitative sequencing library construction, detection method and application thereof | |
CN108368554B (en) | Method for subtype typing diffuse large B-cell lymphoma (DLBCL) | |
KR102512168B1 (en) | Method for Quantitatively Analyzing Protein Population Using Next Generation Sequencing and Use Thereof | |
Singh et al. | In situ 10-cell RNA sequencing in tissue and tumor biopsy samples | |
JP2023547394A (en) | Nucleic acid detection method by oligohybridization and PCR-based amplification | |
KR20180041331A (en) | The method and kit of the selection of Molecule-Binding Nucleic Acids and the identification of the targets, and their use | |
Strom | Fundamentals of RNA analysis on biobanked specimens | |
TW202305143A (en) | Highly sensitive methods for accurate parallel quantification of nucleic acids | |
WO2018213642A1 (en) | System and method for isolation and qualification of nucleic acids | |
DNAse | Supplementary Figure 1. Data related to ChRIP. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |