EP4073263A1 - Verfahren zur stabilen komplexbildung und entsprechende kits - Google Patents
Verfahren zur stabilen komplexbildung und entsprechende kitsInfo
- Publication number
- EP4073263A1 EP4073263A1 EP21737980.9A EP21737980A EP4073263A1 EP 4073263 A1 EP4073263 A1 EP 4073263A1 EP 21737980 A EP21737980 A EP 21737980A EP 4073263 A1 EP4073263 A1 EP 4073263A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- binding agent
- binding
- tag
- macromolecule
- polypeptide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 188
- 230000009918 complex formation Effects 0.000 title description 3
- 239000011230 binding agent Substances 0.000 claims abstract description 686
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 448
- 229920002521 macromolecule Polymers 0.000 claims abstract description 323
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 288
- 229920001184 polypeptide Polymers 0.000 claims abstract description 256
- 239000012082 adaptor molecule Substances 0.000 claims abstract description 209
- 238000012546 transfer Methods 0.000 claims abstract description 77
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 28
- 238000009739 binding Methods 0.000 claims description 261
- 230000000087 stabilizing effect Effects 0.000 claims description 254
- 230000027455 binding Effects 0.000 claims description 203
- 238000009396 hybridization Methods 0.000 claims description 155
- 230000000295 complement effect Effects 0.000 claims description 102
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 claims description 101
- 239000003795 chemical substances by application Substances 0.000 claims description 98
- 239000003153 chemical reaction reagent Substances 0.000 claims description 57
- 238000006243 chemical reaction Methods 0.000 claims description 54
- 102000040430 polynucleotide Human genes 0.000 claims description 51
- 108091033319 polynucleotide Proteins 0.000 claims description 51
- 239000002157 polynucleotide Substances 0.000 claims description 51
- 239000000126 substance Substances 0.000 claims description 16
- 238000001514 detection method Methods 0.000 claims description 15
- 125000000539 amino acid group Chemical group 0.000 claims description 12
- 230000000368 destabilizing effect Effects 0.000 claims description 6
- 239000003607 modifier Substances 0.000 claims description 5
- 238000003203 nucleic acid sequencing method Methods 0.000 claims description 2
- 102000004169 proteins and genes Human genes 0.000 abstract description 168
- 108090000623 proteins and genes Proteins 0.000 abstract description 167
- 150000007523 nucleic acids Chemical class 0.000 abstract description 138
- 102000039446 nucleic acids Human genes 0.000 abstract description 135
- 108020004707 nucleic acids Proteins 0.000 abstract description 134
- 239000002585 base Substances 0.000 description 227
- 229940024606 amino acid Drugs 0.000 description 217
- 235000001014 amino acid Nutrition 0.000 description 216
- 150000001413 amino acids Chemical class 0.000 description 168
- 235000018102 proteins Nutrition 0.000 description 158
- 125000006850 spacer group Chemical group 0.000 description 100
- 239000011324 bead Substances 0.000 description 99
- 239000000523 sample Substances 0.000 description 77
- 108020004414 DNA Proteins 0.000 description 67
- 238000004458 analytical method Methods 0.000 description 65
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 59
- 238000012163 sequencing technique Methods 0.000 description 52
- 238000003556 assay Methods 0.000 description 50
- 125000003275 alpha amino acid group Chemical group 0.000 description 47
- 230000003993 interaction Effects 0.000 description 46
- -1 e.g. Polymers 0.000 description 38
- 230000037452 priming Effects 0.000 description 38
- 239000007787 solid Substances 0.000 description 36
- 230000004481 post-translational protein modification Effects 0.000 description 35
- 210000004027 cell Anatomy 0.000 description 34
- 235000020958 biotin Nutrition 0.000 description 30
- 229960002685 biotin Drugs 0.000 description 30
- 239000011616 biotin Substances 0.000 description 30
- 230000004048 modification Effects 0.000 description 24
- 238000012986 modification Methods 0.000 description 24
- 108091034117 Oligonucleotide Proteins 0.000 description 23
- 230000002441 reversible effect Effects 0.000 description 23
- 229920000642 polymer Polymers 0.000 description 21
- 239000000758 substrate Substances 0.000 description 20
- 239000003446 ligand Substances 0.000 description 19
- 230000006641 stabilisation Effects 0.000 description 19
- 238000011105 stabilization Methods 0.000 description 19
- 125000003729 nucleotide group Chemical group 0.000 description 18
- 108010026552 Proteome Proteins 0.000 description 17
- 239000000203 mixture Substances 0.000 description 17
- 239000002773 nucleotide Substances 0.000 description 17
- 239000012634 fragment Substances 0.000 description 16
- AUTOLBMXDDTRRT-JGVFFNPUSA-N (4R,5S)-dethiobiotin Chemical compound C[C@@H]1NC(=O)N[C@@H]1CCCCCC(O)=O AUTOLBMXDDTRRT-JGVFFNPUSA-N 0.000 description 15
- 238000002372 labelling Methods 0.000 description 15
- 210000001519 tissue Anatomy 0.000 description 15
- 102000004190 Enzymes Human genes 0.000 description 14
- 108090000790 Enzymes Proteins 0.000 description 14
- 238000013459 approach Methods 0.000 description 14
- 229940088598 enzyme Drugs 0.000 description 14
- 239000012528 membrane Substances 0.000 description 14
- 238000005192 partition Methods 0.000 description 14
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N silicon dioxide Inorganic materials O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 14
- LOTKRQAVGJMPNV-UHFFFAOYSA-N 1-fluoro-2,4-dinitrobenzene Chemical compound [O-][N+](=O)C1=CC=C(F)C([N+]([O-])=O)=C1 LOTKRQAVGJMPNV-UHFFFAOYSA-N 0.000 description 13
- 108091023037 Aptamer Proteins 0.000 description 13
- 108091093037 Peptide nucleic acid Proteins 0.000 description 13
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 13
- 239000002105 nanoparticle Substances 0.000 description 13
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 11
- UFBJCMHMOXMLKC-UHFFFAOYSA-N 2,4-dinitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1[N+]([O-])=O UFBJCMHMOXMLKC-UHFFFAOYSA-N 0.000 description 11
- 239000000020 Nitrocellulose Substances 0.000 description 11
- 239000012472 biological sample Substances 0.000 description 11
- 229910021645 metal ion Inorganic materials 0.000 description 11
- 238000007481 next generation sequencing Methods 0.000 description 11
- 229920001220 nitrocellulos Polymers 0.000 description 11
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 11
- 230000009870 specific binding Effects 0.000 description 11
- 241000282414 Homo sapiens Species 0.000 description 10
- 102000035195 Peptidases Human genes 0.000 description 10
- 108091005804 Peptidases Proteins 0.000 description 10
- 108010090804 Streptavidin Proteins 0.000 description 10
- 210000001124 body fluid Anatomy 0.000 description 10
- 210000004899 c-terminal region Anatomy 0.000 description 10
- 238000005859 coupling reaction Methods 0.000 description 10
- 239000011521 glass Substances 0.000 description 10
- QKFJKGMPGYROCL-UHFFFAOYSA-N phenyl isothiocyanate Chemical compound S=C=NC1=CC=CC=C1 QKFJKGMPGYROCL-UHFFFAOYSA-N 0.000 description 10
- COLNVLDHVKWLRT-QMMMGPOBSA-N phenylalanine group Chemical group N[C@@H](CC1=CC=CC=C1)C(=O)O COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 10
- 229920001223 polyethylene glycol Polymers 0.000 description 10
- 238000005406 washing Methods 0.000 description 10
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 9
- 239000002202 Polyethylene glycol Substances 0.000 description 9
- 230000008901 benefit Effects 0.000 description 9
- 230000001413 cellular effect Effects 0.000 description 9
- 230000008878 coupling Effects 0.000 description 9
- 238000010168 coupling process Methods 0.000 description 9
- 230000000694 effects Effects 0.000 description 9
- 230000013595 glycosylation Effects 0.000 description 9
- 238000006206 glycosylation reaction Methods 0.000 description 9
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 9
- 102000005367 Carboxypeptidases Human genes 0.000 description 8
- 108010006303 Carboxypeptidases Proteins 0.000 description 8
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 8
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 8
- 229920002307 Dextran Polymers 0.000 description 8
- 102000005593 Endopeptidases Human genes 0.000 description 8
- 108010059378 Endopeptidases Proteins 0.000 description 8
- 241000588724 Escherichia coli Species 0.000 description 8
- 108060003951 Immunoglobulin Proteins 0.000 description 8
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 8
- 239000004372 Polyvinyl alcohol Substances 0.000 description 8
- 239000004365 Protease Substances 0.000 description 8
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 8
- 125000003277 amino group Chemical group 0.000 description 8
- 230000003321 amplification Effects 0.000 description 8
- 150000001540 azides Chemical class 0.000 description 8
- 150000001875 compounds Chemical class 0.000 description 8
- 102000018358 immunoglobulin Human genes 0.000 description 8
- 230000011987 methylation Effects 0.000 description 8
- 238000007069 methylation reaction Methods 0.000 description 8
- 238000003199 nucleic acid amplification method Methods 0.000 description 8
- 230000036961 partial effect Effects 0.000 description 8
- 229920002451 polyvinyl alcohol Polymers 0.000 description 8
- 229940068984 polyvinyl alcohol Drugs 0.000 description 8
- 235000019422 polyvinyl alcohol Nutrition 0.000 description 8
- 229920000936 Agarose Polymers 0.000 description 7
- 108090000915 Aminopeptidases Proteins 0.000 description 7
- 102000004400 Aminopeptidases Human genes 0.000 description 7
- 238000002965 ELISA Methods 0.000 description 7
- 108091093094 Glycol nucleic acid Proteins 0.000 description 7
- 239000004793 Polystyrene Substances 0.000 description 7
- 238000006640 acetylation reaction Methods 0.000 description 7
- 150000001345 alkine derivatives Chemical class 0.000 description 7
- 230000008859 change Effects 0.000 description 7
- 238000003776 cleavage reaction Methods 0.000 description 7
- 125000004122 cyclic group Chemical group 0.000 description 7
- 239000000839 emulsion Substances 0.000 description 7
- 230000002255 enzymatic effect Effects 0.000 description 7
- 230000003100 immobilizing effect Effects 0.000 description 7
- 230000026731 phosphorylation Effects 0.000 description 7
- 238000006366 phosphorylation reaction Methods 0.000 description 7
- 229920002223 polystyrene Polymers 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 7
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 6
- 102000012410 DNA Ligases Human genes 0.000 description 6
- 108010061982 DNA Ligases Proteins 0.000 description 6
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 6
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 6
- 239000004677 Nylon Substances 0.000 description 6
- 108010003723 Single-Domain Antibodies Proteins 0.000 description 6
- 108091046915 Threose nucleic acid Proteins 0.000 description 6
- 230000021736 acetylation Effects 0.000 description 6
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 6
- 125000000217 alkyl group Chemical group 0.000 description 6
- 150000001408 amides Chemical class 0.000 description 6
- 238000000137 annealing Methods 0.000 description 6
- DMLAVOWQYNRWNQ-UHFFFAOYSA-N azobenzene Chemical compound C1=CC=CC=C1N=NC1=CC=CC=C1 DMLAVOWQYNRWNQ-UHFFFAOYSA-N 0.000 description 6
- 210000004369 blood Anatomy 0.000 description 6
- 239000008280 blood Substances 0.000 description 6
- 239000010839 body fluid Substances 0.000 description 6
- 150000001720 carbohydrates Chemical class 0.000 description 6
- 235000014633 carbohydrates Nutrition 0.000 description 6
- 229920002678 cellulose Polymers 0.000 description 6
- 239000001913 cellulose Substances 0.000 description 6
- 238000012512 characterization method Methods 0.000 description 6
- 230000029087 digestion Effects 0.000 description 6
- 238000006073 displacement reaction Methods 0.000 description 6
- 239000000975 dye Substances 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 230000029226 lipidation Effects 0.000 description 6
- 239000000463 material Substances 0.000 description 6
- 239000004005 microsphere Substances 0.000 description 6
- 229920001778 nylon Polymers 0.000 description 6
- 229940117953 phenylisothiocyanate Drugs 0.000 description 6
- 210000002381 plasma Anatomy 0.000 description 6
- 229920000058 polyacrylate Polymers 0.000 description 6
- 235000019419 proteases Nutrition 0.000 description 6
- 102000052866 Amino Acyl-tRNA Synthetases Human genes 0.000 description 5
- 108700028939 Amino Acyl-tRNA Synthetases Proteins 0.000 description 5
- 239000004475 Arginine Substances 0.000 description 5
- 102000053602 DNA Human genes 0.000 description 5
- 102100031780 Endonuclease Human genes 0.000 description 5
- 108010067770 Endopeptidase K Proteins 0.000 description 5
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 5
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 5
- 230000004913 activation Effects 0.000 description 5
- 235000004279 alanine Nutrition 0.000 description 5
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 5
- 150000001299 aldehydes Chemical class 0.000 description 5
- 150000001336 alkenes Chemical class 0.000 description 5
- 239000000427 antigen Substances 0.000 description 5
- 108091007433 antigens Proteins 0.000 description 5
- 102000036639 antigens Human genes 0.000 description 5
- 230000015556 catabolic process Effects 0.000 description 5
- 230000021615 conjugation Effects 0.000 description 5
- 238000006731 degradation reaction Methods 0.000 description 5
- 238000010348 incorporation Methods 0.000 description 5
- 238000005304 joining Methods 0.000 description 5
- 239000011159 matrix material Substances 0.000 description 5
- 230000001404 mediated effect Effects 0.000 description 5
- 229920001343 polytetrafluoroethylene Polymers 0.000 description 5
- 239000004810 polytetrafluoroethylene Substances 0.000 description 5
- 239000011148 porous material Substances 0.000 description 5
- 125000006239 protecting group Chemical group 0.000 description 5
- 230000007017 scission Effects 0.000 description 5
- 239000000377 silicon dioxide Substances 0.000 description 5
- 150000003384 small molecules Chemical class 0.000 description 5
- 238000012421 spiking Methods 0.000 description 5
- 230000034512 ubiquitination Effects 0.000 description 5
- 238000010798 ubiquitination Methods 0.000 description 5
- ROFVEXUMMXZLPA-UHFFFAOYSA-N Bipyridyl Chemical group N1=CC=CC=C1C1=CC=CC=N1 ROFVEXUMMXZLPA-UHFFFAOYSA-N 0.000 description 4
- 238000005698 Diels-Alder reaction Methods 0.000 description 4
- 108010016626 Dipeptides Proteins 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 4
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 4
- 241000124008 Mammalia Species 0.000 description 4
- 108060004795 Methyltransferase Proteins 0.000 description 4
- 230000006295 S-nitrosylation Effects 0.000 description 4
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 4
- 101710126859 Single-stranded DNA-binding protein Proteins 0.000 description 4
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 4
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 description 4
- 230000029936 alkylation Effects 0.000 description 4
- 238000005804 alkylation reaction Methods 0.000 description 4
- 230000009435 amidation Effects 0.000 description 4
- 238000007112 amidation reaction Methods 0.000 description 4
- 239000012491 analyte Substances 0.000 description 4
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 4
- 238000003491 array Methods 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 230000006287 biotinylation Effects 0.000 description 4
- 238000007413 biotinylation Methods 0.000 description 4
- 229910052799 carbon Inorganic materials 0.000 description 4
- 230000001276 controlling effect Effects 0.000 description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 4
- 238000010494 dissociation reaction Methods 0.000 description 4
- 230000005593 dissociations Effects 0.000 description 4
- 230000008030 elimination Effects 0.000 description 4
- 238000003379 elimination reaction Methods 0.000 description 4
- 239000000835 fiber Substances 0.000 description 4
- 238000002866 fluorescence resonance energy transfer Methods 0.000 description 4
- 238000007672 fourth generation sequencing Methods 0.000 description 4
- 238000013467 fragmentation Methods 0.000 description 4
- 238000006062 fragmentation reaction Methods 0.000 description 4
- 108020001507 fusion proteins Proteins 0.000 description 4
- 102000037865 fusion proteins Human genes 0.000 description 4
- 229910052739 hydrogen Inorganic materials 0.000 description 4
- 239000001257 hydrogen Substances 0.000 description 4
- 230000002209 hydrophobic effect Effects 0.000 description 4
- 150000002632 lipids Chemical class 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 108010087904 neutravidin Proteins 0.000 description 4
- 230000009871 nonspecific binding Effects 0.000 description 4
- 210000003463 organelle Anatomy 0.000 description 4
- 230000003647 oxidation Effects 0.000 description 4
- 238000007254 oxidation reaction Methods 0.000 description 4
- 238000002823 phage display Methods 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 230000002829 reductive effect Effects 0.000 description 4
- 239000011347 resin Substances 0.000 description 4
- 229920005989 resin Polymers 0.000 description 4
- 210000003296 saliva Anatomy 0.000 description 4
- 210000002966 serum Anatomy 0.000 description 4
- 239000010703 silicon Substances 0.000 description 4
- 229910052710 silicon Inorganic materials 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 238000000638 solvent extraction Methods 0.000 description 4
- 230000035322 succinylation Effects 0.000 description 4
- 238000010613 succinylation reaction Methods 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 238000011282 treatment Methods 0.000 description 4
- 229940035893 uracil Drugs 0.000 description 4
- 210000002700 urine Anatomy 0.000 description 4
- QGKMIGUHVLGJBR-UHFFFAOYSA-M (4z)-1-(3-methylbutyl)-4-[[1-(3-methylbutyl)quinolin-1-ium-4-yl]methylidene]quinoline;iodide Chemical compound [I-].C12=CC=CC=C2N(CCC(C)C)C=CC1=CC1=CC=[N+](CCC(C)C)C2=CC=CC=C12 QGKMIGUHVLGJBR-UHFFFAOYSA-M 0.000 description 3
- BAILAVMHWMOTOK-UHFFFAOYSA-N 2-fluoro-1-nitro-5-sulfonylcyclohexa-1,3-diene Chemical compound S(=O)(=O)=C1CC(=C(C=C1)F)[N+](=O)[O-] BAILAVMHWMOTOK-UHFFFAOYSA-N 0.000 description 3
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 3
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 3
- 102000008186 Collagen Human genes 0.000 description 3
- 108010035532 Collagen Proteins 0.000 description 3
- 150000008574 D-amino acids Chemical class 0.000 description 3
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 3
- 102000018389 Exopeptidases Human genes 0.000 description 3
- 108010091443 Exopeptidases Proteins 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- 229920002683 Glycosaminoglycan Polymers 0.000 description 3
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 3
- 150000008575 L-amino acids Chemical class 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- 108090001090 Lectins Proteins 0.000 description 3
- 102000004856 Lectins Human genes 0.000 description 3
- 102000003960 Ligases Human genes 0.000 description 3
- 108090000364 Ligases Proteins 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- VEQPNABPJHWNSG-UHFFFAOYSA-N Nickel(2+) Chemical compound [Ni+2] VEQPNABPJHWNSG-UHFFFAOYSA-N 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 102000007079 Peptide Fragments Human genes 0.000 description 3
- 108010033276 Peptide Fragments Proteins 0.000 description 3
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 3
- 229920002732 Polyanhydride Polymers 0.000 description 3
- 239000004698 Polyethylene Substances 0.000 description 3
- 229920000954 Polyglycolide Polymers 0.000 description 3
- 229920001710 Polyorthoester Polymers 0.000 description 3
- 239000004743 Polypropylene Substances 0.000 description 3
- 206010036790 Productive cough Diseases 0.000 description 3
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 3
- AUNGANRZJHBGPY-SCRDCRAPSA-N Riboflavin Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-SCRDCRAPSA-N 0.000 description 3
- 230000006297 S-sulfenylation Effects 0.000 description 3
- BLRPTPMANUNPDV-UHFFFAOYSA-N Silane Chemical compound [SiH4] BLRPTPMANUNPDV-UHFFFAOYSA-N 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 239000004809 Teflon Substances 0.000 description 3
- 229920006362 Teflon® Polymers 0.000 description 3
- DPOPAJRDYZGTIR-UHFFFAOYSA-N Tetrazine Chemical compound C1=CN=NN=N1 DPOPAJRDYZGTIR-UHFFFAOYSA-N 0.000 description 3
- 101710183280 Topoisomerase Proteins 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 230000003213 activating effect Effects 0.000 description 3
- 230000010933 acylation Effects 0.000 description 3
- 238000005917 acylation reaction Methods 0.000 description 3
- 102000035181 adaptor proteins Human genes 0.000 description 3
- 108091005764 adaptor proteins Proteins 0.000 description 3
- 229960005305 adenosine Drugs 0.000 description 3
- 150000001412 amines Chemical class 0.000 description 3
- 235000009582 asparagine Nutrition 0.000 description 3
- 229960001230 asparagine Drugs 0.000 description 3
- HSDAJNMJOMSNEV-UHFFFAOYSA-N benzyl chloroformate Chemical compound ClC(=O)OCC1=CC=CC=C1 HSDAJNMJOMSNEV-UHFFFAOYSA-N 0.000 description 3
- 125000001584 benzyloxycarbonyl group Chemical group C(=O)(OCC1=CC=CC=C1)* 0.000 description 3
- 230000006242 butyrylation Effects 0.000 description 3
- 238000010514 butyrylation reaction Methods 0.000 description 3
- 238000005251 capillar electrophoresis Methods 0.000 description 3
- 230000021235 carbamoylation Effects 0.000 description 3
- 230000006315 carbonylation Effects 0.000 description 3
- 238000005810 carbonylation reaction Methods 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 3
- 229920001436 collagen Polymers 0.000 description 3
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 3
- 235000018417 cysteine Nutrition 0.000 description 3
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 3
- 230000006240 deamidation Effects 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- FOOBQHKMWYGHCE-UHFFFAOYSA-N diphthamide Chemical compound C[N+](C)(C)C(C(N)=O)CCC1=NC=C(CC(N)C([O-])=O)N1 FOOBQHKMWYGHCE-UHFFFAOYSA-N 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 238000001962 electrophoresis Methods 0.000 description 3
- 230000006330 eliminylation Effects 0.000 description 3
- 150000002118 epoxides Chemical class 0.000 description 3
- 150000002148 esters Chemical class 0.000 description 3
- 210000000416 exudates and transudate Anatomy 0.000 description 3
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 3
- 230000022244 formylation Effects 0.000 description 3
- 238000006170 formylation reaction Methods 0.000 description 3
- 238000005194 fractionation Methods 0.000 description 3
- 230000006251 gamma-carboxylation Effects 0.000 description 3
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 3
- 230000035430 glutathionylation Effects 0.000 description 3
- 230000006238 glycylation Effects 0.000 description 3
- 230000006095 glypiation Effects 0.000 description 3
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 3
- 239000010931 gold Substances 0.000 description 3
- 229910052737 gold Inorganic materials 0.000 description 3
- 230000006149 hemylation Effects 0.000 description 3
- 230000033444 hydroxylation Effects 0.000 description 3
- 238000005805 hydroxylation reaction Methods 0.000 description 3
- 230000006164 hypusine formation Effects 0.000 description 3
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 3
- 238000003018 immunoassay Methods 0.000 description 3
- 229940072221 immunoglobulins Drugs 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 238000005305 interferometry Methods 0.000 description 3
- 230000026045 iodination Effects 0.000 description 3
- 238000006192 iodination reaction Methods 0.000 description 3
- 238000006317 isomerization reaction Methods 0.000 description 3
- 239000002523 lectin Substances 0.000 description 3
- 230000006144 lipoylation Effects 0.000 description 3
- 210000002751 lymph Anatomy 0.000 description 3
- 235000018977 lysine Nutrition 0.000 description 3
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 3
- 150000002678 macrocyclic compounds Chemical class 0.000 description 3
- 230000017538 malonylation Effects 0.000 description 3
- WDWDWGRYHDPSDS-UHFFFAOYSA-N methanimine Chemical compound N=C WDWDWGRYHDPSDS-UHFFFAOYSA-N 0.000 description 3
- 239000011325 microbead Substances 0.000 description 3
- 230000000813 microbial effect Effects 0.000 description 3
- 210000003470 mitochondria Anatomy 0.000 description 3
- 239000000178 monomer Substances 0.000 description 3
- 125000004573 morpholin-4-yl group Chemical group N1(CCOCC1)* 0.000 description 3
- 210000004940 nucleus Anatomy 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 230000026792 palmitoylation Effects 0.000 description 3
- 230000005298 paramagnetic effect Effects 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 230000006320 pegylation Effects 0.000 description 3
- 230000005261 phosphopantetheinylation Effects 0.000 description 3
- 150000008300 phosphoramidites Chemical class 0.000 description 3
- 238000007699 photoisomerization reaction Methods 0.000 description 3
- 229920003023 plastic Polymers 0.000 description 3
- 239000004033 plastic Substances 0.000 description 3
- 229920001308 poly(aminoacid) Polymers 0.000 description 3
- 229920000747 poly(lactic acid) Polymers 0.000 description 3
- 239000004417 polycarbonate Substances 0.000 description 3
- 229920000515 polycarbonate Polymers 0.000 description 3
- 229920000728 polyester Polymers 0.000 description 3
- 229920000573 polyethylene Polymers 0.000 description 3
- 239000004633 polyglycolic acid Substances 0.000 description 3
- 239000004626 polylactic acid Substances 0.000 description 3
- 229920000193 polymethacrylate Polymers 0.000 description 3
- 229920001155 polypropylene Polymers 0.000 description 3
- 229920001299 polypropylene fumarate Polymers 0.000 description 3
- 239000004800 polyvinyl chloride Substances 0.000 description 3
- 229920000915 polyvinyl chloride Polymers 0.000 description 3
- 230000013823 prenylation Effects 0.000 description 3
- 235000013930 proline Nutrition 0.000 description 3
- 229960002429 proline Drugs 0.000 description 3
- 230000006289 propionylation Effects 0.000 description 3
- 238000010515 propionylation reaction Methods 0.000 description 3
- 235000019833 protease Nutrition 0.000 description 3
- 150000003212 purines Chemical class 0.000 description 3
- 239000010453 quartz Substances 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 230000006159 retinylidene Schiff base formation Effects 0.000 description 3
- 210000003705 ribosome Anatomy 0.000 description 3
- 238000005922 selenation reaction Methods 0.000 description 3
- 229910000077 silane Inorganic materials 0.000 description 3
- 229920002379 silicone rubber Polymers 0.000 description 3
- 238000009987 spinning Methods 0.000 description 3
- 210000003802 sputum Anatomy 0.000 description 3
- 208000024794 sputum Diseases 0.000 description 3
- 235000000346 sugar Nutrition 0.000 description 3
- 230000002123 temporal effect Effects 0.000 description 3
- URYYVOIYTNXXBN-OWOJBTEDSA-N trans-cyclooctene Chemical compound C1CCC\C=C\CC1 URYYVOIYTNXXBN-OWOJBTEDSA-N 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 230000003612 virological effect Effects 0.000 description 3
- 125000004178 (C1-C4) alkyl group Chemical group 0.000 description 2
- OKUWOEKJQRUMBW-UHFFFAOYSA-N 2-[2-(2-methoxyethoxy)ethoxy]ethanamine Chemical compound COCCOCCOCCN OKUWOEKJQRUMBW-UHFFFAOYSA-N 0.000 description 2
- SMBSZJBWYCGCJP-UHFFFAOYSA-N 3-(diethylamino)chromen-2-one Chemical compound C1=CC=C2OC(=O)C(N(CC)CC)=CC2=C1 SMBSZJBWYCGCJP-UHFFFAOYSA-N 0.000 description 2
- JLBJTVDPSNHSKJ-UHFFFAOYSA-N 4-Methylstyrene Chemical compound CC1=CC=C(C=C)C=C1 JLBJTVDPSNHSKJ-UHFFFAOYSA-N 0.000 description 2
- QBHQECQPBHQTKE-UHFFFAOYSA-N 5-phenyltetrazine Chemical compound C1=CC=CC=C1C1=CN=NN=N1 QBHQECQPBHQTKE-UHFFFAOYSA-N 0.000 description 2
- YRKUVKRSQWSHRX-UHFFFAOYSA-N 6-nitro-5-sulfonylcyclohexa-1,3-dien-1-ol Chemical group S(=O)(=O)=C1C(C(=CC=C1)O)[N+](=O)[O-] YRKUVKRSQWSHRX-UHFFFAOYSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 101800001415 Bri23 peptide Proteins 0.000 description 2
- 101800000655 C-terminal peptide Proteins 0.000 description 2
- 102400000107 C-terminal peptide Human genes 0.000 description 2
- 241000282472 Canis lupus familiaris Species 0.000 description 2
- 102000014914 Carrier Proteins Human genes 0.000 description 2
- 241000282693 Cercopithecidae Species 0.000 description 2
- 108091064528 ClpS family Proteins 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 2
- FDKWRPBBCBCIGA-UWTATZPHSA-N D-Selenocysteine Natural products [Se]C[C@@H](N)C(O)=O FDKWRPBBCBCIGA-UWTATZPHSA-N 0.000 description 2
- 101150026402 DBP gene Proteins 0.000 description 2
- 238000000018 DNA microarray Methods 0.000 description 2
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 description 2
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 2
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 2
- 102100029764 DNA-directed DNA/RNA polymerase mu Human genes 0.000 description 2
- XPDXVDYUQZHFPV-UHFFFAOYSA-N Dansyl Chloride Chemical compound C1=CC=C2C(N(C)C)=CC=CC2=C1S(Cl)(=O)=O XPDXVDYUQZHFPV-UHFFFAOYSA-N 0.000 description 2
- 241000701867 Enterobacteria phage T7 Species 0.000 description 2
- 241000283086 Equidae Species 0.000 description 2
- 241000282326 Felis catus Species 0.000 description 2
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 2
- 108020005004 Guide RNA Proteins 0.000 description 2
- OAKJQQAXSVQMHS-UHFFFAOYSA-N Hydrazine Chemical compound NN OAKJQQAXSVQMHS-UHFFFAOYSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- ZFOMKMMPBOQKMC-KXUCPTDWSA-N L-pyrrolysine Chemical compound C[C@@H]1CC=N[C@H]1C(=O)NCCCC[C@H]([NH3+])C([O-])=O ZFOMKMMPBOQKMC-KXUCPTDWSA-N 0.000 description 2
- ZKZBPNGNEQAJSX-REOHCLBHSA-N L-selenocysteine Chemical compound [SeH]C[C@H](N)C(O)=O ZKZBPNGNEQAJSX-REOHCLBHSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- PEEHTFAAVSWFBL-UHFFFAOYSA-N Maleimide Chemical compound O=C1NC(=O)C=C1 PEEHTFAAVSWFBL-UHFFFAOYSA-N 0.000 description 2
- 108010085220 Multiprotein Complexes Proteins 0.000 description 2
- 102000007474 Multiprotein Complexes Human genes 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- PYUSHNKNPOHWEZ-YFKPBYRVSA-N N-formyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC=O PYUSHNKNPOHWEZ-YFKPBYRVSA-N 0.000 description 2
- UGJBHEZMOKVTIM-UHFFFAOYSA-N N-formylglycine Chemical compound OC(=O)CNC=O UGJBHEZMOKVTIM-UHFFFAOYSA-N 0.000 description 2
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- 108091081548 Palindromic sequence Proteins 0.000 description 2
- 241001494479 Pecora Species 0.000 description 2
- 108090000284 Pepsin A Proteins 0.000 description 2
- 102000057297 Pepsin A Human genes 0.000 description 2
- 108010079855 Peptide Aptamers Proteins 0.000 description 2
- 101710093976 Plasmid-derived single-stranded DNA-binding protein Proteins 0.000 description 2
- 229920001213 Polysorbate 20 Polymers 0.000 description 2
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 2
- 108091008103 RNA aptamers Proteins 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 101710176276 SSB protein Proteins 0.000 description 2
- 108010092505 SpyTag peptide Proteins 0.000 description 2
- 238000003800 Staudinger reaction Methods 0.000 description 2
- 101150104425 T4 gene Proteins 0.000 description 2
- 108010006785 Taq Polymerase Proteins 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 238000005411 Van der Waals force Methods 0.000 description 2
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical compound C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 2
- 150000001294 alanine derivatives Chemical class 0.000 description 2
- 125000005907 alkyl ester group Chemical group 0.000 description 2
- ZOJBYZNEUISWFT-UHFFFAOYSA-N allyl isothiocyanate Chemical compound C=CCN=C=S ZOJBYZNEUISWFT-UHFFFAOYSA-N 0.000 description 2
- 210000004381 amniotic fluid Anatomy 0.000 description 2
- 210000001742 aqueous humor Anatomy 0.000 description 2
- 210000003567 ascitic fluid Anatomy 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 2
- 239000012965 benzophenone Substances 0.000 description 2
- MJSHDCCLFGOEIK-UHFFFAOYSA-N benzyl (2,5-dioxopyrrolidin-1-yl) carbonate Chemical compound O=C1CCC(=O)N1OC(=O)OCC1=CC=CC=C1 MJSHDCCLFGOEIK-UHFFFAOYSA-N 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- WHGYBXFWUBPSRW-FOUAGVGXSA-N beta-cyclodextrin Chemical compound OC[C@H]([C@H]([C@@H]([C@H]1O)O)O[C@H]2O[C@@H]([C@@H](O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O3)[C@H](O)[C@H]2O)CO)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H]3O[C@@H]1CO WHGYBXFWUBPSRW-FOUAGVGXSA-N 0.000 description 2
- 108091008324 binding proteins Proteins 0.000 description 2
- 239000013060 biological fluid Substances 0.000 description 2
- 238000001574 biopsy Methods 0.000 description 2
- 210000001185 bone marrow Anatomy 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 238000000533 capillary isoelectric focusing Methods 0.000 description 2
- 238000001649 capillary isotachophoresis Methods 0.000 description 2
- 239000004202 carbamide Substances 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 210000000170 cell membrane Anatomy 0.000 description 2
- 239000000919 ceramic Substances 0.000 description 2
- 210000002939 cerumen Anatomy 0.000 description 2
- 238000012412 chemical coupling Methods 0.000 description 2
- 238000007385 chemical modification Methods 0.000 description 2
- 238000007697 cis-trans-isomerization reaction Methods 0.000 description 2
- 101150038575 clpS gene Proteins 0.000 description 2
- 229910052802 copper Inorganic materials 0.000 description 2
- 239000010949 copper Substances 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- LIKFHECYJZWXFJ-UHFFFAOYSA-N dimethyldichlorosilane Chemical compound C[Si](C)(Cl)Cl LIKFHECYJZWXFJ-UHFFFAOYSA-N 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 238000005538 encapsulation Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 230000009144 enzymatic modification Effects 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- 210000000981 epithelium Anatomy 0.000 description 2
- 210000003722 extracellular fluid Anatomy 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 125000000524 functional group Chemical group 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 238000001879 gelation Methods 0.000 description 2
- KWIUHFFTVRNATP-UHFFFAOYSA-N glycine betaine Chemical compound C[N+](C)(C)CC([O-])=O KWIUHFFTVRNATP-UHFFFAOYSA-N 0.000 description 2
- 150000002332 glycine derivatives Chemical class 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 125000000623 heterocyclic group Chemical group 0.000 description 2
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 238000005342 ion exchange Methods 0.000 description 2
- 230000002427 irreversible effect Effects 0.000 description 2
- 150000002605 large molecules Chemical class 0.000 description 2
- 238000004811 liquid chromatography Methods 0.000 description 2
- 235000019689 luncheon sausage Nutrition 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 2
- 125000000250 methylamino group Chemical group [H]N(*)C([H])([H])[H] 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 239000011859 microparticle Substances 0.000 description 2
- 238000000386 microscopy Methods 0.000 description 2
- 210000003097 mucus Anatomy 0.000 description 2
- 210000003205 muscle Anatomy 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 210000000944 nerve tissue Anatomy 0.000 description 2
- 108091008104 nucleic acid aptamers Proteins 0.000 description 2
- 239000012038 nucleophile Substances 0.000 description 2
- 150000002894 organic compounds Chemical class 0.000 description 2
- 229940111202 pepsin Drugs 0.000 description 2
- 150000002994 phenylalanines Chemical class 0.000 description 2
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 2
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 2
- 229920001296 polysiloxane Polymers 0.000 description 2
- 230000001323 posttranslational effect Effects 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 238000007639 printing Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000000734 protein sequencing Methods 0.000 description 2
- 230000002797 proteolythic effect Effects 0.000 description 2
- 150000004728 pyruvic acid derivatives Chemical class 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- ZKZBPNGNEQAJSX-UHFFFAOYSA-N selenocysteine Natural products [SeH]CC(N)C(O)=O ZKZBPNGNEQAJSX-UHFFFAOYSA-N 0.000 description 2
- 235000016491 selenocysteine Nutrition 0.000 description 2
- 229940055619 selenocysteine Drugs 0.000 description 2
- 210000000582 semen Anatomy 0.000 description 2
- 238000007841 sequencing by ligation Methods 0.000 description 2
- 239000011343 solid material Substances 0.000 description 2
- 108010068698 spleen exonuclease Proteins 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 210000001179 synovial fluid Anatomy 0.000 description 2
- 125000000999 tert-butyl group Chemical group [H]C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 150000003536 tetrazoles Chemical class 0.000 description 2
- 239000010409 thin film Substances 0.000 description 2
- 238000006449 thioacetylation reaction Methods 0.000 description 2
- 238000005636 thioacylation reaction Methods 0.000 description 2
- 150000003573 thiols Chemical class 0.000 description 2
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 150000003667 tyrosine derivatives Chemical class 0.000 description 2
- ZKAOVABYLXQUTI-UHFFFAOYSA-N (2-acetylphenyl)boronic acid Chemical compound CC(=O)C1=CC=CC=C1B(O)O ZKAOVABYLXQUTI-UHFFFAOYSA-N 0.000 description 1
- DGUWACLYDSWXRZ-UHFFFAOYSA-N (2-formylphenyl)boronic acid Chemical compound OB(O)C1=CC=CC=C1C=O DGUWACLYDSWXRZ-UHFFFAOYSA-N 0.000 description 1
- MUVQIIBPDFTEKM-IMJSIDKUSA-N (2s,3s)-2-aminobutane-1,3-diol Chemical compound C[C@H](O)[C@@H](N)CO MUVQIIBPDFTEKM-IMJSIDKUSA-N 0.000 description 1
- DMLAVOWQYNRWNQ-BUHFOSPRSA-N (E)-azobenzene Chemical compound C1=CC=CC=C1\N=N\C1=CC=CC=C1 DMLAVOWQYNRWNQ-BUHFOSPRSA-N 0.000 description 1
- GOXVPASJCHROBY-UHFFFAOYSA-N 1,1,1-trifluoro-2-isothiocyanatoethane Chemical compound FC(F)(F)CN=C=S GOXVPASJCHROBY-UHFFFAOYSA-N 0.000 description 1
- NGNKMRBGZPDABE-UHFFFAOYSA-N 1,2,3,4,5-pentafluoro-6-isothiocyanatobenzene Chemical compound FC1=C(F)C(F)=C(N=C=S)C(F)=C1F NGNKMRBGZPDABE-UHFFFAOYSA-N 0.000 description 1
- JGRQVOJYPHEEEL-UHFFFAOYSA-N 1-acetyl-3,1-benzoxazine-2,4-dione Chemical compound C1=CC=C2C(=O)OC(=O)N(C(=O)C)C2=C1 JGRQVOJYPHEEEL-UHFFFAOYSA-N 0.000 description 1
- NJMWOUFKYKNWDW-UHFFFAOYSA-N 1-ethyl-3-methylimidazolium Chemical compound CCN1C=C[N+](C)=C1 NJMWOUFKYKNWDW-UHFFFAOYSA-N 0.000 description 1
- GFEPANUKFYVALF-UHFFFAOYSA-N 1-isothiocyanato-3-(trifluoromethyl)benzene Chemical compound FC(F)(F)C1=CC=CC(N=C=S)=C1 GFEPANUKFYVALF-UHFFFAOYSA-N 0.000 description 1
- JKSZUQPHKOPVHF-UHFFFAOYSA-N 1-isothiocyanato-4-(trifluoromethoxy)benzene Chemical compound FC(F)(F)OC1=CC=C(N=C=S)C=C1 JKSZUQPHKOPVHF-UHFFFAOYSA-N 0.000 description 1
- DQEVDFQAYLIBRD-UHFFFAOYSA-N 1-isothiocyanato-4-(trifluoromethyl)benzene Chemical compound FC(F)(F)C1=CC=C(N=C=S)C=C1 DQEVDFQAYLIBRD-UHFFFAOYSA-N 0.000 description 1
- JBDOSUUXMYMWQH-UHFFFAOYSA-N 1-naphthyl isothiocyanate Chemical compound C1=CC=C2C(N=C=S)=CC=CC2=C1 JBDOSUUXMYMWQH-UHFFFAOYSA-N 0.000 description 1
- RLOQBKJCOAXOLR-UHFFFAOYSA-N 1h-pyrrole-2-carboxamide Chemical class NC(=O)C1=CC=CN1 RLOQBKJCOAXOLR-UHFFFAOYSA-N 0.000 description 1
- YKBGVTZYEHREMT-KVQBGUIXSA-N 2'-deoxyguanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](CO)O1 YKBGVTZYEHREMT-KVQBGUIXSA-N 0.000 description 1
- YKBGVTZYEHREMT-UHFFFAOYSA-N 2'-deoxyguanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1CC(O)C(CO)O1 YKBGVTZYEHREMT-UHFFFAOYSA-N 0.000 description 1
- YQTCQNIPQMJNTI-UHFFFAOYSA-N 2,2-dimethylpropan-1-one Chemical group CC(C)(C)[C]=O YQTCQNIPQMJNTI-UHFFFAOYSA-N 0.000 description 1
- FALRKNHUBBKYCC-UHFFFAOYSA-N 2-(chloromethyl)pyridine-3-carbonitrile Chemical compound ClCC1=NC=CC=C1C#N FALRKNHUBBKYCC-UHFFFAOYSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- LAXVMANLDGWYJP-UHFFFAOYSA-N 2-amino-5-(2-aminoethyl)naphthalene-1-sulfonic acid Chemical compound NC1=CC=C2C(CCN)=CC=CC2=C1S(O)(=O)=O LAXVMANLDGWYJP-UHFFFAOYSA-N 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- CSDSSGBPEUDDEE-UHFFFAOYSA-N 2-formylpyridine Chemical compound O=CC1=CC=CC=N1 CSDSSGBPEUDDEE-UHFFFAOYSA-N 0.000 description 1
- FTBBGQKRYUTLMP-UHFFFAOYSA-N 2-nitro-1h-pyrrole Chemical class [O-][N+](=O)C1=CC=CN1 FTBBGQKRYUTLMP-UHFFFAOYSA-N 0.000 description 1
- NQUNIMFHIWQQGJ-UHFFFAOYSA-N 2-nitro-5-thiocyanatobenzoic acid Chemical compound OC(=O)C1=CC(SC#N)=CC=C1[N+]([O-])=O NQUNIMFHIWQQGJ-UHFFFAOYSA-N 0.000 description 1
- HAGRZCJZAKVSTR-UHFFFAOYSA-N 3-methyl-2-(2-nitrophenyl)sulfanyl-1h-indole Chemical compound N1C2=CC=CC=C2C(C)=C1SC1=CC=CC=C1[N+]([O-])=O HAGRZCJZAKVSTR-UHFFFAOYSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- 125000004042 4-aminobutyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])N([H])[H] 0.000 description 1
- IGHBXJSNZCFXNK-UHFFFAOYSA-N 4-chloro-7-nitrobenzofurazan Chemical compound [O-][N+](=O)C1=CC=C(Cl)C2=NON=C12 IGHBXJSNZCFXNK-UHFFFAOYSA-N 0.000 description 1
- SJQRQOKXQKVJGJ-UHFFFAOYSA-N 5-(2-aminoethylamino)naphthalene-1-sulfonic acid Chemical compound C1=CC=C2C(NCCN)=CC=CC2=C1S(O)(=O)=O SJQRQOKXQKVJGJ-UHFFFAOYSA-N 0.000 description 1
- 150000005697 5-halopyrimidines Chemical class 0.000 description 1
- ZLAQATDNGLKIEV-UHFFFAOYSA-N 5-methyl-2-sulfanylidene-1h-pyrimidin-4-one Chemical compound CC1=CNC(=S)NC1=O ZLAQATDNGLKIEV-UHFFFAOYSA-N 0.000 description 1
- MSSXOMSJDRHRMC-UHFFFAOYSA-N 9H-purine-2,6-diamine Chemical compound NC1=NC(N)=C2NC=NC2=N1 MSSXOMSJDRHRMC-UHFFFAOYSA-N 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 108700015125 Adenovirus DBP Proteins 0.000 description 1
- 101100022253 Aedes aegypti MAL1 gene Proteins 0.000 description 1
- 102100027211 Albumin Human genes 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 101710201712 Amino acid binding protein Proteins 0.000 description 1
- 102000008102 Ankyrins Human genes 0.000 description 1
- 108010049777 Ankyrins Proteins 0.000 description 1
- 101100107610 Arabidopsis thaliana ABCF4 gene Proteins 0.000 description 1
- 206010053555 Arthritis bacterial Diseases 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- BXTVQNYQYUTQAZ-UHFFFAOYSA-N BNPS-skatole Chemical compound N=1C2=CC=CC=C2C(C)(Br)C=1SC1=CC=CC=C1[N+]([O-])=O BXTVQNYQYUTQAZ-UHFFFAOYSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 102000004506 Blood Proteins Human genes 0.000 description 1
- 108010017384 Blood Proteins Proteins 0.000 description 1
- 108090000342 C-Type Lectins Proteins 0.000 description 1
- 102000003930 C-Type Lectins Human genes 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 102000003670 Carboxypeptidase B Human genes 0.000 description 1
- 108090000087 Carboxypeptidase B Proteins 0.000 description 1
- 206010050337 Cerumen impaction Diseases 0.000 description 1
- 108010023798 Charybdotoxin Proteins 0.000 description 1
- 241000251730 Chondrichthyes Species 0.000 description 1
- 108090000317 Chymotrypsin Proteins 0.000 description 1
- 235000016795 Cola Nutrition 0.000 description 1
- 244000228088 Cola acuminata Species 0.000 description 1
- 235000011824 Cola pachycarpa Nutrition 0.000 description 1
- 108060002063 Cyclotide Proteins 0.000 description 1
- 108010025905 Cystine-Knot Miniproteins Proteins 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 108010092681 DNA Primase Proteins 0.000 description 1
- 102000016559 DNA Primase Human genes 0.000 description 1
- 108091008102 DNA aptamers Proteins 0.000 description 1
- 101710134178 DNA polymerase processivity factor BMRF1 Proteins 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 108700022150 Designed Ankyrin Repeat Proteins Proteins 0.000 description 1
- 108090000194 Dipeptidyl-peptidases and tripeptidyl-peptidases Proteins 0.000 description 1
- 102000003779 Dipeptidyl-peptidases and tripeptidyl-peptidases Human genes 0.000 description 1
- 108010049959 Discoidins Proteins 0.000 description 1
- 206010061818 Disease progression Diseases 0.000 description 1
- 102100037240 E3 ubiquitin-protein ligase UBR2 Human genes 0.000 description 1
- 101000609473 Ecballium elaterium Trypsin inhibitor 2 Proteins 0.000 description 1
- 101710194146 Ecotin Proteins 0.000 description 1
- 108090000860 Endopeptidase Clp Proteins 0.000 description 1
- 101100070287 Enterobacteria phage T4 41 gene Proteins 0.000 description 1
- 108010013369 Enteropeptidase Proteins 0.000 description 1
- 102100029727 Enteropeptidase Human genes 0.000 description 1
- 101000643395 Escherichia phage T7 Single-stranded DNA-binding protein Proteins 0.000 description 1
- 241000701533 Escherichia virus T4 Species 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 108090001126 Furin Proteins 0.000 description 1
- 102100035233 Furin Human genes 0.000 description 1
- 108010001496 Galectin 2 Proteins 0.000 description 1
- 108010001517 Galectin 3 Proteins 0.000 description 1
- 102100021735 Galectin-2 Human genes 0.000 description 1
- 102100039558 Galectin-3 Human genes 0.000 description 1
- 101710121810 Galectin-9 Proteins 0.000 description 1
- 102100031351 Galectin-9 Human genes 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 101000930822 Giardia intestinalis Dipeptidyl-peptidase 4 Proteins 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 201000005569 Gout Diseases 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 101001069938 Griffithsia sp. (strain Q66D336) Griffithsin Proteins 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- 108010004901 Haloalkane dehalogenase Proteins 0.000 description 1
- 208000009889 Herpes Simplex Diseases 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 1
- 102000017727 Immunoglobulin Variable Region Human genes 0.000 description 1
- 206010062717 Increased upper airway secretion Diseases 0.000 description 1
- 208000004575 Infectious Arthritis Diseases 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 102000000853 LDL receptors Human genes 0.000 description 1
- 108010001831 LDL receptors Proteins 0.000 description 1
- 235000014647 Lens culinaris subsp culinaris Nutrition 0.000 description 1
- 244000043158 Lens esculenta Species 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 102000019298 Lipocalin Human genes 0.000 description 1
- 108050006654 Lipocalin Proteins 0.000 description 1
- 102100037750 Malectin Human genes 0.000 description 1
- 101710084935 Malectin Proteins 0.000 description 1
- 241001575980 Mendoza Species 0.000 description 1
- 108010006035 Metalloproteases Proteins 0.000 description 1
- 102000005741 Metalloproteases Human genes 0.000 description 1
- 206010027476 Metastases Diseases 0.000 description 1
- 108010059724 Micrococcal Nuclease Proteins 0.000 description 1
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 description 1
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 description 1
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 description 1
- 101800000135 N-terminal protein Proteins 0.000 description 1
- MRPHEOOGUCXXIJ-UHFFFAOYSA-N N=C=S.CN(C)C1=CC=CC=C1N=NC1=CC=CC=C1 Chemical compound N=C=S.CN(C)C1=CC=CC=C1N=NC1=CC=CC=C1 MRPHEOOGUCXXIJ-UHFFFAOYSA-N 0.000 description 1
- 240000002853 Nelumbo nucifera Species 0.000 description 1
- 235000006508 Nelumbo nucifera Nutrition 0.000 description 1
- 235000006510 Nelumbo pentapetala Nutrition 0.000 description 1
- 101710204212 Neocarzinostatin Proteins 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 108091005461 Nucleic proteins Proteins 0.000 description 1
- 101800001452 P1 proteinase Proteins 0.000 description 1
- 102000000470 PDZ domains Human genes 0.000 description 1
- 108050008994 PDZ domains Proteins 0.000 description 1
- 108010067372 Pancreatic elastase Proteins 0.000 description 1
- 102000016387 Pancreatic elastase Human genes 0.000 description 1
- 108090000526 Papain Proteins 0.000 description 1
- 108010043958 Peptoids Proteins 0.000 description 1
- 208000005228 Pericardial Effusion Diseases 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- 101710193132 Pre-hexon-linking protein VIII Proteins 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- 102000029797 Prion Human genes 0.000 description 1
- 108091000054 Prion Proteins 0.000 description 1
- 108090000690 Pseudomonas adhesin Proteins 0.000 description 1
- 101001113927 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) PA-I galactophilic lectin Proteins 0.000 description 1
- 108010065868 RNA polymerase SP6 Proteins 0.000 description 1
- 101150085800 RPA2 gene Proteins 0.000 description 1
- 101150030992 RPA3 gene Proteins 0.000 description 1
- 102000018780 Replication Protein A Human genes 0.000 description 1
- 108010027643 Replication Protein A Proteins 0.000 description 1
- 102100034372 Replication protein A 14 kDa subunit Human genes 0.000 description 1
- 102100035525 Replication protein A 32 kDa subunit Human genes 0.000 description 1
- 241000219061 Rheum Species 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 101100068078 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GCN4 gene Proteins 0.000 description 1
- 101100528939 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) RPA14 gene Proteins 0.000 description 1
- 101100042881 Sambucus nigra SNA-I gene Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 102000012479 Serine Proteases Human genes 0.000 description 1
- 108010022999 Serine Proteases Proteins 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 108090000787 Subtilisin Proteins 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 208000033809 Suppuration Diseases 0.000 description 1
- 108091008874 T cell receptors Proteins 0.000 description 1
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- 108010076818 TEV protease Proteins 0.000 description 1
- 102100024554 Tetranectin Human genes 0.000 description 1
- 101100388071 Thermococcus sp. (strain GE8) pol gene Proteins 0.000 description 1
- 108090001109 Thermolysin Proteins 0.000 description 1
- 102100036407 Thioredoxin Human genes 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 102000003441 UBR1 Human genes 0.000 description 1
- 101150118716 UBR1 gene Proteins 0.000 description 1
- 101150056689 UBR2 gene Proteins 0.000 description 1
- 101150008036 UL29 gene Proteins 0.000 description 1
- 101001089018 Ulex europaeus Anti-H(O) lectin 1 Proteins 0.000 description 1
- 101001023076 Ulex europaeus Anti-H(O) lectin 2 Proteins 0.000 description 1
- 101000755508 Urtica dioica Lectin/endochitinase 1 Proteins 0.000 description 1
- 108010019530 Vascular Endothelial Growth Factors Proteins 0.000 description 1
- 102000005789 Vascular Endothelial Growth Factors Human genes 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 101100033868 Xenopus laevis rpa2-a gene Proteins 0.000 description 1
- 101100033871 Xenopus laevis rpa2-b gene Proteins 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 101710185494 Zinc finger protein Proteins 0.000 description 1
- 102100023597 Zinc finger protein 816 Human genes 0.000 description 1
- 238000010958 [3+2] cycloaddition reaction Methods 0.000 description 1
- ZVQOOHYFBIDMTQ-UHFFFAOYSA-N [methyl(oxido){1-[6-(trifluoromethyl)pyridin-3-yl]ethyl}-lambda(6)-sulfanylidene]cyanamide Chemical compound N#CN=S(C)(=O)C(C)C1=CC=C(C(F)(F)F)N=C1 ZVQOOHYFBIDMTQ-UHFFFAOYSA-N 0.000 description 1
- 206010000269 abscess Diseases 0.000 description 1
- 230000000397 acetylating effect Effects 0.000 description 1
- ORILYTVJVMAKLC-UHFFFAOYSA-N adamantane Chemical class C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 239000000853 adhesive Substances 0.000 description 1
- 230000001070 adhesive effect Effects 0.000 description 1
- 108091008108 affimer Proteins 0.000 description 1
- 230000008856 allosteric binding Effects 0.000 description 1
- 230000003281 allosteric effect Effects 0.000 description 1
- 235000016720 allyl isothiocyanate Nutrition 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 102000021052 amino acid binding proteins Human genes 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 210000001367 artery Anatomy 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 108010086186 avian pancreatic polypeptide Proteins 0.000 description 1
- IVRMZWNICZWHMI-UHFFFAOYSA-N azide group Chemical group [N-]=[N+]=[N-] IVRMZWNICZWHMI-UHFFFAOYSA-N 0.000 description 1
- 238000010462 azide-alkyne Huisgen cycloaddition reaction Methods 0.000 description 1
- 238000010461 azide-alkyne cycloaddition reaction Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- RWCCWEUUXYIKHB-UHFFFAOYSA-N benzophenone Chemical compound C=1C=CC=CC=1C(=O)C1=CC=CC=C1 RWCCWEUUXYIKHB-UHFFFAOYSA-N 0.000 description 1
- KFEUJDWYNGMDBV-RPHKZZMBSA-N beta-D-Galp-(1->4)-D-GlcpNAc Chemical compound O[C@@H]1[C@@H](NC(=O)C)C(O)O[C@H](CO)[C@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 KFEUJDWYNGMDBV-RPHKZZMBSA-N 0.000 description 1
- 229960003237 betaine Drugs 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 210000000941 bile Anatomy 0.000 description 1
- 230000000975 bioactive effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- LRESCJAINPKJTO-UHFFFAOYSA-N bis(trifluoromethylsulfonyl)azanide;1-ethyl-3-methylimidazol-3-ium Chemical compound CCN1C=C[N+](C)=C1.FC(F)(F)S(=O)(=O)[N-]S(=O)(=O)C(F)(F)F LRESCJAINPKJTO-UHFFFAOYSA-N 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000004204 blood vessel Anatomy 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 239000006172 buffering agent Substances 0.000 description 1
- 125000001314 canonical amino-acid group Chemical group 0.000 description 1
- 108091008400 carbohydrate binding proteins Proteins 0.000 description 1
- 102000023852 carbohydrate binding proteins Human genes 0.000 description 1
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 1
- 125000002843 carboxylic acid group Chemical group 0.000 description 1
- 210000000845 cartilage Anatomy 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000022131 cell cycle Effects 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 230000012292 cell migration Effects 0.000 description 1
- 230000010307 cell transformation Effects 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 230000005754 cellular signaling Effects 0.000 description 1
- 210000003756 cervix mucus Anatomy 0.000 description 1
- 239000013043 chemical agent Substances 0.000 description 1
- 235000013330 chicken meat Nutrition 0.000 description 1
- 210000003763 chloroplast Anatomy 0.000 description 1
- 210000001268 chyle Anatomy 0.000 description 1
- 210000004913 chyme Anatomy 0.000 description 1
- 229960002376 chymotrypsin Drugs 0.000 description 1
- 238000012650 click reaction Methods 0.000 description 1
- 230000008045 co-localization Effects 0.000 description 1
- 210000003022 colostrum Anatomy 0.000 description 1
- 235000021277 colostrum Nutrition 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 230000001268 conjugating effect Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 239000007822 coupling agent Substances 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 239000003431 cross linking reagent Substances 0.000 description 1
- 230000009260 cross reactivity Effects 0.000 description 1
- CNVQLPPZGABUCM-LIGYZCPXSA-N ctx toxin Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@H]1CSSC[C@H]2C(=O)N[C@H](C(N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@H]3CSSC[C@@H](C(N[C@@H](CC=4C5=CC=CC=C5NC=4)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CCCNC(N)=N)NC3=O)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CO)C(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=3NC=NC=3)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N2)C(C)C)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H]([C@@H](C)O)NC1=O)=O)CCSC)C(C)C)[C@@H](C)O)NC(=O)[C@H]1NC(=O)CC1)C1=CC=CC=C1 CNVQLPPZGABUCM-LIGYZCPXSA-N 0.000 description 1
- ZDOBFUIMGBWEAB-XGFHMVPTSA-N cucurbit[7]uril Chemical compound N1([C@H]2[C@H]3N(C1=O)CN1[C@H]4[C@H]5N(C1=O)CN1[C@H]6[C@H]7N(C1=O)CN1[C@H]8[C@H]9N(C1=O)CN1[C@H]%10[C@H]%11N(C1=O)CN([C@@H]1N(C%12=O)CN%11C(=O)N%10CN9C(=O)N8CN7C(=O)N6CN5C(=O)N4CN3C(=O)N2C2)C3=O)CN4C(=O)N5[C@H]6[C@@H]4N2C(=O)N6CN%12[C@@H]1N3C5 ZDOBFUIMGBWEAB-XGFHMVPTSA-N 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 238000006352 cycloaddition reaction Methods 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 125000001295 dansyl group Chemical group [H]C1=C([H])C(N(C([H])([H])[H])C([H])([H])[H])=C2C([H])=C([H])C([H])=C(C2=C1[H])S(*)(=O)=O 0.000 description 1
- 101150047356 dec-1 gene Proteins 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000012350 deep sequencing Methods 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000010511 deprotection reaction Methods 0.000 description 1
- 238000001212 derivatisation Methods 0.000 description 1
- VGONTNSXDCQUGY-UHFFFAOYSA-N desoxyinosine Natural products C1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 VGONTNSXDCQUGY-UHFFFAOYSA-N 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- UQLDLKMNUJERMK-UHFFFAOYSA-L di(octadecanoyloxy)lead Chemical compound [Pb+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O UQLDLKMNUJERMK-UHFFFAOYSA-L 0.000 description 1
- 150000001988 diarylethenes Chemical class 0.000 description 1
- 150000001993 dienes Chemical class 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 230000005750 disease progression Effects 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 229940052761 dopaminergic adamantane derivative Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 210000003060 endolymph Anatomy 0.000 description 1
- 229940066758 endopeptidases Drugs 0.000 description 1
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 1
- 108010014507 erythroagglutinating phytohemagglutinin Proteins 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- KTWOOEGAPBSYNW-UHFFFAOYSA-N ferrocene Chemical compound [Fe+2].C=1C=C[CH-]C=1.C=1C=C[CH-]C=1 KTWOOEGAPBSYNW-UHFFFAOYSA-N 0.000 description 1
- 238000001917 fluorescence detection Methods 0.000 description 1
- 238000002189 fluorescence spectrum Methods 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 238000001997 free-flow electrophoresis Methods 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 210000000232 gallbladder Anatomy 0.000 description 1
- 210000004211 gastric acid Anatomy 0.000 description 1
- 210000004051 gastric juice Anatomy 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 210000004907 gland Anatomy 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 1
- 210000002288 golgi apparatus Anatomy 0.000 description 1
- ZJYYHGLJYGJLLN-UHFFFAOYSA-N guanidinium thiocyanate Chemical compound SC#N.NC(N)=N ZJYYHGLJYGJLLN-UHFFFAOYSA-N 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 210000002216 heart Anatomy 0.000 description 1
- 238000012165 high-throughput sequencing Methods 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 235000020256 human milk Nutrition 0.000 description 1
- 210000004251 human milk Anatomy 0.000 description 1
- 239000000017 hydrogel Substances 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 229940099472 immunoglobulin a Drugs 0.000 description 1
- 229940027941 immunoglobulin g Drugs 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 210000003000 inclusion body Anatomy 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000007641 inkjet printing Methods 0.000 description 1
- 239000000543 intermediate Substances 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 239000002608 ionic liquid Substances 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- VYFOAVADNIHPTR-UHFFFAOYSA-N isatoic anhydride Chemical compound NC1=CC=CC=C1CO VYFOAVADNIHPTR-UHFFFAOYSA-N 0.000 description 1
- 239000012948 isocyanate Substances 0.000 description 1
- 150000002513 isocyanates Chemical class 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 230000006122 isoprenylation Effects 0.000 description 1
- VDBNYAPERZTOOF-UHFFFAOYSA-N isoquinolin-1(2H)-one Chemical class C1=CC=C2C(=O)NC=CC2=C1 VDBNYAPERZTOOF-UHFFFAOYSA-N 0.000 description 1
- 108010084553 jacalin Proteins 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 238000002824 mRNA display Methods 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 230000009401 metastasis Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 229920012128 methyl methacrylate acrylonitrile butadiene styrene Polymers 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 210000002500 microbody Anatomy 0.000 description 1
- 230000009149 molecular binding Effects 0.000 description 1
- RAHPKSGJRPHILH-UHFFFAOYSA-N n'-nitroimidazole-1-carboximidamide Chemical compound [O-][N+](=O)N=C(N)N1C=CN=C1 RAHPKSGJRPHILH-UHFFFAOYSA-N 0.000 description 1
- 229950006780 n-acetylglucosamine Drugs 0.000 description 1
- 239000011807 nanoball Substances 0.000 description 1
- 239000007908 nanoemulsion Substances 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- QZGIWPZCWHMVQL-UIYAJPBUSA-N neocarzinostatin chromophore Chemical compound O1[C@H](C)[C@H](O)[C@H](O)[C@@H](NC)[C@H]1O[C@@H]1C/2=C/C#C[C@H]3O[C@@]3([C@@H]3OC(=O)OC3)C#CC\2=C[C@H]1OC(=O)C1=C(O)C=CC2=C(C)C=C(OC)C=C12 QZGIWPZCWHMVQL-UIYAJPBUSA-N 0.000 description 1
- 210000000653 nervous system Anatomy 0.000 description 1
- VOFUROIFQGPCGE-UHFFFAOYSA-N nile red Chemical compound C1=CC=C2C3=NC4=CC=C(N(CC)CC)C=C4OC3=CC(=O)C2=C1 VOFUROIFQGPCGE-UHFFFAOYSA-N 0.000 description 1
- MGFYIUFZLHCRTH-UHFFFAOYSA-N nitrilotriacetic acid Chemical compound OC(=O)CN(CC(O)=O)CC(O)=O MGFYIUFZLHCRTH-UHFFFAOYSA-N 0.000 description 1
- 125000006502 nitrobenzyl group Chemical group 0.000 description 1
- 230000009635 nitrosylation Effects 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 230000000269 nucleophilic effect Effects 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- IFPHDUVGLXEIOQ-UHFFFAOYSA-N ortho-iodosylbenzoic acid Chemical compound OC(=O)C1=CC=CC=C1I=O IFPHDUVGLXEIOQ-UHFFFAOYSA-N 0.000 description 1
- 201000008482 osteoarthritis Diseases 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 229940055729 papain Drugs 0.000 description 1
- 235000019834 papain Nutrition 0.000 description 1
- 108700041181 parathymosin alpha Proteins 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 239000000816 peptidomimetic Substances 0.000 description 1
- 210000004912 pericardial fluid Anatomy 0.000 description 1
- 210000004049 perilymph Anatomy 0.000 description 1
- 210000001322 periplasm Anatomy 0.000 description 1
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N phenol group Chemical group C1(=CC=CC=C1)O ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 1
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 1
- UYWQUFXKFGHYNT-UHFFFAOYSA-N phenylmethyl ester of formic acid Natural products O=COCC1=CC=CC=C1 UYWQUFXKFGHYNT-UHFFFAOYSA-N 0.000 description 1
- 208000026435 phlegm Diseases 0.000 description 1
- NMHMNPHRMNGLLB-UHFFFAOYSA-N phloretic acid Chemical compound OC(=O)CCC1=CC=C(O)C=C1 NMHMNPHRMNGLLB-UHFFFAOYSA-N 0.000 description 1
- 150000003003 phosphines Chemical class 0.000 description 1
- 150000004713 phosphodiesters Chemical class 0.000 description 1
- UEZVMMHDMIWARA-UHFFFAOYSA-M phosphonate Chemical compound [O-]P(=O)=O UEZVMMHDMIWARA-UHFFFAOYSA-M 0.000 description 1
- 230000002186 photoactivation Effects 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 108010086652 phytohemagglutinin-P Proteins 0.000 description 1
- 210000004910 pleural fluid Anatomy 0.000 description 1
- 229920001983 poloxamer Polymers 0.000 description 1
- 229920001992 poloxamer 407 Polymers 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 238000003498 protein array Methods 0.000 description 1
- 238000002731 protein assay Methods 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 230000009145 protein modification Effects 0.000 description 1
- 230000030634 protein phosphate-linked glycosylation Effects 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 239000002213 purine nucleotide Substances 0.000 description 1
- 210000004915 pus Anatomy 0.000 description 1
- 150000003230 pyrimidines Chemical class 0.000 description 1
- 238000012175 pyrosequencing Methods 0.000 description 1
- 238000004451 qualitative analysis Methods 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 239000002096 quantum dot Substances 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 101150079601 recA gene Proteins 0.000 description 1
- 210000000664 rectum Anatomy 0.000 description 1
- 230000009711 regulatory function Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 239000001022 rhodamine dye Substances 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 125000000548 ribosyl group Chemical group C1([C@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 238000005096 rolling process Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 210000002374 sebum Anatomy 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000009758 senescence Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 201000001223 septic arthritis Diseases 0.000 description 1
- 125000005629 sialic acid group Chemical group 0.000 description 1
- 150000004760 silicates Chemical class 0.000 description 1
- 238000004557 single molecule detection Methods 0.000 description 1
- 108700014590 single-stranded DNA binding proteins Proteins 0.000 description 1
- RPENMORRBUTCPR-UHFFFAOYSA-M sodium;1-hydroxy-2,5-dioxopyrrolidine-3-sulfonate Chemical compound [Na+].ON1C(=O)CC(S([O-])(=O)=O)C1=O RPENMORRBUTCPR-UHFFFAOYSA-M 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 150000001629 stilbenes Chemical class 0.000 description 1
- 235000021286 stilbenes Nutrition 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 229940014800 succinic anhydride Drugs 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 125000001273 sulfonato group Chemical group [O-]S(*)(=O)=O 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 210000004243 sweat Anatomy 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 210000001138 tear Anatomy 0.000 description 1
- 239000004557 technical material Substances 0.000 description 1
- 229950001790 tendamistat Drugs 0.000 description 1
- 108010037401 tendamistate Proteins 0.000 description 1
- 210000001550 testis Anatomy 0.000 description 1
- ABZLKHKQJHEPAX-UHFFFAOYSA-N tetramethylrhodamine Chemical compound C=12C=CC(N(C)C)=CC2=[O+]C2=CC(N(C)C)=CC=C2C=1C1=CC=CC=C1C([O-])=O ABZLKHKQJHEPAX-UHFFFAOYSA-N 0.000 description 1
- 108010013645 tetranectin Proteins 0.000 description 1
- MPLHNVLQVRSVEE-UHFFFAOYSA-N texas red Chemical compound [O-]S(=O)(=O)C1=CC(S(Cl)(=O)=O)=CC=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 MPLHNVLQVRSVEE-UHFFFAOYSA-N 0.000 description 1
- 150000007970 thio esters Chemical group 0.000 description 1
- 108060008226 thioredoxin Proteins 0.000 description 1
- 229940094937 thioredoxin Drugs 0.000 description 1
- 238000007671 third-generation sequencing Methods 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 210000001541 thymus gland Anatomy 0.000 description 1
- SBUXRMKDJWEXRL-ZWKOTPCHSA-N trans-body Chemical compound O=C([C@@H]1N(C2=O)[C@H](C3=C(C4=CC=CC=C4N3)C1)CC)N2C1=CC=C(F)C=C1 SBUXRMKDJWEXRL-ZWKOTPCHSA-N 0.000 description 1
- 230000002463 transducing effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 150000003852 triazoles Chemical class 0.000 description 1
- PIEPQKCYPFFYMG-UHFFFAOYSA-N tris acetate Chemical compound CC(O)=O.OCC(N)(CO)CO PIEPQKCYPFFYMG-UHFFFAOYSA-N 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 229960001322 trypsin Drugs 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 210000000605 viral structure Anatomy 0.000 description 1
- 210000004127 vitreous body Anatomy 0.000 description 1
- 210000004916 vomit Anatomy 0.000 description 1
- 230000008673 vomiting Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- 229950009268 zinostatin Drugs 0.000 description 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/543—Immunoassay; Biospecific binding assay; Materials therefor with an insoluble carrier for immobilising immunochemicals
- G01N33/54306—Solid-phase reaction mechanisms
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/58—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving labelled substances
- G01N33/582—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving labelled substances with fluorescent label
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B20/00—Methods specially adapted for identifying library members
- C40B20/04—Identifying library members by means of a tag, label, or other readable or detectable entity associated with the library members, e.g. decoding processes
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B70/00—Tags or labels specially adapted for combinatorial chemistry or libraries, e.g. fluorescent tags or bar codes
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/5308—Immunoassay; Biospecific binding assay; Materials therefor for analytes not provided for elsewhere, e.g. nucleic acids, uric acid, worms, mites
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/543—Immunoassay; Biospecific binding assay; Materials therefor with an insoluble carrier for immobilising immunochemicals
- G01N33/54353—Immunoassay; Biospecific binding assay; Materials therefor with an insoluble carrier for immobilising immunochemicals with ligand attached to the carrier via a chemical coupling agent
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
Definitions
- the present disclosure relates to methods and kits for analyzing a macromolecule including information transfer between molecules, such as transfer of identifying information between nucleic acid molecules. Also, methods and kits for forming a stable complex comprising a binding agent and a target (e.g a macromolecule, a polypeptide) to be analyzed are disclosed. Such methods employs barcoding and nucleic acid encoding of molecular recognition events, and/or detectable labels. Also provided herein is a programmable system for information transfer comprising, using or involving one or more adaptor molecules.
- Recognition and binding of molecular targets using binding agents can be useful for characterization and/or detection of target biomolecules.
- Some analysis methods involve non-covalently associated complexes of proteins or proteins with other molecules (see e.g., Xing et al., Plant Physiol. (2016)171(2): 727-758).
- molecular recognition and characterization of a protein or peptide macromolecule can be performed using an immunoassay.
- immunoassay formats including ELISA, multiplex ELISA (e.g., spotted antibody arrays, liquid particle ELISA arrays), digital ELISA, reverse phase protein arrays (RPPA), and others.
- crosslinking reagents and methods exist for applications involving binding agents for targets. It may be preferred that binding agents and detection assays are performed in a manner that allows specificity and stability in a controllable manner that allows processing of a plurality of binding agents and targets. Additionally, speed and reversibility may also be a desired feature for the binding reaction. However, current reagents and techniques are somewhat limited in some of these aspects.
- the present invention provides novel methods and compositions which may be utilized in a wide variety of nucleic acid-based and/or protein (e.g., binding agent)-based procedures, and further provides other related advantages.
- the provided methods for performing a binding reaction are compatible with information transfer, such as between nucleic acids associated with the binding agent and the target, with applications to macromolecule sequencing and/or analysis (e.g, protein sequencing and/or analysis).
- the information transfer is between a nucleic acid tag associated with the binding agent and a nucleic acid tag associated with the target ( e.g ., by extension or ligation).
- a binding reaction comprising contacting a binding agent with a target, wherein the binding agent and the target each comprises or is associated with a stabilizing component; allowing the binding agent to interact with a binding site located on the target; and linking the stabilizing components to form a stable complex comprising the binding agent, the target and the stabilizing components.
- a method for analyzing a macromolecule comprising the steps of: (a) providing a macromolecule joined to a support, wherein the macromolecule comprises or is associated with a first stabilizing component; (b) contacting the macromolecule with a binding agent capable of binding to the macromolecule, wherein the binding agent comprises or is associated with a second stabilizing component; (c) after binding of the binding agent to the macromolecule, linking the first and second stabilizing components together to form a stable complex comprising the binding agent, the macromolecule and the stabilizing components; (d) analyzing the macromolecule by obtaining information about the binding agent bound to the macromolecule.
- the stabilizing components are linked upon introduction of a linking agent, and no covalent bonds are formed during formation of the stable complex. In other embodiments, the stabilizing components are linked upon introduction to light.
- the linking agent comprises a polypeptide.
- the first or second stabilizing component comprises a polynucleotide, and the linking agent comprises a linking polynucleotide that hybridizes to the polynucleotide of one of the stabilizing components.
- the first stabilizing component is the same as the second stabilizing component. In some other embodiments, the first stabilizing component has a lower affinity to the linking agent in comparison to an affinity of the second stabilizing component to the linking agent.
- the method comprises contacting a plurality of binding agents with a single macromolecule, or contacting a plurality of binding agents with a plurality of macromolecules, and wherein at least one binding agent of the plurality of binding agents is capable of binding to the macromolecule and each binding agent of the plurality of binding agents comprises or is associated with the second stabilizing component.
- the binding agent is fluorescently labeled to enable detection of the contact between the macromolecule and the binding agent; and analyzing the macromolecule comprises detecting fluorescence from the binding agent after contacting the macromolecule.
- the macromolecule comprises a polypeptide and the binding agent or a binding agent from the plurality of binding agents is capable of binding to a N-terminal amino acid (NTAA) of the polypeptide or to a modified NTAA of the polypeptide.
- NTAA N-terminal amino acid
- analyzing the macromolecule comprises identifying at least one amino acid residue of the polypeptide.
- providing a macromolecule comprises providing the polypeptide associated with a recording tag; the binding agent or each binding agent from the plurality of binding agents comprises or is associated with a coding tag with identifying information regarding the binding agent; obtaining an information about the binding agent comprises transferring an information from the coding tag to the recording tag after binding of the binding agent to the macromolecule to generate an extended recording tag; and identifying at least one amino acid residue of the polypeptide comprises analyzing the extended recording tag.
- the method further comprises: providing an adaptor molecule comprising a first hybridization sequence and a secondary tag, wherein the first hybridization sequence is substantially complementary to at least a portion of the coding tag, to allow hybridization between the first hybridization sequence and the coding tag; and transferring information of the secondary tag to the recording tag to generate an extended recording tag, wherein the information of the secondary tag is transferred from the adaptor molecule to the recording tag after the coding tag associated with the binding agent hybridizes with the first hybridization sequence on the adaptor molecule.
- transferring information of the coding tag to the recording tag is performed after the stabilizing components are linked together.
- transferring information comprises contacting the coding tag with a reagent for transferring the identifying information, the reagent comprising a reagent for primer extension reaction, a chemical ligation reagent or a biological ligation reagent.
- the stable complex is disrupted after the transfer of information from the coding tag to the recording tag by removing the linking agent from the stable complex or by introducing a destabilizing agent.
- the method further comprises contacting the polypeptide with a N-terminal modifier agent prior to binding of the binding agent to the polypeptide to form the modified NTAA of the polypeptide.
- the method further comprises removing the modified
- the method includes treating the protein or peptide with a reagent for modifying a terminal amino acid of the protein or peptide.
- the method further comprises repeating at least one more time prior to analyzing the extended recording tag steps of: contacting the polypeptide with a N-terminal modifier agent to form the modified NTAA of the polypeptide; contacting the polypeptide with a binding agent capable of binding to the modified NTAA of the polypeptide or with a plurality of binding agents wherein at least one binding agent of the plurality of binding agents is capable of binding to the modified NTAA of the polypeptide, wherein each binding agent of the plurality of binding agents comprises or is associated with the second stabilizing component; linking the first and second stabilizing components together to form a stable complex comprising the binding agent, the macromolecule and the stabilizing components; optionally, removing the modified NTAA of the polypeptide.
- the extended recording tag is analyzed using a nucleic acid sequencing method.
- the stabilizing components are attached to or associated with the binding agent and the target, respectively, at a site different from the binding site between the binding agent and the target.
- the method further includes adding a universal priming site to the extended recording tag.
- the method includes repeating some of the provided steps one or more times in a cyclic manner.
- a macromolecule comprising the steps of: providing a macromolecule and an associated recording tag joined to a support; contacting the macromolecule with a binding agent capable of binding to the macromolecule, wherein the binding agent comprises a coding tag with identifying information regarding the binding agent, to allow binding between the macromolecule and the binding agent; providing an adaptor molecule comprising a first hybridization sequence substantially complementary to at least a portion of the coding tag (or portion thereof), and a secondary tag, to allow hybridization between the adaptor molecule (or the first hybridization sequence) and the coding tag (or the portion of the coding tag); transferring the information of the secondary tag to the recording tag to generate an extended recording tag; and analyzing the extended recording tag.
- kits for analyzing a macromolecule comprising: a binding agent comprising a coding tag, which comprises identifying information regarding the binding agent, wherein the binding agent is configured to bind a macromolecule associated with a first stabilizing component and with a recording tag joined to a support, and wherein the binding agent is associated with a second stabilizing component; the recording tag associated with the first stabilizing component; a linking agent configured to linking the first and second stabilizing components together after binding of the binding agent to the macromolecule to form a stable complex comprising the binding agent, the macromolecule and the stabilizing components.
- the kit comprises a plurality of binding agents and wherein at least one binding agent of the plurality of binding agents is capable of binding to the macromolecule and each binding agent of the plurality of binding agents comprises or is associated with the second stabilizing component.
- the macromolecule comprises a polypeptide.
- the kit further comprises: an adaptor molecule comprising a first hybridization sequence substantially complementary to at least a portion of the coding tag, and a secondary tag, wherein an information of the secondary tag is configured for transfer from the adaptor molecule to the recording tag to generate an extended recording tag after the coding tag associated with the binding agent hybridizes with the first hybridization sequence on the adaptor molecule.
- the coding tag and/or the recording tag comprises a unique molecular identifier (UMI) or a barcode sequence.
- UMI unique molecular identifier
- kits for analyzing a macromolecule comprising: a binding agent comprising a coding tag, which comprises identifying information regarding the binding agent; an adaptor molecule comprising a first hybridization sequence substantially complementary to at least a portion of the coding tag, and a secondary tag; wherein the binding agent is configured to bind a macromolecule associated with a recording tag; and wherein information from the secondary tag is configured for transfer from the adaptor molecule to the recording tag.
- the kit comprises a plurality of binding agents and a plurality of adaptor molecules.
- the plurality of adaptor molecules includes at least one adaptor molecule capable of hybridizing to at least one coding tag associated with the binding agent.
- multiple coding tags associated with the binding agent are configured to hybridize to adaptor molecules comprising the same secondary tag.
- the adaptor molecule further comprises a second hybridization sequence substantially complementary to a portion of the recording tag.
- FIG. 1A-1D depicts an exemplary binding reaction and formation of a stable complex for information transfer.
- a target joined to a recording tag is associated with a first stabilizing component.
- the target is contacted with a binding agent associated with a second stabilizing component and the binding agent interacts with the target.
- a linking agent is introduced and the two stabilizing components are linked and a stable complex containing the binding agent and the target is formed.
- information is transferred from the coding tag to the recording tag ( e.g . by extension). While the schematic depicts a stabilizing component and linking agent each as one molecule, it is within the scope of the invention that each stabilizing component or the linking agent may contain sub-components or comprise two or more molecules.
- FIG. 2A-2D depicts an exemplary binding reaction and formation of a stable complex for information transfer with stabilizing components associated with nucleic acids.
- FIG. 2A shows a target joined to a recording tag.
- the target is associated with a first stabilizing component containing a nucleic acid that hybridizes to a portion of the recording tag, and the target is contacted with a binding agent associated with a second stabilizing component via hybridization of the joined nucleic acid.
- a linking agent is introduced and the stabilizing components are linked, forming a stable complex containing the binding agent and the target.
- information is transferred from the coding tag to the recording tag (e.g. by extension).
- FIG. 3A-3D depict an exemplary binding reaction and formation of a stable complex for information transfer (using interactions between a binding pair, e.g., biotin and streptavidin or neutravidin).
- FIG. 3A shows a peptide target joined to a recording tag with a biotin molecule attached at the 5’ end and a binding agent configured to bind an N-terminal phenylalanine (“F” binding agent) and associated nucleic acid components (including a coding tag).
- F N-terminal phenylalanine
- FIG. 3B the peptide is contacted with the binding agent associated with a second biotin via hybridization of the joined nucleic acid.
- FIG. 3A shows a peptide target joined to a recording tag with a biotin molecule attached at the 5’ end and a binding agent configured to bind an N-terminal phenylalanine (“F” binding agent) and associated nucleic acid components (including a coding tag).
- F N-terminal
- streptavidin or neutravidin is introduced and interacts with the biotin molecules, forming a stable complex containing the binding agent and the peptide-recording tag conjugate, and information transfer by extension occurs.
- FIG. 3D depicts the extended recording tag containing information transferred from the coding tag.
- Fig. 4. depicts exemplary results from a polypeptide analysis assay (ProteoCode assay) performed which included forming a stable complex using interactions between biotin (stabilizing component) and streptavidin or neutravidin (linking agent). For comparison, a negative control where no linking agent was provided (PBST) and a encoding control was performed. The results show binding and encoding (transfer of information from a coding tag to recording tag) with a binding agent that recognizes the amino acid residue, phenylalanine.
- ProteoCode assay polypeptide analysis assay
- Fig. 5A and Fig. 5B show a bipartite clamping oligo, which anneals to an A region on the DNA-polypeptide chimera and a B region on the binding agent’s coding tag. After binding and washing, the A’-B’ clamping oligo is exposed to the system to stabilize the binding agentDNA-polypeptide complex. Other types of bipartite affinity agents can also be employed for this stabilizing effect.
- Fig. 5C and Fig. 5D show that at high concentrations, stabilizing components can form a hindered clamping structure preventing efficient stabilization.
- FIG. 6A-FIG. 6D depicts an exemplary macromolecule analysis assay involving information transfer using an adaptor molecule with a first hybridization sequence and a secondary tag.
- a peptide to be analyzed is joined to a recording tag immobilized on a support.
- the peptide is contacted with a binding agent associated with a coding tag and the binding agent interacts with the peptide to be analyzed.
- an adaptor molecule comprising a first hybridization sequence (1 st hyb sequence) and secondary tag is introduced.
- the first hybridization sequence contains a sequence complementary to the coding tag associated with the binding agent.
- the adaptor molecule Once the adaptor molecule is in place (via hybridization to the coding tag), information from the secondary tag on the adaptor molecule containing identifying information regarding the binding agent is transferred from the adaptor molecule to the recording tag via ligation, thereby generating an extended recording tag.
- the adaptor molecule portion besides the secondary tag
- the binding agent may be optionally removed, as shown in FIG. 6D. A cycle of steps shown in FIG. 6B-6D may be repeated one or more times to further extend the recording tag.
- FIG. 7A-FIG. 7D depicts an exemplary macromolecule analysis assay involving information transfer using an adaptor molecule with a first hybridization sequence, a secondary tag, and a second hybridization sequence.
- a peptide to be analyzed is joined to a recording tag immobilized on a support.
- the peptide is contacted with a binding agent associated with a coding tag and the binding agent interacts with the peptide to be analyzed.
- an adaptor molecule comprising a first hybridization sequence (1 st hyb sequence), secondary tag, and second hybridization sequence (2 nd hyb sequence) is introduced.
- the first hybridization sequence contains a sequence complementary to the coding tag associated with the binding agent.
- the second hybridization sequence contains a sequence complementary to a portion of the recording tag.
- FIG. 8 depicts a model or exemplary assay system for information transfer using a splint adaptor molecule containing a first hybridization sequence complementary to a region on the coding tag and a second hybridization sequence complementary to a region on the recording tag.
- Fig. 9 depicts an exemplary embodiment for information transfer based on hybridization and extension reactions.
- Fig. 9A shows a splint adaptor molecule containing a first hybridization sequence (Payload seq 1, PL1) complementary to a region on the coding tag (PLE), followed by a PEG-based linker, a spacer sequence (Sp’), a barcode sequence (BC’) and another spacer sequence (Sp’) complementary to a region on the recording tag (Sp).
- Fig. 9B shows encoding yield in the information transfer assay utilizing an engineered F-binder.
- the encoding assay was performed with the F-binder conjugated with two different coding tags that contain oligonucleotides complementary to hybridization sequences Payload seq 1 (PL1) and Payload seq 1 (PL2); splint adaptor molecules as shown in Fig. 9A containing from 5’ end to 3’ end a first hybridization sequence (either PL1 or PL2), a secondary tag (Sp’ and BC’) and a second hybridization sequence (Sp’) were added to allow hybridization between the splint adaptor molecule, the coding tag and the recording tag. Coding tags were fused to 3 peptides (AA-PA, AFA-PA or FA-PA, SEQ ID NOs: 3-5) or used without peptide.
- Fig. 10A A binder (shown as cylinder) fused with a coding tag binds to the target peptide (shown as several circles connected together). This interaction is stabilized with Clamp oligo having terminal sequences complementary to portions of the coding tag and recording tag and serving as stabilizing components (stabilization through hybridization).
- Encoding oligo (adaptor molecule) is annealed to the portion of the coding tag. Encoding oligo can then be directly ligated to the recording tag, or can be first ligated to Clamp oligo and then ligated to the recording tag.
- Fig. 10B Encoding oligo from Fig. 10A is shown ligated to Clamp oligo and is ready for ligation to the recording tag. After ligation to the recording tag and removal of the binder, USER enzyme can be used to cleave the U residue and remove the extra sequence from the ligated product. The encoding cycle can be repeated with a new binder and a new Encoding oligo to produce a further extended recording tag.
- Fig. IOC a new binder and a new Encoding oligo to produce a further extended recording tag.
- the extended recording tag produced from several cycles of encoding using the spacer-less ligation approach (ssDNA ligase is used for ligation).
- Fig. 10D After final cycle of encoding, Capping oligonucleotide is introduced to add a priming site to the extended recording tag for further analysis by NGS.
- Fig.11 depicts an exemplary arrangement of interacting components in the encoding assay.
- Target peptide fused to the recording tag is immobilized on a solid support via interaction with a hairpin DNA associated with the first stabilizing component (DSB).
- DSB hairpin DNA associated with the first stabilizing component
- a binding agent fused to the coding tag and labeled with biotin interacts with the target peptide. This interaction is stabilized upon addition of the linking agent.
- Fig.12 shows dependence of encoding efficiencies on stabilization with a linking agent (SA) during stringent washing conditions.
- SA linking agent
- Fig. 12A no SA was added (no stabilization)
- Fig. 12B 50 nM of streptavidin (SA) was added to connect DSB on the recording tag and biotin associated with the binding agent (stabilization).
- SA streptavidin
- Fig.13 shows dependence of encoding efficiencies on the encoding temperature.
- Fig.l3A encoding efficiencies measured at two temperatures (25 °C and 37 °C) are shown for the setup where recording tags did not contain a DSB molecule at its 5’ end (no stabilization during encoding), whereas in Fig.l3B DSB was attached to the recording tags (stabilization during encoding).
- Fig. 14 Exemplary embodiments of controllable hybridization of two polynucleotides used as stabilizing components.
- FIG. 14A shows a binder (shown as cylinder) fused with a coding tag that binds to the target peptide (shown as several circles connected together) immobilized on a support with a recording tag.
- SI and S2 are two complementary polynucleotides that serve as the first and second stabilizing components (protected from hybridization before binding).
- introduction of light or a linking agent can trigger hybridization or association of the stabilizing components by a variety of ways shown in the Fig. 14B.
- Light or a linking agent can trigger isomerization, uncaging or structural transformation of one of the components.
- kits for performing a binding reaction comprising contacting a binding agent with a target, wherein the binding agent and the target each comprises or is associated with a stabilizing component; allowing the binding agent to interact with a binding site located on the target; and linking the stabilizing components to form a stable complex.
- each of the stabilizing components is attached to or associated with the binding agent and the target, respectively, at a site different from the binding site between the binding agent and the target.
- the target in the binding reaction is a macromolecule, e.g., a polypeptide.
- the binding reaction is performed with a plurality of binding agents and a plurality of macromolecules, e.g., polypeptides.
- the provided methods for performing a binding reaction is performed in an assay for sequencing or analysis of the polypeptides.
- the analysis employs barcoding and nucleic acid encoding of molecular recognition events, and/or detectable labels.
- the stable complex formed comprising the binding agent and target is compatible with steps in the analysis including transferring information between nucleic acid tags (e.g., a DNA tag or a DNA recording tag).
- the target macromolecules are digested prior to performing the binding reaction.
- kits containing components and/or reagents for performing the provided binding reactions are also include instructions for performing any of the methods provided for performing the binding reaction and for macromolecule sequencing and/or analysis.
- binding agents can be useful for characterization and/or detection of target biomolecules. It may be desired for performing some assays that a binding agent binds to the target and forms a stable complex for downstream steps in the assay or analysis to take place. There remains a need for improved techniques relating to performing binding reactions, as well as to products, methods and kits for accomplishing the same.
- the present invention provides novel methods and compositions which may be utilized in a wide variety of nucleic acid-based and/or protein (e.g ., binding agent)-based procedures, and further provides other related advantages.
- the provided methods for performing a binding reaction include providing components which form a “clamp” that stabilizes a binding reaction or stabilizes a complex containing a binding agent and a target (e.g., a polypeptide).
- the provided methods are useful for performing a binding reaction and maintaining interaction between a binding agent and a target.
- the complex may be less stable and/or the binding agent and target may not remain bound for a desired amount of time.
- the present disclosure provides, in part, methods for performing a binding reaction for use with or as part of a method for highly-parallel, high throughput digital macromolecule (e.g., polypeptide) characterization and quantitation, with direct applications to protein and peptide characterization and sequencing.
- the provided methods are for forming a stable complex comprising a binding agent and a target including linking the associated stabilizing components, wherein the stabilizing components are linked directly or indirectly to the binding agent and the target, respectively.
- the target comprises macromolecules, e.g., a plurality of macromolecules obtained from a sample.
- the sample is obtained from a subject.
- the provided methods for performing a binding reaction are compatible with information transfer, with applications to macromolecule sequencing and/or analysis (e.g, protein sequencing and/or analysis), such as between nucleic acids associated with the binding agent and the target.
- the information transfer is between a nucleic acid tag associated with the binding agent and a nucleic acid tag associated with the target (e.g, by extension or ligation).
- the information transferred comprises identifying information regarding a binding agent that is configured to bind to the macromolecule.
- the information transfer can be achieved by any suitable means such as by extension or ligation, and can be between nucleic acid molecules, e.g., between a nucleic acid tag associated with the binding agent and a secondary tag on an adaptor molecule.
- a method for analyzing a macromolecule comprising the steps of: (a) providing a macromolecule and an associated recording tag joined to a support; (b) contacting the macromolecule with a binding agent capable of binding to the macromolecule, wherein the binding agent comprises a coding tag with identifying information regarding the binding agent, to allow binding between the macromolecule and the binding agent; (c) transferring information of the coding tag to the recording tag to generate an extended recording tag, wherein the information of the coding tag is transferred to the recording tag after the binding agent is bound to the macromolecule; and (d) analyzing the extended recording tag.
- a method for analyzing a macromolecule comprising the steps of: (a) providing a macromolecule and an associated recording tag joined to a support; (b) contacting the macromolecule with a binding agent capable of binding to the macromolecule, wherein the binding agent comprises a coding tag with identifying information regarding the binding agent, to allow binding between the macromolecule and the binding agent; (c) providing an adaptor molecule comprising a first hybridization sequence and a secondary tag, wherein the first hybridization sequence is substantially complementary to at least a portion of the coding tag, to allow hybridization between the first hybridization sequence and the coding tag, wherein step (c) is performed before, after or simultaneously with step (b); (d) transferring information of the secondary tag to the recording tag to generate an extended recording tag, wherein the information of the secondary tag is transferred from the adaptor molecule to the recording tag after the coding tag associated with the binding agent hybridizes with the first hybridization sequence on the adaptor molecule; and (e) analyzing the extended recording tag.
- the challenge with encoding information about the binding agent bound to the macromolecule into the extended recording tag in the abovementioned examples is that after binding and washing, the binding agent can quickly dissociate before the information about the binding agent is recorded into the recording tag.
- the stabilization approach described herein enables higher temperature and longer time duration encoding, since linking the stabilizing components minimizes prolonged dissociation of binding agent from the target polypeptide. Higher temperatures and longer encoding times are beneficial for slower encoding methods (such as ligation), reduce non-specific primer-primer interactions, enable use of enzymes that have a higher operating temperature, such as CircLigase, and enable the use of stringent annealing conditions when using adaptor molecules for encoding.
- the combination of the described stabilization approach and ssDNA ligation can enable “spacerless” encoding paving the way for effective targeted enrichment.
- antibody herein is used in the broadest sense and includes polyclonal and monoclonal antibodies, including intact antibodies and functional (antigen-binding) antibody fragments, including fragment antigen binding (Fab) fragments, F(ab') 2 fragments, Fab' fragments, Fv fragments, recombinant IgG (rlgG) fragments, single chain antibody fragments, including single chain variable fragments (scFv), and single domain antibodies (e.g ., sdAb, sdFv, nanobody) fragments.
- Fab fragment antigen binding
- rlgG recombinant IgG
- scFv single chain variable fragments
- single domain antibodies e.g ., sdAb, sdFv, nanobody
- the term encompasses genetically engineered and/or otherwise modified forms of immunoglobulins, such as intrabodies, peptibodies, chimeric antibodies, fully human antibodies, humanized antibodies, and heteroconjugate antibodies, multispecific, e.g., bispecific, antibodies, diabodies, triabodies, and tetrabodies, tandem di-scFv, tandem tri-scFv.
- antibody should be understood to encompass functional antibody fragments thereof.
- the term also encompasses intact or full-length antibodies, including antibodies of any class or sub-class, including IgG and sub-classes thereof, IgM, IgE, IgA, and IgD.
- An “individual” or “subject” includes a mammal. Mammals include, but are not limited to, domesticated animals (e.g., cows, sheep, cats, dogs, and horses), primates (e.g., humans and non-human primates such as monkeys), rabbits, and rodents (e.g., mice and rats).
- An “individual” or “subject” may include birds such as chickens, vertebrates such as fish and mammals such as mice, rats, rabbits, cats, dogs, pigs, cows, ox, sheep, goats, horses, monkeys and other non-human primates. In certain embodiments, the individual or subject is a human.
- sample refers to anything which may contain an analyte for which an analyte assay is desired.
- a “sample” can be a solution, a suspension, liquid, powder, a paste, aqueous, non-aqueous or any combination thereof.
- the sample may be a biological sample, such as a biological fluid or a biological tissue.
- biological fluids include urine, blood, plasma, serum, saliva, semen, stool, sputum, cerebral spinal fluid, tears, mucus, amniotic fluid or the like.
- Biological tissues are aggregate of cells, usually of a particular kind together with their intercellular substance that form one of the structural materials of a human, animal, plant, bacterial, fungal or viral structure, including connective, epithelium, muscle and nerve tissues. Examples of biological tissues also include organs, tumors, lymph nodes, arteries and individual cell(s).
- the sample is a biological sample.
- a biological sample of the present disclosure encompasses a sample in the form of a solution, a suspension, a liquid, a powder, a paste, an aqueous sample, or a non-aqueous sample.
- a “biological sample” includes any sample obtained from a living or viral (or prion) source or other source of macromolecules and biomolecules, and includes any cell type or tissue of a subject from which nucleic acid, protein and/or other macromolecule can be obtained.
- the biological sample can be a sample obtained directly from a biological source or a sample that is processed. For example, isolated nucleic acids that are amplified constitute a biological sample.
- Biological samples include, but are not limited to, body fluids, such as blood, plasma, serum, cerebrospinal fluid, synovial fluid, urine and sweat, tissue and organ samples from animals and plants and processed samples derived therefrom.
- the sample can be derived from a tissue or a body fluid, for example, a connective, epithelium, muscle or nerve tissue; a tissue selected from the group consisting of brain, lung, liver, spleen, bone marrow, thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, stomach, intestine, testis, ovary, uterus, rectum, nervous system, gland, and internal blood vessels; or a body fluid selected from the group consisting of blood, urine, saliva, bone marrow, sperm, an ascitic fluid, and subfractions thereof, e.g ., serum or plasma.
- level or “levels” are used to refer to the presence and/or amount of a target, e.g. , a substance or an organism that is part of the etiology of a disease or disorder, and can be determined qualitatively or quantitatively.
- a “qualitative” change in the target level refers to the appearance or disappearance of a target that is not detectable or is present in samples obtained from normal controls.
- a “quantitative” change in the levels of one or more targets refers to a measurable increase or decrease in the target levels when compared to a healthy control.
- macromolecule encompasses large molecules composed of smaller subunits.
- macromolecules include, but are not limited to peptides, polypeptides, proteins, nucleic acids, carbohydrates, lipids, macrocycles, or a combination or complex thereof.
- a macromolecule also includes a chimeric macromolecule composed of a combination of two or more types of macromolecules, covalently linked together (e.g ., a peptide linked to a nucleic acid).
- a macromolecule may also include a “macromolecule assembly”, which is composed of non-covalent complexes of two or more macromolecules.
- a macromolecule assembly may be composed of the same type of macromolecule (e.g., protein-protein) or of two or more different types of macromolecules (e.g., protein-DNA).
- polypeptide encompasses peptides and proteins, and refers to a molecule comprising a chain of two or more amino acids joined by peptide bonds.
- a polypeptide comprises 2 to 50 amino acids, e.g, having more than 20-30 amino acids.
- a peptide does not comprise a secondary, tertiary, or higher structure.
- the polypeptide is a protein.
- a protein comprises 30 or more amino acids, e.g. having more than 50 amino acids.
- a protein in addition to a primary structure, a protein comprises a secondary, tertiary, or higher structure.
- the amino acids of the polypeptides are most typically L-amino acids, but may also be D-amino acids, modified amino acids, amino acid analogs, amino acid mimetics, or any combination thereof.
- Polypeptides may be naturally occurring, synthetically produced, or recombinantly expressed. Polypeptides may be synthetically produced, isolated, recombinantly expressed, or be produced by a combination of methodologies as described above. Polypeptides may also comprise additional groups modifying the amino acid chain, for example, functional groups added via post-translational modification.
- the polymer may be linear or branched, it may comprise modified amino acids, and it may be interrupted by non-amino acids.
- the term also encompasses an amino acid polymer that has been modified naturally or by intervention; for example, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation or modification, such as conjugation with a labeling component.
- amino acid refers to an organic compound comprising an amine group, a carboxylic acid group, and a side-chain specific to each amino acid, which serve as a monomeric subunit of a peptide.
- An amino acid includes the 20 standard, naturally occurring or canonical amino acids as well as non-standard amino acids.
- the standard, naturally-occurring amino acids include Alanine (A or Ala), Cysteine (C or Cys), Aspartic Acid (D or Asp), Glutamic Acid (E or Glu), Phenylalanine (F or Phe), Glycine (G or Gly), Histidine (H or His), Isoleucine (I or lie), Lysine (K or Lys), Leucine (L or Leu), Methionine (M or Met), Asparagine (N or Asn), Proline (P or Pro), Glutamine (Q or Gin), Arginine (R or Arg), Serine (S or Ser), Threonine (T or Thr), Valine (V or Val), Tryptophan (W or Trp), and Tyrosine (Y or Tyr).
- An amino acid may be an L-amino acid or a D-amino acid.
- Non-standard amino acids may be modified amino acids, amino acid analogs, amino acid mimetics, non-standard proteinogenic amino acids, or non-proteinogenic amino acids that occur naturally or are chemically synthesized.
- Examples of non-standard amino acids include, but are not limited to, selenocysteine, pyrrolysine, and N-formylmethionine, b- amino acids, Homo-amino acids, Proline and Pyruvic acid derivatives, 3-substituted alanine derivatives, glycine derivatives, ring-substituted phenylalanine and tyrosine derivatives, linear core amino acids, N-methyl amino acids.
- post-translational modification refers to modifications that occur on a peptide after its translation, e.g ., translation by ribosomes, is complete.
- a post-translational modification may be a covalent chemical modification or enzymatic modification.
- post-translation modifications include, but are not limited to, acylation, acetylation, alkylation (including methylation), biotinylation, butyrylation, carbamylation, carbonylation, deamidation, deiminiation, diphthamide formation, disulfide bridge formation, eliminylation, flavin attachment, formylation, gamma-carboxylation, glutamyl ati on, glycylation, glycosylation, glypiation, heme C attachment, hydroxylation, hypusine formation, iodination, isoprenyl ati on, lipidation, lipoylation, malonylation, methylation, myristolylation, oxidation, palmitoylation, pegylation, phosphopantetheinylation, phosphorylation, prenylation, propionylation, retinylidene Schiff base formation, S-glutathionylation, S-nitrosylation, S-sulfenylation,
- a post-translational modification includes modifications of the amino terminus and/or the carboxyl terminus of a peptide.
- Modifications of the terminal amino group include, but are not limited to, des- amino, N-lower alkyl, N-di-lower alkyl, and N-acyl modifications.
- Modifications of the terminal carboxy group include, but are not limited to, amide, lower alkyl amide, dialkyl amide, and lower alkyl ester modifications ( e.g ., wherein lower alkyl is C1-C4 alkyl).
- a post-translational modification also includes modifications, such as but not limited to those described above, of amino acids falling between the amino and carboxy termini.
- the term post-translational modification can also include peptide modifications that include one or more detectable labels.
- binding agent refers to a nucleic acid molecule, a peptide, a polypeptide, a protein, carbohydrate, or a small molecule that binds to, associates, unites with, recognizes, or combines with a binding target, e.g., a polypeptide or a component or feature of a polypeptide.
- a binding agent may form a covalent association or non-covalent association with the polypeptide or component or feature of a polypeptide.
- a binding agent may also be a chimeric binding agent, composed of two or more types of molecules, such as a nucleic acid molecule-peptide chimeric binding agent or a carbohydrate-peptide chimeric binding agent.
- a binding agent may be a naturally occurring, synthetically produced, or recombinantly expressed molecule.
- a binding agent may bind to a single monomer or subunit of a polypeptide (e.g., a single amino acid of a polypeptide) or bind to a plurality of linked subunits of a polypeptide (e.g., a di-peptide , tri -peptide, or higher order peptide of a longer peptide, polypeptide, or protein molecule).
- a binding agent may bind to a linear molecule or a molecule having a three-dimensional structure (also referred to as conformation).
- an antibody binding agent may bind to linear peptide, polypeptide, or protein, or bind to a conformational peptide, polypeptide, or protein.
- a binding agent may bind to an N-terminal peptide, a C-terminal peptide, or an intervening peptide of a peptide, polypeptide, or protein molecule.
- a binding agent may bind to an N- terminal amino acid, C-terminal amino acid, or an intervening amino acid of a peptide molecule.
- a binding agent may preferably bind to a chemically modified or labeled amino acid (e.g., an amino acid that has been labeled by a chemical reagent) over a non-modified or unlabeled amino acid.
- a binding agent may preferably bind to an amino acid that has been labeled or modified over an amino acid that is unlabeled or unmodified.
- a binding agent may bind to a post-translational modification of a peptide molecule.
- a binding agent may exhibit selective binding to a component or feature of a polypeptide (e.g ., a binding agent may selectively bind to one of the 20 possible natural amino acid residues and bind with very low affinity or not at all to the other 19 natural amino acid residues).
- a binding agent may exhibit less selective binding, where the binding agent is capable of binding or configured to bind to a plurality of components or features of a polypeptide (e.g., a binding agent may bind with similar affinity to two or more different amino acid residues).
- a binding agent may comprise a coding tag, which may be joined to the binding agent by a linker.
- linker refers to one or more of a nucleotide, a nucleotide analog, an amino acid, a peptide, a polypeptide, a polymer, or a non-nucleotide chemical moiety that is used to join two molecules.
- a linker may be used to join a binding agent with a coding tag, a recording tag with a polypeptide, a polypeptide with a support, a recording tag with a solid support, etc.
- a linker joins two molecules via enzymatic reaction or chemistry reaction (e.g, click chemistry).
- ligand refers to any molecule or moiety connected to the compounds described herein.
- Ligand may refer to one or more ligands attached to a compound.
- the ligand is a pendant group or binding site (e.g, the site to which the binding agent binds).
- proteome can include the entire set of proteins, polypeptides, or peptides (including conjugates or complexes thereof) expressed by a genome, cell, tissue, or organism at a certain time, of any organism. In one aspect, it is the set of expressed proteins in a given type of cell or organism, at a given time, under defined conditions. Proteomics is the study of the proteome. For example, a “cellular proteome” may include the collection of proteins found in a particular cell type under a particular set of environmental conditions, such as exposure to hormone stimulation. An organism’s complete proteome may include the complete set of proteins from all of the various cellular proteomes. A proteome may also include the collection of proteins in certain sub-cellular biological systems.
- proteome include subsets of a proteome, including but not limited to a kinome; a secretome; a receptome (e.g., GPCRome); an immunoproteome; a nutriproteome; a proteome subset defined by a post-translational modification (e.g., phosphorylation, ubiquitination, methylation, acetylation, glycosylation, oxidation, lipidation, and/or nitrosylation), such as a phosphoproteome (e.g., phosphotyrosine- proteome, tyrosine-kinome, and tyrosine-phosphatome), a glycoproteome, etc.; a proteome subset associated with a tissue or organ, a developmental stage, or a physiological or pathological condition; a proteome subset associated a cellular process, such as cell cycle, differentiation (or
- proteomics refers to qualitative or quantitative analysis of the proteome within cells, tissues, and bodily fluids, and the corresponding spatial distribution of the proteome within the cell and within tissues. Additionally, proteomics studies include the dynamic state of the proteome, continually changing in time as a function of biology and defined biological or chemical stimuli.
- N-terminal amino acid N-terminal amino acid
- C-terminal amino acid C-terminal amino acid
- the amino acids making up a peptide may be numbered in order, with the peptide being “w” amino acids in length.
- NTAA is considered the n th amino acid (also referred to herein as the “w NTAA”).
- the next amino acid is the n-1 amino acid, then the n-2 amino acid, and so on down the length of the peptide from the N-terminal end to C-terminal end.
- an NTAA, CTAA, or both may be modified or labeled with a moiety or a chemical moiety.
- barcode refers to a nucleic acid molecule of about 2 to about 30 bases (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29 or 30 bases) providing a unique identifier tag or origin information for a polypeptide, a binding agent, a set of binding agents from a binding cycle, a sample polypeptides, a set of samples, polypeptides within a compartment (e.g., droplet, bead, or separated location), polypeptides within a set of compartments, a fraction of polypeptides, a set of polypeptide fractions, a spatial region or set of spatial regions, a library of polypeptides, or a library of binding agents.
- bases e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29 or 30 bases
- a barcode can be an artificial sequence or a naturally occurring sequence.
- each barcode within a population of barcodes is different.
- a portion of barcodes in a population of barcodes is different, e.g., at least about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, or 99% of the barcodes in a population of barcodes is different.
- a population of barcodes may be randomly generated or non-randomly generated.
- a population of barcodes are error- correcting or error-tolerant barcodes.
- Barcodes can be used to computationally deconvolute the multiplexed sequencing data and identify sequence reads derived from an individual polypeptide, sample, library, etc.
- a barcode can also be used for deconvolution of a collection of polypeptides that have been distributed into small compartments for enhanced mapping. For example, rather than mapping a peptide back to the proteome, the peptide is mapped back to its originating protein molecule or protein complex.
- coding tag refers to a polynucleotide with any suitable length, e.g ., a nucleic acid molecule of about 2 bases to about 100 bases, including any integer including 2 and 100 and in between, that comprises identifying information for its associated binding agent.
- a “coding tag” may also be made from a “sequenceable polymer” (see, e.g., Niu et ah, 2013, Nat. Chem. 5:282-292; Roy et ah, 2015, Nat. Commun. 6:7237; Lutz, 2015, Macromolecules 48:4759-4767; each of which are incorporated by reference in its entirety).
- a coding tag may comprise an encoder sequence, which is optionally flanked by one spacer on one side or optionally flanked by a spacer on each side.
- a coding tag may also be comprised of an optional UMI and/or an optional binding cycle-specific barcode.
- a coding tag may be single stranded or double stranded.
- a double stranded coding tag may comprise blunt ends, overhanging ends, or both.
- a coding tag may refer to the coding tag that is directly attached to a binding agent, to a complementary sequence hybridized to the coding tag directly attached to a binding agent (e.g., for double stranded coding tags), or to coding tag information present in an extended recording tag.
- a coding tag may further comprise a binding cycle specific spacer or barcode, a unique molecular identifier, a universal priming site, or any combination thereof.
- spacer refers to a nucleic acid molecule of about
- a spacer sequence flanks a secondary tag or an encoder sequence of a coding tag on one end or both ends. Following binding of a binding agent to a polypeptide, annealing between complementary spacer sequences on their associated coding tag (or adaptor molecule) and recording tag, respectively, allows transfer of binding information through a primer extension reaction or ligation to the recording tag, coding tag, or a di-tag construct.
- Sp refers to spacer sequence complementary to Sp.
- spacer sequences within a library of binding agents possess the same number of bases.
- a common (shared or identical) spacer may be used in a library of binding agents.
- a spacer sequence may have a “cycle specific” sequence in order to track binding agents used in a particular binding cycle.
- the spacer sequence (Sp) can be constant across all binding cycles, be specific for a particular class of polypeptides, or be binding cycle number specific.
- Polypeptide class-specific spacers permit annealing of a cognate binding agent’s coding tag information present in an extended recording tag from a completed binding/extension cycle to the coding tag of another binding agent recognizing the same class of polypeptides in a subsequent binding cycle via the class-specific spacers.
- a spacer sequence may comprise sufficient number of bases to anneal to a complementary spacer sequence in a recording tag to initiate a primer extension (also referred to as polymerase extension) reaction, or provide a “splint” for a ligation reaction, or mediate a “sticky end” ligation reaction.
- a spacer sequence may comprise a fewer number of bases than the encoder sequence within a coding tag.
- recording tag refers to a moiety, e.g ., a chemical coupling moiety, a nucleic acid molecule, or a sequenceable polymer molecule (see, e.g. ,
- identifying information of a coding tag can be transferred, either directly or indirectly (e.g, via an adaptor molecule).
- information from a secondary tag of an adaptor molecule e.g, as a proxy, representation or correlation of the information of a coding tag
- identifying information about the macromolecule e.g, UMI information
- UMI information UMI information
- Identifying information can comprise any information characterizing a molecule such as information pertaining to sample, fraction, partition, spatial location, interacting neighboring molecule(s), cycle number, etc. Additionally, the presence of UMI information can also be classified as identifying information.
- information from a coding tag linked to a binding agent can be transferred to the recording tag associated with the polypeptide while the binding agent is bound to the polypeptide.
- information from a recording tag associated with the polypeptide can be transferred to the coding tag linked to the binding agent while the binding agent is bound to the polypeptide.
- a recording tag may be directly linked to a polypeptide, linked to a polypeptide via a multifunctional linker, or associated with a polypeptide by virtue of its proximity (or co-localization) on a support.
- a recording tag may be linked via its 5’ end or 3’ end or at an internal site, as long as the linkage is compatible with the method used to transfer coding tag information to the recording tag or vice versa.
- a recording tag may further comprise other functional components, e.g ., a universal priming site, unique molecular identifier, a barcode (e.g, a sample barcode, a fraction barcode, spatial barcode, a compartment tag, etc.), a spacer sequence that is complementary to a spacer sequence of a coding tag, or any combination thereof.
- the spacer sequence of a recording tag is preferably at the 3’ -end of the recording tag in embodiments where polymerase extension is used to transfer coding tag information to the recording tag.
- primer extension also referred to as “polymerase extension” refers to a reaction catalyzed by a nucleic acid polymerase (e.g., DNA polymerase) whereby a nucleic acid molecule (e.g, oligonucleotide primer, spacer sequence) that anneals to a complementary strand is extended by the polymerase, using the complementary strand as template.
- a nucleic acid polymerase e.g., DNA polymerase
- a nucleic acid molecule e.g, oligonucleotide primer, spacer sequence
- UMI unique molecular identifier
- a polypeptide UMI can be used to computationally deconvolute sequencing data from a plurality of extended recording tags to identify extended recording tags that originated from an individual polypeptide.
- a polypeptide UMI can be used to accurately count originating polypeptide molecules by collapsing NGS reads to unique UMIs.
- a binding agent UMI can be used to identify each individual molecular binding agent that binds to a particular polypeptide. For example, a UMI can be used to identify the number of individual binding events for a binding agent specific for a single amino acid that occurs for a particular peptide molecule. It is understood that when UMI and barcode are both referenced in the context of a binding agent or polypeptide, that the barcode refers to identifying information other that the UMI for the individual binding agent or polypeptide (e.g ., sample barcode, compartment barcode, binding cycle barcode).
- universal priming sequence refers to a nucleic acid molecule, which may be used for library amplification and/or for sequencing reactions.
- a universal priming site may include, but is not limited to, a priming site (primer sequence) for PCR amplification, flow cell adaptor sequences that anneal to complementary oligonucleotides on flow cell surfaces enabling bridge amplification in some next generation sequencing platforms, a sequencing priming site, or a combination thereof.
- Universal priming sites can be used for other types of amplification, including those commonly used in conjunction with next generation digital sequencing.
- extended recording tag molecules may be circularized and a universal priming site used for rolling circle amplification to form DNA nanoballs that can be used as sequencing templates (Drmanac et al., 2009, Science 327:78-81).
- recording tag molecules may be circularized and sequenced directly by polymerase extension from universal priming sites (Korlach et al., 2008, Proc. Natl. Acad. Sci.
- the term “forward” when used in context with a “universal priming site” or “universal primer” may also be referred to as “5”’ or “sense”.
- the term “reverse” when used in context with a “universal priming site” or “universal primer” may also be referred to as “3”’ or “antisense”.
- extended recording tag refers to a recording tag to which information of at least one binding agent’s coding tag (or its complementary sequence) has been transferred following binding of the binding agent to a polypeptide.
- Information of the coding tag may be transferred to the recording tag directly (e.g., ligation) or indirectly (e.g., primer extension).
- Information may be transferred to the recording tag directly (e.g., ligation) or indirectly (e.g., primer extension) from a secondary tag of an adaptor molecule.
- Information of a coding tag may be transferred to the recording tag enzymatically or chemically.
- An extended recording tag may comprise binding agent information of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 125, 150, 175, 200 or more coding tags.
- the base sequence of an extended recording tag may reflect the temporal and sequential order of binding of the binding agents identified by their coding tags, may reflect a partial sequential order of binding of the binding agents identified by the coding tags, or may not reflect any order of binding of the binding agents identified by the coding tags.
- the coding tag information present in the extended recording tag represents with at least 25%, 30%, 35% , 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identity the polypeptide sequence being analyzed.
- errors may be due to off-target binding by a binding agent, or to a “missed” binding cycle ( e.g ., because a binding agent fails to bind to a polypeptide during a binding cycle, because of a failed primer extension reaction), or both.
- solid support As used herein, the term “solid support”, “solid surface”, or “solid substrate”, or
- sampling substrate refers to any solid material, including porous and non- porous materials, to which a polypeptide can be associated directly or indirectly, by any means known in the art, including covalent and non-covalent interactions, or any combination thereof.
- a solid support may be two-dimensional (e.g., planar surface) or three- dimensional (e.g., gel matrix or bead).
- a solid support can be any support surface including, but not limited to, a bead, a microbead, an array, a glass surface, a silicon surface, a plastic surface, a filter, a membrane, a PTFE membrane, a PTFE membrane, a nitrocellulose membrane, a nitrocellulose-based polymer surface, nylon, a silicon wafer chip, a flow through chip, a flow cell, a biochip including signal transducing electronics, a channel, a microtiter well, an ELISA plate, a spinning interferometry disc, a nitrocellulose membrane, a nitrocellulose-based polymer surface, a polymer matrix, a nanoparticle, or a microsphere.
- Materials for a solid support include but are not limited to acrylamide, agarose, cellulose, dextran, nitrocellulose, glass, gold, quartz, polystyrene, polyethylene vinyl acetate, polypropylene, polyester, polymethacrylate, polyacrylate, polyethylene, polyethylene oxide, poly silicates, polycarbonates, poly vinyl alcohol (PVA), Teflon, fluorocarbons, nylon, silicon rubber, polyanhydrides, polyglycolic acid, polyvinylchloride, polylactic acid, polyorthoesters, functionalized silane, polypropylfumerate, collagen, glycosaminoglycans, polyamino acids, dextran, or any combination thereof.
- Solid supports further include thin film, membrane, bottles, dishes, fibers, woven fibers, shaped polymers such as tubes, particles, beads, microspheres, microparticles, or any combination thereof.
- the bead can include, but is not limited to, a ceramic bead, a polystyrene bead, a polymer bead, a polyacrylate bead, a methylstyrene bead, an agarose bead, a cellulose bead, a dextran bead, an acrylamide bead, a solid core bead, a porous bead, a paramagnetic bead, a glass bead, a controlled pore bead, a silica-based bead, or any combinations thereof.
- a bead may be spherical or an irregularly shaped.
- a bead or support may be porous.
- a bead’s size may range from nanometers, e.g., 100 nm, to millimeters, e.g., 1 mm.
- beads range in size from about 0.2 micron to about 200 microns, or from about 0.5 micron to about 5 micron.
- beads can be about 1, 1.5, 2, 2.5, 2.8, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 8.5, 9, 9.5, 10, 10.5, 15, or 20 pm in diameter.
- “a bead” solid support may refer to an individual bead or a plurality of beads.
- the solid surface is a nanoparticle.
- the nanoparticles range in size from about 1 nm to about 500 nm in diameter, for example, between about 1 nm and about 20 nm, between about 1 nm and about 50 nm, between about 1 nm and about 100 nm, between about 10 nm and about 50 nm, between about 10 nm and about 100 nm, between about 10 nm and about 200 nm, between about 50 nm and about 100 nm, between about 50 nm and about 150, between about 50 nm and about 200 nm, between about 100 nm and about 200 nm, or between about 200 nm and about 500 nm in diameter.
- the nanoparticles can be about 10 nm, about 50 nm, about 100 nm, about 150 nm, about 200 nm, about 300 nm, or about 500 nm in diameter. In some embodiments, the nanoparticles are less than about 200 nm in diameter.
- nucleic acid molecule refers to a single- or double-stranded polynucleotide containing deoxyribonucleotides or ribonucleotides that are linked by 3 ’-5’ phosphodiester bonds, as well as polynucleotide analogs.
- a nucleic acid molecule includes, but is not limited to, DNA, RNA, and cDNA.
- a polynucleotide analog may possess a backbone other than a standard phosphodiester linkage found in natural polynucleotides and, optionally, a modified sugar moiety or moieties other than ribose or deoxyribose.
- Polynucleotide analogs contain bases capable of hydrogen bonding by Watson-Crick base pairing to standard polynucleotide bases, where the analog backbone presents the bases in a manner to permit such hydrogen bonding in a sequence- specific fashion between the oligonucleotide analog molecule and bases in a standard polynucleotide.
- polynucleotide analogs include, but are not limited to xeno nucleic acid (XNA), bridged nucleic acid (BNA), glycol nucleic acid (GNA), peptide nucleic acids (PNAs), yPNAs, morpholino polynucleotides, locked nucleic acids (LNAs), threose nucleic acid (TNA), 2’-0-Methyl polynucleotides, 2'-0-alkyl ribosyl substituted polynucleotides, phosphor othioate polynucleotides, and boronophosphate polynucleotides.
- XNA xeno nucleic acid
- BNA bridged nucleic acid
- GAA glycol nucleic acid
- PNAs peptide nucleic acids
- yPNAs yPNAs
- morpholino polynucleotides include locked nucleic acids (LNAs),
- a polynucleotide analog may possess purine or pyrimidine analogs, including for example, 7-deaza purine analogs, 8-halopurine analogs, 5-halopyrimidine analogs, or universal base analogs that can pair with any base, including hypoxanthine, nitroazoles, isocarbostyril analogues, azole carboxamides, and aromatic triazole analogues, or base analogs with additional functionality, such as a biotin moiety for affinity binding.
- the nucleic acid molecule or oligonucleotide is a modified oligonucleotide.
- the nucleic acid molecule or oligonucleotide is a DNA with pseudo complementary bases, a DNA with protected bases, an RNA molecule, a BNA molecule, an XNA molecule, a LNA molecule, a PNA molecule, a gRNA molecule, or a morpholino DNA, or a combination thereof.
- the nucleic acid molecule or oligonucleotide is backbone modified, sugar modified, or nucleobase modified.
- the nucleic acid molecule or oligonucleotide has nucleobase protecting groups such as Alloc, electrophilic protecting groups such as thiranes, acetyl protecting groups, nitrobenzyl protecting groups, sulfonate protecting groups, or traditional base-labile protecting groups.
- nucleic acid sequencing means the determination of the order of nucleotides in a nucleic acid molecule or a sample of nucleic acid molecules.
- next generation sequencing refers to high-throughput sequencing methods that allow the sequencing of millions to billions of molecules in parallel.
- next generation sequencing methods include sequencing by synthesis, sequencing by ligation, sequencing by hybridization, polony sequencing, ion semiconductor sequencing, and pyrosequencing.
- primers By attaching primers to a solid substrate and a complementary sequence to a nucleic acid molecule, a nucleic acid molecule can be hybridized to the solid substrate via the primer and then multiple copies can be generated in a discrete area on the solid substrate by using polymerase to amplify (these groupings are sometimes referred to as polymerase colonies or polonies).
- a nucleotide at a particular position can be sequenced multiple times e.g ., hundreds or thousands of times) - this depth of coverage is referred to as "deep sequencing.”
- Examples of high throughput nucleic acid sequencing technology include platforms provided by Illumina, BGI, Qiagen, Thermo-Fisher, and Roche, including formats such as parallel bead arrays, sequencing by synthesis, sequencing by ligation, capillary electrophoresis, electronic microchips, “biochips,” microarrays, parallel microchips, and single-molecule arrays (See e.g., Service, Science (2006) 311:1544-1546).
- single molecule sequencing or “third generation sequencing” refers to next-generation sequencing methods wherein reads from single molecule sequencing instruments are generated by sequencing of a single molecule of DNA. Unlike next generation sequencing methods that rely on amplification to clone many DNA molecules in parallel for sequencing in a phased approach, single molecule sequencing interrogates single molecules of DNA and does not require amplification or synchronization. Single molecule sequencing includes methods that need to pause the sequencing reaction after each base incorporation ('wash-and-scan' cycle) and methods which do not need to halt between read steps. Examples of single molecule sequencing methods include single molecule real-time sequencing (Pacific Biosciences), nanopore-based sequencing (Oxford Nanopore), duplex interrupted nanopore sequencing, and direct imaging of DNA using advanced microscopy.
- analyzing means to identify, detect, quantify, characterize, distinguish, or a combination thereof, all or a portion of the components of the polypeptide.
- analyzing a peptide, polypeptide, or protein includes determining all or a portion of the amino acid sequence (contiguous or non-continuous) of the peptide.
- Analyzing a polypeptide also includes partial identification of a component of the polypeptide. For example, partial identification of amino acids in the polypeptide protein sequence can identify an amino acid in the protein as belonging to a subset of possible amino acids.
- Analysis typically begins with analysis of the n NTAA, and then proceeds to the next amino acid of the peptide (i.e., n-1 , n-2 , n-3 , and so forth). This is accomplished by elimination of the n NTAA, thereby converting the n-1 amino acid of the peptide to an N- terminal amino acid (referred to herein as the “ n-1 NTAA”). Analyzing the peptide may also include determining the presence and frequency of post-translational modifications on the peptide, which may or may not include information regarding the sequential order of the post-translational modifications on the peptide.
- Analyzing the peptide may also include determining the presence and frequency of epitopes in the peptide, which may or may not include information regarding the sequential order or location of the epitopes within the peptide. Analyzing the peptide may include combining different types of analysis, for example obtaining epitope information, amino acid sequence information, post-translational modification information, or any combination thereof.
- a stabilizing component may be directly or indirectly associated with or joined to the target.
- the stabilizing component may be directly or indirectly associated with or joined to the binding agent.
- each of the stabilizing components is attached to or associated with the binding agent and the target, respectively, at a site different from the binding site between the binding agent and the target.
- the stabilizing component and the binding moiety of the binding agent are separate.
- the target in the binding reaction is a macromolecule, e.g., a peptide, polypeptide, and protein.
- the binding reaction is performed with a plurality of binding agents and a plurality of macromolecules, e.g., peptides, polypeptides, and proteins.
- the plurality of binding agents may include a mixture of binding agents.
- the provided methods for performing a binding reaction is performed in an assay for sequencing or analysis of the peptides, polypeptides, and proteins. Before or after performing a binding reaction, other steps of an assay for analysis of the target may be performed (see e.g. , FIG. 1A-1D and FIG. 2A-2D).
- the provided methods for performing a binding reaction are compatible with a further information transfer step, such as information transfer between nucleic acids associated with the binding agent and the target.
- the information transfer is between a nucleic acid tag associated with the binding agent and a nucleic acid tag associated with the target (e.g, by extension or ligation).
- a binding agent is contacted with a target, and the binding agent and the target each comprises, is joined to, or is associated with a stabilizing component.
- the binding agent is allowed to interact with the target, then the stabilizing components are linked to form a stable complex.
- the linking of the stabilizing components can be controlled and/or inducible.
- the linking of the stabilizing components does not occur until the stabilizing components are “activated”.
- the stabilizing components are linked upon introduction to light.
- the stabilizing components are linked upon introduction to a linking agent.
- the linking agent comprises a chemical reagent, a non-biological reagent, a biological reagent, or a combination thereof.
- the linking agent comprises a protein or a polypeptide. In some examples, the linking agent comprises metal ions. Once activated, the linking of the stabilizing components, either directly with each other or indirectly via a linker or other components, allows formation of a stable complex with the binding agent and target.
- the stable complex comprises a binding agent and a target, wherein the binding agent and the target is each associated with or joined to a stabilizing component.
- the binding agent is configured to bind to the target at a binding site located on the target.
- the method comprises linking the stabilizing components associated with the binding agent and the target, thereby forming a stable complex.
- the linking of the stabilizing components forms a complex adequately or sufficiently stable for performing other steps or analysis of the target.
- the complex containing the binding agent and the target is adequately or sufficiently stable for information transfer to occur.
- the interaction between the binding agent and the target is maintained.
- the method for performing a binding reaction is reversible, where the stable complex is formed and then can be disassembled.
- the method for performing a binding reaction is temporally controlled.
- the linking of the stabilizing components is inducible.
- the method for performing a binding reaction includes an activation step for the stable complex to form.
- the method for performing a binding reaction includes an activation step for linking the stabilizing components.
- the linking of the stabilizing components can involve photosensitive step ( e.g . photoisomerization) or can involve hybridization-based interactions.
- the stabilizing components comprise caged compounds or caged molecules, such as small organic molecules.
- the stabilizing component is a photosensitive caged molecule.
- the stable complex may form quickly, e.g. the stabilizing components are linked quickly once activated.
- the provided methods may provide the advantage of providing specificity and stability in forming the complex comprising the binding agent and the target.
- specificity is provided by first contacting the binding agent with the target. After the binding agent interacts with the target, the stabilizing component associated with the binding agent is linked to the stabilizing component associated with the target, thereby forming a stable complex.
- the method is performed with a mixture of binding agents and a mixture of targets, and each binding agent in the mixture is configured to exhibit at least partial specificity towards some particular target(s).
- the binding agent is allowed to bind to the appropriate target before the stabilizing components are linked.
- the binding agents and stabilizing components with the desired binding affinity are selected and used for the methods provided herein for the binding reaction.
- the stabilizing components and linking agents with the desired binding affinity are selected and used for the methods provided herein for the binding reaction.
- the relative affinity of stabilizing components to each other and/or to the linking agent is at least as high as the affinity of the binding agent to the target.
- the method includes a wash step after allowing the binding agent to interact with the binding site located on the target. The wash step may remove non-specific binding of binding agents to non-target molecules.
- the linking agent for linking the stabilizing components is provided and introduced after the wash step.
- the binding reaction can be accomplished by a number of different ways depending on the design of the components in the complex.
- the binding agent may be joined to a stabilizing component and a nucleic acid molecule by a linker of various lengths and the distance between the components may vary.
- the target is associated or joined to a stabilizing component via a linker of various lengths based on the interaction of the components in the complex.
- the methods for performing the binding reaction includes a binding agent associated with, joined to, attached to, or comprising a stabilizing component and a target associated with, joined to, attached to, or comprising a stabilizing component.
- the binding agent and the target are each associated with a stabilizing component.
- the stabilizing component itself comprises one or more sub-components.
- the binding agent is associated with a first stabilizing component and the target is associated with a second stabilizing component.
- the first and second stabilizing components are the same or different.
- the binding agent and target may each be associated with one or more stabilizing components.
- the binding agent may be directly associated with, joined to, attached to the stabilizing component s).
- the binding agent may be indirectly associated with, joined to, attached to the stabilizing component s), such as via a linker.
- the binding agent can be joined to the stabilizing component via any suitable linker, such as of various lengths and flexibility.
- the stabilizing component and the binding agent is joined via a flexible linker (e.g ., PEG linker).
- the binding agent is joined to a nucleic acid molecule
- a linker e.g., PEG linker
- the target may be directly associated with, joined to, attached to the stabilizing component s).
- the target may be indirectly associated with, joined to, attached to the stabilizing component(s), such as via a linker.
- the target is joined to the stabilizing component via any suitable linker, such as of various lengths and flexibility.
- the stabilizing component and the target is joined via a flexible linker (e.g ., PEG linker).
- the target is joined to a nucleic acid molecule (e.g., a capture nucleic acid molecule) that is joined the stabilizing component via a linker (e.g,
- the target is joined to a bait nucleic acid molecule which hybridizes with at least a portion of the capture nucleic acid molecule that is immobilized on a solid support and the capture nucleic acid molecule is joined to the stabilizing component.
- a linker joins two molecules (binding agent and stabilizing component or target and stabilizing component) via enzymatic reaction or chemistry reaction (e.g, click chemistry).
- the stabilizing components are joined to the target or binding agent via a functional moiety, such as a click chemistry moiety, an aldehyde, an azide/alkyne, or a maleimide/thiol, or an epoxide/nucleophile, an inverse electron demand Diels-Alder (iEDDA) group, or a moiety for a Staudinger reaction.
- a functional moiety such as a click chemistry moiety, an aldehyde, an azide/alkyne, or a maleimide/thiol, or an epoxide/nucleophile, an inverse electron demand Diels-Alder (iEDDA) group, or a moiety for a Staudinger reaction.
- the stabilizing components are joined to the target or binding agent via hybridization of attached nucleic acid molecules or oligonucleotides.
- a stabilizing component is joined or attached (directly or indirectly via a linker) to a nucleic acid molecule or oligonucleotide.
- the nucleic acid molecule or oligonucleotide joined or associated with the stabilizing component is configured for hybridization to a complementary nucleic acid molecule or oligonucleotide.
- the complementary nucleic acid molecule or oligonucleotide is associated or joined to a binding molecule or a binding pair member such as a biotin.
- a recording tag is joined to a binding pair member, e.g, a biotin molecule (or similar molecule) at the 5’ end.
- a recording tag is joined to a stabilizing component DNA and the stabilizing component can be associated with its complementary stabilizing component nucleic acid which is joined to a binding pair member, e.g, a biotin (or similar molecule).
- a binding agent is joined via a linker to a coding tag which is joined via a linker to a biotin molecule.
- a binding agent is joined via a linker to a coding tag (nucleic acid hairpin) which is joined via a linker to a biotin or similar molecule (in the following order: binding agent - linker - hairpin coding tag - linker - biotin).
- a binding agent is joined via a linker to a stabilizing component DNA which is joined via a linker to a coding tag (in the following order: binding agent - linker - stabilizing component DNA - linker - coding tag) (FIG. 3A).
- the stabilizing component can be associated with its complementary stabilizing component nucleic acid which is joined to a biotin (or similar molecule) (FIG. 3B).
- the linking of the stabilizing components to form the stable complex includes interaction of the stabilizing component associated with the binding agent with the stabilizing component associated with the target.
- the linking of the stabilizing components include interaction of the stabilizing components with a linking agent.
- the linking of the stabilizing components include interaction of nucleic acid molecules associated with each of the stabilizing components.
- the interaction of the stabilizing components with each other or with the linking agent is covalent or non-covalent.
- binding partners or pairs are known to those of skill in the art and may be used in the subject binding reactions to stabilize the interaction of the binding agent and target (e.g ., as stabilizing components).
- the stabilizing components can be joined to the binding agent or target using standard conjugation chemistries (Hermanson, Bioconjugate Techniques, (2013) Academic Press). Selection of the stabilizing component may be based on affinity of the stabilizing components to each other or for the linking agent, speed of interaction, strength of the interaction, reversibility of the interaction, etc.
- the stabilizing components each comprises a biological molecule, a chemical molecule, a small molecule or a combination thereof.
- the stabilizing components comprises any appropriate binding partners, host-guest molecules or motifs, other interacting molecules, or portions thereof (see e.g. Liu et ah, Chem Soc Rev. (2017); 46(9): 2391-2403; Mantooth et ah, Macromol Biosci. (2019) 19(l):el800281).
- the stabilizing component comprises an organic molecule or a synthetic molecule.
- the stabilizing component is or comprises a small molecule, a compound, a protein, a protein complex, polypeptide, peptide, nucleic acid molecule, carbohydrate, lipid, macrocycle, a chimeric macromolecule, a synthetic host, or any combinations thereof.
- the stabilizing component is or comprises an antibody, a catalytic antibody, an antigen, an enzyme, an inhibitor, a ligand, a protein, a substrate, or an organic compound.
- the stabilizing component is or comprises a hapten.
- a hapten molecule may be attached at different positions in the hapten molecule to the binding agent or the target (or an associated polynucleotide or nucleic acid molecule).
- the stabilizing components may comprise a photosensitive molecule (e.g . photolabile or photoisomerization).
- the stabilizing components are configured for nucleic acid hybridization-based interactions.
- the stabilizing components comprise or are associated with caged compounds or caged molecules, such as small organic molecules.
- the stabilizing components comprise or are associated with one or more components of a known host-guest interaction.
- the introduction of the light, activating the stabilizing component, or providing the linking agent provides temporal control over the linking of the stabilizing components.
- the stabilizing components are linked to each other (directly or indirectly) upon introduction of a linking agent or light.
- the stabilizing components remain inactive, or are generally not linked to each other or to a linking agent until activated.
- activation may refer to the introduction of a molecule, photoactivation (e.g., introduction of light, for example, UV or blue light), change in pH of the reaction, change in condition of the reaction (e.g, change in temperature), or destruction or removal of inhibition (e.g, uncaging of a molecule).
- one or more of the stabilizing components upon activation, undergoes a conformational change.
- one or more of the stabilizing components is under allosteric control and upon activation (e.g, by binding to a linking agent), the stabilizing component is made available for interactions/binding.
- the light or linking agent induces uncaging of one or both of the stabilizing components, deblocking of one or both of the stabilizing components, isomerization of the stabilizing components, hybridization of the stabilizing components, and/or binding of the stabilizing components.
- the linking of the stabilizing components occurs in less than about 10 seconds, less than about 30 seconds, less than about 60 seconds, less than about 80 seconds, less than about 100 seconds, less than about 2 minutes, less than about 5 minutes, less than about 10 minutes, or less than about 15 minutes. It may be desirable to select stabilizing components that may be linked in an amount of time less than the time for the binding agent to dissociate from the target, to maintain specificity of the binding agent with the target.
- linking of the stabilizing components is specific or occurs within the complex between a stabilizing component associated with the binding agent and a stabilizing component associated with the target bound by said binding agent.
- the method is performed such that linking of stabilizing components is not intermolecular, e.g., between stabilizing components of different complexes. It may be preferred that linking does not occur between a stabilizing component of a binding agent and a target bound by a different binding agent.
- linking of intramolecular stabilizing components within a complex can be achieved by titrating or controlling the density of target macromolecules on a support or within the volume of a substrate. In some cases, the control of density of the target macromolecules is performed by controlling the density of functional coupling groups for attaching the targets or by spiking a competitor or “dummy” reactive molecule when immobilizing the targets to the support.
- the linking agent comprises a chemical reagent, a non- biological reagent, a biological reagent, or a combination thereof.
- the linking agent comprises one or more proteins.
- the linking agent comprises metal ions.
- the stabilizing components are linked upon a change in pH of the reaction or reaction mixture or environment.
- the linking agent comprises at least one polynucleotide or nucleic acid comprising a sequence which hybridizes to at least one of the stabilizing components.
- the linking agent is a polynucleotide or nucleic acid comprising two hybridization regions: one region for hybridizing to a nucleic acid joined to a target and one region for hybridizing to a nucleic acid joined to the binding agent.
- one stabilizing component is associated with the binding agent via hybridization of a polynucleotide or nucleic acid joined to the stabilizing component to a nucleic acid joined to the binding agent.
- one stabilizing component is associated with the target via hybridization of a polynucleotide or nucleic acid joined to the stabilizing component to a nucleic acid joined to the recording tag joined to the target.
- the stabilizing component is or comprises a biotin or an analog thereof (e.g . desthiobiotin) and the linking agent is or comprises an avidin (e.g., streptavidin or neutravidin).
- the first stabilizing component is or comprises a first antibody or an antigen-recognizing fragment thereof;
- the second stabilizing component is or comprises a second antibody or an antigen recognizing fragment thereof recognizing a different epitope from the first antibody;
- the linking agent comprises two epitopes recognized by the first and second antibodies, so after introduction of the linking agent a stable complex forms comprising the first and second antibodies (or antigen-recognizing fragments thereof) and the linking agent.
- the binding agent and the target remains bound.
- the binding agent and the target are released from each other, remaining in the vicinity of each other by virtue of the linked stabilizing components. In this case, when the stabilizing components remain linked, the process of information (encoding) transfer can occur.
- the described stabilization approach operates by transiently “cross-linking” the binding agent and the target on a support after binding event forming a stable complex.
- stabilizing components can be employed, but in a preferred embodiment the stabilization methods rely on a rapid means of reversibly coupling the DNA-target polypeptide complex to the binding agent after it binds to the target polypeptide.
- the following embodiment illustrates an exemplary workflow including a binding reaction: a large collection of polypeptides (e.g, 50 million - 1 billion or more) from a proteolytic digest are immobilized randomly on a substrate (e.g, beads) at an appropriate intramolecular spacing with nucleic acid capture molecules; the targets are joined to nucleic acid capture molecules which are each joined to a desthiobiotin molecule (the first stabilizing component); binding agents each joined to a biotin molecule (the second stabilizing component) and an associated nucleic acid molecule containing information regarding the binding agent are contacted with the targets and allowed to interact; a wash is preformed to remove non-specific binding; streptavidin is added to the reaction as a linking agent and associates with the biotin and desthiobiotin; a streptavidin molecule binds a biotin joined to the binding agent and a desthiobiotin joined to the target, thereby forming a stable complex containing the binding agent and target.
- biotin-nucleic acid conjugates can be added after the binding agent interacts with the target.
- each of the biotin or desthiobiotin may use any similar molecule or analog, depending on desired strength of the interaction.
- the first stabilizing component is the same as the second stabilizing component.
- a biotin molecule can be used instead of desthiobiotin molecule, and two biotin molecules will interact with the linking agent and form the stable complex.
- the first stabilizing component has a lower affinity to the linking agent in comparison to an affinity of the second stabilizing component to the linking agent as shown in the exemplary workflow from the previous paragraph.
- it will be preferrable to use this combination of different stabilizing components such as desthiobiotin (DSB) and biotin.
- a rapid high-affinity stabilizing component on the binding agent (biotin) and a lower affinity stabilizing component (DSB) associated with a target polypeptide provides for both rapid formation of the stable complex and controllable release (disruption of the stable complex) at the target polypeptide side, for example, by elution with biotin, which opens the target polypeptide for the next binding cycle.
- formation of the stable complex is reversible and no covalent bonds are formed during formation of the stable complex.
- the disrupting is conducted by removing the linking agent.
- the disrupting is conducted by introducing a destabilizing agent.
- the destabilizing agent comprises heat, a denaturing agent, an enzyme, a competitor molecule, or a combination thereof.
- the competitor molecule is a competitor for binding of or to the binding agent, the linking agent, and/or the stabilizing components.
- reversible covalent bonds can be formed during formation of the stable complex.
- the method for performing the binding reaction further comprises disrupting or destabilizing the stable complex comprising the binding agent, target, stabilizing components and, optionally, the linking agent.
- the disrupting allows the stabilizing component (e.g associated with the recording tag) to become available for interacting.
- the method includes a repeated cycle of forming a stable complex and disrupting the stable complex such that the binding agent is released from the target, allowing the target to be available for other reactions or treatments.
- the first stabilizing component associated with a target peptide or macromolecule has a lower affinity to the linking agent in comparison to an affinity of the second stabilizing component to the linking agent. This setup allows for efficient disruption of the stable complex and binder dissociation.
- stabilizing components can be utilized in this setup.
- One particular type includes using dethiobiotin (DSB) and biotin linked via streptavidin during the stable complex formation, and then using biotin for dissociation.
- Other linking agents can also be used, preferably ones that have affinity sites for two different interacting partners. These partners can be included as stabilizing components and will be linked together upon introduction of the linking agent.
- one or more of the stabilizing components are cleavable.
- two different cleavable stabilizing components are attached to the target and binding agent respectively, directly or indirectly via a nucleic acid molecule.
- Specific cleaving agents e.g. chemical reagent for cleaving
- the method may include using linking the stabilizing components to form a stable complex comprising the binding agent, the target and the stabilizing components, then cleaving the stabilizing component associated with the binding agent while the stabilizing component remains associated with the target.
- the first or second stabilizing component comprises a polynucleotide
- the linking agent comprises a linking polynucleotide that hybridizes to the polynucleotide of one of the stabilizing components.
- known approaches can be used to generate controllable hybridization of two polynucleotides that will result in formation of the stable complex containing binding agent and target.
- Fig. 13 Several potential embodiments of controllable hybridization of two polynucleotides (used as stabilizing components) are illustrated in Fig. 13.
- photoisomerization or uncaging can trigger hybridization, as disclosed in Szymahski W, et ak, Reversible photocontrol of biological systems by the incorporation of molecular photoswitches. Chem Rev. 2013 Aug 14; 113(8):6114-78; Asanuma H, et ak, Synthesis of azobenzene-tethered DNA for reversible photo-regulation of DNA functions: hybridization and transcription. Nat Protoc. 2007;2(1):203-12; Yunqi Yan et al., Photocontrolled DNA hybridization stringency with fluorescence detection in heterogeneous assays, ACS Sens.
- “Caged” compounds have inactivating groups bonded to bioactive molecules that can be readily removed in an orthogonal manner, for example, by UV light or visible light photoirradiation. By using light to turn on activity, high spatial and temporal control of polynucleotide hybridization can be attained.
- the following embodiment illustrates another exemplary workflow including a binding reaction: a large collection of polypeptides (e.g ., 50 million - 1 billion or more) from a proteolytic digest are immobilized randomly on a substrate (e.g., beads) at an appropriate intramolecular spacing with nucleic acid capture molecules; the target polypeptides are joined to nucleic acid capture molecules which are each joined to a hybridizable polynucleotide (the first stabilizing component); binding agents each joined to a complementary hybridizable polynucleotide (the second stabilizing component) and an associated coding tag containing information regarding the binding agent are contacted with the target polypeptides and allowed to interact; a wash is preformed to remove non-specific binding.
- a binding reaction a large collection of polypeptides (e.g ., 50 million - 1 billion or more) from a proteolytic digest are immobilized randomly on a substrate (e.g., beads) at an appropriate intramolecular spacing with nucleic
- the hybridizable polynucleotide is modified by introducing photoswitchable nucleotides or caged nucleotides to prevent hybridization with its complementary polynucleotide.
- Light of a certain wavelength is introduced to the reaction as a linking agent, inducing uncaging of nucleotides and allowing hybridization and formation of a stable complex containing the binding agent and target polypeptide.
- caged or modified nucleotide variants can be used.
- diethylaminocoumarin (DEACM) as a photoremovable protecting group for 2’ -deoxyguanosine can be used, and light with 405 nm wavelength can be used for uncaging as disclosed in Menge C, Heckel A.
- azobenzene moieties can be introduced into certain DNA nucleotides on a conventional DNA synthesizer using a phosphoramidite monomer bearing an azobenzene synthesized from D-threoninol as disclosed in Asanuma H, et al., Synthesis of azobenzene- tethered DNA for reversible photo-regulation of DNA functions: hybridization and transcription. NatProtoc. 2007;2(1):203-12.
- Hybridization of a polynucleotide having azobenzene-modified DNA can be reversibly photo-controlled by controlling cis-trans isomerization of the azobenzene.
- the hybridization can be photo-induced by cis-trans isomerization of the azobenzene moiety by irradiation of a visible light (wavelength is more than 400 nm).
- a visible light wavelength is more than 400 nm.
- a stable duplex can be formed with a complementary strand.
- hybridization is reversible and can be disrupted by UV light irradiation (wavelength between 300 nm and 400 nm), which induces isomerization of the trans-azobenzene to its cis-form.
- Photoswitchable units can be introduced to nucleotide monophosphates in nucleic acid oligomers via two methods: alkylation of a thiophosphate-modified backbone and amidation of the ribose moiety on a 2'- aminodeoxyuridylate analog as disclosed in Szymahski W, et al., Reversible photocontrol of biological systems by the incorporation of molecular photoswitches. Chem Rev. 2013 Aug 14; 113(8):6114-78 and references therein.
- the linking agent comprises a metal ion that links two stabilizing components together.
- a metal ion that links two stabilizing components together.
- divalent metal ions Fe 2+ , Co 2+ , Ni 2+ , Cu 2+ , Zn 2+
- Metal ions can bring together spartially separated metal-chelating or metal-coordinating groups to form a stable complex having a metal ion in its center.
- a solid support contains a N-tert-butyl (tBu)-modified capture DNA for immobilizing a target macromolecule (e.g. polypeptide).
- a binding agent is added that comprises b-cyclodextrin (bCD) blocked by 2,2’-bipyridyl (bpy).
- bCD b-cyclodextrin
- bpy 2,2’-bipyridyl
- a metal ion Fe 2+ , Co 2+ , Ni 2+ , Cu 2+ or Zn 2+
- 2,2’-bipyridyl (bpy) is further added.
- a Ni-NTA/HisTag interaction can be utilize for the stabilization reaction.
- a solid support contains N-terminal protected 6*His-tagged capture DNA for immobilizing a target macromolecule (e.g. polypeptide).
- a binding agent is added that comprises chelating ligand nitrilotriacetic acid (NTA).
- NTA chelating ligand nitrilotriacetic acid
- a metal ion Ni 2+
- imidazole is further added.
- an Azide/ Alkyne linkage can be utilize for the stabilization reaction.
- a solid support contains an aldehyde-modified capture DNA for immobilizing a target macromolecule (e.g. polypeptide).
- a binding agent is added that comprises TMS or TIPS -protected Aldehyde- Azide.
- a metal ion copper is added to trigger Azide/ Aldehyde click reaction.
- deprotection of TMS or TIPS is used to generate new aldehyde on the capture DNA.
- kits for analysis of macromolecules e.g., peptides, polypeptides, and proteins, which includes a step of transferring information to a recording tag.
- the analysis employs barcoding and nucleic acid encoding of molecular recognition events, and/or detectable labels.
- the information transferred comprises identifying information regarding a binding agent that is configured to bind to the macromolecule.
- the information transfer can be achieved by any suitable means such as by extension or ligation, and can be between nucleic acid molecules, e.g, between a nucleic acid tag associated with the macromolecule for analysis and a secondary tag on an adaptor molecule.
- the provided method for information transfer comprises: (a) providing a macromolecule and an associated recording tag joined to a support; (b) contacting the macromolecule with a binding agent capable of binding to the macromolecule, wherein the binding agent comprises a coding tag with identifying information regarding the binding agent, to allow binding between the macromolecule and the binding agent; (c) providing an adaptor molecule comprising a first hybridization sequence substantially complementary to at least a portion of the coding tag (or portion thereof), and a secondary tag, to allow hybridization between the adaptor molecule (or the first hybridization sequence) and the coding tag (or the portion of the coding tag); (d) transferring the information of the secondary tag to the recording tag to generate an extended recording tag; and analyzing the extended recording tag.
- the methods provided herein may include providing a plurality of binding agents and a plurality of macromolecules and allowing the binding agents and macromolecules to interact.
- a plurality of adaptor molecules are provided.
- the present methods comprise contacting a single macromolecule with a single binding agent, contacting a plurality of macromolecules with a single binding agent, or contacting a plurality of macromolecules with a plurality of binding agents.
- macromolecule comprises a polypeptide.
- the present disclosure provides, in part, methods for analyzing a macromolecule which includes information transfer, with direct applications to protein and peptide characterization, quantitation, and/or sequencing.
- methods for transferring information from a secondary tag of an adaptor molecule to a recording tag associated with the macromolecule (e.g ., polypeptide) bound by the binding agent are provided herein.
- Transfer of information may be performed via ligation, extension or other methods known in the art.
- the information transferred from the secondary tag of an adaptor molecule includes identifying information regarding the identity of the binding agent, the macromolecule or portion thereof bound by the binding agent.
- the identifying information may comprise information regarding the identity of the one or more amino acid(s) on the peptide bound by the binding agent ( see 6A-6D and FIG. 7A-7D).
- the information regarding the identity of the macromolecule bound by the binding agent is from the coding tag associated with said binding agent, and transferred to the recording tag via the hybridized adaptor molecule.
- the macromolecule analysis assay may include one or more cycles of transferring identifying information of a binding agent to a recording tag associated with the macromolecule to be analyzed.
- the extended recording tag associated with the macromolecule for analysis can comprise the information from one or more secondary tags. If multiple cycles are performed, the resulting extended recording tag then contains information built up from a series of binding events and multiple information transfer events using adaptor molecules comprising secondary tags. In general, improvements for the transfer of information may provide certain benefits to the macromolecule analysis assay.
- the adaptor molecules provided in step (c) used in this method for analyzing macromolecules provides certain advantages to the overall design of the assay.
- the adaptor molecule serves as an intermediate between the information on the coding tag associated with the binding agent and the transferred information on the recording tag.
- the adaptor molecules comprise a first hybridization sequence and a secondary tag, wherein the first hybridization sequence or portion thereof is substantially complementary or complementary to the coding tag or a region therein.
- the first hybridization sequence allows each coding tag to be associated with an adaptor molecule and its contained secondary tag.
- the use of the adaptor molecules provides the ability to adjust the information transferred to the recording tag (via the secondary tag) quickly and conveniently, by obviating the need to remake binding agent-coding tag conjugates, which may be a time consuming process.
- the use of the adaptor molecule provides some flexibility such as the ability to collapse information at the level of the secondary tag.
- the adaptor molecules may be designed to contain suitable barcodes (e.g. as part of the secondary tag) based on the sequencing system used for the readout.
- the barcode may be suitable for less accurate NGS such as nanopore sequencing (e.g, more error correction).
- multiple cycle of transferring information from a binding agent-fused coding tag to the polypeptide- associated recording tag occur, similar to the cycles shown in Figs. 1, 2, 6 and 7.
- the terminal amino acid of the polypeptide gets cleaved off, so the next amino acid of the polypeptide becomes a new terminal amino acid and a target for binding agents on the next cycle.
- a secondary tag of an adaptor molecule to a recording tag associated with the macromolecule (e.g., polypeptide) bound by the binding agent. Transfer of information may be performed via ligation, extension or other methods known in the art.
- the information transferred from the secondary tag of an adaptor molecule includes identifying information regarding the identity of the binding agent, the macromolecule or portion thereof bound by the binding agent.
- the identifying information may comprise information regarding the identity of the one or more amino acid(s) on the peptide bound by the binding agent [0123]
- the use of cycle-specific adapter molecules rather than cycle-specific DNA-tagged binding agents recognizing a particular NTAA greatly decreases the manufacturing burden of creating and maintaining a set of 20 or more binders (this set would recognize all 20 NTAA and/or post-translationally modified NTAA) multiply by 15+ cycles. For 20 binders and 15 cycles, this equates to a set of 300 binders.
- a new pool of cycle-specific adapter molecules can be used to translate NTAA binding information into cycle-specific NTAA binding information.
- a single universal set of 15-20 DNA-conjugated binding agents would be employed in each cycle, and cycle-specific information would be conferred by the use of cycle-specific pools of adapter molecules.
- the analysis assay includes the use of a plurality of binding agents and each binding agent is associated with a coding tag containing identifying information regarding the identity of the macromolecule.
- An example of collapsing information from multiple binding agents is as follows: the macromolecule “X” may be bound by two binding agents (such as at different motifs on the molecule) with corresponding coding tags B 1 and B2, two adaptor molecules which contain a secondary tag and BG and B2’ as the first hybridization region, respectively, can both be associated with “X”. Thus the information from two binding agents is collapsed at the level of the secondary tag and transferred as the same information to the recording tag.
- the option also remains to switch in adaptor molecules that retain the information of the B1 and B2 binding agents while using the same binding agents and associated coding tags, simply by using different adaptor molecules.
- the adaptor molecules can be exchanged or modified without the need for modifying the binding agent and associated coding tags.
- a spacer is added to the end of the recording tag with the secondary tag, and the spacer comprises a sequence that is capable of hybridizing with a sequence on the adaptor molecule to facilitate transfer of the identifying information.
- the adaptor molecule further comprises a second hybridization sequence substantially complementary or complementary to a portion of the recording tag.
- the second hybridization sequence on the adaptor molecule may be substantially complementary or complementary to a sequence at the 3’ terminus of the recording tag, such as a region on the recording tag generated from a previous information transfer of the secondary tag from the adaptor molecule to the recording tag.
- information transfer of the secondary tag from the adaptor molecule to the recording tag occurs if both the first hybridization sequence on the adaptor molecule hybridizes to the coding tag of a binding agent and the second hybridization sequence of the adaptor molecule hybridizes to a portion of the recording tag.
- a set of adaptor molecules is used such that combinations of the first hybridization sequence and the second hybridization sequence covers all combinations needed to hybridize to potential complementary sequences on the recording tags and coding tags.
- the provided method for information transfer using the adaptor molecule comprising a second hybridization sequence enables a “spacer-less” approach of transferring information from an adaptor molecule to a recording tag. For example, this may remove the need for inserting a spacer sequence for hybridization purposes into the extended recording tag.
- a spacer-less approach provides the benefit that the extended recording tag length can be reduced.
- Identifying information associated with a specific binding agent may be transferred to a recording tag using a variety of methods.
- the transfer in the methods provided herein are from the secondary tag of an adaptor molecule to the recording tag to generate an extended recording tag.
- the transfer of identifying information can be accomplished by ligation e.g ., an enzymatic or chemical ligation, a splint ligation, a sticky end ligation, a single strand (ss) ligation such as a ssDNA ligation, or any combination thereof), a polymerase- mediated reaction (e.g., primer extension of single-stranded nucleic acid or double-stranded nucleic acid), or any combination thereof.
- ligation e.g ., an enzymatic or chemical ligation, a splint ligation, a sticky end ligation, a single strand (ss) ligation such as a ssDNA ligation, or any combination thereof
- Identifying information associated with a specific binding agent may be transferred to a nucleic acid on the recording tag associated with the immobilized macromolecule via ligation (FIG. 6A-6D).
- Ligation may be a blunt end ligation or sticky end ligation.
- Ligation may be an enzymatic ligation reaction. Examples of ligases include, but are not limited to CV DNA ligase, T4 DNA ligase, T7 DNA ligase, T3 DNA ligase, Taq DNA ligase, E. coli DNA ligase, 9°N DNA ligase ( See e.g., U.S. Patent Publication No. US20140378315).
- the adaptor molecule comprises one or more uracil bases, which can be targeted for digestion with a uracil-specific excision reagent (e.g ., USER TM ).
- a uracil-specific excision reagent e.g ., USER TM
- transfer of PNAs can be accomplished with chemical ligation using published techniques.
- the structure of PNA is such that it has a 5’ N-terminal amine group and an unreactive 3’ C-terminal amide.
- Chemical ligation of PNA requires that the termini be modified to be chemically active. This is typically done by derivatizing the 5’ N-terminus with a cysteinyl moiety and the 3’ C-terminus with a thioester moiety.
- Such modified PNAs easily couple using standard native chemical ligation conditions (Roloff et al., (2013) Bioorgan. Med. Chem. 21:3458-3464).
- identifying information from a secondary tag can be transferred to a recording tag using topoisomerase.
- Topoisomerase can be used be used to ligate a topo-charged 3’ phosphate on the recording tag (or extensions thereof or any nucleic acids attached) to the 5’ end of the coding tag, or complement thereof (Shuman et al., 1994,
- information is transferred to a recording tag via primer extension (Chan et al. (2015) Curr Opin Chem Biol 26: 55-61).
- a sequence on the 3’- terminus of a recording tag or an extended recording tag anneals with complementary sequence on the 3’ terminus of an adaptor molecule and a polymerase (e.g., strand-displacing polymerase) extends the recording tag sequence, using the secondary tag of the annealed adaptor molecule as a template (FIG. 7).
- the complementary sequence on the 3’ terminus of an adaptor molecule may be a spacer sequence.
- the complementary sequence on the 3’ terminus of an adaptor molecule may be the second hybridization sequence.
- the adaptor molecule comprises a spacer or linker to stop extension after transfer of information from the secondary tag to the recording tag.
- a DNA polymerase that is used for primer extension possesses strand-displacement activity and has limited or is devoid of 3’-5 exonuclease activity.
- Several of many examples of such polymerases include Klenow exo- (Klenow fragment of DNA Pol 1), T4 DNA polymerase exo-, T7 DNA polymerase exo (Sequenase 2.0), Pfu exo-, Vent exo-, Deep Vent exo-, Bst DNA polymerase large fragment exo-, Bca Pol, 9°N Pol, and Phi29 Pol exo-.
- the DNA polymerase is active at room temperature and up to 45°C.
- thermophilic polymerase in another embodiment, a “warm start” version of a thermophilic polymerase is employed such that the polymerase is activated and is used at about 40°C-50°C.
- An exemplary warm start polymerase is Bst 2.0 Warm Start DNA Polymerase (New England Biolabs).
- Additives useful in strand-displacement replication include any of a number of single-stranded DNA binding proteins (SSB proteins) of bacterial, viral, or eukaryotic origin, such as SSB protein of E. coli, phage T4 gene 32 product, phage T7 gene 2.5 protein, phage Pfi SSB, replication protein A RPA32 and RPA14 subunits (Wold, Annu. Rev. Biochem.
- DNA binding proteins such as adenovirus DNA-binding protein, herpes simplex protein ICP8, BMRFl polymerase accessory subunit, herpes virus UL29 S SB-like protein; any of a number of replication complex proteins known to participate in DNA replication, such as phage T7 helicase/primase, phage T4 gene 41 helicase, E. coli Rep helicase, E. coli recBCD helicase, recA, E. coli and eukaryotic topoisomerases (Annu Rev Biochem. (2001) 70:369-413).
- Mis-priming or self-priming events such as when the terminal spacer sequence of the recording tag primes extension self-extension may be minimized by inclusion of single stranded binding proteins (T4 gene 32, E. coli SSB, etc.), DMSO (1-10%), formamide (1-10%), BSA( 10-100 ug/ml), TMAC1 (1-5 mM), ammonium sulfate (10-50 mM), betaine (1-3 M), glycerol (5-40%), or ethylene glycol (5-40%), in the primer extension reaction.
- Most type A polymerases devoid of 3’ exonuclease activity endogenous or engineered removal
- Klenow exo- preferably an adenosine base (to lesser degree a G base, dependent on sequence context)
- Taq polymerase catalyze non-templated addition of a nucleotide, preferably an adenosine base (to lesser degree a G base, dependent on sequence context) to the 3’ blunt end of a duplex extension product.
- a 3’ pyrimidine (OT) minimizes non- templated adenosine addition
- a 3’ purine nucleotide G>A
- non-templated base can be reduced by employing a mutant polymerase (mesophilic or thermophilic) in which non-templated terminal transferase activity has been greatly reduced by one or more point mutations, especially in the O-helix region (see Ei.S. Patent 7,501,237) (Yang et ah, Nucleic Acids Res. (2002) 30(19): 4314-4320).
- Pfu exo- which is 3’ exonuclease deficient and has strand- displacing ability, also does not have non-templated terminal transferase activity.
- polymerase extension buffers are comprised of 40-120 mM buffering agent such as Tris-Acetate, Tris-HCl, HEPES, etc. at a pH of 6-9.
- buffering agent such as Tris-Acetate, Tris-HCl, HEPES, etc. at a pH of 6-9.
- Self-priming/mis-priming events initiated by self-annealing of the terminal spacer sequence of the extended recording tag with internal regions of the extended recording tag may be minimized by including pseudo-complementary bases in the recording/extended recording tag (Lahoud, Timoshchuk et al. 2008), (Hoshika, Chen et al. 2010). Pseudo complementary bases show significantly reduced hybridization affinities for the formation of duplexes with each other due the presence of chemical modification.
- pseudo-complementary modified bases can form strong base pairs with natural DNA or RNA sequences.
- the coding tag spacer sequence is comprised of multiple A and T bases, and commercially available pseudo-complementary bases 2- aminoadenine and 2-thiothymine are incorporated in the recording tag using phosphoramidite oligonucleotide synthesis. Additional pseudocomplementary bases can be incorporated into the extended recording tag during primer extension by adding pseudo complementary nucleotides to the reaction (Gamper, Arar et al. 2006).
- the binding event information of the binding agent to the macromolecule is transferred from a secondary tag of an adaptor molecule to the recording tag associated with the immobilized macromolecule in a cyclic fashion.
- steps repeated one or more times include: (b) contacting the macromolecule with a binding agent capable of binding to the macromolecule, wherein the binding agent comprises a coding tag with identifying information regarding the binding agent; (c) providing an adaptor molecule comprising a first hybridization sequence substantially complementary or complementary to at least a portion of the coding tag or the entire coding tag, and a secondary tag; and (d) transferring the information of the secondary tag to the recording tag to generate an extended recording tag.
- the method further includes one or more wash steps between any of steps (b), (c), and (d).
- the method further includes removing the binding agent and/or the adaptor molecule.
- a set of adaptor molecules used in a cycle of binding and information transfer may include cycle information, such as using cycle specific sequences.
- the adaptor molecules comprise binding cycle-specific sequences.
- the secondary tags comprise binding cycle-specific sequences. Binding cycle-specific sequences may be accomplished either via the use of completely unique binding cycle barcodes or through unique combinations of sub-barcodes. In some aspects, embedding binding cycle information directly in the secondary tag sequence may allow the total length of the coding tag to be minimized when employing error-correcting barcodes.
- error-tolerant barcodes allows highly accurate barcode identification using sequencing platforms and approaches that are more error-prone, but have other advantages such as rapid speed of analysis, lower cost, and/or more portable instrumentation.
- the provided methods also allow the use of adaptor molecules with secondary tags that are of preferred lengths, such as a length suitable for a particular sequencing method.
- adaptor molecules may comprise secondary tags containing error-tolerant barcodes.
- various libraries or sets of adaptor molecules can be designed to be compatible with a particular sequencing method and switched interchangeably if another sequencing method is preferred for the downstream analysis step.
- the methods provided may include preparing, selecting, and providing a single adaptor molecule or a plurality of adaptor molecules.
- the library or set of adaptor molecules used in the provided methods comprises at least one adaptor molecule configured to hybridize to at least one coding tag associated with the binding agents used for analyzing the macromolecule (or a portion of the coding tag).
- the adaptor molecule further comprises a second hybridization sequence.
- the first hybridization sequence comprises a sequence substantially complementary or complementary to particular coding tags, allowing adaptor molecules to hybridize to corresponding coding tags and associate the secondary tag with the corresponding binding agents.
- the information from the secondary tag can be transferred from the adaptor molecule to the recording tag after the coding tag associated with the binding agent hybridizes with the first hybridization sequence on the adaptor molecule.
- the adaptor molecule may comprise any suitable nucleic acid molecule including a DNA molecule, an RNA molecule, a PNA molecule, a BNA molecule, an XNA, molecule, an LNA molecule, a gRNA molecule, or a combination thereof.
- the adaptor molecule comprises a spacer or linker.
- the linker or spacer is provided to stop extension after transfer of information from the secondary tag to the recording tag.
- a linker may be
- the adaptor molecule may further comprise a universal priming site, a binding cycle specific spacer, binding cycle-specific barcode, a UMI or any combination thereof.
- multiple coding tags associated with a binding agent is configured to hybridize to adaptor molecules comprising the same secondary tag.
- two binding agents that provide the same identifying information regarding a binding agent and the corresponding cognate target may be designed such that each binding agent is associated with the same secondary tag.
- a binding agent that binds peptides with a terminal alanine (Pl)-alanine (P2) and a binding agent that binds peptides with a terminal alanine (Pl)-arginine (P2) both provide information that a peptide has a NTAA (PI) that is alanine, regardless of the amino acid at the penultimate position (P2).
- the coding tag associated with both binding agents hybridize with adaptor molecules which share the same secondary tag providing information of the terminal alanine on the peptide, thus collapsing the information of multiple binders into one piece of information at the level of the secondary tag.
- the first hybridization sequence (1 st Hyb Sequence in FIG.
- the adaptor molecule comprises a single stranded region for hybridizing to the coding tag (or region therein) associated with the binding agent.
- the first hybridization sequence comprises at least one nucleic acid region which is substantially complementary to a coding tag or portion thereof.
- the first hybridization sequence in the adaptor molecule is complementary to a portion of the coding tag.
- the first hybridization sequence in the adaptor molecule is complementary to the entire coding tag.
- the first hybridization sequence comprises a sequence of nucleotides that binds selectively to the coding tag sequence or portion thereof.
- the first hybridization sequence comprises a single stranded region which is substantially complementary to the coding tag sequence.
- substantially complementary refers to sequences that are capable of hybridizing to a target nucleic acid sequence under the conditions employed.
- a “substantially complementary" single-stranded region is exactly complementary to a target nucleic acid sequence.
- the single-stranded region of the first hybridization sequence complementary to the coding tag may have at least 4 bases, at least 5 bases, at least 6 bases, at least 7 bases, at least 8 bases, at least 9 bases, at least 10 bases, at least 12 bases, at least 14 bases, at least 16 bases, at least 20 bases, at least 24 bases, at least 30 bases, or at least 34 bases.
- the single-stranded region of the first hybridization sequence complementary to the coding tag has fewer than 40 bases, fewer than 30 bases, or fewer than 25 bases.
- One skilled in the art may select complementary regions with number of bases that is sufficient for forming stable hybridization regions between the first hybridization sequence and the coding tag.
- the first hybridization sequence for hybridizing to the coding tag is located at the 3' or the 5' terminus of the adaptor molecule. In some specific embodiments, the first hybridization sequence for hybridizing to the coding tag is located at the 5' terminus of the adaptor molecule. In some examples, the first hybridization sequence is 5’ to the secondary tag in the adaptor molecule.
- FIG. 7C) of the adaptor molecule comprises a single stranded region for hybridizing to a portion of the recording tag associated with the macromolecule for analysis.
- the second hybridization sequence on the adaptor molecule is substantially complementary or complementary to a region on the recording tag generated from a previous information transfer of the secondary tag from the adaptor molecule to the recording tag.
- the second hybridization sequence on the adaptor molecule comprises the secondary tag information or portion thereof on a different adaptor molecule.
- the second hybridization sequence comprises at least one nucleic acid region which is substantially complementary to a portion of the recording tag or portion thereof.
- the second hybridization sequence is substantially complementary or complementary to a sequence at the 3’ terminus of the recording tag.
- the second hybridization sequence comprises a sequence of nucleotides that binds selectively to a portion of the recording tag associated with the macromolecule for analysis.
- the second hybridization sequence comprises a single stranded region which is substantially complementary to a portion of the recording tag associated with the macromolecule for analysis.
- the single-stranded region of the second hybridization sequence complementary to the recording tag or portion thereof may have at least 4 bases, at least 5 bases, at least 6 bases, at least 7 bases, at least 8 bases, at least 9 bases, at least 10 bases, at least 12 bases, at least 14 bases, at least 16 bases, at least 20 bases, at least 24 bases, at least 30 bases, or at least 34 bases.
- the single- stranded region of the second hybridization sequence complementary to the recording tag or portion thereof has fewer than 40 bases, fewer than 30 bases, or fewer than 25 bases.
- One skilled in the art may select complementary regions with number of bases that is sufficient for forming stable hybridization regions between the second hybridization sequence and the portion of the recording tag.
- the second hybridization sequence for hybridizing to the portion of the recording tag is located at the 3' or the 5' terminus of the adaptor molecule. In some specific embodiments, the second hybridization sequence is located at the 3’ terminus of the adaptor molecule.
- the secondary tag of an adaptor molecule can be a polynucleotide of any suitable length, e.g ., a nucleic acid molecule of about 2 bases to about 100 bases, including any integer including 2 and 100 and in between, that comprises identifying information for a binding agent.
- Information of a secondary tag can be associated to a binding agent via the first hybridization sequence (which is substantially complementary or complementary to the coding tag (or portion thereof) associated with a binding agent) comprised by the same adaptor molecule.
- a secondary tag can made from a sequenceable polymer. The secondary tag may be optionally flanked by one spacer on one side or optionally flanked by a spacer on each side.
- a secondary tag may also be comprised of an optional UMI and/or an optional binding cycle-specific barcode.
- a secondary tag may be flanked by a binding cycle specific spacer.
- a secondary tag may be single stranded or double stranded.
- a double stranded secondary tag may comprise blunt ends, overhanging ends, or both.
- a secondary tag may refer to the secondary tag that is part of the adaptor molecule or to a complementary sequence that is capable of hybridizing to the secondary tag, or to the information present in an extended recording tag transferred from the secondary tag.
- a secondary tag may further comprise a binding cycle specific barcode, a unique molecular identifier, or both.
- the secondary tag is a binding cycle specific sequence. For example, one set or plurality of adaptor molecules is used with a first cycle and a second set or plurality of adaptor molecules is used with a second cycle, etc.
- a secondary tag comprises a sequence that provides identifying information regarding the binding agent associated via the first hybridization region on the adaptor molecule.
- the sequence providing identifying information is about 3 bases to about 30 bases, about 3 bases to about 20 bases, about 3 bases to about 10 bases, or about 3 bases to about 8 bases.
- an sequence providing identifying information is about 3 bases, 4 bases, 5 bases, 6 bases, 7 bases, 8 bases, 9 bases, 10 bases, 11 bases, 12 bases, 13 bases, 14 bases, 15 bases, 20 bases, 25 bases, or 30 bases in length.
- the length of the sequence providing identifying information may be adjusted based on the size of the binding agent library used.
- each unique binding agent within a library of binding agents is associated with a sequence providing identifying information regarding the binding agent.
- the secondary tag comprises this sequence providing identifying information regarding the binding agent.
- 20 unique sequences may be used as secondary tags for providing identifying information for a library of 20 binding agents that bind to the 20 standard amino acids. Additional sequences may be used to identify modified amino acids (e.g ., post-translationally modified amino acids).
- 30 unique sequence may be used as secondary tags for a library of 30 binding agents that bind to the 20 standard amino acids and 10 post-translational modified amino acids (e.g., phosphorylated amino acids, acetylated amino acids, methylated amino acids).
- two or more different binding agents may be associated with the same sequence (secondary tags) providing identifying information regarding the binding agent.
- 20 unique sequences may be used as secondary tags for a library of 30 binding agents that bind to the 20 standard amino acids in an overlapping manner.
- the secondary tag sequence can be optimized for a particular sequencing analysis platform.
- the sequencing platform is nanopore sequencing.
- the sequencing platform has a per base error rate of > 1%, > 5%, > 10%, >15%, > 20%, > 25%, or > 30%.
- the barcode sequences e.g., sequences comprising information from the secondary tag
- the length of the barcode sequences is optimally designed to implement an appropriate level of error detection and/or correction.
- the adaptor molecule further comprises a spacer sequence at one end or both ends. In some embodiments, the adaptor molecule comprises a spacer at the 3’ terminus.
- a spacer sequence is about 1 base to about 20 bases, about 1 base to about 10 bases, about 5 bases to about 9 bases, or about 4 bases to about 8 bases. In some embodiments, a spacer is about 1 base, 2 bases, 3 bases, 4 bases, 5 bases, 6 bases, 7 bases, 8 bases, 9 bases, 10 bases, 11 bases, 12 bases, 13 bases, 14 bases, 15 bases or 20 bases in length.
- a spacer is shorter than the sequence providing identifying information comprised in the secondary tag, e.g., at least 1 base, 2, bases, 3 bases, 4 bases, 5 bases, 6, bases, 7 bases, 8 bases, 9 bases, 10 bases, 11 bases, 12 bases, 13 bases, 14 bases, 15 bases, 20 bases, or 25 bases shorter than the sequence providing identifying information comprised in the secondary tag.
- a spacer is the same length as the sequence providing identifying information comprised in the secondary tag.
- the spacer is specific to the adaptor molecule. In some cases, a spacer is designed such that a spacer from a previous binding cycle only interacts with a spacer from the appropriate adaptor molecule in a current binding cycle.
- a spacer sequence may be used as the primer annealing site for a primer extension reaction, or a splint or sticky end in a ligation reaction.
- a 5’ spacer on an adaptor molecule may optionally contain pseudo complementary bases to a 3’ spacer on the recording tag to increase TM (Lehoud et al., 2008, Nucleic Acids Res. 36:3409-3419).
- the adaptor molecules do not have a binding cycle specific spacer sequence. In some embodiments, the adaptor molecules do not have a spacer sequence.
- a set (e.g. a library) or plurality of adaptor molecules share a common spacer sequence used in an assay (e.g. the entire library of adaptor molecules used in a multiple binding cycle method possess a common spacer).
- the adaptor molecules are comprised of a binding cycle tags, identifying a particular binding cycle.
- the adaptor molecules within a library or set of adaptor molecules have a binding cycle specific spacer sequence.
- adaptor molecules comprises one binding cycle specific spacer sequence.
- adaptor molecules used in the first binding cycle comprise a “cycle 1” specific spacer sequence
- adaptor molecules used in the second binding cycle comprise a “cycle 2” specific spacer sequence
- so on up to “n” binding cycles a spacer sequence comprises a sufficient number of bases to anneal to a complementary spacer sequence in a recording tag or extended recording tag to initiate a primer extension reaction or sticky end ligation reaction.
- adaptor molecules used to bind in an alternating cycles comprises different binding cycle specific spacer sequences.
- adaptor molecules used in the first binding cycle comprise a “cycle 1” specific spacer sequence
- adaptor molecules used in the second binding cycle comprise a “cycle 2” specific spacer sequence
- adaptor molecules used in the third binding cycle also comprises the “cycle 1” specific spacer sequence
- adaptor molecules used in the fourth binding cycle comprises the “cycle 2” specific spacer sequence.
- the adaptor molecules may also be designed to contain palindromic sequences. Inclusion of a palindromic sequence into the adaptor molecule allows a nascent, growing, extended recording tag to fold upon itself as information is transferred from the secondary tag. The extended recording tag is folded into a more compact structure, effectively decreasing undesired inter-molecular binding and primer extension events.
- the adaptor molecules may include a terminator nucleotide incorporated at the 3’ end of the 3’ spacer sequence. After a binding agent binds to a polypeptide, an adaptor molecule hybridizes, and their corresponding hybridization sequences and recording tags anneal via complementary spacer sequences, it is possible for primer extension to transfer information from the secondary tag to the recording tag. Addition of a terminator nucleotide on the 3’ end of the secondary tag prevents transfer of recording tag information to the secondary tag.
- the adaptor molecule comprises from 5’ to 3’ direction the secondary tag and the first hybridization sequence, optionally with a linker in between said components. In some specific embodiments, the adaptor molecule comprises from 5’ to 3’ direction: the first hybridization sequence, the secondary tag, and the second hybridization sequence. In some specific embodiments, the adaptor molecule comprises from 5’ to 3’ direction: the first hybridization sequence, the secondary tag, and a spacer sequence. In some specific embodiments, the adaptor molecule comprises from 5’ to 3’ direction: the first hybridization sequence, a spacer or linker to stop extension, the secondary tag, and the second hybridization sequence. In some specific embodiments, the adaptor molecule comprises from 5’ to 3’ direction: the first hybridization sequence, a spacer or linker to stop extension, the secondary tag, and a spacer sequence.
- the method further includes adding a universal priming site from an adaptor molecule to the extended recording tag, prior to or during the last binding cycle.
- the universal reverse priming site is added to the recording tag (e.g extended recording tag) from an adaptor molecule.
- the adaptor molecules used in the final binding cycle may comprise a universal priming site. After transfer of the final secondary tag information to the extended recording tag, the tag can be capped by addition of a universal reverse priming site via ligation, primer extension or other methods known in the art.
- a capping reaction may be performed in any binding cycle to add a capping sequence (e.g ., a universal priming site) to recording tags that did not extend with information from a secondary tag in that cycle.
- a capping sequence e.g ., a universal priming site
- such a step is useful to indicate that an information transfer event did not happen in a particular binding cycle.
- the macromolecule for analysis may be labeled with a nucleic acid molecule or an oligonucleotide (e.g, DNA recording tag).
- a plurality of macromolecules in the sample is provided with recording tags.
- the recording tags may be associated or attached, directly or indirectly to the macromolecules using any suitable means.
- a macromolecule may be associated with one or more recording tags.
- the recording tag may be any suitable sequenceable moiety to which identifying information can be transferred (e.g, information from one or more secondary tags).
- the recording tags may be associated or attached, directly or indirectly to the macromolecules prior to contacting with a binding agent.
- At least one recording tag is associated or co-localized directly or indirectly with the macromolecule (e.g, polypeptide).
- a single recording tag is attached to a polypeptide, such as via the attachment to a N- or C-terminal amino acid.
- multiple recording tags are attached to the polypeptide, such as to the lysine residues or peptide backbone.
- a polypeptide labeled with multiple recording tags is fragmented or digested into smaller peptides, with each peptide labeled on average with one recording tag.
- a recording tag may comprise DNA, RNA, or polynucleotide analogs including
- a recording tag may be single stranded, or partially or completely double stranded.
- a recording tag may have a blunt end or overhanging end.
- all or a substantial amount of the macromolecules e.g., at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100%
- a recording tag e.g., a subset of macromolecules within a sample are labeled with recording tags.
- a subset of macromolecules from a sample undergo targeted (analyte specific) labeling with recording tags.
- targeted recording tag labeling of proteins may be achieved using target protein-specific binding agents (e.g ., antibodies, aptamers, etc.).
- target protein-specific binding agents e.g ., antibodies, aptamers, etc.
- the recording tags are attached to the macromolecules prior to providing the sample on a support.
- the recording tags are attached to the macromolecules after providing the sample on the support.
- the recording tag may comprise other nucleic acid components.
- the recording tag may comprise a unique molecular identifier, a compartment tag, a partition barcode, sample barcode, a fraction barcode, a spacer sequence, a universal priming site, or any combination thereof.
- the recording tag may comprise a blocking group, such as at the 3’ -terminus of the recording tag. In some cases, the 3’-terminus of the recording tag is blocked to prevent extension of the recording tag by a polymerase.
- the recording tag can include a sample identifying barcode.
- a sample barcode is useful in the multiplexed analysis of a set of samples in a single reaction vessel or immobilized to a single solid substrate or collection of solid substrates (e.g., a planar slide, population of beads contained in a single tube or vessel, etc.).
- macromolecules from many different samples can be labeled with recording tags with sample-specific barcodes, and then all the samples pooled together prior to immobilization to a support, cyclic binding of the binding agent, and recording tag analysis.
- samples can be kept separate until after creation of a DNA- encoded library, and sample barcodes attached during PCR amplification of the DNA- encoded library, and then mixed together prior to sequencing.
- This approach could be useful when assaying analytes (e.g., proteins) of different abundance classes.
- a recording tag comprises an optional, unique molecular identifier (UMI), which provides a unique identifier tag for each macromolecules (e.g, polypeptide) to which the UMI is associated with.
- UMI can be about 3 to about 40 bases, about 3 to about 30 bases, about 3 to about 20 bases, or about 3 to about 10 bases, or about 3 to about 8 bases.
- a UMI is about 3 bases, 4 bases, 5 bases, 6 bases, 7 bases, 8 bases, 9 bases, 10 bases, 11 bases, 12 bases, 13 bases, 14 bases, 15 bases, 16 bases, 17 bases, 18 bases, 19 bases, 20 bases, 25 bases, 30 bases, 35 bases, or 40 bases in length.
- a UMI can be used to de-convolute sequencing data from a plurality of extended recording tags to identify sequence reads from individual macromolecules.
- each macromolecule is associated with a single recording tag, with each recording tag comprising a unique UMI.
- multiple copies of a recording tag are associated with a single macromolecule, with each copy of the recording tag comprising the same UMI.
- a UMI has a different base sequence than the spacer or secondary tags to facilitate distinguishing these components during sequence analysis.
- the UMI may provide function as a location identifier and also provide information in the macromolecule analysis assay. For example, the UMI may be used to identify molecules that are identical by descent, and therefore originated from the same initial molecule. In some aspects, this information can be used to correct for variations in amplification, and to detect and correct sequencing errors.
- the recording tag comprises a spacer polymer.
- a recording tag comprises a spacer at its terminus, e.g., 3’ end.
- reference to a spacer sequence in the context of a recording tag includes a spacer sequence that is identical to the spacer sequence associated with its cognate binding agent, or a spacer sequence that is complementary to the spacer sequence associated with its cognate binding agent.
- the terminal, e.g., 3’, spacer on the recording tag permits transfer of identifying information of a cognate binding agent from a secondary tag to the recording tag during the first binding cycle (e.g., via annealing of complementary spacer sequences for primer extension or sticky end ligation).
- the spacer sequence is about 1-20 bases in length, about 2-12 bases in length, or 5-10 bases in length. The length of the spacer may depend on factors such as the temperature and reaction conditions of the primer extension reaction for transferring secondary tag information to the recording tag.
- the recording tags associated with a library of polypeptides share a common spacer sequence.
- the recording tags associated with a library of polypeptides have binding cycle specific spacer sequences that are complementary to the binding cycle specific spacer sequences of adaptor molecules.
- the spacer sequence in the recording tag is designed to have minimal complementarity to other regions in the recording tag; likewise, the spacer sequence in the adaptor molecules should have minimal complementarity to other regions in the adaptor molecule.
- the spacer sequence of the recording tags and adaptor molecules should have minimal sequence complementarity to components such unique molecular identifiers, barcodes ( e.g ., compartment, partition, sample, spatial location), universal primer sequences, secondary tag sequences, cycle specific sequences, etc. present in the tags.
- a recording tag comprises a universal priming site, e.g., a forward or 5’ universal priming site.
- a universal priming site is a nucleic acid sequence that may be used for priming a library amplification reaction and/or for sequencing.
- a universal priming site may include, but is not limited to, a priming site for PCR amplification, flow cell adaptor sequences that anneal to complementary oligonucleotides on flow cell surfaces (e.g., Illumina next generation sequencing), a sequencing priming site, or a combination thereof.
- a universal priming site can be about 10 bases to about 60 bases.
- a universal priming site comprises an Illumina P5 primer (5’- AATGATACGGCGACCACCGA-3 ’ - SEQ ID NO: 1) or an Illumina P7 primer (5’- CAAGCAGAAGACGGCATACGAGAT - 3’ - SEQ ID NO:2).
- a recording tag comprises a compartment tag.
- the compartment tag is a component within a recording tag.
- the recording tag can also include a barcode which represents a compartment tag in which a compartment, such as a droplet, microwell, physical region on a support, etc. is assigned a unique barcode.
- the association of a compartment with a specific barcode can be achieved in any number of ways such as by encapsulating a single barcoded bead in a compartment, e.g, by direct merging or adding a barcoded droplet to a compartment, by directly printing or injecting a barcode reagents to a compartment, etc.
- the barcode reagents within a compartment are used to add compartment-specific barcodes to the macromolecule or fragments thereof within the compartment. Applied to protein partitioning into compartments, the barcodes can be used to map analyzed peptides back to their originating protein molecules in the compartment. This can greatly facilitate protein identification. Compartment barcodes can also be used to identify protein complexes. In other embodiments, multiple compartments that represent a subset of a population of compartments may be assigned a unique barcode representing the subset. In some embodiments, the recording tag comprises fraction barcode which contains identifying information for the macromolecules within a fraction.
- one or more of the tags further comprise a functional moiety capable of reacting with an internal amino acid, the peptide backbone, or N-terminal amino acid on the plurality of protein complexes, proteins, or polypeptides.
- the functional moiety is a click chemistry moiety, an aldehyde, an azide/alkyne, or a maleimide/thiol, or an epoxide/nucleophile, an inverse electron demand Diels- Alder (iEDDA) group, or a moiety for a Staudinger reaction.
- a plurality of compartment tags is formed by printing, spotting, ink-jetting the compartment tags into the compartment, or a combination thereof.
- the tag is attached to a polypeptide to link the tag to the macromolecule via a polypeptide-polypeptide linkage.
- the tag-attached polypeptide comprises a protein ligase recognition sequence.
- a peptide or polypeptide macromolecule can be immobilized to a support by an affinity capture reagent (and optionally covalently crosslinked), wherein the recording tag is associated with the affinity capture reagent directly, or alternatively, the macromolecule can be directly immobilized to the support with a recording tag.
- the macromolecule is attached to a bait nucleic acid which hybridizes to a capture nucleic acid and is ligated to a capture nucleic acid which comprises a reactive coupling moiety for attaching to the support.
- the bait or capture nucleic acid may serve as a recording tag to which information regarding the polypeptide can be transferred.
- the macromolecule is attached to a bait nucleic acid to form a nucleic acid-macromolecule chimera.
- the immobilization methods comprise bringing the nucleic acid-macromolecule chimera into proximity with a support by hybridizing the bait nucleic acid to a capture nucleic acid attached to the support, and covalently coupling the nucleic acid-macromolecule chimera to the solid support.
- the nucleic acid-macromolecule chimera is coupled indirectly to the solid support, such as via a linker.
- a plurality of the nucleic acid-macromolecule chimeras is coupled on the solid support and any adjacently coupled nucleic acid-macromolecule chimeras are spaced apart from each other at an average distance of about 50 nm or greater.
- the density or number of macromolecules provided with a recording tag is controlled or titrated.
- the desired spacing, density, and/or amount of recording tags in the sample may be titrated by providing a diluted or controlled number of recording tags.
- the desired spacing, density, and/or amount of recording tags may be achieved by spiking a competitor or “dummy” competitor molecule when providing, associating, and/or attaching the recording tags.
- the “dummy” competitor molecule reacts in the same way as a recording tag being associated or attached to a macromolecule in the sample but the competitor molecule does not function as a recording tag.
- a desired density is 1 functional recording tag per 1,000 available sites for attachment in the sample
- spiking in 1 functional recording tag for every 1,000 “dummy” competitor molecules is used to achieve the desired spacing.
- the ratio of functional recording tags is adjusted based on the reaction rate of the functional recording tags compared to the reaction rate of the competitor molecules.
- the labeling of the macromolecule with a recording tag is performed using standard amine coupling chemistries.
- the e-amino group e.g ., of lysine residues
- the N-terminal amino group may be susceptible to labeling with amine-reactive coupling agents, depending on the pH of the reaction (Mendoza et ah, Mass Spectrom Rev (2009) 28(5): 785-815).
- the recording tag comprises a reactive moiety (e.g., for conjugation to a solid surface, a multifunctional linker, or a macromolecule), a linker, a universal priming sequence, a barcode (e.g., compartment tag, partition barcode, sample barcode, fraction barcode, or any combination thereof), an optional UMI, and a spacer (Sp) sequence for facilitating information transfer.
- the protein can be first labeled with a universal DNA tag, and the barcode-Sp sequence (representing a sample, a compartment, a physical location on a slide, etc.) are attached to the protein later through and enzymatic or chemical coupling step.
- a universal DNA tag comprises a short sequence of nucleotides that are used to label a protein or polypeptide macromolecule and can be used as point of attachment for a barcode (e.g., compartment tag, recording tag, etc.).
- a recording tag may comprise at its terminus a sequence complementary to the universal DNA tag.
- a universal DNA tag is a universal priming sequence.
- the annealed universal DNA tag may be extended via primer extension, transferring the recording tag information to the DNA tagged protein.
- the protein is labeled with a universal DNA tag prior to proteinase digestion into peptides.
- the universal DNA tags on the labeled peptides from the digest can then be converted into an informative and effective recording tag.
- the recording tags may comprise a reactive moiety for a cognate reactive moiety present on the macromolecule, e.g ., protein, (e.g., click chemistry labeling, photoaffmity labeling).
- recording tags may comprise an azide moiety for interacting with alkyne-derivatized proteins, or recording tags may comprise a benzophenone for interacting with native proteins, etc.
- the DNA capture probe may be designed to contain uracil bases, which are then targeted for digestion with a uracil-specific excision reagent (e.g, USER TM ), and the target-protein specific binding agent may be dissociated from the target protein.
- a uracil-specific excision reagent e.g, USER TM
- the target-protein specific binding agent may be dissociated from the target protein.
- other types of linkages besides hybridization can be used to link the recording tag to a macromolecule.
- a suitable linker can be attached to various positions of the recording tag, such as the 3’ end, at an internal position, or within the linker attached to the 5’ end of the recording tag.
- the spacer sequence in the recording is designed to have minimal complementarity to other regions in the recording tag.
- the spacer sequence of the recording tags and adaptor molecules should have minimal sequence complementarity to components such unique molecular identifiers, barcodes (e.g, compartment, partition, sample, spatial location), universal primer sequences, secondary tag sequences, cycle specific sequences, etc. present in the recording tags, adaptor molecules, and/or coding tags.
- an extended recording tag comprises from 5’ to 3’ direction: a universal forward (or 5’) priming sequence, information transferred from one or more secondary tag(s), and a spacer sequence.
- an extended recording tag comprises from 5’ to 3’ direction: a universal forward (or 5’) priming sequence, information transferred from one or more secondary tag(s), optionally other barcodes (e.g, sample barcode, partition barcode, compartment barcode, or any combination thereof), a spacer sequence, and a universal reverse (or 3’) priming sequence.
- a recording tag comprises from 5’ to 3’ direction: a universal forward (or 5’) priming sequence, information transferred from one or more secondary tag(s), optionally other barcodes (e.g ., sample barcode, partition barcode, compartment barcode, or any combination thereof), an optional UMI, a spacer sequence, and a universal reverse (or 3’) priming sequence.
- a universal forward (or 5’) priming sequence information transferred from one or more secondary tag(s), optionally other barcodes (e.g ., sample barcode, partition barcode, compartment barcode, or any combination thereof), an optional UMI, a spacer sequence, and a universal reverse (or 3’) priming sequence.
- the methods described herein use a binding agent configured for interacting with the macromolecules to be analyzed (e.g., polypeptides, peptides, proteins).
- the assay can include contacting a plurality of binding agents to a plurality of macromolecules, or a plurality of targets.
- the present methods comprise contacting a single macromolecule with a single binding agent, contacting a plurality of macromolecules (a plurality of targets) with a single binding agent, or contacting a plurality of macromolecules with a plurality of binding agents.
- the plurality of binding agents includes a mixture of binding agents.
- a binding agent can be any molecule (e.g., peptide, polypeptide, protein, nucleic acid, carbohydrate, small molecule, and the like) capable of binding to a component or feature of a polypeptide.
- a binding agent can be a naturally occurring, synthetically produced, or recombinantly expressed molecule.
- the scaffold used to engineer a binding agent can be from any species, e.g., human, non-human, transgenic.
- a binding agent may bind to a portion of a target macromolecule or a motif.
- a binding agent may bind to a single monomer or subunit of a polypeptide (e.g., a single amino acid) or bind to multiple linked subunits of a polypeptide (e.g., dipeptide, tripeptide, or higher order peptide of a longer polypeptide molecule).
- a polypeptide e.g., a single amino acid
- multiple linked subunits of a polypeptide e.g., dipeptide, tripeptide, or higher order peptide of a longer polypeptide molecule.
- the binding agent comprises an antibody, an antigen-binding antibody fragment, a single-domain antibody (sdAb), a recombinant heavy-chain-only antibody (VHH), a single-chain antibody (scFv), a shark-derived variable domain (vNARs), a Fv, a Fab, a Fab', a F(ab')2, a linear antibody, a diabody, an aptamer, a peptide mimetic molecule, a fusion protein, a reactive or non-reactive small molecule, or a synthetic molecule.
- a binding agent may be designed to bind covalently.
- Covalent binding can be designed to be conditional or favored upon binding to the correct moiety.
- an target and its cognate binding agent may each be modified with a reactive group such that once the target-specific binding agent is bound to the target, a coupling reaction is carried out to create a covalent linkage between the two. Non-specific binding of the binding agent to other locations that lack the cognate reactive group would not result in covalent attachment.
- the target comprises a ligand that is capable of forming a covalent bond to a binding agent.
- the target comprises a ligand group that is capable of covalent binding to a binding agent.
- Covalent binding between a binding agent and its target may allow for more stringent washing to be used to remove binding agents that are non-specifically bound, thus increasing the specificity of the assay.
- the method includes a wash step after contacting the binding agent to the macromolecule to remove non-specifically bound binding agents.
- the stringency of the wash step may be tuned depending on the affinity of the binding agent to the target and/or the strength and stability of the complex formed.
- the binding agents are configured to provide specificity for binding of the binding agent to the macromolecule.
- a binding agent may be a selective binding agent.
- selective binding refers to the ability of the binding agent to preferentially bind to a specific ligand (e.g ., amino acid or class of amino acids) relative to binding to a different ligand (e.g., amino acid or class of amino acids). Selectivity is commonly referred to as the equilibrium constant for the reaction of displacement of one ligand by another ligand in a complex with a binding agent.
- selectivity is associated with the spatial geometry of the ligand and/or the manner and degree by which the ligand binds to a binding agent, such as by hydrogen bonding, hydrophobic binding, and Van der Waals forces (non-covalent interactions) or by reversible or non-reversible covalent attachment to the binding agent. It should also be understood that selectivity may be relative, and as opposed to absolute, and that different factors can affect the same, including ligand concentration.
- a binding agent selectively binds one of the twenty standard amino acids.
- a binding agent binds to an N-terminal amino acid residue, a C-terminal amino acid residue, or an internal amino acid residue.
- the binding agent is partially specific or selective. In some aspects, the binding agent preferentially binds one or more amino acids. In some examples, a binding agent may bind to or is capable of binding to two or more of the twenty standard amino acids. For example, a binding agent may preferentially bind the amino acids A, C, and G over other amino acids. In some other examples, the binding agent may selectively or specifically bind more than one amino acid. In some aspects, the binding agent may also have a preference for one or more amino acids at the second, third, fourth, fifth, etc. positions from the terminal amino acid. In some cases, the binding agent preferentially binds to a specific terminal amino acid and a penultimate amino acid.
- a binding agent may preferentially bind AA, AC, and AG or a binding agent may preferentially bind AA, CA, and GA.
- binding agents with different specificities can be associated with coding tags that hybridize to adaptor molecules comprising the same secondary tag.
- a binding agent may exhibit flexibility and variability in target binding preference in some or all of the positions of the targets.
- a binding agent may have a preference for one or more specific target terminal amino acids and have a flexible preference for a target at the penultimate position.
- a binding agent may have a preference for one or more specific target amino acids in the penultimate amino acid position and have a flexible preference for a target at the terminal amino acid position.
- a binding agent is selective for a target comprising a terminal amino acid and other components of a macromolecule. In some examples, a binding agent is selective for a target comprising a terminal amino acid and at least a portion of the peptide backbone. In some particular examples, a binding agent is selective for a target comprising a terminal amino acid and an amide peptide backbone. In some cases, the peptide backbone comprises a natural peptide backbone or a post-translational modification. In some embodiments, the binding agent exhibits allosteric binding.
- the method comprises contacting a mixture of binding agents with a mixture of macromolecules and selectivity need only be relative to the other binding agents to which the target is exposed. It should also be understood that selectivity of a binding agent need not be absolute to a specific molecule but could be to a portion of a molecule. In some examples, selectivity of a binding agent need not be absolute to a specific amino acid, but could be selective to a class of amino acids, such as amino acids with polar or non-polar side chains, or with electrically (positively or negatively) charged side chains, or with aromatic side chains, or some specific class or size of side chains, and the like.
- the ability of a binding agent to selectively bind a feature or component of a macromolecule is characterized by comparing binding abilities of binding agents.
- the binding ability of a binding agent to the target can be compared to the binding ability of a binding agent which binds to a different target, for example, comparing a binding agent selective for a class of amino acids to a binding agent selective for a different class of amino acids.
- a binding agent selective for non-polar side chains is compared to a binding agent selective for polar side chains.
- a binding agent selective for a feature, component of a peptide, or one or more amino acid exhibits at least IX, at least 2X, at least 5X, at least 10X, at least 50X, at least 100X, or at least 500X more binding compared to a binding agent selective for a different feature, component of a peptide, or one or more amino acid.
- the binding agent has a high affinity and high selectivity for the macromolecule, e.g., the polypeptide, of interest.
- a high binding affinity with a low off-rate may be efficacious for hybridization of the adaptor molecule to the coding tag.
- a binding agent has a Kd of about ⁇
- nM 500 nM, ⁇ 200 nM, ⁇ 100 nM, ⁇ 50 nM, ⁇ 10 nM, ⁇ 5 nM, ⁇ 1 nM, ⁇ 0.5 nM, or ⁇ 0.1 nM.
- the binding agent is added to the polypeptide at a concentration >1X, >5X, >10X, >100X, or >1000X its Kd to drive binding to completion.
- concentration >1X, >5X, >10X, >100X, or >1000X its Kd to drive binding to completion.
- a binding agent may bind to a terminal amino acid of a peptide, an intervening amino acid, dipeptide (sequence of two amino acids), tripeptide (sequence of three amino acids), or higher order peptide of a peptide molecule.
- each binding agent in a library of binding agents selectively binds to a particular amino acid, for example one of the twenty standard naturally occurring amino acids.
- the binding agent binds to an unmodified or native (e.g, natural) amino acid.
- the binding agent binds to an unmodified or native dipeptide (sequence of two amino acids), tripeptide (sequence of three amino acids), or higher order peptide of a peptide molecule.
- a binding agent may be engineered for high affinity for a native or unmodified N-terminal amino acid (NTAA), high specificity for a native or unmodified NTAA, or both.
- binding agents can be developed through directed evolution of promising affinity scaffolds using phage display.
- a binding agent may bind to a post-translational modification of an amino acid.
- a peptide comprises one or more post-translational modifications, which may be the same of different.
- the NTAA, CTAA, an intervening amino acid, or a combination thereof of a peptide may be post-translationally modified.
- Post-translational modifications to amino acids include acylation, acetylation, alkylation (including methylation), biotinylation, butyrylation, carbamylation, carbonylation, deamidation, deiminiation, diphthamide formation, disulfide bridge formation, eliminylation, flavin attachment, formylation, gamma-carboxylation, glutamyl ati on, glycylation, glycosylation, glypiation, heme C attachment, hydroxylation, hypusine formation, iodination, isoprenyl ati on, lipidation, lipoylation, malonylation, methylation, myristolylation, oxidation, palmitoylation, pegylation, phosphopantetheinylation, phosphorylation, prenylation, propionylation, retinylidene Schiff base formation, S- glutathionylation, S-nitrosylation, S-sulfenylation, selenation
- a lectin is used as a binding agent for detecting the glycosylation state of a protein, polypeptide, or peptide.
- Lectins are carbohydrate-binding proteins that can selectively recognize glycan epitopes of free carbohydrates or glycoproteins.
- a list of lectins recognizing various glycosylation states include: A, AAA, AAL, ABA, ACA, ACG, ACL, AOL, ASA, BanLec, BC2L-A, BC2LCN, BPA, BPL, Calsepa, CGL2, CNL, Con, ConA, DBA, Discoidin, DSA, EC A, EEL, F17AG, Gall, Gall- S, Gal2, Gal 3, Gal3C-S, Gal7-S, Gal9, GNA, GRFT, GS-I, GS-II, GSL-I, GSL-II, HHL, HIHA, HP A, I, II, Jacalin, LB A, LCA, LEA, LEL, Lentil, Lotus, LSL-N, LTL, MAA, MAH
- a binding agent may bind to a native or unmodified or unlabeled terminal amino acid. Moreover, in some cases, these natural amino acid binders don’t recognize N-terminal labels. Directed evolution of aaRS scaffolds can be used to generate higher affinity, higher specificity binding agents that recognized the N-terminal amino acids in the context of an N-terminal label. In another example, Havranak et al. (U.S. Patent Publication No. US 2014/0273004) describes engineering aminoacyl tRNA synthetases (aaRSs) as specific NTAA binders.
- aaRSs engineering aminoacyl tRNA synthetases
- the amino acid binding pocket of the aaRSs has an intrinsic ability to bind cognate amino acids, but generally exhibits poor binding affinity and specificity. Moreover, these natural amino acid binders don’t recognize N- terminal labels. Directed evolution of aaRS scaffolds can be used to generate higher affinity, higher specificity binding agents that recognized the N-terminal amino acids in the context of an N-terminal label.
- a binding agent may bind to a modified or labeled terminal amino acid (e.g ., an NTAA that has been functionalized or modified). In some embodiments, a binding agent may bind to a chemically or enzymatically modified terminal amino acid.
- a modified or labeled NTAA can be one that is functionalized with phenylisothiocyanate, PITC, l-fluoro-2, 4-dinitrobenzene (Sanger’s reagent, DNFB), benzyloxycarbonyl chloride or carbobenzoxy chloride (Cbz-Cl), N- (Benzyloxycarbonyloxy)succinimide (Cbz-OSu or Cbz-O-NHS), dansyl chloride (DNS-C1, or l-dimethylaminonaphthalene-5-sulfonyl chloride), 4-sulfonyl-2-nitrofluorobenzene (SNFB), N-Acetyl-Isatoic Anhydride, Isatoic Anhydride, 2-Pyridinecarboxaldehyde, 2- Formylphenylboronic acid, 2-Acetylphenylboronic acid, l-Fluoro-2, 4-dinitrobenzene
- the binding agent binds an amino acid labeled by contacting with a reagent or using a method as described in International Patent Publication No. WO 2019/089846. In some cases, the binding agent binds an amino acid labeled by an amine modifying reagent.
- a binding agent may bind to an N-terminal peptide, a C-terminal peptide, or an intervening peptide of a peptide, polypeptide, or protein molecule.
- a binding agent may bind to an N-terminal amino acid, C-terminal amino acid, or an intervening amino acid of a peptide molecule.
- a binding agent may bind to an N-terminal or C-terminal diamino acid moiety.
- An N-terminal diamino acid is comprised of the N-terminal amino acid and the penultimate N-terminal amino acid.
- a C-terminal diamino acid is similarly defined for the C-terminus.
- the binding agent binds to a chemically modified N- terminal amino acid residue or a chemically modified C-terminal amino acid residue.
- the NTAA may be modified with an “immunogenic” hapten, such as dinitrophenol (DNP).
- DNP dinitrophenol
- NTAA nM 1 + nM 2 + nM 2 + nM 3 + nM 3 + nM 3 + nM 4 + nM 3 + nM 4 + nM 4 + nM 4 + nM 6 + nM 6 + nM 6 + nM 6 + nM 6 + nM 6 + nM 6 + nM 6 + nM 6 + nM 6 + nM 6 Y-DNP-2-N-9367
- SNP sulfonyl nitrophenol
- SNFB 4-sulfonyl-2-nitrofluorobenzene
- Similar affinity enhancements may also be achieved with alternative NTAA modifiers, such as an acetyl group or an amidinyl (guanidinyl) group.
- alternative NTAA modifiers such as an acetyl group or an amidinyl (guanidinyl) group.
- a binding agent can be an aptamer (e.g ., peptide aptamer,
- DNA aptamer, or RNA aptamer a peptoid, an antibody or a specific binding fragment thereof, an amino acid binding protein or enzyme, an antibody binding fragment, an antibody mimetic, a peptide, a peptidomimetic, a protein, or a polynucleotide (e.g., DNA, RNA, peptide nucleic acid (PNA), a gPNA, bridged nucleic acid (BNA), xeno nucleic acid (XNA), glycerol nucleic acid (GNA), or threose nucleic acid (TNA), or a variant thereof).
- PNA peptide nucleic acid
- BNA bridged nucleic acid
- XNA xeno nucleic acid
- GNA glycerol nucleic acid
- TAA threose nucleic acid
- antibody and antibodies are used in a broad sense, to include not only intact antibody molecules, for example but not limited to immunoglobulin A, immunoglobulin G, immunoglobulin D, immunoglobulin E, and immunoglobulin M, but also any immunoreactive component(s) of an antibody molecule or portion thereof that immuno-specifically bind to at least one epitope.
- An antibody may be naturally occurring, synthetically produced, or recombinantly expressed.
- An antibody may be a fusion protein.
- An antibody may be an antibody mimetic.
- antibodies include but are not limited to, Fab fragments, Fab' fragments, F(ab') fragments, single chain antibody fragments (scFv), miniantibodies, nanobodies, diabodies, crosslinked antibody fragments, AffibodyTM, nanobodies, single domain antibodies, DVD-Ig molecules, alphabodies, affimers, affitins, cyclotides, molecules, and the like.
- Immunoreactive products derived using antibody engineering or protein engineering techniques are also expressly within the meaning of the term antibodies. Detailed descriptions of antibody and/or protein engineering, including relevant protocols, can be found in, among other places, J. Maynard and G. Georgiou, 2000, Ann. Rev. Biomed. Eng.
- nucleic acid and peptide aptamers that specifically recognize a macromolecule can be produced using known methods.
- Aptamers bind target molecules in a highly specific, conformation-dependent manner, typically with very high affinity, although aptamers with lower binding affinity can be selected if desired.
- Aptamers have been shown to distinguish between targets based on very small structural differences such as the presence or absence of a methyl or hydroxyl group and certain aptamers can distinguish between D- and L-enantiomers.
- Aptamers have been obtained that bind small molecular targets, including drugs, metal ions, and organic dyes, peptides, biotin, and proteins, including but not limited to streptavidin, VEGF, and viral proteins. Aptamers have been shown to retain functional activity after biotinylation, fluorescein labeling, and when attached to glass surfaces and microspheres (see, e.g. , Jayasena, 1999, Clin Chem 45:1628-50; Kusser2000, J. Biotechnol. 74: 27-39; Colas, 2000, Curr Opin Chem Biol 4:54-9).
- RNA aptamers which specifically bind arginine and AMP have been described as well (see, Patel and Suri, 2000, J. Biotech. 74:39-60). Oligonucleotide aptamers that bind to a specific amino acid have been disclosed in Gold et al. (1995, Ann. Rev. Biochem. 64:763-97). RNA aptamers that bind amino acids have also been described (Ames and Breaker, 2011, RNA Biol. 8; 82-89; Mannironi et al., 2000, RNA 6:520-27; Famulok, 1994, J. Am. Chem. Soc. 116:1698-1706).
- a binding agent can be made by modifying naturally-occurring or synthetically- produced proteins by genetic engineering to introduce one or more mutations in the amino acid sequence to produce engineered proteins that bind to a specific component or feature of a polypeptide (e.g., NTAA, CTAA, or post-translationally modified amino acid or a peptide).
- a polypeptide e.g., NTAA, CTAA, or post-translationally modified amino acid or a peptide.
- exopeptidases e.g., aminopeptidases, carboxypeptidases, dipeptidyl peptidase, dipeptidyl aminopeptidase
- exoproteases e.g., aminopeptidases, carboxypeptidases, dipeptidyl peptidase, dipeptidyl aminopeptidase
- exoproteases e.g., mutated exoproteases, mutated anticalins, mutated ClpSs, antibodies, or tRNA synthetases
- carboxypeptidases can be modified to create a binding agent that selectively binds to a particular CTAA.
- a binding agent can also be designed or modified, and utilized, to specifically bind a modified NTAA or modified CTAA, for example one that has a post- translational modification (e.g ., phosphorylated NTAA or phosphorylated CTAA) or one that has been modified with a label (e.g., PTC, l-fluoro-2, 4-dinitrobenzene (using Sanger’s reagent, DNFB), dansyl chloride (using DNS-C1, or l-dimethylaminonaphthalene-5-sulfonyl chloride), or using a thioacylation reagent, a thioacetylation reagent, an acetylation reagent, an amidination (guanidinylation) reagent, or a thiobenzylation reagent).
- a post- translational modification e.g ., phosphorylated NTAA or phosphorylated CTAA
- a label e.g., PTC,
- a binding agent that selectively binds to a labeled or functionalized NTAA can be utilized.
- the NTAA may be reacted with phenylisothiocyanate (PITC) to form a phenylthiocarbamoyl-NTAA derivative.
- PITC phenylisothiocyanate
- the binding agent may be fashioned to selectively bind both the phenyl group of the phenylthiocarbamoyl moiety as well as the alpha-carbon R group of the NTAA.
- Use of PITC in this manner allows for subsequent elimination of the NTAA by Edman degradation as discussed below.
- the NTAA may be reacted with Sanger’s reagent (DNFB), to generate a DNP-labeled NTAA.
- DNFB Sanger’s reagent
- an ionic liquid such as 1 -ethyl-3 -methylimidazolium bis[(trifluoromethyl)sulfonyl]imide ([emim][Tf2N]), in which DNFB is highly soluble.
- the binding agent may be engineered to selectively bind the combination of the DNP and the R group on the NTAA.
- the addition of the DNP moiety provides a larger “handle” for the interaction of the binding agent with the NTAA, and should lead to a higher affinity interaction.
- a binding agent may be a modified aminopeptidase.
- the binding agent may be a modified aminopeptidase that has been engineered to recognize the DNP-labeled NTAA providing cyclic control of aminopeptidase degradation of the peptide.
- the aminopeptidase is a monomeric metallo-protease, such an aminopeptidase activated by zinc (Calcagno et al., Appl Microbiol Biotechnol. (2016) 100(16):7091-7102).
- a binding agent may selectively bind to an NTAA that is modified with sulfonyl nitrophenol (SNP), e.g., by using 4-sulfonyl-2- nitrofluorobenzene (SNFB).
- SNP sulfonyl nitrophenol
- SNFB 4-sulfonyl-2- nitrofluorobenzene
- Other reagents that may be used to functionalize the NTAA include trifluoroethyl isothiocyanate, allyl isothiocyanate, and dimethylaminoazobenzene isothiocyanate, or a reagent as described in International Patent Publication No. WO 2019/089846.
- the binding agent binds to an unmodified or native amino acid. In some examples, the binding agent binds to an unmodified or native dipeptide (sequence of two amino acids), tripeptide (sequence of three amino acids), or higher order peptide of a peptide molecule.
- a binding agent may be engineered for high affinity for a modified NTAA, high specificity for a modified NTAA, or both. In some embodiments, binding agents can be developed through directed evolution of promising affinity scaffolds using phage display.
- the binding moiety of the binding agent comprises a member of the evolutionarily conserved ClpS family of adaptor proteins involved in natural N-terminal protein recognition and binding or a variant thereof.
- the amino acid residues corresponding to the ClpS hydrophobic binding pocket identified in Schuenemann et al. are modified in order to generate a binding moiety with the desired selectivity.
- the binding moiety comprises a member of the UBR box recognition sequence family, or a variant of the UBR box recognition sequence family.
- UBR recognition boxes are described in Tasaki et al., (2009), JBC 284(3): 1884-95.
- the binding moiety may comprise UBR1, UBR2, or a mutant, variant, or homologue thereof.
- the binding agent further comprises one or more detectable labels such as fluorescent labels, in addition to the binding moiety.
- the binding agent does not comprise a polynucleotide such as a coding tag.
- the binding agent comprises a synthetic or natural antibody.
- the binding agent comprises an aptamer.
- the binding agent comprises a polypeptide, such as a modified member of the ClpS family of adaptor proteins, such as a variant of an E. coli ClpS binding polypeptide, and a detectable label.
- the detectable label is optically detectable.
- the detectable label comprises a fluorescently moiety, a color-coded nanoparticle, a quantum dot or any combination thereof.
- the label comprises a polystyrene dye encompassing a core dye molecule such as a FluoSphereTM, Nile Red, fluorescein, rhodamine, derivatized rhodamine dyes, such as TAMRA, phosphor, polymethadine dye, fluorescent phosphoramidite, TEXAS RED, green fluorescent protein, acridine, cyanine, cyanine 5 dye, cyanine 3 dye, 5-(2'-aminoethyl)-aminonaphthalene-l -sulfonic acid (EDANS), BODIPY, 120 ALEXA or a derivative or modification of any of the foregoing.
- the detectable label is resistant to photobleaching while producing lots of signal (such as photons) at a unique and easily detectable wavelength, with high signal-to-noi
- the binding agents are fluorescently labeled with a fluorophore such that when a binding agent binds to a macromolecule, fluorescence emitted by the fluorophore can be detected by an appropriate detector.
- the fluorophores can be coupled to free amine groups (e.g., lysine side chains) of the binding agents.
- binding agents are nucleic acid aptamers
- the fluorophores can be coupled to nucleotides by methods known in the art such that the fluorophores would not disrupt 3D structure of an aptamer. Fluorescence can suitably be detected by detectors known in the art.
- single molecule detection of bound fluorescently labeled binding agent can be done by total internal reflection fluorescence (TIRF) microscopy, as shown in US 9435810 B2, incorporated herein. Based on the fluorescence detected from a specific binding agent, information about the macromolecule can be collected and/or recorded, and, sometimes, a portion of the macromolecule can be identified (for example, the N-terminal amino acid of the polypeptide).
- TIRF total internal reflection fluorescence
- the contacting step comprises contacting a macromolecule with more than one binding agents (a mixture of two or more binding agents) capable of binding to the macromolecule, each of the binding agents is suitably labeled with different fluorophores having different fluorescence emission spectra.
- anticalins are engineered for both high affinity and high specificity to labeled NTAAs (e.g . PTC, modified-PTC, Cbz, DNP, SNP, acetyl, guanidinyl, amino guanidinyl, heterocyclic methanimine, etc.).
- Certain varieties of anticalin scaffolds have suitable shape for binding single amino acids, by virtue of their beta barrel structure.
- An N-terminal amino acid (either with or without modification) can potentially fit and be recognized in this “beta barrel” bucket.
- High affinity anticalins with engineered novel binding activities have been described (reviewed by Skerra, 2008, FEBS J. 275: 2677- 2683).
- anticalins with high affinity binding (low nM) to fluorescein and digoxygenin have been engineered (Gebauer et ah, 2012, Methods Enzymol 503: 157-188.).
- Engineering of alternative scaffolds for new binding functions has also been reviewed by Banta et al. (2013, Annu. Rev. Biomed. Eng. 15:93-113).
- the functional affinity (avidity) of a given monovalent binding agent may be increased by at least an order of magnitude by using a bivalent or higher order multimer of the monovalent binding agent (Vauquelin et al., 2013, Br J Pharmacol 168(8): 1771-1785. 2013).
- Avidity refers to the accumulated strength of multiple, simultaneous, non-covalent binding interactions. An individual binding interaction may be easily dissociated. However, when multiple binding interactions are present at the same time, transient dissociation of a single binding interaction does not allow the binding protein to diffuse away and the binding interaction is likely to be restored.
- An alternative method for increasing avidity of a binding agent is to include complementary sequences in the coding tag attached to the binding agent and the recording tag associated with the polypeptide.
- the binding agent is linked, directly or indirectly, to a multimerization domain.
- monomeric, dimeric, and higher order (e.g., 3, 4, 5, or more) multimeric polypeptides comprising one or more binding agents are provided herein.
- the binding agent is dimeric.
- two polypeptides of the invention can be covalently or non-covalently attached to each other to form a dimer.
- the binding agent is derived from a biological, naturally occurring, non-naturally occurring, or synthetic source.
- the binding agent is derived from de novo protein design (Huang et al., (2016) 537(7620):320-327).
- the binding agent has a structure, sequence, and/or activity designed from first principles.
- a binding agent can be utilized that selectively binds a modified C-terminal amino acid (CTAA).
- CAA C-terminal amino acid
- Carboxypeptidases are proteases that cleave/eliminate terminal amino acids containing a free carboxyl group.
- a number of carboxypeptidases exhibit amino acid preferences, e.g., carboxypeptidase B preferentially cleaves at basic amino acids, such as arginine and lysine.
- a carboxypeptidase can be modified to create a binding agent that selectively binds to particular amino acid.
- the carboxypeptidase may be engineered to selectively bind both the modification moiety as well as the alpha-carbon R group of the CTAA.
- engineered carboxypeptidases may specifically recognize 20 different CTAAs representing the standard amino acids in the context of a C-terminal label. Control of the stepwise degradation from the C-terminus of the peptide is achieved by using engineered carboxypeptidases that are only active (e.g., binding activity or catalytic activity) in the presence of the label.
- the CTAA may be modified by a para-Nitroanilide or 7-amino-4- methylcoumarinyl group.
- Other potential scaffolds that can be engineered to generate binding agents for use in the methods described herein include: an anticalin, a lipocalin, an amino acid tRNA synthetase (aaRS), ClpS, an Affilin , an AdnectinTM, a T cell receptor, a zinc finger protein, a thioredoxin, GST Al-1, DARPin, an affimer, an affitin, an alphabody, an avimer, a monobody, an antibody, a single domain antibody, a nanobody, EETI-II, HPSTI, intrabody, PHD-fmger, V(NAR) LDTI, evibody, Ig(NAR), knottin, maxibody, microbody, neocarzinostatin, pVIII, tendamistat, VLR, protein A scaffold, MTI-II, ecotin, GCN4, Im9, kunitz domain, PBP, trans-body, tetranect
- Ldl receptor domain A Min-23, PDZ-domain, avian pancreatic polypeptide, charybdotoxin/10Fn3, domain antibody (Dab), a2p8 ankyrin repeat, insect defensing A peptide, Designed AR protein, C-type lectin domain, staphylococcal nuclease, Src homology domain 3 (SH3), or Src homology domain 2 (SH2). See e.g., El-Gebali et al., (2019)
- a binding agent is derived from an enzyme which binds one or more amino acids (e.g ., an aminopeptidase).
- a binding agent can be derived from an anticalin or a Clp protease adaptor protein (ClpS).
- a binding agent may preferably bind to a modified or labeled amino acid, by chemical or enzymatic means, (e.g., an amino acid that has been functionalized by a reagent (e.g., a compound)) over a non-modified or unlabeled amino acid.
- a binding agent may preferably bind to an amino acid that has been functionalized with an acetyl moiety, Cbz moiety, guanyl moiety, dansyl moiety, PTC moiety, DNP moiety, SNP moiety, diheterocyclic methanimine moiety, etc., over an amino acid that does not possess said moiety.
- a binding agent may preferably bind to an amino acid that has been functionalized or modified as described in International Patent Publication No. WO 2019/089846. In some cases, a binding agent may bind to a post-translationally modified amino acid.
- an extended nucleic acid comprises coding tag information relating to amino acid sequence and post-translational modifications of the polypeptide.
- a peptide is contacted with binding agents for PTM modifications, and associated coding tag information are transferred to the recording tag associated with the immobilized peptide. Once the detection and transfer of coding tag information relating to amino acid modifications is complete, the PTM modifying groups can be removed before detection and transfer of coding tag information for the primary amino acid sequence using N-terminal or C-terminal degradation methods.
- the N-terminal amino acid (or labeled N-terminal amino acid, e.g, PITC-NTAA, Cbz-NTAA, DNP-NTAA, SNP-NTAA, acetyl-NTAA, guanidinylated- NTAA, heterocyclic methanimine-NTAA) of each immobilized peptide is bound by a cognate NTAA binding agent which is attached to a coding tag.
- detection of internal post-translationally modified amino acids is be accomplished prior to detection and elimination of terminal amino acids (e.g., NTAA or CTAA).
- a peptide is contacted with binding agents for PTM modifications, and information from a corresponding secondary tag is transferred to the recording tag associated with the immobilized peptide.
- the PTM modifying groups can be removed before detection and transfer of secondary tag information for the primary amino acid sequence using N-terminal or C- terminal degradation methods.
- resulting extended nucleic acids indicate the presence of post-translational modifications in a peptide sequence, though not the sequential order, along with primary amino acid sequence information.
- detection of internal post-translationally modified amino acids may occur concurrently with detection of primary amino acid sequence.
- an NTAA or CTAA
- a binding agent specific for a post- translationally modified amino acid either alone or as part of a library of binding agents (e.g ., library composed of binding agents for the 20 standard amino acids and selected post- translational modified amino acids).
- a binding agent or library of binding agents
- a macromolecule e.g., a polypeptide
- a non-cognate binding agent is referring to a binding agent that is selective for a different target (e.g. polypeptide feature or component) than the particular target being considered.
- the binding agent selective for phenylalanine would be first binding agent capable of selectively binding to the n- NTAA (i.e., phenylalanine), while the other two binding agents would be non-cognate binding agents for that peptide (since they are selective for NTAAs other than phenylalanine).
- the tyrosine and asparagine binding agents may, however, be cognate binding agents for other peptides in the sample.
- the binding agent selective for tyrosine would be second binding agent capable of selectively binding to the n- 1 NTAA (i.e., tyrosine), while the other two binding agents would be non-cognate binding agents (since they are selective for NTAAs other than tyrosine).
- an agent is a binding agent or a non cognate binding agent will depend on the nature of the particular polypeptide feature or component currently available for binding. Also, if multiple polypeptides are analyzed in a multiplexed reaction, a binding agent for one polypeptide may be a non-cognate binding agent for another, and vice versa. According, it should be understood that the following description concerning binding agents is applicable to any type of binding agent described herein (i.e., both cognate and non-cognate binding agents).
- the concentration of the binding agents in a solution is controlled to reduce background and/or false positive results of the assay.
- the concentration of a binding agent can be at any suitable concentration, e.g ., at about 0.0001 nM, about 0.001 nM, about 0.01 nM, about 0.1 nM, about 1 nM, about 2 nM, about 5 nM, about 10 nM, about 20 nM, about 50 nM, about 100 nM, about 200 nM, about 500 nM, or about 1,000 nM.
- the concentration of a soluble conjugate used in the assay is between about 0.0001 nM and about 0.001 nM, between about 0.001 nM and about 0.01 nM, between about 0.01 nM and about 0.1 nM, between about 0.1 nM and about 1 nM, between about 1 nM and about 2 nM, between about 2 nM and about 5 nM, between about 5 nM and about 10 nM, between about 10 nM and about 20 nM, between about 20 nM and about 50 nM, between about 50 nM and about 100 nM, between about 100 nM and about 200 nM, between about 200 nM and about 500 nM, between about 500 nM and about 1000 nM, or more than about 1,000 nM.
- the coding tag associated with the binding agent is or comprises a polynucleotide with any suitable length, e.g ., a nucleic acid molecule of about 2 bases to about 100 bases, including any integer including 2 and 100 and in between, that comprises identifying information for its associated binding agent.
- a coding tag comprises a sequence for hybridizing to a first hybridization sequence of an adaptor molecule.
- a “coding tag” may also be made from a “sequenceable polymer” (see, e.g., Niu et al., 2013, Nat. Chem. 5:282- 292; Roy et al., 2015, Nat. Commun.
- a coding tag may comprise an encoder sequence or a sequence with identifying information.
- the coding tag or portion thereof may comprise a sequence substantially complementary or complementary to a first hybridization sequence of an adaptor molecule, thus the coding tag can be associated with the secondary tag which contains identifying information regarding the binding agent.
- the first hybridization sequence on an adaptor molecule comprises a substantially complementary or complementary sequence configured to hybridize to the coding tag (or portion thereof) directly attached to a binding agent.
- the coding tag is used in the provided methods to hybridize to said first hybridization on the adaptor molecule and localize the appropriate adaptor molecule for information transfer.
- a coding tag may be a single stranded molecule, a double stranded molecule, or a partially double stranded.
- a coding tag may comprise blunt ends, overhanging ends, or one of each.
- a coding tag is partially double stranded.
- the coding tag may comprise a hairpin.
- the hairpin comprises mutually complementary nucleic acid regions are connected through a nucleic acid strand.
- the nucleic acid hairpin can also further comprise 3' and/or 5' single-stranded region(s) extending from the double-stranded stem segment.
- the hairpin comprises a single strand of nucleic acid.
- a binding agent described comprises a coding tag containing identifying information regarding (e.g, representing or correlating to) the binding agent.
- the identifying information from the secondary tag corresponds to the coding tag and comprises information regarding the identity of the target bound by the binding agent.
- the identifying information from the coding tag comprises or is associated with information regarding the identity of the one or more amino acid(s) on the peptide bound by the binding agent.
- a coding tag is a nucleic acid molecule of about 3 bases to about 100 bases that provides unique identifying information for its associated binding agent.
- a coding tag may comprise about 3 to about 90 bases, about 3 to about 80 bases, about 3 to about 70 bases, about 3 to about 60 bases, about 3 bases to about 50 bases, about 3 bases to about 40 bases, about 3 bases to about 30 bases, about 3 bases to about 20 bases, about 3 bases to about 10 bases, or about 3 bases to about 8 bases.
- a coding tag is about 3 bases, 4 bases, 5 bases, 6 bases, 7 bases, 8 bases, 9 bases, 10 bases, 11 bases, 12 bases, 13 bases, 14 bases, 15 bases, 16 bases, 17 bases, 18 bases, 19 bases, 20 bases, 25 bases, 30 bases, 35 bases, 40 bases, 55 bases, 60 bases, 65 bases, 70 bases, 75 bases, 80 bases, 85 bases, 90 bases, 95 bases, or 100 bases in length.
- a coding tag may be composed of DNA, RNA, polynucleotide analogs, or a combination thereof.
- Polynucleotide analogs include PNA, gPNA, BNA, GNA, TNA, LNA, morpholino polynucleotides, 2’-0-Methyl polynucleotides, alkyl ribosyl substituted polynucleotides, phosphorothioate polynucleotides, and 7-deaza purine analogs.
- a coding tag can be joined to a binding agent directly or indirectly, by any means known in the art, including covalent and non-covalent interactions.
- a coding tag may be joined to binding agent enzymatically or chemically.
- a coding tag may be joined to a binding agent via ligation.
- a coding tag is joined to a binding agent via affinity binding pairs (e.g ., biotin and streptavidin).
- a coding tag may be joined to a binding agent to an unnatural amino acid, such as via a covalent interaction with an unnatural amino acid.
- a binding agent is joined to a coding tag via SpyCatcher-
- the SpyTag peptide forms an irreversible covalent bond to the SpyCatcher protein via a spontaneous isopeptide linkage, thereby offering a genetically encoded way to create peptide interactions that resist force and harsh conditions (Zakeri et ak, 2012, Proc. Natl. Acad. Sci. 109:E690-697; Li et ak, 2014, J. Mol. Biol. 426:309- 317).
- a binding agent may be expressed as a fusion protein comprising the SpyCatcher protein.
- the SpyCatcher protein is appended on the N-terminus or C- terminus of the binding agent.
- the SpyTag peptide can be coupled to the coding tag using standard conjugation chemistries (Hermanson, Bioconjugate Techniques, (2013) Academic Press).
- an enzyme-based strategy is used to join the binding agent to a coding tag.
- the binding agent may be joined to a coding tag using a formylglycine (FGly)-generating enzyme (FGE).
- FGE formylglycine
- FGly formylglycine
- a protein e.g., SpyLigase, is used to join the binding agent to the coding tag (Fierer et al., Proc Natl Acad Sci U S A. 2014; 111(13): El 176-El 181).
- a binding agent is joined to a coding tag via SnoopTag-
- the SnoopTag peptide forms an isopeptide bond with the SnoopCatcher protein (Veggiani et al., Proc. Natl. Acad. Sci. USA, 2016, 113:1202- 1207).
- a binding agent may be expressed as a fusion protein comprising the SnoopCatcher protein.
- the SnoopCatcher protein is appended on the N-terminus or C-terminus of the binding agent.
- the SnoopTag peptide can be coupled to the coding tag using standard conjugation chemistries.
- a binding agent is joined to a coding tag via the
- HaloTag® protein fusion tag and its chemical ligand is a modified haloalkane dehalogenase designed to covalently bind to synthetic ligands (HaloTag ligands) (Los et al., 2008, ACS Chem. Biol. 3:373-382).
- the synthetic ligands comprise a chloroalkane linker attached to a variety of useful molecules. A covalent bond forms between the HaloTag and the chloroalkane linker that is highly specific, occurs rapidly under physiological conditions, and is essentially irreversible.
- a binding agent is joined to a coding tag by attaching
- conjugating using an enzyme, such as sortase-mediated labeling (See e.g., Antos et al.,
- the sortase enzyme catalyzes a transpeptidation reaction ( See e.g, Falck et al, Antibodies (2016) 7(4): 1-19).
- the binding agent is modified with or attached to one or more N-terminal or C-terminal glycine residues.
- a binding agent is joined to a coding tag using a cysteine bioconjugation method.
- a binding agent is joined to a coding tag using p-clamp-mediated cysteine bioconjugation ( See e.g., Zhang et al., Nat Chem. (2016) 8(2): 120-128).
- a binding agent is joined to a coding tag using 3- arylpropiolonitriles (APN)-mediated tagging (e.g. Koniev et al., Bioconjug Chem. 2014; 25(2):202-206).
- APN arylpropiolonitriles
- the provided methods for analysis of macromolecules may include additional steps, treatments, and reactions.
- the macromolecule analysis assay is a next generation protein assay (NGPA) using multiple binding agents and enzymatically-mediated sequential information transfer.
- the analysis assay is performed on immobilized target molecules bound a cognate binding agent (e.g, antibody) and forming a stable complex, then hybridizing an adaptor molecule comprising a sequence that is complementary to the coding tag and a secondary tag, and transferring information from the secondary tags of bound antibodies to the recording tag associated with the macromolecule.
- NGPA next generation protein assay
- the analysis assay is performed on immobilized macromolecules bound by two or more cognate binding agents (e.g, antibodies). After a cognate antibody binding event, an adaptor molecule hybridizes, and a combined primer extension and DNA nicking step is used to transfer information from the secondary tags of the adaptor molecules to the recording tag.
- cognate binding agents e.g, antibodies
- an adaptor molecule hybridizes, and a combined primer extension and DNA nicking step is used to transfer information from the secondary tags of the adaptor molecules to the recording tag.
- polyclonal antibodies (or mixed population of monoclonal antibody) to multivalent epitopes on a protein can be used for the assay. See e.g., International Patent Publication No. WO 2017/192633.
- the macromolecule is a polypeptide and a polypeptide analysis assay is performed.
- the sequence (or a portion of the sequence thereof) and/or the identity of a target protein is determined using a polypeptide analysis assay.
- the polypeptide analysis assay includes assessing at least a partial sequence or identity of the polypeptide using suitable techniques or procedures. For example, at least a partial sequence of the polypeptide can be assessed by /V-terminal amino acid analysis or C-terminal amino acid analysis. In some embodiments, at least a partial sequence of the polypeptide can be assessed using a ProteoCode assay.
- At least a partial sequence of the polypeptide can be assessed by the techniques or procedures disclosed and/or claimed in U.S. Provisional Patent Application Nos. 62/330,841, 62/339,071, 62/376,886, 62/579,844, 62/582,312, 62/583,448, 62/579,870, 62/579,840, and 62/582,916, and International Patent Publication Nos. WO 2017/192633, and WO/2019/089836, and WO 2019/089851.
- the polypeptide analysis assay includes performing an assay which utilizes the recording tag associated with the macromolecule, e.g ., the polypeptide.
- the recording tag is used to record information gathered from one or more binding events between a binding agent and the macromolecule to be analyzed.
- a method for analyzing a macromolecule comprising the steps of: (a) providing a macromolecule and an associated recording tag joined to a support; (b) contacting the macromolecule with a binding agent capable of binding to the macromolecule, wherein the binding agent comprises a coding tag with identifying information regarding the binding agent, to allow binding between the macromolecule and the binding agent; (c) providing an adaptor molecule comprising a first hybridization sequence substantially complementary or complementary to at least a portion of the coding tag, and a secondary tag, to allow hybridization between the adaptor molecule (or the first hybridization sequence) and the coding tag (or the portion of the coding tag); (d) transferring the information of the secondary tag to the recording tag to generate an extended recording tag; and analyzing the extended recording tag.
- the binding agent is removed after step (d).
- the method further includes adding a universal priming site to the extended recording tag, prior to analyzing the extended recording tag.
- step (a) is performed before steps (b), (c), and (d).
- step (b) is performed before step (c) and step (d).
- step (c) is performed before step (d).
- the steps are performed in the order: (a), (b), (c), and (d), optionally repeating steps (b), (c), and (d) one or more times.
- the method is performed with one or more macromolecules and the method further includes step (e) removing the terminal amino acid (e.g, N-terminal amino acid (NTAA)) of the polypeptide, protein or peptide to expose a new terminal amino acid of the polypeptide, protein or peptide.
- a cycle of steps (b), (c), (d) and (e) is repeated one or more times prior to analyzing the extended recording tag.
- the steps are performed in the order: (a), (b), (c), (d) and (e), optionally repeating steps (b), (c), (d) and (e) one or more times.
- the method includes treating the target polypeptide, protein or peptide with a reagent for modifying a terminal amino acid of the polypeptide, protein or peptide.
- the reagent for modifying a terminal amino acid of a polypeptide comprises a chemical agent or an enzymatic agent.
- the target polypeptide, protein or peptide is contacted with the reagent for modifying a terminal amino acid before step (b).
- the target polypeptide, protein or peptide is contacted with the reagent for modifying a terminal amino acid before removing the terminal amino acid.
- the method further includes removing the binding agent after transferring information from the secondary tag to the recording tag.
- the binding agent is removed after step (d).
- the binding agent is removed before step (e).
- removing the binding agent is performed after transferring information from the secondary tag of the adaptor molecule to the recording tag associated with the target.
- the provided methods for analysis of macromolecules can be used in combination with a method for performing a binding reaction that forms a stable complex.
- the step of contacting the macromolecule with a binding agent capable of binding to the macromolecule further includes components and/or steps for stabilizing the complex.
- the binding reaction comprises contacting a binding agent with a target macromolecule, wherein the binding agent and the target each comprises or is associated with a stabilizing component, allowing the binding agent to interact with the target, and allowing linking of the stabilizing components to form a stable complex.
- the stable complex may include the binding agent and associated coding tag, the adaptor molecule, the stabilizing component associated with the binding agent, the target and associated stabilizing component, and optionally a linking agent.
- the stabilizing component associated with the binding agent is joined to the coding tag associated with the binding agent.
- the linking of the stabilizing components either directly with each other or indirectly via a linker (e.g ., a linking agent) or other components, allows formation of a stable complex with the binding agent and target.
- a linker e.g ., a linking agent
- a stabilizing component is joined or attached (directly or indirectly via a linker) to a nucleic acid molecule or oligonucleotide.
- the linking agent comprises an oligonucleotide comprising a sequence complementary to a nucleic acid joined to the binding agent (e.g., stabilizing component associated with the coding tag) and the adaptor molecule comprises a first hybridization region complementary to the coding tag.
- a stable complex is formed that includes a recording tag joined to the macromolecule target, the macromolecule target bound by the binding agent associated with a coding tag, the coding tag hybridized to both the adaptor molecule and the linking agent (hybridized to the stabilizing component), and the linking agent is also hybridized to the stabilizing component of the recording tag.
- information is transferred from the coding tag to the linking agent, and this information is subsequently transferred to the recording tag.
- information is transferred from an adaptor molecule by ligation, wherein the adaptor molecule is part of the linking agent joining the stabilizing component associated with the recording tag with the stabilizing component associated with the binding agent.
- a linking agent comprises two DNA pieces joined by a linker (a first DNA for hybridizing to the stabilizing component associated with the recording tag and a second DNA comprising the adaptor molecule with a secondary tag for transferring information to the recording tag).
- the secondary tag of the adaptor molecule is ligated to the recording tag, USER enzyme can be used to cleave the U residue and remove the remaining portion of the linking agent from the extended recording tag. In some cases, this approach may be useful for reducing or removing the requirement for spacers in the information transfer step. In some embodiments, single stranded DNA ligation is used for information transfer.
- the macromolecule analysis assay comprises: providing a macromolecule and an associated recording tag joined to a support, wherein the macromolecule is also associated with a stabilizing component; contacting the macromolecule with a binding agent capable of binding to the macromolecule, wherein the binding agent comprises a coding tag with identifying information regarding the binding agent, wherein the binding agent is also associated with a stabilizing component; providing a linking agent and linking the stabilizing components to form a stable complex comprising the macromolecule, the binding agent and the stabilizing components; providing an adaptor molecule comprising a first hybridization sequence substantially complementary to at least a portion of the coding tag, and a secondary tag, to allow hybridization between the adaptor molecule (or the first hybridization sequence) and the coding tag (or the portion of the coding tag); and transferring the information of the secondary tag to the recording tag to generate an extended recording tag; and analyzing the extended recording tag.
- information is transferred from the secondary tag of the adaptor molecule to the recording tag.
- information is transferred from the secondary tag of the adaptor molecule
- the methods disclosed herein can be used for analysis, including detection, identification, quantitation and/or sequencing, of a plurality of macromolecules simultaneously (multiplexing).
- Multiplexing refers to analysis of a plurality of macromolecules (e.g. polypeptides) in the same assay.
- the plurality of macromolecules can be derived from the same sample or different samples.
- the plurality of macromolecules can be derived from the same subject or different subjects.
- the plurality of macromolecules that are analyzed can be different macromolecules, or the same macromolecule derived from different samples.
- a plurality of macromolecules includes 2 or more macromolecules, 5 or more macromolecules, 10 or more macromolecules, 50 or more macromolecules, 100 or more macromolecules, 500 or more macromolecules, 1000 or more macromolecules, 5,000 or more macromolecules, 10,000 or more macromolecules, 50,000 or more macromolecules, 100,000 or more macromolecules, 500,000 or more macromolecules, or 1,000,000 or more macromolecules.
- a target Prior to performing the binding reaction, a target may be obtained from a source and treated in various ways to prepare the target for the binding reaction, such as by joining to a stabilizing component.
- the binding reaction may be performed on a plurality of targets.
- the target is immobilized on a support.
- a target is a macromolecule or portion thereof.
- the targets are molecules (e.g, macromolecules) obtained from a sample and are of unknown identity.
- the targets are molecules (e.g, macromolecules) from a mixture of molecules obtained from a sample.
- a macromolecule can be a large molecule composed of smaller subunits.
- a macromolecule is a protein, a protein complex, polypeptide, peptide, nucleic acid molecule, carbohydrate, lipid, macrocycle, or a chimeric macromolecule.
- a macromolecule (e.g., protein, polypeptide, peptide) in the methods disclosed herein may be obtained from any suitable source or sample.
- the target is or comprises macromolecules (e.g., proteins, polypeptides, or peptides) obtained from a sample that is a biological sample.
- the sample comprises but is not limited to, mammalian or human cells, yeast cells, and/or bacterial cells.
- the sample contains cells that are from a sample obtained from a multicellular organism.
- the sample may be isolated from an individual.
- the sample may comprise a single cell type or multiple cell types.
- the sample may be obtained from a mammalian organism or a human, for example by puncture, or other collecting or sampling procedures.
- the sample comprises two or more cells.
- the biological sample may contain whole cells and/or live cells and/or cell debris.
- a suitable source or sample may include but is not limited to: biological samples, such as biopsy samples, cell cultures, cells (both primary cells and cultured cell lines), sample comprising cell organelles or vesicles, tissues and tissue extracts; of virtually any organism.
- a suitable source or sample may include but is not limited to: biopsy; fecal matter; bodily fluids (such as blood, whole blood, serum, plasma, urine, lymph, bile, aqueous humor, breast milk, cerumen (earwax), chyle, chyme, endolymph, perilymph, exudates, cerebrospinal fluid, interstitial fluid, aqueous or vitreous humor, colostrum, sputum, amniotic fluid, saliva, anal and vaginal secretions, gastric acid, gastric juice, lymph, mucus (including nasal drainage and phlegm), pericardial fluid, peritoneal fluid, pleural fluid, pus, rheum, saliva, sebum (skin oil), sputum, synovial fluid, perspiration and semen, a transudate, vomit and mixtures of one or more thereof, an exudate (e.g., fluid obtained from an abscess or any other site of infection
- the macromolecules may be obtained and prepared from a single cell type or multiple cell types.
- the sample comprises a population of cells.
- the macromolecules e.g., proteins, polypeptides, or peptides
- the macromolecules are from a cellular or subcellular component, an extracellular vesicle, an organelle, or an organized subcomponent thereof.
- the macromolecules e.g., proteins, polypeptides, or peptides
- the sample may include but are not limited to cellular organelles, (e.g., nucleus, golgi apparatus, ribosomes, mitochondria, endoplasmic reticulum, chloroplast, cell membrane, vesicles, etc.).
- organelles e.g., nucleus, golgi apparatus, ribosomes, mitochondria, endoplasmic reticulum, chloroplast, cell membrane, vesicles, etc.
- the target comprises a macromolecule that is a protein, a protein complex, a polypeptide, or peptide.
- Amino acid sequence information and post- translational modifications of a peptide, polypeptide, or protein are transduced into a nucleic acid encoded library that can be analyzed via next generation sequencing methods.
- a peptide may comprise L-amino acids, D-amino acids, or both.
- a peptide, polypeptide, protein, or protein complex may comprise a standard, naturally occurring amino acid, a modified amino acid (e.g., post-translational modification), an amino acid analog, an amino acid mimetic, or any combination thereof.
- a peptide, polypeptide, or protein is naturally occurring, synthetically produced, or recombinantly expressed.
- a peptide, polypeptide, protein, or protein complex may further comprise a post-translational modification.
- Non-standard amino acids include selenocysteine, pyrrolysine, and N-formylmethionine, b-amino acids, homo-amino acids, Proline and Pyruvic acid derivatives, 3 -substituted Alanine derivatives, Glycine derivatives, ring-substituted Phenylalanine and Tyrosine Derivatives, linear core amino acids, and N-methyl amino acids.
- a post-translational modification (PTM) of a peptide, polypeptide, or protein may be a covalent modification or enzymatic modification.
- post-translation modifications include, but are not limited to, acylation, acetylation, alkylation (including methylation), biotinylation, butyrylation, carbamylation, carbonylation, deamidation, deiminiation, diphthamide formation, disulfide bridge formation, eliminylation, flavin attachment, formylation, gamma-carboxylation, glutamyl ati on, glycylation, glycosylation ( e.g ., N-linked, O-linked, C-linked, phosphoglycosylation), glypiation, heme C attachment, hydroxylation, hypusine formation, iodination, isoprenylation, lipidation, lipoylation, malonylation, methylation, myristolylation, oxidation, palmitoylation,
- a post-translational modification includes modifications of the amino terminus and/or the carboxyl terminus of a peptide, polypeptide, or protein.
- Modifications of the terminal amino group include, but are not limited to, des-amino, N-lower alkyl, N-di-lower alkyl, and N-acyl modifications.
- Modifications of the terminal carboxy group include, but are not limited to, amide, lower alkyl amide, dialkyl amide, and lower alkyl ester modifications (e.g., wherein lower alkyl is C1-C4 alkyl).
- a post-translational modification also includes modifications, such as but not limited to those described above, of amino acids falling between the amino and carboxy termini of a peptide, polypeptide, or protein.
- Post-translational modification can regulate a protein’s “biology” within a cell, e.g., its activity, structure, stability, or localization. For example, phosphorylation plays an important role in regulation of protein, particularly in cell signaling (Prabakaran et al., 2012, Wiley Interdiscip Rev Syst Biol Med 4: 565-583).
- the addition of sugars to proteins, such as glycosylation has been shown to promote protein folding, improve stability, and modify regulatory function and the attachment of lipids to proteins enables targeting to the cell membrane.
- a post-translational modification can also include peptide, polypeptide, or protein modifications to include one or more detectable labels.
- a peptide, polypeptide, or protein can be fragmented.
- Peptides, polypeptides, or proteins can be fragmented by any means known in the art, including fragmentation by a protease or endopeptidase.
- fragmentation of a peptide, polypeptide, or protein is targeted by use of a specific protease or endopeptidase.
- a specific protease or endopeptidase binds and cleaves at a specific consensus sequence (e.g., TEV protease).
- fragmentation of a peptide, polypeptide, or protein is non-targeted or random by use of a non-specific protease or endopeptidase.
- a non-specific protease may bind and cleave at a specific amino acid residue rather than a consensus sequence (e.g ., proteinase K is a non-specific serine protease).
- proteinases and endopeptidases can be used to cleave a protein or polypeptide into smaller peptide fragments include proteinase K, trypsin, chymotrypsin, pepsin, thermolysin, thrombin, Factor Xa, furin, endopeptidase, papain, pepsin, subtilisin, elastase, enterokinase, GenenaseTM I, Endoproteinase LysC, Endoproteinase AspN, Endoproteinase GluC, etc. (Granvogl et ah, 2007, Anal Bioanal Chem 389: 991-1002).
- a peptide, polypeptide, or protein is fragmented by proteinase K, or optionally, a thermolabile version of proteinase K to enable rapid inactivation.
- Proteinase K is stable in denaturing reagents, such as urea and SDS, and enables digestion of completely denatured proteins. Protein and polypeptide fragmentation into peptides can be performed before or after attachment of a DNA tag or DNA recording tag.
- Chemical reagents can also be used to digest proteins into peptide fragments.
- a chemical reagent may cleave at a specific amino acid residue (e.g., cyanogen bromide hydrolyzes peptide bonds at the C-terminus of methionine residues).
- Chemical reagents for fragmenting polypeptides or proteins into smaller peptides include cyanogen bromide (CNBr), hydroxylamine, hydrazine, formic acid, BNPS-skatole [2-(2-nitrophenylsulfenyl)-3- methylindole], iodosobenzoic acid, vNTCB +Ni (2-nitro-5-thiocyanobenzoic acid), etc.
- the resulting peptide fragments are approximately the same desired length, e.g., from about 10 amino acids to about 70 amino acids, from about 10 amino acids to about 60 amino acids, from about 10 amino acids to about 50 amino acids, about 10 to about 40 amino acids, from about 10 to about 30 amino acids, from about 20 amino acids to about 70 amino acids, from about 20 amino acids to about 60 amino acids, from about 20 amino acids to about 50 amino acids, about 20 to about 40 amino acids, from about 20 to about 30 amino acids, from about 30 amino acids to about 70 amino acids, from about 30 amino acids to about 60 amino acids, from about 30 amino acids to about 50 amino acids, or from about 30 amino acids to about 40 amino acids.
- a cleavage reaction may be monitored, preferably in real time, by spiking the protein or polypeptide sample with a short test FRET (fluorescence resonance energy transfer) peptide comprising a peptide sequence containing a proteinase or endopeptidase cleavage site.
- FRET fluorescence resonance energy transfer
- a fluorescent group and a quencher group are attached to either end of the peptide sequence containing the cleavage site, and fluorescence resonance energy transfer between the quencher and the fluorophore leads to low fluorescence.
- the quencher and fluorophore are separated giving a large increase in fluorescence.
- a cleavage reaction can be stopped when a certain fluorescence intensity is achieved, allowing a reproducible cleavage endpoint to be achieved.
- a target is or comprises a sample of macromolecules (e.g ., peptides, polypeptides, or proteins) which can undergo protein fractionation methods where proteins or peptides are separated by one or more properties such as cellular location, molecular weight, hydrophobicity, isoelectric point, or protein enrichment methods.
- macromolecules e.g., peptides, polypeptides, or proteins
- a subset of macromolecules (e.g., proteins) within a sample is fractionated such that a subset of the macromolecules is sorted from the rest of the sample. For example, the sample may undergo fractionation methods prior to attachment to a support.
- protein enrichment methods may be used to select for a specific protein or peptide (see, e.g., Whiteaker et al., 2007, Anal. Biochem. 362:44-54, incorporated by reference in its entirety) or to select for a particular post translational modification (see, e.g., Huang et al., 2014. J. Chromatogr. A 1372:1-17, incorporated by reference in its entirety).
- a particular class or classes of proteins such as immunoglobulins, or immunoglobulin (Ig) isotypes such as IgG, can be affinity enriched or selected for analysis.
- a protein sample dynamic range can be modulated by fractionating the protein sample using standard fractionation methods, including electrophoresis and liquid chromatography (Zhou et al., 2012, Anal Chem 84(2): 720-734), or partitioning the fractions into compartments (e.g., droplets) loaded with limited capacity protein binding beads/resin (e.g. hydroxylated silica particles) (McCormick, 1989, Anal Biochem 181(1): 66-74) and eluting bound protein. Excess protein in each compartmentalized fraction is washed away.
- standard fractionation methods including electrophoresis and liquid chromatography (Zhou et al., 2012, Anal Chem 84(2): 720-734), or partitioning the fractions into compartments (e.g., droplets) loaded with limited capacity protein binding beads/resin (e.g. hydroxylated silica particles) (McCormick, 1989, Anal Biochem 181(1): 66-74) and eluting bound
- electrophoretic methods include capillary electrophoresis (CE), capillary isoelectric focusing (CIEF), capillary isotachophoresis (CITP), free flow electrophoresis, gel-eluted liquid fraction entrapment electrophoresis (GELFrEE).
- electrophoretic methods include capillary electrophoresis (CE), capillary isoelectric focusing (CIEF), capillary isotachophoresis (CITP), free flow electrophoresis, gel-eluted liquid fraction entrapment electrophoresis (GELFrEE).
- liquid chromatography protein separation methods include reverse phase (RP), ion exchange (IE), size exclusion (SE), hydrophilic interaction, etc.
- compartment partitions include emulsions, droplets, microwells, physically separated regions on a flat substrate, etc.
- Exemplary protein binding beads/resins include silica nanoparticles derivatized with phenol groups or hydroxyl groups (e.g ., StrataClean Resin from Agilent Technologies, RapidClean from LabTech, etc.). By limiting the binding capacity of the beads/resin, highly-abundant proteins eluting in a given fraction will only be partially bound to the beads, and excess proteins removed.
- a partition barcode which comprises assignment of a unique barcode to a subsampling of macromolecules from a population of macromolecules within a sample.
- This partition barcode may be comprised of identical barcodes arising from the partitioning of macromolecules within compartments labeled with the same barcode (e.g., a barcoded bead population in which multiple beads share the same barcode).
- the use of physical compartments effectively subsamples the original sample to provide assignment of partition barcodes. For instance, a set of beads labeled with 10,000 different compartment barcodes is provided. Furthermore, suppose in a given assay, that a population of 1 million beads are used in the assay.
- single molecule partitioning and partition barcoding of polypeptides is accomplished by labeling polypeptides (chemically or enzymatically) with an amplifiable DNA UMI tag (e.g., recording tag) at the N or C terminus, or both.
- DNA tags are attached to the body of the polypeptide (internal amino acids) via non-specific photo labeling or specific chemical attachment to reactive amino acids such as lysines.
- Information from the recording tag attached to the terminus of the peptide is transferred to the DNA tags via an enzymatic emulsion PCR (Williams et al., Nat Methods, (2006) 3(7):545-550; Schutze et al., Anal Biochem.
- a nanoemulsion is employed such that, on average, there is fewer than a single polypeptide per emulsion droplet with size from 50 nm-1000 nm (Nishikawa et al., J Nucleic Acids. (2012) 2012: 923214; Gupta et al., Soft Matter. (2016) 12(11):2826-41; Sole et al., Langmuir (2006, 22(20):8326-8332).
- PCR all the components of PCR are included in the aqueous emulsion mix including primers, dNTPs, Mg2+, polymerase, and PCR buffer.
- the recording tag is designed with a T7/SP6 RNA polymerase promoter sequence to generate transcripts that hybridize to the DNA tags attached to the body of the polypeptide (Ryckelynck et al., RNA. (2015) 21(3):458-469).
- a reverse transcriptase (RT) copies the information from the hybridized RNA molecule to the DNA tag.
- emulsion PCR or IVT/RT can be used to effectively transfer information from the terminus recording tag to multiple DNA tags attached to the body of the polypeptide.
- a sample of macromolecule targets e.g ., peptides, polypeptides, or proteins
- a compartment separates or isolates a subset of macromolecules from a sample of macromolecules.
- the compartment may be an aqueous compartment (e.g., microfluidic droplet), a solid compartment (e.g., picotiter well or microtiter well on a plate, tube, vial, bead), or a separated region on a surface.
- a compartment may comprise one or more beads to which macromolecules may be immobilized.
- macromolecules in a compartment is labeled with a compartment tag including a barcode.
- the macromolecules in one compartment can be labeled with the same barcode or macromolecules in multiple compartments can be labeled with the same barcode.
- Compartment barcoding can be accomplished in a number of ways including direct incorporation of unique barcodes into each droplet by droplet joining (Bio-Rad Laboratories), by introduction of barcoded beads into droplets (10X Genomics), or by combinatorial barcoding of components of the droplet post encapsulation and gelation using and split-pool combinatorial barcoding as described by Gunderson et al. (International Patent Publication No. WO 2016/130704, incorporated by reference in its entirety). A similar combinatorial labeling scheme can also be applied to nuclei (Vitak et al., Nat Methods (2017) 14(3):302-308).
- the target e.g ., macromolecule
- the target is joined to a support before performing the binding reaction.
- a support with a large carrying capacity to immobilize a large number of targets (e.g., macromolecules).
- the preparation of the targets including joining the target to a support is performed prior to performing the binding reaction.
- the preparation of the target including joining the macromolecule to nucleic acid molecule or a oligonucleotide may be performed prior to or after immobilizing the target.
- a plurality of targets are attached to a support prior to the binding reaction and contacting with a binding agent.
- a support can be any solid or porous support including, but not limited to, a bead, a microbead, an array, a glass surface, a silicon surface, a plastic surface, a filter, a membrane, a PTFE membrane, nylon, a microtiter well, an ELISA plate, a spinning interferometry disc, a nitrocellulose membrane, a nitrocellulose-based polymer surface, a nanoparticle, or a microsphere.
- Materials for a support include but are not limited to acrylamide, agarose, cellulose, dextran, nitrocellulose, glass, gold, quartz, polystyrene, polyethylene vinyl acetate, polypropylene, polyester, polymethacrylate, polyacrylate, polyethylene, polyethylene oxide, polysilicates, polycarbonates, poly vinyl alcohol (PVA), Teflon, fluorocarbons, nylon, silicon rubber, silica, polyanhydrides, polyglycolic acid, polyvinylchloride, polylactic acid, polyorthoesters, functionalized silane, polypropylfumerate, collagen, glycosaminoglycans, polyamino acids, or any combination thereof.
- a support is a bead, for example, a polystyrene bead, a polymer bead, a polyacrylate bead, an agarose bead, a cellulose bead, a dextran bead, an acrylamide bead, a solid core bead, a porous bead, a paramagnetic bead, a glass bead, a silica-based bead, or a controlled pore bead, or any combinations thereof.
- the support is a porous agarose bead.
- the support may comprise any suitable solid material, including porous and non-porous materials, to which a macromolecule, e.g ., a polypeptide, can be associated directly or indirectly, by any means known in the art, including covalent and non-covalent interactions, or any combination thereof.
- a support may be two- dimensional (e.g., planar surface) or three-dimensional (e.g., gel matrix or bead).
- a support can be any support surface including, but not limited to, a bead, a microbead, an array, a glass surface, a silicon surface, a plastic surface, a filter, a membrane, a PTFE membrane, a PTFE membrane, a nitrocellulose membrane, a nitrocellulose-based polymer surface, nylon, a microtiter well, an ELISA plate, a spinning interferometry disc, a polymer matrix, a nanoparticle, or a microsphere.
- Materials for a support include but are not limited to acrylamide, agarose, cellulose, dextran, nitrocellulose, glass, gold, quartz, polystyrene, polyethylene vinyl acetate, polypropylene, polyester, polymethacrylate, polyacrylate, polyethylene, polyethylene oxide, polysilicates, polycarbonates, poly vinyl alcohol (PVA), Teflon, fluorocarbons, nylon, silicon rubber, polyanhydrides, polyglycolic acid, polyvinylchloride, polylactic acid, polyorthoesters, functionalized silane, polypropylfumerate, collagen, glycosaminoglycans, polyamino acids, dextran, or any combination thereof.
- Supports further include thin film, membrane, bottles, dishes, fibers, woven fibers, shaped polymers such as tubes, particles, beads, microspheres, microparticles, or any combination thereof.
- the bead can include, but is not limited to, a ceramic bead, a polystyrene bead, a polymer bead, a polyacrylate bead, a methylstyrene bead, an agarose bead, a cellulose bead, a dextran bead, an acrylamide bead, a solid core bead, a porous bead, a paramagnetic bead, a glass bead, or a controlled pore bead, a silica-based bead, or any combinations thereof.
- a bead may be spherical or an irregularly shaped.
- a bead or support may be porous.
- a bead’s size may range from nanometers, e.g, 100 nm, to millimeters, e.g, 1 mm.
- beads range in size from about 0.2 micron to about 200 microns, or from about 0.5 micron to about 5 micron.
- beads can be about 1, 1.5, 2, 2.5, 2.8, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 8.5, 9, 9.5, 10, 10.5, 15, or 20 pm in diameter.
- “a bead” support may refer to an individual bead or a plurality of beads.
- the solid surface is a nanoparticle.
- the nanoparticles range in size from about 1 nm to about 500 nm in diameter, for example, between about 1 nm and about 20 nm, between about 1 nm and about 50 nm, between about 1 nm and about 100 nm, between about 10 nm and about 50 nm, between about 10 nm and about 100 nm, between about 10 nm and about 200 nm, between about 50 nm and about 100 nm, between about 50 nm and about 150, between about 50 nm and about 200 nm, between about 100 nm and about 200 nm, or between about 200 nm and about 500 nm in diameter.
- the nanoparticles can be about 10 nm, about 50 nm, about 100 nm, about 150 nm, about 200 nm, about 300 nm, or about 500 nm in diameter. In some embodiments, the nanoparticles are less than about 200 nm in diameter.
- Various reactions may be used to attach the targets (e.g ., macromolecules) to a support (e.g., a solid or a porous support).
- the targets e.g., macromolecules
- the targets may be attached directly or indirectly to the support.
- the targets are attached to the support via a nucleic acid.
- Exemplary reactions include the copper catalyzed reaction of an azide and alkyne to form a triazole (Huisgen 1, 3-dipolar cycloaddition), strain-promoted azide alkyne cycloaddition (SPAAC), reaction of a diene and dienophile (Diels- Alder), strain-promoted alkyne-nitrone cycloaddition, reaction of a strained alkene with an azide, tetrazine or tetrazole, alkene and azide [3+2] cycloaddition, alkene and tetrazine inverse electron demand Diels-Alder (IEDDA) reaction (e.g., m- tetrazine (mTet) or phenyl tetrazine (pTet) and trans-cyclooctene (TCO)); or pTet and an alkene), alkene and tetrazole
- Exemplary displacement reactions include reaction of an amine with: an activated ester; an N- hydroxysuccinimide ester; an isocyanate; an isothioscyanate, an aldehyde, an epoxide, or the like.
- iEDDA click chemistry is used for immobilizing macromolecules (e.g, polypeptides) to a support since it is rapid and delivers high yields at low input concentrations.
- m-tetrazine rather than tetrazine is used in an iEDDA click chemistry reaction, as m-tetrazine has improved bond stability.
- phenyl tetrazine is used in an iEDDA click chemistry reaction.
- a polypeptide is labeled with a bifunctional click chemistry reagent, such as alkyne- NHS ester (acetylene-PEG-NHS ester) reagent or alkyne-benzophenone to generate an alkyne-labeled polypeptide.
- a bifunctional click chemistry reagent such as alkyne- NHS ester (acetylene-PEG-NHS ester) reagent or alkyne-benzophenone to generate an alkyne-labeled polypeptide.
- an alkyne can also be a strained alkyne, such as cyclooctynes including Dibenzocyclooctyl (DBCO), etc.
- DBCO Dibenzocyclooctyl
- the target molecules can be spaced appropriately to accommodate methods of performing the binding reaction and any downstream analysis steps to be used to assess the target. For example, it may be advantageous to space the target molecules that optimally to allow a nucleic acid-based method for assessing and sequencing the proteins to be performed.
- the method for assessing and sequencing protein targets involve a binding agent which binds to the target molecules and the binding agent comprises a coding tag with information that is transferred to a nucleic acid attached to the target molecules.
- spacing of the targets on the support is determined based on the consideration that information transfer from a coding tag of a binding agent bound to one target molecule may reach a neighboring molecule.
- the surface of the support is passivated (blocked).
- passivated surface refers to a surface that has been treated with outer layer of material.
- Methods of passivating surfaces include standard methods from the fluorescent single molecule analysis literature, including passivating surfaces with polymer like polyethylene glycol (PEG) (Pan et ah, 2015, Phys. Biol. 12:045006), polysiloxane ( e.g ., Pluronic F-127), star polymers (e.g., star PEG) (Groll et ak, 2010, Methods Enzymol. 472: 1- 18), hydrophobic dichlorodimethylsilane (DDS) + self-assembled Tween-20 (Hua et ak, 2014, Nat.
- PEG polyethylene glycol
- DDS hydrophobic dichlorodimethylsilane
- density of macromolecules e.g., proteins, polypeptide, or peptides
- density of macromolecules can be titrated on the surface or within the volume of a solid substrate by spiking a competitor or “dummy” reactive molecule when immobilizing the proteins, polypeptides or peptides to the solid substrate.
- the density of functional coupling groups for attaching the target may be titrated on the substrate surface.
- multiple target molecules e.g., macromolecules
- the volume (e.g., porous supports) of a support such that adjacent molecules are spaced apart at a distance of about 50 nm to about 500 nm, or about 50 nm to about 400 nm, or about 50 nm to about 300 nm, or about 50 nm to about 200 nm, or about 50 nm to about 100 nm.
- multiple molecules are spaced apart on the surface of a support with an average distance of at least 50 nm, at least 60 nm, at least 70 nm, at least 80 nm, at least 90 nm, at least 100 nm, at least 150 nm, at least 200 nm, at least 250 nm, at least 300 nm, at least 350 nm, at least 400 nm, at least 450 nm, or at least 500 nm. In some embodiments, multiple molecules are spaced apart on the surface of a support with an average distance of at least 50 nm.
- molecules are spaced apart on the surface or within the volume of a support such that, empirically, the relative frequency of inter- to intra-molecular events (e.g. transfer of information) is ⁇ 1:10; ⁇ 1:100; ⁇ 1:1,000; or ⁇ 1:10,000.
- the plurality of target molecules is coupled on the support spaced apart at an average distance between two adjacent molecules which ranges from about 50 to 100 nm, from about 50 to 250 nm, from about 50 to 500 nm, from about 50 to 750 nm, from about 50 to 1,000 nm, from about 50 to 1,500 nm, from about 50 to 2,000 nm, from about 100 to 250 nm, from about 100 to 500 nm, from about 200 to 500 nm, from about 300 to 500 nm, from about 100 to 1000 nm, from about 500 to 600 nm, from about 500 to 700 nm, from about 500 to 800 nm, from about 500 to 900 nm, from about 500 to 1,000 nm, from about 500 to 2,000 nm, from about 500 to 5,000 nm, from about 1,000 to 5,000 nm, or from about 3,000 to 5,000 nm.
- appropriate spacing of the target molecules (e.g, macromolecules) on the support is accomplished by titrating the ratio of available attachment molecules on the substrate surface.
- the substrate surface e.g., bead surface
- the substrate surface is functionalized with a carboxyl group (COOH) which is treated with an activating agent (e.g., activating agent is EDC and Sulfo-NHS).
- an activating agent e.g., activating agent is EDC and Sulfo-NHS
- the substrate surface (e.g., bead surface) comprises NHS moieties.
- a mixture of mPEG ⁇ -NFb and NIT-PEG, ,-mTet is added to the activated beads (wherein n is any number, such as 1-100).
- the ratio between the mPEG3-NH2 (not available for coupling) and NH2-PEG24-mTet (available for coupling) is titrated to generate an appropriate density of functional moieties available to attach the polypeptides on the substrate surface.
- the mean spacing between coupling moieties (e.g., NH2-PEG4-mTet) on the solid surface is at least 50 nm, at least 100 nm, at least 250 nm, or at least 500 nm.
- the ratio of ME-PEG n -mTet to mPEG3-NH2 is about or greater than 1 : 1000, about or greater than 1 : 10,000, about or greater than 1 : 100,000, or about or greater than 1 : 1,000,000.
- the recording tag attaches to the ME-PEG n -mTet.
- the spacing of the target molecules (e.g, macromolecules) on the support is achieved by controlling the concentration and/or number of available COOH or other functional groups on the support.
- the provided methods for performing a binding reaction may be used in or in combination with an assay for analyzing the target, such as in a macromolecule analysis assay.
- additional treatments and reactions may be performed with the target before or after the binding reaction.
- some of the additional reactions and treatments may be performed while the stable complex comprising the binding agent, target, and stabilizing components are intact.
- the target or plurality of targets is obtained from a sample and immobilized on a support (e.g, on a bead).
- the binding reaction is useful for identifying the target or a portion thereof, such as by using a binding agent with a known binding profile.
- the binding agent comprises one or more detectable labels.
- the macromolecule analysis assay includes contacting the macromolecule with a binding agent capable of binding to the macromolecule and forming a stable complex, wherein the binding agent comprises a coding tag with identifying information regarding the binding agent; and transferring the information of the coding tag to a recording tag (associated with the target macromolecule) to generate an extended recording tag.
- the stable complex is formed by linking the stabilizing components (associated or joined to the target and binding agent).
- transferring the information of the coding tag to the recording tag to extend the recording tag may be repeated one or more times.
- the analysis assay is performed on immobilized target molecules bound by two or more cognate binding agents (e.g, antibodies). After a cognate antibody binding event, a stable complex can be formed, and a combined primer extension and DNA nicking step may be used to transfer information from the coding tags of bound antibodies to the recording tag.
- a method for analyzing a macromolecule comprising the steps of: (a) providing a macromolecule and an associated recording tag joined to a support; (b) contacting the macromolecule with a binding agent capable of binding to the macromolecule, wherein the binding agent comprises a coding tag with identifying information regarding the binding agent and the binding agent and the macromolecule each comprises or is associated with a stabilizing component; (c) linking the stabilizing components to form a stable complex; (d) transferring the information of the coding tag to the recording tag to generate an extended recording tag; and analyzing the extended recording tag.
- step (a) is performed before steps (b), (c) and (d).
- step (b) is performed before step (c) and step (d).
- step (c) is performed before step (d).
- the method further comprises removing the one or more binding agents. In some embodiments, removing the one or more binding agents is performed after step (d).
- the method is performed with one or more target peptides and the method further includes step (e) removing the terminal amino acid (e.g ., N-terminal amino acid (NTAA)) of the peptide to expose a new terminal amino acid of the peptide.
- NTAA N-terminal amino acid
- the method includes treating the target peptide with a reagent for modifying a terminal amino acid of the peptide.
- the target peptide is contacted with the reagent for modifying a terminal amino acid before step (b).
- the target peptide is contacted with the reagent for modifying a terminal amino acid before removing the terminal amino acid.
- the method further includes the step of removing the binding agent after transferring information from the coding tag to the recording tag.
- the stable complex is disrupted prior to removing the binding agent.
- the disrupting may be performed by introducing a destabilizing agent, such as heat, a denaturing agent, an enzyme or a competitor molecule.
- the bound binding agent and annealed coding tag can be removed following transfer of the identifying information (e.g., primer extension) by using highly denaturing conditions (e.g, 0.1-0.2 N NaOH, 6M Urea,
- the provided methods for analysis of macromolecules can further include providing a plurality of adaptor molecules each comprising a first hybridization sequence substantially complementary to at least a portion of the coding tag (or portion thereof), and a secondary tag; and transferring the information of the secondary tag to the recording tag to generate an extended recording tag.
- the information transferred from the secondary tag of an adaptor molecule includes identifying information regarding the identity of the macromolecule or portion thereof bound by the binding agent.
- the extended recording tag associated with the macromolecule for analysis can comprise the information from one or more secondary tags.
- the methods provided may include preparing, selecting, and providing a plurality of adaptor molecules.
- more than one coding tag associated a binding agent is configured to hybridize to adaptor molecules comprising the same secondary tag.
- the adaptor molecules comprise a first hybridization sequence and a secondary tag, wherein the first hybridization sequence or portion thereof is complementary to the coding tag or region therein.
- the first hybridization sequence of the adaptor molecule comprises a single stranded region for hybridizing to the coding tag (or region therein) associated with the binding agent.
- the adaptor molecule further comprises a second hybridization sequence substantially complementary to at least a portion of the recording tag.
- the macromolecule analysis assay comprises: providing a macromolecule and an associated recording tag joined to a support, wherein the macromolecule is also associated with a stabilizing component; contacting the macromolecule with a binding agent capable of binding to the macromolecule, wherein the binding agent comprises a coding tag with identifying information regarding the binding agent, to allow binding between the macromolecule and the binding agent, wherein the binding agent is also associated with a stabilizing component; providing a linking agent and linking the stabilizing components to form a stable complex; providing a plurality of adaptor molecules each comprising a first hybridization sequence substantially complementary to at least a portion of the coding tag, and a secondary tag, to allow hybridization between the adaptor molecule (or the first hybridization sequence) and the coding tag (or the portion of the coding tag); transferring the information of the secondary tag to the recording tag to generate an extended recording tag; and analyzing the extended recording tag.
- information is transferred from the secondary tag of the adaptor molecule to the recording tag.
- the method further comprises transferring information of a coding tag associated with the binding agent to the recording tag associated with the target to generate an extended recording tag, thereby generating an extended recording tag.
- transferring information of the coding tag to the recording tag is performed after the stabilizing components are linked.
- transferring information of the coding tag to the recording tag is performed after the stable complex is formed.
- transferring information of the coding tag to the recording tag is performed after introducing the linking agent or introducing light to the binding reaction.
- the method generally includes contacting a binding agent to terminal amino acid (e.g ., NTAA or CTAA) of a polypeptide, protein or peptide, providing an adaptor molecule and allowing hybridization of complementary sequences, and transferring the information from the secondary tag to the recording tag associated with the polypeptide, protein or peptide, thereby generating a first order extended recording tag (see Figs. 1, 2, 6 or 7).
- the method comprises labeling or modifying the macromolecule (e.g. peptide) prior to or after the polypeptide, protein or peptide is contacted with the binding agent.
- the terminal amino acid of the polypeptide, protein or peptide bound by the binding agent may be a chemically labeled or modified terminal amino acid.
- the method further includes removing or eliminating the terminal amino acid (e.g, NTAA or CTAA) from the polypeptide, protein or peptide after the information transfer step.
- the terminal amino acid eliminated may be a chemically labeled or modified terminal amino acid. Removal of the NTAA by contacting with an enzyme or chemical reagents converts the penultimate amino acid of the polypeptide, protein or peptide to a terminal amino acid.
- the polypeptide analysis may include one or more cycles of binding with additional binding agents to the terminal amino acid, providing a plurality of adaptor molecules and allow hybridization between the coding tag and first hybridization sequence of the adaptor molecule, and transferring information from the secondary tags to the extended nucleic acid thereby generating a higher order extended recording tag containing information regarding two or more binding agents, and eliminating the terminal amino acid in a cyclic manner. Additional binding, hybridizing, transferring information, and removal, can occur as described above up to n amino acids to generate an n th order extended nucleic acid, which collectively represent the polypeptide, protein or peptide. In some of any provided embodiments, steps including the NTAA in the described exemplary approach can be performed instead with a C terminal amino acid (CTAA).
- CTCAA C terminal amino acid
- the order of the steps in the process for a degradation- based peptide or polypeptide sequencing assay can be reversed or be performed in various orders.
- the terminal amino acid labeling can be conducted before and/or after the polypeptide is bound to the binding agent.
- the polypeptide analysis assay includes performing an assay which utilizes the recording tag associated with the target macromolecule, e.g ., the polypeptide.
- the recording tag associated with the target polypeptide is used in the polypeptide analysis assay which includes transferring identifying information from one or more coding tags to the recording tag.
- contacting of the first binding agent and second binding agent to the target, and optionally any further binding agents are performed at the same time.
- the first binding agent and second binding agent, and optionally any further order binding agents can be pooled together, for example to form a library of binding agents.
- the first binding agent and second binding agent, and optionally any further order binding agents, rather than being pooled together are added simultaneously to the polypeptide.
- a library of binding agents comprises at least 20 binding agents that selectively bind to the 20 standard, naturally occurring amino acids.
- a library of binding agents may comprise binding agents that selectively bind to the modified amino acids.
- first binding agent and second binding agent, and optionally any further order binding agents are each contacted with the polypeptide in separate binding cycles, added in sequential order.
- multiple binding agents are used at the same time in parallel. This parallel approach saves time and reduces non-specific binding by non-cognate binding agents to a site that is bound by a cognate binding agent (because the binding agents are in competition).
- identifying information of its linked coding tag is transferred to the recording tag (e.g, recording tag) associated with the peptide, thereby generating an extended recording tag.
- the nucleic acid associated with the protein or peptide for analysis can comprise the recording tag and information from one or more coding tags.
- the recording tag further comprises barcodes and/or other nucleic acid components.
- the identifying information from the coding tag of the binding agent is transferred to the recording tag (or other nucleic acid components) attached thereto.
- the transfer of the identifying information can be achieved by any suitable means such as by extension or ligation.
- a spacer is added to the end of the recording tag, and the spacer comprises a sequence that is capable of hybridizing with a sequence on the coding tag to facilitate transfer of the identifying information.
- Coding tag information associated with a specific binding agent may be transferred to a recording tag using a variety of methods.
- information of a coding tagA spacer sequence on the 3’ -terminus of a recording tag or an extended recording tag anneals with complementary spacer sequence on the 3’ terminus of a coding tag and a polymerase (e.g., strand-displacing polymerase) extends the recording tag sequence, using the annealed coding tag as a template.
- oligonucleotides complementary to coding tag encoder sequence and 5’ spacer can be pre annealed to the coding tags to prevent hybridization of the coding tag to internal encoder and spacer sequences present in an extended recording tag.
- the 3’ terminal spacer, on the coding tag, remaining single stranded, preferably binds to the terminal 3’ spacer on the recording tag.
- a nascent recording tag can be coated with a single stranded binding protein to prevent annealing of the coding tag to internal sites.
- the nascent recording tag can also be coated with RecA (or related homologues such as uvsX) to facilitate invasion of the 3’ terminus into a completely double stranded coding tag (Bell et al., 2012, Nature 491 :274-278).
- RecA or related homologues such as uvsX
- This configuration prevents the double stranded coding tag from interacting with internal recording tag elements, yet is susceptible to strand invasion by the RecA coated 3’ tail of the extended recording tag (Bell, et al., 2015, Elife 4: e08646).
- the presence of a single-stranded binding protein can facilitate the strand displacement reaction.
- information from a coding tag linked to a binding agent can be transferred to the nucleic acid associated with the polypeptide while the binding agent is bound to the polypeptide.
- An extended nucleic acid associated with the macromolecule, e.g ., the peptide, with identifying information from the coding tag may comprise information from a binding agent’s coding tag representing each binding cycle performed.
- an extended nucleic acid may also experience a “missed” binding cycle, e.g., if a binding agent fails to bind to the polypeptide, because the coding tag was missing, damaged, or defective, because the primer extension reaction failed. Even if a binding event occurs, transfer of information from the coding tag may be incomplete or less than 100% accurate, e.g, because a coding tag was damaged or defective, because errors were introduced in the primer extension reaction).
- an extended nucleic acid may represent 100%, or up to 95%,
- the coding tag information present in the extended nucleic acid may have at least 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 100% identity the corresponding coding tags.
- an extended recording tag associated with the immobilized peptide may comprise information from multiple coding tags representing multiple, successive binding events.
- a single, concatenated extended recording tag associated with the immobilized peptide can be representative of a single polypeptide.
- transfer of coding tag information to the recording tag associated with the immobilized peptide also includes transfer to an extended recording tag as would occur in methods involving multiple, successive binding events.
- Coding tag information associated with a specific binding agent may be transferred using a variety of methods.
- information of a coding tag is transferred to a recording tag associated with the immobilized peptide via primer extension (Chan et al., Curr Opin Chem Biol. (2015) 26: 55-61).
- a spacer sequence on the 3 ’-terminus of a recording tag anneals with complementary spacer sequence on the 3’ terminus of a coding tag and a polymerase (e.g., strand-displacing polymerase) extends the nucleic acid sequence on the recording tag, using the annealed coding tag as a template.
- oligonucleotides complementary to coding tag encoder sequence and 5’ spacer can be pre-annealed to the coding tags to prevent hybridization of the coding tag to internal encoder and spacer sequences present in an extended nucleic acid.
- the 3’ terminal spacer, on the coding tag, remaining single stranded, preferably binds to the terminal 3’ spacer on the recording tag (or any barcodes or other nucleic acid components associated).
- a nascent recording tag associated with the immobilized peptide can be coated with a single stranded binding protein to prevent annealing of the coding tag to internal sites.
- ligation e.g, an enzymatic or chemical ligation, a splint ligation, a sticky end ligation, a single-strand (ss) ligation such as a ssDNA ligation, or any combination thereof
- a polymerase-mediated reaction e.g, primer extension of single-stranded nucleic acid or double-stranded nucleic acid, or any combination thereof.
- Most type A polymerases are devoid of 3 ’ exonuclease activity (endogenous or engineered removal), such as Klenow exo-, T7 DNA polymerase exo- (Sequenase 2.0), and Taq polymerase catalyzes non-templated addition of a nucleotide, preferably an adenosine base (to lesser degree a G base, dependent on sequence context) to the 3’ blunt end of a duplex amplification product.
- exonuclease activity endogenous or engineered removal
- Klenow exo- preferably T7 DNA polymerase exo- (Sequenase 2.0)
- Taq polymerase catalyzes non-templated addition of a nucleotide, preferably an adenosine base (to lesser degree a G base, dependent on sequence context) to the 3’ blunt end of a duplex amplification product.
- a thymidine base in the coding tag between the spacer sequence distal from the binding agent and the adjacent barcode sequence accommodates the sporadic inclusion of a non-templated adenosine nucleotide on the 3’ terminus of the spacer sequence of the recording tag.
- the extended recording tag associated with the immobilized peptide can anneal to the coding tag and undergo primer extension.
- competitor also referred to as blocking
- oligonucleotides complementary to nucleic acids containing spacer sequences can be added to binding reactions to minimize non-specific interactions.
- the blocking oligonucleotides contain a sequence that is complementary to the coding tag or a portion thereof attached to the binding agent. In some embodiments, blocking oligonucleotides are relatively short.
- Blocking oligonucleotides may comprise a terminator nucleotide at its 3’ end to prevent primer extension.
- the annealing of the spacer sequence on the recording tag to the complementary spacer sequence on the coding tag is metastable under the primer extension reaction conditions (i.e., the annealing Tm is similar to the reaction temperature). This allows the spacer sequence of the coding tag to displace any blocking oligonucleotide annealed to the spacer sequence of the recording tag (or extensions thereof).
- Coding tag information associated with a specific binding agent may be transferred to a nucleic acid on the recording tag associated with the immobilized peptide via ligation.
- Electroligase ® a ligation may be a chemical ligation reaction, such as chemical ligation using standard chemical ligation or “click chemistry” (Gunderson et ah, Genome Res (1998) 8(11): 1142-1153; Peng et ah, European J Org Chem (2010) (22): 4194- 4197; El-Sagheeret ah, Proc Natl Acad Sci U S A (2011) 108(28): 11338-11343; El-Sagheer et ah, Org Biomol Chem (2011) 9(1): 232-235; Sharma et ah, Anal Chem (2012) 84(14): 6104-6109; Roloff et ah, Bioorg Med Chem (2013) 21(12): 3458-3464; Lito
- coding tag information can be transferred using topoisomerase.
- the binding event information is transferred from a coding tag to the recording tag associated with the immobilized peptide in a cyclic fashion.
- Cross-reactive binding events can be informatically filtered out after sequencing by requiring that at least two different coding tags, identifying two or more independent binding events, map to the same class of binding agents (cognate to a particular protein).
- the coding tag may contain an optional UMI sequence in addition to one or more spacer sequences. Universal priming sequences may also be included in extended nucleic acids on the recording tag associated with the immobilized peptide for amplification and NGS sequencing.
- the final extended recording tag containing information from one or more binding agents is optionally flanked by universal priming sites to facilitate downstream amplification and/or DNA sequencing.
- the forward universal priming site e.g ., Alumina’s P5-S1 sequence
- the reverse universal priming site e.g., Alumina’s P7-S2’ sequence
- the addition of forward and reverse priming sites can be done independently of a binding agent.
- the target macromolecule e.g, protein or polypeptide
- a nucleic acid molecule or a oligonucleotide e.g, DNA recording tag
- a plurality of target macromolecules in the sample is provided with recording tags.
- the recording tags may be associated or attached, directly or indirectly to the target macromolecules using any suitable means.
- the recording tags may be associated or attached, directly or indirectly to the target macromolecules prior to contacting the target with a binding agent.
- At least one recording tag is associated or co-localized directly or indirectly with the target macromolecule.
- a UMI has a different base sequence than the spacer or encoder sequences within the binding agents’ coding tags to facilitate distinguishing these components during sequence analysis.
- the recording tags associated with a library of polypeptides share a common spacer sequence.
- the recording tags associated with a library of polypeptides have binding cycle specific spacer sequences that are complementary to the binding cycle specific spacer sequences of their cognate binding agents.
- the spacer sequence in the recording tag is designed to have minimal complementarity to other regions in the recording tag; likewise, the spacer sequence in the coding tag should have minimal complementarity to other regions in the coding tag.
- the spacer sequence of the recording tags and coding tags should have minimal sequence complementarity to components such unique molecular identifiers, barcodes (e.g., compartment, partition, sample, spatial location), universal primer sequences, encoder sequences, cycle specific sequences, etc. present in the recording tags or coding tags.
- information of one or more tags are transferred to the recording tag (e.g., via primer extension or ligation) to extend the recording tag.
- the spacer sequence in the coding tag should have minimal complementarity to other regions in the coding tag.
- the spacer sequence of the recording tags and coding tags should have minimal sequence complementarity to components such unique molecular identifiers, barcodes ( e.g ., compartment, partition, sample, spatial location), universal primer sequences, encoder sequences, cycle specific sequences, etc. present in the recording tags or coding tags.
- a recording tag comprises from 5’ to 3’ direction: a universal forward (or 5’) priming sequence, information transferred from the coding tag, and a spacer sequence.
- an extended recording tag comprises from 5’ to 3’ direction: a universal forward (or 5’) priming sequence, information transferred from the coding tag, optionally other barcodes (e.g., sample barcode, partition barcode, compartment barcode, or any combination thereof), a spacer sequence, a universal reverse (or 3’) priming sequence.
- a recording tag comprises from 5’ to 3’ direction: a universal forward (or 5’) priming sequence, information transferred from the coding tag, optionally other barcodes (e.g., sample barcode, partition barcode, compartment barcode, or any combination thereof), an optional UMI, and a spacer sequence.
- the coding tag associated with the binding agent is or comprises a polynucleotide with any suitable length, e.g, a nucleic acid molecule of about 2 bases to about 100 bases, including any integer including 2 and 100 and in between, that comprises identifying information for its associated binding agent.
- a coding tag may comprise an encoder sequence or a sequence with identifying information, which is optionally flanked by one spacer on one side or optionally flanked by a spacer on each side.
- a coding tag may also be comprised of an optional UMI and/or an optional binding cycle-specific barcode.
- a coding tag may refer to the coding tag that is directly attached to a binding agent, to a complementary sequence hybridized to the coding tag directly attached to a binding agent (e.g., for double stranded coding tags), or to coding tag information present in an extended nucleic acid on the recording tag.
- a coding tag may further comprise a binding cycle specific spacer or barcode, a unique molecular identifier, a universal priming site, or any combination thereof.
- a coding tag is partially double stranded, which prevents annealing of the coding tag to internal encoder and spacer sequences in a growing extended recording tag.the binding agent.
- the identifying information from the coding tag comprises information regarding the identity of the target bound by the binding agent.
- a coding tag comprises an encoder sequence that provides identifying information regarding the associated binding agent.
- An encoder sequence is about 3 bases to about 30 bases, about 3 bases to about 20 bases, about 3 bases to about 10 bases, or about 3 bases to about 8 bases. In some embodiments, an encoder sequence is about 3 bases, 4 bases, 5 bases, 6 bases, 7 bases, 8 bases, 9 bases, 10 bases, 11 bases, 12 bases, 13 bases, 14 bases, 15 bases,
- the length of the encoder sequence determines the number of unique encoder sequences that can be generated. Shorter encoding sequences generate a smaller number of unique encoding sequences, which may be useful when using a small number of binding agents. In a specific embodiment, a set of > 50 unique encoder sequences are used for a binding agent library.
- each unique binding agent within a library of binding agents has a unique encoder sequence.
- 20 unique encoder sequences may be used for a library of 20 binding agents that bind to the 20 standard amino acids.
- 30 unique encoder sequences may be used.
- two or more different binding agents may share the same encoder sequence.
- two binding agents that each bind to a different standard amino acid may share the same encoder sequence.
- a coding tag further comprises a spacer sequence at one end or both ends.
- a spacer within a coding tag is shorter than the encoder sequence, e.g., at least 1 base, 2, bases, 3 bases, 4 bases, 5 bases, 6, bases, 7 bases, 8 bases, 9 bases, 10 bases, 11 bases, 12 bases, 13 bases, 14 bases, 15 bases, 20 bases, or 25 bases shorter than the encoder sequence.
- a spacer within a coding tag is the same length as the encoder sequence.
- the spacer is binding agent specific so that a spacer from a previous binding cycle only interacts with a spacer from the appropriate binding agent in a current binding cycle.
- a spacer sequence may be used as the primer annealing site for a primer extension reaction, or a splint or sticky end in a ligation reaction.
- a 5’ spacer on a coding tag may optionally contain pseudo complementary bases to a 3’ spacer on the recording tag to increase T.
- the coding tags within a library of binding agents do not have a binding cycle specific spacer sequence.
- two or more binding agents that each bind to different targets have associated coding tags share the same spacers.
- coding tags associated with two or more binding agents share coding tags with the same sequence or a portion thereof.
- the coding tags within a collection of binding agents share a common spacer sequence used in an assay (e.g . the entire library of binding agents used in a multiple binding cycle method possess a common spacer in their coding tags).
- the coding tags are comprised of a binding cycle tags, identifying a particular binding cycle.
- the coding tags within a library of binding agents have a binding cycle specific spacer sequence.
- a coding tag comprises one binding cycle specific spacer sequence.
- a coding tag for binding agents used in the first binding cycle comprise a “cycle 1” specific spacer sequence
- a coding tag for binding agents used in the second binding cycle comprise a “cycle 2” specific spacer sequence, and so on up to “n” binding cycles.
- coding tags for binding agents used in the first binding cycle comprise a “cycle 1” specific spacer sequence and a “cycle 2” specific spacer sequence
- coding tags for binding agents used in the second binding cycle comprise a “cycle 2” specific spacer sequence and a “cycle 3” specific spacer sequence, and so on up to “n” binding cycles.
- coding tags associated with binding agents used to bind in an alternating cycles comprises different binding cycle specific spacer sequences.
- a coding tag for binding agents used in the first binding cycle comprise a “cycle 1” specific spacer sequence
- a coding tag for binding agents used in the second binding cycle comprise a “cycle 2” specific spacer sequence
- a coding tag for binding agents used in the third binding cycle also comprises the “cycle 1” specific spacer sequence
- a coding tag for binding agents used in the fourth binding cycle comprises the “cycle 2” specific spacer sequence.
- a cycle specific spacer sequence can also be used to concatenate information of coding tags onto a single recording tag when a population of recording tags is associated with a polypeptide.
- the first binding cycle transfers information from the coding tag to a randomly-chosen recording tag, and subsequent binding cycles can prime only the extended recording tag using cycle dependent spacer sequences.
- coding tags for binding agents used in the first binding cycle comprise a “cycle 1” specific spacer sequence and a “cycle 2” specific spacer sequence
- coding tags for binding agents used in the second binding cycle comprise a “cycle 2” specific spacer sequence and a “cycle 3” specific spacer sequence, and so on up to “n” binding cycles.
- Coding tags of binding agents from the first binding cycle are capable of annealing to recording tags via complementary cycle 1 specific spacer sequences.
- the cycle 2 specific spacer sequence is positioned at the 3’ terminus of the extended recording tag at the end of binding cycle 1.
- Coding tags of binding agents from the second binding cycle are capable of annealing to the extended recording tags via complementary cycle 2 specific spacer sequences.
- the cycle 3 specific spacer sequence is positioned at the 3’ terminus of the extended recording tag at the end of binding cycle 2, and so on through “n” binding cycles.
- This embodiment provides that transfer of binding information in a particular binding cycle among multiple binding cycles will only occur on (extended) recording tags that have experienced the previous binding cycles.
- a binding agent may fail to bind to a cognate polypeptide.
- Oligonucleotides comprising binding cycle specific spacers after each binding cycle as a “chase” step can be used to keep the binding cycles synchronized even if the event of a binding cycle failure. For example, if a cognate binding agent fails to bind to a polypeptide during binding cycle 1, adding a chase step following binding cycle 1 using oligonucleotides comprising both a cycle 1 specific spacer, a cycle 2 specific spacer, and a “null” encoder sequence.
- the “null” encoder sequence can be the absence of an encoder sequence or, preferably, a specific barcode that positively identifies a “null” binding cycle.
- the “null” oligonucleotide is capable of annealing to the recording tag via the cycle 1 specific spacer, and the cycle 2 specific spacer is transferred to the recording tag.
- binding agents from binding cycle 2 are capable of annealing to the extended recording tag via the cycle 2 specific spacer despite the failed binding cycle 1 event.
- the “null” oligonucleotide marks binding cycle 1 as a failed binding event within the extended recording tag.
- a coding tag comprises a cleavable or nickable DNA strand within the second (3’) spacer sequence proximal to the binding agent.
- the 3’ spacer may have one or more uracil bases that can be nicked by uracil-specific excision reagent (USER). USER generates a single nucleotide gap at the location of the uracil.
- the 3’ spacer may comprise a recognition sequence for a nicking endonuclease that hydrolyzes only one strand of a duplex.
- the enzyme used for cleaving or nicking the 3’ spacer sequence acts only on one DNA strand (the 3’ spacer of the coding tag), such that the other strand within the duplex belonging to the (extended) recording tag is left intact.
- the enzyme used for cleaving or nicking the 3’ spacer sequence acts only on one DNA strand (the 3’ spacer of the coding tag), such that the other strand within the duplex belonging to the (extended) recording tag is left intact.
- These embodiments is particularly useful in assays analyzing proteins in their native conformation, as it allows the non-denaturing removal of the binding agent from the (extended) recording tag after primer extension has occurred and leaves a single stranded DNA spacer sequence on the extended recording tag available for subsequent binding cycles.
- the coding tags may also be designed to contain palindromic sequences. Inclusion of a palindromic sequence into a coding tag allows a nascent, growing, extended recording tag to fold upon itself as coding tag information is transferred.
- An extended recording tag can be built up from a series of binding events using coding tags comprising analyte-specific spacers and encoder sequences.
- a first binding event employs a binding agent with a coding tag comprised of a generic 3’ spacer primer sequence and an analyte-specific spacer sequence at the 5’ terminus for use in the next binding cycle; subsequent binding cycles then use binding agents with encoded analyte- specific 3’ spacer sequences.
- a pair of cognate binding agents to a particular polypeptide analyte is used in two binding cycles to identify the analyte.
- the first cognate binding agent contains a coding tag comprised of a generic spacer 3’ sequence for priming extension on the generic spacer sequence of the recording tag, and an encoded analyte-specific spacer at the 5’ end, which will be used in the next binding cycle.
- the 3’ analyte-specific spacer of the second binding agent is matched to the 5’ analyte-specific spacer of the first binding agent.
- Cross-reactive binding agents will not be able to prime extension on the recording tag, and no amplifiable extended recording tag product generated.
- This approach greatly enhances the specificity of the methods disclosed herein.
- the same principle can be applied to triplet binding agent sets, in which 3 cycles of binding are employed.
- a generic 3’ Sp sequence on the recording tag interacts with a generic spacer on a binding agent coding tag.
- Primer extension transfers coding tag information, including an analyte specific 5’ spacer, to the recording tag.
- Subsequent binding cycles employ analyte specific spacers on the binding agents’ coding tags.
- a coding tag may include a terminator nucleotide incorporated at the 3’ end of the 3’ spacer sequence. After a binding agent binds to a polypeptide and their corresponding coding tag and recording tags anneal via complementary spacer sequences, it is possible for primer extension to transfer information from the coding tag to the recording tag, or to transfer information from the recording tag to the coding tag. Addition of a terminator nucleotide on the 3’ end of the coding tag prevents transfer of recording tag information to the coding tag. It is understood that for embodiments described herein involving generation of extended coding tags, it may be preferable to include a terminator nucleotide at the 3’ end of the recording tag to prevent transfer of coding tag information to the recording tag.
- the coding tag sequence can be optimized for the particular sequencing analysis platform. Sequences comprising identifying information from the coding tag can be designed to be optimally electrically distinguishable in transit through a nanopore.
- a coding tag may further comprise a unique molecular identifier for the binding agent to which the coding tag is linked.
- a UMI for the binding agent may be useful in embodiments utilizing extended coding tags or di-tag molecules for sequencing readouts, which in combination with the encoder sequence provides information regarding the identity of the binding agent and number of unique binding events for a polypeptide.
- n-1 NTAA Removal of the n labeled NTAA by contacting with an enzyme or chemical reagents converts the n-1 amino acid of the peptide to an N-terminal amino acid, which is referred to herein as an n-1 NTAA.
- a second binding agent is contacted with the peptide and binds to the n-1 NTAA, and the second binding agent’s coding tag information is transferred to the first order extended nucleic acid thereby generating a second order extended nucleic acid (e.g ., for generating a concatenated n th order extended nucleic acid representing the peptide).
- n-2 NTAA Elimination of the n-1 labeled NTAA converts the n-2 amino acid of the peptide to an N-terminal amino acid, which is referred to herein as n-2 NTAA. Additional binding, transfer, labeling, and removal, can occur as described above up to n amino acids to generate an n th order extended nucleic acid or n separate extended nucleic acids, which collectively represent the peptide.
- an n “order” when used in reference to a binding agent, coding tag, or extended nucleic acid refers to the n binding cycle, wherein the binding agent and its associated coding tag is used or the n binding cycle where the extended nucleic acid is created (e.g. on recording tag).
- steps including the NTAA in the described exemplary approach can be performed instead with a C terminal amino acid (CTAA).
- CTCAA C terminal amino acid
- the terminal amino acid is removed or cleaved from the peptide to expose a new terminal amino acid.
- the terminal amino acid is an NTAA.
- the terminal amino acid is a CTAA.
- Cleavage of a terminal amino acid can be accomplished by any number of known techniques, including chemical cleavage and enzymatic cleavage.
- an engineered enzyme that catalyzes or reagent that promotes the removal of the PITC-derivatized or other labeled N-terminal amino acid is used.
- the terminal amino acid is removed or eliminated using any of the methods as described in International Patent Publication No. WO 2019/089846.
- cleavage of a terminal amino uses a carboxypeptidase, an aminopeptidase, a dipeptidyl peptidase, a dipeptidyl aminopeptidase or a variant, mutant, or modified protein thereof; a hydrolase or a variant, mutant, or modified protein thereof; a mild Edman degradation reagent; an Edmanase enzyme; anhydrous TFA, a base; or any combination thereof.
- the mild Edman degradation uses a dichloro or monochloro acid; the mild Edman degradation uses TFA, TCA, or DCA; or the mild Edman degradation uses triethylamine, triethanolamine, or triethylammonium acetate (Et3NHOAc).
- the reagent for removing the amino acid comprises a base.
- the base is a hydroxide, an alkylated amine, a cyclic amine, a carbonate buffer, trisodium phosphate buffer, or a metal salt.
- the hydroxide is sodium hydroxide
- the alkylated amine is selected from methylamine, ethylamine, propylamine, dimethylamine, diethylamine, dipropylamine, trimethylamine, triethylamine, tripropylamine, cyclohexylamine, benzylamine, aniline, diphenylamine, N,N- Diisopropylethylamine (DIPEA), and lithium diisopropylamide (LDA);
- the cyclic amine is selected from pyridine, pyrimidine, imidazole, pyrrole, indole, piperidine, prolidine, 1,8- diazabicyclo[5.4.0]undec-7-ene (DBU), and l,5-diazabicyclo[4.3.0]non-5-ene (DBN);
- the carbonate buffer comprises sodium carbonate, potassium carbonate, calcium carbonate, sodium bicarbonate, potassium bicarbonate, or calcium bicarbonate
- Enzymatic cleavage of a NTAA may be accomplished by an aminopeptidase or other peptidases.
- Aminopeptidases naturally occur as monomeric and multimeric enzymes, and may be metal or ATP-dependent.
- Natural aminopeptidases have very limited specificity, and generically cleave N-terminal amino acids in a processive manner, cleaving one amino acid off after another.
- aminopeptidases e.g ., metalloenzymatic aminopeptidase
- an aminopeptidase may be engineered such than it only cleaves an N-terminal amino acid if it is modified by a group such as PTC, modified-PTC, Cbz, DNP, SNP, acetyl, guanidinyl, diheterocyclic methanimine, etc.
- the aminopeptidase cleaves only a single amino acid at a time from the N-terminus, and allows control of the degradation cycle.
- the modified aminopeptidase is non-selective as to amino acid residue identity while being selective for the N-terminal label.
- the modified aminopeptidase is selective for both amino acid residue identity and the N-terminal label.
- Engineered aminopeptidase mutants that bind to and cleave individual or small groups of labelled (biotinylated) NTAAs have been described (see, PCT Publication No.
- Aminopeptidases are enzymes that cleave amino acids from the N-terminus of proteins or peptides. Natural aminopeptidases have very limited specificity, and generically eliminate N-terminal amino acids in a processive manner, cleaving one amino acid off after another (Kishor et al., 2015, Anal. Biochem. 488:6-8). However, residue specific aminopeptidases have been identified (Eriquez et al., J. Clin. Microbiol.
- Aminopeptidases may be engineered to specifically bind to 20 different NTAAs representing the standard amino acids that are labeled with a specific moiety (e.g ., PTC, DNP, SNP, etc.). Control of the stepwise degradation of the N-terminus of the peptide is achieved by using engineered aminopeptidases that are only active (e.g., binding activity or catalytic activity) in the presence of the label.
- a specific moiety e.g ., PTC, DNP, SNP, etc.
- Patent Publication No. US 2014/0273004 describes engineering aminoacyl tRNA synthetases (aaRSs) as specific NTAA binders.
- aaRSs engineering aminoacyl tRNA synthetases
- the amino acid binding pocket of the aaRSs has an intrinsic ability to bind cognate amino acids, but generally exhibits poor binding affinity and specificity. Moreover, these natural amino acid binders don’t recognize N-terminal labels. Directed evolution of aaRS scaffolds can be used to generate higher affinity, higher specificity binding agents that recognized the N-terminal amino acids in the context of an N-terminal label.
- the aminopeptidase may be engineered to be non specific, such that it does not selectively recognize one particular amino acid over another, but rather just recognizes the labeled N-terminus.
- cyclic cleavage is attained by using an engineered acylpeptide hydrolase (APH) to cleave an acetylated NTAA.
- APH engineered acylpeptide hydrolase
- amidination (guanidinylation) of the NTAA is employed to enable mild cleavage of the labeled NTAA using NaOH (Hamada, (2016) Bioorg Med Chem Lett 26(7): 1690-1695).
- the method further comprises contacting the polypeptide with a proline aminopeptidase under conditions suitable to cleave an N-terminal proline before step (b).
- a proline aminopeptidase is an enzyme that is capable of specifically cleaving an N-terminal proline from a polypeptide.
- PAP enzymes that cleave N-terminal prolines are also referred to as proline iminopeptidases (PIPs).
- Known monomeric PAPs include family members from B. coagulans, L. delbrueckii, N.gonorrhoeae, F. meningosepticum, S. marcescens, T. acidophilum, L. plantarum (MEROPS S33.001) Nakajima et al., J Bacteriol. (2006) 188(4): 1599-606; Kitazono et al., Bacteriol (1992) 174(24):7919-7925).
- PAPs including I) hansenii (Bolumar et al., (2003) 86(1-2): 141-151) and similar homologues from other species (Basten et al., Mol Genet Genomics (2005) 272(6):673-679). Either native or engineered variants/mutants of PAPs may be employed.
- carboxypeptidases may also be modified in the same fashion as aminopeptidases to engineer carboxypeptidases that specifically bind to CTAAs having a C- terminal label.
- carboxypeptidase cleaves only a single amino acid at a time from the C-terminus, and allows control of the degradation cycle.
- the modified carboxypeptidase is non-selective as to amino acid residue identity while being selective for the C-terminal label.
- the modified carboxypeptidase is selective for both amino acid residue identity and the C-terminal label.
- the extended recording tag generated from performing the provided methods comprises information transferred from one or more coding tags.
- the extended recording tags further comprise identifying information from one or more coding tags.
- the extended recording tags are amplified (or a portion thereof) prior to determining at least the sequence of the coding tag(s) in the extended recording tag.
- the extended recording tags (or a portion thereof) are released prior to determining at least the sequence of the coding tag(s) in the extended recording tag.
- the transferred secondary sequences are analyzed.
- the extended nucleic acid is any nucleic acid molecule or sequenceable polymer molecule (see, e.g., Niu et al., 2013, Nat. Chem. 5:282-292; Roy et al., 2015, Nat. Commun. 6:7237; Lutz, 2015, Macromolecules 48:4759-4767; each of which are incorporated by reference in its entirety) that comprises identifying information for a macromolecule, e.g, a polypeptide.
- the length of the final extended recording tag generated by the methods described herein is dependent upon multiple factors, including the length of the coding tag(s) (e.g ., barcode and spacer), the length of the nucleic acids (e.g, optionally including any unique molecular identifier, spacer, universal priming site, barcode, or combinations thereof).
- the tag can be capped by addition of a universal reverse priming site via ligation, primer extension or other methods known in the art.
- the universal forward priming site in the nucleic acid e.g, on the recording tag
- the universal reverse priming site is compatible with the universal reverse priming site that is appended to the final extended nucleic acid.
- a universal reverse priming site is an Illumina P7 primer (5 ’ -C AAGC AGAAGACGGC ATACGAGAT - 3’ - SEQ ID NO:2) or an Illumina P5 primer (5 ’-AATGATACGGCGACCACCGA-3 ’ - SEQ ID NO:l).
- the sense or antisense P7 may be appended, depending on strand sense of the nucleic acid to which the identifying information from the secondary or coding tag is transferred to.
- An extended nucleic acid library can be cleaved or amplified directly from the support (e.g., beads) and used in traditional next generation sequencing assays and protocols.
- a primer extension reaction is performed on a library of single stranded extended nucleic acids (e.g., extended on the recording tag) to copy complementary strands thereof.
- the polypeptide sequencing assay e.g., ProteoCode assay
- Extended nucleic acid recording tags can be processed and analysed using a variety of nucleic acid sequencing methods. In some embodiments, extended recording tags containing the information from one or more secondary or coding tags and any other nucleic acid components are processed and analysed. In some embodiments, the collection of extended recording tags can be concatenated. In some embodiments, the extended recording tag can be amplified prior to determining the sequence.
- the recording tag or extended recording tag comprises information from one or more secondary or coding tags is analysed and/or sequenced.
- the method includes analyzing the identifying information regarding the binding agent of the macromolecule analysis assay transferred to the recording tag.
- sequencing methods include, but are not limited to, chain termination sequencing (Sanger sequencing); next generation sequencing methods, such as sequencing by synthesis, sequencing by ligation, sequencing by hybridization, polony sequencing, ion semiconductor sequencing, and pyrosequencing; and third generation sequencing methods, such as single molecule real time sequencing, nanopore-based sequencing, duplex interrupted sequencing, and direct imaging of DNA using advanced microscopy.
- chain termination sequencing Sanger sequencing
- next generation sequencing methods such as sequencing by synthesis, sequencing by ligation, sequencing by hybridization, polony sequencing, ion semiconductor sequencing, and pyrosequencing
- third generation sequencing methods such as single molecule real time sequencing, nanopore-based sequencing, duplex interrupted sequencing, and direct imaging of DNA using advanced microscopy.
- Suitable sequencing methods for use in the invention include, but are not limited to, the following methods known in the art, such as sequencing by hybridization, sequencing by synthesis technology (e.g ., HiSeqTM and SolexaTM, Illumina), SMRTTM (Single Molecule Real Time) technology ( Pacific Biosciences), true single molecule sequencing (e.g.,
- Heli ScopeTM Helicos Biosciences
- massively parallel next generation sequencing e.g, SOLiDTM, Applied Biosciences; Solexa and HiSeqTM, Illumina
- massively parallel semiconductor sequencing e.g, Ion Torrent
- pyrosequencing technology e.g, GS FLX and GS Junior Systems, Roche/454
- nanopore sequence e.g, Oxford Nanopore Technologies
- a library of nucleic acids may be amplified in a variety of ways.
- a library of nucleic acids e.g., recording tags comprising information from one or more secondary or coding tags
- undergo exponential amplification e.g., via PCR or emulsion PCR.
- Emulsion PCR is known to produce more uniform amplification (Hori, Fukano et ak, Biochem Biophys Res Commun (2007) 352(2): 323-328).
- a library of nucleic acids e.g, extended nucleic acids
- the library of nucleic acids can be amplified using primers compatible with the universal forward priming site and universal reverse priming site contained therein.
- a library of nucleic acids e.g., the recording tag
- Sequences that can be added to the termini of the extended nucleic acids include library specific index sequences to allow multiplexing of multiple libraries in a single sequencing run, adaptor sequences, read primer sequences, or any other sequences for making the library of extended nucleic acids compatible for a sequencing platform.
- a 20 m ⁇ PCR reaction volume is set up using an extended nucleic acid library eluted from ⁇ 1 mg of beads ( ⁇ 10 ng), 200 mM dNTP, 1 mM of each forward and reverse amplification primers, 0.5 m ⁇ (1U) of Phusion Hot Start enzyme (New England Biolabs) and subjected to the following cycling conditions: 98° C for 30 sec followed by 20 cycles of 98° C for 10 sec, 60° C for 30 sec, 12° C for 30 sec, followed by 12° C for 7 min, then hold at 4° C.
- the library of nucleic acids can undergo target enrichment.
- target enrichment can be used to selectively capture or amplify extended nucleic acids representing macromolecules (e.g., polypeptides) of interest from a library of extended nucleic acids before sequencing.
- target enrichment for protein sequencing is challenging because of the high cost and difficulty in producing highly-specific binding agents for target proteins. In some cases, antibodies are notoriously non specific and difficult to scale production across thousands of proteins.
- the methods of the present disclosure circumvent this problem by converting the protein code into a nucleic acid code which can then make use of a wide range of targeted DNA enrichment strategies available for DNA libraries.
- peptides of interest can be enriched in a sample by enriching their corresponding extended nucleic acids.
- Methods of targeted enrichment are known in the art, and include hybrid capture assays, PCR-based assays such as TruSeq custom Amplicon (Illumina), padlock probes (also referred to as molecular inversion probes), and the like (see, Mamanova et al., (2010) Nature Methods 7: 111-118; Bodi et al., J. Biomol. Tech.
- a library of nucleic acids is enriched via a hybrid capture-based assay.
- the library of extended nucleic acids is hybridized to target-specific oligonucleotides that are labeled with an affinity tag (e.g., biotin).
- an affinity tag e.g., biotin
- Extended nucleic acids hybridized to the target-specific oligonucleotides are “pulled down” via their affinity tags using an affinity ligand (e.g., streptavidin coated beads), and background (non-specific) extended nucleic acids are washed away.
- enriched extended nucleic acids e.g., extended nucleic acids
- positive enrichment e.g, eluted from the beads.
- oligonucleotides complementary to the corresponding extended nucleic acid library representations of peptides of interest can be used in a hybrid capture assay.
- sequential rounds or enrichment can also be carried out, with the same or different bait sets.
- “tiled” bait oligonucleotides can be designed across the entire nucleic acid representation of the protein.
- primer extension and ligation-based mediated amplification enrichment can be used to select and module fraction enriched of library elements representing a subset of polypeptides.
- Competing oligonucleotides can also be employed to tune the degree of primer extension, ligation, or amplification. In the simplest implementation, this can be accomplished by having a mix of target specific primers comprising a universal primer tail and competing primers lacking a 5 universal primer tail. After an initial primer extension, only primers with the 5 universal primer sequence can be amplified. The ratio of primer with and without the universal primer sequence controls the fraction of target amplified.
- the inclusion of hybridizing but non-extending primers can be used to modulate the fraction of library elements undergoing primer extension, ligation, or amplification.
- Targeted enrichment methods can also be used in a negative selection mode to selectively remove extended nucleic acids from a library before sequencing.
- undesirable extended nucleic acids that can be removed are those representing over abundant polypeptide species, e.g., for proteins, albumin, immunoglobulins, etc.
- a competitor oligonucleotide bait hybridizing to the target but lacking a biotin moiety, can also be used in the hybrid capture step to modulate the fraction of any particular locus enriched.
- the competitor oligonucleotide bait competes for hybridization to the target with the standard biotinylated bait effectively modulating the fraction of target pulled down during enrichment.
- the ten orders dynamic range of protein expression can be compressed by several orders using this competitive suppression approach, especially for the overly abundant species such as albumin.
- the fraction of library elements captured for a given locus relative to standard hybrid capture can be modulated from 100% down to 0% enrichment.
- library normalization techniques can be used to remove overly abundant species from the extended nucleic acid library. This approach works best for defined length libraries originating from peptides generated by site-specific protease digestion such as trypsin, LysC, GluC, etc.
- normalization can be accomplished by denaturing a double-stranded library and allowing the library elements to re-anneal. The abundant library elements re-anneal more quickly than less abundant elements due to the second-order rate constant of bimolecular hybridization kinetics (Bochman, Paeschke et al. 2012).
- the ssDNA library elements can be separated from the abundant dsDNA library elements using methods known in the art, such as chromatography on hydroxyapatite columns (VanderNoot, et al., 2012, Biotechniques 53:373-380) or treatment of the library with a duplex-specific nuclease (DSN) from Kamchatka crab (Shagin et al., (2002) Genome Res. 12:1935-42) which destroys the dsDNA library elements.
- methods known in the art such as chromatography on hydroxyapatite columns (VanderNoot, et al., 2012, Biotechniques 53:373-380) or treatment of the library with a duplex-specific nuclease (DSN) from Kamchatka crab (Shagin et al., (2002) Genome Res. 12:1935-42) which destroys the dsDNA library elements.
- a library of nucleic acids is concatenated by ligation or end-complementary PCR to create a long DNA molecule comprising multiple different extended recorder tags, extended coding tags, or di-tags, respectively (Du et al., (2003) BioTechniques 35:66-72; Muecke et al., (2008) Structure 16:837-841; U.S. Patent No. 5,834,252, each of which is incorporated by reference in its entirety).
- This embodiment is preferable for nanopore sequencing in which long strands of DNA are analyzed by the nanopore sequencing device.
- nucleic acids e.g., extended nucleic acids
- the nucleic acids can be analysed directly on the support, such as a flow cell or beads that are compatible for loading onto a flow cell surface (optionally microcell patterned), wherein the flow cell or beads can integrate with a single molecule sequencer or a single molecule decoding instrument.
- a flow cell or beads that are compatible for loading onto a flow cell surface (optionally microcell patterned)
- the flow cell or beads can integrate with a single molecule sequencer or a single molecule decoding instrument.
- hybridization of several rounds of pooled fluorescently-labeled of decoding oligonucleotides (Gunderson et al., (2004) Genome Res.
- the binding agents can be labeled with cycle-specific coding tags as described above (see also, Gunderson et al., (2004) Genome Res. 14:970-7).
- the resulting sequences can be collapsed by their UMIs if used and then associated to their corresponding polypeptides and aligned to the totality of the proteome.
- Resulting sequences can also be collapsed by their compartment tags and associated to their corresponding compartmental proteome, which in a particular embodiment contains only a single or a very limited number of protein molecules. Both protein identification and quantification can easily be derived from this digital peptide information.
- the methods disclosed herein can be used for analysis, including detection, quantitation and/or sequencing, of a plurality of macromolecules simultaneously (multiplexing).
- Multiplexing refers to analysis of a plurality of macromolecules (e.g. polypeptides) in the same assay.
- the plurality of macromolecules can be derived from the same sample or different samples.
- the plurality of macromolecules can be derived from the same subject or different subjects.
- the plurality of macromolecules that are analyzed can be different macromolecules, or the same macromolecule derived from different samples.
- a plurality of macromolecules includes 2 or more macromolecules, 5 or more macromolecules, 10 or more macromolecules, 50 or more macromolecules, 100 or more macromolecules, 500 or more macromolecules, 1,000 or more macromolecules, 5,000 or more macromolecules, 10,000 or more macromolecules, 50,000 or more macromolecules, 100,000 or more macromolecules, 500,000 or more macromolecules, or 1,000,000 or more macromolecules.
- kits and articles of manufacture comprising components for preforming a binding reaction by forming a stable complex of binding agents, targets, and the stabilizing components associated.
- the kits further contain other reagents for treating and analyzing the target macromolecules (e.g., proteins, polypeptides, or peptides).
- the kits and articles of manufacture may include any one or more of the reagents and components used in the methods described in Section I and II.
- the kit comprises reagents for preparing samples for preforming the binding reaction, such as for preparing targets from a sample and joining with stabilizing components.
- the kit comprises a plurality of binding agents wherein each binding agent is associated with one or more stabilizing components.
- kits contain components for performing a binding reaction comprising contacting a binding agent with a target, wherein the binding agent and the target each comprises or is associated with a stabilizing component; allowing the binding agent to interact with a binding site located on the target; and linking the stabilizing components to form a stable complex; wherein each of the stabilizing components is attached to or associated with the binding agent and the target, respectively, at a site different from the binding site between the binding agent and the target.
- the kits optionally include instructions for performing the binding reaction.
- kits comprise one or more of the following components: binding agent(s), stabilizing component(s), linking agent(s), solid support(s), recording tag(s), reagent(s) for attaching the stabilizing components, reagent(s) for transferring information, sequencing reagent(s), and/or any reagents as described in the methods for performing the binding reaction and analyzing macromolecules (e.g ., proteins, polypeptides, or peptides), enzyme(s), buffer(s), etc.
- macromolecules e.g ., proteins, polypeptides, or peptides
- kits also include other components for treating the macromolecules (e.g., proteins, polypeptides, or peptides), preforming a binding reaction, and analysis of the same including other reagent(s) for analysis of the target.
- the macromolecules e.g., proteins, polypeptides, or peptides
- preforming a binding reaction e.g., preforming a binding reaction
- analysis of the same including other reagent(s) for analysis of the target.
- components used to prepare a reaction mixture e.g., the reaction mixture is a solution.
- the reaction mixture includes one or more of the following: stabilizing component(s), linking agent(s), solid support(s), recording tag(s), reagent(s) for attaching or associating the stabilizing components, reagent(s) for transferring information, sequencing reagent(s), binding agent(s) with associated stabilizing component s) and/or coding tag(s), buffer(s).
- kits for performing a binding reaction comprising a library of binding agents, wherein each binding agent comprises or is associated with one or more stabilizing components, and a coding tag comprising identifying information regarding the binding moiety.
- the binding moiety is capable of binding to one or more N-terminal, internal, or C-terminal amino acids of the target peptide, or capable of binding to the one or more N-terminal, internal, or C-terminal amino acids of a peptide modified by a functionalizing reagent.
- the kit also includes linking agents, wherein the linking agent comprises a chemical reagent, a non-biological reagent, a biological reagent, or a combination thereof.
- the linking agent comprises a polypeptide or protein.
- the linking agent comprises a metal ion.
- the kit includes a binding agent comprising a coding tag, which comprises identifying information regarding the binding agent; an adaptor molecule comprising a first hybridization sequence substantially complementary to at least a portion of the coding tag, and a secondary tag.
- the binding agent is configured to bind a macromolecule associated with a recording tag and reagents for transferring information from the secondary tag from the adaptor molecule to the recording tag are also provided.
- the kit includes a plurality or set of adaptor molecules.
- the kit includes a plurality or set of binding agents.
- the set of binding agents is configured to be compatible with the set of adaptor molecules.
- provided herein are components used to prepare a reaction mixture.
- the reaction mixture is a solution.
- the reaction mixture includes one or more of the following: adaptor molecule(s), binding agent(s) and associated coding tag(s), solid support(s), recording tag(s), reagent(s) for transferring information, sequencing reagent(s), and/or buffer(s).
- the kit comprises: a binding agent comprising a coding tag, which comprises identifying information regarding the binding agent, wherein the binding agent is configured to bind a macromolecule associated with a first stabilizing component and with a recording tag joined to a support, and wherein the binding agent is associated with a second stabilizing component; the recording tag associated with the first stabilizing component; a linking agent configured to linking the first and second stabilizing components together after binding of the binding agent to the macromolecule to form a stable complex comprising the binding agent, the macromolecule and the stabilizing components.
- a binding agent comprising a coding tag, which comprises identifying information regarding the binding agent, wherein the binding agent is configured to bind a macromolecule associated with a first stabilizing component and with a recording tag joined to a support, and wherein the binding agent is associated with a second stabilizing component; the recording tag associated with the first stabilizing component; a linking agent configured to linking the first and second stabilizing components together after binding of the binding agent to the macromolecule to form a stable complex comprising the binding agent
- the kit may also comprise an adaptor molecule comprising a first hybridization sequence substantially complementary to at least a portion of the coding tag, and a secondary tag, wherein an information of the secondary tag is configured for transfer from the adaptor molecule to the recording tag to generate an extended recording tag after the coding tag associated with the binding agent hybridizes with the first hybridization sequence on the adaptor molecule.
- kits and articles of manufacture further comprise a plurality of nucleic acid molecules or oligonucleotides.
- the kits include a plurality of barcodes.
- the barcode(s) may include a compartment barcode, a partition barcode, a sample barcode, a fraction barcode, or any combination thereof.
- the barcode comprises a unique molecule identifier (UMI).
- the barcode comprises a DNA molecule, DNA with pseudo-complementary bases, an RNA molecule, a BNA molecule, an XNA molecule, a LNA molecule, a PNA molecule, a gRNA molecule, a non-nucleic acid sequenceable polymer, e.g ., a polysaccharide, a polypeptide, a peptide, or a polyamide, or a combination thereof.
- the barcodes are configured to attach the target macromolecules, e.g. , the proteins, in the sample or to attach to nucleic components associated with the targets.
- the kit further comprises reagents for treating the target macromolecules, e.g. , the proteins. Any combination of fractionation, enrichment, and subtraction methods, of the proteins may be performed.
- the reagent may be used to fragment or digest the proteins.
- the kit comprises reagents and components to fractionate, isolate, subtract, enrich proteins.
- the kits further comprises a protease such as trypsin, LysN, or LysC.
- the kit comprises a support for immobilizing the one or more targets and reagents for immobilizing the target on a support.
- the kit also comprises one or more buffers or reaction fluids necessary for any of the binding reaction to occur. Buffers including wash buffers, reaction buffers, and binding buffers, elution buffers and the like are known to those or ordinary skill in the arts.
- the kits further include buffers and other components to accompany other reagents described herein.
- the reagents, buffers, and other components may be provided in vials (such as sealed vials), vessels, ampules, bottles, jars, flexible packaging (e.g, sealed Mylar or plastic bags), and the like. Any of the components of the kits may be sterilized and/or sealed.
- the kit includes one or more reagents for nucleic acid sequence analysis.
- the reagent for sequence analysis is for use in sequencing by synthesis, sequencing by ligation, single molecule sequencing, single molecule fluorescent sequencing, sequencing by hybridization, polony sequencing, ion semiconductor sequencing, pyrosequencing, single molecule real-time sequencing, nanopore-based sequencing, or direct imaging of DNA using advanced microscopy, or any combination thereof.
- the subject kits may further include instructions for using the components of the kit to practice the subject methods, i.e., instructions for sample preparation, treatment and/or analysis.
- the instructions are directed to methods of performing the binding reaction with target macromolecules (e.g ., proteins, polypeptides, or peptides).
- target macromolecules e.g ., proteins, polypeptides, or peptides.
- the kits described herein may also include other materials desirable from a commercial and user standpoint, including other buffers, diluents, filters, syringes, and package inserts with instructions for performing any methods described herein.
- kit components any of the above-mentioned kit components, and any molecule, molecular complex or conjugate, reagent (e.g., chemical or biological reagents), agent, structure (e.g, support, surface, particle, or bead), reaction intermediate, reaction product, binding complex, or any other article of manufacture disclosed and/or used in the exemplary kits and methods, may be provided separately or in any suitable combination in order to form a kit.
- reagent e.g., chemical or biological reagents
- agent e.g., chemical or biological reagents
- structure e.g, support, surface, particle, or bead
- reaction intermediate reaction product, binding complex, or any other article of manufacture disclosed and/or used in the exemplary kits and methods
- a method for performing a binding reaction comprising: contacting a binding agent with a target, wherein the binding agent and the target each comprises or is associated with a stabilizing component; allowing the binding agent to interact with a binding site located on the target; and linking the stabilizing components to form a stable complex comprising the binding agent, the target and the stabilizing components; wherein the stabilizing components are attached to or associated with the binding agent and the target, respectively, at a site different from the binding site between the binding agent and the target.
- the stabilizing components each comprises a biological molecule, a chemical molecule, a small molecule or a combination thereof.
- the linking agent comprises at least one polynucleotide or nucleic acid comprising a sequence which hybridizes to at least one of the stabilizing components.
- the support is a three- dimensional support (e.g., a porous matrix or a bead).
- the support is a polystyrene bead, a polyacrylate bead, a polymer bead, an agarose bead, a cellulose bead, a dextran bead, an acrylamide bead, a solid core bead, a porous bead, a paramagnetic bead, a glass bead, a controlled pore bead, a silica-based bead, or a combination thereof.
- the recording tag is a DNA molecule, an RNA molecule, a PNA molecule, a BNA molecule, an XNA, molecule, an LNA molecule, a gRNA molecule, or a combination thereof.
- the destabilizing agent comprises heat, a denaturing agent, an enzyme, or a competitor molecule.
- a method for analyzing a macromolecule comprising the steps of:
- binding agent capable of binding to the macromolecule
- the binding agent comprises a coding tag with identifying information regarding the binding agent and the binding agent and the macromolecule each comprises or is associated with a stabilizing component
- linking agent comprises a chemical reagent, a non-biological reagent, a biological reagent, or a combination thereof.
- linking agent comprises a polypeptide or a protein.
- the linking agent comprises a metal ion.
- the stabilizing components each comprises a biological molecule, a chemical molecule, a small molecule or a combination thereof.
- linking agent comprises at least one polynucleotide or nucleic acid comprising a sequence which hybridizes to at least one of the stabilizing components.
- any one of embodiments 47-72, wherein the support is a three- dimensional support (e.g., a porous matrix or a bead).
- the support is a polystyrene bead, a polyacrylate bead, a polymer bead, an agarose bead, a cellulose bead, a dextran bead, an acrylamide bead, a solid core bead, a porous bead, a paramagnetic bead, a glass bead, a controlled pore bead, a silica-based bead, or a combination thereof.
- the recording tag is a DNA molecule, an RNA molecule, a PNA molecule, a BNA molecule, an XNA, molecule, an LNA molecule, a gRNA molecule, or a combination thereof.
- N-terminal amino acid (NTAA) of the protein or peptide (e) removing the N-terminal amino acid (NTAA) of the protein or peptide to expose a new NTAA of the protein or peptide.
- nucleic acid sequencing method is sequencing by synthesis, sequencing by ligation, sequencing by hybridization, polony sequencing, ion semiconductor sequencing, or pyrosequencing.
- nucleic acid sequencing method is single molecule real-time sequencing, nanopore-based sequencing, or direct imaging of DNA using advanced microscopy.
- the binding agent is an aminopeptidase or variant, mutant, or modified protein thereof; an aminoacyl tRNA synthetase or variant, mutant, or modified protein thereof; an anticalin or variant, mutant, or modified protein thereof; a ClpS, ClpS2, or variant, mutant, or modified protein thereof; a UBR box protein or variant, mutant, or modified protein thereof; or a modified small molecule that binds amino acid(s), i.e. vancomycin or a variant, mutant, or modified molecule thereof; or an antibody or binding fragment thereof; or any combination thereof.
- a method for analyzing a macromolecule comprising the steps of:
- binding agent capable of binding to the macromolecule, wherein the binding agent comprises or is associated with a second stabilizing component
- the linking agent comprises a polypeptide.
- the binding agent is fluorescently labeled to enable detection of the contact between the macromolecule and the binding agent; and analyzing the macromolecule comprises detecting fluorescence from the binding agent after contacting the macromolecule.
- first or second stabilizing component comprises a polynucleotide
- the linking agent comprises a linking polynucleotide that hybridizes to the polynucleotide of one of the stabilizing components.
- 106 The method of any one of embodiments 97-105, wherein the macromolecule comprises a polypeptide and the binding agent or a binding agent from the plurality of binding agents is capable of binding to a N-terminal amino acid (NTAA) of the polypeptide or to a modified NTAA of the polypeptide.
- NTAA N-terminal amino acid
- analyzing the macromolecule comprises identifying at least one amino acid residue of the polypeptide.
- providing a macromolecule comprises providing the polypeptide associated with a recording tag; the binding agent or each binding agent from the plurality of binding agents comprises or is associated with a coding tag with identifying information regarding the binding agent; obtaining an information about the binding agent comprises transferring an information from the coding tag to the recording tag after binding of the binding agent to the macromolecule to generate an extended recording tag; and identifying at least one amino acid residue of the polypeptide comprises analyzing the extended recording tag.
- the method of embodiment 108 further comprising: providing an adaptor molecule comprising a first hybridization sequence and a secondary tag, wherein the first hybridization sequence is substantially complementary to at least a portion of the coding tag, to allow hybridization between the first hybridization sequence and the coding tag; and transferring information of the secondary tag to the recording tag to generate an extended recording tag, wherein the information of the secondary tag is transferred from the adaptor molecule to the recording tag after the coding tag associated with the binding agent hybridizes with the first hybridization sequence on the adaptor molecule.
- the transferring information comprises contacting the coding tag with a reagent for transferring the identifying information, the reagent comprising a reagent for primer extension reaction, a chemical ligation reagent or a biological ligation reagent.
- a kit for analyzing a macromolecule comprising: a binding agent comprising a coding tag, which comprises identifying information regarding the binding agent, wherein the binding agent is configured to bind a macromolecule associated with a first stabilizing component and with a recording tag joined to a support, and wherein the binding agent is associated with a second stabilizing component; the recording tag associated with the first stabilizing component; a linking agent configured to linking the first and second stabilizing components together after binding of the binding agent to the macromolecule to form a stable complex comprising the binding agent, the macromolecule and the stabilizing components.
- kit of embodiment 117 wherein the kit comprises a plurality of binding agents and wherein at least one binding agent of the plurality of binding agents is capable of binding to the macromolecule and each binding agent of the plurality of binding agents comprises or is associated with the second stabilizing component.
- kits of any one of embodiments 117-119, wherein the coding tag and/or the recording tag comprises a unique molecular identifier (UMI) or a barcode sequence.
- UMI unique molecular identifier
- kit of any one of embodiments 117-120 further comprising: an adaptor molecule comprising a first hybridization sequence substantially complementary to at least a portion of the coding tag, and a secondary tag, wherein an information of the secondary tag is configured for transfer from the adaptor molecule to the recording tag to generate an extended recording tag after the coding tag associated with the binding agent hybridizes with the first hybridization sequence on the adaptor molecule.
- a method for analyzing a macromolecule comprising the steps of:
- step (b) contacting the macromolecule with a binding agent capable of binding to the macromolecule, wherein the binding agent comprises a coding tag with identifying information regarding the binding agent, to allow binding between the macromolecule and the binding agent; (c) providing an adaptor molecule comprising a first hybridization sequence and a secondary tag, wherein the first hybridization sequence is substantially complementary to at least a portion of the coding tag, to allow hybridization between the first hybridization sequence and the coding tag, wherein step (c) is performed before, after or simultaneously with step (b);
- step (b) comprises contacting a plurality of macromolecules with a plurality of binding agents and step (c) comprises providing a plurality of adaptor molecules, wherein the plurality of adaptor molecules comprises at least one adaptor molecule capable of hybridizing to at least one coding tag associated with the binding agent.
- the adaptor molecule further comprises a second hybridization sequence substantially complementary to a sequence at the 3’ terminus of the recording tag or substantially complementary to a region on the recording tag generated from a previous information transfer of the secondary tag from the adaptor molecule to the recording tag, and wherein information transfer of the secondary tag from the adaptor molecule to the recording tag occurs after: the first hybridization sequence on the adaptor molecule hybridizes to the coding tag of the binding agent; and the second hybridization sequence of the adaptor molecule hybridizes to a portion of the recording tag.
- a pin-based magnetic particle processor (Kingfisher, Thermo) was used for unit- automation of the panning procedure, which enables the handling of 96 magnetic pins, corresponding to the positions of a 96-well microtitre plate, essentially as described in Zoltan Konthur et al., Semi-automated Magnetic Bead-Based Antibody Selection from Phage Display Libraries, Springer Protocols Handbook, Antibody Engineering, pp 267-287, 2010, DOI 10.1007/978-3-642-01144-3 18, which is incorporated herein by reference.
- phage libraries were panned against different NTAA target peptides. Clones from the panning output were isolated and characterized using a panel of peptides in a multiplex Luminex binding assay. Specific binders were isolated against a variety of modified and non-modified NTAAs.
- a highly-selective engineered ClpS2 variant for a NTAA F binder (binder specifically recognizing F at the N-terminus of a polypeptide) was obtained.
- Agrobacterium tumefaciens ClpS2 (4YJM, starting scaffold MSDSPVDLKPKPKVKPKLERPKLYKVMLLNDDYTPREFVTVVLKAVFRMSEDTGRRV MMTAHRFGSAVVVVCERDIAETKAKEATDLGKEAGFPLMFTTEPEE as set forth in SEQ ID NO: 6) was cloned into a phage display vector with PelB leader sequence at N-terminus.
- lipocalins were used as starting scaffolds for directed evolution toward modified NTAAs by phage display technology.
- Anticalins have an intrinsic cup-like binding pocket, highly stable structure, good recombinant expression in E. coli ., binding pocket evolvability using phage display, and demonstrated potential for strong and specific binding to small molecules.
- Many anticalins have an intrinsic ability to bind a modified- dipeptide residue.
- N-terminal modifier agents M such that when combined with the PI amino acid (N-terminal residue), the M-Pl moiety occupies the anticalin b -barrel core, with the PI sidechain oriented closer to the surface of the pocket.
- P2 residue (penultimate residue) of the peptide to be located just outside the pocket or affinity determining region and contribute less energy to binding.
- Pyrazole methanimine (PMI) is used as the N-terminal modifier agent.
- Two selective engineered NTAA binders were obtained based on lipocalin/anticalin scaffolds. 31-F binder specifically recognizing F at the N-terminus of a polypeptide was obtained from the following starting scaffold, SEQ ID NO: 7:
- Example 2 Stable binding reaction performed in a polypeptide analysis assay.
- This example describes a binding reaction performed to form a stable complex in a polypeptide analysis assay which involves information transfer for encoding amino acid sequence information of the target peptide into DNA sequence of an extended recording tag.
- Target peptides attached to corresponding barcodes were joined to immobilized, bead-attached nucleic acid recording tags containing a biotin molecule (as the stabilizing component) at its 5’ end (FIG. 3A).
- the target peptides assessed included two peptides with an N-terminal amino phenylalanine (F) (“FA”, contained same peptides but different DNA barcode) and three peptides with an N-terminal alanine (A) (“AA” and “AFA”).
- a recording tag only control (“RT”) was also performed which did not have a target peptide joined to the recording tag.
- F-binding agents configured to recognize peptide with a N-terminal phenylalanine were conjugated with nucleic acids (coding tag and DNA for associating with the biotin as the stabilizing component) and linker components.
- the nucleic acid associated with the binding agent contained a barcode (BC’) with identifying information regarding the binding agent flanked by two spacer (SP’) sequences useful for hybridization during information transfer extension reactions.
- BC barcode
- SP spacer
- the coding tag specific for each binder is attached to SpyTag via a PEG linker, and the resulting SpyTag-PL’ is conjugated to binder-SpyCatcher fusion protein, so the corresponding stabilizing component contains DNA-PEG Linker- Sp’ -Coding Tag BC’-Sp’.
- the immobilized recording tags and target peptides were pre-washed with 0.1 M NaOH and 0.1% Tween20, 2 times of PBS+0.1% Tween20, incubated with PierceTM Protein- Free T20 (PBS) Blocking Buffer (Thermo Scientific, Cat# 37573) at 37°C for 15 minutes, and washed two times with PBST (PBS+0.1% Tween20). After the washes, 200 nM of the DNA- conjugated F-binding agent and 300 nM of nucleic acids joined to a biotin which is complementary to the stabilizing component DNA was provided at 25°C for 30 min, as shown in FIG. 3B.
- 3C-3D As a negative control, PBST was added to the reaction instead of neutravidin and streptavidin. In the encoding control condition, the encoding mixture was added at the step that the linking agent was added in the other samples before the long and stringent wash with PBST at 37°C for 45 minutes instead of the later step as described above.
- streptavidin and neutravidin provided as the linking agent showed encoding efficiencies at least comparable to the encoding control where encoding is performed before the long and stringent wash (PBST at 37°C for 45min), serving as a control for the encoding/information transfer reaction.
- PBST at 37°C for 45min long and stringent wash
- biotin molecules stabilizing components
- This example describes an exemplary assay system including information transfer using a splint adaptor molecule containing a first hybridization sequence complementary to a region on the coding tag and a second hybridization sequence complementary to a region on the recording tag.
- Phosphorylated DNA recording tag was attached to three different peptides, with an amino-terminal sequence of FA, AFA and AA respectively (FA-peptide: FAGVAMPGAEDDVVGSGSGK as set forth in SEQ ID NO: 3; AFA-peptide: AFAGVAMPGAEDDVVGSGSK as set forth in SEQ ID NO: 4; AA-peptide: AAGVAMPGAEDDVVGSGSK as set forth in SEQ ID NO: 5).
- the DNA recording tag with no peptide attached was also used.
- the peptide-DNA conjugates and no-peptide DNA recording tags were immobilized on magnetic beads (Dynabeads, Thermo Fisher, USA).
- an oligo binder that is configured to hybridize to a sequence at the 5’ end of the recording tag was used (FIG. 8).
- the oligo binder contained a nucleic acid coding tag containing a barcode.
- Two different coding tag sequences were tested.
- a DNA splint adaptor molecule was introduced that contained a first hybridization sequence complementary to the coding tag associated with the oligo binder and a second hybridization sequence that is complementary to the spacer region (Sp’) in the recording tags (FIG. 8).
- splint adaptor molecules comprising a first hybridization sequence that contained a 3 nucleotide mismatch, a 4 nucleotide mismatch, or a 7 nucleotide (full) mismatch, or a second hybridization sequence that contained a 4 nucleotide mismatch, or a 8 nucleotide (full) mismatch, were used.
- oligo binder and DNA-peptide chimera immobilized beads was incubated with the splint adaptor molecule at 37°C for 30 minutes, then at room temperature for 15 minutes. After a 1 -minute wash in PBST (PBS+ Tween20), the beads were incubated with encoding mixture containing 50 mM Tris-HCl, pH7.5, 2 mM MgSCE, 50 mM NaCl, 1 mM DTT, 0.1% Tween 20, 0.1 mg/mL BSA, 0.125 mM dNTPs, 0.125 units/pL Klenow fragment (3 ’->5’ exo-) (MCLAB, USA) at 37°C for 5 minutes.
- PBST PBS+ Tween20
- the beads were washed once with PBST with 10% formamide, once with 0.1 M NaOH and once with PBST with 10% formamide. The resulting beads were resuspended in PBST.
- the information of splint adaptor molecule (corresponding to the coding tag) was transferred to the recording tag, thereby generating an extended recording tag.
- the extended recording tag of the assay was subjected to qPCR with corresponding primers.
- NGS next-generation sequencing
- the recoding tags were capped to add a universal priming sequence. Capping was done using an extension reaction.
- the extended recording tags of the assay were subjected to PCR amplification and analyzed by NGS. As shown in Table 2, information transfer from the splint adaptor molecule to the recording tag was observed when the splint adaptor contained 0 mismatched nucleotides with the coding tag. In comparison, low signal was observed when mismatches were present. This was also the case when the splint adaptor molecule was not provided (negative control) or when the Klenow polymerase was not provided (negative control). The data indicates that hybridization between the sequence on the splint adaptor molecule and corresponding complementary region on the coding tag and the recording tag was sufficient to facilitate transfer of information from the splint adaptor molecule, resulting in an extended recording tag.
- Example 4 Exemplary assay showing information transfer via an adaptor molecule.
- This example describes a specific embodiment for information transfer between a peptide molecule conjugated to a recording tag and a DNA-conjugated binder molecule conjugated that recognizes an N-terminal amino acid of the peptide molecule (Fig. 9).
- Information transfer occurs via a splint adaptor molecule containing a first hybridization sequence (PLE) complementary to a region on the coding tag (PL1), followed by a PEG-based linker, a spacer sequence (Sp’), a barcode sequence (BC’) and another spacer sequence (Sp’) complementary to a region on the recording tag (Sp) (Fig. 9A).
- PLE first hybridization sequence
- Sp spacer sequence
- BC barcode sequence
- Sp spacer sequence
- RT-peptide chimeras were created by first “activating” the 5’ amine on the RT oligonucleotides by coupling to TCO- PEG12-NHS ester (Click Chemistry Tools).
- the RT oligonucleotide designed with an internal alkyne group, was coupled to azide-containing FA, AA and AF peptides (FA-peptide or FA-PA: FAGVAMPGAEDDVVGSGSGK as set forth in SEQ ID NO: 3; AFA-peptide or AFA-PA: AFAGVAMPGAEDDVVGSGSK as set forth in SEQ ID NO: 4; AA-peptide or AA-PA: AAGVAMPGAEDDVVGSGSK as set forth in SEQ ID NO: 5).
- FA-peptide or FA-PA FAGVAMPGAEDDVVGSGSGK as set forth in SEQ ID NO: 3
- AFA-peptide or AFA-PA AFAGVAMPGAEDDVVGSGSK as set forth in SEQ ID NO: 4
- AA-peptide or AA-PA AAGVAMPGAEDDVVGSGSK as set forth in SEQ ID NO: 5).
- the four chimeras that contain FA peptide, AA peptide, AF peptide or no peptide were combined and immobilized to mTet beads using iEDDA TCO-mTet chemistry.
- This 4-plex model system has been demonstrated an intra-molecular single molecule binding and specific encoding of an engineered F-binder on low recording tag density beads (1:10,000 and 1:100,000), see US 20200348308 Al. Absolute loading of the four different chimeras on beads was measured by an universal PA antibody since all three peptide types contained a PA antigen sequence, and all four chimeras were loaded in roughly equal amounts on the beads.
- a hybridization-based immobilization was employed as disclosed in WO 2020/223000.
- the four RT -peptide chimeras were hybridized and ligated to hairpin capture DNAs attached to magnetic beads (Dynabeads, Thermo Fisher, USA). These capture beads were generated by attachment of hairpin capture nucleic acids to the beads using trans- cyclooctene (TCO) and methyltetrazine (mTet)-based click chemistry.
- TCO trans- cyclooctene
- mTet methyltetrazine
- TCO-modified short hairpin capture nucleic acids (16 basepair stem, 5 base loop, 24 base 5’ overhang) were reacted with mTet-coated magnetic beads.
- phosphorylated RT-peptide chimeras (10 nM) were annealed to the hairpin DNAs attached to beads in 5x SSC, 0.02% SDS, and incubated for 30 minutes at 37 °C.
- the beads were washed once with PBST and resuspended in lx Quick ligation solution (New England Biolabs, USA) with T4 DNA ligase.
- RNA samples were washed three times with 0.1 M NaOH+0.1% Tween 20 and three times with PBST.
- the total immobilized RT- peptide chimeras were quantified by qPCR using specific primer sets.
- peptides can be immobilized onto beads using a non-hybridization based method that did not involve a ligation step.
- the non-hybridization based method can be performed by incubating 30 mM TCO-modified DNA-tagged peptides including amino FA-terminal peptides, amino AFA- terminal peptides, and amino AA-terminal peptides, with mTet-coated magnetic beads overnight at 25 °C as described in US 20200348308 Al.
- a binding and encoding assay was performed utilizing splint adaptor molecules as shown in Fig. 9A.
- a specific F-binder engineered from ClpS2 obtained by Phage display library screening as disclosed in the previous example conjugated with two different coding tags - hybridization sequences Payload seq 1 (PL1) and Payload seq 1 (PL2) was used for the assay.
- PL1 and PL2 were designed as random sequences that contain -50% CG and do not interact with corresponding barcode and spacer sequences on the adaptor molecules.
- the complementary sequences of PL1 and PL2 (am-PLU and am-PL2’, SEQ ID NOs: 16-17) were attached to SpyTag via a PEG linker, and the resulting SpyTag-PLU and SpyTag-PL2’ were conjugated to F binder- SpyCatcher fusion protein, thus creating F binder-PLl’ and F binder-PL2’ fusions.
- the F binder-PLl’ and F binder-PL2’ were mixed with splint adaptor molecules, CT PL1 S6 and CT PL2 S6 (SEQ ID NOs: 18-19), respectively, in 1:4 molar ratio, forming two binder mixes.
- Both adaptor molecules also include the C3 spacer at the 3’ terminus of the oligonucleotide.
- the C3 spacer is a three carbon spacer.
- the C3 spacer located at the 3’ terminus of the oligonucleotide prohibits extension of the oligonucleotide by a polymerase and prevents ligation at the 3’ end.
- splint adaptor molecules as shown in Fig. 9A allows for hybridization between splint adaptor molecule, coding tag and recording tag, providing conditions for transfer information between the coding tag and the recording tag.
- the bead-immobilized RT -peptide chimeras were incubated with 200 nM of each binder mix in 150 pL of PBST (PBS+ Tween20) at room temperature.
- the beads were washed twice with 200 pL of PBST with 500 mM NaCl and resuspended in encoding mixture containing 50 mM Tris-HCl, pH7.5, 2 mM MgSCE, 50 mM NaCl, 1 mM DTT, 0.1% Tween 20, 0.1 mg/mL BSA, 0.125 mM dNTPs, 0.125 units/pL Klenow fragment (3 ’->5’ exo-) (MCLAB, USA) at 37 °C for 5 minutes. The beads were washed once with 0.1 M NaOH+0.01% Tween 20 and twice with PBST.
- the information of splint adaptor molecule (corresponding to the coding tag) was transferred to the recording tag, thereby generating an extended recording tag.
- NGS next-generation sequencing
- the recoding tags were capped to add a universal priming sequence. Capping was done using an extension reaction.
- the extended recording tags of the assay were subjected to PCR amplification and analyzed by NGS. Encoding yield on each peptide was obtained by evaluating percentage of recoding tags containing adaptor molecule’s barcode information that was transferred during the assay.
- a high encoding yield during the assay was observed only for the target peptide FA-PA having F as the N-terminal amino acid (Fig. 9B), showing specificity of the information transfer for the cognate F-binder fusions.
- the splint adapter molecule can be annealed to the binder-peptide complex post binding and washing.
- Example 5 Exemplary multi-cycle assay showing information transfer via an adaptor molecule.
- An exemplary two-cycle encoding assay is performed as follows.
- the first cycle encoding is performed as described in Example 3.
- Chimeric molecules comprised of peptide- DNA recording tags are immobilized on magnetic beads as described in Example 3.
- a mixture of several N-terminal modified selective binders is used in the assay to interact with an immobilized polypeptide; each binder comprises an engineered protein specifically recognizing a particular N-terminal amino acid of the polypeptide and associated with a coding tag containing information about the binder.
- the coding tag specific for each binder (PL’) is attached to SpyTag via a PEG linker, and the resulting SpyTag-PL’ is conjugated to binder- SpyCatcher fusion protein.
- a set of 20 exemplar coding tag adapter sequences (PL’ sequences) of 20 nucleotides in length include the following sequences (SEQ ID NOs: 20-39) based on Elmas, A., et al. (2013).
- the binder-PL’ conjugates are mixed with corresponding adaptor molecules, containing a complementary PL sequences, a spacer and a unique barcode (BC’), in 1 :4 molar ratio.
- the binder-specific coding tag adapter sequences are comprised of 15 to 30-mer barcodes which have been designed to hybrid efficiently and orthogonally to other members barcodes. Exemplar approaches to generating orthogonal hybridizing sets of barcodes (20-25 nt.) are disclosed in: Elmas, A., et al. (2013). "Designing DNA Barcodes Orthogonal in Melting Temperature by Simulated Annealing Optimization.” Nucleic acid therapeutics 23: 140-151; Pierce, S. E., et al. (2006).
- the barcodes can be further filtered to remove cross-reactivity with other DNA components of the ProteoCode system including universal priming sequences, spacer sequences, recording tag barcodes, etc.
- Both adaptor molecules also include the 3SpC3 spacer at the 3’ terminus of the oligonucleotide.
- the C3 spacer is a three carbon spacer.
- the C3 spacer located at the 3’ terminus of the oligonucleotide prohibits extension of the oligonucleotide by a polymerase and prevents ligation at the 3’ end.
- the beads are washed twice with 200 pL PBS-T with 500 mM NaCl, and resuspended in encoding mixture containing 50 mM Tris-HCl, pH7.5, 2 mM MgS04, 50 mM NaCl, 1 mM DTT, 0.1% Tween 20, 0.1 mg/mL BSA, 0.125 mM dNTPs, 0.125 units/uL Klenow fragment (3’-5’ exo-) (MCLAB, USA) at 37 °C for 5 minutes.
- the beads are washed once with 0.1 M NaOH + 0.01% Tween 20 and twice with PBS-T.
- the information regarding this binder is transferred from binding-specific barcode in the adaptor molecule to the recording tag (becomes encoded in the recording tag).
- the adaptor molecules may also contain a cycle-specific barcode (in this case a unique barcode for the first cycle) that will be also encoded in the recording tag after information transfer.
- the N-terminal amino acid of the polypeptide is cleaved off by mild Edman-like methods described, for example in US 20200348307 A1 or WO 2020223133 Al.
- the N-terminal amino acid of the polypeptide is cleaved off enzymatically by engineered cleavases described, for example in WO 2020198264 Al.
- the cleavage exposes a new N-terminal amino acid of the polypeptide, and the system is ready for the second cycle.
- the same set of binder coding tag combinations can be used (for example, binders A and B), but with a new set of adaptor molecules.
- a mixture will contain 200 nM binder A- PLAU+ adaptor A_2 nd cycle (GGCTGTTTGTGGCTCTACCA— -AG AG AT GGC AC GT GGT A AGAG, SEQ ID NO: 42) and 200 nM binder B-PL2’+ adaptor molecule2_2 nd cycle (T GG AG AT C GTT GC T T GT ACC — AG AG AT GGT GC GT GGT A AG AG, SEQ ID NO: 43) in PBS-T.
- Both adaptor molecules (adaptor moleculel_2 nd cycle and adaptor molecule2_2 nd cycle) also include the 3SpC3 spacer at the 3’ terminus of the oligonucleotide.
- Adaptor molecules used for the second cycle will contain a cycle-specific barcode unique for the second cycle.
- the encoding conditions are the same for the first and the second cycle.
- either the third cycle can be done (following the NTAA cleavage) with new adaptor molecules containing a cycle-specific barcode unique for the third cycle, or the extended recording tag is capped by addition of a universal priming sequence to finish the encoding process.
- the capped recording tag is subjected to PCR amplification followed by sequencing and analysis using a next-generation sequencing method.
- the described approach overcomes a necessity to make and use a new pool of binder fusions for each cycle of encoding; rather, a new pool of cycle-specific adapter molecules is used for each cycle together with a single universal set of coding tag-conjugated binders.
- Example 6 Stable binding reaction performed in a polypeptide analysis assay.
- This example describes “stabilizing” reaction immediately after a binding/washing event to form a stable complex of the binder with the polypeptide.
- the “stabilized” binding event enables robust information transfer without concern of binder dissociating from the polypeptide during encoding. This is particularly useful in a couple of different embodiments: 1) when using binders with appreciable off rates, and 2) when encoding using adapter molecules in which the adapter molecules are hybridized to the “clamped” (stabilized via use of stabilizing components) binder, stringent washing performed, and the extension step performed without regard to binder dissociation.
- a model bead set comprised of target polypeptides attached to corresponding barcodes are immobilized to hairpin nucleic acid recording tags containing a desthiobiotin (DSB) molecule (as the stabilizing component) at its 5’ end.
- the target polypeptides include two polypeptides with an N-terminal phenylalanine (F) (“FA”, containing the same polypeptides but different DNA barcode) and three polypeptides with an N-terminal alanine (A) (“AA” and “AFA”).
- F N-terminal phenylalanine
- A N-terminal alanine
- a recording tag only control (“RT”) is also performed devoid of a target peptide.
- F- binding agents recognizing an N-terminal phenylalanine residue are conjugated with “hybridizable” DNA coding tags.
- the nucleic acid associated with the binding agent contains a barcode (BC’) with identifying information regarding the binding agent.
- the “hybridizable” coding tag barcodes for the binding agents are designed to be 15-25 mers DNA sequences with good hybridization properties and minimal cross-hybridization between the barcodes as described in (Elmas, A., et al. (2013). "Designing DNA Barcodes Orthogonal in Melting Temperature by Simulated Annealing Optimization.” Nucleic acid therapeutics 23: 140-151).
- the immobilized recording tags and target polypeptides are pre-washed with 0.1 M NaOH and 0.1% Tween 20, washed two times with PBST (PBS+0.1% Tween 20), incubated with PierceTM Protein-Free T20 (PBS) Blocking Buffer (Thermo Scientific, Cat# 37573) at 37°C for 15 minutes, and washed two times with PBST. After the washes, 200 nM of the DNA- conjugated F-binding agent and 300 nM of a biotinylated splint adapter, complementary to a region of the binder coding tag, is added to the beads and incubated at 25°C for 30 min.
- the beads are then incubated with the extension mix for 5 minutes (0.125 U/pL Klenow fragment (3 ’->5’ exo-), 125 uM dNTP mixture (dATP, dCTP, dGTP, and dTTP), 50 mM Tris-HCl (pH, 7.5), 2 mM MgS0 4 , 50 mMNaCl, 1 mM DTT, 0.1% Tween 20, and 0.1 mg/mL BSA) to copy the information from the adapter molecule to the recording tag resulting in the extended recording tag having BC and Sp at the 3’ end.
- 125 U/pL Klenow fragment 3 ’->5’ exo-
- 125 uM dNTP mixture dATP, dCTP, dGTP, and dTTP
- 50 mM Tris-HCl pH, 7.5
- 2 mM MgS0 4 50 mMNaCl
- 1 mM DTT 0.1% Tween 20
- the linking agent (SA), adapter molecule, and binding agents are removed by incubation in 4 mM biotin in 0.1 M NaOH/0.1% Tween 20 for 10 min.
- This step takes advantage of the particular setup, in which the first stabilizing component (DSB) has a lower affinity to the linking agent (SA) in comparison to an affinity of the second stabilizing component (biotin) to the linking agent. This allows for efficient disruption of the stable complex before commencing the next encoding cycle.
- One or more additional encoding cycles can be performed similarly as described above.
- a nucleic acid (the capping oligonucleotide set forth in SEQ ID NO: 44) was added into the encoding mixture and incubated at 25°C for 10 minutes to add a universal priming sequence to the recording tags (extended or unextended) using an extension reaction to generate a final product for NGS readout.
- the extended recording tags of the assay were subjected to PCR amplification and analyzed by next- generation sequencing (NGS).
- Example 7 Use of stabilizing components for efficient encoding of different binders.
- Target peptides attached to corresponding barcodes were joined to immobilized, bead-attached recording tags containing a desthiobiotin (DSB) molecule (as the stabilizing component) at its 5’ end (Fig. 11).
- a recording tag only control (“RT”) was also included in the experiments, which did not contain a target peptide joined to the recording tag.
- the beads were treated with an N-terminal modifier agent (pyrazole methanimine, PMI) to generate PMI-modified peptides as described in WO 2019/089846.
- 31-F and 44-L binding agents configured to recognize peptides with N-terminally PMI-modified phenylalanine or leucine residues were conjugated with nucleic acids (coding tag and DNA for associating with the biotin as the stabilizing component).
- the nucleic acid associated with the binding agent contained a barcode (BC’) with identifying information regarding the binding agent flanked by two spacer (SP’) sequences useful for hybridization during information transfer extension reactions.
- the coding tag specific for each binder is attached to SpyTag via a PEG linker, and the resulting SpyTag-PL’ is conjugated to binder- SpyCatcher fusion protein, so the corresponding stabilizing component contains DNA-PEG Linker- Sp’ -Coding Tag BC’-Sp’.
- the immobilized recording tags and target peptides were pre-washed with 0.1 M NaOH and 0.1% Tween20, 2 times of PBS+0.1% Tween20, incubated with PierceTM Protein- Free T20 (PBS) Blocking Buffer at 37°C for 15 minutes, and washed with PBST (PBS+0.1% Tween20). After the wash, 200 nM of the DNA-conjugated binding agents and 300 nM of the nucleic acid complementary to the stabilizing component DNA joined to a biotin molecule was incubated at 25°C for 30 min.
- PBS PierceTM Protein- Free T20
- the samples were incubated with the encoding mixture for 5 minutes (0.125 U/pL Klenow fragment (3 ’->5’ exo-), dNTP mixture (125 mM for each), 50 mM Tris-HCl (pH, 7.5), 2 mM MgS0 4 , 50 mM NaCl, 1 mM DTT, 0.1% Tween 20, and 0.1 mg/mL BSA) to copy the information from the coding tag to the recording tag (RT) to generate an extended recording tag, resulting BC and Sp at the 3’ end of the RT.
- Example 8 Exemplary kit.
- each binding agent is associated with a coding tag, which comprises identifying information regarding the binding agent (a unique barcode sequence) and is additionally associated with biotin (stabilizing component) as described in Example 2, Example 7, Fig. 11 and Fig.
- the kit comprises additional binding agents that recognize other specific NTAA of a polypeptide; 2) a polynucleotide containing a unique molecular identifier (EIMI) or a barcode sequence fused to dethiobiotin configured to be linked to a target polypeptide by means of a click chemistry (nucleotides are coupled to TCO-PEG12-NHS ester and then coupled to azide-containing polypeptide as described in Example 4); and 3) streptavidin.
- EIMI unique molecular identifier
- barcode sequence fused to dethiobiotin configured to be linked to a target polypeptide by means of a click chemistry (nucleotides are coupled to TCO-PEG12-NHS ester and then coupled to azide-containing polypeptide as described in Example 4); and 3) streptavidin.
- This kit can be used to determine an amino acid sequence of a target polypeptide by methods described in Examples 2-4 and 7.
- peg9 spacer internal nine atom polyethylene glycol spacer
- peg6 spacer internal six atom polyethylene glycol spacer
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Immunology (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Hematology (AREA)
- Urology & Nephrology (AREA)
- Biochemistry (AREA)
- Medicinal Chemistry (AREA)
- Analytical Chemistry (AREA)
- Food Science & Technology (AREA)
- Pathology (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Microbiology (AREA)
- Organic Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Tropical Medicine & Parasitology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202062958176P | 2020-01-07 | 2020-01-07 | |
PCT/US2021/012224 WO2021141924A1 (en) | 2020-01-07 | 2021-01-05 | Methods for stable complex formation and related kits |
Publications (2)
Publication Number | Publication Date |
---|---|
EP4073263A1 true EP4073263A1 (de) | 2022-10-19 |
EP4073263A4 EP4073263A4 (de) | 2023-11-08 |
Family
ID=76655264
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP21737980.9A Pending EP4073263A4 (de) | 2020-01-07 | 2021-01-05 | Verfahren zur stabilen komplexbildung und entsprechende kits |
Country Status (2)
Country | Link |
---|---|
US (1) | US11169157B2 (de) |
EP (1) | EP4073263A4 (de) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP4430206A1 (de) | 2021-11-10 | 2024-09-18 | Encodia, Inc. | Verfahren zur barcodierung von makromolekülen in einzelnen zellen |
EP4206674A1 (de) | 2021-12-28 | 2023-07-05 | Encodia, Inc. | Serotypisierungs- und antikörperprofilierungsassays mit hohem durchsatz |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE69931467T3 (de) | 1998-11-06 | 2018-12-27 | Lsi Medience Corporation | Neue komplexe, die vernetztes avidin enthalten, verfahren zur herstellung und analyseverfahren, das diese verwendet |
AU772069B2 (en) | 1999-03-23 | 2004-04-08 | Merck Patent Gesellschaft Mit Beschrankter Haftung | Protein isolation and analysis |
DE602004023960D1 (de) | 2003-09-18 | 2009-12-17 | Nuevolution As | Methode zur Gewinnung struktureller Informationen kodierter Moleküle und zur Selektion von Verbindungen |
WO2007062664A2 (en) | 2005-12-01 | 2007-06-07 | Nuevolution A/S | Enzymatic encoding methods for efficient synthesis of large libraries |
US8642744B2 (en) | 2010-10-28 | 2014-02-04 | Agilent Technologies, Inc. | Crosslinking reagent for analysis of protein-protein interaction |
WO2016164530A1 (en) | 2015-04-07 | 2016-10-13 | University Of Maryland, College Park | Compositions and methods for high throughput protein sequencing |
US10870872B2 (en) | 2016-04-04 | 2020-12-22 | President And Fellows Of Harvard College | Enzymatic nucleic acid synthesis |
CN110199019B (zh) * | 2016-05-02 | 2024-09-10 | Encodia有限公司 | 采用核酸编码的大分子分析 |
WO2019089851A1 (en) | 2017-10-31 | 2019-05-09 | Encodia, Inc. | Methods and kits using nucleic acid encoding and/or label |
US20200348307A1 (en) | 2017-10-31 | 2020-11-05 | Encodia, Inc. | Methods and compositions for polypeptide analysis |
EP3704249A4 (de) | 2017-10-31 | 2021-10-27 | Encodia, Inc. | Kits zur analyse unter verwendung von nukleinsäurecodierung und/oder markierung |
MX2020006803A (es) | 2017-12-29 | 2020-10-28 | Nautilus Biotechnology Inc | Estrategias de descodificacion para identificacion de proteinas. |
-
2021
- 2021-01-05 EP EP21737980.9A patent/EP4073263A4/de active Pending
- 2021-01-14 US US17/149,324 patent/US11169157B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
US20210208150A1 (en) | 2021-07-08 |
US11169157B2 (en) | 2021-11-09 |
EP4073263A4 (de) | 2023-11-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230340458A1 (en) | Methods and kits using nucleic acid encoding and/or label | |
WO2019089846A1 (en) | Methods and compositions for polypeptide analysis | |
US20210396762A1 (en) | Methods for peptide analysis employing multi-component detection agent and related kits | |
AU2020247918B2 (en) | Modified cleavases, uses thereof and related kits | |
US20220214353A1 (en) | Methods for spatial analysis of proteins and related kits | |
US11634709B2 (en) | Methods for preparing analytes and related kits | |
US20230016396A1 (en) | Methods of polypeptide sequencing | |
CA3141321A1 (en) | Methods and related kits for spatial analysis | |
US11169157B2 (en) | Methods for stable complex formation and related kits | |
US20230220589A1 (en) | Polypeptide terminal binders and uses thereof | |
US20220214350A1 (en) | Methods for stable complex formation and related kits | |
US20230056532A1 (en) | Methods for information transfer and related kits | |
US20230136966A1 (en) | Sequential encoding methods and related kits | |
WO2021141924A1 (en) | Methods for stable complex formation and related kits | |
US20240294981A1 (en) | Sequential encoding methods and related kits | |
US20220127754A1 (en) | Methods and compositions of accelerating reactions for polypeptide analysis and related uses | |
US20240042446A1 (en) | Automated treatment of macromolecules for analysis and related apparatus |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE |
|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |
|
17P | Request for examination filed |
Effective date: 20220712 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
DAV | Request for validation of the european patent (deleted) | ||
DAX | Request for extension of the european patent (deleted) | ||
A4 | Supplementary search report drawn up and despatched |
Effective date: 20231010 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G01N 33/543 20060101ALI20231004BHEP Ipc: C40B 70/00 20060101ALI20231004BHEP Ipc: C40B 20/04 20060101ALI20231004BHEP Ipc: C12Q 1/6869 20180101ALI20231004BHEP Ipc: C12Q 1/6811 20180101ALI20231004BHEP Ipc: C12Q 1/68 20180101AFI20231004BHEP |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
17Q | First examination report despatched |
Effective date: 20240827 |