CN108291218B - 核酸酶非依赖性靶向基因编辑平台及其用途 - Google Patents
核酸酶非依赖性靶向基因编辑平台及其用途 Download PDFInfo
- Publication number
- CN108291218B CN108291218B CN201680053489.6A CN201680053489A CN108291218B CN 108291218 B CN108291218 B CN 108291218B CN 201680053489 A CN201680053489 A CN 201680053489A CN 108291218 B CN108291218 B CN 108291218B
- Authority
- CN
- China
- Prior art keywords
- rna
- leu
- protein
- lys
- cell
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 101710163270 Nuclease Proteins 0.000 title claims description 53
- 238000010362 genome editing Methods 0.000 title abstract description 19
- 108090000623 proteins and genes Proteins 0.000 claims description 189
- 210000004027 cell Anatomy 0.000 claims description 184
- 102000004169 proteins and genes Human genes 0.000 claims description 153
- 230000000694 effects Effects 0.000 claims description 112
- 108020004414 DNA Proteins 0.000 claims description 97
- 238000000034 method Methods 0.000 claims description 91
- 150000007523 nucleic acids Chemical class 0.000 claims description 83
- 239000012636 effector Substances 0.000 claims description 76
- 108091033409 CRISPR Proteins 0.000 claims description 72
- 102000039446 nucleic acids Human genes 0.000 claims description 67
- 108020004707 nucleic acids Proteins 0.000 claims description 67
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 58
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 57
- 230000035772 mutation Effects 0.000 claims description 50
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 47
- 230000008685 targeting Effects 0.000 claims description 47
- 108020005004 Guide RNA Proteins 0.000 claims description 46
- 229920001184 polypeptide Polymers 0.000 claims description 44
- 230000014509 gene expression Effects 0.000 claims description 34
- 102000040430 polynucleotide Human genes 0.000 claims description 34
- 108091033319 polynucleotide Proteins 0.000 claims description 34
- 239000002157 polynucleotide Substances 0.000 claims description 34
- 230000004570 RNA-binding Effects 0.000 claims description 30
- 230000027455 binding Effects 0.000 claims description 26
- 210000000130 stem cell Anatomy 0.000 claims description 25
- 108020001507 fusion proteins Proteins 0.000 claims description 23
- 102000037865 fusion proteins Human genes 0.000 claims description 23
- 201000010099 disease Diseases 0.000 claims description 21
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 21
- 230000004048 modification Effects 0.000 claims description 19
- 238000012986 modification Methods 0.000 claims description 19
- 238000010354 CRISPR gene editing Methods 0.000 claims description 18
- 241000193996 Streptococcus pyogenes Species 0.000 claims description 18
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 claims description 18
- 101710125418 Major capsid protein Proteins 0.000 claims description 17
- 230000000295 complement effect Effects 0.000 claims description 15
- 239000013604 expression vector Substances 0.000 claims description 15
- 230000001580 bacterial effect Effects 0.000 claims description 14
- 238000000338 in vitro Methods 0.000 claims description 13
- 101710132601 Capsid protein Proteins 0.000 claims description 12
- 101710094648 Coat protein Proteins 0.000 claims description 12
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 claims description 12
- 101710141454 Nucleoprotein Proteins 0.000 claims description 12
- 101710083689 Probable capsid protein Proteins 0.000 claims description 12
- 210000004962 mammalian cell Anatomy 0.000 claims description 11
- 230000009615 deamination Effects 0.000 claims description 10
- 238000006481 deamination reaction Methods 0.000 claims description 10
- 241000894007 species Species 0.000 claims description 10
- 206010064571 Gene mutation Diseases 0.000 claims description 9
- 108060004795 Methyltransferase Proteins 0.000 claims description 9
- 108010017842 Telomerase Proteins 0.000 claims description 9
- 239000003153 chemical reaction reagent Substances 0.000 claims description 9
- 230000002255 enzymatic effect Effects 0.000 claims description 9
- 210000001082 somatic cell Anatomy 0.000 claims description 9
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 8
- 230000033616 DNA repair Effects 0.000 claims description 8
- 230000015572 biosynthetic process Effects 0.000 claims description 8
- 230000001717 pathogenic effect Effects 0.000 claims description 8
- 102000044126 RNA-Binding Proteins Human genes 0.000 claims description 7
- 229940104302 cytosine Drugs 0.000 claims description 7
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 7
- 108091092566 Extrachromosomal DNA Proteins 0.000 claims description 6
- 101000615488 Homo sapiens Methyl-CpG-binding domain protein 2 Proteins 0.000 claims description 6
- 102100021299 Methyl-CpG-binding domain protein 2 Human genes 0.000 claims description 6
- 241000283984 Rodentia Species 0.000 claims description 6
- 241000194020 Streptococcus thermophilus Species 0.000 claims description 6
- 210000005260 human cell Anatomy 0.000 claims description 6
- 101710159080 Aconitate hydratase A Proteins 0.000 claims description 5
- 101710159078 Aconitate hydratase B Proteins 0.000 claims description 5
- 102000016397 Methyltransferase Human genes 0.000 claims description 5
- 101710105008 RNA-binding protein Proteins 0.000 claims description 5
- 241000251539 Vertebrata <Metazoa> Species 0.000 claims description 5
- 125000000539 amino acid group Chemical group 0.000 claims description 5
- 241001515965 unidentified phage Species 0.000 claims description 5
- 241000251468 Actinopterygii Species 0.000 claims description 4
- 241000283690 Bos taurus Species 0.000 claims description 4
- 230000005778 DNA damage Effects 0.000 claims description 4
- 231100000277 DNA damage Toxicity 0.000 claims description 4
- 108010046331 Deoxyribodipyrimidine photo-lyase Proteins 0.000 claims description 4
- 102100034343 Integrase Human genes 0.000 claims description 4
- 108010061833 Integrases Proteins 0.000 claims description 4
- 102000015335 Ku Autoantigen Human genes 0.000 claims description 4
- 108010025026 Ku Autoantigen Proteins 0.000 claims description 4
- 102000003960 Ligases Human genes 0.000 claims description 4
- 108090000364 Ligases Proteins 0.000 claims description 4
- 241000588650 Neisseria meningitidis Species 0.000 claims description 4
- 230000010718 Oxidation Activity Effects 0.000 claims description 4
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 claims description 4
- 108091093078 Pyrimidine dimer Proteins 0.000 claims description 4
- 102000018120 Recombinases Human genes 0.000 claims description 4
- 108010091086 Recombinases Proteins 0.000 claims description 4
- 241000191967 Staphylococcus aureus Species 0.000 claims description 4
- 241000193985 Streptococcus agalactiae Species 0.000 claims description 4
- 102000008579 Transposases Human genes 0.000 claims description 4
- 108010020764 Transposases Proteins 0.000 claims description 4
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 claims description 4
- 230000029936 alkylation Effects 0.000 claims description 4
- 238000005804 alkylation reaction Methods 0.000 claims description 4
- 210000004102 animal cell Anatomy 0.000 claims description 4
- 230000027832 depurination Effects 0.000 claims description 4
- 210000001671 embryonic stem cell Anatomy 0.000 claims description 4
- 210000004602 germ cell Anatomy 0.000 claims description 4
- 244000052769 pathogen Species 0.000 claims description 4
- 239000013635 pyrimidine dimer Substances 0.000 claims description 4
- 241000271566 Aves Species 0.000 claims description 3
- 241000283707 Capra Species 0.000 claims description 3
- 241001494479 Pecora Species 0.000 claims description 3
- 230000026279 RNA modification Effects 0.000 claims description 3
- 241000589892 Treponema denticola Species 0.000 claims description 3
- 210000000349 chromosome Anatomy 0.000 claims description 3
- 239000003446 ligand Substances 0.000 claims description 3
- 108091023037 Aptamer Proteins 0.000 claims description 2
- 239000002126 C01EB10 - Adenosine Substances 0.000 claims description 2
- 108091008103 RNA aptamers Proteins 0.000 claims description 2
- 229960005305 adenosine Drugs 0.000 claims description 2
- 239000012895 dilution Substances 0.000 claims description 2
- 238000010790 dilution Methods 0.000 claims description 2
- 239000003814 drug Substances 0.000 claims description 2
- 238000002360 preparation method Methods 0.000 claims description 2
- 210000004263 induced pluripotent stem cell Anatomy 0.000 claims 2
- 230000008836 DNA modification Effects 0.000 claims 1
- 241000589886 Treponema Species 0.000 claims 1
- 108091028113 Trans-activating crRNA Proteins 0.000 description 163
- 235000018102 proteins Nutrition 0.000 description 145
- 206010009944 Colon cancer Diseases 0.000 description 47
- 125000003729 nucleotide group Chemical group 0.000 description 47
- 239000002773 nucleotide Substances 0.000 description 43
- 235000001014 amino acid Nutrition 0.000 description 34
- 229940024606 amino acid Drugs 0.000 description 32
- 150000001413 amino acids Chemical class 0.000 description 32
- 125000005647 linker group Chemical group 0.000 description 32
- 102220605874 Cytosolic arginine sensor for mTORC1 subunit 2_D10A_mutation Human genes 0.000 description 29
- 101150090202 rpoB gene Proteins 0.000 description 26
- 230000004927 fusion Effects 0.000 description 25
- 230000007115 recruitment Effects 0.000 description 25
- 108091079001 CRISPR RNA Proteins 0.000 description 24
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 24
- 239000000203 mixture Substances 0.000 description 24
- 239000013598 vector Substances 0.000 description 24
- 101100038261 Methanococcus vannielii (strain ATCC 35089 / DSM 1224 / JCM 13029 / OCM 148 / SB) rpo2C gene Proteins 0.000 description 22
- 210000001161 mammalian embryo Anatomy 0.000 description 22
- 101150085857 rpo2 gene Proteins 0.000 description 22
- 102000004190 Enzymes Human genes 0.000 description 20
- 108090000790 Enzymes Proteins 0.000 description 20
- 102220013975 rs397516854 Human genes 0.000 description 19
- 238000006243 chemical reaction Methods 0.000 description 17
- 238000012937 correction Methods 0.000 description 17
- 238000011282 treatment Methods 0.000 description 17
- 125000003275 alpha amino acid group Chemical group 0.000 description 16
- WYWHKKSPHMUBEB-UHFFFAOYSA-N tioguanine Chemical compound N1C(N)=NC(=S)C2=C1N=CN2 WYWHKKSPHMUBEB-UHFFFAOYSA-N 0.000 description 16
- 210000002257 embryonic structure Anatomy 0.000 description 15
- 238000001890 transfection Methods 0.000 description 15
- 108010079649 APOBEC-1 Deaminase Proteins 0.000 description 14
- 230000005782 double-strand break Effects 0.000 description 13
- 230000001965 increasing effect Effects 0.000 description 13
- 108010091358 Hypoxanthine Phosphoribosyltransferase Proteins 0.000 description 12
- 230000001404 mediated effect Effects 0.000 description 12
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 10
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 10
- 238000002474 experimental method Methods 0.000 description 10
- 230000036438 mutation frequency Effects 0.000 description 10
- 210000001519 tissue Anatomy 0.000 description 10
- 102000014914 Carrier Proteins Human genes 0.000 description 9
- 241001465754 Metazoa Species 0.000 description 9
- 230000008901 benefit Effects 0.000 description 9
- 108091008324 binding proteins Proteins 0.000 description 9
- 238000013461 design Methods 0.000 description 9
- 239000013612 plasmid Substances 0.000 description 9
- JQXXHWHPUNPDRT-WLSIYKJHSA-N rifampicin Chemical compound O([C@](C1=O)(C)O/C=C/[C@@H]([C@H]([C@@H](OC(C)=O)[C@H](C)[C@H](O)[C@H](C)[C@@H](O)[C@@H](C)\C=C\C=C(C)/C(=O)NC=2C(O)=C3C([O-])=C4C)C)OC)C4=C1C3=C(O)C=2\C=N\N1CC[NH+](C)CC1 JQXXHWHPUNPDRT-WLSIYKJHSA-N 0.000 description 9
- 229960001225 rifampicin Drugs 0.000 description 9
- 230000001225 therapeutic effect Effects 0.000 description 9
- 230000003612 virological effect Effects 0.000 description 9
- 102000012758 APOBEC-1 Deaminase Human genes 0.000 description 8
- 102000053602 DNA Human genes 0.000 description 8
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 8
- 102100029098 Hypoxanthine-guanine phosphoribosyltransferase Human genes 0.000 description 8
- 206010028980 Neoplasm Diseases 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 238000001727 in vivo Methods 0.000 description 8
- 108010034529 leucyl-lysine Proteins 0.000 description 8
- 229960003087 tioguanine Drugs 0.000 description 8
- 108010004483 APOBEC-3G Deaminase Proteins 0.000 description 7
- 244000063299 Bacillus subtilis Species 0.000 description 7
- 235000014469 Bacillus subtilis Nutrition 0.000 description 7
- 108091026890 Coding region Proteins 0.000 description 7
- 108020004705 Codon Proteins 0.000 description 7
- 102100038076 DNA dC->dU-editing enzyme APOBEC-3G Human genes 0.000 description 7
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 7
- 108010062796 arginyllysine Proteins 0.000 description 7
- 201000011510 cancer Diseases 0.000 description 7
- 238000012239 gene modification Methods 0.000 description 7
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 7
- 108010050848 glycylleucine Proteins 0.000 description 7
- 238000002744 homologous recombination Methods 0.000 description 7
- 230000006801 homologous recombination Effects 0.000 description 7
- 108010054155 lysyllysine Proteins 0.000 description 7
- 235000004252 protein component Nutrition 0.000 description 7
- 230000000392 somatic effect Effects 0.000 description 7
- 238000010453 CRISPR/Cas method Methods 0.000 description 6
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 6
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 6
- 208000026350 Inborn Genetic disease Diseases 0.000 description 6
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 6
- 230000003197 catalytic effect Effects 0.000 description 6
- 230000008859 change Effects 0.000 description 6
- 239000000562 conjugate Substances 0.000 description 6
- 230000037430 deletion Effects 0.000 description 6
- 238000012217 deletion Methods 0.000 description 6
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 6
- 208000016361 genetic disease Diseases 0.000 description 6
- 230000037431 insertion Effects 0.000 description 6
- 238000003780 insertion Methods 0.000 description 6
- 239000003550 marker Substances 0.000 description 6
- 108020004999 messenger RNA Proteins 0.000 description 6
- 230000004770 neurodegeneration Effects 0.000 description 6
- 208000015122 neurodegenerative disease Diseases 0.000 description 6
- 239000013600 plasmid vector Substances 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 238000006467 substitution reaction Methods 0.000 description 6
- 102100040397 C->U-editing enzyme APOBEC-1 Human genes 0.000 description 5
- 102000005381 Cytidine Deaminase Human genes 0.000 description 5
- 108010031325 Cytidine deaminase Proteins 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 5
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 108010092854 aspartyllysine Proteins 0.000 description 5
- 230000002950 deficient Effects 0.000 description 5
- 108010049041 glutamylalanine Proteins 0.000 description 5
- 229910052739 hydrogen Inorganic materials 0.000 description 5
- 239000001257 hydrogen Substances 0.000 description 5
- 238000001638 lipofection Methods 0.000 description 5
- 238000010369 molecular cloning Methods 0.000 description 5
- COLNVLDHVKWLRT-QMMMGPOBSA-N phenylalanine group Chemical group N[C@@H](CC1=CC=CC=C1)C(=O)O COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- 238000013518 transcription Methods 0.000 description 5
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 4
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 4
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 4
- 208000035473 Communicable disease Diseases 0.000 description 4
- 241000699802 Cricetulus griseus Species 0.000 description 4
- 102000000311 Cytosine Deaminase Human genes 0.000 description 4
- 108010080611 Cytosine Deaminase Proteins 0.000 description 4
- 238000010442 DNA editing Methods 0.000 description 4
- 230000004568 DNA-binding Effects 0.000 description 4
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 4
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- 241000700721 Hepatitis B virus Species 0.000 description 4
- 241000238631 Hexapoda Species 0.000 description 4
- 241000186869 Lactobacillus salivarius Species 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 4
- 241000124008 Mammalia Species 0.000 description 4
- 241000699666 Mus <mouse, genus> Species 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 238000010459 TALEN Methods 0.000 description 4
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 4
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 4
- 230000004913 activation Effects 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 230000002759 chromosomal effect Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 4
- 230000021615 conjugation Effects 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000005017 genetic modification Effects 0.000 description 4
- 235000013617 genetically modified food Nutrition 0.000 description 4
- 238000009396 hybridization Methods 0.000 description 4
- 230000005764 inhibitory process Effects 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 108010057302 methyl(arginyl)-lysyl-prolyl-tryptophyl-tert-leucyl-leucine Proteins 0.000 description 4
- 238000002156 mixing Methods 0.000 description 4
- 239000000178 monomer Substances 0.000 description 4
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 230000004853 protein function Effects 0.000 description 4
- 230000012743 protein tagging Effects 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 239000000523 sample Substances 0.000 description 4
- 230000004083 survival effect Effects 0.000 description 4
- 208000024891 symptom Diseases 0.000 description 4
- -1 tracRNA Proteins 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 230000009261 transgenic effect Effects 0.000 description 4
- 238000011144 upstream manufacturing Methods 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- 239000013603 viral vector Substances 0.000 description 4
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 3
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 3
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 3
- 241000193830 Bacillus <bacterium> Species 0.000 description 3
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 3
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 3
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 3
- 240000001046 Lactobacillus acidophilus Species 0.000 description 3
- 235000013956 Lactobacillus acidophilus Nutrition 0.000 description 3
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 3
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 3
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 3
- 241001467578 Microbacterium Species 0.000 description 3
- 241000192041 Micrococcus Species 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- 241000700159 Rattus Species 0.000 description 3
- 241000191043 Rhodobacter sphaeroides Species 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 241000194017 Streptococcus Species 0.000 description 3
- 241000187747 Streptomyces Species 0.000 description 3
- 241000187191 Streptomyces viridochromogenes Species 0.000 description 3
- 210000001744 T-lymphocyte Anatomy 0.000 description 3
- FKIGTIXHSRNKJU-IXOXFDKPSA-N Thr-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CN=CN1 FKIGTIXHSRNKJU-IXOXFDKPSA-N 0.000 description 3
- 102000006943 Uracil-DNA Glycosidase Human genes 0.000 description 3
- 108010072685 Uracil-DNA Glycosidase Proteins 0.000 description 3
- 102100037111 Uracil-DNA glycosylase Human genes 0.000 description 3
- 108010044940 alanylglutamine Proteins 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 230000003115 biocidal effect Effects 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 239000001506 calcium phosphate Substances 0.000 description 3
- 229910000389 calcium phosphate Inorganic materials 0.000 description 3
- 235000011010 calcium phosphates Nutrition 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 230000001684 chronic effect Effects 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 239000000539 dimer Substances 0.000 description 3
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 230000014789 establishment of RNA localization Effects 0.000 description 3
- 230000004077 genetic alteration Effects 0.000 description 3
- 231100000118 genetic alteration Toxicity 0.000 description 3
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 108010025306 histidylleucine Proteins 0.000 description 3
- 108010085325 histidylproline Proteins 0.000 description 3
- 108010018006 histidylserine Proteins 0.000 description 3
- 230000002779 inactivation Effects 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 229940039695 lactobacillus acidophilus Drugs 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108010064235 lysylglycine Proteins 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 238000000520 microinjection Methods 0.000 description 3
- 108091027963 non-coding RNA Proteins 0.000 description 3
- 102000042567 non-coding RNA Human genes 0.000 description 3
- 108091008819 oncoproteins Proteins 0.000 description 3
- 102000027450 oncoproteins Human genes 0.000 description 3
- 230000001575 pathological effect Effects 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- 230000000149 penetrating effect Effects 0.000 description 3
- 108010012581 phenylalanylglutamate Proteins 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 108020001580 protein domains Proteins 0.000 description 3
- 238000011002 quantification Methods 0.000 description 3
- 239000011541 reaction mixture Substances 0.000 description 3
- 230000008439 repair process Effects 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 125000006850 spacer group Chemical group 0.000 description 3
- 238000002560 therapeutic procedure Methods 0.000 description 3
- 108010061238 threonyl-glycine Proteins 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 3
- 101710169336 5'-deoxyadenosine deaminase Proteins 0.000 description 2
- 208000030507 AIDS Diseases 0.000 description 2
- 102100036664 Adenosine deaminase Human genes 0.000 description 2
- 102000055025 Adenosine deaminases Human genes 0.000 description 2
- 241000640374 Alicyclobacillus acidocaldarius Species 0.000 description 2
- 241000203069 Archaea Species 0.000 description 2
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 2
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 2
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 2
- QEQVUHQQYDZUEN-GUBZILKMSA-N Asn-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N QEQVUHQQYDZUEN-GUBZILKMSA-N 0.000 description 2
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 2
- 241000906059 Bacillus pseudomycoides Species 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 101150029409 CFTR gene Proteins 0.000 description 2
- 108010051109 Cell-Penetrating Peptides Proteins 0.000 description 2
- 102000020313 Cell-Penetrating Peptides Human genes 0.000 description 2
- 241000193163 Clostridioides difficile Species 0.000 description 2
- 201000003883 Cystic fibrosis Diseases 0.000 description 2
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 2
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 2
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 2
- DBNLXHGDGBUCDV-KKUMJFAQSA-N Gln-Phe-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DBNLXHGDGBUCDV-KKUMJFAQSA-N 0.000 description 2
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- 102100029100 Hematopoietic prostaglandin D synthase Human genes 0.000 description 2
- GJMHMDKCJPQJOI-IHRRRGAJSA-N His-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 GJMHMDKCJPQJOI-IHRRRGAJSA-N 0.000 description 2
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 2
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 2
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- 241000270322 Lepidosauria Species 0.000 description 2
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 2
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 2
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 2
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 2
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 2
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 2
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 2
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 2
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 2
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 2
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 2
- 238000010357 RNA editing Methods 0.000 description 2
- 108700020471 RNA-Binding Proteins Proteins 0.000 description 2
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 2
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 239000004098 Tetracycline Substances 0.000 description 2
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 2
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 2
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 2
- 101710154778 Thymidylate synthase 1 Proteins 0.000 description 2
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 2
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 2
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 2
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 2
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 108020005202 Viral DNA Proteins 0.000 description 2
- 208000036142 Viral infection Diseases 0.000 description 2
- 210000004504 adult stem cell Anatomy 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 206010002026 amyotrophic lateral sclerosis Diseases 0.000 description 2
- 230000000692 anti-sense effect Effects 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 230000004640 cellular pathway Effects 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 230000004186 co-expression Effects 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000006471 dimerization reaction Methods 0.000 description 2
- 230000000447 dimerizing effect Effects 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 108091006047 fluorescent proteins Proteins 0.000 description 2
- 102000034287 fluorescent proteins Human genes 0.000 description 2
- 230000037433 frameshift Effects 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 2
- 239000002853 nucleic acid probe Substances 0.000 description 2
- 238000006384 oligomerization reaction Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 239000005022 packaging material Substances 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010025488 pinealon Proteins 0.000 description 2
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Substances [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 230000000069 prophylactic effect Effects 0.000 description 2
- 210000001938 protoplast Anatomy 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 230000011664 signaling Effects 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 229960002180 tetracycline Drugs 0.000 description 2
- 229930101283 tetracycline Natural products 0.000 description 2
- 235000019364 tetracycline Nutrition 0.000 description 2
- 150000003522 tetracyclines Chemical class 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 238000003151 transfection method Methods 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- 230000009385 viral infection Effects 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- MWBWWFOAEOYUST-UHFFFAOYSA-N 2-aminopurine Chemical compound NC1=NC=C2N=CNC2=N1 MWBWWFOAEOYUST-UHFFFAOYSA-N 0.000 description 1
- AGFIRQJZCNVMCW-UAKXSSHOSA-N 5-bromouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(Br)=C1 AGFIRQJZCNVMCW-UAKXSSHOSA-N 0.000 description 1
- OGHAROSJZRTIOK-KQYNXXCUSA-O 7-methylguanosine Chemical compound C1=2N=C(N)NC(=O)C=2[N+](C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OGHAROSJZRTIOK-KQYNXXCUSA-O 0.000 description 1
- 108010029988 AICDA (activation-induced cytidine deaminase) Proteins 0.000 description 1
- 108010044087 AS-I toxin Proteins 0.000 description 1
- 241001290773 Acidiphilium acidophilum Species 0.000 description 1
- 241001464929 Acidithiobacillus caldus Species 0.000 description 1
- 241000605222 Acidithiobacillus ferrooxidans Species 0.000 description 1
- 241001468182 Acidobacterium Species 0.000 description 1
- 206010069754 Acquired gene mutation Diseases 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 101710142939 Adenosine deaminase 1 Proteins 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- WJRXVTCKASUIFF-FXQIFTODSA-N Ala-Cys-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WJRXVTCKASUIFF-FXQIFTODSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- AETQNIIFKCMVHP-UVBJJODRSA-N Ala-Trp-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AETQNIIFKCMVHP-UVBJJODRSA-N 0.000 description 1
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- 241000590031 Alteromonas Species 0.000 description 1
- 208000031873 Animal Disease Models Diseases 0.000 description 1
- 208000031295 Animal disease Diseases 0.000 description 1
- 241000269350 Anura Species 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- RRGPUNYIPJXJBU-GUBZILKMSA-N Arg-Asp-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O RRGPUNYIPJXJBU-GUBZILKMSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- QQJSJIBESHAJPM-IHRRRGAJSA-N Arg-Cys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QQJSJIBESHAJPM-IHRRRGAJSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- PYZPXCZNQSEHDT-GUBZILKMSA-N Arg-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PYZPXCZNQSEHDT-GUBZILKMSA-N 0.000 description 1
- KSUALAGYYLQSHJ-RCWTZXSCSA-N Arg-Met-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSUALAGYYLQSHJ-RCWTZXSCSA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 1
- RATVAFHGEFAWDH-JYJNAYRXSA-N Arg-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCN=C(N)N)N RATVAFHGEFAWDH-JYJNAYRXSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 1
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 1
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- HMUKKNAMNSXDBB-CIUDSAMLSA-N Asn-Met-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMUKKNAMNSXDBB-CIUDSAMLSA-N 0.000 description 1
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 1
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 1
- YUUIAUXBNOHFRJ-IHRRRGAJSA-N Asn-Phe-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O YUUIAUXBNOHFRJ-IHRRRGAJSA-N 0.000 description 1
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- KTDWFWNZLLFEFU-KKUMJFAQSA-N Asn-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KTDWFWNZLLFEFU-KKUMJFAQSA-N 0.000 description 1
- CGYKCTPUGXFPMG-IHPCNDPISA-N Asn-Tyr-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CGYKCTPUGXFPMG-IHPCNDPISA-N 0.000 description 1
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 1
- MFMJRYHVLLEMQM-DCAQKATOSA-N Asp-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N MFMJRYHVLLEMQM-DCAQKATOSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- UWOPETAWXDZUJR-ACZMJKKPSA-N Asp-Cys-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O UWOPETAWXDZUJR-ACZMJKKPSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- SCQIQCWLOMOEFP-DCAQKATOSA-N Asp-Leu-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SCQIQCWLOMOEFP-DCAQKATOSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 101710192393 Attachment protein G3P Proteins 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 108020004513 Bacterial RNA Proteins 0.000 description 1
- 101100263837 Bovine ephemeral fever virus (strain BB7721) beta gene Proteins 0.000 description 1
- 108010040467 CRISPR-Associated Proteins Proteins 0.000 description 1
- 241000252229 Carassius auratus Species 0.000 description 1
- 241000269333 Caudata Species 0.000 description 1
- 108091092236 Chimeric RNA Proteins 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 241000193155 Clostridium botulinum Species 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 1
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 1
- VBPGTULCFGKGTF-ACZMJKKPSA-N Cys-Glu-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VBPGTULCFGKGTF-ACZMJKKPSA-N 0.000 description 1
- UUOYKFNULIOCGJ-GUBZILKMSA-N Cys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N UUOYKFNULIOCGJ-GUBZILKMSA-N 0.000 description 1
- PJWIPBIMSKJTIE-DCAQKATOSA-N Cys-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N PJWIPBIMSKJTIE-DCAQKATOSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- MKMKILWCRQLDFJ-DCAQKATOSA-N Cys-Lys-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MKMKILWCRQLDFJ-DCAQKATOSA-N 0.000 description 1
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 1
- ZOKPRHVIFAUJPV-GUBZILKMSA-N Cys-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O ZOKPRHVIFAUJPV-GUBZILKMSA-N 0.000 description 1
- LHRCZIRWNFRIRG-SRVKXCTJSA-N Cys-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O LHRCZIRWNFRIRG-SRVKXCTJSA-N 0.000 description 1
- 108010079245 Cystic Fibrosis Transmembrane Conductance Regulator Proteins 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical group OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 108010009540 DNA (Cytosine-5-)-Methyltransferase 1 Proteins 0.000 description 1
- 102100036279 DNA (cytosine-5)-methyltransferase 1 Human genes 0.000 description 1
- 102100024812 DNA (cytosine-5)-methyltransferase 3A Human genes 0.000 description 1
- 102100024810 DNA (cytosine-5)-methyltransferase 3B Human genes 0.000 description 1
- 102000011724 DNA Repair Enzymes Human genes 0.000 description 1
- 108010076525 DNA Repair Enzymes Proteins 0.000 description 1
- 102100040263 DNA dC->dU-editing enzyme APOBEC-3A Human genes 0.000 description 1
- 102100040262 DNA dC->dU-editing enzyme APOBEC-3B Human genes 0.000 description 1
- 102100040261 DNA dC->dU-editing enzyme APOBEC-3C Human genes 0.000 description 1
- 102100040264 DNA dC->dU-editing enzyme APOBEC-3D Human genes 0.000 description 1
- 102100040266 DNA dC->dU-editing enzyme APOBEC-3F Human genes 0.000 description 1
- 102100038050 DNA dC->dU-editing enzyme APOBEC-3H Human genes 0.000 description 1
- 101710082737 DNA dC->dU-editing enzyme APOBEC-3H Proteins 0.000 description 1
- 230000007067 DNA methylation Effects 0.000 description 1
- 230000007018 DNA scission Effects 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 241000252212 Danio rerio Species 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 102100029791 Double-stranded RNA-specific adenosine deaminase Human genes 0.000 description 1
- 241001454374 Drosophila <fruit fly, subgenus> Species 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 101100316840 Enterobacteria phage P4 Beta gene Proteins 0.000 description 1
- 241000709744 Enterobacterio phage MS2 Species 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 241000660147 Escherichia coli str. K-12 substr. MG1655 Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 208000000666 Fowlpox Diseases 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 101710179596 Gene 3 protein Proteins 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 208000034951 Genetic Translocation Diseases 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- OETQLUYCMBARHJ-CIUDSAMLSA-N Gln-Asn-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OETQLUYCMBARHJ-CIUDSAMLSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- SOIAHPSKKUYREP-CIUDSAMLSA-N Gln-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SOIAHPSKKUYREP-CIUDSAMLSA-N 0.000 description 1
- VVWWRZZMPSPVQU-KBIXCLLPSA-N Gln-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VVWWRZZMPSPVQU-KBIXCLLPSA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- GLEGHWQNGPMKHO-DCAQKATOSA-N Gln-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GLEGHWQNGPMKHO-DCAQKATOSA-N 0.000 description 1
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 1
- NNXIQPMZGZUFJJ-AVGNSLFASA-N Gln-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N NNXIQPMZGZUFJJ-AVGNSLFASA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- GIVHPCWYVWUUSG-HVTMNAMFSA-N Gln-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GIVHPCWYVWUUSG-HVTMNAMFSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 1
- ZXGLLNZQSBLQLT-SRVKXCTJSA-N Gln-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZXGLLNZQSBLQLT-SRVKXCTJSA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- WTJIWXMJESRHMM-XDTLVQLUSA-N Gln-Tyr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O WTJIWXMJESRHMM-XDTLVQLUSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 1
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 1
- ZXQPJYWZSFGWJB-AVGNSLFASA-N Glu-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXQPJYWZSFGWJB-AVGNSLFASA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- XOFYVODYSNKPDK-AVGNSLFASA-N Glu-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOFYVODYSNKPDK-AVGNSLFASA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- VLIJYPMATZSOLL-YUMQZZPRSA-N Gly-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VLIJYPMATZSOLL-YUMQZZPRSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- RCHFYMASWAZQQZ-ZANVPECISA-N Gly-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CN)=CNC2=C1 RCHFYMASWAZQQZ-ZANVPECISA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- 241000606790 Haemophilus Species 0.000 description 1
- 229920000209 Hexadimethrine bromide Polymers 0.000 description 1
- IPIVXQQRZXEUGW-UWJYBYFXSA-N His-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IPIVXQQRZXEUGW-UWJYBYFXSA-N 0.000 description 1
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 1
- SVHKVHBPTOMLTO-DCAQKATOSA-N His-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SVHKVHBPTOMLTO-DCAQKATOSA-N 0.000 description 1
- VOKCBYNCZVSILJ-KKUMJFAQSA-N His-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O VOKCBYNCZVSILJ-KKUMJFAQSA-N 0.000 description 1
- WZOGEMJIZBNFBK-CIUDSAMLSA-N His-Asp-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WZOGEMJIZBNFBK-CIUDSAMLSA-N 0.000 description 1
- MVADCDSCFTXCBT-CIUDSAMLSA-N His-Asp-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MVADCDSCFTXCBT-CIUDSAMLSA-N 0.000 description 1
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 description 1
- IMCHNUANCIGUKS-SRVKXCTJSA-N His-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IMCHNUANCIGUKS-SRVKXCTJSA-N 0.000 description 1
- XMENRVZYPBKBIL-AVGNSLFASA-N His-Glu-His Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XMENRVZYPBKBIL-AVGNSLFASA-N 0.000 description 1
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 1
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 1
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 1
- WCHONUZTYDQMBY-PYJNHQTQSA-N His-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WCHONUZTYDQMBY-PYJNHQTQSA-N 0.000 description 1
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000909242 Homo sapiens DNA (cytosine-5)-methyltransferase 3A Proteins 0.000 description 1
- 101000909249 Homo sapiens DNA (cytosine-5)-methyltransferase 3B Proteins 0.000 description 1
- 101000964378 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3A Proteins 0.000 description 1
- 101000964385 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3B Proteins 0.000 description 1
- 101000964383 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3C Proteins 0.000 description 1
- 101000964382 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3D Proteins 0.000 description 1
- 101000964377 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3F Proteins 0.000 description 1
- 101000865408 Homo sapiens Double-stranded RNA-specific adenosine deaminase Proteins 0.000 description 1
- 101000755690 Homo sapiens Single-stranded DNA cytosine deaminase Proteins 0.000 description 1
- 208000023105 Huntington disease Diseases 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- DXUJSRIVSWEOAG-NAKRPEOUSA-N Ile-Arg-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N DXUJSRIVSWEOAG-NAKRPEOUSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- QHGBCRCMBCWMBJ-UHFFFAOYSA-N Ile-Glu-Ala-Lys Natural products CCC(C)C(N)C(=O)NC(CCC(O)=O)C(=O)NC(C)C(=O)NC(C(O)=O)CCCCN QHGBCRCMBCWMBJ-UHFFFAOYSA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- HYLIOBDWPQNLKI-HVTMNAMFSA-N Ile-His-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HYLIOBDWPQNLKI-HVTMNAMFSA-N 0.000 description 1
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 1
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 1
- MTONDYJJCIBZTK-PEDHHIEDSA-N Ile-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(=O)O)N MTONDYJJCIBZTK-PEDHHIEDSA-N 0.000 description 1
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 1
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- JSLIXOUMAOUGBN-JUKXBJQTSA-N Ile-Tyr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JSLIXOUMAOUGBN-JUKXBJQTSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 241000283953 Lagomorpha Species 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- PIHFVNPEAHFNLN-KKUMJFAQSA-N Leu-Cys-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N PIHFVNPEAHFNLN-KKUMJFAQSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 208000035752 Live birth Diseases 0.000 description 1
- 239000006142 Luria-Bertani Agar Substances 0.000 description 1
- 239000006391 Luria-Bertani Medium Substances 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- BRSGXFITDXFMFF-IHRRRGAJSA-N Lys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N BRSGXFITDXFMFF-IHRRRGAJSA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- WLCYCADOWRMSAJ-CIUDSAMLSA-N Lys-Asn-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O WLCYCADOWRMSAJ-CIUDSAMLSA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 1
- WTZUSCUIVPVCRH-SRVKXCTJSA-N Lys-Gln-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WTZUSCUIVPVCRH-SRVKXCTJSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- CAVGLNOOIFHJOF-SRVKXCTJSA-N Lys-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N CAVGLNOOIFHJOF-SRVKXCTJSA-N 0.000 description 1
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 1
- SPCHLZUWJTYZFC-IHRRRGAJSA-N Lys-His-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O SPCHLZUWJTYZFC-IHRRRGAJSA-N 0.000 description 1
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 1
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 1
- KVNLHIXLLZBAFQ-RWMBFGLXSA-N Lys-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N KVNLHIXLLZBAFQ-RWMBFGLXSA-N 0.000 description 1
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 1
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 241000501784 Marinobacter sp. Species 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 1
- XKJUFUPCHARJKX-UWVGGRQHSA-N Met-Gly-His Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 XKJUFUPCHARJKX-UWVGGRQHSA-N 0.000 description 1
- TZHFJXDKXGZHEN-IHRRRGAJSA-N Met-His-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O TZHFJXDKXGZHEN-IHRRRGAJSA-N 0.000 description 1
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- LBNFTWKGISQVEE-AVGNSLFASA-N Met-Leu-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCSC LBNFTWKGISQVEE-AVGNSLFASA-N 0.000 description 1
- CHDYFPCQVUOJEB-ULQDDVLXSA-N Met-Leu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CHDYFPCQVUOJEB-ULQDDVLXSA-N 0.000 description 1
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 1
- QEDGNYFHLXXIDC-DCAQKATOSA-N Met-Pro-Gln Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O QEDGNYFHLXXIDC-DCAQKATOSA-N 0.000 description 1
- PCTFVQATEGYHJU-FXQIFTODSA-N Met-Ser-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O PCTFVQATEGYHJU-FXQIFTODSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- LHXFNWBNRBWMNV-DCAQKATOSA-N Met-Ser-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LHXFNWBNRBWMNV-DCAQKATOSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 1
- 108030004080 Methylcytosine dioxygenases Proteins 0.000 description 1
- 108020005196 Mitochondrial DNA Proteins 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108091061960 Naked DNA Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 208000009869 Neu-Laxova syndrome Diseases 0.000 description 1
- 241000919925 Nitrosococcus halophilus Species 0.000 description 1
- 241001515112 Nitrosococcus watsonii Species 0.000 description 1
- 241000203622 Nocardiopsis Species 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 102000043276 Oncogene Human genes 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 108010088535 Pep-1 peptide Proteins 0.000 description 1
- 241000009328 Perro Species 0.000 description 1
- 241000286209 Phasianidae Species 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 1
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 1
- KOUUGTKGEQZRHV-KKUMJFAQSA-N Phe-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KOUUGTKGEQZRHV-KKUMJFAQSA-N 0.000 description 1
- SXJGROGVINAYSH-AVGNSLFASA-N Phe-Gln-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SXJGROGVINAYSH-AVGNSLFASA-N 0.000 description 1
- GDBOREPXIRKSEQ-FHWLQOOXSA-N Phe-Gln-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GDBOREPXIRKSEQ-FHWLQOOXSA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- RTUWVJVJSMOGPL-KKUMJFAQSA-N Phe-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RTUWVJVJSMOGPL-KKUMJFAQSA-N 0.000 description 1
- UXQFHEKRGHYJRA-STQMWFEESA-N Phe-Met-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O UXQFHEKRGHYJRA-STQMWFEESA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- TXJJXEXCZBHDNA-ACRUOGEOSA-N Phe-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N TXJJXEXCZBHDNA-ACRUOGEOSA-N 0.000 description 1
- DSXPMZMSJHOKKK-HJOGWXRNSA-N Phe-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DSXPMZMSJHOKKK-HJOGWXRNSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- 229920002873 Polyethylenimine Polymers 0.000 description 1
- 102100037935 Polyubiquitin-C Human genes 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 1
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 1
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- CDGABSWLRMECHC-IHRRRGAJSA-N Pro-Lys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CDGABSWLRMECHC-IHRRRGAJSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- SNSYSBUTTJBPDG-OKZBNKHCSA-N Pro-Trp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N4CCC[C@@H]4C(=O)O SNSYSBUTTJBPDG-OKZBNKHCSA-N 0.000 description 1
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 101800001494 Protease 2A Proteins 0.000 description 1
- 101800001066 Protein 2A Proteins 0.000 description 1
- 108700040121 Protein Methyltransferases Proteins 0.000 description 1
- 102000055027 Protein Methyltransferases Human genes 0.000 description 1
- 229930185560 Pseudouridine Natural products 0.000 description 1
- PTJWIQPHWPFNBW-UHFFFAOYSA-N Pseudouridine C Natural products OC1C(O)C(CO)OC1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-UHFFFAOYSA-N 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 102000014450 RNA Polymerase III Human genes 0.000 description 1
- 108010078067 RNA Polymerase III Proteins 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 241000191025 Rhodobacter Species 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- CTRHXXXHUJTTRZ-ZLUOBGJFSA-N Ser-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O CTRHXXXHUJTTRZ-ZLUOBGJFSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- 241000270295 Serpentes Species 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 102100022433 Single-stranded DNA cytosine deaminase Human genes 0.000 description 1
- 241000586493 Spirosoma Species 0.000 description 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 1
- 241000194022 Streptococcus sp. Species 0.000 description 1
- 241000187438 Streptomyces fradiae Species 0.000 description 1
- 241001518258 Streptomyces pristinaespiralis Species 0.000 description 1
- 241000933146 Streptomyces roseus Species 0.000 description 1
- 108091027544 Subgenomic mRNA Proteins 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 101710192266 Tegument protein VP22 Proteins 0.000 description 1
- 241000186339 Thermoanaerobacter Species 0.000 description 1
- 241000228178 Thermoascus Species 0.000 description 1
- 241000605118 Thiobacillus Species 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical group OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- MMTOHPRBJKEZHT-BWBBJGPYSA-N Thr-Cys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O MMTOHPRBJKEZHT-BWBBJGPYSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- CGCMNOIQVAXYMA-UNQGMJICSA-N Thr-Met-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CGCMNOIQVAXYMA-UNQGMJICSA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- IWAVRIPRTCJAQO-HSHDSVGOSA-N Thr-Pro-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IWAVRIPRTCJAQO-HSHDSVGOSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- ABCLYRRGTZNIFU-BWAGICSOSA-N Thr-Tyr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O ABCLYRRGTZNIFU-BWAGICSOSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- HYVLNORXQGKONN-NUTKFTJISA-N Trp-Ala-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 HYVLNORXQGKONN-NUTKFTJISA-N 0.000 description 1
- RYXOUTORDIUWNI-BPUTZDHNSA-N Trp-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RYXOUTORDIUWNI-BPUTZDHNSA-N 0.000 description 1
- BXKWZPXTTSCOMX-AQZXSJQPSA-N Trp-Asn-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXKWZPXTTSCOMX-AQZXSJQPSA-N 0.000 description 1
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 1
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 1
- SAKLWFSRZTZQAJ-GQGQLFGLSA-N Trp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SAKLWFSRZTZQAJ-GQGQLFGLSA-N 0.000 description 1
- KXIQQAWIPDDVOE-BPUTZDHNSA-N Trp-Pro-Cys Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O KXIQQAWIPDDVOE-BPUTZDHNSA-N 0.000 description 1
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 1
- STKZKWFOKOCSLW-UMPQAUOISA-N Trp-Thr-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 STKZKWFOKOCSLW-UMPQAUOISA-N 0.000 description 1
- CYLQUSBOSWCHTO-BPUTZDHNSA-N Trp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CYLQUSBOSWCHTO-BPUTZDHNSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 108700025716 Tumor Suppressor Genes Proteins 0.000 description 1
- 102000044209 Tumor Suppressor Genes Human genes 0.000 description 1
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 1
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 1
- YLRLHDFMMWDYTK-KKUMJFAQSA-N Tyr-Cys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 YLRLHDFMMWDYTK-KKUMJFAQSA-N 0.000 description 1
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- FWOVTJKVUCGVND-UFYCRDLUSA-N Tyr-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FWOVTJKVUCGVND-UFYCRDLUSA-N 0.000 description 1
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 1
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 1
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 1
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 1
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 1
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 108010056354 Ubiquitin C Proteins 0.000 description 1
- 101710172430 Uracil-DNA glycosylase inhibitor Proteins 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- LMSBRIVOCYOKMU-NRPADANISA-N Val-Gln-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N LMSBRIVOCYOKMU-NRPADANISA-N 0.000 description 1
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 1
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- ZTKGDWOUYRRAOQ-ULQDDVLXSA-N Val-His-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N ZTKGDWOUYRRAOQ-ULQDDVLXSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- 108010069429 Val-Pro-Met-Leu-Lys Proteins 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- SUGRIIAOLCDLBD-ZOBUZTSGSA-N Val-Trp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SUGRIIAOLCDLBD-ZOBUZTSGSA-N 0.000 description 1
- SVLAAUGFIHSJPK-JYJNAYRXSA-N Val-Trp-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SVLAAUGFIHSJPK-JYJNAYRXSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- 108700005077 Viral Genes Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 210000005006 adaptive immune system Anatomy 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 230000009824 affinity maturation Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 125000003368 amide group Chemical group 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000003708 ampul Substances 0.000 description 1
- 238000011558 animal model by disease Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000340 anti-metabolite Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 229940100197 antimetabolite Drugs 0.000 description 1
- 239000002256 antimetabolite Substances 0.000 description 1
- 239000004599 antimicrobial Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- WGDUUQDYDIIBKT-UHFFFAOYSA-N beta-Pseudouridine Natural products OC1OC(CN2C=CC(=O)NC2=O)C(O)C1O WGDUUQDYDIIBKT-UHFFFAOYSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 235000013877 carbamide Nutrition 0.000 description 1
- 150000004649 carbonic acid derivatives Chemical class 0.000 description 1
- 230000000747 cardiac effect Effects 0.000 description 1
- 101150038500 cas9 gene Proteins 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 108020001778 catalytic domains Proteins 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 229920006317 cationic polymer Polymers 0.000 description 1
- 108091092356 cellular DNA Proteins 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000004700 cellular uptake Effects 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 235000013330 chicken meat Nutrition 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000005757 colony formation Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 239000013601 cosmid vector Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000000412 dendrimer Substances 0.000 description 1
- 229920000736 dendritic polymer Polymers 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 238000007876 drug discovery Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000013020 embryo development Effects 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000006862 enzymatic digestion Effects 0.000 description 1
- 230000010856 establishment of protein localization Effects 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 150000002170 ethers Chemical class 0.000 description 1
- 239000011888 foil Substances 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 238000007499 fusion processing Methods 0.000 description 1
- 230000009395 genetic defect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 230000006195 histone acetylation Effects 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 238000005805 hydroxylation reaction Methods 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 150000002485 inorganic esters Chemical class 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 102000008371 intracellularly ATP-gated chloride channel activity proteins Human genes 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000011005 laboratory method Methods 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000004777 loss-of-function mutation Effects 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 239000008176 lyophilized powder Substances 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 230000003228 microsomal effect Effects 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 210000002894 multi-fate stem cell Anatomy 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000219 mutagenic Toxicity 0.000 description 1
- 230000003505 mutagenic effect Effects 0.000 description 1
- 231100000150 mutagenicity / genotoxicity testing Toxicity 0.000 description 1
- 231100000376 mutation frequency increase Toxicity 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 210000005155 neural progenitor cell Anatomy 0.000 description 1
- 230000006780 non-homologous end joining Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 230000009438 off-target cleavage Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 150000002895 organic esters Chemical class 0.000 description 1
- 230000000065 osmolyte Effects 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 239000000123 paper Substances 0.000 description 1
- 230000006320 pegylation Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 239000000863 peptide conjugate Substances 0.000 description 1
- 239000000816 peptidomimetic Substances 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 239000010452 phosphate Chemical group 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical group [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 108010011110 polyarginine Proteins 0.000 description 1
- 230000029279 positive regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 208000009305 pseudorabies Diseases 0.000 description 1
- PTJWIQPHWPFNBW-GBNDHIKLSA-N pseudouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-GBNDHIKLSA-N 0.000 description 1
- 230000004144 purine metabolism Effects 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000006722 reduction reaction Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000013605 shuttle vector Substances 0.000 description 1
- HBMJWWWQQXIZIP-UHFFFAOYSA-N silicon carbide Chemical compound [Si+]#[C-] HBMJWWWQQXIZIP-UHFFFAOYSA-N 0.000 description 1
- 229910010271 silicon carbide Inorganic materials 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 230000037439 somatic mutation Effects 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010012567 tyrosyl-glycyl-glycyl-phenylalanyl Proteins 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 150000003672 ureas Chemical class 0.000 description 1
- 150000003673 urethanes Chemical class 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Crystallography & Structural Chemistry (AREA)
- Mycology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本发明公开了用于靶向基因编辑的系统及相关用途。
Description
相关申请的交叉引用
本申请要求于2015年7月15日提交的美国临时申请号62/192,876的优先权。该申请的内容通过引用整体并入本文。
政府利益
本文所公开的发明至少部分是在美国国务院富布赖特外国学生计划的资助号15130816的政府支持下完成的。因此,美国政府对本发明具有一定的权利。
发明领域
本发明涉及用于靶向基因编辑的系统及相关用途。
发明背景
靶向基因编辑是真核细胞、胚胎和动物基因操作的有力工具。使用它,靶向的基因组位置和/或特定的染色体序列可以被缺失、失活或修饰。目前的几种方法依赖于使用工程改造的核酸酶,例如锌指核酸酶(ZFN)或转录激活剂样效应子核酸酶(TALEN)。这些嵌合核酸酶含有与非特异性DNA切割结构域连接的可编程的、序列特异性DNA结合模块。由于每个新的基因组靶需要设计包含新的序列特异性DNA结合模块的新的ZFN或TALEN,所以这些定制设计的核酸酶制备起来往往是昂贵且耗时的。而且,ZFN和TALEN的特异性使得它们能够介导从靶解离的(off-target)切割。最近开发的基因组修饰在技术上利用细菌簇规律间隔有短回文重复序列(clusters of regularly interspaced short palindromic repeats,CRISPR)相关蛋白9(Cas9)(一种RNA指导的DNA内切核酸酶)诱导DNA靶位点上特定的双链断裂(DSB)。RNA-Cas9复合物识别其同源DNA靶序列并与其碱基配对,导致靶切割形成DSB。
然而,一个尚未解决的主要问题是如何校正体细胞中的基因突变。目前现有技术的共同效应子是核酸酶,所述核酸酶导致DNA DSB,其进而触发细胞通路的激活,如同源重组和非同源末端连接。这个过程有许多主要缺点。首先,由于末端连接的最终产品具有不可预知的性质,DSB以随机和不可预测的方式导致框架内(in-frame)和框架移位(frame-shift)突变,这限制了其用于直接临床应用。其次,DSB可能引起非局部致突变事件,如染色体易位,这是该过程的不良后果。在体内,这些变化可能是潜在有害的。第三,修复或校正通常需要DSB介导的同源重组,其活性在治疗剂尤为重要的大多数体细胞组织/细胞中低或甚至不存在。
因此,目前的基于核酸酶的技术对基因编辑的适用性有限,并且存在对不依赖于导致双链断裂的核酸酶活性的靶向基因修饰技术的需求。
发明概述
本发明通过提供靶向基因编辑系统和相关用途来解决上述需求。本文中涉及的胚胎为非人动物胚胎。
因此,本发明的一个方面提供了一种系统,其包含:(i)靶向序列的蛋白,或编码其的多核苷酸,(ii)RNA支架,或编码其的DNA多核苷酸,和(iii)非核酸酶效应子融合蛋白,或编码其的多核苷酸。RNA支架包含(a)靶向核酸的基序,其包含与靶核酸序列互补的指导RNA序列,(b)CRISPR基序,其能够结合靶向序列的蛋白,和(c)募集RNA基序。非核酸酶效应子融合蛋白包含(a)结合RNA的结构域,其能够结合募集RNA基序,(b)接头序列,和(c)效应子结构域。非核酸酶效应子融合蛋白具有酶促活性。
对于上述系统,靶向序列的蛋白可以是CRISPR蛋白。优选地,靶向序列的蛋白质不具有核酸酶活性。靶向序列的蛋白的实例包括选自由以下组成的组的菌种的dCas9:化脓性链球菌(Streptococcus pyogenes)、无乳链球菌(Streptococcus agalactiae)、金黄色葡萄球菌(Staphylococcus aureus)、嗜热链球菌(Streptococcus thermophilus)、嗜热链球菌(Streptococcus thermophilus)、脑膜炎奈瑟球菌(Neisseria meningitidis)和齿垢密螺旋体(Treponema denticola)。
在上述RNA支架中,募集RNA基序和RNA结合结构域可以是选自以下组成组的对:(1)端粒酶Ku结合基序和Ku蛋白或其RNA结合部分,(2)端粒酶Sm7结合基序和Sm7蛋白或其RNA结合部分,(3)MS2噬菌体操纵子茎环和MS2外壳蛋白(MCP)或其RNA结合部分,(4)PP7噬菌体操纵子茎环和PP7外壳蛋白(PCP)或其RNA结合部分,(5)SfMu噬菌体Com茎环和Com RNA结合蛋白或其RNA结合部分,和(6)非天然RNA适体和相应的适体配体或其RNA结合部分。
在上述非核酸酶效应子融合蛋白中,接头序列的长度可以是0-100(例如1-100、5-80、10-50和20-30)个氨基酸残基。酶促活性是脱氨基活性,甲基转移酶活性,脱甲基酶活性,DNA修复活性,DNA损伤活性,歧化酶活性,烷基化活性,脱嘌呤活性,氧化活性,嘧啶二聚体形成活性,整合酶活性,转座酶活性,重组酶活性,聚合酶活性,连接酶活性,解旋酶活性,光裂合酶活性或糖基化酶活性。在一些实施方案中,酶促活性是脱氨基活性(例如,胞嘧啶脱氨基活性或腺苷脱氨基活性),甲基转移酶活性或脱甲基酶活性。RNA结合结构域不是Cas9或其功能等价物,也不是其RNA结合结构域。
还提供了编码上述系统的组分(i)-(iii)中的一种或多种的分离的核酸,包含所述核酸的表达载体,或包含所述核酸的宿主细胞。
在第二个方面,本发明提供了位点特异性修饰靶DNA的方法。该方法包括使靶核酸与上述系统的组分(i)-(iii)接触。靶核酸可以在细胞中。靶核酸可以是RNA,染色体外DNA或染色体上的基因组DNA。细胞可以选自由以下组成的组:古细菌细胞,细菌细胞,真核细胞,真核单细胞生物,体细胞,非人动物生殖细胞,干细胞,植物细胞,藻类细胞,动物细胞,无脊椎动物细胞,脊椎动物细胞,鱼细胞,蛙细胞,鸟类细胞,哺乳动物细胞,猪细胞,牛细胞,山羊细胞,绵羊细胞,啮齿动物细胞,大鼠细胞,小鼠细胞,非人灵长类动物细胞和人细胞。
细胞可以在人或非人受试者中或来源于人或非人受试者。人或非人受试者具有基因的基因突变。在一些实施方案中,所述受试者患有由基因突变引起的疾病或有患该疾病的风险。在这种情况下,位点特异性修饰校正了基因突变或使基因表达失活。在其他实施方案中,受试者具有病原体或处于暴露于该病原体的风险中,并且位点特异性修饰使病原体基因失活。
本发明进一步提供了包含上述系统或其一个或多个组分的试剂盒。该系统可以进一步含有一种或多种选自由以下组成组的组分:用于重构和/或稀释的试剂和用于将核酸或多肽引入宿主细胞的试剂。
在下面的描述中阐述了本发明的一个或多个实施方案的细节。从说明书和权利要求中,本发明的其它特征、目的和优点将是显而易见的。
附图简要说明
图1A、1B、1C、1D和1E是用于体内靶向基因编辑的示例性的核酸酶非依赖性的CasRcure或CRC平台的一组示意图。图1A.平台的组分,从左往右:(1)靶向序列的组分dCas9,(2)RNA支架,其含有指导RNA基序(用于靶向序列)、CRISPR基序(用于dCas9结合)和募集RNA基序(用于募集效应子-RNA结合蛋白融合体),和(3)效应子-RNA结合域融合蛋白。系统可以经安排为靶向DNA或RNA分子上的特定核苷酸(右)。图1B.如果效应子蛋白融合体作为单体起作用,则该系统可以靶向靶位点的上游(左侧)或下游(右侧)的单个位点。图1C.如果效应子蛋白需要二聚化才能发挥正确的催化功能,则可以将该系统多重化以同时靶向靶位点的上游和下游的多个序列,从而使效应子蛋白二聚化(右)。或者,将效应子蛋白募集至单个位点可足以增加其对相邻效应子蛋白的亲和力,促进二聚化(右)。图1D.被募集并定位在靶位点的四聚体效应子酶的实例可通过双(左)或单靶向(右)实现。图1E.可用于编辑RNA靶(例如逆转录病毒失活)的系统。
图2A、2B、2C、2D、2E和2F显示AID的靶向募集能够引入核苷酸转换的位点特异性转换。图2A.沿大肠杆菌的rpoB基因的RRDR簇I的靶区域(SEQ ID No:23和24)的示意图。显示了(顶部)DNA序列(SEQ ID No:23),其中显示了PAM(加框)和可突变位置(箭头);(中部)这些实验中使用的gRNA的结合位点,所有的gRNA被编程为靶向模板链(TS,-);(底部)蛋白质序列(SEQ ID No:25),其中显示了涉及利福平耐药性的关键氨基酸(箭头)。图2B.用指定的gRNA处理大肠杆菌MG1655细胞并在含有120μM利福平的平板中选择。图2C.从图2B的顶部图计算突变频率。图2D.来自用rpoB_TS-4gRNA处理的AIDCRC(顶部,SEQ ID NO:26)和未处理的细胞(中部,SEQ ID NO:27)的代表性测序结果。C1592>T突变导致蛋白质序列中的S531F改变(底部,SEQ ID NO:28和29),该突变是已知诱导Rif的突变(Petersen-Mahrt,et al.,Nature 418,99-104(2002),Xu,M.,et al.,Journal of Bacteriology 187,2783-2792,doi:10.1128/JB.187.8.2783-2792.2005(2005),和Zenkin,N.,et al.,AntimicrobialAgents and Chemotherapy 49,1587-1590,doi:10.1128/AAC.49.4.1587-1590.2005(2005))。以C和S(野生型)以及T和F(突变体)显示修饰的核苷酸和氨基酸残基。图2E.用gRNA rpoB_3、rpoB_TS-4和乱序序列(scramble)(SEQ ID No:30-41)处理AIDCRC的突变分布。图2F.数据表明CRC积极地将位于未配对的链(原间隔区(protospacer))上的靶胞嘧啶残基脱氨基,优先更接近5'末端。
图3A和3B显示了CRC系统模块化:靶向模块的工程化提高了突变频率。图3A.当用rpoB_TS-4gRNA靶向时,将靶向模块从dCas9改变为nCas9D10A提高了系统效率,将利福平平板上的存活分数从高于对照18倍(AIDCRC)增加至43倍(AIDCRCD10A)。图3B.用rpoB_TS-4作为靶(SEQ ID No:30-32)的AIDCRCD10A处理的突变分布。C1592在100%的克隆中被修饰,其中75%的C被突变为T,25%的C被突变为A。
图4A和4B显示了CRC系统模块化:效应子模块的工程化提高了突变频率。图4A.APOBEC3G(APO3GCRCD10A)和APOBEC1(APO3GCRCD10A)作为效应子用原系统AIDCRC一起进行测试。当用rpoB_TS-4gRNA靶向时,用APOBEC1处理提高了相对于AIDCRCD10A的突变频率。APO3GCRCD10A比AIDCRC活性低。图4B.用rpoB_TS-4作为靶(SEQ ID NO:30-32)处理Apo1CRCD10A的突变分布(以%表示)。在100%的克隆中观察到C1592>T转换。另外,25%的分析克隆是双突变体,转换C1590>T,没有氨基酸改变。
图5A和5B显示了CRC系统模块化:增加RNA募集支架的数量提高了突变频率。图5A.增加募集支架的数量同时靶向相同的位置使突变效率从高于它们各自的编码乱序gRNA对照50倍(rpoB_TS-4 1xMS2)增加到140倍(rpoB_TS-4 2xMS2)。图5B.用rpoB_TS-4_2xMS2作为靶(SEQ ID NO:30-32)处理AIDCRCD10A的突变分布(以%表示)。C1592在100%的克隆中被修饰,62.5%的C被突变为T,37.5%的C被突变为A。
图6A、6B、6C和6D显示CRC系统能够修饰哺乳动物细胞中染色体外DNA中的靶核苷酸,恢复蛋白质功能。图6A.这些实验中使用的构建体的示意图。(上图)蛋白质编码基因在人泛素C启动子(UbC)的控制下作为多顺反子构建体被克隆,以确保该系统的两种蛋白质组分的化学计量浓度。(下图)嵌合gRNA_MS2构建体在U6或H1启动子的控制下被克隆,以分别用5'-G或5'-A表达靶。图6B.nfEGFPY66C缺陷荧光团周围靶区域的示意图。显示了(顶部)在这些实验中使用的gRNA的结合位点,所有的gRNA被编程为靶向非模板链(NT,+);(中部)DNA序列(SEQ ID NO:42和43),其中显示PAM(加框)和可突变位置(箭头);(底部)蛋白质序列(SEQID No:44),其中显示了消除EGFP荧光的突变氨基酸(箭头)。图6C.293T细胞中nfEGFPY66C靶向。用nfEGFPY66CNT-1和用效率较低的nfEGFPY66C_NT-2处理诱导EGFP信号,而用乱序gRNA检测不到信号。此外,将CRC平台与不同的基因编辑系统(BE3)进行比较,所述基因编辑系统需要胞苷脱氨酶蛋白质与Cas9蛋白质直接融合用于募集,并且需要共同表达尿嘧啶DNA糖基化酶(UGI)的抑制剂以提高效率。BF,明场。图6D.使用nfEGFPY66CNT-1作为靶向gRNA,用AIDCRCD10A和BE3系统处理GFP阳性细胞的定量(以%表示)。
图7A和7B显示处理CRC系统可导致哺乳动物细胞内源基因中的位点特异性核苷酸转换。图7A.中国仓鼠HPRT基因外显子3上靶区域示意图。显示了(顶部)DNA序列(SEQ IDNo:45和46),其中显示了PAM(加框)和可突变位置(箭头);(中部)在这些实验中使用的gRNA的结合位点,gRNA被编程以靶向模板链(TS,-);(底部)蛋白质序列(SEQ ID No:47),其中显示了涉及HPRT蛋白质不稳定性的关键氨基酸(箭头)。图7B.用AIDCRCD10A、BE3靶向HPRT或未处理后抗6-TG的V79-4细胞的定量。与未处理细胞相比,AIDCRCD10A处理的存活分数是未处理细胞的140倍,而BE3是40倍。
发明详述
目前的基因特异性编辑技术主要基于核酸酶诱导的DNA DSB和由此产生的DSB诱导的同源重组。由于大多数体细胞中同源重组的活性很低或不存在,所以这些技术在大多数疾病的体细胞组织中的病理性基因突变的治疗校正方面的用途有限。
如本文所公开的,本发明(至少部分)基于允许基因或RNA转录物的DNA序列定向编辑的新型平台或系统。该系统不依赖于核酸酶活性,不产生DSB,也不依赖于DSB介导的同源重组。此外,该平台的RNA支架的这种设计是模块化的,其允许以极其灵活和方便的方式靶向任何期望的DNA或RNA序列。实质上,这种方法使人们能够将DNA或RNA编辑酶引导至体细胞(包括干细胞)中的任何DNA或RNA序列。通过精确编辑靶DNA或RNA序列,该酶能够校正基因疾病中的突变基因,使病毒感染细胞中的病毒基因组失活,消除神经退行性疾病中致病蛋白质的表达,或沉默癌症中的致癌蛋白质。此外,这种方法能够通过离体编辑干细胞或祖细胞的基因组用于基于细胞的治疗。除了治疗应用外,该系统还能够作为强大的研究工具广泛应用于有任何生物的基因组的靶向修饰。
基因编辑平台
本发明的一个方面提供了一种基因编辑平台,其克服了目前核酸酶和DSB依赖的基因组工程和基因编辑技术的上述限制。该平台被命名为CasRcure系统或CRC系统,具有三个功能组分:(1)经工程化用于靶向序列的基于核酸酶缺陷型CRISPR/Cas的模块;(2)基于RNA支架的模块,用于将平台指导至靶序列以及用于募集校正模块;和(3)作为效应子校正模块的非核酸酶DNA/RNA修饰酶,如胞嘧啶脱氨酶(例如激活诱导的胞嘧啶脱氨酶,AID)。CasRcure系统一起允许特定的DNA/RNA测序锚定,将效应DNA/RNA修饰酶灵活地和模块化地募集到特定序列,并引发在体细胞中有活性的细胞通路,用于校正基因信息,特别是点突变。
图1中示出了示例性CasRcure系统的示意图。更具体地,该系统包括图1A中总结的三个结构和功能组分:(1)序列靶向模块(例如,dCas9蛋白);(2)用于序列识别和用于效应子募集的RNA支架(含有指导RNA基序、CRISPR RNA基序和募集RNA基序的RNA分子),和(3)效应子(非核酸酶DNA修饰酶,如与结合到募集RNA基序的小蛋白融合的AID)。三个组分可以构建在单个表达载体中或者在两个到三个分开的表达载体中。三个具体组分的整体和组合构成了技术平台的使能(enabling)。
如本文所公开的,募集机制之间有许多明确的区别:RNA支架介导的募集系统(CRC)相对于Cas9与效应子蛋白(BE3)的直接融合。以下实例中显示的结果表明,RNA支架介导的募集比染色体外靶(图6C和6D)和内源基因(图7B)的直接融合更有效。另外,CRC系统不依赖UNG抑制(DNA修复酶),而BE3使用有效的UNG抑制肽(UGI)。全局或局部DNA修复抑制可能导致不期望的、不可控制的、潜在的有害结果。另外,CRC系统的模块化设计可以实现灵活的系统工程化。模块是可以互换的,可以轻松实现不同模块的许多组合。另一方面,直接融合总是需要新的融合过程来设计新的模块。此外,RNA支架介导的募集可能促进效应子蛋白的寡聚化,而直接融合将由于空间位阻而不会形成寡聚体。
a.序列靶向模块
上述系统的序列靶向组分基于来自细菌物种的CRISPR/Cas系统。原始的功能性细菌CRISPR-Cas系统需要三个组分:提供核酸酶活性的Cas蛋白和称为CRISPR RNA(crRNA)和反式作用RNA(tracrRNA)的两个短的非编码RNA物质,这两个RNA物质形成所谓的指导RNA(gRNA)。II型CRISPR是表征最好的系统之一,并以四个连续的步骤进行靶向的DNA双链断裂。第一,从CRISPR基因座转录两个非编码RNA,pre-crRNA和tracrRNA。第二,tracrRNA与pre-crRNA分子的重复区域杂交,并介导pre-crRNA分子加工成含有单独间隔区序列的成熟crRNA分子。第三,成熟的crRNA:tracrRNA复合物(即所谓的指导RNA)通过在crRNA上的间隔区序列与包含3个核苷酸(nt)原间隔区邻近基序(PAM)的靶DNA上的原间隔区的互补序列之间的Watson-Crick碱基配,指引Cas核酸酶(例如Cas9)至靶DNA。PAM序列对于Cas9靶向是必需的。最后,Cas核酸酶介导靶DNA的切割以在靶位点内产生双链断裂。在其天然情况下,CRISPR/Cas系统作为一种适应性免疫系统,可以保护细菌免受反复病毒感染,并且PAM序列可以作为自我/非自我识别信号,而Cas9蛋白具有核酸酶活性。已显示CRISPR/Cas系统在体外和体内都具有巨大的基因编辑潜力。
在本文公开的发明中,序列识别机制可以以类似的方式实现。也就是说,突变Cas蛋白(例如在其核酸酶催化结构域中含有突变的dCas9蛋白质因此不具有核酸酶活性,或者在催化结构域之一处部分突变的nCas9蛋白质因此不具有用于产生DSB的核酸酶活性)特异性识别含有短的间隔区序列(典型地长20个核苷酸)的非编码RNA支架分子,其将Cas蛋白指导至其靶DNA或RNA序列。后者侧翼是3'PAM。
在本发明中可以使用各种Cas蛋白。Cas蛋白(CRISPR相关蛋白或CRISPR蛋白,可互换使用)是指I型、II型或III型CRISPR-Cas系统(其具有RNA指导的DNA结合)的蛋白或由此衍生的蛋白。合适的CRISPR/Cas蛋白的非限制性实例包括Cas3,Cas4,Cas5,Cas5e(或CasD),Cas6,Cas6e,Cas6f,Cas7,Cas8a1,Cas8a2,Cas8b,Cas8c,Cas9,Cas10,Cas10d,CasF,CasG,CasH,Csy1,Csy2,Csy3,Cse1(或CasA),Cse2(或CasB),Cse3(或CasE),Cse4(或CasC),Csc1,Csc2,Csa5,Csn2,Csm2,Csm3,Csm4,Csm5,Csm6,Cmr1,Cmr3,Cmr4,Cmr5,Cmr6,Csb1,Csb2,Csb3,Csx17,Csx14,Csx10,Csx16,CsaX,Csx3,Csz1,Csx15,Csf1,Csf2,Csf3,Csf4和Cu1966。参见例如WO2014144761、WO2014144592、WO2013176772、US20140273226和US20140273233,其内容通过引用整体并入本文。
在一个实施方案中,Cas蛋白衍生自II型CRISPR-Cas系统。在示例性实施方案中,Cas蛋白是或衍生自Cas9蛋白。Cas9蛋白可以来自化脓性链球菌(Streptococcuspyogenes),嗜热链球菌(Streptococcus thermophilus),链球菌(Streptococcus sp.),达松维尔拟诺卡氏菌(Nocardiopsis dassonvillei),始旋链霉菌(Streptomycespristinaespiralis),绿色产色链霉菌(Streptomyces viridochromogenes),绿色产色链霉菌(Streptomyces viridochromogenes),玫瑰链孢囊菌(Streptosporangium roseum),玫瑰链孢囊菌(Streptosporangium roseum),酸热脂环酸杆菌(Alicyclobacillusacidocaldarius),假蕈状芽孢杆菌(Bacillus pseudomycoides),Bacillusselenitireducens,Exiguobacterium sibiricum,德氏乳杆菌(Lactobacillusdelbrueckii),唾液乳杆菌(Lactobacillus salivarius),海洋微颤蓝细菌(Microscillamarina),Burkholderiales bacterium,Polaromonas naphthalenivorans,Polaromonassp.,Crocosphaera watsonii,蓝杆藻(Cyanothece sp.),铜绿微囊蓝细菌(Microcystisaeruginosa),聚球菌(Synechococcus sp.),伯糖醋盐杆菌(Acetohalobium arabaticum),Ammonifex degensii,Caldicelulosiruptor becscii,Candidatus Desulforudis,肉毒梭菌(Clostridium botulinum),艰难梭菌(Clostridium difficile),大芬戈尔德菌(Finegoldia magna),嗜热盐碱厌氧菌(Natranaerobius thermophilus),丙酸互营细菌(Pelotomaculum thermopropionicum),Acidithiobacillus caldus,嗜酸氧化亚铁硫杆菌(Acidithiobacillus ferrooxidans),紫色硫细菌(Allochromatium vinosum),海杆菌(Marinobacter sp.),嗜盐硝化球菌(Nitrosococcus halophilus),Nitrosococcuswatsoni,Pseudoalteromonas haloplanktis,Ktedonobacter racemifer,Methanohalobium evestigatum,多鱼腥蓝细菌(Anabaena variabilis),产泡沫节球蓝细菌(Nodularia spumigena),Nostoc sp.,最大节螺蓝细菌(Arthrospira maxima),Arthrospira platensis,Arthrospira sp.,Lyngbya sp.,原型微鞘藻(Microcoleuschthonoplastes),Oscillatoria sp.,Petrotoga mobilis,Thermosipho africanus,或Acaryochloris marina。
通常,Cas蛋白包括至少一个RNA结合结构域。RNA结合结构域与指导RNA相互作用。Cas蛋白可以是野生型Cas蛋白或不具有核酸酶活性的修饰形式。可以修饰Cas蛋白以增加核酸结合亲和力和/或特异性,改变酶活性,和/或改变蛋白质的另一种性质。例如,可修饰、缺失或失活所述蛋白的核酸酶(即DNase、RNase)结构域。或者,可以截短所述蛋白以去除对于该蛋白功能不是必需的结构域。所述蛋白也可以被截短或修饰以优化效应子结构域的活性。
在一些实施方案中,Cas蛋白可以是野生型Cas蛋白(例如Cas9)或其片段的突变体。在其他实施方案中,Cas蛋白可以衍生自突变Cas蛋白。例如,可以修饰Cas9蛋白的氨基酸序列以改变蛋白质的一种或多种性质(例如,核酸酶活性,亲和力,稳定性等)。或者,可以从Cas9蛋白质中去除不涉及RNA靶向的所述蛋白的结构域,使得修饰的Cas9蛋白小于野生型Cas9蛋白。在一些实施方案中,本系统利用来自化脓性链球菌(S.pyogenes)的Cas9蛋白,或者如细菌中所编码或者经密码子优化以在哺乳动物细胞中表达。
突变Cas蛋白是指野生型蛋白的多肽衍生物,例如具有一个或多个点突变、插入、缺失、截短、融合蛋白的蛋白或其组合。该突变体具有RNA指导的DNA结合活性或RNA指导的核酸酶活性中的至少一种或两者。通常,修饰形式与野生型蛋白如下面的SEQ ID No.1至少50%(例如50%和100%之间,包括例如50%、60%、70%、75%、80%、85%、90%、95%和99%)相同。
可以作为重组多肽获得Cas蛋白(以及本发明中描述的其他蛋白组分)。为了制备重组多肽,编码它的核酸可以与编码融合伴侣的另一核酸(例如谷胱甘肽-S-转移酶(GST)、6x-His表位标签或M13基因3蛋白)连接。所得到的融合核酸在合适的宿主细胞中表达可以通过本领域已知的方法分离的融合蛋白。可以进一步处理分离的融合蛋白,例如通过酶消化来除去融合伴侣并获得本发明的重组多肽。或者,可以化学合成(参见例如Creighton,"Proteins:Structures and Molecular Principles,"W.H.Freeman&Co.,NY,1983),或者如本文所述通过重组DNA技术产生这些蛋白。本领域技术人员可以参考Frederick M.Ausubelet al.,Current Protocols in Molecular Biology,John Wiley&Sons,2003;和Sambrooket al.,Molecular Cloning,A Laboratory Manual,"Cold Spring Harbor Press,ColdSpring Harbor,NY,2001得到另外的指导。
本发明中描述的Cas蛋白可以以纯化或分离的形式提供,或者可以是组合物的一部分。优选地,在以组合物形式的情况下,首先将这些蛋白纯化至一定程度,更优选至高水平的纯度(例如,约80%、90%、95%或99%或更高)。根据本发明的组合物可以是任何类型的所需组合物,但是通常是适合用作或包含在用于RNA指导的靶向的组合物中的含水组合物。本领域技术人员很清楚可以包含在这种核酸酶反应组合物中的各种物质。
如本文所公开的,可以使用核酸酶失活的Cas9(dCas9,例如来自化脓性链球菌D10A的H840A突变蛋白,图1A)或者核酸酶缺陷切口酶Cas9(nCas9,例如来自化脓性链球菌D10A突变蛋白,图1A和图2F)。dCas9或nCas9也可以来自各种细菌物种。表1列出了dCas9以及它们相应的PAM要求实例的非穷尽列表。
表1.
b.用于序列识别和效应子募集的RNA支架:
本文公开的平台的第二组分是RNA支架,其具有三个亚组分:可编程的指导RNA基序、CRISPR RNA基序和募集RNA基序。该支架可以是单个RNA分子,或者是多个RNA分子的复合物。如本文所公开的,可编程的指导RNA、CRISPR RNA和Cas蛋白一起形成用于序列靶向和识别的基于CRISPR/Cas的模块,而募集RNA基序通过RNA-蛋白结合对募集携带基因修正的蛋白效应子。因此,该第二组分连接校正模块和序列识别模块。
可编程指导RNA
一个关键的亚组分是可编程的指导RNA。由于其简单和高效,CRISPR-Cas系统已被用于在各种生物的细胞中进行基因组编辑。该系统的特异性由靶DNA与定制设计的指导RNA之间的碱基配对决定。通过设计和调整指导RNA的碱基配对性质,只要靶序列中存在PAM序列,就可以靶向任何感兴趣的序列。
在本文公开的RNA支架的亚组分中,指导序列提供靶向特异性。它包括与预先选定的感兴趣的靶位点互补并能够与之杂交的区域。在各种实施方案中,该指导序列可以包含约10个核苷酸至超过约25个核苷酸。例如,指导序列与相应的靶位点序列之间的碱基配对区域可以长约10,11,12,13,14,15,16,17,18,19,20,22,23,24,25,或超过25个核苷酸。在示例性实施方案中,指导序列长约17-20个核苷酸,例如20个核苷酸。
选择合适的靶核酸的一个要求是它具有3'PAM位点/序列。每个靶序列及其相应的PAM位点/序列在本文中被称为Cas靶向位点。表征最好的系统之一II型CRISPR系统只需要Cas9蛋白和与靶序列互补的指导RNA来影响靶切割。化脓性链球菌的II型CRISPR系统使用具有N12-20NGG的靶位点,其中NGG代表来自化脓性链球菌的PAM位点,N12-20代表直接位于PAM位点5'的12-20个核苷酸。来自其他细菌物种的其他PAM位点序列包括NGGNG,NNNNGATT,NNAGAA,NNAGAAW和NAAAAC。参见例如US 20140273233,WO 2013176772,Cong et al.,(2012),Science 339(6121):819–823,Jinek et al.,(2012),Science 337(6096):816–821,Mali et al,(2013),Science 339(6121):823–826,Gasiunas et al.,(2012),ProcNatl Acad Sci U S A.109(39):E2579–E2586,Cho et al.,(2013)Nature Biotechnology31,230–232,Hou et al.,Proc Natl Acad Sci U S A.2013Sep 24;110(39):15644-9,Mojica et al.,Microbiology.2009Mar;155(Pt 3):733-40,和www.addgene.org/CRISPR/。这些文献的内容通过引用整体并入本文。
靶核酸链可以是宿主细胞中基因组DNA上的两条链中的任一条。这样的基因组双链DNA的实例包括但不一定限于宿主细胞染色体,线粒体DNA和稳定维持的质粒。然而,可以理解的是,本方法可以对存在于宿主细胞中的其它dsDNA实施,例如非稳定的质粒DNA、病毒DNA和噬菌粒DNA,只要存在Cas靶向的位点而不管宿主细胞dsDNA的性质。本方法也可以对RNA实施。
CRISPR基序
除了上述指导序列之外,本发明的RNA支架还包括额外的活性或非活性亚组分。在一个实例中,支架含有具有tracrRNA活性的CRISPR基序。例如,支架可以是杂合RNA分子,其中上述可编程指导RNA与tracrRNA融合以模拟天然crRNA:tracrRNA双链体。示例性的杂合crRNA:tracRNA,sgRNA序列:5'-(20nt指导)-GUUUAAGAGCUAUGCUGGAAACAGCAUAGCAAGUUUAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCUUUUUUU-3’(SEQ ID No:4;Chenet al.Cell.2013Dec 19;155(7):1479-91)。本领域已知各种tracrRNA序列,并且实例包括以下tracrRNA及其活性部分。如本文所用,tracrRNA的活性部分保留与Cas蛋白(例如Cas9或dCas9)形成复合物的能力。参见例如WO2014144592。产生crRNA-tracrRNA杂合RNA的方法是本领域已知的。参见例如WO2014099750,US 20140179006,和US 20140273226。这些文献的内容通过引用整体并入本文。
GGAACCAUUCAAAACAGCAUAGCAAGUUAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGC(SEQ ID No:5);
UAGCAAGUUAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGC(SEQ IDNo:6);
AGCAUAGCAAGUUAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGC(SEQID No:7);
CAAAACAGCAUAGCAAGUUAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGC(SEQ ID No:8);
UAGCAAGUUAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUG(SEQ ID No:9);
UAGCAAGUUAAAAUAAGGCUAGUCCGUUAUCA(SEQ ID No:10);和
UAGCAAGUUAAAAUAAGGCUAGUCCG(SEQ ID No:11)。
在一些实施方案中,tracrRNA活性和指导序列是两个独立的RNA分子,它们一起形成指导RNA和相关支架。在这种情况下,具有tracrRNA活性的分子应该能够与具有指导序列的分子相互作用(通常通过碱基配对)。
募集RNA基序
RNA支架的第三个亚组分是募集RNA模体,其连接校正模块和序列识别模块。这个连接对于本文公开的平台是至关重要的。
将效应子/DNA编辑酶募集到靶序列的一种方法是通过效应子蛋白与dCas9的直接融合。效应子酶(“校正模块”)与序列识别所需的蛋白质(如dCas9)直接融合已经在序列特异性转录活化或抑制方面取得了成功,但蛋白质-蛋白质融合设计可能会产生空间位阻,其对于需要为其活性形成多聚体复合物的酶而言不是理想的。事实上,大多数核苷酸编辑酶(如AID或APOBEC3G)的DNA编辑催化活性都需要形成二聚体、四聚体或更高级寡聚体。与dCas9直接融合(其以确定的构象锚定到DNA上)将阻碍在正确位置形成功能性寡聚酶复合物。
相比之下,本文公开的平台基于RNA支架介导的效应子蛋白募集。更具体地,该平台利用了各种RNA基序/RNA结合蛋白结合对的优势。为此目的,设计RNA支架以使与RNA结合蛋白(例如MS2外壳蛋白,MCP)特异性结合的RNA基序(例如,MS2操纵子基序)连接至gRNA-CRISPR支架(图1A)。
结果,本文公开的平台的这种RNA支架组分是设计的RNA分子,其不仅含有用于特异性DNA/RNA序列识别的gRNA基序、用于dCas9结合的CRISPR RNA基序,而且含有用于效应子募集的募集RNA基序(图1A)。通过这种方式,募集的效应子蛋白融合蛋白可以通过其与募集RNA基序结合的能力而募集到靶位点。由于RNA支架介导的募集的灵活性,功能单体以及二聚体、四聚体或寡聚体可能相对容易在靶DNA或RNA序列附近形成。图1B-E中示出了示例构象。这些RNA募集基序/结合蛋白对可以衍生自天然存在的来源(例如RNA噬菌体或酵母端粒酶),或可以是人工设计的(例如,RNA适体及其相应的结合蛋白配体)。表2总结了可以在CasRcure系统中使用的募集RNA基序/RNA结合蛋白对的非穷尽列表。
表2.能够用于本发明的募集RNA基序以及它们配对的RNA结合蛋白/蛋白结构域的实例。
*募集的蛋白融合至效应子蛋白,例如参见表3。
以下列出了上述结合对的序列。
1.端粒酶Ku结合基序/Ku异二聚体
a.Ku结合发夹
5’-TTCTTGTCGTACTTATAGATCGCTACGTTATTTCAATTTTGAAAATCTGAGTCCTGGGAGTGCGGA-3’(SEQ ID No:12)
b.Ku异二聚体
MSGWESYYKTEGDEEAEEEQEENLEASGDYKYSGRDSLIFLVDASKAMFESQSEDELTPFDMSIQCIQSVYISKIISSDRDLLAVVFYGTEKDKNSVNFKNIYVLQELDNPGAKRILELDQFKGQQGQKRFQDMMGHGSDYSLSEVLWVCANLFSDVQFKMSHKRIMLFTNEDNPHGNDSAKASRARTKAGDLRDTGIFLDLMHLKKPGGFDISLFYRDIISIAEDEDLRVHFEESSKLEDLLRKVRAKETRKRALSRLKLKLNKDIVISVGIYNLVQKALKPPPIKLYRETNEPVKTKTRTFNTSTGGLLLPSDTKRSQIYGSRQIILEKEETEELKRFDDPGLMLMGFKPLVLLKKHHYLRPSLFVYPEESLVIGSSTLFSALLIKCLEKEVAALCRYTPRRNIPPYFVALVPQEEELDDQKIQVTPPGFQLVFLPFADDKRKMPFTEKIMATPEQVGKMKAIVEKLRFTYRSDSFENPVLQQHFRNLEALALDLMEPEQAVDLTLPKVEAMNKRLGSLVDEFKELVYPPDYNPEGKVTKRKHDNEGSGSKRPKVEYSEEELKTHISKGTLGKFTVPMLKEACRAYGLKSGLKKQELLEALTKHFQD>(SEQ ID No:13)
MVRSGNKAAVVLCMDVGFTMSNSIPGIESPFEQAKKVITMFVQRQVFAENKDEIALVLFGTDGTDNPLSGGDQYQNITVHRHLMLPDFDLLEDIESKIQPGSQQADFLDALIVSMDVIQHETIGKKFEKRHIEIFTDLSSRFSKSQLDIIIHSLKKCDISERHSIHWPCRLTIGSNLSIRIAAYKSILQERVKKTWTVVDAKTLKKEDIQKETVYCLNDDDETEVLKEDIIQGFRYGSDIVPFSKVDEEQMKYKSEGKCFSVLGFCKSSQVQRRFFMGNQVLKVFAARDDEAAAVALSSLIHALDDLDMVAIVRYAYDKRANPQVGVAFPHIKHNYECLVYVQLPFMEDLRQYMFSSLKNSKKYAPTEAQLNAVDALIDSMSLAKKDEKTDTLEDLFPTTKIPNPRFQRLFQCLLHRALHPREPLPPIQQHIWNMLNPPAEVTTKSQIPLSKIKTLFPLIEAKKKDQVTAQEIFQDNHEDGPTAK(SEQ ID No:14)
2.端粒酶Sm7结合基序/Sm7同七聚体
a.Sm共有位点(单链)
5’-AATTTTTGGA-3’(SEQ ID No:15)
b.单体Sm–样蛋白(古细菌)
GSVIDVSSQRVNVQRPLDALGNSLNSPVIIKLKGDREFRGVLKSFDLHMNLVLNDAEELEDGEVTRRLGTVLIRGDNIVYISP(SEQ ID No:16)
3.MS2噬菌体操纵子茎环/MS2外壳蛋白
a.MS2噬菌体操纵子茎环
5’-GCGCACATGAGGATCACCCATGTGC-3’(SEQ ID No:17)
b.MS2外壳蛋白
MASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID No:18)
4.PP7噬菌体操纵子茎环/PP7外壳蛋白
a.PP7噬菌体操纵子茎环
5’-aTAAGGAGTTTATATGGAAACCCTTA-3’(SEQ ID No:19)
b.PP7外壳蛋白(PCP)
MSKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDCSTSVCGELPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGR.(SEQ ID No:20)
5.SfMu Com茎环/SfMu Com结合蛋白
a.SfMu Com茎环
5’-CTGAATGCCTGCGAGCATC-3’(SEQ ID No:21)
b.SfMu Com结合蛋白
MKSIRCKNCNKLLFKADSFDHIEIRCPRCKRHIIMLNACEHPTEKHCGKREKITHSDETVRY(SEQ IDNo:22)
RNA支架可以是单个RNA分子,或者是多个RNA分子的复合物。例如,指导RNA、CRISPR基序和募集RNA基序可以是一个长的单RNA分子的三个区段。或者,它们中的一个、两个或三个可以在独立的分子上。在后一种情况下,三种组分可以通过共价或非共价连接或结合(包括例如Watson-Crick碱基配对)连接在一起形成支架。
在一个实例中,RNA支架可以包含两个独立的RNA分子。第一RNA分子可以包含可编程的指导RNA和能够与互补区形成茎双链体结构的区域。除了CRISPR基序和募集DNA基序之外,第二RNA分子可以包含互补区。通过这种茎双链体结构,第一和第二RNA分子形成本发明的RNA支架。在一个实施方案中,第一和第二RNA分子各自包含与其他序列碱基配对的(约6至约20个核苷酸的)序列。同样,CRISPR基序和募集DNA基序也可以在不同的RNA分子上并且与另一个茎双链体结构在一起。
本发明的RNA和相关支架可以通过本领域已知的各种方法(包括基于细胞的表达、体外转录和化学合成)制备。使用TC-RNA化学(参见例如美国专利8,202,983)化学合成相对较长的RNA(长达200聚体或更多)的能力能够产生具有胜过四个基本核糖核苷酸(A、C、G和U)所能够具有的性质的特殊特征的RNA。
可以使用本领域已知的宿主细胞系统或体外翻译-转录系统,用重组技术制备Cas蛋白-指导RNA支架复合物。这样的系统和技术的细节可以在例如WO2014144761、WO2014144592、WO2013176772、US20140273226、和US20140273233中找到,其内容通过引用整体并入本文。复合物可以从细胞的细胞材料或其中产生它们的体外翻译-转录系统中分离或纯化(至少至某种程度)。
RNA支架可以包括一个或多个修饰。这样的修饰可以包括引入至少一个非天然存在的核苷酸或修饰的核苷酸或其类似物。修饰的核苷酸可以在核糖、磷酸和/或碱基部分被修饰。修饰的核苷酸可以包括2'-O-甲基类似物、2'-脱氧类似物或2'-氟类似物。核酸主链可以被修饰,例如,可以使用硫代磷酸酯主链。锁核酸(LNA)或桥式核酸(BNA)的使用也可以是可以的。修饰的碱基的其他实例包括但不限于2-氨基嘌呤,5-溴-尿苷,假尿苷,肌苷,7-甲基鸟苷。这些修饰可能适用于CRISPR系统的任何组分。在优选的实施方案中,对RNA成分(例如指导RNA序列)进行这些修饰。
c.效应子:非核酸酶DNA修饰酶
本发明公开的平台的第三组分是非核酸酶效应子。所述效应子不是核酸酶,并且不具有核酸酶活性,但可以具有其他类型DNA修饰酶的活性。酶促活性的实例包括但不限于脱氨活性,甲基转移酶活性,脱甲基酶活性,DNA修复活性,DNA损伤活性,歧化酶活性,烷基化活性,脱嘌呤活性,氧化活性,嘧啶二聚体形成活性,整合酶活性,转座酶活性,重组酶活性,聚合酶活性,连接酶活性,解旋酶活性,光裂合酶活性或糖基化酶活性。在一些实施方案中,效应子具有胞嘧啶脱氨酶(例如AID,APOBEC3G)、腺苷脱氨酶(例如ADA)、DNA甲基转移酶和DNA脱甲基酶的活性。
在优选实施方案中,该第三组分是具有RNA结合结构域和效应子结构域的缀合物或融合蛋白。这两个结构域可以通过接头连接。
RNA结合结构域
尽管本发明可以使用各种RNA结合结构域,但不应使用Cas蛋白(如Cas9)或其变体(如dCas9)的RNA结合结构域。如上所述,与dCas9直接融合(其以确定的构象锚定于DNA)将阻碍在正确位置形成功能性寡聚酶复合物。相反,本发明利用了各种其他RNA基序-RNA结合蛋白结合对。实例包括表2中列出的那些实例。
通过这种方式,可以通过RNA结合结构域与募集RNA基序结合的能力将效应子蛋白募集到靶位点。由于RNA支架介导的募集的灵活性,功能单体以及二聚体、四聚体或寡聚体可相对容易地在靶DNA或RNA序列附近形成。
效应子结构域
效应子组分包含活性部分,即效应子结构域。在一些实施方案中,效应子结构域包含非核酸酶蛋白(例如脱氨酶)的天然存在的活性部分。在其他实施方案中,效应子结构域包含非核酸酶蛋白的天然存在的活性部分的修饰的氨基酸序列(例如,置换、缺失、插入)。效应子结构域具有酶活性。该活性的实例包括脱氨活性,甲基转移酶活性,脱甲基酶活性,DNA修复活性,DNA损伤活性,歧化酶活性,烷基化活性,脱嘌呤活性,氧化活性,嘧啶二聚体形成活性,整合酶活性,转座酶活性,重组酶活性,聚合酶活性,连接酶活性,解旋酶活性,光裂合酶活性,糖基化酶活性,DNA甲基化,组蛋白乙酰化活性,或组蛋白甲基化活性。
接头
上述两个结构域以及本文所公开的其他结构域可以通过接头,例如但不限于化学修饰,肽接头,化学接头,共价或非共价键或蛋白质融合,或通过本领域技术人员已知的工具连接。连接可以是永久的或可逆的。参见例如美国专利号4625014、5057301和5514363,美国申请号20150182596和20100063258,以及WO2012142515,其内容通过引用整体并入本文。在一些实施方案中,可以包括几种接头以利用缀合物中每个接头和每个蛋白质结构域的期望性质。例如,柔性接头和增加缀合物溶解度的接头可以考虑单独使用或与其它接头一起使用。肽接头可以通过表达编码接头的DNA和缀合物中的一个或多个蛋白质结构域来连接。接头可以是酸可切割、可光裂合和热敏接头。缀合方法是本领域技术人员众所周知的,并且包括用于本发明。
在一些实施方案中,RNA结合结构域和效应子结构域可以通过肽接头连接。可以通过表达框架内编码两个结构域和接头的核酸来连接肽接头。任选地,可以在结构域的氨基末端和羧基末端之一或两者处连接接头肽。在一些实例中,接头是如美国专利号6,165,476、5,856,456,美国申请号20150182596和2010/0063258以及国际申请WO2012/142515中公开的免疫球蛋白铰链区接头,其每一篇通过引用整体并入本文。
其他结构域
效应子融合蛋白可以包含其他结构域。在某些实施方案中,效应子融合蛋白可以包含至少一个核定位信号(NLS)。通常,NLS包含一段碱性氨基酸。核定位信号在本领域是已知的(参见例如Lange et al.,J.Biol.Chem.,2007,282:5101-5105)。NLS可以位于融合蛋白的N端、C端或内部位置中。
在一些实施方案中,融合蛋白可以包含至少一个细胞穿透结构域以促进蛋白递送到靶细胞中。在一个实施方案中,细胞穿透结构域可以是细胞穿透肽序列。本领域已知各种细胞穿透肽序列,实例包括HIV-1TAT蛋白,人HBV的TLM,Pep-1,VP22和多聚精氨酸肽序列。
在其他实施方案中,融合蛋白可以包含至少一个标志物结构域。标志物结构域的非限制性实例包括荧光蛋白、纯化标签和表位标签。在一些实施方案中,标志物结构域可以是荧光蛋白。在其他实施方案中,标志物结构域可以是纯化标签和/或表位标签。参见例如US 20140273233。
在一个实施方案中,使用AID作为实例来说明系统如何工作。AID是一种胞苷脱氨酶,其能够催化DNA或RNA情况下胞嘧啶脱氨反应。当被带到靶向位点时,AID将C碱基改为U碱基。在分裂细胞中,这可能导致C到T点突变。或者,C到U的变化可以触发细胞DNA修复途径,主要是剪切修复途径,其将去除错配的U-G碱基对,并替换为T-A、A-T、C-G或G-C对。结果,在靶C-G位点产生点突变。由于剪切修复途径在大多数(如果不是全部的话)体细胞中存在,因此募集AID至靶位点可以将C-G碱基对校正为其他碱基对。在这种情况下,如果C-G碱基对是体细胞组织/细胞中引起潜在疾病的基因突变,则上述方法能够用于校正突变并由此治疗疾病。
同样,如果引起潜在疾病的基因突变是特定位点上的A-T碱基对,则能够使用相同方法将腺苷脱氨酶募集至特定位点,其中腺苷脱氨酶在此能够将A-T碱基对校正为其他碱基对。预计其他效应子酶会产生其他类型的碱基配对变化。表3中详细列出了DNA/RNA修饰酶实例的非穷尽列表。
表3.能够用于本发明的效应子蛋白的实例
效应子蛋白全称:
AID:激活诱导胞嘧啶脱氨酶,又称AICDA
APOBEC1:载脂蛋白B mRNA编辑酶,催化多肽样1.
APOBEC3A:载脂蛋白B mRNA编辑酶,催化多肽样3A
APOBEC3B:载脂蛋白B mRNA编辑酶,催化多肽样3B
APOBEC3C:载脂蛋白B mRNA编辑酶,催化多肽样3C
APOBEC3D:载脂蛋白B mRNA编辑酶,催化多肽样3D
APOBEC3F:载脂蛋白B mRNA编辑酶,催化多肽样3F
APOBEC3G:载脂蛋白B mRNA编辑酶,催化多肽样3G
APOBEC3H:载脂蛋白B mRNA编辑酶,催化多肽样3H
ADA:腺苷脱氨酶
ADAR1:作用于RNA的腺苷脱氨酶1
Dnmt1:DNA(胞嘧啶-5-)-甲基转移酶1
Dnmt3a:DNA(胞嘧啶-5-)-甲基转移酶3α
Dnmt3b:DNA(胞嘧啶-5-)-甲基转移酶3β
Tet1:甲基胞嘧啶双加氧酶
上述三个具体组分构成了该技术平台。可以从表1-3中的列表中分别选择每种组分以实现特定的治疗/效用目标。
在一个实例中,使用(i)来自化脓性链球菌的dCas9作为靶向序列的蛋白,(ii)含有指导RNA序列、CRISPR RNA基序和MS2操纵子基序的RNA支架,和(iii)含有与MS2操纵子结合蛋白MCP融合的人AID的效应子融合体,来构建CasRcure系统。下面列出了这些组分的序列
化脓性链球菌dCas9蛋白序列(SEQ ID No.1)
MDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDAIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGD
(加下划线的残基:D10A、H840A活性位点突变体)
RNA支架表达盒(化脓性链球菌),含有20-核苷酸可编程序列、CRISPR RNA基序和MS2操纵子基序(SEQ ID No.2):
(N20:可编程序列;加下划线:CRISPR RNA基序;粗体:MS2基序;斜体:终止子)
效应子AID-MCP融合体(SEQ ID No.3):
像上述的Cas蛋白一样,非核酸酶效应子也可以作为重组多肽获得。制备重组多肽的技术是本领域已知的。参见例如Creighton,"Proteins:Structures and MolecularPrinciples,"W.H.Freeman&Co.,NY,1983);Ausubel et al.,Current Protocols inMolecular Biology,John Wiley&Sons,2003;和Sambrook et al.,Molecular Cloning,ALaboratory Manual,"Cold Spring Harbor Press,Cold Spring Harbor,NY,2001。
可以使用一种到三种表达载体来表达本文公开的平台/系统的上述三种组分。该系统可以被编程为实际上靶向任何DNA或RNA序列。
表达系统
为了使用上述平台,可能需要从编码它们的核酸中表达一种或多种蛋白质和RNA组分。这可以以各种方式进行。例如,编码RNA支架或蛋白质的核酸可被克隆到一个或多个中间载体中,用于引入原核或真核细胞中进行复制和/或转录。中间载体通常是用于储存或操作编码RNA支架或蛋白质的核酸以产生RNA支架或蛋白质的原核载体,例如质粒或穿梭载体或昆虫载体。也可以将核酸克隆到一个或多个表达载体中,用于施用于植物细胞,动物细胞,优选哺乳动物细胞或人细胞,真菌细胞,细菌细胞或原生动物细胞。因此,本发明提供了编码上述任何RNA支架或蛋白质的核酸。优选地,所述核酸经分离和/或纯化。
本发明还提供了具有编码上述RNA支架或蛋白质中的一个或多个的序列的重组构建体或载体。构建体的实例包括已经以正向或反向方向插入了本发明的核酸序列的载体如质粒或病毒载体。在优选的实施方案中,构建体还包括与该序列可操作连接的调控序列,包括启动子。大量合适的载体和启动子是本领域技术人员已知的,并且是可商购获得的。用于原核和真核宿主的合适克隆和表达载体也描述于例如Sambrook et al.(2001,MolecularCloning:ALaboratory Manual,Cold Spring Harbor Press)中。
载体是指能够运输已经与其连接的另一核酸的核酸分子。载体能够自主复制或整合到宿主DNA中。载体的实例包括质粒、粘粒或病毒载体。本发明的载体包括适于在宿主细胞中表达核酸的形式的核酸。优选地,载体包括与待表达的核酸序列可操作地连接的一个或多个调控序列。“调控序列”包括启动子、增强子和其他表达控制元件(例如多腺苷酸化信号)。调控序列包括指导核苷酸序列组成型表达的调控序列,以及诱导型调控序列。表达载体的设计可取决于诸如待转化、转染或感染的宿主细胞的选择,期望的RNA或蛋白质的表达水平等因素。
表达载体的实例包括染色体、非染色体和合成的DNA序列,细菌质粒,噬菌体DNA,杆状病毒,酵母质粒,由质粒和噬菌体DNA组合衍生得到的载体,病毒DNA如牛痘、腺病毒、禽痘病毒和伪狂犬病毒。但是,可以使用任何其他载体,只要其在宿主中是可复制且能存活的。可以通过多种方法将合适的核酸序列插入载体中。通常,可以通过本领域已知的方法将编码上述RNA或蛋白之一的核酸序列插入合适的限制性内切核酸酶位点内。这样的方法和相关的亚克隆方法在本领域技术人员的范围内。
载体可以包括用于扩增表达的合适序列。此外,表达载体优选含有一个或多个选择性标志物基因以提供用于选择转化的宿主细胞的表型性状,例如用于真核细胞培养物的二氢叶酸还原酶或新霉素抗性,或例如在大肠杆菌中的四环素或氨苄青霉素抗性。
用于表达RNA的载体可以包括RNAPol III启动子例如H1、U6或7SK启动子,以驱动RNA的表达。这些人启动子允许质粒转染后在哺乳动物细胞中表达RNA。或者,可以使用T7启动子,例如用于体外转录,并且可以在体外转录和纯化RNA。
含有如上所述的合适核酸序列以及合适的启动子或控制序列的载体可用于转化、转染或感染合适的宿主以使宿主表达上述的RNA或蛋白质。合适的表达宿主的实例包括细菌细胞(例如大肠杆菌(E.coli),链霉菌(Streptomyces),鼠伤寒沙门氏菌(Salmonellatyphimurium)),真菌细胞(酵母),昆虫细胞(例如果蝇(Drosophila)和草地贪夜蛾(Spodoptera frugiperda)(Sf9)),动物细胞(例如CHO、COS和HEK 293),腺病毒,和植物细胞。选择合适的宿主在本领域技术人员的范围内。在一些实施方案中,本发明提供了通过用具有编码所述RNA或多肽或蛋白质之一的核苷酸序列的表达载体转化、转染或感染宿主细胞来产生上述RNA或蛋白质的方法。然后在合适的条件下培养宿主细胞,这可以表达RNA或蛋白质。
可以使用本领域已知的用于将外源核苷酸序列引入宿主细胞的任何方法。实例包括使用磷酸钙转染,聚凝胺,原生质体融合,电穿孔,核转染,脂质体,显微注射,裸DNA,质粒载体,病毒载体(游离型和整合型),以及用于将克隆的基因组DNA、cDNA、合成DNA或其他外源基因物质引入宿主细胞的任何其它众所周知的方法。
方法
本发明的另一方面包括用于修饰细胞、非人动物胚胎、人或非人动物中的靶DNA序列(例如染色体序列)或靶RNA序列的方法。该方法包括将上述(i)靶向序列的蛋白或编码其的多核苷酸,(ii)RNA支架或编码其的DNA多核苷酸和(iii)核酸酶效应子融合蛋白或编码其的多核苷酸引入细胞或非人动物胚胎中。RNA支架将靶向序列的蛋白和融合蛋白指导至靶位点处的靶多核苷酸,并且融合蛋白的效应子结构域修饰该序列。如本文所公开的,对靶向序列的蛋白质(例如cas9蛋白质)进行修饰,使得内切核酸酶活性被消除。
在某些实施方案中,效应子蛋白作为单体起作用。在该情况下,如图1B所示,本发明的系统可以靶向靶位点的上游(左)或下游(右)的单个位点。在其他实施方案中,效应子蛋白需要二聚化以获得合适的催化功能。为此目的,可以将该系统多重化以同时靶向至靶位点的上游和下游靶序列,从而使效应子蛋白二聚化(图1C,左)。或者,将效应子蛋白募集到单个位点可足以增加其对相邻效应子蛋白的亲和力,促进二聚化(图1C,右)。在又一些其他实施方案中,如图1D所示,四聚体效应子酶可以被募集并定位在靶位点。这可以通过双靶向(图1D,左)或单靶向(图1D,右)实现。本发明中公开的系统也可用于编辑RNA靶(例如逆转录病毒失活)。参见图1E。在该情况下,如果效应子蛋白需要装配功能性寡聚体,则如图1C和1D的右图所示,单靶向至RNA分子可促进寡聚化。
靶多核苷酸不具有序列限制,除了该序列紧跟在PAM序列(下游或3')之后。PAM的实例包括但不限于NGG、NGGNG和NNAGAAW(其中N定义为任何核苷酸,W定义为A或T)。以上给出了PAM序列的其他实例,本领域技术人员将能够鉴定用于给定CRISPR蛋白的进一步的PAM序列。靶位点可以位于基因的编码区域、基因的内含子、基因间的控制区域等中。基因可以是蛋白质编码基因或RNA编码基因。
靶多核苷酸可以是细胞内源的或外源的任何多核苷酸。例如,靶多核苷酸可以是存在于真核细胞核内的多核苷酸。靶多核苷酸可以是编码基因产物(例如蛋白质)或非编码序列(例如,调控多核苷酸)的序列。
本发明的该系统的蛋白质组分可以作为分离的蛋白质引入细胞或非人动物胚胎中。在一个实施方案中,每种蛋白质可以包含至少一个细胞穿透结构域,其促进蛋白质的细胞摄取。在其他实施方案中,编码一种或多种蛋白质的mRNA分子或DNA分子可被引入细胞或非人动物胚胎中。通常,编码蛋白质的DNA序列可操作地连接到将在目的细胞或非人动物胚胎中起作用的启动子序列。DNA序列可以是线性的,或者DNA序列可以是载体的一部分。在其他实施方案中,可以将蛋白质作为包含上述蛋白质和RNA支架的RNA-蛋白质复合物引入细胞或非人动物胚胎中。
在替代实施方案中,编码所述蛋白质的DNA可以进一步包含编码RNA支架的组分的一个或多个序列。通常,编码所述蛋白质和RNA支架的DNA序列可操作地连接到合适的启动子控制序列,所述启动子控制序列允许蛋白质和RNA支架在细胞或非人动物胚胎中分别表达。编码所述蛋白质和RNA支架的DNA序列可以进一步包含额外的表达控制、调控和/或加工序列。编码所述蛋白质和指导RNA的DNA序列可以是线性的或可以是载体的一部分。
在通过编码RNA的DNA分子将RNA引入细胞的实施方案中,RNA编码序列可以可操作地连接至启动子控制序列以在真核细胞中表达指导RNA。例如,RNA编码序列可以可操作地连接到被RNA聚合酶III(Pol III)识别的启动子序列。合适的Pol III启动子的实例包括但不限于哺乳动物U6或H1启动子。在示例性实施方案中,RNA编码序列与小鼠或人U6启动子连接。在其他示例性实施方案中,RNA编码序列与小鼠或人H1启动子连接。
编码蛋白质和/或RNA的DNA分子可以是线性或环状的。在一些实施方案中,DNA序列可以是载体的一部分。合适的载体包括质粒载体,噬菌粒,粘粒,人工/微型染色体,转座子和病毒载体。在示例性实施方案中,编码所述蛋白质和/或RNA的DNA存在于质粒载体中。合适的质粒载体的非限制性实例包括pUC,pBR322,pET,pBluescript及其变体。载体可以包含额外的表达控制序列(例如增强子序列,Kozak序列,多聚腺苷酸化序列,转录终止序列等),可选择的标志物序列(例如抗生素抗性基因),复制起点等。
本发明的这个系统的蛋白质组分(或编码它们的核酸)和RNA组分(或编码它们的DNA)可通过各种方法引入细胞或非人动物胚胎中。通常,非人动物胚胎是感兴趣物种的受精的单细胞阶段胚胎。在一些实施方案中,转染细胞或非人动物胚胎。合适的转染方法包括磷酸钙介导的转染,核转染(或电穿孔),阳离子聚合物转染(例如DEAE-葡聚糖或聚乙烯亚胺),病毒转导,病毒微体转染,病毒粒子转染,脂质体转染(liposome transfection),阳离子脂质体转染,免疫脂质体转染,非脂质体脂质转染,树状聚合物转染,热休克转染,磁转染,lipofection,基因枪递送,impalefection,声致穿孔(sonoporation),光学转染,以及核酸的专有药剂增强的摄取。转染方法是本领域众所周知的(参见例如"CurrentProtocols in Molecular Biology"Ausubel et al.,John Wiley&Sons,New York,2003或"Molecular Cloning:A Laboratory Manual"Sambrook&Russell,Cold Spring HarborPress,Cold Spring Harbor,N.Y.,3rd edition,2001)。在其他实施方案中,通过显微注射将分子引入细胞或非人动物胚胎中。例如,可以将分子注射入一个细胞的非人动物胚胎的原核中。
本发明的该系统的蛋白质组分(或编码它们的核酸)和RNA组分(或编码它们的DNA)可以同时或顺序地引入细胞或非人动物胚胎中。所述蛋白质(或其编码核酸)与RNA(或编码RNA的DNA)的比例通常将近似化学计量,使得它们可以形成RNA-蛋白质复合物。类似地,两种不同蛋白质(或编码核酸)的比例将近似化学计量。在一个实施方案中,所述蛋白质组分和RNA组分(或编码它们的DNA序列)在同一核酸或载体内一起递送。
该方法还包括维持细胞或非人动物胚胎在合适的条件下,使得指导RNA将效应子蛋白指导至靶序列中的靶向位点,并且效应子结构域修饰靶序列。
通常,可以维持细胞在适合于细胞生长和/或维持的条件下。合适的细胞培养条件在本领域中是众所周知的,并且描述于例如Current Protocols in Molecular Biology"Ausubel et al.,John Wiley&Sons,New York,2003or"Molecular Cloning:ALaboratoryManual"Sambrook&Russell,Cold Spring Harbor Press,Cold Spring Harbor,N.Y.,3rdedition,2001),Santiago et al.(2008)PNAS 105:5809-5814;Moehle et al.(2007)PNAS104:3055-3060;Urnov et al.(2005)Nature 435:646-651;和Lombardo et al.(2007)Nat.Biotechnology 25:1298-1306中。本领域技术人员知道本领域已知的用于培养细胞的方法,并且能够并且会根据细胞类型而变化。在所有情况下,可以使用常规优化来确定针对特定细胞类型的最佳技术。
可以在体外(例如在细胞培养物中)培养非人动物胚胎。通常,将非人动物胚胎在合适的温度和合适的培养基中以必要的O2/CO2比例培养以允许蛋白质和RNA支架的表达(如果需要的话)。培养基的合适的非限制性实例包括M2,M16,KSOM,BMOC和HTF培养基。本领域技术人员将意识到,培养条件能够并且将根据非人动物胚胎的种类而变化。在所有情况下,可以使用常规优化来确定特定种类非人动物胚胎的最佳培养条件。在一些情况下,细胞系可以衍生自体外培养的非人动物胚胎,或者胚胎干细胞系。
或者,可以通过将非人动物胚胎转移到雌性宿主的子宫中来在体内培养胚胎。一般来说,雌性寄主来自与非人动物胚胎相同或相似的物种。优选地,雌性宿主是假孕的。制备假孕雌性宿主的方法是本领域已知的。另外,将非人动物胚胎转移到雌性宿主中的方法是已知的。在体内培养非人动物胚胎允许胚胎发育并且能够导致衍生自胚胎的动物的活产。这种动物将在身体的每个细胞中包含修饰的染色体序列。
各种真核细胞适用于该方法。例如,细胞可以是人细胞,非人哺乳动物细胞,非哺乳动物脊椎动物细胞,无脊椎动物细胞,昆虫细胞,植物细胞,酵母细胞或单细胞真核生物。各种非人动物胚胎适用于该方法。例如,胚胎可以是1细胞、2细胞或4细胞的非人的哺乳动物胚胎。示例性的非人哺乳动物胚胎,包括1细胞胚胎,包括但不限于小鼠,大鼠,仓鼠,啮齿动物,兔,猫,犬,羊,猪,牛,马和灵长类动物胚胎。在其他实施方案中,细胞可以是干细胞。合适的干细胞包括但不限于胚胎干细胞,ES样干细胞,成体干细胞,多能(pluripotent)干细胞,诱导多能(pluripotent)干细胞,多能(multipotent)干细胞,寡能(oligopotent)干细胞,单能(unipotent)干细胞和其他干细胞。在示例性实施方案中,细胞是哺乳动物细胞或胚胎是非人哺乳动物胚胎。
用途和应用
本文公开的系统和方法广泛具有各种用途,包括修饰和编辑(例如失活和激活)多种细胞类型中的靶多核苷酸。因此,这些系统和方法具有广泛的应用,例如用在研究和治疗中。
许多毁灭性的人类疾病有一个共同的原因:基因改变或突变。患者的致病性突变要么是通过父母遗传获得的,要么是由环境因素引起的。这些疾病包括但不限于以下类别。第一,一些基因疾病是由种系突变引起的。一个实例是囊性纤维化,其是由父母遗传的突变CFTR基因造成的。第二,一些疾病(如慢性病毒感染性疾病)是由外部环境因素引起的基因改变引起的。一个实例是AIDS,其是由人HIV病毒基因组插入感染的T细胞的基因组引起的。第三,一些神经退行性疾病涉及基因改变。一个实例是亨廷顿舞蹈病,其是由受影响的患者的亨廷顿基因中CAG三核苷酸的扩增引起的。最后,癌症是由癌细胞中积累的各种体细胞突变引起的。因此,校正致病性基因突变(或功能性校正序列)为治疗这些疾病提供了有吸引力的治疗机会。
体细胞基因编辑是许多人疾病的有吸引力的治疗策略。为了实现成功的治疗性基因编辑,三个关键因素被认为是至关重要的:(i)如何实现序列特异性识别(“序列识别模块”);(ii)如何校正发生的突变(“校正模块”);和(iii)如何将“校正模块”连接到“序列识别模块”以一起实现序列特异性校正。实现每种个体任务的方法有很多。然而,目前现有的平台或技术都不能实现最佳和实用的体细胞基因编辑。更具体而言,目前的基因特异性编辑技术主要基于核酸酶诱导的DNADSB和随后DSB诱导的同源重组,其在大多数体细胞中的活性低或不存在。因此,这些技术在大多数疾病的体细胞组织中的病理性基因突变的治疗性校正中使用受限。
相反,本发明公开的系统和方法允许不依赖于核酸酶活性的基因或RNA转录物的DNA序列定向编辑。该系统和方法不产生DSB,或不依赖于DSB介导的同源重组。此外,该系统的这种设计是模块化的,其允许以极其灵活和方便的方式靶向任何期望的DNA或RNA序列。实质上,这种方法使人们能够将DNA或RNA编辑酶指导至体细胞(包括干细胞)中的任何DNA或RNA序列。通过精确编辑靶DNA或RNA序列,该酶能够校正基因疾病中的突变基因,使感染细胞中的病毒基因组失活,消除神经退行性疾病中致病蛋白的表达,或沉默癌症中的致癌蛋白。因此,本发明公开的系统和方法可用于校正包括上述基因疾病、慢性感染性疾病、神经退行性疾病和癌症在内的疾病的潜在基因改变。
基因疾病
据估计,由已知的基因突变引起的疾病超过六千种。校正病理组织/器官中发生的引起疾病的突变能够减轻或治愈疾病。例如,在美国,每3,000人中就有一人受到囊性纤维化影响。它是由突变的CFTR基因的遗传引起的,并且70%的患者具有相同的突变,导致在位置508处缺失苯丙氨酸的三核苷酸缺失(称为“ΔPhe 508”)。ΔPhe 508导致CFTR的错位和降解。本发明中公开的系统和方法能够用于在受影响的组织(肺)中将Val 509残基(GTT)转换成Phe 509(TTT),从而功能上校正ΔPhe 508突变。
慢性传染病
本发明中公开的系统和方法也能够用于特异性失活并入人细胞/组织的病毒基因组中的任何基因。例如,本发明公开的系统和方法允许人们创建用于早期终止必需的病毒基因的翻译的终止密码子,从而补救或治愈慢性衰弱的感染性疾病。例如,目前的AIDS治疗可以减少病毒载量,但不能完全消除阳性T细胞中的休眠HIV。本文公开的系统和方法能够用于通过引入一个或两个终止密码子,永久性地失活人T细胞整合的HIV基因组中一个或两个关键HIV基因的表达。另一个实例是乙型肝炎病毒(HBV)。本文公开的系统和方法能够用于特异性失活一个或两个关键被整合到人基因组中的HBV基因并沉默HBV生命周期。
神经退行性疾病
一些神经退行性疾病是由功能获得突变引起的。例如,SOD1G93A导致肌萎缩侧索硬化症(ALS)的发展。本发明公开的系统和方法能够用于通过引入终止密码子或通过改变剪接位点来校正该突变或消除突变蛋白的表达。
癌症
许多基因(包括肿瘤抑制基因、致癌基因和DNA修复基因)有助于癌症的发展。这些基因突变常导致各种癌症。使用本发明中公开的系统和方法,能够特异性靶向和校正这些突变。因此,通过在催化位点或剪接位点引入点突变能够使致癌性致癌蛋白功能性消除或者能够消除其表达。
干细胞基因修饰
在一些实施方案中,能够使用本发明中公开的系统和方法基因修饰干细胞或祖细胞。合适的细胞包括例如干细胞(成体干细胞、胚胎干细胞、iPS细胞等)和祖细胞(例如心脏祖细胞、神经祖细胞等)。合适的细胞包括哺乳动物干细胞和祖细胞,包括例如啮齿动物干细胞、啮齿动物祖细胞、人干细胞、人祖细胞等。合适的宿主细胞包括体外宿主细胞,例如分离的宿主细胞。
在一些实施方案中,本发明能够用于组织的离体靶向和精确基因修饰,校正潜在的基因缺陷。离体校正后,组织可以返回到患者。此外,该技术能够广泛用于基于细胞的疗法以校正基因疾病。
动物和植物中的基因编辑
上述系统和方法能够用于产生具有一种或多种感兴趣的基因修饰的转基因非人动物或植物。在一些实施方案中,转基因非人动物针对基因修饰是纯合的。在一些实施方案中,转基因非人动物针对基因修饰是杂合的。在一些实施方案中,转基因非人动物是脊椎动物,例如鱼(例如,斑马鱼,金鱼,河豚,洞穴鱼等),两栖动物(青蛙,蝾螈等),鸟类(例如鸡,火鸡等),爬行动物(例如蛇,蜥蜴等),哺乳动物(例如有蹄类动物,例如猪,牛,山羊,绵羊等);兔类动物(例如兔子);啮齿动物(例如大鼠,小鼠);非人灵长类动物。
如上所述,本发明能够用于治疗动物疾病,其方式类似于治疗人类疾病。或者,它能够用于产生具有特定基因突变的敲入动物疾病模型以用于研究、药物发现和靶验证。上述系统和方法还能够用于向各种生物的ES细胞或非人动物胚胎引入点突变,用于培育和改善动物种群和作物品质。
将外源核酸导入植物细胞的方法在本领域中是众所周知的。合适的方法包括病毒感染(例如双链DNA病毒),转染,接合,原生质体融合,电穿孔,粒子枪技术,磷酸钙沉淀,直接显微注射,碳化硅晶须技术,农杆菌介导的转化等。方法的选择通常取决于被转化的细胞类型和发生转化的环境(即体外、离体或体内)。
试剂盒
本发明进一步提供了含有用于执行上述方法的试剂的试剂盒,包括CRISPR:Cas指导的靶结合或校正反应。为此,用于本文公开的方法的一种或多种反应组分例如RNA、Cas蛋白、融合效应子蛋白和相关核酸可以以试剂盒的形式供应以供使用。在一个实施方案中,试剂盒包含CRISPR蛋白或编码Cas蛋白的核酸、效应子蛋白,上述RNA支架中的一个或多个,上述一组RNA分子。在其他实施方案中,试剂盒可以包含一种或多种其他反应组分。在这样的试剂盒中,将适量的一种或多种反应组分提供在一个或多个容器中或保持在基底上。
试剂盒的附加组分的实例包括但不限于一种或多种宿主细胞,用于将外源核苷酸序列引入宿主细胞的一种或多种试剂,用于检测RNA或蛋白质的表达或验证靶核酸状态的一种或多种试剂(例如探针或PCR引物),以及用于反应的缓冲液或培养基(1X或浓缩形式)。试剂盒还可以包括一种或多种以下组分:支持物,终止、修饰或消化试剂,渗压剂和用于检测的设备。
所使用的反应组分可以以各种形式提供。例如,可以将组分(例如酶、RNA、探针和/或引物)悬浮在水溶液中或者作为冷冻干燥或冻干的粉末、丸粒或珠。在后一种情况下,组分在重构时形成组分的完全混合物用于测定。本发明的试剂盒可以在任何合适的温度下提供。例如,为了将包含蛋白质组分或其复合物的试剂盒储存在液体中,优选将它们提供并维持在0℃以下,优选在-20℃或以下,或者以其他方式处于冷冻状态。
试剂盒或系统可以含有(足以用于至少一种测定的量的)本文所述组分的任何组合。在一些应用中,一种或多种反应组分可以以预先测量的单次使用量提供在单独的、通常为一次性的管或等同的容器中。利用这样的配置,可以通过将靶核酸或含有靶核酸的样品或细胞直接添加到单独的管中,来进行RNA指导的反应。试剂盒中提供的组分的量可以是任何合适的量,并且可以取决于产品针对的目标市场。其中提供组分的容器可以是能够容纳所提供的形式的任何常规容器,例如微量离心管,微量滴定板,安瓿瓶,瓶子或整体测试装置,诸如流体装置、盒、横向流或其他类似的装置。
试剂盒还可以包括用于容纳容器或容器组合的包装材料。用于这样的试剂盒和系统的通常包装材料包括固体基质(例如,玻璃,塑料,纸,箔,微粒等),其以各种配置中的任一种(例如小瓶,微量滴定板孔,微阵列等)容纳反应组分或检测探针。试剂盒可以进一步包括以有形形式记录的用于使用组分的说明书。
定义
核酸或多核苷酸是指DNA分子(例如但不限于cDNA或基因组DNA)或RNA分子(例如但不限于mRNA),并且包括DNA或RNA类似物。DNA或RNA类似物可以由核苷酸类似物合成。DNA或RNA分子可以包括非天然存在的部分,例如修饰的碱基,修饰的主链,RNA中的脱氧核糖核苷酸等。核酸分子可以是单链或双链的。
当提及核酸分子或多肽时,术语“分离的”是指核酸分子或多肽基本上不含与其在自然界中与之有关或一起发现的至少一种其他组分。
如本文所用,术语“指导RNA”通常是指能够结合CRISPR蛋白并将CRISPR蛋白靶向靶DNA内的特定位置的RNA分子(或总体地RNA分子组)。指导RNA可以包含两个区段:靶向DNA的指导区段和结合蛋白质的区段。靶向DNA的区段包含与靶序列互补(或至少可以在严谨条件下杂交)的核苷酸序列。结合蛋白质的区段与CRISPR蛋白如Cas9或Cas9相关多肽相互作用。这两个区段可以位于同一RNA分子中,或者位于两个或更多个独立的RNA分子中。当两个区段处于独立的RNA分子中时,包含靶向DNA的指导区段的分子有时被称为CRISPRRNA(crRNA),而包含结合蛋白质的区段的分子被称为反式激活RNA(tracrRNA)。
如本文所用,术语“靶核酸”或“靶”是指含有靶核酸序列的核酸。靶核酸可以是单链或双链的,通常是双链DNA。本文使用的“靶核酸序列”、“靶序列”或“靶区域”是指希望使用CRISPR系统结合或修饰的特定序列或其互补序列。靶序列可以在细胞基因组内的体外或体内核酸内,其可以是任何形式的单链或双链核酸。
“靶核酸链”是指与本文所公开的指导RNA进行碱基配对的靶核酸的链。也就是说,与crRNA和指导序列杂交的靶核酸的链被称为“靶核酸链”。不与指导序列互补的靶核酸的另一链被称为“非互补链”。在双链靶核酸(例如DNA)的情况下,每条链可以是设计crRNA和指导RNA的“靶核酸链”并用于实施本发明,只要有合适的PAM位点。
如本文所用,术语“衍生自”是指这样的过程,其中使用第一组分(例如第一分子)或来自该第一组分的信息来分离、衍生或制备不同的第二组分(例如与第一分子不同的第二分子)。例如,哺乳动物密码子优化的Cas9多核苷酸衍生自野生型Cas9蛋白氨基酸序列。而且,包括Cas9单突变切口酶(nCas9,如nCas9D10A)和Cas9双突变体无核酸酶(dCas9,如dCas9 D10AH840A)的哺乳动物密码子优化的Cas9多核苷酸变体衍生自编码野生型哺乳动物密码子优化的Cas9蛋白的多核苷酸。
如本文所用,术语“野生型”是本领域技术人员理解的技术术语,并且是指与生物、菌株、基因或特征的通常形式,其天然存在且不同于突变体或变体形式。
如本文所用,术语“变体”是指与第二组合物(例如第二分子,也称为“亲本”分子)有关的第一组合物(例如第一分子)。变体分子可以衍生自、分离自、基于或同源于母体分子。例如,哺乳动物密码子优化的Cas9的突变形式(hspCas9)(包括Cas9单突变切口酶和Cas9双突变体无核酸酶)是哺乳动物密码子优化的野生型Cas9的变体(hspCas9)。术语变体可以用于描述多核苷酸或多肽。
应用于多核苷酸时,变体分子可以与原始亲本分子具有完全的核苷酸序列同一性,或者可以与亲本分子具有小于100%的核苷酸序列同一性。例如,基因核苷酸序列的变体可以是与原始的核苷酸序列相比,核苷酸序列至少50%,60%,70%,80%,90%,95%,98%,99%或更多相同的第二核苷酸序列。多核苷酸变体还包括包含完整亲本多核苷酸并且还包含额外的融合核苷酸序列的多核苷酸。多核苷酸变体还包括作为亲本多核苷酸的部分或子序列的多核苷酸,例如本文公开的多核苷酸的独特子序列(例如,如通过标准序列比较和比对技术所确定的)也包括在本发明中。
在另一方面,多核苷酸变体包括相对于亲本核苷酸序列含有较小、细微或不重要的改变的核苷酸序列。例如,较小、细微或不重要的改变包括以下对核苷酸序列的改变:(i)不改变相应多肽的氨基酸序列,(ii)出现在多核苷酸的蛋白质编码开放阅读框之外,(iii)导致可能影响相应氨基酸序列但对多肽的生物学活性几乎没有影响的缺失或插入,(iv)核苷酸改变导致氨基酸被化学上相似的氨基酸置换。在多核苷酸不编码蛋白质(例如,tRNA或crRNA或tracrRNA)的情况下,该多核苷酸的变体可以包括不导致多核苷酸功能丧失的核苷酸改变。在另一方面,本发明涵盖产生功能相同的核苷酸序列的公开核苷酸序列的保守变体。本领域技术人员将认识到,所公开的核苷酸序列的许多变体被包括在本发明中。
当应用于蛋白质时,变体多肽可以具有与原始亲本多肽的完全氨基酸序列同一性,或者可以具有与亲本蛋白质的小于100%的氨基酸同一性。例如,氨基酸序列的变体可以是相比于原始氨基酸序列,氨基酸序列至少50%,60%,70%,80%,90%,95%,98%,99%或更多相同的第二氨基酸序列。
多肽变体包括包含完整亲本多肽并且还包含额外的融合氨基酸序列的多肽。多肽变体还包括作为亲本多肽的部分或子序列的多肽,例如本文公开的多肽的独特子序列(例如,如通过标准序列比较和比对技术所确定的)也包括在本发明中。
在另一方面,多肽变体包括相对于亲本氨基酸序列含有较小、细微或不重要的改变的多肽。例如,较小、细微或不重要的改变包括对多肽的生物学活性具有小的影响或没有影响的氨基酸改变(包括置换、缺失和插入),并产生功能相同的多肽,包括添加非功能性肽序列的氨基酸改变。在其他方面,本发明的变体多肽改变亲本分子的生物学活性,例如已经被修饰或失去核酸酶活性的Cas9多肽的突变变体。本领域技术人员将理解,本发明涵盖所公开的多肽的许多变体。
在一些方面,本发明的多核苷酸或多肽变体可以包括改变、添加或缺失小百分比的核苷酸或氨基酸位置的变体分子,例如通常小于约10%,小于约5%,小于4%,小于2%或小于1%。
如本文所用,核苷酸或氨基酸序列中的术语“保守置换”是指核苷酸序列中的改变,其(i)由于三联体密码子冗余性而不导致氨基酸序列中的任何相应变化,或(ii)导致原始亲本氨基酸被具有化学相似结构的氨基酸置换。提供功能上相似的氨基酸的保守置换表在本领域是众所周知的,其中一个氨基酸残基被另一个具有相似化学性质的氨基酸残基置换(例如,芳族侧链或带正电的侧链),并且因此基本上不改变所得多肽分子的功能特性。
以下是含有类似化学性质的天然氨基酸分组,其中组内的置换是“保守”氨基酸置换。以下所示的这种分组不是硬性的,因为当考虑到不同的功能特性时,这些天然氨基酸可以被置于不同的分组中。具有非极性和/或脂肪族侧链的氨基酸包括:甘氨酸,丙氨酸,缬氨酸,亮氨酸,异亮氨酸和脯氨酸。具有极性不带电荷侧链的氨基酸包括:丝氨酸,苏氨酸,半胱氨酸,甲硫氨酸,天冬酰胺和谷氨酰胺。具有芳族侧链的氨基酸包括:苯丙氨酸,酪氨酸和色氨酸。具有带正电侧链的氨基酸包括:赖氨酸,精氨酸和组氨酸。具有带负电侧链的氨基酸包括:天冬氨酸和谷氨酸。
“Cas9突变体”或“Cas9变体”是指野生型Cas9蛋白如化脓性链球菌Cas9蛋白(即,SEQ ID NO:1)的蛋白或多肽衍生物,例如具有一个或多个点突变、插入、缺失、截短、融合蛋白或其组合的蛋白。它基本上保留了Cas9蛋白的RNA靶向活性。该蛋白或多肽可以包含SEQID NO:1的片段,由其组成或基本上由其组成。通常,突变体/变体与SEQ ID NO:1至少50%(例如50%至100%,包括端值之间的任何数值)相同。突变体/变体可以与RNA分子结合并经由RNA分子靶向特定的DNA序列,并且可以另外具有核酸酶活性。这些结构域的实例包括RuvC样基序(SEQ ID NO:1中的氨基酸7-22、759-766和982-989)和HNH基序(氨基酸837-863)。参见Gasiunas et al.,Proc Natl Acad Sci U S A.2012September 25;109(39):E2579–E2586和WO2013176772。
“互补性”是指核酸通过传统Watson-Crick碱基配对或其他非传统类型与另一核酸序列形成氢键的能力。百分比互补性表示核酸分子中可与第二核酸序列形成氢键(例如Watson-Crick碱基配对)的残基的百分比(例如,10中有5,6,7,8,9,10个则为50%,60%,70%,80%,90%和100%互补)。“完全互补”是指核酸序列的所有连续残基将与第二核酸序列中相同数量的连续残基氢键结合。如本文所用的“基本上互补的”是指在8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,30,35,40,45,50,或更多个核苷酸的区域上至少60%,65%,70%,75%,80%,85%,90%,95%,97%,98%,99%,或100%的互补性程度,或指在严格条件下杂交的两条核酸。
如本文所用,用于杂交的“严格条件”是指与靶序列具有互补性的核酸主要与靶序列杂交且基本上不与非靶序列杂交的条件。严格条件通常是序列相关的,并且取决于许多因素。通常,序列越长,序列与其靶序列特异性杂交的温度越高。严格条件的非限制性实例详细描述于Tijssen(1993),Laboratory Techniques In Biochemistry And MolecularBiology-Hybridization With Nucleic Acid Probes Part I,Second Chapter"Overviewof principles of hybridization and the strategy of nucleic acid probe assay",Elsevier,N.Y.中。
“杂交”是指在特定的杂交条件下完全或部分互补的核酸链一起形成其中两条构成链通过氢键连接的双链结构或区域的过程。虽然氢键通常在腺嘌呤和胸腺嘧啶或尿嘧啶(A和T或U)或胞嘧啶和鸟嘌呤(C和G)之间形成,但也可以形成其它碱基对(例如,Adams etal.,The Biochemistry of the Nucleic Acids,11th ed.,1992)。
如本文所用,“表达”是指通过其多核苷酸从DNA模板转录(例如转录成mRNA和其他RNA转录物)的过程和/或通过其转录的mRNA随后被翻译成肽、多肽、或蛋白质的过程。转录物和编码的多肽可以统称为“基因产物”。如果多核苷酸衍生自基因组DNA,则表达可包括mRNA在真核细胞中的剪接。
术语“多肽”、“肽”和“蛋白质”在本文中可互换使用,是指任何长度的氨基酸的聚合物。该聚合物可以是直链或支链的,它可以包含修饰的氨基酸,并且可以被非氨基酸中断。该术语还包括已被修饰的氨基酸聚合物;例如二硫键形成,糖基化,脂质化,乙酰化,磷酸化,聚乙二醇化或任何其他操作,例如与标记组分缀合。如本文所用,术语“氨基酸”包括天然和/或非天然或合成氨基酸,包括甘氨酸和D或L光学异构体,以及氨基酸类似物和肽模拟物。
术语“融合多肽”或“融合蛋白”是指通过将两个或更多个多肽序列连接在一起而产生的蛋白质。包含在本发明中的融合多肽包括嵌合基因构建体的翻译产物,所述嵌合基因构建体将编码第一多肽(例如RNA结合结构域)的核酸序列与编码第二多肽(例如效应子结构域)的核酸序列连接,形成单个开放阅读框架。换句话说,“融合多肽”或“融合蛋白”是由肽键或经由数种肽连接的两种或更多种蛋白质的重组蛋白质。融合蛋白还可以包含两个结构域之间的肽接头。
术语“接头”是指用于连接两个或更多个实体的任何工具、实体或部分。接头可以是共价接头或非共价接头。共价接头的实例包括共价键或共价附接至一个或多个待连接的蛋白质或结构域的接头部分。接头还可以是非共价键,例如通过金属中心例如铂原子的有机金属键。对于共价键,可以使用各种官能团,例如酰胺基团,包括碳酸衍生物,醚,酯,包括有机和无机酯,氨基,尿烷,尿素等。为了提供连接,可以通过氧化、羟基化、置换、还原等修饰所述结构域以提供偶联位点。缀合方法是本领域技术人员众所周知的,并且包括在本发明中。接头部分包括但不限于化学接头部分,或者例如肽接头部分(接头序列)。应该理解的是,不显著降低RNA结合结构域和效应子结构域功能的修饰是优选的。
如本文所用,本文使用的术语“缀合”或“连接”是指两个或更多个实体附接形成一个实体。缀合物包括肽-小分子缀合物以及肽-蛋白质/肽缀合物。
术语“受试者”和“患者”在本文中可互换使用,指脊椎动物,优选哺乳动物,更优选人。哺乳动物包括但不限于鼠类,猿猴,人,农场动物,运动动物和宠物。也包括体内获得的或体外培养的生物实体的组织、细胞及其后代。在一些实施方案中,受试者可以是无脊椎动物,例如昆虫或线虫;而在其他情况下,受试者可能是植物或真菌。
如本文所用,“治疗”或“减轻”或“改善”可互换使用。这些术语是指用于获得有益的或期望的结果的方法,包括但不限于治疗益处和/或预防益处。治疗益处是指治疗中的一种或多种疾病、病况或症状的任何治疗相关的改善或作用。对于预防益处,可将组合物施用于具有发展特定疾病、病况或症状的风险的受试者,或施用于报告疾病的一种或多种生理症状但疾病、病况或症状可能还没有表现出来的受试者。
如本文所用,当涉及任何组分集合时使用的术语“接触”包括将要接触的组分混合成同一混合物(例如,加入到同一隔室或溶液中)的任何方法,不一定需要所述组分之间的实际物理接触。所述组分可以以任何顺序或任何组合(或子组合)接触,并且可以包括随后(任选在添加其它所述组分之前)从混合物中除去一种或一些所述组分的情况。例如,“将A与B和C接触”包括以下任何和所有情况:(i)将A与C混合,然后将B加入到混合物中;(ii)将A和B混合成混合物,将B从混合物中除去,然后将C加入到混合物中;(iii)将A加入到B和C的混合物中。将靶核酸或细胞与一种或多种反应组分(例如Cas蛋白或指导RNA)“接触”包括以下任何或所有情况:(i)使靶或细胞与反应混合物的第一组分接触以产生混合物,然后将反应混合物的其它组分以任何顺序或组合加入到混合物中;和(ii)反应混合物在与靶或细胞混合之前完全形成。
如本文所用的术语“混合物”是指散布且不是以任何特定的顺序的元素的组合。混合物是异源的,不能在空间上分离成不同的成分。元素混合物的实例包括溶解在相同水溶液中的许多不同元素,或随机或以不特定顺序附接到固体载体上的许多不同元素,其中不同元素不是在空间上不同。换句话说,混合物是不可寻址的。
如本文所公开的,提供了许多数值范围。可以理解的是,除非上下文清楚地另外指出,在该范围的上限和下限之间的每个中间值至下限单位的十分之一也被具体地公开。在所述范围内的任何规定值或中间值与该规定范围内的任何其他规定值或中间值之间的每个较小范围都包含在本发明内。这些较小范围的上限和下限可以独立地包括在该范围内或排除在外,并且其中任一个、没有一个或两个限值都包括在较小范围内的每个范围也包含在本发明内,受限于规定的范围中任何具体排除的限制。在所述范围包括一个或两个限值的情况下,排除那些所包括的限值中的任一个或两个的范围也包括在本发明中。术语“约”通常是指所指数字的正或负10%。例如,“约10%”可以表示9%至11%的范围,“约20”可以表示18-22。“约”的其他含义可以从上下文明显看出,例如四舍五入,所以例如“约1”也可以表示0.5到1.4。
实施例
实施例1CRC系统导致细菌基因组中靶胞嘧啶核苷酸的位点特异性突变。
在本实施例中,使用大肠杆菌MG1655菌株作为模型。细菌RNA聚合酶亚单位β基因(rpoB)中的突变使细胞对抗生素利福平具有抗性(Jin,et al.,Journal of MolecularBiology 202,45-58,(1988),和Goldstein,et al.,J Antibiot 67,625-630,doi:10.1038/ja.2014.107(2014))。可以单独分离和分析突变体,并且可以计算突变频率。AID是B细胞特异性蛋白质,属于胞苷脱氨酶的APOBEC家族,并且在抗体多样化和亲和力成熟过程中参与体细胞高变和类别转换重组(Odegard,et al.,Nat Rev Immunol 6,573-583(2006),和Noia,et al.Annual Review of Biochemistry 76,1-22,doi:doi:10.1146/annurev.biochem.76.061705.090740(2007))。因此,对于这些实验组,使用AID作为非核酸酶效应子蛋白来靶向来自大肠杆菌MG1655的rpoB基因。
构建体和系统配置
诱导型启动子
所有蛋白质编码构建体均设计成在Tet诱导型启动子的控制下。使用浓度为30nM的无水四环素(ATc;Sigma)作为诱导剂。
Cas9构建体
本系统的核心特征是引入精确的核苷酸修饰而不产生DSB。为此,使用核酸酶缺陷型Cas9作为DNA靶向模块,即催化缺陷型Cas9(Cas9D10A/H840A,dCas9)和Cas9切口酶(nCas9D10A或nCas9H840A)(Jinek,M.et al.,Science 337,816-821,doi:10.1126/science.1225829(2012))。已经使用Cas9切口酶通过偏移双DNA切刻来减少脱靶DSB(Ran,F.A.et al.,Cell154,1380-1389,doi:10.1016/j.cell.2013.08.021(2013),和Shen,B.et al.,Nat Meth11,399-402,doi:10.1038/nmeth.2857(2014)),并且dCas9已被工程改造以发挥不依赖于核酸酶活性的各种活性。参见Fujita,T.et al.,Biochemical and biophysical researchcommunications 439,132-136,(2013),Perez-Pinera,P.et al.Nat Meth 10,973-976,doi:10.1038/nmeth.2600(2013),Mali,P.et al.Nat Biotechnol 31,833-838,doi:10.1038et al./nbt.2675(2013),Zalatan,J.G.et al.,Cell 160,339-350,doi:10.1016/j.cell.2014.11.052(2015),Qi,L.S.et al.,Cell 152,1173-1183,doi:10.1016/j.cell.2013.02.022(2013),Larson,M.H.et al.,Nature protocols 8,2180-2196,doi:10.1038/nprot.2013.132(2013),Hilton,I.B.et al.,Nat Biotech 33,510-517,doi:10.1038/nbt.3199(2015),Thakore,P.I.et al.,Nat Meth 12,1143-1149,doi:10.1038/nmeth.363(2015),Chen,B.et al.,Cell 155,1479-1491,doi:10.1016/j.cell.2013.12.001(2013),和Fu,Y.et al.,Nature communications 7,doi:10.1038/ncomms11707(2016)。因此,这些变体在很大程度上被认为是安全的,并且代表开发本研究中提出的系统的理想候选者。
靶向募集系统
该系统被工程改造成RNA支架介导的募集平台。包括本研究中使用的构建体的示意图的示意性代表在图1A中示出。Cas9变体被设计为独立构建体,而gRNA被工程改造成嵌合RNA物质,其中噬菌体RNA支架被合成融合到CRISPR RNA支架的3'末端。噬菌体RNA支架募集特定的RNA结合蛋白,其又被连接至非核酸酶效应子蛋白(图1B)。RNA支架募集系统衍生自噬菌体MS2及其相互作用伴侣MS2外壳蛋白(MCP)。
靶向gRNA
靶是细菌rpoB基因。一起被称为利福平抗性决定区(RRDR)的三个簇中的突变赋予细胞对抗生素利福平的抗性(RifR)(Goldstein,et al.,J Antibiot 67,625-630,doi:10.1038/ja.2014.107(2014))。设计了一组四个gRNA,以沿着RRDR簇I序列靶向关键氨基酸(即S512、D516、H526和S531;图2A)。Jin,et al.,Journal of molecular biology 202,45-58,(1988)和Jin,D.J.et al.,Methods in Enzymology Vol.Volume 273 300-319(Academic Press,1996)
实验方法
化学感受态大肠杆菌MG1655细胞用10-20ng总DNA转化,所述总DNA包含编码第1部分所述构建体的质粒组合。转化后,选择细胞并在含有合适的抗生素的Luria-Bertani培养液中诱导。选择/诱导后,测量OD,连续稀释细胞,将108-104个细胞涂布在含有利福平(120μM)的LB琼脂平板上。针对铺板效率,将200个细胞铺板在没有利福平的选择性琼脂平板上。过夜孵育后,计数集落并对突变频率进行计数检查。另外,通过PCR扩增来自分离的集落的rpoB基因并进行测序以探寻突变。
结果
AID的靶向募集导致C到T的位点特异性转换。
靶向rpoB的RRDR(簇I)区域的一组四个gRNA被用于将AID募集到靶位点(图2A)。用rpoB_TS-4进行的以及至更小程度用rpoB_TS-3进行的CRC靶向提高了利福平培养基中MG1655细胞的存活分数(图2B、2C)。衍生自rpoB_TS-4处理的克隆的序列分析揭示了高特异性,使C1592突变为T,伴随着丝氨酸531到苯丙氨酸的氨基酸改变,这是已知得到RifR细胞的突变(Petersen-Mahrt,et al.,Nature 418,99-104(2002),Xu,M.,et al.,Journal ofBacteriology 187,2783-2792,doi:10.1128/JB.187.8.2783-2792.2005(2005),和Zenkin,N.,et al.,Antimicrobial Agents and Chemotherapy 49,1587-1590,doi:10.1128/AAC.49.4.1587-1590.2005(2005))(图2D)。图2E总结了rpoB_TS-3、rpoB_TS-4和乱序序列的突变分布。观察到高度增加的突变频率和rpoB_TS-4处理上的修饰的核苷酸的位置,并且rpoB_TS-3处理的效率降低,这表明靶胞嘧啶必须位于CRISPRR-环留下的未配对的链上,优先更接近原间隔区的5'末端(即突变频率TS4>TS3,均靶向和修饰相同的核苷酸,图2A、2C和2E)。这与AID积极地使单链DNA上的胞嘧啶残基脱氨基一致(Odegard,et al.,Nat Rev Immunol 6,573-583(2006),Noia,et al.,Annual Review of Biochemistry 76,1-22,doi:doi:10.1146/annurev.biochem.76.061705.090740(2007),Smith,H.C.,etal.,Seminars in Cell&Developmental Biology 23,258-268,doi:10.1016/j.semcdb.2011.10.004(2012),和Ranganathan,V.,et al.,Nature communications 5,doi:10.1038/ncomms5516(2014))。靶向模型的示意图如图2F所示。
CRC模块化
将靶向模块从dCas9改变为nCas9D10A提高了C到T/A转换的效率
就利福平平板上的存活分数比对照组提高了18至43倍方面,将靶向模块从dCas9改变为nCas9D10A提高了系统效率(图3A)。突变分析揭示了与AIDCRC处理相同的靶核苷酸特异性。在这种情况下,C1592在100%的克隆中被修饰,75%的C被突变为T和25%的C被突变为A(图3B)。
其他非核酸酶效应子APOBEC3G和APOBEC1的靶向募集能够引入C到T/A的位点特异
性转换
除了AID作为效应子蛋白之外,我们还测试了来自APOBEC家族的其他胞苷脱氨酶,即APOBEC3G和APOBEC1(图4A)。与原系统AIDCRCD10A相比,APOBEC1提高了靶突变频率。APOBEC3G比原型系统活跃程度低。以rpoB_TS-4作为靶向构建体的Apo1CRCD10A处理细胞的突变分析显示100%C1592>T转换。另外,25%的分析克隆是双突变体,转换C1590>T,没有氨基酸改变(图4B)。
增加RNA募集支架的数量可以增强突变频率,而不会改变C到T/A转换的特异性。
添加串联多聚体募集支架可能会增加靶区域上的效应子存在,从而增强系统的效率。为此,我们工程改造了rpoB_TS-4以包含两个MS2环(2xMS2)。我们比较具有一个MS2环的rpoB_TS-4(1xMS2)和rpoB_TS-4 2xMS2(图5A)之间的靶向效率。结果表明,依据RifR,增加募集环的数量实际上增强了突变频率,表明效应子蛋白的存在增加。以rpoB_TS-4_2xMS2为靶向构建体的AIDCRCD10A处理的细胞的突变分析显示C1592核苷酸在100%的克隆中被修饰,62.5%的C被突变为T和37.5%的C被突变为A(图5B)。这些结果表明,工程改造募集模块不会影响系统的靶向特异性。
总之,这些结果表明,CRC系统的模块化设计方便了工程改造过程,并为进一步改进系统开启了可能性。
实施例2CRC系统导致哺乳动物系统中位点特异性核苷酸转换
实验设计:工程改造该系统用于哺乳动物表达
我们接下来试图工程改造该系统用于哺乳动物表达。为此,我们使用哺乳动物密码子优化的nCas9D10A接着由自我切割的P2A肽分离的AID_MCP融合体,将原核AIDCRCD10A系统概括为多顺反子构建体。在泛素C启动子的控制下克隆该构建体。gRNA_2xMS2盒在U6或H1启动子的控制下分别克隆用于具有5'-G或5'-A的靶(Ranganathan,V.,et al.,Naturecommunications 5,doi:10.1038/ncomms5516(2014))。图6A中示出了这些实验组中使用的构建体的示意图。
靶向染色体外DNA:EGFP回复突变测定
EGFP被工程改造为具有破坏其荧光团的功能丧失点突变(197A>G,Y66C),因此使蛋白质不发荧光(nfEGFPY66C)。然后将突变GFP的表达载体转染到哺乳动物细胞中并作为该系统的底物。这个实验的目的是“校正”这种功能丧失的突变。当校正的基因被转录和翻译时,校正将恢复蛋白质功能,在荧光显微镜下可以将其可视化为荧光细胞。
实验方法
用10μg包含编码nfEGFPY66C、AIDCRCD10A和gRNA构建体的靶质粒的DNA的组合转染大约7×105个293T细胞。为了比较,在这些实验组中使用第三代碱基编辑器系统(BE3,Komor,A.C.,et al.,Nature advance online publication,doi:10.1038/nature17946)。BE3是稍微相似的具有不同的募集机制(Cas9与APOBEC1的直接融合)的系统并且包括抑制尿嘧啶DNA糖基化酶(参与DNA修复的酶)的肽。过夜孵育后,在荧光显微镜下分析细胞以观察GFP信号。
结果
发现上述CRC系统能够修饰染色体外DNA中的靶核苷酸,恢复蛋白质功能。由于靶胞嘧啶位于模板链(TS,-)上,所以设计了两个gRNA以结合靶核苷酸周围的非模板链(NT,+)(图6B)。靶胞嘧啶分别位于nfEGFPY66C_NT-1和nfEGFPY66C_NT-2原间隔区内的位置5和12上。用编码nCas9D10A、AID_MCP、gRNA(nfEGFPY66C_NT-1或nfEGFPY66C_NT-2或乱序序列)和靶构建体nfEGFPY66C的DNA转染293T细胞。在用nfEGFPY66C_NT-1和nfEGFPY66C_NT-2处理的细胞上检测到EGFP信号,但是没有在用乱序序列处理的细胞上检测到EGFP信号(图6C)。由于靶胞嘧啶的位置,与nfEGFPY66C_NT-2相比,nfEGFPY66C_NT-1处理的细胞中的EGFP信号更强。nfEGFPY66C_NT-1可能使得靶向的C更容易接近AID(图6C,中和右图)。另外,将CRC平台与不同的基因编辑系统(BE3)进行比较,所述基因编辑系统利用胞苷脱氨酶蛋白与Cas9蛋白的直接融合进行募集并且需要共同表达尿嘧啶DNA糖基化酶(UGI)的抑制剂以提高效率。意外地发现,即使没有局部UNG抑制(没有尿嘧啶DNA糖基化酶抑制剂UGI的表达),效应子和序列靶向模块通过RNA支架连接的CRC系统比BE3系统更有效(图6C、6D和7B)。
这些结果证实了来自细菌系统的发现,并且表明该系统以可编程方式有效地使人细胞中染色体外DNA中的特定胞嘧啶残基脱氨基。使用nfEGFPY66C_NT-1作为靶向gRNA,来自用AIDCRCD10A和BE3处理的GFP阳性细胞的定量表明CRC系统具有比BE3更好的转换效率(图6D)。
实施例3 CRC系统导致哺乳动物细胞内源基因中的位点特异性核苷酸转换靶向内源性基因座:中国仓鼠HPRT基因
受到从细菌阴性选择系统观察到的阳性结果的鼓舞,我们决定在哺乳动物中使用类似的方法。次黄嘌呤-鸟嘌呤磷酸核糖转移酶(HPRT)是参与嘌呤代谢的酶,已知沿其编码序列的突变引起对抗代谢物6-硫鸟嘌呤的抗性(6-TGR)(O'Neill,J.P.et al.,Nature269,815-816(1977))。对于这些实验,我们旨在用CRC系统突变HPRT基因以破坏其功能,然后用6-TG选择突变细胞用于进一步分析。
实验方法
用10μg包含AIDCRCD10A构建体和gRNA HPRT_TS-1表达载体的DNA的组合转染大约7×105个中国仓鼠V79-4细胞。为了比较,还用BE3和gRNA HPRT_TS-1处理细胞。按照先前描述的哺乳动物诱变操作方案,培养处理和未处理的细胞(Klein,C.B.,et al.,于CurrentProtocols in Toxicology(John Wiley&Sons,Inc.,2001))。简而言之,转染后,细胞保持7天,然后6-TG选择用于先前存在的HPRT mRNA和蛋白质的突变固定和转换。用60μM 6-TG选择细胞14天以允许形成6-TGR集落。对集落进行计数以估计突变频率,并分离个体集落并分别扩增用于测序分析。
结果
一个gRNA被设计为靶向来自中国仓鼠HPRT基因的外显子3(图7A)。gRNA靶向编码苯丙氨酸的密码子74,并且该残基中的突变与降低的HPRT蛋白稳定性有关(Davidson,B.L.,et al.,Gene 63,331-336,doi:http://dx.doi.org/10.1016/0378-1119(88)90536-7(1988))。用编码AIDCRCD10A或BE3构建体的DNA与靶向gRNA表达载体一起转染V79-4细胞。AIDCRCD10A系统导致突变,使得与BE3系统相比,细胞对6-TG处理的抗性具有更高的效率(即分别比未处理的细胞高140倍和40倍;图7B)。结果显示CRC系统能够靶向和修饰内源性哺乳动物基因座中的特定DNA序列。
上述优选实施方案的实施例和描述应该被认为是示例性的,而不是限制权利要求所限定的本发明。容易理解的是,可以使用上述特征的多种变化和组合,而不脱离如权利要求中所阐述的本发明。这样的变化不被视为脱离本发明的范围,并且所有这样的变化旨在被包括在所附权利要求书的范围内。本文引用的所有参考文献全文引入作为参考。
序列表
<110> 新泽西鲁特格斯州立大学(Rutgers, The State University of NewJersey)
金晟侃(Jin, Shengkan)
J-C·科兰特斯(Collantes, Juan-Carlos)
<120> 核酸酶非依赖性靶向基因编辑平台及其用途
<130> S2015-134/ 096738.00475
<150> 62/192,876
<151> 2015-07-15
<160> 47
<170> SIPOSequenceListing 1.0
<210> 1
<211> 1368
<212> PRT
<213> 化脓性链球菌(Streptococcus pyogenes)
<400> 1
Met Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val
1 5 10 15
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
Leu Ser Asp Tyr Asp Val Asp Ala Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala Lys
1010 1015 1020
Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe Tyr Ser
1025 1030 1035 1040
Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala Asn Gly Glu
1045 1050 1055
Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu Thr Gly Glu Ile
1060 1065 1070
Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val Arg Lys Val Leu Ser
1075 1080 1085
Met Pro Gln Val Asn Ile Val Lys Lys Thr Glu Val Gln Thr Gly Gly
1090 1095 1100
Phe Ser Lys Glu Ser Ile Leu Pro Lys Arg Asn Ser Asp Lys Leu Ile
1105 1110 1115 1120
Ala Arg Lys Lys Asp Trp Asp Pro Lys Lys Tyr Gly Gly Phe Asp Ser
1125 1130 1135
Pro Thr Val Ala Tyr Ser Val Leu Val Val Ala Lys Val Glu Lys Gly
1140 1145 1150
Lys Ser Lys Lys Leu Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile
1155 1160 1165
Met Glu Arg Ser Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala
1170 1175 1180
Lys Gly Tyr Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys
1185 1190 1195 1200
Tyr Ser Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser
1205 1210 1215
Ala Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr
1220 1225 1230
Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys His
1250 1255 1260
Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys Arg Val
1265 1270 1275 1280
Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala Tyr Asn Lys
1285 1290 1295
His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn Ile Ile His Leu
1300 1305 1310
Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala Phe Lys Tyr Phe Asp
1315 1320 1325
Thr Thr Ile Asp Arg Lys Arg Tyr Thr Ser Thr Lys Glu Val Leu Asp
1330 1335 1340
Ala Thr Leu Ile His Gln Ser Ile Thr Gly Leu Tyr Glu Thr Arg Ile
1345 1350 1355 1360
Asp Leu Ser Gln Leu Gly Gly Asp
1365
<210> 2
<211> 129
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> RNA支架表达盒
<220>
<221> misc_feature
<222> (1)..(20)
<223> n是a, c, g, 或t
<400> 2
nnnnnnnnnn nnnnnnnnnn gttttagagc tagaaatagc aagttaaaat aaggctagtc 60
cgttatcaac ttgaaaaagt ggcaccgagt cggtgcgcgc acatgaggat cacccatgtg 120
ctttttttg 129
<210> 3
<211> 340
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 效应子AID -MCP融合物
<400> 3
Met Asp Ser Leu Leu Met Asn Arg Arg Lys Phe Leu Tyr Gln Phe Lys
1 5 10 15
Asn Val Arg Trp Ala Lys Gly Arg Arg Glu Thr Tyr Leu Cys Tyr Val
20 25 30
Val Lys Arg Arg Asp Ser Ala Thr Ser Phe Ser Leu Asp Phe Gly Tyr
35 40 45
Leu Arg Asn Lys Asn Gly Cys His Val Glu Leu Leu Phe Leu Arg Tyr
50 55 60
Ile Ser Asp Trp Asp Leu Asp Pro Gly Arg Cys Tyr Arg Val Thr Trp
65 70 75 80
Phe Thr Ser Trp Ser Pro Cys Tyr Asp Cys Ala Arg His Val Ala Asp
85 90 95
Phe Leu Arg Gly Asn Pro Asn Leu Ser Leu Arg Ile Phe Thr Ala Arg
100 105 110
Leu Tyr Phe Cys Glu Asp Arg Lys Ala Glu Pro Glu Gly Leu Arg Arg
115 120 125
Leu His Arg Ala Gly Val Gln Ile Ala Ile Met Thr Phe Lys Asp Tyr
130 135 140
Phe Tyr Cys Trp Asn Thr Phe Val Glu Asn His Glu Arg Thr Phe Lys
145 150 155 160
Ala Trp Glu Gly Leu His Glu Asn Ser Val Arg Leu Ser Arg Gln Leu
165 170 175
Arg Arg Ile Leu Leu Pro Leu Tyr Glu Val Asp Asp Leu Arg Asp Ala
180 185 190
Phe Arg Thr Leu Gly Leu Glu Leu Lys Thr Pro Leu Gly Asp Thr Thr
195 200 205
His Thr Ser Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Met
210 215 220
Ala Ser Asn Phe Thr Gln Phe Val Leu Val Asp Asn Gly Gly Thr Gly
225 230 235 240
Asp Val Thr Val Ala Pro Ser Asn Phe Ala Asn Gly Ile Ala Glu Trp
245 250 255
Ile Ser Ser Asn Ser Arg Ser Gln Ala Tyr Lys Val Thr Cys Ser Val
260 265 270
Arg Gln Ser Ser Ala Gln Asn Arg Lys Tyr Thr Ile Lys Val Glu Val
275 280 285
Pro Lys Gly Ala Trp Arg Ser Tyr Leu Asn Met Glu Leu Thr Ile Pro
290 295 300
Ile Phe Ala Thr Asn Ser Asp Cys Glu Leu Ile Val Lys Ala Met Gln
305 310 315 320
Gly Leu Leu Lys Asp Gly Asn Pro Ile Pro Ser Ala Ile Ala Ala Asn
325 330 335
Ser Gly Ile Tyr
340
<210> 4
<211> 93
<212> RNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 杂合crRNA:tracRNA
<400> 4
guuuaagagc uaugcuggaa acagcauagc aaguuuaaau aaggcuaguc cguuaucaac 60
uugaaaaagu ggcaccgagu cggugcuuuu uuu 93
<210> 5
<211> 79
<212> RNA
<213> 人工序列(Artificial Sequence)
<220>
<223> tracRNA
<400> 5
ggaaccauuc aaaacagcau agcaaguuaa aauaaggcua guccguuauc aacuugaaaa 60
aguggcaccg agucggugc 79
<210> 6
<211> 60
<212> RNA
<213> 人工序列(Artificial Sequence)
<220>
<223> tracRNA
<400> 6
uagcaaguua aaauaaggcu aguccguuau caacuugaaa aaguggcacc gagucggugc 60
<210> 7
<211> 64
<212> RNA
<213> 人工序列(Artificial Sequence)
<220>
<223> tracRNA
<400> 7
agcauagcaa guuaaaauaa ggcuaguccg uuaucaacuu gaaaaagugg caccgagucg 60
gugc 64
<210> 8
<211> 70
<212> RNA
<213> 人工序列(Artificial Sequence)
<220>
<223> tracRNA
<400> 8
caaaacagca uagcaaguua aaauaaggcu aguccguuau caacuugaaa aaguggcacc 60
gagucggugc 70
<210> 9
<211> 45
<212> RNA
<213> 人工序列(Artificial Sequence)
<220>
<223> tracRNA
<400> 9
uagcaaguua aaauaaggcu aguccguuau caacuugaaa aagug 45
<210> 10
<211> 32
<212> RNA
<213> 人工序列(Artificial Sequence)
<220>
<223> tracRNA
<400> 10
uagcaaguua aaauaaggcu aguccguuau ca 32
<210> 11
<211> 26
<212> RNA
<213> 人工序列(Artificial Sequence)
<220>
<223> tracRNA
<400> 11
uagcaaguua aaauaaggcu aguccg 26
<210> 12
<211> 66
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> Ku结合发夹
<400> 12
ttcttgtcgt acttatagat cgctacgtta tttcaatttt gaaaatctga gtcctgggag 60
tgcgga 66
<210> 13
<211> 609
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 端粒酶Ku
<400> 13
Met Ser Gly Trp Glu Ser Tyr Tyr Lys Thr Glu Gly Asp Glu Glu Ala
1 5 10 15
Glu Glu Glu Gln Glu Glu Asn Leu Glu Ala Ser Gly Asp Tyr Lys Tyr
20 25 30
Ser Gly Arg Asp Ser Leu Ile Phe Leu Val Asp Ala Ser Lys Ala Met
35 40 45
Phe Glu Ser Gln Ser Glu Asp Glu Leu Thr Pro Phe Asp Met Ser Ile
50 55 60
Gln Cys Ile Gln Ser Val Tyr Ile Ser Lys Ile Ile Ser Ser Asp Arg
65 70 75 80
Asp Leu Leu Ala Val Val Phe Tyr Gly Thr Glu Lys Asp Lys Asn Ser
85 90 95
Val Asn Phe Lys Asn Ile Tyr Val Leu Gln Glu Leu Asp Asn Pro Gly
100 105 110
Ala Lys Arg Ile Leu Glu Leu Asp Gln Phe Lys Gly Gln Gln Gly Gln
115 120 125
Lys Arg Phe Gln Asp Met Met Gly His Gly Ser Asp Tyr Ser Leu Ser
130 135 140
Glu Val Leu Trp Val Cys Ala Asn Leu Phe Ser Asp Val Gln Phe Lys
145 150 155 160
Met Ser His Lys Arg Ile Met Leu Phe Thr Asn Glu Asp Asn Pro His
165 170 175
Gly Asn Asp Ser Ala Lys Ala Ser Arg Ala Arg Thr Lys Ala Gly Asp
180 185 190
Leu Arg Asp Thr Gly Ile Phe Leu Asp Leu Met His Leu Lys Lys Pro
195 200 205
Gly Gly Phe Asp Ile Ser Leu Phe Tyr Arg Asp Ile Ile Ser Ile Ala
210 215 220
Glu Asp Glu Asp Leu Arg Val His Phe Glu Glu Ser Ser Lys Leu Glu
225 230 235 240
Asp Leu Leu Arg Lys Val Arg Ala Lys Glu Thr Arg Lys Arg Ala Leu
245 250 255
Ser Arg Leu Lys Leu Lys Leu Asn Lys Asp Ile Val Ile Ser Val Gly
260 265 270
Ile Tyr Asn Leu Val Gln Lys Ala Leu Lys Pro Pro Pro Ile Lys Leu
275 280 285
Tyr Arg Glu Thr Asn Glu Pro Val Lys Thr Lys Thr Arg Thr Phe Asn
290 295 300
Thr Ser Thr Gly Gly Leu Leu Leu Pro Ser Asp Thr Lys Arg Ser Gln
305 310 315 320
Ile Tyr Gly Ser Arg Gln Ile Ile Leu Glu Lys Glu Glu Thr Glu Glu
325 330 335
Leu Lys Arg Phe Asp Asp Pro Gly Leu Met Leu Met Gly Phe Lys Pro
340 345 350
Leu Val Leu Leu Lys Lys His His Tyr Leu Arg Pro Ser Leu Phe Val
355 360 365
Tyr Pro Glu Glu Ser Leu Val Ile Gly Ser Ser Thr Leu Phe Ser Ala
370 375 380
Leu Leu Ile Lys Cys Leu Glu Lys Glu Val Ala Ala Leu Cys Arg Tyr
385 390 395 400
Thr Pro Arg Arg Asn Ile Pro Pro Tyr Phe Val Ala Leu Val Pro Gln
405 410 415
Glu Glu Glu Leu Asp Asp Gln Lys Ile Gln Val Thr Pro Pro Gly Phe
420 425 430
Gln Leu Val Phe Leu Pro Phe Ala Asp Asp Lys Arg Lys Met Pro Phe
435 440 445
Thr Glu Lys Ile Met Ala Thr Pro Glu Gln Val Gly Lys Met Lys Ala
450 455 460
Ile Val Glu Lys Leu Arg Phe Thr Tyr Arg Ser Asp Ser Phe Glu Asn
465 470 475 480
Pro Val Leu Gln Gln His Phe Arg Asn Leu Glu Ala Leu Ala Leu Asp
485 490 495
Leu Met Glu Pro Glu Gln Ala Val Asp Leu Thr Leu Pro Lys Val Glu
500 505 510
Ala Met Asn Lys Arg Leu Gly Ser Leu Val Asp Glu Phe Lys Glu Leu
515 520 525
Val Tyr Pro Pro Asp Tyr Asn Pro Glu Gly Lys Val Thr Lys Arg Lys
530 535 540
His Asp Asn Glu Gly Ser Gly Ser Lys Arg Pro Lys Val Glu Tyr Ser
545 550 555 560
Glu Glu Glu Leu Lys Thr His Ile Ser Lys Gly Thr Leu Gly Lys Phe
565 570 575
Thr Val Pro Met Leu Lys Glu Ala Cys Arg Ala Tyr Gly Leu Lys Ser
580 585 590
Gly Leu Lys Lys Gln Glu Leu Leu Glu Ala Leu Thr Lys His Phe Gln
595 600 605
Asp
<210> 14
<211> 485
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 端粒酶Ku
<400> 14
Met Val Arg Ser Gly Asn Lys Ala Ala Val Val Leu Cys Met Asp Val
1 5 10 15
Gly Phe Thr Met Ser Asn Ser Ile Pro Gly Ile Glu Ser Pro Phe Glu
20 25 30
Gln Ala Lys Lys Val Ile Thr Met Phe Val Gln Arg Gln Val Phe Ala
35 40 45
Glu Asn Lys Asp Glu Ile Ala Leu Val Leu Phe Gly Thr Asp Gly Thr
50 55 60
Asp Asn Pro Leu Ser Gly Gly Asp Gln Tyr Gln Asn Ile Thr Val His
65 70 75 80
Arg His Leu Met Leu Pro Asp Phe Asp Leu Leu Glu Asp Ile Glu Ser
85 90 95
Lys Ile Gln Pro Gly Ser Gln Gln Ala Asp Phe Leu Asp Ala Leu Ile
100 105 110
Val Ser Met Asp Val Ile Gln His Glu Thr Ile Gly Lys Lys Phe Glu
115 120 125
Lys Arg His Ile Glu Ile Phe Thr Asp Leu Ser Ser Arg Phe Ser Lys
130 135 140
Ser Gln Leu Asp Ile Ile Ile His Ser Leu Lys Lys Cys Asp Ile Ser
145 150 155 160
Glu Arg His Ser Ile His Trp Pro Cys Arg Leu Thr Ile Gly Ser Asn
165 170 175
Leu Ser Ile Arg Ile Ala Ala Tyr Lys Ser Ile Leu Gln Glu Arg Val
180 185 190
Lys Lys Thr Trp Thr Val Val Asp Ala Lys Thr Leu Lys Lys Glu Asp
195 200 205
Ile Gln Lys Glu Thr Val Tyr Cys Leu Asn Asp Asp Asp Glu Thr Glu
210 215 220
Val Leu Lys Glu Asp Ile Ile Gln Gly Phe Arg Tyr Gly Ser Asp Ile
225 230 235 240
Val Pro Phe Ser Lys Val Asp Glu Glu Gln Met Lys Tyr Lys Ser Glu
245 250 255
Gly Lys Cys Phe Ser Val Leu Gly Phe Cys Lys Ser Ser Gln Val Gln
260 265 270
Arg Arg Phe Phe Met Gly Asn Gln Val Leu Lys Val Phe Ala Ala Arg
275 280 285
Asp Asp Glu Ala Ala Ala Val Ala Leu Ser Ser Leu Ile His Ala Leu
290 295 300
Asp Asp Leu Asp Met Val Ala Ile Val Arg Tyr Ala Tyr Asp Lys Arg
305 310 315 320
Ala Asn Pro Gln Val Gly Val Ala Phe Pro His Ile Lys His Asn Tyr
325 330 335
Glu Cys Leu Val Tyr Val Gln Leu Pro Phe Met Glu Asp Leu Arg Gln
340 345 350
Tyr Met Phe Ser Ser Leu Lys Asn Ser Lys Lys Tyr Ala Pro Thr Glu
355 360 365
Ala Gln Leu Asn Ala Val Asp Ala Leu Ile Asp Ser Met Ser Leu Ala
370 375 380
Lys Lys Asp Glu Lys Thr Asp Thr Leu Glu Asp Leu Phe Pro Thr Thr
385 390 395 400
Lys Ile Pro Asn Pro Arg Phe Gln Arg Leu Phe Gln Cys Leu Leu His
405 410 415
Arg Ala Leu His Pro Arg Glu Pro Leu Pro Pro Ile Gln Gln His Ile
420 425 430
Trp Asn Met Leu Asn Pro Pro Ala Glu Val Thr Thr Lys Ser Gln Ile
435 440 445
Pro Leu Ser Lys Ile Lys Thr Leu Phe Pro Leu Ile Glu Ala Lys Lys
450 455 460
Lys Asp Gln Val Thr Ala Gln Glu Ile Phe Gln Asp Asn His Glu Asp
465 470 475 480
Gly Pro Thr Ala Lys
485
<210> 15
<211> 10
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> Sm共有位点
<400> 15
aatttttgga 10
<210> 16
<211> 83
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 单体的Sm -样蛋白(archaea)
<400> 16
Gly Ser Val Ile Asp Val Ser Ser Gln Arg Val Asn Val Gln Arg Pro
1 5 10 15
Leu Asp Ala Leu Gly Asn Ser Leu Asn Ser Pro Val Ile Ile Lys Leu
20 25 30
Lys Gly Asp Arg Glu Phe Arg Gly Val Leu Lys Ser Phe Asp Leu His
35 40 45
Met Asn Leu Val Leu Asn Asp Ala Glu Glu Leu Glu Asp Gly Glu Val
50 55 60
Thr Arg Arg Leu Gly Thr Val Leu Ile Arg Gly Asp Asn Ile Val Tyr
65 70 75 80
Ile Ser Pro
<210> 17
<211> 25
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> MS2噬菌体操纵子茎环
<400> 17
gcgcacatga ggatcaccca tgtgc 25
<210> 18
<211> 117
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> MS2外壳蛋白
<400> 18
Met Ala Ser Asn Phe Thr Gln Phe Val Leu Val Asp Asn Gly Gly Thr
1 5 10 15
Gly Asp Val Thr Val Ala Pro Ser Asn Phe Ala Asn Gly Ile Ala Glu
20 25 30
Trp Ile Ser Ser Asn Ser Arg Ser Gln Ala Tyr Lys Val Thr Cys Ser
35 40 45
Val Arg Gln Ser Ser Ala Gln Asn Arg Lys Tyr Thr Ile Lys Val Glu
50 55 60
Val Pro Lys Gly Ala Trp Arg Ser Tyr Leu Asn Met Glu Leu Thr Ile
65 70 75 80
Pro Ile Phe Ala Thr Asn Ser Asp Cys Glu Leu Ile Val Lys Ala Met
85 90 95
Gln Gly Leu Leu Lys Asp Gly Asn Pro Ile Pro Ser Ala Ile Ala Ala
100 105 110
Asn Ser Gly Ile Tyr
115
<210> 19
<211> 26
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> PP7噬菌体操纵子茎环
<400> 19
ataaggagtt tatatggaaa ccctta 26
<210> 20
<211> 128
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> PP7外壳蛋白(PCP)
<400> 20
Met Ser Lys Thr Ile Val Leu Ser Val Gly Glu Ala Thr Arg Thr Leu
1 5 10 15
Thr Glu Ile Gln Ser Thr Ala Asp Arg Gln Ile Phe Glu Glu Lys Val
20 25 30
Gly Pro Leu Val Gly Arg Leu Arg Leu Thr Ala Ser Leu Arg Gln Asn
35 40 45
Gly Ala Lys Thr Ala Tyr Arg Val Asn Leu Lys Leu Asp Gln Ala Asp
50 55 60
Val Val Asp Cys Ser Thr Ser Val Cys Gly Glu Leu Pro Lys Val Arg
65 70 75 80
Tyr Thr Gln Val Trp Ser His Asp Val Thr Ile Val Ala Asn Ser Thr
85 90 95
Glu Ala Ser Arg Lys Ser Leu Tyr Asp Leu Thr Lys Ser Leu Val Ala
100 105 110
Thr Ser Gln Val Glu Asp Leu Val Val Asn Leu Val Pro Leu Gly Arg
115 120 125
<210> 21
<211> 19
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> SfMu Com茎环
<400> 21
ctgaatgcct gcgagcatc 19
<210> 22
<211> 62
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> SfMu Com结合蛋白
<400> 22
Met Lys Ser Ile Arg Cys Lys Asn Cys Asn Lys Leu Leu Phe Lys Ala
1 5 10 15
Asp Ser Phe Asp His Ile Glu Ile Arg Cys Pro Arg Cys Lys Arg His
20 25 30
Ile Ile Met Leu Asn Ala Cys Glu His Pro Thr Glu Lys His Cys Gly
35 40 45
Lys Arg Glu Lys Ile Thr His Ser Asp Glu Thr Val Arg Tyr
50 55 60
<210> 23
<211> 110
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> Rpo B RRDR簇I
<400> 23
gcagcagtga aagagttctt cggttccagc cagctgtctc agtttatgga ccagaacaac 60
ccgctgtctg agattacgca caaacgtcgt atctccgcac tcggcccagg 110
<210> 24
<211> 110
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> Rpo B RRDR簇I
<400> 24
cctgggccga gtgcggagat acgacgtttg tgcgtaatct cagacagcgg gttgttctgg 60
tccataaact gagacagctg gctggaaccg aagaactctt tcactgctgc 110
<210> 25
<211> 36
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> Rpo B RRDR簇I蛋白
<400> 25
Ala Ala Val Lys Glu Phe Phe Gly Ser Ser Gln Leu Ser Gln Phe Met
1 5 10 15
Asp Gln Asn Asn Pro Leu Ser Glu Ile Thr His Lys Arg Arg Ile Ser
20 25 30
Ala Leu Gly Pro
35
<210> 26
<211> 30
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 处理的Rpo B RRDR簇I Ts4
<400> 26
acaaacgtcg tatcttcgca ctcggcccag 30
<210> 27
<211> 30
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 未处理的 Rpo B RRDR簇I Ts4
<400> 27
acaaacgtcg tatctccgca ctcggcccag 30
<210> 28
<211> 10
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 处理的Rpo B RRDR簇I Ts4
<400> 28
His Lys Arg Arg Ile Phe Ala Leu Gly Pro
1 5 10
<210> 29
<211> 10
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 未处理的Rpo B RRDR簇I Ts4
<400> 29
His Lys Arg Arg Ile Ser Ala Leu Gly Pro
1 5 10
<210> 30
<211> 12
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 未处理的Rpo B RRDR簇I Ts3
<400> 30
Thr His Lys Arg Arg Ile Ser Ala Leu Gly Pro Gly
1 5 10
<210> 31
<211> 12
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 处理的Rpo B RRDR簇I Ts4
<400> 31
Thr His Lys Arg Arg Ile Phe Ala Leu Gly Pro Gly
1 5 10
<210> 32
<211> 36
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 未处理的Rpo B RRDR簇I Ts4
<400> 32
acgcacaaac gtcgtatctc cgcactcggc ccaggc 36
<210> 33
<211> 9
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 未处理的秩乱
<400> 33
Gly Ser Ser Gln Leu Ser Gln Phe Met
1 5
<210> 34
<211> 9
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 处理的秩乱
<400> 34
Gly Ser Ser Gln Leu Ser Arg Phe Met
1 5
<210> 35
<211> 27
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 未处理的秩乱
<400> 35
ggttccagcc agctgtctca gtttatg 27
<210> 36
<211> 5
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 未处理的秩乱
<400> 36
Arg Ile Ser Ala Leu
1 5
<210> 37
<211> 5
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 处理的秩乱
<400> 37
Arg Ile Phe Ala Leu
1 5
<210> 38
<211> 15
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 未处理的秩乱
<400> 38
cgtatctccg cactc 15
<210> 39
<211> 4
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 未处理的秩乱
<400> 39
Gly Leu Ile Asn
1
<210> 40
<211> 4
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> 处理的秩乱
<400> 40
Gly Leu Phe Asn
1
<210> 41
<211> 15
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 未处理的秩乱
<400> 41
ggtctgatca actct 15
<210> 42
<211> 51
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> nfGFP正义
<400> 42
ccctggccca ccctcgtgac caccctgacc tgcggcgtgc agtgcttcag c 51
<210> 43
<211> 51
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> nfGFP反义
<400> 43
gctgaagcac tgcacgccgc aggtcagggt ggtcacgagg gtgggccagg g 51
<210> 44
<211> 17
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> nfGFP蛋白
<400> 44
Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Cys Gly Val Gln Cys Phe
1 5 10 15
Ser
<210> 45
<211> 51
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> HPGRT正义
<400> 45
ctgaaggggg gctataaatt ctttgctgac ctgctggatt acattaaagc a 51
<210> 46
<211> 51
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> HPGRT反义
<400> 46
tgctttaatg taatccagca ggtcagcaaa gaatttatag ccccccttca g 51
<210> 47
<211> 17
<212> PRT
<213> 人工序列(Artificial Sequence)
<220>
<223> HPGRT蛋白
<400> 47
Leu Lys Gly Gly Tyr Lys Phe Phe Ala Asp Leu Leu Asp Tyr Ile Lys
1 5 10 15
Ala
Claims (33)
1.一种系统,其包含:
(i)靶向序列的蛋白,或编码其的多核苷酸,
(ii)RNA支架,或编码其的DNA多核苷酸,其包含
(a)靶向核酸的基序,其包含与靶核酸序列互补的指导RNA序列,
(b)CRISPR基序,其能够结合所述靶向序列的蛋白,和
(c)募集RNA基序,
和
(iii)非核酸酶效应子融合蛋白,或编码其的多核苷酸,其包含
(a)RNA结合结构域,其能够结合所述募集RNA基序,
(b)接头,和
(c)效应子结构域,其具有DNA/RNA修饰的酶促活性。
2.根据权利要求1所述的系统,其中所述靶向序列的蛋白是CRISPR蛋白。
3.根据权利要求1或2所述的系统,其中所述靶向序列的蛋白不具有核酸酶活性。
4.根据权利要求1或2所述的系统,其中所述靶向序列蛋白包含选自由以下组成的菌种的dCas9或nCas9的序列:化脓性链球菌(Streptococcus pyogenes)、无乳链球菌(Streptococcus agalactiae)、金黄色葡萄球菌(Staphylococcus aureus)、嗜热链球菌(Streptococcus thermophilus)、脑膜炎奈瑟球菌(Neisseria meningitidis)和齿垢密螺旋体(Treponema denticola)。
5.根据权利要求1或2所述的系统,其中所述募集RNA基序和所述RNA结合结构域是成对地选自由以下组成的组:
端粒酶Ku结合基序和Ku蛋白或其RNA结合部分,
端粒酶Sm7结合基序和Sm7蛋白或其RNA结合部分,
MS2噬菌体操纵子茎环和MS2外壳蛋白或其RNA结合部分,
PP7噬菌体操纵子茎环和PP7外壳蛋白或其RNA结合部分,
SfMu噬菌体Com茎环和Com RNA结合蛋白或其RNA结合部分,和
非天然RNA适体和相应的适体配体或其RNA结合部分。
6.根据权利要求1或2所述的系统,其中所述接头长度为0至100个氨基酸残基。
7.根据权利要求1或2所述的系统,其中所述酶促活性是脱氨基活性,甲基转移酶活性,脱甲基酶活性,DNA修复活性,DNA损伤活性,歧化酶活性,烷基化活性,脱嘌呤活性,氧化活性,嘧啶二聚体形成活性,整合酶活性,转座酶活性,重组酶活性,聚合酶活性,连接酶活性,解旋酶活性,光裂合酶活性或糖基化酶活性。
8.根据权利要求7所述的系统,其中所述酶促活性是胞嘧啶脱氨基活性或腺苷脱氨基活性。
9.根据权利要求1或2所述的系统,其中所述RNA结合结构域不是Cas9或其功能等价物,也不是其RNA结合结构域。
10.分离的一种或多种核酸,其编码权利要求1-9中任一项所述的系统的组分(i)-(iii)。
11.表达载体或宿主细胞,其包含权利要求10所述的一种或多种核酸,所述细胞不是胚胎干细胞或生殖细胞。
12.靶DNA的位点特异性修饰方法,包括使所述靶DNA或包含所述靶DNA的细胞体外与权利要求1-9中任一项所述的系统的组分(i)-(iii)接触。
13.根据权利要求1-9中任一项所述的系统、根据权利要求10所述的一种或多种核酸或根据权利要求11所述的表达载体或宿主细胞在制备用于细胞中靶DNA的位点特异性修饰的药物中的用途。
14.根据权利要求12所述的方法或权利要求13所述的用途,其中所述靶DNA是染色体外DNA。
15.根据权利要求12所述的方法或权利要求13所述的用途,其中所述靶DNA是染色体上的基因组DNA。
16.根据权利要求12-15中任一项所述的方法或用途,其中所述细胞选自由以下组成的组:古细菌细胞,细菌细胞,真核单细胞生物,干细胞,植物细胞,鱼细胞,蛙细胞,鸟类细胞,猪细胞,牛细胞,山羊细胞,绵羊细胞,大鼠细胞,小鼠细胞,非人灵长类动物细胞和人细胞。
17.根据权利要求12-15中任一项所述的方法或用途,其中所述细胞是体细胞。
18.根据权利要求12-15中任一项所述的方法或用途,其中所述细胞是真核细胞。
19.根据权利要求12-15中任一项所述的方法或用途,其中所述细胞是动物细胞。
20.根据权利要求12-15中任一项所述的方法或用途,其中所述细胞是哺乳动物细胞。
21.根据权利要求12-15中任一项所述的方法或用途,其中所述细胞是啮齿动物细胞。
22.根据权利要求12-15中任一项所述的方法或用途,其中所述细胞是无脊椎动物细胞或脊椎动物细胞。
23.根据权利要求12-15中任一项所述的方法或用途,其中所述细胞来源于人或非人受试者。
24.根据权利要求12-15中任一项所述的方法或用途,其中所述细胞是非人动物生殖细胞或藻类细胞。
25.根据权利要求23所述的方法或用途,其中所述人或非人受试者具有基因的突变。
26.根据权利要求25所述的方法或用途,其中所述受试者患有由所述基因突变引起的疾病或有患该疾病的风险。
27.根据权利要求26所述的方法或用途,其中所述位点特异性修饰校正所述基因突变或使该基因表达失活。
28.根据权利要求23所述的方法或用途,其中所述受试者具有病原体或处于暴露于该病原体的风险中。
29.根据权利要求28所述的方法或用途,其中所述位点特异性修饰使所述病原体基因失活。
30.根据权利要求12-15中任一项所述的方法或用途,其中所述细胞是诱导多能干细胞。
31.试剂盒,其包含权利要求1-9中任一项所述的系统。
32.根据权利要求31所述的试剂盒,其进一步包含一种或多种选自由以下组成的组的组分:用于重构和/或稀释的试剂和用于将核酸或多肽引入宿主细胞的试剂。
33.根据权利要求32所述的试剂盒,其中所述宿主细胞是诱导多能干细胞。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201562192876P | 2015-07-15 | 2015-07-15 | |
US62/192,876 | 2015-07-15 | ||
PCT/US2016/042413 WO2017011721A1 (en) | 2015-07-15 | 2016-07-15 | Nuclease-independent targeted gene editing platform and uses thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108291218A CN108291218A (zh) | 2018-07-17 |
CN108291218B true CN108291218B (zh) | 2022-08-19 |
Family
ID=57758208
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201680053489.6A Active CN108291218B (zh) | 2015-07-15 | 2016-07-15 | 核酸酶非依赖性靶向基因编辑平台及其用途 |
Country Status (9)
Country | Link |
---|---|
US (2) | US11479793B2 (zh) |
EP (2) | EP3322804B1 (zh) |
JP (2) | JP7044373B2 (zh) |
CN (1) | CN108291218B (zh) |
CA (2) | CA3168241A1 (zh) |
DK (1) | DK3322804T3 (zh) |
ES (1) | ES2892625T3 (zh) |
HK (1) | HK1258288A1 (zh) |
WO (1) | WO2017011721A1 (zh) |
Families Citing this family (58)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013066438A2 (en) | 2011-07-22 | 2013-05-10 | President And Fellows Of Harvard College | Evaluation and improvement of nuclease cleavage specificity |
US20150044192A1 (en) | 2013-08-09 | 2015-02-12 | President And Fellows Of Harvard College | Methods for identifying a target site of a cas9 nuclease |
US9359599B2 (en) | 2013-08-22 | 2016-06-07 | President And Fellows Of Harvard College | Engineered transcription activator-like effector (TALE) domains and uses thereof |
US9526784B2 (en) | 2013-09-06 | 2016-12-27 | President And Fellows Of Harvard College | Delivery system for functional nucleases |
US9322037B2 (en) | 2013-09-06 | 2016-04-26 | President And Fellows Of Harvard College | Cas9-FokI fusion proteins and uses thereof |
US9340799B2 (en) | 2013-09-06 | 2016-05-17 | President And Fellows Of Harvard College | MRNA-sensing switchable gRNAs |
US9068179B1 (en) | 2013-12-12 | 2015-06-30 | President And Fellows Of Harvard College | Methods for correcting presenilin point mutations |
WO2016022363A2 (en) | 2014-07-30 | 2016-02-11 | President And Fellows Of Harvard College | Cas9 proteins including ligand-dependent inteins |
JP7044373B2 (ja) | 2015-07-15 | 2022-03-30 | ラトガース,ザ ステート ユニバーシティ オブ ニュージャージー | ヌクレアーゼ非依存的な標的化遺伝子編集プラットフォームおよびその用途 |
IL294014B2 (en) | 2015-10-23 | 2024-07-01 | Harvard College | Nucleobase editors and their uses |
JP6872560B2 (ja) * | 2016-05-06 | 2021-05-19 | エム. ウルフ、トッド | プログラム可能ヌクレアーゼのあるゲノム編集及びプログラム可能ヌクレアーゼのないゲノム編集のための改善された方法 |
CN107522787A (zh) * | 2016-06-15 | 2017-12-29 | 中国科学院上海生命科学研究院 | 在细胞内产生点突变的融合蛋白、其制备及用途 |
CA3032699A1 (en) | 2016-08-03 | 2018-02-08 | President And Fellows Of Harvard College | Adenosine nucleobase editors and uses thereof |
AU2017308889B2 (en) | 2016-08-09 | 2023-11-09 | President And Fellows Of Harvard College | Programmable Cas9-recombinase fusion proteins and uses thereof |
US11542509B2 (en) | 2016-08-24 | 2023-01-03 | President And Fellows Of Harvard College | Incorporation of unnatural amino acids into proteins using base editing |
WO2018071868A1 (en) | 2016-10-14 | 2018-04-19 | President And Fellows Of Harvard College | Aav delivery of nucleobase editors |
US10745677B2 (en) | 2016-12-23 | 2020-08-18 | President And Fellows Of Harvard College | Editing of CCR5 receptor gene to protect against HIV infection |
JP7219972B2 (ja) | 2017-01-05 | 2023-02-09 | ラトガース,ザ ステート ユニバーシティ オブ ニュー ジャージー | Dna二本鎖切断に非依存的な標的化遺伝子編集プラットフォームおよびその用途 |
EP3592853A1 (en) | 2017-03-09 | 2020-01-15 | President and Fellows of Harvard College | Suppression of pain by gene editing |
JP2020510038A (ja) * | 2017-03-09 | 2020-04-02 | プレジデント アンド フェローズ オブ ハーバード カレッジ | がんワクチン |
JP2020510439A (ja) | 2017-03-10 | 2020-04-09 | プレジデント アンド フェローズ オブ ハーバード カレッジ | シトシンからグアニンへの塩基編集因子 |
KR102280546B1 (ko) | 2017-03-22 | 2021-07-22 | 고쿠리츠다이가쿠호진 고베다이가쿠 | 세포 내재성 dna 변형 효소를 사용하여 표적화된 dna의 핵산 염기를 특이적으로 변환시키는, 세포의 핵산 서열의 변환 방법 및 이에 사용하는 분자 복합체 |
IL269458B2 (en) | 2017-03-23 | 2024-02-01 | Harvard College | Nucleic base editors that include nucleic acid programmable DNA binding proteins |
CN107384920B (zh) * | 2017-05-10 | 2020-07-14 | 中山大学 | 一套基于化脓性链球菌的碱基编辑系统及其在基因编辑中的应用 |
WO2018209320A1 (en) | 2017-05-12 | 2018-11-15 | President And Fellows Of Harvard College | Aptazyme-embedded guide rnas for use with crispr-cas9 in genome editing and transcriptional activation |
CN110959040A (zh) | 2017-05-25 | 2020-04-03 | 通用医疗公司 | 具有改进的精度和特异性的碱基编辑器 |
JP2020534795A (ja) | 2017-07-28 | 2020-12-03 | プレジデント アンド フェローズ オブ ハーバード カレッジ | ファージによって支援される連続的進化(pace)を用いて塩基編集因子を進化させるための方法および組成物 |
WO2019027728A1 (en) * | 2017-07-31 | 2019-02-07 | Sigma-Aldrich Co. Llc | SYNTHETIC GUIDE RNA FOR CRISPR / CAS ACTIVATOR SYSTEMS |
US11319532B2 (en) | 2017-08-30 | 2022-05-03 | President And Fellows Of Harvard College | High efficiency base editors comprising Gam |
US11795443B2 (en) | 2017-10-16 | 2023-10-24 | The Broad Institute, Inc. | Uses of adenosine base editors |
CN111448313A (zh) * | 2017-11-16 | 2020-07-24 | 阿斯利康(瑞典)有限公司 | 用于改善基于Cas9的敲入策略的有效性的组合物和方法 |
CN111885915B (zh) | 2018-03-19 | 2023-04-28 | 瑞泽恩制药公司 | 使用crispr/cas系统对动物进行转录调制 |
CN108949830B (zh) * | 2018-08-03 | 2021-11-26 | 福州大学 | 一种在鱼类中实现基因组编辑、精确定点基因敲入的方法 |
CN110878321B (zh) * | 2018-09-06 | 2023-08-22 | 上海科技大学 | 一种用于肺炎克雷伯菌基因编辑的表达载体 |
CN109385425A (zh) * | 2018-11-13 | 2019-02-26 | 中山大学 | 一种高特异性ABE碱基编辑系统及其在β血红蛋白病中的应用 |
US11946040B2 (en) | 2019-02-04 | 2024-04-02 | The General Hospital Corporation | Adenine DNA base editor variants with reduced off-target RNA editing |
CN114096666A (zh) | 2019-02-13 | 2022-02-25 | 比姆医疗股份有限公司 | 治疗血红素病变的组合物和方法 |
DE112020001342T5 (de) | 2019-03-19 | 2022-01-13 | President and Fellows of Harvard College | Verfahren und Zusammensetzungen zum Editing von Nukleotidsequenzen |
WO2020232373A1 (en) * | 2019-05-15 | 2020-11-19 | Board Of Regents, The University Of Texas System | Crispr methods for treating cancers |
EP3972654A4 (en) * | 2019-05-21 | 2024-07-03 | Beam Therapeutics Inc | METHODS FOR EDITING SINGLE NUCLEOTIDE POLYMORPHISM USING PROGRAMMABLE BASE EDITOR SYSTEMS |
US20220275359A1 (en) * | 2019-08-01 | 2022-09-01 | Astellas Pharma Inc. | Guide rna for editing target, with functional nucleotide sequence added |
EP3783104A1 (en) * | 2019-08-20 | 2021-02-24 | Kemijski Institut | Coiled-coil mediated tethering of crispr-cas and exonucleases for enhanced genome editing |
CN112481309B (zh) * | 2019-09-12 | 2023-10-10 | 石家庄埃佤基因科技有限公司 | Ago蛋白的用途及组合物和基因编辑方法 |
WO2021081264A1 (en) | 2019-10-24 | 2021-04-29 | Pairwise Plants Services, Inc. | Optimized crispr-cas nucleases and base editors and methods of use thereof |
EP4069842A1 (en) | 2019-12-02 | 2022-10-12 | Shape Therapeutics Inc. | Therapeutic editing |
MX2022010835A (es) * | 2020-03-03 | 2022-09-29 | Univ Leland Stanford Junior | Recombinacion del genoma guiada por arn en escala de kilobase. |
WO2021175289A1 (zh) * | 2020-03-04 | 2021-09-10 | 中国科学院遗传与发育生物学研究所 | 多重基因组编辑方法和系统 |
DE112021002672T5 (de) | 2020-05-08 | 2023-04-13 | President And Fellows Of Harvard College | Vefahren und zusammensetzungen zum gleichzeitigen editieren beider stränge einer doppelsträngigen nukleotid-zielsequenz |
EP4179079A1 (en) | 2020-07-10 | 2023-05-17 | Horizon Discovery Limited | Method for producing genetically modified cells |
EP4247951A2 (en) * | 2020-11-19 | 2023-09-27 | Wake Forest University Health Sciences | Vectors, systems and methods for eukaryotic gene editing |
EP4274902A1 (en) * | 2021-01-05 | 2023-11-15 | Horizon Discovery Limited | Guide rna designs and complexes for tracr-less type v cas systems |
WO2022148955A1 (en) | 2021-01-05 | 2022-07-14 | Horizon Discovery Limited | Method for producing genetically modified cells |
EP4274903A1 (en) * | 2021-01-05 | 2023-11-15 | Horizon Discovery Limited | Guide rna designs and complexes for type v cas systems |
JP2024521806A (ja) | 2021-05-25 | 2024-06-04 | アソシエーション セントロ デ インベスティゲイション クーペレイティヴァ アン ナノシエンシアス “シーアイシー ナノグネ” | 合成Casタンパク質 |
GB202107671D0 (en) | 2021-05-28 | 2021-07-14 | Asociacion Centro De Investig Cooperativa En Nanociencias Cic Nanogune | Synthetic cas proteins |
WO2024096790A1 (en) * | 2022-10-31 | 2024-05-10 | Soledits Ab | Modular crispr system |
CN116726198B (zh) * | 2023-03-13 | 2024-02-13 | 陕西师范大学 | 一种抗肿瘤药物及其在脑胶质瘤治疗中的应用 |
WO2024196921A1 (en) | 2023-03-20 | 2024-09-26 | Pioneer Hi-Bred International, Inc. | Cas polypeptides with altered pam recognition |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015089406A1 (en) * | 2013-12-12 | 2015-06-18 | President And Fellows Of Harvard College | Cas variants for gene editing |
Family Cites Families (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US63258A (en) | 1867-03-26 | David king | ||
US4625014A (en) | 1984-07-10 | 1986-11-25 | Dana-Farber Cancer Institute, Inc. | Cell-delivery agent |
US5061641A (en) | 1988-04-01 | 1991-10-29 | Immunomedics, Inc. | Method for radiolabeling proteins |
US5057301A (en) | 1988-04-06 | 1991-10-15 | Neorx Corporation | Modified cellular substrates used as linkers for increased cell retention of diagnostic and therapeutic agents |
WO1994012520A1 (en) | 1992-11-20 | 1994-06-09 | Enzon, Inc. | Linker for linked fusion polypeptides |
US6165476A (en) | 1997-07-10 | 2000-12-26 | Beth Israel Deaconess Medical Center | Fusion proteins with an immunoglobulin hinge region linker |
WO2008002661A2 (en) | 2006-06-28 | 2008-01-03 | The Board Of Trustees Of The Leland Stanford Junior University | Fusion protein constructs |
EP2476689B1 (en) | 2007-05-10 | 2015-10-21 | Agilent Technologies, Inc. | Thiocarbon-protecting groups for RNA synthesis |
US20110104787A1 (en) | 2009-11-05 | 2011-05-05 | President And Fellows Of Harvard College | Fusion Peptides That Bind to and Modify Target Nucleic Acid Sequences |
EP3144320B9 (en) | 2011-04-13 | 2018-08-22 | Bristol-Myers Squibb Company | Fc fusion proteins comprising novel linkers or arrangements |
DE202013012241U1 (de) | 2012-05-25 | 2016-01-18 | Emmanuelle Charpentier | Zusammensetzungen für die durch RNA gesteuerte Modifikation einer Ziel-DNA und für die durch RNA gesteuerte Modulation der Transkription |
JP5918909B2 (ja) | 2012-06-25 | 2016-05-18 | ザ ブリガム アンド ウィメンズ ホスピタル インコーポレイテッドThe Brigham and Women’s Hospital, Inc. | 標的化治療薬 |
EP2825654B1 (en) | 2012-12-12 | 2017-04-26 | The Broad Institute, Inc. | Crispr-cas component systems, methods and compositions for sequence manipulation |
US8697359B1 (en) | 2012-12-12 | 2014-04-15 | The Broad Institute, Inc. | CRISPR-Cas systems and methods for altering expression of gene products |
RU2699523C2 (ru) | 2012-12-17 | 2019-09-05 | Президент Энд Фэллоуз Оф Харвард Коллидж | Рнк-направляемая инженерия генома человека |
US9833312B2 (en) | 2012-12-18 | 2017-12-05 | Lensvector Inc. | Liquid crystal optical device with advanced electric field control capability |
US20140273230A1 (en) | 2013-03-15 | 2014-09-18 | Sigma-Aldrich Co., Llc | Crispr-based genome modification and regulation |
KR102210322B1 (ko) | 2013-03-15 | 2021-02-01 | 더 제너럴 하스피탈 코포레이션 | Rna-안내 게놈 편집의 특이성을 증가시키기 위한 rna-안내 foki 뉴클레아제(rfn)의 용도 |
US9234213B2 (en) | 2013-03-15 | 2016-01-12 | System Biosciences, Llc | Compositions and methods directed to CRISPR/Cas genomic engineering systems |
KR20160034901A (ko) | 2013-06-17 | 2016-03-30 | 더 브로드 인스티튜트, 인코퍼레이티드 | 서열 조작에 최적화된 crispr-cas 이중 닉카아제 시스템, 방법 및 조성물 |
WO2015089486A2 (en) | 2013-12-12 | 2015-06-18 | The Broad Institute Inc. | Systems, methods and compositions for sequence manipulation with optimized functional crispr-cas systems |
EP3985124A1 (en) * | 2013-12-26 | 2022-04-20 | The General Hospital Corporation | Multiplex guide rnas |
EP4063503A1 (en) * | 2014-02-11 | 2022-09-28 | The Regents of the University of Colorado, a body corporate | Crispr enabled multiplexed genome engineering |
CN111500569A (zh) * | 2014-03-05 | 2020-08-07 | 国立大学法人神户大学 | 特异性转变靶向dna序列的核酸碱基的基因组序列的修饰方法、及其使用的分子复合体 |
WO2016054106A1 (en) | 2014-09-29 | 2016-04-07 | The Regents Of The University Of California | SCAFFOLD RNAs |
KR102531016B1 (ko) | 2014-11-21 | 2023-05-10 | 리제너론 파마슈티칼스 인코포레이티드 | 쌍 형성된 가이드 rna를 사용하는 표적화된 유전자 변형을 위한 방법 및 조성물 |
JP7044373B2 (ja) | 2015-07-15 | 2022-03-30 | ラトガース,ザ ステート ユニバーシティ オブ ニュージャージー | ヌクレアーゼ非依存的な標的化遺伝子編集プラットフォームおよびその用途 |
IL294014B2 (en) | 2015-10-23 | 2024-07-01 | Harvard College | Nucleobase editors and their uses |
JP7219972B2 (ja) | 2017-01-05 | 2023-02-09 | ラトガース,ザ ステート ユニバーシティ オブ ニュー ジャージー | Dna二本鎖切断に非依存的な標的化遺伝子編集プラットフォームおよびその用途 |
KR20240043783A (ko) | 2021-08-06 | 2024-04-03 | 럿거스, 더 스테이트 유니버시티 오브 뉴 저지 | 유전자 변형 세포의 제조 방법 |
-
2016
- 2016-07-15 JP JP2018502145A patent/JP7044373B2/ja active Active
- 2016-07-15 US US15/744,505 patent/US11479793B2/en active Active
- 2016-07-15 CN CN201680053489.6A patent/CN108291218B/zh active Active
- 2016-07-15 DK DK16825229.4T patent/DK3322804T3/da active
- 2016-07-15 CA CA3168241A patent/CA3168241A1/en active Pending
- 2016-07-15 EP EP16825229.4A patent/EP3322804B1/en active Active
- 2016-07-15 EP EP21186227.1A patent/EP3957731A1/en active Pending
- 2016-07-15 ES ES16825229T patent/ES2892625T3/es active Active
- 2016-07-15 CA CA2992580A patent/CA2992580C/en active Active
- 2016-07-15 WO PCT/US2016/042413 patent/WO2017011721A1/en unknown
-
2019
- 2019-01-15 HK HK19100652.8A patent/HK1258288A1/zh unknown
-
2022
- 2022-03-10 JP JP2022037065A patent/JP7364268B2/ja active Active
- 2022-04-25 US US17/660,523 patent/US20220267806A1/en active Pending
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015089406A1 (en) * | 2013-12-12 | 2015-06-18 | President And Fellows Of Harvard College | Cas variants for gene editing |
Non-Patent Citations (1)
Title |
---|
Engineering complex synthetic transcriptional programs with CRISPR RNA scaffolds;Jesse G Zalatan等;《Cell》;20150115;第160卷(第1-2期);第16-19页 * |
Also Published As
Publication number | Publication date |
---|---|
CA3168241A1 (en) | 2017-01-19 |
EP3957731A1 (en) | 2022-02-23 |
DK3322804T3 (da) | 2021-10-04 |
WO2017011721A1 (en) | 2017-01-19 |
JP7364268B2 (ja) | 2023-10-18 |
EP3322804B1 (en) | 2021-09-01 |
HK1258288A1 (zh) | 2019-11-08 |
CA2992580C (en) | 2022-09-20 |
CN108291218A (zh) | 2018-07-17 |
JP2018520686A (ja) | 2018-08-02 |
ES2892625T3 (es) | 2022-02-04 |
US11479793B2 (en) | 2022-10-25 |
EP3322804A4 (en) | 2019-01-30 |
US20220267806A1 (en) | 2022-08-25 |
CA2992580A1 (en) | 2017-01-19 |
EP3322804A1 (en) | 2018-05-23 |
US20180327784A1 (en) | 2018-11-15 |
JP7044373B2 (ja) | 2022-03-30 |
JP2022095633A (ja) | 2022-06-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108291218B (zh) | 核酸酶非依赖性靶向基因编辑平台及其用途 | |
JP7219972B2 (ja) | Dna二本鎖切断に非依存的な標的化遺伝子編集プラットフォームおよびその用途 | |
JP6878468B2 (ja) | 標的ゲノム修飾を増強するためのプログラム可能なdna結合タンパク質の使用 | |
CN110770342B (zh) | Dna被编辑了的真核细胞的制造方法、和在该方法中使用的试剂盒 | |
EP3138912B1 (en) | Crispr-based genome modification and regulation | |
JP7109547B2 (ja) | 真核ゲノム修飾のための操作されたCas9システム | |
US20220290134A1 (en) | Highly Efficient DNA Base Editors Mediated By RNA-Aptamer Recruitment For Targeted Genome Modification And Uses Thereof | |
JP2023156365A (ja) | Crispr/cas融合タンパク質およびシステム | |
US20230058352A1 (en) | High Fidelity SpCas9 Nucleases for Genome Modification | |
KR20120112266A (ko) | Cmah 유전자를 표적으로 하는 징크 핑거 뉴클레아제 및 이의 용도 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1258288 Country of ref document: HK |
|
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TG01 | Patent term adjustment |