CN113785066A - 通过靶向dmpk基因治疗肌营养不良症的方法 - Google Patents
通过靶向dmpk基因治疗肌营养不良症的方法 Download PDFInfo
- Publication number
- CN113785066A CN113785066A CN202080032034.2A CN202080032034A CN113785066A CN 113785066 A CN113785066 A CN 113785066A CN 202080032034 A CN202080032034 A CN 202080032034A CN 113785066 A CN113785066 A CN 113785066A
- Authority
- CN
- China
- Prior art keywords
- seq
- lys
- leu
- glu
- ile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000008685 targeting Effects 0.000 title claims abstract description 107
- 238000000034 method Methods 0.000 title claims description 62
- 201000006938 muscular dystrophy Diseases 0.000 title abstract description 11
- 108010052185 Myotonin-Protein Kinase Proteins 0.000 title description 57
- 108020005004 Guide RNA Proteins 0.000 claims abstract description 149
- 239000002773 nucleotide Substances 0.000 claims abstract description 143
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 143
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 139
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 139
- 239000002157 polynucleotide Substances 0.000 claims abstract description 139
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 121
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 99
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 88
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 88
- 239000012636 effector Substances 0.000 claims abstract description 86
- 101710163270 Nuclease Proteins 0.000 claims abstract description 83
- 230000002950 deficient Effects 0.000 claims abstract description 82
- 238000013518 transcription Methods 0.000 claims abstract description 69
- 230000035897 transcription Effects 0.000 claims abstract description 69
- 108091033409 CRISPR Proteins 0.000 claims abstract description 20
- 238000010354 CRISPR gene editing Methods 0.000 claims abstract 9
- 239000013598 vector Substances 0.000 claims description 47
- 241000191967 Staphylococcus aureus Species 0.000 claims description 25
- 210000003205 muscle Anatomy 0.000 claims description 25
- 108091006107 transcriptional repressors Proteins 0.000 claims description 24
- 239000008194 pharmaceutical composition Substances 0.000 claims description 23
- 102100025169 Max-binding protein MNT Human genes 0.000 claims description 20
- 108010059343 MM Form Creatine Kinase Proteins 0.000 claims description 17
- 108010089612 myosin-heavy-chain kinase Proteins 0.000 claims description 16
- 239000013607 AAV vector Substances 0.000 claims description 14
- 206010068871 Myotonic dystrophy Diseases 0.000 claims description 12
- 239000013603 viral vector Substances 0.000 claims description 11
- 239000013600 plasmid vector Substances 0.000 claims description 10
- 101150082516 HDT1 gene Proteins 0.000 claims description 8
- -1 MBD2B Proteins 0.000 claims description 8
- 102000006890 Methyl-CpG-Binding Protein 2 Human genes 0.000 claims description 8
- 108010072388 Methyl-CpG-Binding Protein 2 Proteins 0.000 claims description 8
- 108090000506 Protein phosphatase inhibitor 1 Proteins 0.000 claims description 8
- 101100365087 Arabidopsis thaliana SCRA gene Proteins 0.000 claims description 7
- 101100365680 Arabidopsis thaliana SGT1B gene Proteins 0.000 claims description 7
- 101100417900 Clostridium acetobutylicum (strain ATCC 824 / DSM 792 / JCM 1419 / LMG 5710 / VKM B-1787) rbr3A gene Proteins 0.000 claims description 7
- 101000651906 Homo sapiens Paired amphipathic helix protein Sin3a Proteins 0.000 claims description 7
- 101150034686 PDC gene Proteins 0.000 claims description 7
- 102100027334 Paired amphipathic helix protein Sin3a Human genes 0.000 claims description 7
- 101150105073 SCR1 gene Proteins 0.000 claims description 7
- 101100134054 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) NTG1 gene Proteins 0.000 claims description 7
- 101150068906 snr-6 gene Proteins 0.000 claims description 7
- 241000702421 Dependoparvovirus Species 0.000 claims description 6
- 230000002265 prevention Effects 0.000 claims description 6
- 241000702423 Adeno-associated virus - 2 Species 0.000 claims description 4
- 241001655883 Adeno-associated virus - 1 Species 0.000 claims description 3
- 241000972680 Adeno-associated virus - 6 Species 0.000 claims description 3
- 241001164823 Adeno-associated virus - 7 Species 0.000 claims description 3
- 241001164825 Adeno-associated virus - 8 Species 0.000 claims description 3
- 102000004097 Protein phosphatase inhibitor 1 Human genes 0.000 claims 1
- 108020004414 DNA Proteins 0.000 description 172
- 241000282414 Homo sapiens Species 0.000 description 171
- 210000004027 cell Anatomy 0.000 description 128
- 230000014509 gene expression Effects 0.000 description 116
- 101000901659 Homo sapiens Myotonin-protein kinase Proteins 0.000 description 64
- 102000018658 Myotonin-Protein Kinase Human genes 0.000 description 39
- 239000002609 medium Substances 0.000 description 39
- 230000001105 regulatory effect Effects 0.000 description 39
- 239000000203 mixture Substances 0.000 description 33
- 108010054155 lysyllysine Proteins 0.000 description 31
- 108091028043 Nucleic acid sequence Proteins 0.000 description 27
- 239000000523 sample Substances 0.000 description 24
- 108091027544 Subgenomic mRNA Proteins 0.000 description 23
- 239000001963 growth medium Substances 0.000 description 23
- 239000013612 plasmid Substances 0.000 description 23
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 21
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 21
- 108010068380 arginylarginine Proteins 0.000 description 21
- 108010068265 aspartyltyrosine Proteins 0.000 description 21
- 241000699670 Mus sp. Species 0.000 description 20
- 241000880493 Leptailurus serval Species 0.000 description 19
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 17
- 238000006243 chemical reaction Methods 0.000 description 17
- 108010003700 lysyl aspartic acid Proteins 0.000 description 16
- 239000002953 phosphate buffered saline Substances 0.000 description 16
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 15
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 15
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 15
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 15
- 108010050848 glycylleucine Proteins 0.000 description 15
- 108010073969 valyllysine Proteins 0.000 description 15
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 14
- 102000004389 Ribonucleoproteins Human genes 0.000 description 14
- 108010081734 Ribonucleoproteins Proteins 0.000 description 14
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 14
- 239000002299 complementary DNA Substances 0.000 description 14
- 108010034529 leucyl-lysine Proteins 0.000 description 14
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 14
- 108010061238 threonyl-glycine Proteins 0.000 description 14
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 13
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 13
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 13
- 108091028113 Trans-activating crRNA Proteins 0.000 description 13
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 13
- 238000004519 manufacturing process Methods 0.000 description 13
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 12
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 12
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 12
- 238000011529 RT qPCR Methods 0.000 description 12
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 12
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 11
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 11
- 241000700605 Viruses Species 0.000 description 11
- 150000001413 amino acids Chemical group 0.000 description 11
- 238000002474 experimental method Methods 0.000 description 11
- 108090000765 processed proteins & peptides Proteins 0.000 description 11
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 10
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 10
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 10
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 10
- 108010005233 alanylglutamic acid Proteins 0.000 description 10
- 108010077515 glycylproline Proteins 0.000 description 10
- 108010028295 histidylhistidine Proteins 0.000 description 10
- 241000701022 Cytomegalovirus Species 0.000 description 9
- 241000713666 Lentivirus Species 0.000 description 9
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 9
- 230000000295 complement effect Effects 0.000 description 9
- 230000000694 effects Effects 0.000 description 9
- 229920001184 polypeptide Polymers 0.000 description 9
- 102000004196 processed proteins & peptides Human genes 0.000 description 9
- 108010003137 tyrosyltyrosine Proteins 0.000 description 9
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 8
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 8
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 8
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 8
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 8
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 8
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 8
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 8
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 8
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 8
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 8
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 8
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 8
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 8
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 8
- 108010047495 alanylglycine Proteins 0.000 description 8
- 125000000539 amino acid group Chemical group 0.000 description 8
- 108010053037 kyotorphin Proteins 0.000 description 8
- 108010017391 lysylvaline Proteins 0.000 description 8
- 210000004940 nucleus Anatomy 0.000 description 8
- 230000002441 reversible effect Effects 0.000 description 8
- 239000000243 solution Substances 0.000 description 8
- 239000006228 supernatant Substances 0.000 description 8
- 210000001519 tissue Anatomy 0.000 description 8
- 102100038740 Activator of RNA decay Human genes 0.000 description 7
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 7
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 7
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 7
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 7
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 7
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 7
- VYMJAWXRWHJIMS-LKTVYLICSA-N Ala-Tyr-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VYMJAWXRWHJIMS-LKTVYLICSA-N 0.000 description 7
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 7
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 7
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 7
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 7
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 7
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 7
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 7
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 7
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 7
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 7
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 7
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 7
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 7
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 7
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 7
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 7
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 7
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 7
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 7
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 7
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 7
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 7
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 7
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 7
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 7
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 7
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 7
- DRXOWZZHCSBUOI-YJRXYDGGSA-N Cys-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N)O DRXOWZZHCSBUOI-YJRXYDGGSA-N 0.000 description 7
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 7
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 7
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 7
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 7
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 7
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 7
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 7
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 7
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 7
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 7
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 7
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 7
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 7
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 7
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 7
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 7
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 7
- CGWHAXBNGYQBBK-JBACZVJFSA-N Glu-Trp-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)C1=CC=C(O)C=C1 CGWHAXBNGYQBBK-JBACZVJFSA-N 0.000 description 7
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 7
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 7
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 7
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 7
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 7
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 7
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 7
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 7
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 7
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 7
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 7
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 7
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 7
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 7
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 7
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 7
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 7
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 7
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 7
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 7
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 7
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 7
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 7
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 7
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 7
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 7
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 7
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 7
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 7
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 7
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 7
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 7
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 7
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 7
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 7
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 7
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 7
- URJUVJDTPXCQFL-IHPCNDPISA-N Leu-Trp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N URJUVJDTPXCQFL-IHPCNDPISA-N 0.000 description 7
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 7
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 7
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 7
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 7
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 7
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 7
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 7
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 7
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 7
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 7
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 7
- XKJUFUPCHARJKX-UWVGGRQHSA-N Met-Gly-His Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 XKJUFUPCHARJKX-UWVGGRQHSA-N 0.000 description 7
- 108091034117 Oligonucleotide Proteins 0.000 description 7
- OVJMCXAPGFDGMG-HKUYNNGSSA-N Phe-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OVJMCXAPGFDGMG-HKUYNNGSSA-N 0.000 description 7
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 7
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 7
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 7
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 7
- 108010079005 RDV peptide Proteins 0.000 description 7
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 7
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 7
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 7
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 7
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 7
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 7
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 7
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 7
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 7
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 7
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 7
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 7
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 7
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 7
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 7
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 7
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 7
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 7
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 7
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 7
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 7
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 7
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 7
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 7
- 238000011534 incubation Methods 0.000 description 7
- 108010009298 lysylglutamic acid Proteins 0.000 description 7
- 108010051242 phenylalanylserine Proteins 0.000 description 7
- 108010005652 splenotritin Proteins 0.000 description 7
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 7
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 6
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 6
- 102000012422 Collagen Type I Human genes 0.000 description 6
- 108010022452 Collagen Type I Proteins 0.000 description 6
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 6
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 6
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 6
- YOTHMZZSJKKEHZ-SZMVWBNQSA-N Glu-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCC(O)=O)=CNC2=C1 YOTHMZZSJKKEHZ-SZMVWBNQSA-N 0.000 description 6
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 6
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 6
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 6
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 6
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 6
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 6
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 6
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 6
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 6
- 108010003201 RGH 0205 Proteins 0.000 description 6
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 6
- 238000012217 deletion Methods 0.000 description 6
- 230000037430 deletion Effects 0.000 description 6
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 6
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 6
- 238000000338 in vitro Methods 0.000 description 6
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 6
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 6
- 108020004999 messenger RNA Proteins 0.000 description 6
- 239000002245 particle Substances 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 238000003753 real-time PCR Methods 0.000 description 6
- 230000009467 reduction Effects 0.000 description 6
- 210000002363 skeletal muscle cell Anatomy 0.000 description 6
- 108010068794 tyrosyl-tyrosyl-glutamyl-glutamic acid Proteins 0.000 description 6
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 5
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 5
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 5
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 5
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 5
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 5
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 5
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 5
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 5
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 5
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 5
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 5
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 5
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 5
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 5
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 5
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 5
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 5
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 5
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 description 5
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 5
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 5
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 5
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 5
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 5
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 5
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 5
- 101710172824 CRISPR-associated endonuclease Cas9 Proteins 0.000 description 5
- 108020004705 Codon Proteins 0.000 description 5
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 5
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 5
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 5
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 5
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 5
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 5
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 5
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 5
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 5
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 5
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 5
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 5
- VLIJYPMATZSOLL-YUMQZZPRSA-N Gly-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VLIJYPMATZSOLL-YUMQZZPRSA-N 0.000 description 5
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 5
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 5
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 5
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 5
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 5
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 5
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 5
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 5
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 5
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 5
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 5
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 5
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 5
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 5
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 5
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 5
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 5
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 5
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 5
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 5
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 5
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 5
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 5
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 5
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 5
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 5
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 5
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 5
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 5
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 5
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 5
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 5
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 5
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 5
- DJBCKVNHEIJLQA-GMOBBJLQSA-N Met-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCSC)N DJBCKVNHEIJLQA-GMOBBJLQSA-N 0.000 description 5
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 5
- TWEWRDAAIYBJTO-ULQDDVLXSA-N Met-Tyr-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N TWEWRDAAIYBJTO-ULQDDVLXSA-N 0.000 description 5
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 5
- ABQFNJAFONNUTH-FHWLQOOXSA-N Phe-Gln-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N ABQFNJAFONNUTH-FHWLQOOXSA-N 0.000 description 5
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 5
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 5
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 5
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 5
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 5
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 5
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 5
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 5
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 5
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 5
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 5
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 5
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 5
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 5
- MXFPBNFKVBHIRW-BZSNNMDCSA-N Tyr-Lys-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O MXFPBNFKVBHIRW-BZSNNMDCSA-N 0.000 description 5
- IGXLNVIYDYONFB-UFYCRDLUSA-N Tyr-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 IGXLNVIYDYONFB-UFYCRDLUSA-N 0.000 description 5
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 5
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 5
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 5
- 108010013835 arginine glutamate Proteins 0.000 description 5
- 108010008355 arginyl-glutamine Proteins 0.000 description 5
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 5
- 108010092854 aspartyllysine Proteins 0.000 description 5
- 230000010261 cell growth Effects 0.000 description 5
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 5
- 201000010099 disease Diseases 0.000 description 5
- 230000003828 downregulation Effects 0.000 description 5
- 230000006872 improvement Effects 0.000 description 5
- 108010091871 leucylmethionine Proteins 0.000 description 5
- 210000004185 liver Anatomy 0.000 description 5
- 108010056582 methionylglutamic acid Proteins 0.000 description 5
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 238000010361 transduction Methods 0.000 description 5
- 230000026683 transduction Effects 0.000 description 5
- 238000011144 upstream manufacturing Methods 0.000 description 5
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 4
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 4
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 4
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 4
- 102100024108 Dystrophin Human genes 0.000 description 4
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 4
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 4
- 229930182566 Gentamicin Natural products 0.000 description 4
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 4
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 4
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 4
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 4
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 4
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 4
- 101000583839 Homo sapiens Muscleblind-like protein 1 Proteins 0.000 description 4
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 4
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 4
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 4
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 4
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 4
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 4
- DZQYZKPINJLLEN-KKUMJFAQSA-N Lys-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O DZQYZKPINJLLEN-KKUMJFAQSA-N 0.000 description 4
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 4
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 4
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 4
- 102100030965 Muscleblind-like protein 1 Human genes 0.000 description 4
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 4
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 4
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 4
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 4
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 4
- PYJKETPLFITNKS-IHRRRGAJSA-N Tyr-Pro-Asn Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O PYJKETPLFITNKS-IHRRRGAJSA-N 0.000 description 4
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- 229930189065 blasticidin Natural products 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 210000000234 capsid Anatomy 0.000 description 4
- 238000004113 cell culture Methods 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 229960002518 gentamicin Drugs 0.000 description 4
- 238000004806 packaging method and process Methods 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 230000000754 repressing effect Effects 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- FWBHETKCLVMNFS-UHFFFAOYSA-N 4',6-Diamino-2-phenylindol Chemical compound C1=CC(C(=N)N)=CC=C1C1=CC2=CC=C(C(N)=N)C=C2N1 FWBHETKCLVMNFS-UHFFFAOYSA-N 0.000 description 3
- 108020000948 Antisense Oligonucleotides Proteins 0.000 description 3
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 3
- UEONJSPBTSWKOI-CIUDSAMLSA-N Asn-Gln-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O UEONJSPBTSWKOI-CIUDSAMLSA-N 0.000 description 3
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 3
- 241000589601 Francisella Species 0.000 description 3
- 101150112014 Gapdh gene Proteins 0.000 description 3
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 3
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 3
- 229920000209 Hexadimethrine bromide Polymers 0.000 description 3
- 101001091266 Homo sapiens Kinesin-like protein KIF13A Proteins 0.000 description 3
- 101000764260 Homo sapiens Troponin T, cardiac muscle Proteins 0.000 description 3
- 108010091358 Hypoxanthine Phosphoribosyltransferase Proteins 0.000 description 3
- 102100029098 Hypoxanthine-guanine phosphoribosyltransferase Human genes 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- 102100034865 Kinesin-like protein KIF13A Human genes 0.000 description 3
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 3
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 3
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 3
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 3
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 3
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 3
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 3
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 3
- RVGRUAULSDPKGF-UHFFFAOYSA-N Poloxamer Chemical compound C1CO1.CC1CO1 RVGRUAULSDPKGF-UHFFFAOYSA-N 0.000 description 3
- 238000002123 RNA extraction Methods 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 3
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 3
- 101000910035 Streptococcus pyogenes serotype M1 CRISPR-associated endonuclease Cas9/Csn1 Proteins 0.000 description 3
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 3
- 102100026893 Troponin T, cardiac muscle Human genes 0.000 description 3
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 3
- 239000004480 active ingredient Substances 0.000 description 3
- 239000000074 antisense oligonucleotide Substances 0.000 description 3
- 238000012230 antisense oligonucleotides Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 238000010805 cDNA synthesis kit Methods 0.000 description 3
- AIYUHDOJVYHVIT-UHFFFAOYSA-M caesium chloride Chemical compound [Cl-].[Cs+] AIYUHDOJVYHVIT-UHFFFAOYSA-M 0.000 description 3
- 230000001351 cycling effect Effects 0.000 description 3
- 230000007547 defect Effects 0.000 description 3
- 238000010195 expression analysis Methods 0.000 description 3
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 3
- 102000048595 human DMPK Human genes 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 108010068488 methionylphenylalanine Proteins 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 150000007523 nucleic acids Chemical class 0.000 description 3
- 230000000144 pharmacologic effect Effects 0.000 description 3
- 229920001993 poloxamer 188 Polymers 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 238000001890 transfection Methods 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 241000701161 unidentified adenovirus Species 0.000 description 3
- 239000003981 vehicle Substances 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- 108020005345 3' Untranslated Regions Proteins 0.000 description 2
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 2
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 2
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 2
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 2
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 2
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 2
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 2
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 2
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 2
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 2
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 2
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 2
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 2
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 108090000565 Capsid Proteins Proteins 0.000 description 2
- 102100023321 Ceruloplasmin Human genes 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 102220605874 Cytosolic arginine sensor for mTORC1 subunit 2_D10A_mutation Human genes 0.000 description 2
- 102100036279 DNA (cytosine-5)-methyltransferase 1 Human genes 0.000 description 2
- 101000860092 Francisella tularensis subsp. novicida (strain U112) CRISPR-associated endonuclease Cas12a Proteins 0.000 description 2
- DFRYZTUPVZNRLG-KKUMJFAQSA-N Gln-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DFRYZTUPVZNRLG-KKUMJFAQSA-N 0.000 description 2
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 2
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 2
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 2
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 2
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 2
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 2
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 2
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- LMMPTUVWHCFTOT-GARJFASQSA-N His-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O LMMPTUVWHCFTOT-GARJFASQSA-N 0.000 description 2
- XJFITURPHAKKAI-SRVKXCTJSA-N His-Pro-Gln Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CN=CN1 XJFITURPHAKKAI-SRVKXCTJSA-N 0.000 description 2
- 102100039996 Histone deacetylase 1 Human genes 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 101001035024 Homo sapiens Histone deacetylase 1 Proteins 0.000 description 2
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 2
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 2
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 2
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 2
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 2
- XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 2
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 2
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 2
- 208000026350 Inborn Genetic disease Diseases 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- PPQRKXHCLYCBSP-IHRRRGAJSA-N Leu-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N PPQRKXHCLYCBSP-IHRRRGAJSA-N 0.000 description 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 2
- PRCHKVGXZVTALR-KKUMJFAQSA-N Lys-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N PRCHKVGXZVTALR-KKUMJFAQSA-N 0.000 description 2
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 2
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 2
- 241000282567 Macaca fascicularis Species 0.000 description 2
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 241000588650 Neisseria meningitidis Species 0.000 description 2
- 229930040373 Paraformaldehyde Natural products 0.000 description 2
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 2
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 2
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 2
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 2
- 241000317981 Podosphaera fuliginea Species 0.000 description 2
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 2
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 2
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 2
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 2
- 241000714474 Rous sarcoma virus Species 0.000 description 2
- 108091081021 Sense strand Proteins 0.000 description 2
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 2
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 2
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 2
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 2
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 2
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 2
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 2
- 108700019146 Transgenes Proteins 0.000 description 2
- UUIYFDAWNBSWPG-IHPCNDPISA-N Trp-Lys-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N UUIYFDAWNBSWPG-IHPCNDPISA-N 0.000 description 2
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 2
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 2
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 2
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 2
- LFCQXIXJQXWZJI-BZSNNMDCSA-N Tyr-His-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N)O LFCQXIXJQXWZJI-BZSNNMDCSA-N 0.000 description 2
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 2
- JQOMHZMWQHXALX-FHWLQOOXSA-N Tyr-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JQOMHZMWQHXALX-FHWLQOOXSA-N 0.000 description 2
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 2
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 2
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 2
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 2
- 230000001594 aberrant effect Effects 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 230000000692 anti-sense effect Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 229940096422 collagen type i Drugs 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 238000000432 density-gradient centrifugation Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 208000035475 disorder Diseases 0.000 description 2
- 239000003797 essential amino acid Substances 0.000 description 2
- 235000020776 essential amino acid Nutrition 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 238000001415 gene therapy Methods 0.000 description 2
- 208000016361 genetic disease Diseases 0.000 description 2
- 108060003196 globin Proteins 0.000 description 2
- 102000018146 globin Human genes 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 210000003917 human chromosome Anatomy 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 238000007901 in situ hybridization Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 2
- QWTDNUCVQCZILF-UHFFFAOYSA-N isopentane Chemical compound CCC(C)C QWTDNUCVQCZILF-UHFFFAOYSA-N 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 125000000956 methoxy group Chemical group [H]C([H])([H])O* 0.000 description 2
- 238000010172 mouse model Methods 0.000 description 2
- 210000000663 muscle cell Anatomy 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 210000003098 myoblast Anatomy 0.000 description 2
- 210000004165 myocardium Anatomy 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 229920002866 paraformaldehyde Polymers 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 239000002342 ribonucleoside Substances 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 239000006152 selective media Substances 0.000 description 2
- 210000002027 skeletal muscle Anatomy 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 2
- 239000001509 sodium citrate Substances 0.000 description 2
- DAEPDZWVDSPTHF-UHFFFAOYSA-M sodium pyruvate Chemical compound [Na+].CC(=O)C([O-])=O DAEPDZWVDSPTHF-UHFFFAOYSA-M 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 238000004448 titration Methods 0.000 description 2
- 239000012096 transfection reagent Substances 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 1
- RRBGTUQJDFBWNN-MUGJNUQGSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O RRBGTUQJDFBWNN-MUGJNUQGSA-N 0.000 description 1
- YRIZYWQGELRKNT-UHFFFAOYSA-N 1,3,5-trichloro-1,3,5-triazinane-2,4,6-trione Chemical compound ClN1C(=O)N(Cl)C(=O)N(Cl)C1=O YRIZYWQGELRKNT-UHFFFAOYSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- SVYBEBLNQGDRHF-UHFFFAOYSA-N 4-amino-N-(5-ethyl-1,3,4-thiadiazol-2-yl)benzenesulfonamide Chemical compound S1C(CC)=NN=C1NS(=O)(=O)C1=CC=C(N)C=C1 SVYBEBLNQGDRHF-UHFFFAOYSA-N 0.000 description 1
- AWXGSYPUMWKTBR-UHFFFAOYSA-N 4-carbazol-9-yl-n,n-bis(4-carbazol-9-ylphenyl)aniline Chemical compound C12=CC=CC=C2C2=CC=CC=C2N1C1=CC=C(N(C=2C=CC(=CC=2)N2C3=CC=CC=C3C3=CC=CC=C32)C=2C=CC(=CC=2)N2C3=CC=CC=C3C3=CC=CC=C32)C=C1 AWXGSYPUMWKTBR-UHFFFAOYSA-N 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 241000202702 Adeno-associated virus - 3 Species 0.000 description 1
- 241000580270 Adeno-associated virus - 4 Species 0.000 description 1
- 241001634120 Adeno-associated virus - 5 Species 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- 101100043932 Arabidopsis thaliana SUVH4 gene Proteins 0.000 description 1
- MAISCYVJLBBRNU-DCAQKATOSA-N Arg-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N MAISCYVJLBBRNU-DCAQKATOSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 1
- 108091032955 Bacterial small RNA Proteins 0.000 description 1
- 108091079001 CRISPR RNA Proteins 0.000 description 1
- 101100063432 Caenorhabditis elegans dim-1 gene Proteins 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 108091092236 Chimeric RNA Proteins 0.000 description 1
- 102100032918 Chromobox protein homolog 5 Human genes 0.000 description 1
- 102000008186 Collagen Human genes 0.000 description 1
- 108010035532 Collagen Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- BUIYOWKUSCTBRE-CIUDSAMLSA-N Cys-Arg-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O BUIYOWKUSCTBRE-CIUDSAMLSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- OEDPLIBVQGRKGZ-AVGNSLFASA-N Cys-Tyr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O OEDPLIBVQGRKGZ-AVGNSLFASA-N 0.000 description 1
- 108010009540 DNA (Cytosine-5-)-Methyltransferase 1 Proteins 0.000 description 1
- 102100024812 DNA (cytosine-5)-methyltransferase 3A Human genes 0.000 description 1
- 102100024810 DNA (cytosine-5)-methyltransferase 3B Human genes 0.000 description 1
- 101710123222 DNA (cytosine-5)-methyltransferase 3B Proteins 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 108010024491 DNA Methyltransferase 3A Proteins 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 101100339522 Drosophila virilis HP1A gene Proteins 0.000 description 1
- 108010069091 Dystrophin Proteins 0.000 description 1
- 102100031780 Endonuclease Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 241000589599 Francisella tularensis subsp. novicida Species 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- 102100021519 Hemoglobin subunit beta Human genes 0.000 description 1
- 108091005904 Hemoglobin subunit beta Proteins 0.000 description 1
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 1
- NNBWMLHQXBTIIT-HVTMNAMFSA-N His-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N NNBWMLHQXBTIIT-HVTMNAMFSA-N 0.000 description 1
- 102100029768 Histone-lysine N-methyltransferase SETD1A Human genes 0.000 description 1
- 102100028998 Histone-lysine N-methyltransferase SUV39H1 Human genes 0.000 description 1
- 101100327120 Homo sapiens CBX5 gene Proteins 0.000 description 1
- 101000931098 Homo sapiens DNA (cytosine-5)-methyltransferase 1 Proteins 0.000 description 1
- 101000804521 Homo sapiens Dystrophia myotonica WD repeat-containing protein Proteins 0.000 description 1
- 101001053946 Homo sapiens Dystrophin Proteins 0.000 description 1
- 101000865038 Homo sapiens Histone-lysine N-methyltransferase SETD1A Proteins 0.000 description 1
- 101000696705 Homo sapiens Histone-lysine N-methyltransferase SUV39H1 Proteins 0.000 description 1
- 101001050886 Homo sapiens Lysine-specific histone demethylase 1A Proteins 0.000 description 1
- 101000573526 Homo sapiens Membrane protein MLC1 Proteins 0.000 description 1
- 101000615488 Homo sapiens Methyl-CpG-binding domain protein 2 Proteins 0.000 description 1
- 101000615495 Homo sapiens Methyl-CpG-binding domain protein 3 Proteins 0.000 description 1
- 101001023043 Homo sapiens Myoblast determination protein 1 Proteins 0.000 description 1
- 101000635885 Homo sapiens Myosin light chain 1/3, skeletal muscle isoform Proteins 0.000 description 1
- 101001071233 Homo sapiens PHD finger protein 1 Proteins 0.000 description 1
- 101000837344 Homo sapiens T-cell leukemia translocation-altered gene protein Proteins 0.000 description 1
- 238000009015 Human TaqMan MicroRNA Assay kit Methods 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 1
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 1
- 108090001061 Insulin Proteins 0.000 description 1
- 102000004877 Insulin Human genes 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- 229930182816 L-glutamine Natural products 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- 241001112693 Lachnospiraceae Species 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 1
- FSNCEEGOMTYXKY-JTQLQIEISA-N Lycoperodine 1 Natural products N1C2=CC=CC=C2C2=C1CN[C@H](C(=O)O)C2 FSNCEEGOMTYXKY-JTQLQIEISA-N 0.000 description 1
- 108090000362 Lymphotoxin-beta Proteins 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 1
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- 102100024985 Lysine-specific histone demethylase 1A Human genes 0.000 description 1
- 101150094019 MYOG gene Proteins 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 102100021299 Methyl-CpG-binding domain protein 2 Human genes 0.000 description 1
- 102100021291 Methyl-CpG-binding domain protein 3 Human genes 0.000 description 1
- 108091092878 Microsatellite Proteins 0.000 description 1
- 241000711408 Murine respirovirus Species 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- 101100351033 Mus musculus Pax7 gene Proteins 0.000 description 1
- 208000010428 Muscle Weakness Diseases 0.000 description 1
- 206010028289 Muscle atrophy Diseases 0.000 description 1
- 206010028372 Muscular weakness Diseases 0.000 description 1
- 102100035077 Myoblast determination protein 1 Human genes 0.000 description 1
- 102100038380 Myogenic factor 5 Human genes 0.000 description 1
- 101710099061 Myogenic factor 5 Proteins 0.000 description 1
- 102100030740 Myosin light chain 1/3, skeletal muscle isoform Human genes 0.000 description 1
- 101710109784 Myosin regulatory light chain 12B Proteins 0.000 description 1
- 101710092698 Myosin regulatory light chain 2 Proteins 0.000 description 1
- 102100026057 Myosin regulatory light chain 2, atrial isoform Human genes 0.000 description 1
- 101710098224 Myosin regulatory light chain 2, atrial isoform Proteins 0.000 description 1
- 101710112127 Myosin regulatory light chain 2, skeletal muscle isoform Proteins 0.000 description 1
- 102100026925 Myosin regulatory light chain 2, ventricular/cardiac muscle isoform Human genes 0.000 description 1
- 101710105127 Myosin regulatory light chain 2, ventricular/cardiac muscle isoform Proteins 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- 208000009869 Neu-Laxova syndrome Diseases 0.000 description 1
- 108020003217 Nuclear RNA Proteins 0.000 description 1
- 102000043141 Nuclear RNA Human genes 0.000 description 1
- 102100036879 PHD finger protein 1 Human genes 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 1
- SSWJYJHXQOYTSP-SRVKXCTJSA-N Pro-His-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O SSWJYJHXQOYTSP-SRVKXCTJSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- 238000011530 RNeasy Mini Kit Methods 0.000 description 1
- 101150065817 ROM2 gene Proteins 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 101001053942 Saccharolobus solfataricus (strain ATCC 35092 / DSM 1617 / JCM 11322 / P2) Diphosphomevalonate decarboxylase Proteins 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 1
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 102000011990 Sirtuin Human genes 0.000 description 1
- 108050002485 Sirtuin Proteins 0.000 description 1
- 241000194017 Streptococcus Species 0.000 description 1
- 241000193996 Streptococcus pyogenes Species 0.000 description 1
- 241000194020 Streptococcus thermophilus Species 0.000 description 1
- 102100028692 T-cell leukemia translocation-altered gene protein Human genes 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 101800005109 Triakontatetraneuropeptide Proteins 0.000 description 1
- 239000007984 Tris EDTA buffer Substances 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- 102000013534 Troponin C Human genes 0.000 description 1
- GRSCONMARGNYHA-PMVMPFDFSA-N Trp-Lys-Phe Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GRSCONMARGNYHA-PMVMPFDFSA-N 0.000 description 1
- ADMHZNPMMVKGJW-BPUTZDHNSA-N Trp-Ser-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ADMHZNPMMVKGJW-BPUTZDHNSA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 1
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 1
- ARSHSYUZHSIYKR-ACRUOGEOSA-N Tyr-His-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ARSHSYUZHSIYKR-ACRUOGEOSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 1
- BCOBSVIZMQXKFY-KKUMJFAQSA-N Tyr-Ser-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BCOBSVIZMQXKFY-KKUMJFAQSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 1
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- UFCHCOKFAGOQSF-BQFCYCMXSA-N Val-Trp-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N UFCHCOKFAGOQSF-BQFCYCMXSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- 241000212749 Zesius chrysomallus Species 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 230000000747 cardiac effect Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000005352 clarification Methods 0.000 description 1
- 238000012761 co-transfection Methods 0.000 description 1
- 239000011436 cob Substances 0.000 description 1
- 229920001436 collagen Polymers 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000009295 crossflow filtration Methods 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- SPTYHKZRPFATHJ-HYZXJONISA-N dT6 Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)CO)[C@@H](O)C1 SPTYHKZRPFATHJ-HYZXJONISA-N 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 238000002298 density-gradient ultracentrifugation Methods 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 229910003460 diamond Inorganic materials 0.000 description 1
- 239000010432 diamond Substances 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- AFABGHUZZDYHJO-UHFFFAOYSA-N dimethyl butane Natural products CCCC(C)C AFABGHUZZDYHJO-UHFFFAOYSA-N 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 238000005538 encapsulation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 230000030279 gene silencing Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010362 genome editing Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 238000010842 high-capacity cDNA reverse transcription kit Methods 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 239000012510 hollow fiber Substances 0.000 description 1
- 102000055456 human DMWD Human genes 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 229940125396 insulin Drugs 0.000 description 1
- 238000001361 intraarterial administration Methods 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 108091066619 miR-6700 stem-loop Proteins 0.000 description 1
- 230000020763 muscle atrophy Effects 0.000 description 1
- 230000004220 muscle function Effects 0.000 description 1
- 201000000585 muscular atrophy Diseases 0.000 description 1
- 210000000107 myocyte Anatomy 0.000 description 1
- 239000002105 nanoparticle Substances 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 230000009437 off-target effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 239000000565 sealant Substances 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 229940054269 sodium pyruvate Drugs 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 229940126585 therapeutic drug Drugs 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- NMEHNETUFHBYEG-IHKSMFQHSA-N tttn Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 NMEHNETUFHBYEG-IHKSMFQHSA-N 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 238000003260 vortexing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y207/00—Transferases transferring phosphorus-containing groups (2.7)
- C12Y207/11—Protein-serine/threonine kinases (2.7.11)
- C12Y207/11001—Non-specific serine/threonine protein kinase (2.7.11.1), i.e. casein kinase or checkpoint kinase
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/005—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
- A61K48/0058—Nucleic acids adapted for tissue specific expression, e.g. having tissue specific promoters as part of a contruct
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
- C12N15/1137—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing against enzymes
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K31/00—Medicinal preparations containing organic active ingredients
- A61K31/70—Carbohydrates; Sugars; Derivatives thereof
- A61K31/7088—Compounds having three or more nucleosides or nucleotides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K31/00—Medicinal preparations containing organic active ingredients
- A61K31/70—Carbohydrates; Sugars; Derivatives thereof
- A61K31/7088—Compounds having three or more nucleosides or nucleotides
- A61K31/7105—Natural ribonucleic acids, i.e. containing only riboses attached to adenine, guanine, cytosine or uracil and having 3'-5' phosphodiester links
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K35/00—Medicinal preparations containing materials or reaction products thereof with undetermined constitution
- A61K35/66—Microorganisms or materials therefrom
- A61K35/76—Viruses; Subviral particles; Bacteriophages
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
- A61K38/16—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- A61K38/43—Enzymes; Proenzymes; Derivatives thereof
- A61K38/46—Hydrolases (3)
- A61K38/465—Hydrolases (3) acting on ester bonds (3.1), e.g. lipases, ribonucleases
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/005—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
- A61K48/0066—Manipulation of the nucleic acid to modify its expression pattern, e.g. enhance its duration of expression, achieved by the presence of particular introns in the delivered nucleic acid
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P21/00—Drugs for disorders of the muscular or neuromuscular system
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P21/00—Drugs for disorders of the muscular or neuromuscular system
- A61P21/04—Drugs for disorders of the muscular or neuromuscular system for myasthenia gravis
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2750/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
- C12N2750/00011—Details
- C12N2750/14011—Parvoviridae
- C12N2750/14111—Dependovirus, e.g. adenoassociated viruses
- C12N2750/14141—Use of virus, viral particle or viral elements as a vector
- C12N2750/14143—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Medicinal Chemistry (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Pharmacology & Pharmacy (AREA)
- Epidemiology (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Virology (AREA)
- General Chemical & Material Sciences (AREA)
- Physical Education & Sports Medicine (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Neurology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Orthopedic Medicine & Surgery (AREA)
- Mycology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- Immunology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本发明提供了包含下述碱基序列的多核苷酸:(a)编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列,和(b)编码向导RNA的碱基序列,所述向导RNA靶向人类DMPK基因的表达调控区中SEQ ID NO:127、SEQ ID NO:46、SEQ ID NO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQ ID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域,所述多核苷酸预期可用于治疗肌营养不良症。
Description
与相关申请的交叉引用
本申请要求2019年5月28日提交的美国临时专利申请号62/853,373和2020年5月15日提交的美国临时专利申请号63/025,417的利益,所述临时专利申请的内容整体通过参考并入本文。
技术领域
本发明涉及通过靶向人类肌强直素蛋白激酶(DMPK;强直性肌营养不良症蛋白激酶)基因等来治疗肌营养不良症的方法。更具体来说。本发明涉及使用靶向人类DMPK基因的特定序列的向导RNA和转录阻遏物与CRISPR(成簇规则间隔短回文重复序列)效应蛋白的融合蛋白,通过阻遏人类DMPK基因的表达来治疗或预防肌营养不良症的方法和药物组合物等。
背景技术
肌营养不良症是与进行性肌萎缩和肌无力相关的遗传疾病的总称。即使在今天,对肌营养不良症有效的根本治疗药物仍不存在,只是进行对症治疗。在肌营养不良症中,1型强直性肌营养不良症(DM1)由DMPK基因中的突变引起。
DM1是由DMPK基因的3’非翻译区(3’UTR)中CTG重复序列的延长引起的常染色体显性遗传病,并且是一种类型的三联体重复序列疾病。已报道在DM1中,含有扩展的CUG重复序列的RNA从内源RNA靶中分离出CUG重复序列结合蛋白例如MBNL(盲肌样蛋白),从而导致异常的剪接模式、RNA稳定性/定位的改变等。这些发现表明,扩展的重复序列基因座的沉默具有治疗价值,并将各种不同的方法例如反义寡核苷酸、小RNA、小分子等用于沉默毒性RNA(参见Pinto B等,Mol Cell.2017Nov 2,68(3):479-490,其整体通过参考并入本文)。
例如,Jauvin等人用靶向DMPK基因的3’UTR的反义寡核苷酸(ASO)治疗作为DM1的小鼠模型的DMSXL小鼠,并显示DMPK mRNA水平降低,核RNA聚集体(RNA灶点)减少并且肌肉强度增加,而没有检测到明显毒性(参见Jauvin D等,Mol Ther Nucleic Acids.2017年6月16日,7:465-474,其整体通过参考并入本文)。
WO2018/002812公开了一种例如使用CRISPR/Cas9系统通过基因组编辑来编辑细胞中的DMPK基因的方法,所述方法可用于治疗DMPK相关病症或障碍例如DM1(参见WO2018/002812,其整体通过参考并入本文)。
Pinto等人和Batra等人演示了将失活/无核酸酶活性的Cas9(dCas9)应用于DM1治疗的可能性。具体来说,Pinto等人将dCas9与针对CTG重复序列区的gRNA组合,并显示dCas9有效阻断扩展的微卫星重复序列的转录,由此可以在体外和体内(在作为DM1的小鼠模型的HSALR小鼠中)改善由重复序列扩展造成的DM1的特征性表型(参见Pinto B等,MolCell.2017年11月2日,68(3):479-490,其整体通过参考并入本文)。另一方面,Batra等人显示与RNA内切核酸酶融合的dCas9和针对DMPK mRNA的CUG重复序列区的gRNA的组合可以降低CUG重复序列扩展RNA的水平,并改善DM1患者细胞中的剪接异常(参见Batra R等,Cell.2017年8月24日,170(5):899-912,其整体通过参考并入本文)。
发明内容
因此,本发明的一个目的是为肌营养不良症(特别是DM1)提供新的治疗方法。
本发明的另一个目的是提供可用于治疗肌营养不良症的新药剂。
将在下面的详细描述中变得显而易见的这些和其他目的通过本发明的下述发现得以实现,即使用靶向人类DMPK基因(基因ID:1760)的特定序列的向导RNA和转录阻遏物与核酸酶缺陷型CRISPR效应蛋白的融合蛋白,可以强烈抑制人类DMPK基因的表达。本发明人在这些发现的基础上完成了本发明。
因此,本发明提供了下述项:
(1)一种多核苷酸,其包含下述碱基序列:
(a)编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列,和
(b)编码向导RNA的碱基序列,所述向导RNA靶向人类DMPK基因的表达调控区中SEQID NO:127、SEQ ID NO:46、SEQ ID NO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域。
(2)上述(1)所述的多核苷酸,其包含下述碱基序列:
(a)编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列,和
(b)编码向导RNA的碱基序列,所述向导RNA靶向人类DMPK基因的表达调控区中SEQID NO:127、SEQ ID NO:46、SEQ ID NO:128、SEQ ID NO:66、SEQ ID NO:68、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:134、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域。
(3)上述(1)或(2)所述的多核苷酸,其包含下述碱基序列:
(a)编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列,和
(b)编码向导RNA的碱基序列,所述向导RNA靶向人类DMPK基因的表达调控区中SEQID NO:63、SEQ ID NO:136、SEQ ID NO:83、SEQ ID NO:99、SEQ ID NO:135、SEQ ID NO:109或SEQ ID NO:111所示的区域中长度为18至24个核苷酸的连续区域。
(4)上述(1)所述的多核苷酸,其中所述编码向导RNA的碱基序列包含SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:46、SEQ ID NO:62、SEQ ID NO:63、SEQ ID NO:66、SEQ IDNO:68、SEQ ID NO:70、SEQ ID NO:71、SEQ ID NO:72、SEQ ID NO:73、SEQ ID NO:80、SEQ IDNO:81、SEQ ID NO:82、SEQ ID NO:83、SEQ ID NO:85、SEQ ID NO:86、SEQ ID NO:88、SEQ IDNO:91、SEQ ID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:103、SEQ ID NO:105、SEQID NO:106、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的碱基序列,或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:43、SEQID NO:44、SEQ ID NO:46、SEQ ID NO:62、SEQ ID NO:63、SEQ ID NO:66、SEQ ID NO:68、SEQID NO:70、SEQ ID NO:71、SEQ ID NO:72、SEQ ID NO:73、SEQ ID NO:80、SEQ ID NO:81、SEQID NO:82、SEQ ID NO:83、SEQ ID NO:85、SEQ ID NO:86、SEQ ID NO:88、SEQ ID NO:91、SEQID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:103、SEQ ID NO:105、SEQ ID NO:106、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的碱基序列。
(5)上述(1)至(4)中的任一项所述的多核苷酸,其包含编码向导RNA的至少两个碱基序列,其中所述至少两个碱基序列是不同的。
(6)上述(1)至(5)中的任一项所述的多核苷酸,其中所述转录阻遏物选自KRAB、MeCP2、SIN3A、HDT1、MBD2B、NIPP1和HP1A。
(7)上述(6)所述的多核苷酸,其中所述转录阻遏物是KRAB。
(8)上述(1)至(7)中的任一项所述的多核苷酸,其中所述核酸酶缺陷型CRISPR效应蛋白是dCas9。
(9)上述(8)所述的多核苷酸,其中所述dCas9源自于金黄色葡萄球菌(Staphylococcus aureus)。
(10)上述(1)至(9)中的任一项所述的多核苷酸,其还包含用于所述编码向导RNA的碱基序列的启动子序列和/或用于所述编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列的启动子序列。
(11)上述(10)所述的多核苷酸,其中用于所述编码向导RNA的碱基序列的启动子序列选自U6启动子、SNR6启动子、SNR52启动子、SCR1启动子、RPR1启动子、U3启动子和H1启动子。
(12)上述(11)所述的多核苷酸,其中用于所述编码向导RNA的碱基序列的启动子序列是U6启动子。
(13)上述(10)至(12)中的任一项所述的多核苷酸,其中用于所述编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列的启动子序列是遍在启动子或肌肉特异性启动子。
(14)上述(13)所述的多核苷酸,其中所述遍在启动子选自EFS启动子、CMV启动子和CAG启动子。
(15)上述(13)所述的多核苷酸,其中所述肌肉特异性启动子选自CK8启动子、肌球蛋白重链激酶(MHCK)启动子、肌肉肌酸激酶(MCK)启动子、合成C5-12(Syn)启动子和Des启动子。
(16)上述(15)所述的多核苷酸,其中所述肌肉特异性启动子是CK8启动子。
(17)上述(10)至(16)中的任一项所述的多核苷酸,
其中所述编码向导RNA的碱基序列包含SEQ ID NO:70、SEQ ID NO:81、SEQ ID NO:83或SEQ ID NO:99所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQID NO:70、SEQ ID NO:81、SEQ ID NO:83或SEQ ID NO:99所示的碱基序列,
所述转录阻遏物是KRAB,
所述核酸酶缺陷型CRISPR效应蛋白是源自于金黄色葡萄球菌的dCas9,
用于所述编码向导RNA的碱基序列的启动子序列是U6启动子,并且
用于所述编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列的启动子序列是CK8启动子。
(18)上述(17)所述的多核苷酸,
其中所述编码向导RNA的碱基序列包含SEQ ID NO:83所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:83所示的碱基序列。
(19)一种载体,其包含上述(1)至(18)中的任一项所述的多核苷酸。
(20)上述(19)所述的载体,其中所述载体是质粒载体或病毒载体。
(21)上述(20)所述的载体,其中所述病毒载体选自腺相关病毒(AAV)载体、腺病毒载体和慢病毒载体。
(22)上述(21)所述的载体,其中所述AAV载体选自AAV1、AAV2、AAV6、AAV7、AAV8、AAV9、Anc80、AAV587MTP、AAV588MTP、AAV-B1、AAVM41和AAVrh74。
(23)(22)所述的载体,其中所述AAV载体是AAV9。
(24)一种药物组合物,其包含上述(1)至(18)中的任一项所述的多核苷酸或上述(19)至(23)中的任一项所述的载体。
(25)上述(24)所述的药物组合物,其用于治疗或预防1型强直性肌营养不良症。
(26)一种治疗或预防1型强直性肌营养不良症的方法,所述方法包括向需要的对象给药上述(1)至(18)中的任一项所述的多核苷酸或上述(19)至(23)中的任一项所述的载体。
(27)上述(1)至(18)中的任一项所述的多核苷酸或上述(19)至(23)中的任一项所述的载体的用途,其用于治疗或预防1型强直性肌营养不良症。
(28)上述(1)至(18)中的任一项所述的多核苷酸或上述(19)至(23)中的任一项所述的载体的用途,其用于制造治疗或预防1型强直性肌营养不良症的药物组合物。
(29)一种核糖核蛋白,其包含下述组分:
(c)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白,和
(d)靶向人类DMPK基因的表达调控区中SEQ ID NO:127、SEQ ID NO:46、SEQ IDNO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQ ID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域的向导RNA。
(30)上述(29)所述的核糖核蛋白,其包含下述组分:
(c)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白,和
(d)靶向人类DMPK基因的表达调控区中SEQ ID NO:127、SEQ ID NO:46、SEQ IDNO:128、SEQ ID NO:66、SEQ ID NO:68、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:134、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域的向导RNA。
(31)上述(29)或(30)所述的核糖核蛋白,其包含下述组分:
(c)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白,和
(d)靶向人类DMPK基因的表达调控区中SEQ ID NO:63、SEQ ID NO:136、SEQ IDNO:83、SEQ ID NO:99、SEQ ID NO:135、SEQ ID NO:109或SEQ ID NO:111所示的区域中长度为18至24个核苷酸的连续区域的向导RNA。
(32)上述(29)所述的核糖核蛋白,其中所述向导RNA包含SEQ ID NO:157、SEQ IDNO:158、SEQ ID NO:159、SEQ ID NO:160、SEQ ID NO:161、SEQ ID NO:162、SEQ ID NO:163、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:166、SEQ ID NO:167、SEQ ID NO:168、SEQ IDNO:169、SEQ ID NO:170、SEQ ID NO:171、SEQ ID NO:172、SEQ ID NO:173、SEQ ID NO:174、SEQ ID NO:175、SEQ ID NO:176、SEQ ID NO:177、SEQ ID NO:178、SEQ ID NO:179、SEQ IDNO:180、SEQ ID NO:181、SEQ ID NO:182、SEQ ID NO:183、SEQ ID NO:184、SEQ ID NO:185或SEQ ID NO:186所示的碱基序列,或其中缺失、替换、插入和/或添加了1至3个碱基的SEQID NO:157、SEQ ID NO:158、SEQ ID NO:159、SEQ ID NO:160、SEQ ID NO:161、SEQ ID NO:162、SEQ ID NO:163、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:166、SEQ ID NO:167、SEQID NO:168、SEQ ID NO:169、SEQ ID NO:170、SEQ ID NO:171、SEQ ID NO:172、SEQ ID NO:173、SEQ ID NO:174、SEQ ID NO:175、SEQ ID NO:176、SEQ ID NO:177、SEQ ID NO:178、SEQID NO:179、SEQ ID NO:180、SEQ ID NO:181、SEQ ID NO:182、SEQ ID NO:183、SEQ ID NO:184、SEQ ID NO:185或SEQ ID NO:186所示的碱基序列。
(33)上述(29)至(32)中的任一项所述的核糖核蛋白,其中所述转录阻遏物选自KRAB、MeCP2、SIN3A、HDT1、MBD2B、NIPP1和HP1A。
(34)上述(29)至(33)中的任一项所述的核糖核蛋白,其中所述转录阻遏物是KRAB。
(35)上述(29)至(34)中的任一项所述的核糖核蛋白,其中所述核酸酶缺陷型CRISPR效应蛋白是dCas9。
(36)上述(35)所述的核糖核蛋白,其中所述dCas9源自于金黄色葡萄球菌。
(37)上述(29)至(36)中的任一项所述的核糖核蛋白,
其中所述向导RNA包含SEQ ID NO:164、SEQ ID NO:169、SEQ ID NO:171或SEQ IDNO:177所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:164、SEQ ID NO:169、SEQ ID NO:171或SEQ ID NO:177所示的碱基序列,
其中所述转录阻遏物是KRAB,并且
其中所述核酸酶缺陷型CRISPR效应蛋白是源自于金黄色葡萄球菌的dCas9。
(38)上述(37)所述的核糖核蛋白,其中所述向导RNA包含SEQ ID NO:171所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:171所示的碱基序列。
(39)一种用于抑制人类DMPK基因的表达的组合物或药剂盒,所述组合物或药剂盒包含下述组分:
(e)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白或编码所述融合蛋白的多核苷酸,和
(f)靶向人类DMPK基因的表达调控区中SEQ ID NO:127、SEQ ID NO:46、SEQ IDNO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQ ID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域的向导RNA或编码所述向导RNA的多核苷酸。
(40)上述(39)所述的组合物或药剂盒,所述组合物或药剂盒包含下述组分:
(e)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白或编码所述融合蛋白的多核苷酸,和
(f)靶向人类DMPK基因的表达调控区中SEQ ID NO:127、SEQ ID NO:46、SEQ IDNO:128、SEQ ID NO:66、SEQ ID NO:68、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:134、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域的向导RNA或编码所述向导RNA的多核苷酸。
(41)上述(39)或(40)所述的组合物或药剂盒,所述组合物或药剂盒包含下述组分:
(e)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白或编码所述融合蛋白的多核苷酸,和
(f)靶向人类DMPK基因的表达调控区中SEQ ID NO:63、SEQ ID NO:136、SEQ IDNO:83、SEQ ID NO:99、SEQ ID NO:135、SEQ ID NO:109或SEQ ID NO:111所示的区域中长度为18至24个核苷酸的连续区域的向导RNA或编码所述向导RNA的多核苷酸。
(42)上述(39)所述的组合物或药剂盒,其中所述向导RNA包含SEQ ID NO:157、SEQID NO:158、SEQ ID NO:159、SEQ ID NO:160、SEQ ID NO:161、SEQ ID NO:162、SEQ ID NO:163、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:166、SEQ ID NO:167、SEQ ID NO:168、SEQID NO:169、SEQ ID NO:170、SEQ ID NO:171、SEQ ID NO:172、SEQ ID NO:173、SEQ ID NO:174、SEQ ID NO:175、SEQ ID NO:176、SEQ ID NO:177、SEQ ID NO:178、SEQ ID NO:179、SEQID NO:180、SEQ ID NO:181、SEQ ID NO:182、SEQ ID NO:183、SEQ ID NO:184、SEQ ID NO:185或SEQ ID NO:186所示的碱基序列,或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:157、SEQ ID NO:158、SEQ ID NO:159、SEQ ID NO:160、SEQ ID NO:161、SEQ IDNO:162、SEQ ID NO:163、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:166、SEQ ID NO:167、SEQ ID NO:168、SEQ ID NO:169、SEQ ID NO:170、SEQ ID NO:171、SEQ ID NO:172、SEQ IDNO:173、SEQ ID NO:174、SEQ ID NO:175、SEQ ID NO:176、SEQ ID NO:177、SEQ ID NO:178、SEQ ID NO:179、SEQ ID NO:180、SEQ ID NO:181、SEQ ID NO:182、SEQ ID NO:183、SEQ IDNO:184、SEQ ID NO:185或SEQ ID NO:186所示的碱基序列。
(43)上述(39)至(42)所述的组合物或药剂盒,所述组合物或药剂盒包含至少两个不同的向导RNA或编码至少两个不同的向导RNA的多核苷酸或至少两个编码向导RNA的多核苷酸,其中所述至少两个多核苷酸是不同的。
(44)上述(39)至(43)中的任一项所述的组合物或药剂盒,其中所述转录阻遏物选自KRAB、MeCP2、SIN3A、HDT1、MBD2B、NIPP1和HP1A。
(45)上述(44)所述的组合物或药剂盒,其中所述转录阻遏物是KRAB。
(46)上述(39)至(45)中的任一项所述的组合物或药剂盒,其中所述核酸酶缺陷型CRISPR效应蛋白是dCas9。
(47)上述(46)所述的组合物或药剂盒,其中所述dCas9源自于金黄色葡萄球菌。
(48)上述(39)至(47)中的任一项所述的组合物或药剂盒,
其中所述组合物或药剂盒包含编码所述融合蛋白的多核苷酸和编码所述向导RNA的多核苷酸,并且
其中所述编码所述融合蛋白的多核苷酸还包含用于所述融合蛋白的启动子序列,和/或所述编码所述向导RNA的多核苷酸还包含用于所述向导RNA的启动子序列。
(49)上述(48)所述的组合物或药剂盒,其中用于所述向导RNA的启动子序列选自U6启动子、SNR6启动子、SNR52启动子、SCR1启动子、RPR1启动子、U3启动子和H1启动子。
(50)上述(48)所述的组合物或药剂盒,其中用于所述融合蛋白的启动子序列是遍在启动子或肌肉特异性启动子。
(51)上述(50)所述的组合物或药剂盒,其中所述遍在启动子选自EFS启动子、CMV启动子和CAG启动子。
(52)上述(50)所述的组合物或药剂盒,其中所述肌肉特异性启动子选自CK8启动子、肌球蛋白重链激酶(MHCK)启动子、肌肉肌酸激酶(MCK)启动子、合成C5-12(Syn)启动子和Des启动子。
(53)上述(48)至(52)中的任一项所述的组合物或药剂盒,其中所述向导RNA包含SEQ ID NO:164、SEQ ID NO:169、SEQ ID NO:171或SEQ ID NO:177所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:164、SEQ ID NO:169、SEQ ID NO:171或SEQ ID NO:177所示的碱基序列。
其中所述转录阻遏物是KRAB,
其中所述核酸酶缺陷型CRISPR效应蛋白是源自于金黄色葡萄球菌的dCas9,
其中用于所述向导RNA的启动子序列是U6启动子,并且
其中用于所述融合蛋白的启动子序列是CK8启动子。
(54)上述(53)所述的组合物或药剂盒,其中所述向导RNA包含SEQ ID NO:171所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:171所示的碱基序列。
(55)一种治疗或预防1型强直性肌营养不良症的方法,所述方法包括给药下述(e)和(f)的步骤:
(e)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白或编码所述融合蛋白的多核苷酸,和
(f)靶向人类DMPK基因的表达调控区中SEQ ID NO:127、SEQ ID NO:46、SEQ IDNO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQ ID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域的向导RNA或编码所述向导RNA的多核苷酸。
(56)上述(55)所述的方法,其包括给药下述(e)和(f)的步骤:
(e)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白或编码所述融合蛋白的多核苷酸,和
(f)靶向人类DMPK基因的表达调控区中SEQ ID NO:127、SEQ ID NO:46、SEQ IDNO:128、SEQ ID NO:66、SEQ ID NO:68、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:134、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域的向导RNA或编码所述向导RNA的多核苷酸。
(57)上述(55)或(56)所述的方法,其包括给药下述(e)和(f)的步骤:
(e)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白或编码所述融合蛋白的多核苷酸,和
(f)靶向人类DMPK基因的表达调控区中SEQ ID NO:63、SEQ ID NO:136、SEQ IDNO:83、SEQ ID NO:99、SEQ ID NO:135、SEQ ID NO:109或SEQ ID NO:111所示的区域中长度为18至24个核苷酸的连续区域的向导RNA或编码所述向导RNA的多核苷酸。
(58)上述(55)所述的方法,其中所述向导RNA包含SEQ ID NO:157、SEQ ID NO:158、SEQ ID NO:159、SEQ ID NO:160、SEQ ID NO:161、SEQ ID NO:162、SEQ ID NO:163、SEQID NO:164、SEQ ID NO:165、SEQ ID NO:166、SEQ ID NO:167、SEQ ID NO:168、SEQ ID NO:169、SEQ ID NO:170、SEQ ID NO:171、SEQ ID NO:172、SEQ ID NO:173、SEQ ID NO:174、SEQID NO:175、SEQ ID NO:176、SEQ ID NO:177、SEQ ID NO:178、SEQ ID NO:179、SEQ ID NO:180、SEQ ID NO:181、SEQ ID NO:182、SEQ ID NO:183、SEQ ID NO:184、SEQ ID NO:185或SEQ ID NO:186所示的碱基序列,或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ IDNO:157、SEQ ID NO:158、SEQ ID NO:159、SEQ ID NO:160、SEQ ID NO:161、SEQ ID NO:162、SEQ ID NO:163、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:166、SEQ ID NO:167、SEQ IDNO:168、SEQ ID NO:169、SEQ ID NO:170、SEQ ID NO:171、SEQ ID NO:172、SEQ ID NO:173、SEQ ID NO:174、SEQ ID NO:175、SEQ ID NO:176、SEQ ID NO:177、SEQ ID NO:178、SEQ IDNO:179、SEQ ID NO:180、SEQ ID NO:181、SEQ ID NO:182、SEQ ID NO:183、SEQ ID NO:184、SEQ ID NO:185或SEQ ID NO:186所示的碱基序列。
(59)上述(55)至(58)所述的方法,其包括给药至少两个不同的向导RNA或编码至少两个不同的向导RNA的多核苷酸或至少两个编码向导RNA的多核苷酸,其中所述至少两个多核苷酸是不同的。
(60)上述(55)至(59)所述的方法,其中所述转录阻遏物选自KRAB、MeCP2、SIN3A、HDT1、MBD2B、NIPP1和HP1A。
(61)上述(60)所述的方法,其中所述转录阻遏物是KRAB。
(62)上述(55)至(61)中的任一项所述的方法,其中所述核酸酶缺陷型CRISPR效应蛋白是dCas9。
(63)上述(62)所述的方法,其中所述dCas9源自于金黄色葡萄球菌。
(64)上述(55)至(63)中的任一项所述的方法,
其中所述方法包括给药编码所述融合蛋白的多核苷酸和编码所述向导RNA的多核苷酸,并且
其中所述编码所述融合蛋白的多核苷酸还包含用于所述融合蛋白的启动子序列,和/或所述编码所述向导RNA的多核苷酸还包含用于所述向导RNA的启动子序列。
(65)上述(64)所述的方法,其中用于所述向导RNA的启动子序列选自U6启动子、SNR6启动子、SNR52启动子、SCR1启动子、RPR1启动子、U3启动子和H1启动子。
(66)上述(64)所述的方法,其中用于所述融合蛋白的启动子序列是遍在启动子或肌肉特异性启动子。
(67)上述(66)所述的方法,其中所述遍在启动子选自EFS启动子、CMV启动子和CAG启动子。
(68)上述(66)所述的方法,其中所述肌肉特异性启动子选自CK8启动子、肌球蛋白重链激酶(MHCK)启动子、肌肉肌酸激酶(MCK)启动子、合成C5-12(Syn)启动子和Des启动子。
(69)上述(64)至(68)中的任一项所述的方法,其中所述向导RNA包含SEQ ID NO:164、SEQ ID NO:169、SEQ ID NO:171或SEQ ID NO:177所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:164、SEQ ID NO:169、SEQ ID NO:171或SEQ IDNO:177所示的碱基序列,
其中所述转录阻遏物是KRAB,
其中所述核酸酶缺陷型CRISPR效应蛋白是源自于金黄色葡萄球菌的dCas9,
其中用于所述向导RNA的启动子序列是U6启动子,并且
其中用于所述融合蛋白的启动子序列是CK8启动子。
(70)上述(69)所述的方法,其中所述向导RNA包含SEQ ID NO:171所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:171所示的碱基序列。
(71)下述组分的用途:
(e)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白或编码所述融合蛋白的多核苷酸,和
(f)靶向人类DMPK基因的表达调控区中SEQ ID NO:127、SEQ ID NO:46、SEQ IDNO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQ ID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域的向导RNA或编码所述向导RNA的多核苷酸,
所述组分用于制造治疗或预防1型强直性肌营养不良症的药物组合物。
(72)上述(71)所述的下述组分(e)和(f)的用途:
(e)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白或编码所述融合蛋白的多核苷酸,和
(f)靶向人类DMPK基因的表达调控区中SEQ ID NO:127、SEQ ID NO:46、SEQ IDNO:128、SEQ ID NO:66、SEQ ID NO:68、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:134、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域的向导RNA或编码所述向导RNA的多核苷酸,
所述组分用于制造治疗或预防1型强直性肌营养不良症的药物组合物。
(73)上述(71)或(72)所述的下述组分(e)和(f)的用途:
(e)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白或编码所述融合蛋白的多核苷酸,和
(f)靶向人类DMPK基因的表达调控区中SEQ ID NO:63、SEQ ID NO:136、SEQ IDNO:83、SEQ ID NO:99、SEQ ID NO:135、SEQ ID NO:109或SEQ ID NO:111所示的区域中长度为18至24个核苷酸的连续区域的向导RNA或编码所述向导RNA的多核苷酸,
所述组分用于制造治疗或预防1型强直性肌营养不良症的药物组合物。
(74)上述(71)所述的用途,其中所述向导RNA包含SEQ ID NO:157、SEQ ID NO:158、SEQ ID NO:159、SEQ ID NO:160、SEQ ID NO:161、SEQ ID NO:162、SEQ ID NO:163、SEQID NO:164、SEQ ID NO:165、SEQ ID NO:166、SEQ ID NO:167、SEQ ID NO:168、SEQ ID NO:169、SEQ ID NO:170、SEQ ID NO:171、SEQ ID NO:172、SEQ ID NO:173、SEQ ID NO:174、SEQID NO:175、SEQ ID NO:176、SEQ ID NO:177、SEQ ID NO:178、SEQ ID NO:179、SEQ ID NO:180、SEQ ID NO:181、SEQ ID NO:182、SEQ ID NO:183、SEQ ID NO:184、SEQ ID NO:185或SEQ ID NO:186所示的碱基序列,或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ IDNO:157、SEQ ID NO:158、SEQ ID NO:159、SEQ ID NO:160、SEQ ID NO:161、SEQ ID NO:162、SEQ ID NO:163、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:166、SEQ ID NO:167、SEQ IDNO:168、SEQ ID NO:169、SEQ ID NO:170、SEQ ID NO:171、SEQ ID NO:172、SEQ ID NO:173、SEQ ID NO:174、SEQ ID NO:175、SEQ ID NO:176、SEQ ID NO:177、SEQ ID NO:178、SEQ IDNO:179、SEQ ID NO:180、SEQ ID NO:181、SEQ ID NO:182、SEQ ID NO:183、SEQ ID NO:184、SEQ ID NO:185或SEQ ID NO:186所示的碱基序列。
(75)上述(71)至(74)所述的用途,其包括使用至少两个不同的向导RNA或编码至少两个不同的向导RNA的多核苷酸或至少两个编码向导RNA的多核苷酸,其中所述至少两个多核苷酸是不同的。
(76)上述(71)至(75)所述的用途,其中所述转录阻遏物选自KRAB、MeCP2、SIN3A、HDT1、MBD2B、NIPP1和HP1A。
(77)上述(76)所述的用途,其中所述转录阻遏物是KRAB。
(78)上述(71)至(77)中的任一项所述的用途,其中所述核酸酶缺陷型CRISPR效应蛋白是dCas9。
(79)上述(78)所述的用途,其中所述dCas9源自于金黄色葡萄球菌。
(80)上述(71)至(79)中的任一项所述的用途,
其中所述用途包括使用编码所述融合蛋白的多核苷酸和使用编码所述向导RNA的多核苷酸,并且
其中所述编码所述融合蛋白的多核苷酸还包含用于所述融合蛋白的启动子序列,和/或所述编码所述向导RNA的多核苷酸还包含用于所述向导RNA的启动子序列。
(81)上述(80)所述的用途,其中用于所述向导RNA的启动子序列选自U6启动子、SNR6启动子、SNR52启动子、SCR1启动子、RPR1启动子、U3启动子和H1启动子。
(82)上述(80)所述的用途,其中用于所述融合蛋白的启动子序列是遍在启动子或肌肉特异性启动子。
(83)上述(82)所述的用途,其中所述遍在启动子选自EFS启动子、CMV启动子和CAG启动子。
(84)上述(82)所述的用途,其中所述肌肉特异性启动子选自CK8启动子、肌球蛋白重链激酶(MHCK)启动子、肌肉肌酸激酶(MCK)启动子、合成C5-12(Syn)启动子和Des启动子。
(85)上述(80)至(84)所述的用途,其中所述向导RNA包含SEQ ID NO:164、SEQ IDNO:169、SEQ ID NO:171或SEQ ID NO:177所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:164、SEQ ID NO:169、SEQ ID NO:171或SEQ ID NO:177所示的碱基序列,
其中所述转录阻遏物是KRAB,
其中所述核酸酶缺陷型CRISPR效应蛋白是源自于金黄色葡萄球菌的dCas9,
其中用于所述向导RNA的启动子序列是U6启动子,并且
其中用于所述融合蛋白的启动子序列是CK8启动子。
(86)上述(85)所述的用途,其中所述向导RNA包含SEQ ID NO:171所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:171所示的碱基序列。
本发明的效果
根据本发明,可以抑制人类DMPK基因的表达,并且因此预计本发明能够治疗和/或预防DM1。
附图说明
当结合附图考虑时,通过参考下述详细描述将会更容易地获得对本发明及其许多附带优点的更完整的认识,因为它们变得更好理解,在所述附图中:
图1示出了SEQ ID NO:4至126所示的靶向序列的位置,其中黑框示出了显示出人类DMPK基因表达的不少于50%的降低的靶向序列的位置。
图2示出了使用dSaCas9-KRAB和包含分别由SEQ ID NO:4至126所示的靶向序列编码的crRNA的sgRNA评估的对人类DMPK基因的表达抑制作用的结果。水平轴示出了包含由每个靶向序列编码的crRNA的sgRNA,竖直轴示出了在使用每个sgRNA时DMPK基因的表达水平与在使用对照sgRNA时DMPK基因的表达水平(100%)的比率,并且误差条示出了标准偏差。
图3示出了当使用dSaCas9-KRAB和分别包含由靶向序列编码的crRNA的sgRNA控制人类DMPK基因的表达时,SEQ ID NO:4至126所示的靶向序列的位置与人类DMPK基因的表达水平之间的关系。
图4示出了在人类肌细胞中DMPK的下调。
图5显示了在DMSXL小鼠中AAV9-695抑制DMPK表达(A:胫前肌;B:心脏;C:肝脏)。
图6显示了在DMSXL小鼠中AAV9-245抑制DMPK表达(A:胫前肌;B:心脏;C:肝脏)。
图7显示了在DMSXL小鼠中AAV9-257抑制DMPK表达(A:胫前肌;B:心脏;C:肝脏)。
图8显示了在DMSXL小鼠中AAV9-695改善RNA灶点形成。
图9示出了在表达hDMPK sgRNA的iDM细胞中DMPK基因表达的抑制。
图10示出了在表达hDMPK sgRNA的iDM细胞中RNA灶点形成的改善(A:iDM-695细胞和iDM-对照细胞的典型图像;B:每个细胞中RNA灶点阳性的核的比率)。
图11示出了在表达hDMPK sgRNA的iDM细胞中剪接缺陷的改善(A:基因的凝胶图像和外显子模式;B:正常剪接产物的比率)。
具体实施方式
1.多核苷酸
本发明提供了一种包含下述碱基序列的多核苷酸(在后文中有时被称为“本发明的多核苷酸”):
(a)编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列,和
(b)编码向导RNA的碱基序列,所述向导RNA靶向人类DMPK基因的表达调控区中SEQID NO:127、SEQ ID NO:46、SEQ ID NO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域(即18至24个毗连核苷酸)。
本发明的多核苷酸被引入到所需细胞中并转录,产生核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白和靶向人类DMPK基因的表达调控区的特定区域的向导RNA。这些融合蛋白和向导RNA形成复合体(在后文中所述复合体有时被称为“核糖核蛋白;RNP”)并协同作用于上述特定区域,从而阻遏人类DMPK基因的转录。在本发明的一个实施方式中,所述人类DMPK基因的表达可以被抑制例如不少于约40%、不少于约50%、不少于约60%、不少于约70%、不少于约75%、不少于约80%、不少于约85%、不少于约90%、不少于约95%或约100%。
(1)定义
在本说明书中,“人类DMPK基因的表达调控区”意味着通过RNP与该区域的结合可以阻遏人类DMPK基因的表达的任何区域。也就是说,人类DMPK基因的表达调控区可能存在于人类DMPK基因的任何区域例如启动子区、增强子区、内含子、外显子和人类DMPK基因的邻近基因(例如人类DMWD(DM1基因座,含有WD重复序列)基因)中,只要人类DMPK基因的表达被RNP的结合阻遏即可。在本说明书中,当表达调控区用特定序列示出时,所述表达调控区在概念上包括正义链序列和反义链序列两者。
在本发明中,核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白被向导RNA召集到人类DMPK基因的表达调控区的特定区域中。在本说明书中,“靶向……的向导RNA”意味着“将融合蛋白召集到……中的向导RNA”。
在本说明书中,“向导RNA(被称为“gRNA”)”是包含基因组特异性CRISPR-RNA(被称为“crRNA”)的RNA。crRNA是与靶向序列(在晚些时候描述)的互补序列结合的RNA。当使用Cpf1作为CRISPR效应蛋白时,“向导RNA”是指包含由crRNA构成的RNA和附连到其5’-端的特定序列的RNA(例如在FnCpf1的情况下,SEQ ID NO:138所示的RNA序列)。当使用Cas9作为CRISPR效应蛋白时,“向导RNA”是指包含crRNA和附连到其3’-端的反式作用crRNA(被称为“tracrRNA”)的嵌合RNA(被称为“单一向导RNA(sgRNA)”)(参见例如Zhang F.等,Hum MolGenet.2014年9月15日;23(R1):R40-6和Zetsche B.等,Cell.2015年10月22日;163(3):759-71,它们整体通过参考并入本文)。
在本说明书中,与crRNA在人类DMPK基因的表达调控区中结合的序列互补的序列被称为“靶向序列”。也就是说,在本说明书中,“靶向序列”是存在于人类DMPK基因的表达调控区中并与PAM(前间区序列邻近基序)相邻的DNA序列。当使用Cpf1作为CRISPR效应蛋白时,PAM与靶向序列的5’-侧相邻。当使用Cas9作为CRISPR效应蛋白时,PAM与靶向序列的3’-侧相邻。所述靶向序列可以存在于人类DMPK基因的表达调控区的正义链序列侧或反义链序列侧上(参见例如上述的Zhang F.等,Hum Mol Genet.2014年9月15日;23(R1):R40-6和Zetsche B.等,Cell.2015年10月22日;163(3):759-71,它们整体通过参考并入本文)。
(2)核酸酶缺陷型CRISPR效应蛋白
在本发明中,使用核酸酶缺陷型CRISPR效应蛋白,与其融合的转录阻遏物被召集到人类DMPK基因的表达调控区。对本发明中使用的核酸酶缺陷型CRISPR效应蛋白(在后文中有时被简称为“CRISPR效应蛋白”)没有特别限制,只要它与gRNA形成复合体并被召集到人类DMPK基因的表达调控区即可。例如,可以包括核酸酶缺陷型Cas9(在后文中有时也被称为“dCas9”)或核酸酶缺陷型Cpf1(在后文中有时也被称为“dCpf1”)。
上述dCas9的实例包括但不限于源自于酿脓链球菌(Streptococcus pyogenes)的Cas9(SpCas9;PAM序列:NGG(N是A、G、T或C,下同))、源自于嗜热链球菌(Streptococcusthermophilus)的Cas9(St1Cas9;PAM序列:NNAGAAW(W是A或T,下同),St3Cas9;PAM序列:NGGNG)、源自于脑膜炎奈瑟氏菌(Neisseria meningitidis)的Cas9(NmCas9;PAM序列:NNNNGATT)或源自于金黄色葡萄球菌的Cas9(SaCas9;PAM序列:NNGRRT(R是A或G,下同))等的核酸酶缺陷型变体(参见例如Nishimasu等,Cell.2014年2月27日;156(5):935–49;Esvelt KM等,Nat Methods.2013年11月;10(11):1116-21;Zhang Y.Mol Cell.2015年10月15日;60(2):242-55;和Friedland AE等,Genome Biol.2015年11月24日;16:257,它们整体通过参考并入本文)。例如,在SpCas9的情况下,可以使用其中第10位处的Asp残基被转变成Ala残基并且第840位处的His残基被转变成Ala残基的双重突变体(有时被称为“dSpCas9”)(参见例如上述的Nishimasu等,Cell.2014,其整体通过参考并入本文)。或者,在SaCas9的情况下,可以使用其中第10位处的Asp残基被转变成Ala残基并且第580位处的Asn残基被转变成Ala残基的双重突变体(SEQ ID NO:139),或其中第10位处的Asp残基被转变成Ala残基并且第557位处的His残基被转变成Ala残基的双重突变体(SEQ ID NO:140)(在后文中这些双重突变体中的任一者有时被称为“dSaCas9”)(参见例如上述的Friedland AE等,GenomeBiol.2015,其整体通过参考并入本文)。
此外,在本发明的一个实施方式中,作为dCas9,也可以使用通过将上述dCas9的一部分氨基酸序列修饰而获得的变体,所述变体与gRNA形成复合体并被召集到人类DMPK基因的表达调控区。此类变体的实例包括具有部分缺失的氨基酸序列的截短变体。在本发明的一个实施方式中,可以使用在整体通过参考并入本文的WO2019/235627和WO2020/085441中描述的变体作为dCas9。具体来说,也可以使用通过从其中第10位处的Asp残基被转变成Ala残基并且第580位处的Asn残基被转变成Ala残基的双重突变体dSaCas9缺失第721至第745位氨基酸而获得的dSaCas9(SEQ ID NO:141)或其中将所述缺失的部分用肽连接物替换的dSaCas9(例如其中将所述缺失部分用GGSGGS连接物(SEQ ID NO:142)替换的dSaCas9如SEQID NO:143所示)(在后文中这些双重突变体中的任一者有时被称为“dSaCas9[-25]”),或通过缺失作为上述双重突变体dSaCas9的第482至第648位氨基酸而获得的dSaCas9(SEQ IDNO:144)或其中将所述缺失的部分用肽连接物替换的dSaCas9(其中将所述缺失部分用GGSGGS连接物替换的dSaCas9如SEQ ID NO:145所示)。
上述dCpf1的实例包括但不限于源自于新凶手弗朗西丝菌(Francisellanovicida)的Cpf1(FnCpf1;PAM序列:TTN)、源自于氨基酸球菌属菌种(Acidaminococcussp.)的Cpf1(AsCpf1;PAM序列:TTTN)或源自于毛螺菌科(Lachnospiraceae)细菌的Cpf1(LbCpf1;PAM序列:TTTA、TCTA、TCCA或CCCA)等的核酸酶缺陷型变体(参见例如Zetsche B.等,Cell.2015年10月22日;163(3):759-71;Yamano T等,Cell.2016年5月5日;165(4):949-62;和Yamano T等,Mol Cell.2017年8月17日;67(4):633-45,它们整体通过参考并入本文)。例如,在FnCpf1的情况下,可以使用其中第917位处的Asp残基被转变成Ala残基并且第1006位处的Glu残基被转变成Ala残基的双重突变体(参见例如上述的Zetsche B等,Cell.2015,其整体通过参考并入本文)。在本发明的一个实施方式中,作为dCpf1,也可以使用通过将上述dCpf1的一部分氨基酸序列修饰而获得的变体,所述变体与gRNA形成复合体并被召集到人类DMPK基因的表达调控区。
在本发明的一个实施方式中,将dCas9用作所述核酸酶缺陷型CRISPR效应蛋白。在一个实施方式中,所述dCas9是dSaCas9,并且在特定实施方式中,dSaCas9是dSaCas9[-25]。
包含编码核酸酶缺陷型CRISPR效应蛋白的碱基序列的多核苷酸可以例如通过下述方法来克隆:在cDNA序列信息的基础上合成覆盖编码所述蛋白质的所需部分的区域的oligoDNA引物,并使用从产生所述蛋白质的细胞制备的总RNA或mRNA级分作为模板,通过PCR方法扩增所述多核苷酸。此外,包含编码核酸酶缺陷型CRISPR效应蛋白的碱基序列的多核苷酸可以通过下述方法获得:使用已知的定点突变方法在编码克隆的CRISPR效应蛋白的核苷酸序列中引入突变,以将对核酸酶活性来说重要的位点处的氨基酸残基(可以包括例如在SaCas9的情况下第10位处的Asp残基、第557位处的His残基和第580位处的Asn残基,在FnCpf1的情况下第917位处的Asp残基和第1006位处的Glu残基等,但不限于此)转变成其他氨基酸。
或者,包含编码核酸酶缺陷型CRISPR效应蛋白的碱基序列的多核苷酸可以在cDNA序列信息的基础上通过化学合成或化学合成与PCR法或Gibson组装法的组合来获得,并且还可以被进一步构建成经历密码子优化以成为适合于在人类中表达的密码子的碱基序列。
(3)转录阻遏物
在本发明中,通过与所述核酸酶缺陷型CRISPR效应蛋白融合的转录阻遏物的作用阻遏人类DMPK基因的表达。在本说明书中,“转录阻遏物”意味着具有阻遏人类DMPK基因的基因转录的能力的蛋白质或其保留了所述功能的肽片段。对在本发明中使用的转录阻遏物没有特别限制,只要它可以阻遏人类DMPK基因的表达即可。它包括例如Kruppel结合框(KRAB)、MBD2B、v-ErbA、SID(包括SID的链状态(SID4X))、MBD2、MBD3、DNMT家族(例如DNMT1、DNMT3A、DNMT3B)、Rb、MeCP2、ROM2、LSD1、AtHD2A、SET1、HDAC11、SETD8、EZH2、SUV39H1、PHF19、SALI、NUE、SUVR4、KYP、DIM5、HDAC8、SIRT3、SIRT6、MESOLO4、SET8、HST2、COBB、SET-TAF1B、NCOR、SIN3A、HDT1、NIPP1、HP1A、ERF阻遏物结构域(ERD)及其具有转录阻遏能力的变体、其融合体等。在本发明的一个实施方式中,使用KRAB作为所述转录阻遏物。
包含编码转录阻遏物的碱基序列的多核苷酸可以通过化学合成或化学合成与PCR法或Gibson组装法的组合来构建。此外,包含编码转录阻遏物的碱基序列的多核苷酸还可以被构建成具有适合于在人类中表达的密码子的密码子优化的DNA序列。
包含编码转录阻遏物与核酸酶缺陷型CRISPR效应蛋白的融合蛋白的碱基序列的多核苷酸,可以通过将编码所述CRISPR效应蛋白的碱基序列直接地或在添加编码连接物、NLS(核定位信号)(例如SEQ ID NO:189或SEQ ID NO:191所示的碱基序列)、标签等的碱基序列后连接到编码所述转录阻遏物的碱基序列来获得。在本发明中,可以将所述转录阻遏物与所述核酸酶缺陷型CRISPR效应蛋白的N-端或C-端融合。作为连接物,可以使用氨基酸数目为约2至50的连接物,并且其具体实例包括但不限于其中甘氨酸(G)和丝氨酸(S)被交替连接的G-S-G-S连接物等。在本发明的一个实施方式中,作为包含编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列的多核苷酸,可以使用SEQ ID NO:151所示的碱基序列,其编码作为融合蛋白的SV40 NLS、dSaCas9、NLS和KRAB。
(4)向导RNA
在本发明中,可以通过向导RNA将核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白召集到人类DMPK基因的表达调控区。正如在上述“(1)定义”中描述的,向导RNA包含crRNA,并且所述crRNA与靶向序列的互补序列结合。crRNA可能不与靶向序列的互补序列完全互补,只要所述向导RNA可以将所述融合蛋白召集到靶区域即可,并且可以是其中缺失、替换、插入和/或添加了至少1至3个碱基的序列。
当使用dCas9作为所述核酸酶缺陷型CRISPR效应蛋白时,可以例如使用已公布的gRNA设计网站(CRISPR设计工具、CRISPR指导等)确定所述靶向序列。具体来说,从靶基因(即人类DMPK基因)及其临近基因的序列,列出PAM(例如在SaCas9的情况下为NNGRRT)与其3’-侧相邻的长度为约20个核苷酸的候选靶向序列,并且可以使用这些候选靶向序列中在人类基因组中具有少量脱靶位点的候选序列作为所述靶向序列。所述靶向序列的碱基长度为18至24个核苷酸长,优选为18至23个核苷酸长,更优选为18至22个核苷酸长。作为用于预测脱靶位点数目的初筛,大量生物信息学工具是已知且可公开获得的,并且可用于预测具有最低脱靶效应的靶向序列。其实例包括生物信息学工具例如Benchling(https://benchling.com)和COSMID(具有错配、插入和缺失的CRISPR脱靶位点)(可以在互联网上的https://crispr.bme.gatech.edu网站获得)。使用这些工具,可以概述与gRNA所靶向的碱基序列的相似性。当使用的gRNA设计软件不具有搜索靶基因组的脱靶位点的功能时,所述脱靶位点可以例如通过将所述靶基因组针对候选靶向序列的3’-侧上的8至12个核苷酸(具有被靶向核苷酸序列的高区分能力的种子序列)进行Blast搜索来搜索。
在本发明的一个实施方式中,在存在于人类19号染色体(Chr19)的GRCh38.p12位置中的区域中,靠近DMPK基因的转录起始点的区域:45,777,342-45,784,715可以是所述人类DMPK基因的表达调控区。正如在实施例中所示,本发明人发现通过靶向上述区域中的45,778,884-45,783,985区域(图3中的2区),可以调控人类DMPK基因的表达。因此,在本发明的一个实施方式中,所述靶向序列可以是存在于人类19号染色体(Chr19)的GRCh38.p12位置中的区域中的45,778,884-45,783,985区域中的连续的长度为18至24个核苷酸、优选地长度为18至23个核苷酸、更优选地长度为18至22个核苷酸的碱基序列。
此外,本发明人发现,存在于上述区域45,778,884-45,783,985中的SEQ ID NO:127、SEQ ID NO:46、SEQ ID NO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQ ID NO:117或SEQ ID NO:119所示的区域,优选地作为设计所述用于阻遏DMPK基因的表达的靶向序列的区域。因此,在本发明的一个实施方式中,所述靶向序列可以是这些区域中的连续的长度为18至24个核苷酸、优选地长度为18至23个核苷酸、更优选地长度为18至22个核苷酸的碱基序列。人类DMPK基因的表达调控区中每个序列的位置描述在表1和图1中。
在本发明的一个实施方式中,所述靶向序列可以是存在于上述区域45,778,884-45,783,985中的SEQ ID NO:127、SEQ ID NO:46、SEQ ID NO:128、SEQ ID NO:66、SEQ IDNO:68、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:134、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的区域中的连续的长度为18至24个核苷酸、优选地长度为18至23个核苷酸、更优选地长度为18至22个核苷酸的碱基序列,其被认为显示出人类DMPK基因表达的不少于50%的降低。人类DMPK基因的表达调控区中每个序列的位置描述在表1和图1中。
在本发明的另一个实施方式中,所述靶向序列可以是存在于上述区域45,778,884-45,783,985中的SEQ ID NO:63、SEQ ID NO:136、SEQ ID NO:83、SEQ ID NO:99、SEQ IDNO:135、SEQ ID NO:109或SEQ ID NO:111所示的区域中的连续的长度为18至24个核苷酸、优选地长度为18至23个核苷酸、更优选地长度为18至22个核苷酸的碱基序列,其被认为显示出人类DMPK基因表达的不少于75%的降低。人类DMPK基因的表达调控区中每个序列的位置描述在表1和图1中。
在本发明的又一个实施方式中,所述靶向序列可以是SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:46、SEQ ID NO:62、SEQ ID NO:63、SEQ ID NO:66、SEQ ID NO:68、SEQ IDNO:70、SEQ ID NO:71、SEQ ID NO:72、SEQ ID NO:73、SEQ ID NO:80、SEQ ID NO:81、SEQ IDNO:82、SEQ ID NO:83、SEQ ID NO:85、SEQ ID NO:86、SEQ ID NO:88、SEQ ID NO:91、SEQ IDNO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:103、SEQ ID NO:105、SEQ ID NO:106、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的碱基序列。SEQ ID NO:43和44所示的碱基序列是被包含在SEQ ID NO:127所示的区域中的靶向序列。SEQ ID NO:62和63所示的碱基序列是被包含在SEQ ID NO:128所示的区域中的靶向序列。SEQ ID NO:66至68所示的碱基序列是被包含在SEQ ID NO:129所示的区域中的靶向序列。SEQ ID NO:70至73所示的碱基序列是被包含在SEQ ID NO:130所示的区域中的靶向序列。SEQ ID NO:80至83所示的碱基序列是被包含在SEQ ID NO:131所示的区域中的靶向序列。SEQ ID NO:85和86所示的碱基序列是被包含在SEQ ID NO:132所示的区域中的靶向序列。SEQ ID NO:95至100所示的碱基序列是被包含在SEQ ID NO:133所示的区域中的靶向序列。SEQ ID NO:103、105和106所示的碱基序列是被包含在SEQ ID NO:134所示的区域中的靶向序列。SEQ ID NO:105和106所示的碱基序列是被包含在SEQ ID NO:135所示的区域中的靶向序列。SEQ ID NO:70和71所示的碱基序列是被包含在SEQ ID NO:136所示的区域中的靶向序列,SEQ ID NO:103至112所示的碱基序列是被包含在SEQ ID NO:137所示的区域中的靶向序列。人类DMPK基因的表达调控区中每个序列的位置描述在表1和图1中。
在本发明的一个实施方式中,编码crRNA的碱基序列可以是与所述靶向序列相同的碱基序列。例如,当将SEQ ID NO:5所示的靶向序列(CCCAGTCGAGGCCAAAGAAGA)引入到细胞中作为编码crRNA的碱基序列时,从所述序列转录的crRNA是CCCAGUCGAGGCCAAAGAAGA(SEQ ID NO:146)并且与TCTTCTTTGGCCTCGACTGGG(SEQ ID NO:147)结合,后者是与SEQ IDNO:5所示的碱基序列互补的序列,并存在于人类DMPK基因的表达调控区中。在另一个实施方式中,可以使用其中缺失、替换、插入和/或添加了至少1至3个碱基的靶向序列的碱基序列作为编码crRNA的碱基序列,只要向导RNA可以将融合蛋白召集到所述靶区域即可。因此,在本发明的一个实施方式中,作为编码crRNA的碱基序列,可以使用SEQ ID NO:43、SEQ IDNO:44、SEQ ID NO:46、SEQ ID NO:62、SEQ ID NO:63、SEQ ID NO:66、SEQ ID NO:68、SEQ IDNO:70、SEQ ID NO:71、SEQ ID NO:72、SEQ ID NO:73、SEQ ID NO:80、SEQ ID NO:81、SEQ IDNO:82、SEQ ID NO:83、SEQ ID NO:85、SEQ ID NO:86、SEQ ID NO:88、SEQ ID NO:91、SEQ IDNO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:103、SEQ ID NO:105、SEQ ID NO:106、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的碱基序列,或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:46、SEQ ID NO:62、SEQ ID NO:63、SEQ ID NO:66、SEQ ID NO:68、SEQ IDNO:70、SEQ ID NO:71、SEQ ID NO:72、SEQ ID NO:73、SEQ ID NO:80、SEQ ID NO:81、SEQ IDNO:82、SEQ ID NO:83、SEQ ID NO:85、SEQ ID NO:86、SEQ ID NO:88、SEQ ID NO:91、SEQ IDNO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:103、SEQ ID NO:105、SEQ ID NO:106、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的碱基序列。在本发明的另一个实施方式中,作为编码crRNA的碱基序列,可以使用SEQ IDNO:63、SEQ ID NO:70、SEQ ID NO:71、SEQ ID NO:83、SEQ ID NO:99、SEQ ID NO:105、SEQID NO:106、SEQ ID NO:109或SEQ ID NO:111所示的碱基序列,或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:63、SEQ ID NO:70、SEQ ID NO:71、SEQ ID NO:83、SEQID NO:99、SEQ ID NO:105、SEQ ID NO:106、SEQ ID NO:109或SEQ ID NO:111所示的碱基序列。在本发明的又一个实施方式中,作为编码crRNA的碱基序列,可以使用SEQ ID NO:70、SEQ ID NO:81、SEQ ID NO:83或SEQ ID NO:99所示的碱基序列,或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:70、SEQ ID NO:81、SEQ ID NO:83或SEQ ID NO:99所示的碱基序列。在本发明的一个实施方式中,作为编码crRNA的碱基序列,可以使用SEQ IDNO:83所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:83所示的碱基序列。
在本发明的一个实施方式中,可以使用SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:46、SEQ ID NO:62、SEQ ID NO:63、SEQ ID NO:66、SEQ ID NO:68、SEQ ID NO:70、SEQ IDNO:71、SEQ ID NO:72、SEQ ID NO:73、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ IDNO:83、SEQ ID NO:85、SEQ ID NO:86、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:96、SEQ IDNO:99、SEQ ID NO:100、SEQ ID NO:103、SEQ ID NO:105、SEQ ID NO:106、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:46、SEQ ID NO:62、SEQ ID NO:63、SEQ ID NO:66、SEQ ID NO:68、SEQ ID NO:70、SEQ ID NO:71、SEQ ID NO:72、SEQ ID NO:73、SEQ ID NO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ IDNO:83、SEQ ID NO:85、SEQ ID NO:86、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:96、SEQ IDNO:99、SEQ ID NO:100、SEQ ID NO:103、SEQ ID NO:105、SEQ ID NO:106、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的碱基序列作为编码crRNA的碱基序列,以分别产生包含SEQ ID NO:157、SEQ ID NO:158、SEQ ID NO:159、SEQID NO:160、SEQ ID NO:161、SEQ ID NO:162、SEQ ID NO:163、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:166、SEQ ID NO:167、SEQ ID NO:168、SEQ ID NO:169、SEQ ID NO:170、SEQID NO:171、SEQ ID NO:172、SEQ ID NO:173、SEQ ID NO:174、SEQ ID NO:175、SEQ ID NO:176、SEQ ID NO:177、SEQ ID NO:178、SEQ ID NO:179、SEQ ID NO:180、SEQ ID NO:181、SEQID NO:182、SEQ ID NO:183、SEQ ID NO:184、SEQ ID NO:185或SEQ ID NO:186所示的crRNA或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:157、SEQ ID NO:158、SEQ IDNO:159、SEQ ID NO:160、SEQ ID NO:161、SEQ ID NO:162、SEQ ID NO:163、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:166、SEQ ID NO:167、SEQ ID NO:168、SEQ ID NO:169、SEQ IDNO:170、SEQ ID NO:171、SEQ ID NO:172、SEQ ID NO:173、SEQ ID NO:174、SEQ ID NO:175、SEQ ID NO:176、SEQ ID NO:177、SEQ ID NO:178、SEQ ID NO:179、SEQ ID NO:180、SEQ IDNO:181、SEQ ID NO:182、SEQ ID NO:183、SEQ ID NO:184、SEQ ID NO:185或SEQ ID NO:186所示的crRNA的gRNA。在本发明的另一个实施方式中,所述gRNA可以包含SEQ ID NO:157、SEQ ID NO:158、SEQ ID NO:159、SEQ ID NO:160、SEQ ID NO:161、SEQ ID NO:162、SEQ IDNO:163、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:166、SEQ ID NO:167、SEQ ID NO:168、SEQ ID NO:169、SEQ ID NO:170、SEQ ID NO:171、SEQ ID NO:172、SEQ ID NO:173、SEQ IDNO:174、SEQ ID NO:175、SEQ ID NO:176、SEQ ID NO:177、SEQ ID NO:178、SEQ ID NO:179、SEQ ID NO:180、SEQ ID NO:181、SEQ ID NO:182、SEQ ID NO:183、SEQ ID NO:184、SEQ IDNO:185或SEQ ID NO:186所示的碱基序列,或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:157、SEQ ID NO:158、SEQ ID NO:159、SEQ ID NO:160、SEQ ID NO:161、SEQID NO:162、SEQ ID NO:163、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:166、SEQ ID NO:167、SEQ ID NO:168、SEQ ID NO:169、SEQ ID NO:170、SEQ ID NO:171、SEQ ID NO:172、SEQID NO:173、SEQ ID NO:174、SEQ ID NO:175、SEQ ID NO:176、SEQ ID NO:177、SEQ ID NO:178、SEQ ID NO:179、SEQ ID NO:180、SEQ ID NO:181、SEQ ID NO:182、SEQ ID NO:183、SEQID NO:184、SEQ ID NO:185或SEQ ID NO:186所示的碱基序列。在本发明的一个实施方式中,所述gRNA可以包含SEQ ID NO:161、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:171、SEQ ID NO:177、SEQ ID NO:180、SEQ ID NO:181、SEQ ID NO:183或SEQ ID NO:184所示的碱基序列,或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:161、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:171、SEQ ID NO:177、SEQ ID NO:180、SEQ ID NO:181、SEQID NO:183或SEQ ID NO:184所示的碱基序列。在本发明的另一个实施方式中,所述gRNA可以包含SEQ ID NO:164、SEQ ID NO:169、SEQ ID NO:171或SEQ ID NO:177所示的碱基序列,或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:164、SEQ ID NO:169、SEQ IDNO:171或SEQ ID NO:177所示的碱基序列。在本发明的又一个实施方式中,所述gRNA可以包含SEQ ID NO:171所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQID NO:171所示的碱基序列。
当使用dCpf1作为所述核酸酶缺陷型CRISPR效应蛋白时,编码gRNA的碱基序列可以被设计成编码在5’-端附连有特定RNA的crRNA的DNA序列。这种附连到crRNA的5’-端的RNA和编码所述RNA的DNA序列可以由本领域普通技术人员根据所使用的dCpf1适合地选择。例如,当使用dFnCpf1时,可以使用其中将SEQ ID NO:148AATTTCTACTGTTGTAGAT附连到靶向序列的5’-侧的碱基序列作为编码gRNA的碱基序列(当转录成RNA时,下划线部分的序列形成碱基对,从而形成茎环结构)。所述待添加到5’-端的序列可以是其中缺失、替换、插入和/或添加了至少1至6个碱基的通常用于各种不同Cpf1蛋白的序列,只要gRNA在转录后可以将融合蛋白召集到表达调控区即可。
当使用dCas9作为所述核酸酶缺陷型CRISPR效应蛋白时,编码gRNA的碱基序列可以被设计成其中将编码已知tracrRNA的DNA序列连接到编码crRNA的DNA序列的3’-端的DNA序列。这种tracrRNA和编码所述tracrRNA的DNA序列可以由本领域普通技术人员根据所使用的dCas9适合地选择。例如,当使用dSaCas9时,将SEQ ID NO:149所示的碱基序列用作所述编码tracrRNA的DNA序列。所述编码tracrRNA的DNA序列可以是其中缺失、替换、插入和/或添加了至少1至6个碱基的编码通常用于各种不同Cas9蛋白的tracrRNA的碱基序列,只要gRNA在转录后可以将融合蛋白召集到表达调控区即可。
包含编码以这种方式设计的gRNA的碱基序列的多核苷酸可以使用已知的DNA合成方法化学合成。
在本发明的另一个实施方式中,本发明的多核苷酸可以包含至少两个不同的碱基序列,所述碱基序列分别编码靶向人类DMPK基因的表达调控区中SEQ ID NO:127、SEQ IDNO:46、SEQ ID NO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQ ID NO:117或SEQ IDNO:119所示的区域中长度为18至24个核苷酸的连续区域的gRNA。例如,所述多核苷酸可以包含至少两个不同的分别编码向导RNA的碱基序列,其中所述至少两个不同的碱基序列选自包含SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:46、SEQ ID NO:62、SEQ ID NO:63、SEQ IDNO:66、SEQ ID NO:68、SEQ ID NO:70、SEQ ID NO:71、SEQ ID NO:72、SEQ ID NO:73、SEQ IDNO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ ID NO:83、SEQ ID NO:85、SEQ ID NO:86、SEQ IDNO:88、SEQ ID NO:91、SEQ ID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:103、SEQID NO:105、SEQ ID NO:106、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的序列的碱基序列,或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:46、SEQ ID NO:62、SEQ ID NO:63、SEQ IDNO:66、SEQ ID NO:68、SEQ ID NO:70、SEQ ID NO:71、SEQ ID NO:72、SEQ ID NO:73、SEQ IDNO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ ID NO:83、SEQ ID NO:85、SEQ ID NO:86、SEQ IDNO:88、SEQ ID NO:91、SEQ ID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:103、SEQID NO:105、SEQ ID NO:106、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的碱基序列。在本发明的一个实施方式中,所述多核苷酸可以包含至少两个不同的分别编码向导RNA的碱基序列,其中所述至少两个不同的碱基序列选自包含SEQ ID NO:63、SEQ ID NO:70、SEQ ID NO:71、SEQ ID NO:83、SEQ ID NO:99、SEQ IDNO:105、SEQ ID NO:106、SEQ ID NO:109或SEQ ID NO:111所示的序列的碱基序列,或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:63、SEQ ID NO:70、SEQ ID NO:71、SEQ ID NO:83、SEQ ID NO:99、SEQ ID NO:105、SEQ ID NO:106、SEQ ID NO:109或SEQ IDNO:111所示的碱基序列。在本发明的一个实施方式中,所述多核苷酸可以包含至少两个不同的分别编码向导RNA的碱基序列,其中所述至少两个不同的碱基序列选自包含SEQ IDNO:70、SEQ ID NO:81、SEQ ID NO:83或SEQ ID NO:99所示的序列的碱基序列,或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:70、SEQ ID NO:81、SEQ ID NO:83或SEQID NO:99所示的碱基序列。
(5)启动子序列
在本发明的一个实施方式中,启动子序列可以被可操作连接到每个编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列和/或编码gRNA的碱基序列的上游。对所述可能连接的启动子没有特别限制,只要它在靶细胞中显示出启动子活性即可。可能连接到编码gRNA的碱基序列上游的启动子序列的实例包括但不限于作为pol III启动子的U6启动子、SNR6启动子、SNR52启动子、SCR1启动子、RPR1启动子、U3启动子、H1启动子和tRNA启动子等。在本发明的一个实施方式中,可以使用U6启动子作为用于所述编码向导RNA的碱基序列的启动子序列。在本发明的一个实施方式中,当多核苷酸包含分别编码向导RNA的两个或更多个碱基序列时,可以将单一启动子序列可操作连接到所述两个或更多个碱基序列的上游。在另一个实施方式中,当多核苷酸包含分别编码向导RNA的两个或更多个碱基序列时,可以将启动子序列可操作连接到所述两个或更多个碱基序列中的每一者的上游,其中所述可操作连接到每个碱基序列的启动子序列可以是相同或不同的。
作为上述可能连接到编码融合蛋白的碱基序列上游的启动子序列,可以使用遍在启动子或肌肉特异性启动子。所述遍在启动子的实例包括但不限于EF-1α启动子、EFS启动子、CMV(巨细胞病毒)启动子、hTERT启动子、SRα启动子、SV40启动子、LTR启动子、CAG启动子、RSV(劳斯肉瘤病毒)启动子等。在本发明的一个实施方式中,可以使用EFS启动子、CMV启动子或CAG启动子作为所述遍在启动子。所述肌肉特异性启动子的实例包括但不限于CK8启动子、CK6启动子、CK1启动子、CK7启动子、CK9启动子、心肌肌钙蛋白C启动子、α-肌动蛋白启动子、肌球蛋白重链激酶(MHCK)启动子(例如MHCK7启动子等)、MHC启动子、肌球蛋白轻链2A启动子、肌营养不良蛋白启动子、肌肉肌酸激酶(MCK)启动子、dMCK启动子、tMCK启动子、enh348 MCK启动子、合成C5-12(Syn)启动子、Myf5启动子、MLC1/3f启动子、MLC-2启动子、MYOD启动子、Myog启动子、Pax7启动子、Des启动子、cTnC启动子等(对于肌肉特异性启动子的详情,参见US2011/0212529A1,McCarthy JJ等,Skeletal Muscle.2012年5月;2(1):8,Wang B.等,Gene Ther.2008年11月;15(22):1489-99等,它们整体通过参考并入本文)。在本发明的一个实施方式中,可以使用CK8启动子、肌球蛋白重链激酶(MHCK)启动子、肌肉肌酸激酶(MCK)启动子、合成C5-12(Syn)启动子或Des启动子作为所述肌肉特异性启动子。在本发明的一个实施方式中,可以使用CK8启动子作为所述肌肉特异性启动子。上述启动子可以具有任何修饰和/或改变,只要它在靶细胞中具有启动子活性即可。
在本发明的一个实施方式中,使用U6作为用于编码向导RNA的碱基序列的启动子,并且可以使用CK8启动子作为用于编码所述融合蛋白的碱基序列的启动子序列。具体来说,对于U6启动子来说,可以使用下述碱基序列:(i)SEQ ID NO:155所示的碱基序列,(ii)其中缺失、替换、插入和/或添加了1个或几个(例如2、3、4、5个或更多个)碱基并且在靶细胞中具有启动子活性的SEQ ID NO:155所示的碱基序列,或(iii)与SEQ ID NO:155所示的碱基序列具有不低于90%(例如90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或以上)同一性并且在靶细胞中显示出启动子活性的碱基序列。对于CK8启动子来说,可以使用下述碱基序列:(i)SEQ ID NO:187所示的碱基序列,(ii)其中缺失、替换、插入和/或添加了1个或几个(例如2、3、4、5个或更多个)碱基并且在靶细胞中具有启动子活性的SEQ ID NO:187所示的碱基序列,或(iii)与SEQ ID NO:187所示的碱基序列具有不低于90%(例如90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或以上)同一性并且在靶细胞中显示出启动子活性的碱基序列。
(6)其他碱基序列
此外,除了上述序列之外本发明的多核苷酸还可以包含已知序列例如多腺苷化(polyA)信号、Kozak共有序列等,用于提高由编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列的转录产生的mRNA的翻译效率。此外,本发明的多核苷酸可以包含编码连接物序列的碱基序列、编码NLS的碱基序列和/或编码标签的碱基序列。
(7)本发明的示例性实施方式
在本发明的一个实施方式中,提供了一种多核苷酸,其包含:
编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列,
用于所述编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列的启动子序列,
分别编码向导RNA的一个或两个碱基序列,其中所述一个或两个碱基序列选自包含SEQ ID NO:63、SEQ ID NO:70、SEQ ID NO:71、SEQ ID NO:83、SEQ ID NO:99、SEQ ID NO:105、SEQ ID NO:106、SEQ ID NO:109或SEQ ID NO:111所示的序列的碱基序列或包含其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:63、SEQ ID NO:70、SEQ ID NO:71、SEQ ID NO:83、SEQ ID NO:99、SEQ ID NO:105、SEQ ID NO:106、SEQ ID NO:109或SEQ IDNO:111所示的序列的碱基序列,和
用于所述编码gRNA的碱基序列的启动子序列,
其中所述核酸酶缺陷型CRISPR效应蛋白是dSaCas9或dSaCas9[-25],
其中所述转录阻遏物选自KRAB、MeCP2、SIN3A、HDT1、MBD2B、NIPP1和HP1A,
其中所述用于编码融合蛋白的碱基序列的启动子序列选自EFS启动子、CMV启动子、CAG启动子、CK8启动子、肌球蛋白重链激酶(MHCK)启动子、肌肉肌酸激酶(MCK)启动子、合成C5-12(Syn)启动子和Des启动子,并且
其中所述用于编码gRNA的碱基序列的启动子序列选自U6启动子、SNR6启动子、SNR52启动子、SCR1启动子、RPR1启动子、U3启动子和H1启动子。
在本发明的一个实施方式中,提供了一种多核苷酸,其包含:
编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列,
用于所述编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列的CK8启动子,
分别编码向导RNA的一个或两个碱基序列,其中所述一个或两个碱基序列选自包含SEQ ID NO:70、SEQ ID NO:81、SEQ ID NO:83或SEQ ID NO:99所示的序列的碱基序列或包含其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:70、SEQ ID NO:81、SEQ IDNO:83或SEQ ID NO:99所示的序列的碱基序列,和
用于所述编码向导RNA的碱基序列的U6启动子,
其中所述核酸酶缺陷型CRISPR效应蛋白是dSaCas9,并且
其中所述转录阻遏物是KRAB。
在本发明的一个实施方式中,提供了一种多核苷酸,其包含:
编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列,
用于所述编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列的CK8启动子,
编码向导RNA的碱基序列,所述编码向导RNA的碱基序列包含SEQ ID NO:83所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:83所示的碱基序列,和
用于所述编码向导RNA的碱基序列的U6启动子,
其中所述核酸酶缺陷型CRISPR效应蛋白是dSaCas9,并且其中所述转录阻遏物是KRAB。
在本发明的多核苷酸的实施方式中,所述多核苷酸以从5’末端起的顺序包含(i)所述编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列和(ii)所述编码gRNA的碱基序列。在另一个实施方式中,所述多核苷酸以从5’末端起的顺序包含(ii)所述编码gRNA的碱基序列和(i)所述编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列。
2.载体
本发明提供了一种包含本发明的多核苷酸的载体(在后文中有时被称为“本发明的载体”)。本发明的载体可以是质粒载体或病毒载体。
当本发明的载体是质粒载体时,对所使用的质粒载体没有特别限制,并且可以是任何质粒载体例如克隆质粒载体和表达质粒载体。所述质粒载体通过使用已知方法将本发明的多核苷酸插入到质粒载体中来制备。
当本发明的载体是病毒载体时,所使用的病毒载体的实例包括但不限于腺相关病毒(AAV)载体、腺病毒载体、慢病毒载体、反转录病毒载体、仙台病毒载体等。在本说明书中,“病毒载体”还包括其衍生物。考虑到在基因疗法中的使用,优选地使用AAV载体,因为它可以长时间表达转入基因,并且它源自于非致病性病毒并具有高安全性。
包含本发明的多核苷酸的病毒载体可以通过已知方法来制备。简单来说,制备其中已插入本发明的多核苷酸的用于病毒表达的质粒载体,将所述载体转染到适合的宿主细胞中以允许包含本发明的多核苷酸的病毒载体的瞬时生产,并收集所述病毒载体。
在本发明的一个实施方式中,当使用AAV载体时,对所述AAV载体的血清型没有特别限制,只要可以在所述对象中阻遏人类DMPK基因的表达即可,并且可以使用AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAVrh.10等中的任一者(对于AAV的各种不同血清型,参见例如WO 2005/033321和EP2341068(A1),它们整体通过参考并入本文)。在本发明的另一个实施方式中,也可以使用从猴分离的AAV(例如AAVrh74(参见Mol Ther.2017年4月5日;25(4):855-869等,其整体通过参考并入本文)、从猪分离的AAV(例如AAVpo1(例如参见Gene Ther.2009年11月;16(11):1320-8,其整体通过参考并入本文))、作为AAV1、AAV2、AAV8和AAV9的预测祖先的Anc 80(参见Cell Rep.2015年8月11日;12(6):1056-68,其整体通过参考并入本文)等,只要可以在所述对象中阻遏人类DMPK基因的表达即可。AAV变体的实例包括但不限于具有修饰的衣壳的新血清型(例如WO 2012/057363,其整体通过参考并入本文)等。例如,在本发明的一个实施方式中,可以使用对肌肉细胞具有提高的感染力的具有修饰的衣壳的新血清型,例如AAV587MTP、AAV588MTP、AAV-B1、AAVM41等(参见Yu等,GeneTher.2009年8月;16(8):953-62,Choudhury等,Mol Ther.2016年8月;24(7):1247-57,和Yang等,Proc Natl Acad Sci U S A.2009年3月10日;106(10):3946-51,它们整体通过参考并入本文)。
在制备AAV载体时,可以使用已知的方法例如(1)使用质粒的方法,(2)使用杆状病毒的方法,(3)使用单纯性疱疹病毒的方法,(4)使用腺病毒的方法或(5)使用酵母的方法(例如Appl Microbiol Biotechnol.2018;102(3):1045–1054等,其整体通过参考并入本文)。例如,当通过使用质粒的方法制备AAV载体时,首先制备在野生型AAV基因组序列的两个末端处包含反向末端重复序列(ITR)并插入本发明的多核苷酸以代替编码Rep蛋白和衣壳蛋白的DNA的载体质粒。另一方面,将对于形成病毒粒子来说必需的编码Rep蛋白和衣壳蛋白的DNA插入到其他质粒中。此外,制备包含对于AAV的增殖来说必需的负责腺病毒的辅助作用的基因(E1A、E1B、E2A、VA和E4orf6)的质粒作为腺病毒辅助质粒。这三种质粒在宿主细胞中的共转染导致在所述细胞中产生重组AAV(即AAV载体)。作为宿主细胞,优选地使用能够提供负责上述辅助功能的基因的一部分基因产物(蛋白质)的细胞(例如293细胞等)。当使用此类细胞时,在上述腺病毒辅助质粒中不必携带编码可以由所述宿主细胞提供的蛋白质的基因。所述产生的AAV载体存在于培养基和/或细胞中。因此,通过在使用冻融等破坏所述宿主细胞后从培养基收集所述病毒,然后通过使用氯化铯的密度梯度超速离心方法、柱方法等对所述病毒级分进行分离和纯化,来制备所需AAV载体。
AAV载体在安全性、基因转导效率等方面具有极大优势,并被用于基因疗法。然而,已知可以被包装的多核苷酸的尺寸受到限制。例如,在本发明的一个实施方式中,包括包含编码dSaCas9与KRAB的融合蛋白的碱基序列、编码靶向人类DMPK基因的表达调控区的gRNA的碱基序列和作为启动子序列的CK8启动子序列和U6启动子序列以及ITR区的多核苷酸的碱基长度在内的全长为约4.9kb,并且所述多核苷酸可以被携带在单一AAV载体中。
3.用于治疗或预防DM1的药物组合物
本发明还提供了一种包含本发明的多核苷酸或本发明的载体的药物组合物(在后文中有时被称为“本发明的药物组合物”)。本发明的药物组合物可用于治疗或预防DM1。
本发明的药物组合物包含本发明的多核苷酸或本发明的载体作为活性成分,并且可以被制备成包含此类活性成分(即本发明的多核苷酸或本发明的载体)以及通常可药用载体的制剂。
在一个实施方式中,本发明的药物组合物被肠胃外给药,并且可以局部或系统性给药。本发明的药物组合物可以通过例如但不限于静脉内给药、动脉内给药、皮下给药、腹膜内给药或肌肉内给药来给药。
对给药到对象的本发明的药物组合物的剂量没有特别限制,只要它是对所述治疗和/或预防来说有效的量即可。它可以根据活性成分、剂型、对象的年龄和体重、给药日程安排、给药方法等适合地优化。
在本发明的一个实施方式中,本发明的药物组合物不仅可以给药到患有DM1的对象,而且可以预防性给药到基于遗传背景分析等可能在将来发展出DM1的对象。在本说明书中,术语“治疗”除了疾病的治愈之外还包括疾病的缓解。此外,术语“预防”除了预防疾病的发作之外还可以包括延迟疾病的发作。本发明的药物组合物也可以被称为“本发明的药剂”等。
4.治疗或预防DM1的方法
本发明还提供了一种治疗或预防DM1的方法,所述方法包括向需要的对象给药本发明的多核苷酸或本发明的载体(在后文中有时被称为“本发明的方法”)。此外,本发明包括本发明的多核苷酸或本发明的载体,其用于治疗或预防DM1。此外,本发明包括本发明的多核苷酸或本发明的载体的用途,其用于制造治疗或预防DM1的药物组合物。
本发明的方法可以通过将上述本发明的药物组合物给药到患有DM1的对象来实践,并且剂量、给药途径、对象等与上文提到的相同。
在使用本发明的方法的治疗开始之前以及治疗后的任何时间点,可以进行症状的测量以确定所述对象对所述治疗的响应。
本发明的方法可以改善DM1的任何症状例如骨骼肌和/或心肌的功能,但不限于此。对功能待改善的肌肉或组织没有特别限制,并且可以提到的是任何肌肉和组织和肌肉群。
5.核糖核蛋白
本发明提供了一种包含下述组分的核糖核蛋白(在后文中有时被称为“本发明的RNP”):
(c)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白,和
(d)靶向人类DMPK基因的表达调控区中SEQ ID NO:127、SEQ ID NO:46、SEQ IDNO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQ ID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域的向导RNA。
对于包含在本发明的RNP中的核酸酶缺陷型CRISPR效应蛋白、转录阻遏物和向导RNA来说,可以使用在上述“1.多核苷酸”章节中详细解释的核酸酶缺陷型CRISPR效应蛋白、转录阻遏物和向导RNA。包含在本发明的RNP中的核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白,可以通过例如将编码所述融合蛋白的多核苷酸引入到细胞、细菌或其他生物体中以允许表达或通过使用所述多核苷酸的体外翻译系统来产生。此外,包含在本发明的RNP中的向导RNA可以通过例如化学合成或使用编码所述向导RNA的多核苷酸的体外转录系统来产生。将由此制备的融合蛋白和向导RNA混合,以制备本发明的RNP。如有必要,可以混合其他物质例如金粒子。为了将本发明的RNP直接递送到靶细胞、组织等,可以通过已知方法将所述RNP包封在脂质纳米粒子(LNP)中或装载在细胞外囊泡中。本发明的RNP可以通过已知方法引入到靶细胞、组织等中。例如,对于在LNP中的包封和引入方法而言,可以参考Lee K.等,Nat Biomed Eng.2017;1:889–901,WO 2016/153012等,所述文献整体通过参考并入本文。
在本发明的一个实施方式中,包含在本发明的RNP中的向导RNA靶向存在于人类19号染色体(Chr 19)的GRCh38.p12位置中的45,778,884-45,783,985区域中的连续的18至24个核苷酸长度,优选为18至23个核苷酸长度,更优选为18至22个核苷酸长度。
在一个实施方式中,所述向导RNA靶向SEQ ID NO:127、SEQ ID NO:46、SEQ ID NO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQ ID NO:117或SEQ ID NO:119所示的区域中连续的长度为18至24个核苷酸、优选地长度为18至23个核苷酸、更优选地长度为18至22个核苷酸的碱基序列。在另一个实施方式中,所述向导RNA靶向SEQ ID NO:127、SEQ ID NO:46、SEQ ID NO:128、SEQ ID NO:66、SEQ ID NO:68、SEQ ID NO:130、SEQ ID NO:131、SEQ IDNO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQID NO:134、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的区域中连续的长度为18至24个核苷酸、优选地长度为18至23个核苷酸、更优选地长度为18至22个核苷酸的碱基序列。在又一个实施方式中,所述向导RNA靶向SEQ ID NO:63、SEQ ID NO:136、SEQ ID NO:83、SEQ ID NO:99、SEQ ID NO:135、SEQ ID NO:109或SEQID NO:111所示的区域中连续的长度为18至24个核苷酸、优选地长度为18至23个核苷酸、更优选地长度为18至22个核苷酸的碱基序列。在又一个实施方式中,所述向导RNA靶向包含SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:46、SEQ ID NO:62、SEQ ID NO:63、SEQ ID NO:66、SEQ ID NO:68、SEQ ID NO:70、SEQ ID NO:71、SEQ ID NO:72、SEQ ID NO:73、SEQ IDNO:80、SEQ ID NO:81、SEQ ID NO:82、SEQ ID NO:83、SEQ ID NO:85、SEQ ID NO:86、SEQ IDNO:88、SEQ ID NO:91、SEQ ID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:103、SEQID NO:105、SEQ ID NO:106、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的序列的全部或一部分的区域。在本发明的另一个实施方式中,所述向导RNA靶向SEQ ID NO:63、SEQ ID NO:70、SEQ ID NO:71、SEQ ID NO:83、SEQ ID NO:99、SEQ ID NO:105、SEQ ID NO:106、SEQ ID NO:109或SEQ ID NO:111所示的序列的全部或一部分的区域。在本发明的又一个实施方式中,所述向导RNA靶向包含SEQ ID NO:70、SEQID NO:81、SEQ ID NO:83或SEQ ID NO:99所示的序列的全部或一部分的区域。在本发明的一个实施方式中,所述向导RNA靶向包含SEQ ID NO:83所示的序列的全部或一部分的区域。
在本发明的一个实施方式中,可以使用包含SEQ ID NO:157、SEQ ID NO:158、SEQID NO:159、SEQ ID NO:160、SEQ ID NO:161、SEQ ID NO:162、SEQ ID NO:163、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:166、SEQ ID NO:167、SEQ ID NO:168、SEQ ID NO:169、SEQID NO:170、SEQ ID NO:171、SEQ ID NO:172、SEQ ID NO:173、SEQ ID NO:174、SEQ ID NO:175、SEQ ID NO:176、SEQ ID NO:177、SEQ ID NO:178、SEQ ID NO:179、SEQ ID NO:180、SEQID NO:181、SEQ ID NO:182、SEQ ID NO:183、SEQ ID NO:184、SEQ ID NO:185或SEQ ID NO:186所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:157、SEQID NO:158、SEQ ID NO:159、SEQ ID NO:160、SEQ ID NO:161、SEQ ID NO:162、SEQ ID NO:163、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:166、SEQ ID NO:167、SEQ ID NO:168、SEQID NO:169、SEQ ID NO:170、SEQ ID NO:171、SEQ ID NO:172、SEQ ID NO:173、SEQ ID NO:174、SEQ ID NO:175、SEQ ID NO:176、SEQ ID NO:177、SEQ ID NO:178、SEQ ID NO:179、SEQID NO:180、SEQ ID NO:181、SEQ ID NO:182、SEQ ID NO:183、SEQ ID NO:184、SEQ ID NO:185或SEQ ID NO:186所示的碱基序列的向导RNA。在本发明的一个实施方式中,可以使用包含SEQ ID NO:161、SEQ ID NO:164、SEQ ID NO:165、SEQ ID NO:171、SEQ ID NO:177、SEQID NO:180、SEQ ID NO:181、SEQ ID NO:183或SEQ ID NO:184所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:161、SEQ ID NO:164、SEQ ID NO:165、SEQID NO:171、SEQ ID NO:177、SEQ ID NO:180、SEQ ID NO:181、SEQ ID NO:183或SEQ ID NO:184所示的碱基序列的向导RNA。在本发明的另一个实施方式中,可以使用包含SEQ ID NO:164、SEQ ID NO:169、SEQ ID NO:171或SEQ ID NO:177所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:164、SEQ ID NO:169、SEQ ID NO:171或SEQ IDNO:177所示的碱基序列的向导RNA。在本发明的又一个实施方式中,可以使用包含SEQ IDNO:171所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:171所示的碱基序列的向导RNA。
6.其他
本发明还提供了一种用于阻遏人类DMPK基因的表达的包含下述组分的组合物或药剂盒:
(e)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白或编码所述融合蛋白的多核苷酸,和
(f)靶向人类DMPK基因的表达调控区中SEQ ID NO:127、SEQ ID NO:46、SEQ IDNO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQ ID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域的向导RNA或编码所述向导RNA的多核苷酸。
本发明还提供了一种治疗或预防1型强直性肌营养不良症的方法,所述方法包括给药下述(e)和(f)的步骤:
(e)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白或编码所述融合蛋白的多核苷酸,和
(f)靶向人类DMPK基因的表达调控区中SEQ ID NO:127、SEQ ID NO:46、SEQ IDNO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQ ID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域的向导RNA或编码所述向导RNA的多核苷酸。
本发明还提供了下述(e)和(f)的用途:
(e)核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白或编码所述融合蛋白的多核苷酸,和
(f)靶向人类DMPK基因的表达调控区中SEQ ID NO:127、SEQ ID NO:46、SEQ IDNO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQ ID NO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域的向导RNA或编码所述向导RNA的多核苷酸,
其用于制造治疗或预防DM1的药物组合物。
对于本发明中的核酸酶缺陷型CRISPR效应蛋白、转录阻遏物、向导RNA以及编码它们的多核苷酸和它们被携带在其中的载体而言,可以使用在上述“1.多核苷酸”、“2.载体”和“5.核糖核蛋白”章节中详细解释的那些。上述(e)和(f)的剂量、给药途径、对象、制剂等,与在“3.用于治疗或预防DM1的药物组合物”章节中解释的那些相同。
在下面的示例性实施方式的描述过程中,本发明的其他特点将变得显而易见,提供所述示例性实施方式是为了说明本发明,并且不打算限制本发明。
实施例
实施例1.使用iCM和iDM细胞筛选用于人类DMPK基因的gRNA(1)实验方法
DMPK靶向序列的选择
对人类DMPK基因的启动子区附近大约7.4kb的序列(Chr19:GRCh38.p12;45,777,342-45,784,715)进行检索,以寻找可以被与gRNA复合的核酸酶缺陷型SaCas9(D10A和N580A突变体;dSaCas9(SEQ ID NO:139))靶向的序列,其在本文中被定义为靶向序列。靶向序列最初由与具有序列NNGRRT的前间区序列邻近基序(PAM)相邻的19-21个核苷酸的区段(5’-19-21nt靶向序列-NNGRRT-3’)指定,并进行过滤以便只包括那些与食蟹猴(Macacafascicularis)基因组的相应区域具有完美匹配(靶向序列和PAM序列)的区段(在表1中列为“TRUE”)。还选择了另外的21个核苷酸的靶向序列,它们将RNP引导到在由ENCODE项目策划的DNase-Seq实验中表现出高的DNA酶敏感性的区域(The ENCODE Project Consortium,Nature.2012年9月6日;489(7414):57-74;https://www.encodeproject.org)。
慢病毒转移质粒(pED162)的构建
pLentiCRISPR v2购自Genscript(https://www.genscript.com)并做出了下述修改:将SpCas9 gRNA支架序列用SaCas9 gRNA支架序列(SEQ ID NO:150)代替;将SpCas9替换为融合到Krüppel结合框转录阻遏结构域(KRAB)的被两个NLS夹在中间的dSaCas9(SV40NLS-dSaCas9-NLS-KRAB[SEQ ID NO:151(DNA)和152(蛋白质)]);并将puroR表达盒用blastR表达盒[SEQ ID NO:153(DNA)和SEQ ID NO:154(蛋白质)]代替。dSaCas9在其N-端(氨基酸序列由SEQ ID NO:188示出,DNA序列由SEQ ID NO:189示出)和C-端(氨基酸序列由SEQ ID NO:190示出,DNA序列由SEQ ID NO:191示出)附连有两个核定位信号(NLS),以确保效应分子向细胞核的高效定位。当定位到启动子时,KRAB可以通过抑制转录来阻遏基因表达(Gilbert LA,等,Cell,2013年7月18日;154(2):442-51)。将KRAB连接到dSaCas9(D10A和N580A突变体)的C-端,其在后文中被称为dSaCas9-KRAB,并正如由靶向序列引导的靶向到人类DMPK启动子区(图1)。产生的质粒被命名为pED162。
gRNA克隆
将3个非靶向性对照靶向序列(表1,SEQ ID NO:1至3)和123个靶向序列(表1,SEQID NO:4至126)克隆到pED162中。由Integrated DNA Technologies以下述格式合成正向和反向寡核苷酸:正向:5’CACC(G)-19-21个碱基对的靶向序列-3’;和反向:5’AAAC–19-21个碱基对的反向互补靶向序列-(C)-3’,其中如果所述靶不始于G,则添加括号中的碱基。将寡核苷酸以100μM重悬浮在Tris-EDTA缓冲液(pH8.0)中。将1.5μl每种互补的寡核苷酸在NE缓冲液3.1(New England Biolabs(NEB)#B7203S)中合并在50μl反应中。将所述反应在1L H2O中加热至95℃并允许其冷却至25℃,由此将具有与克隆到pED162相容的粘性末端突出端的寡核苷酸退火。将退火的寡核苷酸与已用BsmBI消化并凝胶纯化的慢病毒转移质粒pED162合并,并用T4 DNA连接酶(NEB#M0202S)按照制造商的方案连接。按照制造商的方案将2μl连接反应转化到稳定感受态细胞(NEB#C3040I)。所得到的构建物通过U6启动子(SEQID NO:155)驱动包含由各个靶向序列编码的crRNA并在其3’末端融合有tracrRNA(GUUUUAGUACUCUGGAAACAGAAUCUACUAAAACAAGGCAAAAUGCCGUGUUUAUCACGUCAACUUGUUGGCGAGAUUUUUU)(SEQ ID NO:156)的sgRNA的表达,所述tracrRNA由添加有U6聚合酶的终止信号TTTTTT的SaCas9 gRNA支架序列编码。
慢病毒产生
将Lenti-Pac 293Ta细胞系(Genecopoeia#LT008)以0.8-1.0x106个细胞/孔接种在6孔细胞培养板(VWR#10062-892)中的2ml生长培养基(增补有10%FBS和2mM新鲜的L-谷氨酰胺、1mM丙酮酸钠和MEM非必需氨基酸的DMEM培养基(Thermo Fisher#11140050))中,并在37℃/5%CO2温育24小时。第二天,按照制造商的方案,使用1.5μg包装质粒混合物[1μg包装质粒(pCMV delta R8.2;addgene质粒号12263)和0.5μg包膜表达质粒(pCMV-VSV-G;addgene质粒号8454)]和1μg含有编码dSaCas9-KRAB和所指示的sgRNA的序列的转移质粒pED162,建立TransIT-转染反应(Mirus Bio#MIR6700)。在转染后48小时,通过将培养基上清液通过0.45μm PES滤膜(VWR#10218-488)来收获慢病毒。
iCM和iDM细胞的转导
永生化的非DM对照(对照)成肌细胞(被称为iCM)和永生化的DM1成肌细胞(被称为iDM)从Institut de Myologie获得,所述机构通过在整体通过参考并入本文的Dis ModelMech.2017年4月1日;10(4):487-497中描述的方法建立这些细胞系。对于转导来说,将细胞以0.05×106个细胞/孔接种在12孔细胞培养板(VWR#10062-894)中的1ml含有生长培养基[PromoCell骨骼肌细胞生长培养基;零件编号:C-23160(注:培养基增补有20%FBS而不是试剂盒指导的5%,和30μg/ml庆大霉素S)]的培养基中,并在37℃/5%CO2下温育24小时。第二天,将所述培养基更换为1ml增补有10μg/ml聚凝胺(Sigma#TR-1003-G)的生长培养基,并向每个孔添加对应于每种sgRNA的0.3ml慢病毒上清液(参见上文),所述sgRNA包含由各个靶向序列(表1)编码并与tracrRNA融合的crRNA。将细胞与慢病毒温育48小时,然后除去病毒培养基并更换为选择培养基[增补有10μg/ml杀稻瘟菌素(Thermo Fisher#A1113903)的生长培养基]。在选择培养基中温育48小时后,将三分之一的细胞(从12孔板)转移到新的孔中的生长培养基中。在允许细胞接种24小时后,将生长培养基更换为选择培养基。在选择培养基中培养48小时后,收获细胞并使用96试剂盒(Qiagen#74182)按照制造商的指导提取RNA。
基因表达分析
对于基因表达分析来说,按照High-Capacity cDNA反转录试剂盒(ThermoFisher#4368813)的流程在20μl体积中从0.2μg总RNA产生cDNA。将cDNA稀释10倍,并使用TaqmanTM Fast Advanced主混合物(Thermo Fisher#4444557)按照制造商的方案进行分析。Taqman探针(DMPK:测定Id Hs01094336_m1 FAM;HPRT:测定Id Hs99999909_m1 VIC_PL)从Thermo Fisher获得。基于Taqman探针的实时PCR反应通过QuantStudio 5实时PCR系统,按照Taqman Fast Advanced主混合物方案的指导进行处理和分析。
数据分析
对于每个样品和三个对照来说,通过从来自于DMPK探针的3个技术平行样的平均Ct值中减去来自于HPRT探针的3个技术平行样的平均Ct值(平均Ct DMPK-平均Ct HPRT)来计算ΔCt值。使用公式2-(ΔCt)为每个样品确定表达值。然后对于每个实验来说将样品表达值(表1;SEQ ID NO:4-126)归一化到3个对照表达值(表1;SEQ ID NO:1至3)的平均值,来确定每个样品的相对DMPK表达。分析每个细胞系的两个生物学平行样,并计算来自于所有实验的平均值(表1)。
(2)结果
DMPK基因表达被RNP的阻遏
产生了将用于dSaCas9-KRAB和针对每个靶向序列的sgRNA的表达盒递送到iCM和iDM细胞的慢病毒。对转导的细胞进行杀稻瘟菌素抗性的选择,并使用Taqman测定法对DMPK表达进行定量(表1)。将来自于每个样品的表达值归一化到用对照sgRNA(表1;SEQ ID NO:1、2和3)转导的细胞中DMPK表达的平均值。计算iCM和iDM细胞系的两份平行样之间的平均表达水平(表1,所有DMPK的平均值,以及图2)。
表1用于筛选DMPK基因的表达调控区的靶向序列
表1-1
表1-2
表1-3
表1-4
表1-5
在表1中,“坐标”指示了SEQ ID NO:4-126所示的每个序列的5’末端的坐标。
30个靶向序列显示出不少于50%的DMPK表达降低(SEQ ID NO:43、44、46、62、63、66、68、70、71、72、73、80、81、82、83、85、86、88、91、96、99、100、103、105、106、108、109、111、117和119),9个靶向序列显示出不少于75%的DMPK表达降低(SEQ ID NO:63、70、71、83、99、105、106、109和111),并且1个靶向序列显示出不少于80%的DMPK表达降低(SEQ ID NO:109)。
在上述系统抑制DMPK的表达的可能性的基础上对区域进行鉴定和表征。在1区(图3;Chr19:GRCh38.p12;45,777,342-45,778,884)中,我们发现靶向序列在调节DMPK的表达方面无效。然而,在2区(图3;GRCh38.p12;45,778,884-45,783,985)中,靶向dSaCas9-KRAB能够抑制DMPK表达。正如预期,这个区域涵盖了DMPK启动子和转录起始位点,表明靶向这个区域对DMPK表达具有最大影响。最后,3区(图3;Chr19:GRCh38.p12;45,783,985-45,784,715)对DMPK表达具有较少影响,并且更远离DMPK启动子区。
实施例2腺相关病毒(AAV)生产
(1)实验方法
用于dSaCas9-KRAB:gRNA的递送和表达以及AAV的生产的质粒的构建
pAAV-CMV购自Takara(#6230),并且EFS启动子序列(SEQ ID NO:204)和带有额外的末端终止密码子的SV40NLS-dSaCas9-NLS-KRAB(SEQ ID NO:151)[SEQ ID NO:200(DNA)和SEQ ID NO:152(蛋白质)]从pED162(参见实施例1)亚克隆。b珠蛋白polyA序列(SEQ IDNO:201)、U6启动子序列(SEQ ID NO:202)和SaCas9 gRNA支架序列(SEQ ID NO:150)从pED0001(SEQ ID NO:203)亚克隆,因此代替了ITR之间编码pAAV-CMV的所有功能性组分(即CMV启动子、β-珠蛋白内含子、MCS和hGH polyA)的序列。最后,通过限制性克隆(XhoI和AgeI)将EFS启动子用CK8启动子(SEQ ID NO:187)代替,得到质粒pED148。SEQ ID NO:83、70、81或99所示的靶向序列通过将pED148用BsaI消化,由此产生与退火的合成寡核苷酸相容的突出端来克隆。合成寡核苷酸被设计成使得正向引物在5’末端具有CACC(G)序列[5’CACC-(G)-靶向序列–3’],并且反向引物在5’末端含有额外的AAAC序列[5’AAAC–反向互补的靶向序列–(C)-3’]。将额外的G添加到靶向序列的起点,以增强从U6启动子的表达。产生的质粒分别被命名为pED148-h695(包含SEQ ID NO:83所示的靶向序列)、pED148-h245(包含SEQ ID NO:70所示的靶向序列)、pED148-h257(包含SEQ ID NO:81所示的靶向序列)和pED148-h269(包含SEQ ID NO:99所示的靶向序列)。
腺相关病毒(AAV)的生产
腺相关病毒血清型9(AAV9)粒子使用以每个Hyperflask(Corning#10030)0.86×107个细胞的密度接种并在增补有10%FBS的DMEM培养基(Sigma#D5796)中培养的293T细胞(ATCC#CRL-3216)产生。在接种后4天,将培养基更换成增补有2%FBS和63mM HEPES(Gibco#15630-080)的DMEM培养基。pRC9质粒如下所述来构建:将AAV9衣壳序列(参见JP5054975B)亚克隆到pRC2-mi342载体(Takara#6230)中,代替AAV2衣壳序列。每个Hyperflask使用388μl体外DNA转染试剂(Polyplus#115-010),将细胞用135μg pRC9质粒、121μg包含在无辅助载体系统中的pHelper载体(Takara#6230)和133μg pED148-h695之一转染。3天后,向Hyperflask添加0.2%TritonX-100并收获细胞。
在收获后,使用筒式过滤器(GE Healthcare#KGF-A-0506GG、KMP-HC9206GG)将其上清液和细胞裂解液澄清。在澄清后,使用750kD的XamplerTM超滤筒(GE Healthcare#UFP-750-C-6MA),使用中空纤维将它用切向流过滤进行超滤。在减小体积后,对样品进行亲和层析(POROSTM CaptureSelectTM AAVX亲和树脂(ThermoFisher Scientific#A36739))以纯化AAV。在亲和层析步骤后,对洗脱的样品进行密度梯度离心,以将AAV与中间体AAV粒子分离开。使用磷酸盐缓冲盐水的透析,对使用CsCl密度梯度离心分离的AAV粒子进行缓冲液交换。在缓冲液交换后,使用Ultra-4离心过滤装置(Merck millipore#UFC801024)将AAV样品浓缩,并使用0.22μm Millex-GV注射器过滤装置(Merck millipore#SLGV033RS)除菌。使用DNeasy血液和组织试剂盒(QIAGEN#69506)纯化AAV基因组。纯化的AAV基因组的滴度使用滴定试剂盒(用于实时PCR)(Takara#6233)来测量。得到的AAV被命名为AAV9-695。
使用pED148-h245、pED148-h257或pED148-h269如上所述制造AAV,并分别命名为AAV9-245、AAV9-257和AAV9-269。AAV9-695、AAV9-245和AAV9-257中的每一者被制造两次并用于体外和体内实验。
(2)结果
所述AAV的基因组滴度示出在表2中。
表2
AAV名称 | 浓度(vg/mL) | 批次号 |
AAV9-695 | 2.8x10<sup>12</sup> | 批次1 |
AAV9-245 | 3.6x10<sup>12</sup> | 批次1 |
AAV9-257 | 4.5x10<sup>12</sup> | 批次1 |
AAV9-269 | 5.5x10<sup>12</sup> | 批次1 |
AAV9-695 | 3.6x10<sup>13</sup> | 批次2 |
AAV9-245 | 3.7x10<sup>13</sup> | 批次2 |
AAV9-257 | 4.6x10<sup>13</sup> | 批次2 |
实施例3.带有编码dSaCas9、转录阻遏物和sgRNA的碱基序列的重组AAV9对DMPK基因阻遏的体外药理学评估
(1)实验方法
细胞培养和AAV感染
将iCM细胞悬浮在骨骼肌细胞培养基(Promocell#C23060)中(注:培养基增补有20%FBS而不是由试剂盒指导的5%以及50μg/ml庆大霉素S),并以每孔900μl培养基中20,000个细胞的密度接种在I型胶原蛋白包被的24孔板(IWAKI#4820-010)中。对于AAV感染来说,向培养基添加含有2.8、3.6、4.5或5.5×1012vg/ml的AAV9-695、AAV9-245、AAV9-257或AAV9-269的100μl含有0.001%PluronicTM F-68(GE healthcare#SH30594.01)的PBS,并在37℃/5%CO2下培养和温育2天。对于对照孔来说,向培养基添加100μl含有0.001%Pluronic F-68的PBS。所述实验进行一式三份。将培养基用更换为分化培养基(增补有10μg/ml胰岛素(Sigma#I9278)的DMEM培养基(Thermo Fisher#61965-026)),并将细胞在37℃和5%CO2下培养4天。在用500μl PBS清洗后,使用RNeasy Plus小量试剂盒(Qiagen#74134)按照制造商的说明书提取总RNA。将来自于未用AAV感染的细胞的RNA设置为对照,并在图4中显示为对照。
基因表达分析
对于Taqman qPCR来说,使用SuperScriptTM VILOTM cDNA合成试剂盒(ThermoFisher#11754250)在20μl反应体积中将80ng总RNA转变成cDNA。将所述cDNA用水稀释160倍,并将2μl用于qPCR。qPCR使用QuantStudioTM 12K Flex实时PCR系统(Thermo Fisher),在含有用于DMPK(Thermo Fisher#Hs01094329_m1,FAM)或GAPDH(Thermo Fisher#Hs99999905_m1,FAM)的Taqman探针和TaqmanTM基因表达主混合物(Thermo Fisher#4369016)的5μl终体积中运行。所述qPCR循环条件如下:50℃ 2分钟后95℃ 10分钟,然后是45个95℃ 15秒和60℃ 1分钟的循环。数据使用QuantStudioTM 12K Flex软件(ThermoFisher)进行分析。表达值使用每个基因的标准曲线来分析,并将DMPK基因的表达水平归一化到GAPDH基因的表达水平。
(2)结果
通过将AAV9-695、AAV9-245、AAV9-257或AAV9-269施加到iCM细胞中,发现了DMPKmRNA的下调,这表明带有dSaCas9、KRAB和包含由SEQ ID NO:83、70、81或99所示的靶向序列编码的crRNA的sgRNA的转入基因的AAV9在人类肌细胞中对DMPK下调具有药理作用(图4)。
实施例4.在DMSXL小鼠中抑制DMPK基因表达
(1)实验方法
动物处理
将AAV9-695、AAV9-245或AAV9-257静脉内注射到DMSXL纯合小鼠(被称为DMSXL小鼠),即带有人类DM1基因座和>1,000CTG的非常大的扩增的转基因小鼠(PLoS Genet.2012;8(11):e1003043)中(总共n=4,其中雄性n=2并且雌性n=2)。剂量如下:对于AAV9-695、AAV9-245和AAV9-257来说分别为1.5×1013vg/kg、5×1013vg/kg、1.5×1014vg/kg和5×1014vg/kg。作为对照,注射含有0.001%Pluronic F-68(GE healthcare#SH30594.01)的PBS。4周后将DMSXL小鼠处死并从这些小鼠收集样品(胫前肌(TA)、心脏和肝脏)。如下所述对这些样品进行基因表达分析。在进行RNA提取之前将样品储存在-80℃冷冻室中。
RNA提取和基因表达分析
使用TissueLyser II(Qiagen)将组织样品在1ml ISOGEN(NIPPON GENE#319-90211)中匀浆。在离心分离后,将700μl上清液转移到含有150μl氯仿(Wako#034-02603)的1.5ml管中。在涡旋振荡和离心后,将187μl水层添加到150μl异丙醇(WAKO#166-04836)中并混合。将所述RNA提取物转移到Plus小量试剂盒(QIAGEN#74134)的RNeasy离心柱,并按照制造商的方案进一步纯化。
对于Taqman qPCR来说,使用SuperScriptTM VILOTM cDNA合成试剂盒(ThermoFisher#11754250)在20μl反应体积中将700-1,000ng总RNA转变成cDNA。将所述cDNA用水稀释20倍,并将3-4μl用于qPCR。qPCR使用QuantStudioTM 12K Flex实时PCR系统(ThermoFisher),在含有用于DMPK(Thermo Fisher#Hs01094329_m1,FAM)或GAPDH(Thermo Fisher#Mm99999915_g1,FAM)的Taqman探针和TaqmanTM基因表达主混合物(Thermo Fisher#4369016)的10μl终体积中运行。所述qPCR循环条件如下:50℃ 2分钟后95℃ 10分钟,然后是40-45个95℃ 15秒和60℃ 1分钟的循环。数据使用QuantStudioTM 12K Flex软件(ThermoFisher)进行分析。表达值使用每个基因的标准曲线来分析,并将DMPK基因的表达水平归一化到GAPDH基因的表达水平。
(2)结果
AAV9-695、AAV9-245和AAV9-257在小鼠中表达每个转入基因。在肝脏中没有但在骨骼肌和心肌中发现DMPK mRNA下调,这表明带有dSaCas9、KRAB和包含由SEQ ID NO:83、70或81所示的靶向序列编码的crRNA的sgRNA的转入基因的AAV9在DMSXL小鼠中对DMPK下调具有药理作用(图5-7)。
实施例5.通过向DMSXL小鼠给药AAV9-695改善RNA灶点形成(1)实验方法
荧光原位杂交:FISH
AAV9-695(5×1014vg/kg)或作为对照的介质(含有0.001%Pluronic F-68的PBS)向DMSXL小鼠的给药如实施例4中所述来进行。在给药后4周,切下并收集DMSXL小鼠的胫前肌(TA)。在立即包埋在Tissue-O.C.T.Compound(Sakura Finetek Japan,#4583)中之后,将组织在液氮中预冷的冷异戊烷中冷冻并储存在-80℃。
通过低温切片机制备10μm的冷冻组织切片,并将所述薄切片置于玻璃载片上。将所述载片空气干燥,并用4%聚甲醛在室温固定15分钟,用PBS清洗两次,每次2分钟,并储存在4℃下。
在含有2%丙酮的PBS中在室温温育5分钟后,将所述载片在含有30%甲酰胺的2×盐水柠檬酸钠缓冲液(SSC)(300mM NaCl和30mM柠檬酸钠)中在室温下温育10分钟。将载片在探针溶液(0.02%牛血清白蛋白(SIGMA#A7030-100G),0.066mg/ml酵母tRNA(ThermoFisher#15401-011),2mM核糖核苷氧钒复合物(SIGMA#R3380-5ML)和1ng/μl Cy3-(CAG)5-2’-OMe探针(y_C(M)A(M)G(M)C(M)A(M)G(M)C(M)A(M)G(M)C(M)A(M)G(M)C(M)A(M)G(M),y意味着Cy3并且N(M)意味着2’-OMe RNA。该探针由GeneDesign,Inc.,Japan合成),在含有30%甲酰胺的2×SSC中)中在37℃温育2小时。在杂交后,除去所述探针溶液,并将载片在含有30%甲酰胺的2×SSC中在50℃温育30分钟。将载片用1×SSC清洗一次,并在1×SSC中在室温温育30分钟。将载片用PBS清洗三次,每次10分钟,并向载片添加含有DAPI的ProLongTMDiamond抗褪色封固剂(Thermo Fisher#P36971)。将载片用盖片覆盖并储存在4℃。
使用激光共聚焦显微镜LSM700(ZEISS)观察RNA灶点的形成。
(2)结果
用介质或AAV9-695给药的DMSXL小鼠的TA肌肉切片的典型图像示出在图8中。箭头指示RNA灶点(RNA灶点被定义为位于染为蓝色的核中的可清楚地检测的红点)。
与介质给药的DMSXL小鼠的TA肌肉相比,在AAV9-695给药的DMSXL小鼠的TA肌肉中观察到的RNA灶点数目较低,表明在DMSXL小鼠中AAV9-695给药改善RNA灶点形成。
实施例6.在表达hDMPK sgRNA的iDM细胞中DMPK基因表达的抑制
(1)实验方法
慢病毒产生
将Lenti-XTM 293T细胞(Takara#632180)以5×106个细胞/培养皿接种在I型胶原蛋白包被的100mm培养皿(IWAKI#4020-010)中的10ml增补有10%FBS和MEM非必需氨基酸溶液(Thermo Fisher#11140050)的DMEM(Thermo Fisher#10569-010)中,并在37℃/5%CO2下温育过夜。第二天,按照制造商的方案,使用7μg慢病毒高滴度包装混合物(Takara#6194)和5.5μg含有编码dSaCas9-KRAB和SEQ ID NO:1或83所示的指定靶向序列的序列的转移质粒pED162(实施例1),建立LipofectamineTM 3000转染试剂(Thermo Fisher#L3000008)。质粒如表3中所述命名。在转染后48小时,通过将培养基上清液通过0.45μm滤器,收获10ml含有慢病毒的培养基。为了浓缩病毒溶液,添加1/4体积的PEG-itTM病毒沉淀溶液(SBI#LV810A-1),并在4℃温育过夜。将上清液以1,500×g离心30分钟。在舍弃上清液后,向管添加200μlDMEM,将病毒溶液轻柔地重悬浮,并储存在-80℃下。
表3
质粒 | SEQ ID NO |
pED162-C1 | 1 |
pED162-695 | 83 |
使用RNA病毒(MACHEREY-NAGEL#740956.250)和Lenti-XTM qRT-PCR滴定试剂盒(Clontech#631235)测量,慢病毒滴度在5×1010至7×1010个粒子/ml的范围内。
iDM细胞的转导
将iDM细胞以50,000个细胞/孔接种在I型胶原蛋白包被的12孔板(IWAKI#4815-010)中的1ml含有生长培养基[PromoCell骨骼肌细胞生长培养基;零件编号:C-23160(注:培养基增补有20%FBS而不是试剂盒指导的5%,和50μg/ml庆大霉素S)]的培养基中,并在37℃/5%CO2下温育过夜。第二天,将所述培养基更换为1ml增补有5μg/ml聚凝胺(Sigma#TR-1003-G)的生长培养基,并向每个孔添加对应于每种sgRNA的0.3ml慢病毒上清液(参见上文),所述sgRNA包含由各个靶向序列(SEQ ID NO:1或83)编码并与tracrRNA融合的crRNA。将细胞与慢病毒温育48小时,然后除去病毒培养基并更换为选择培养基[增补有10μg/ml杀稻瘟菌素(Nacalai#03759-71)的生长培养基]。在选择培养基中温育24小时后,将三分之一的细胞转移到具有生长培养基的新的孔中。在允许细胞接种72小时后,将生长培养基更换为选择培养基。在选择培养基中培养48小时后,收获细胞并储用。
iCM细胞的转导
将iCM细胞以50,000个细胞/孔接种在I型胶原蛋白包被的6孔板(IWAKI#4810-010)中的2ml含有生长培养基[PromoCell骨骼肌细胞生长培养基;零件编号:C-23160(注:培养基增补有20%FBS而不是试剂盒指导的5%,和50μg/ml庆大霉素S)]的培养基中,并在37℃/5%CO2下温育过夜。第二天,将所述培养基更换为2ml增补有5μg/ml聚凝胺(Sigma#TR-1003-G)的生长培养基,并向每个孔添加对应于对照sgRNA的2×109vg慢病毒上清液(参见上文),所述对照sgRNA包含由单个靶向序列(SEQ ID NO:1)编码并与tracrRNA融合的crRNA。将细胞与慢病毒温育48小时,然后除去病毒培养基并更换为选择培养基[增补有10μg/ml杀稻瘟菌素(Nacalai#03759-71)的生长培养基]。在选择培养基中温育24小时后,将三分之二的细胞转移到具有生长培养基的I型胶原蛋白包被的100mm培养皿(iwaki#4020-010)中。在允许细胞接种72小时后,将生长培养基更换为选择培养基。在选择培养基中培养48小时后,收获细胞并储用。
细胞培养、RNA提取和cDNA制备
将分别被称为iDM-695细胞(iDM_695)、iDM-对照细胞(iDM_对照)和iCM-对照细胞(iCM_对照)的表达dSaCas9和包含由SEQ ID NO:83所示的靶向序列编码的crRNA的hDMPKsgRNA的iDM细胞、表达dSaCas9和包含由SEQ ID NO:1所示的靶向序列编码的crRNA的对照sgRNA的iDM细胞和表达dSaCas9和包含由SEQ ID NO:1所示的靶向序列编码的crRNA的对照sgRNA的iCM细胞,以每孔500μl中25,000个细胞或1ml中50,000个细胞的密度接种在I型胶原蛋白包被的24孔板(IWAKI#4820-010)中的增补有20%非热失活FBS的骨骼肌细胞生长培养基(Promocell#C23060)中,并在37℃/5%CO2下温育2天(以50,000个细胞/孔接种)或3天(以25,000个细胞/孔接种)。
在用200μL PBS清洗后,使用RNeasy小量试剂盒(Qiagen#74106)按照制造商的说明书提取总RNA。
按照制造商的说明书使用SuperScriptTM VILOTM cDNA合成试剂盒(ThermoFisher#11754-250)将500ng总RNA转变成cDNA。将所述cDNA储存在-20℃下。
基因表达分析
将cDNA用水稀释100倍,并将2μl用于qPCR。qPCR使用ViiA7实时PCR系统(ThermoFisher),在含有用于DMPK(Thermo Fisher#Hs01094329_m1,FAM)或GAPDH(Thermo Fisher#Hs99999905_m1,FAM)的Taqman探针和Taqman基因表达主混合物(Thermo Fisher#4369016)的10μl终体积中运行。qPCR条件如下:使用50℃ 2分钟和95℃ 10分钟预热,然后是45个循环的95℃ 15秒和60℃ 1分钟。表达值使用每个基因的标准曲线来分析,并将DMPK基因的表达水平归一化到GAPDH基因的表达水平。
(2)结果
iDM-695细胞和iDM-对照细胞中DMPK基因的表达示出在图9中。
在表达hDMPK sgRNA的iDM细胞中DMPK基因表达被抑制。实施例7.在表达hDMPKsgRNA的iDM细胞中RNA灶点形成的改善(1)实验方法
荧光原位杂交:FISH
将在实施例6中构建的iDM-695细胞、iDM-对照细胞和iCM-对照细胞以每孔2,500个细胞或5,000个细胞的密度一式四份接种在胶原蛋白包被的96孔板(Thermo FisherScientific#152038)中的增补有20%非热失活FBS的骨骼肌细胞生长培养基(Promocell#C23060)中,并在37℃/5%CO2下温育2天(以5,000个细胞/孔接种)或3天(以2,500个细胞/孔接种)。
将所述细胞用磷酸盐缓冲盐水(PBS)清洗两次,用4%聚甲醛在室温固定15分钟,用PBS清洗两次,并储存在4℃下。
在含有0.2%Triton X-100的PBS中在室温温育10分钟后,将细胞清洗并在含有40%甲酰胺的2×SSC中在室温温育10分钟。向每个孔添加50μl探针溶液(0.02%牛血清白蛋白(SIGMA#A7030-100G),0.066mg/ml酵母tRNA(Thermo Fisher Scientific#15401-011),2mM核糖核苷氧钒复合物(SIGMA#R3380-5ML)和0.1ng/μl Cy3-(CAG)5-LNA探针(y_5(L)A(L)G(L)cagcagcag5(L)A(L)G(L),y意味着Cy3,5(L)意味着LNA-mC,N(L)意味着LNA,并且小写字母意味着DNA。该探针由GeneDesign,Inc.合成),在含有40%甲酰胺的2×SSC中),并将细胞在37℃温育2小时。在杂交后,除去所述探针溶液,并将细胞在含有40%甲酰胺的2×SSC中在37℃温育30分钟。将细胞用1×SSC清洗一次,并在1×SSC中在室温温育30分钟。向每个孔添加50μl含有2μg/ml DAPI(Dojindo#340-07971)的PBS,并将细胞在室温温育30分钟。将细胞用PBS在室温下清洗两次,每次5分钟,并储存在4℃下。
使用IN Cell Analyzer 6000(GE healthcare)检测并分析RNA灶点的形成。捕获每个孔中9个点的图像,并对每个图像中RNA灶点阳性核的数目和核的总数进行计数。分析每个孔中灶点阳性核的比率并计算平均值。
(2)结果
iDM-695细胞和iDM-对照细胞的典型图像示出在图10A中。
每个孔中灶点阳性核的比率示出在图10B中。
与iDM-对照细胞相比,iDM-695细胞中RNA灶点阳性核的比率较低。
实施例8.在表达hDMPK sgRNA的iDM细胞中剪接缺陷的改善(1)实验方法
剪接分析
来自于iDM-695细胞、iDM-对照细胞和iCM-695细胞的cDNA的制备描述在实施例6中。
表4
PCR循环条件如下:35个循环的98℃ 10秒、60℃ 15秒和68℃ 30秒,然后72℃ 7分钟。
将PCR产物载样到Agilent DNA1000试剂盒(Agilent#5067-1504)上,电泳,并使用Agilent 2100 BioAnalyzer系统按照制造商的说明书进行分析。
测量正常和异常剪接产物的峰的AUC,并计算每种细胞中正常剪接产物的比率。
(2)结果
每个基因即DMD、MBNL1、KIF13A和TNNT2的凝胶图像和外显子模式示出在图11A中。
每种细胞中正常剪接产物的比率示出在图11B中,所述正常剪接产物在iCM细胞中更加丰富并且在iDM细胞中较少。
在iDM-695细胞中所有测试的基因的剪接缺陷得以改善。
在本文中陈述数值限度或范围的情况下,端点被包括在内。此外,在数值限度或范围之内的所有值和子范围被具体包含在内,如同被明确写出那样。
当在本文中使用时,单数指称带有“一个或多个”的含义。
显然,根据上述教导,本发明的大量修改和变化是可能的。因此应当理解,在权利要求书的范围内,本发明可以不同于本文中具体描述的方式实施。
上文中提到的所有专利和其他参考文献整体通过参考并入本文,如同被详细阐述的那样。
工业实用性
根据本发明,可以在源自于DM1患者的细胞和DM1模型小鼠中抑制DMPK基因的表达。因此,预计本发明对DM1的治疗和/或预防极其有用。
序列表
<110> 安斯泰来制药株式会社
摩大力斯医疗株式会社
<120> 通过靶向DMPK基因治疗肌营养不良症的方法
<130> 093012
<150> US62/853,373
<151> 2019-05-28
<150> US63/025,417
<151> 2020-05-15
<160> 204
<170> PatentIn 3.5版
<210> 1
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 非靶向性对照靶向序列
<400> 1
acggaggcta agcgtcgcaa 20
<210> 2
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 非靶向性对照靶向序列
<400> 2
cgcttccgcg gcccgttcaa 20
<210> 3
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 非靶向性对照靶向序列
<400> 3
gtaggcgcgc cgctctctac 20
<210> 4
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 4
cccttcttct ttggcctcga c 21
<210> 5
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 5
cccagtcgag gccaaagaag a 21
<210> 6
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 6
cccccggaga cacggctggg c 21
<210> 7
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 7
ctccggggga cacagcaacc g 21
<210> 8
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 8
ctgggacagg cagctacggg c 21
<210> 9
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 9
aggcccaccg cccacagcct g 21
<210> 10
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 10
gatctcgggg gacaggtagt c 21
<210> 11
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 11
gcccagagct ggtgggccca g 21
<210> 12
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 12
gatgtggcca cagcggtcca g 21
<210> 13
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 13
cagcggtcca gcaggatgtt g 21
<210> 14
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 14
gatagcaagc ttgttccctg g 21
<210> 15
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 15
gtagaagcgc gccatctcgg c 21
<210> 16
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 16
actgctgagc aagtttgggg a 21
<210> 17
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 17
ccccttctca gtacctggtc a 21
<210> 18
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 18
ctccatgacc aggtactgag a 21
<210> 19
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 19
ggtactgaga aggggttcgt c 21
<210> 20
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 20
tggggctgca ggtctctgcc c 21
<210> 21
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 21
tcccagaaca tctcaggggg a 21
<210> 22
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 22
tctttcccag aacatctcag g 21
<210> 23
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 23
ctgggaccta ttctctgggt g 21
<210> 24
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 24
ggagtgtctg ggacctattc t 21
<210> 25
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 25
aacggggcca ctgcctaggg a 21
<210> 26
<211> 20
<212> DNA
<213> 智人(Homo sapiens)
<400> 26
tccctaggca gtggccccgt 20
<210> 27
<211> 20
<212> DNA
<213> 智人(Homo sapiens)
<400> 27
ggggggtctc ttacctggaa 20
<210> 28
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 28
ggggccttcc ctatctgaga c 21
<210> 29
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 29
tgcggtaaga gggtgtgtca g 21
<210> 30
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 30
gggaaagtct gtgtgcggta a 21
<210> 31
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 31
gggagaggaa gctctaggat t 21
<210> 32
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 32
ggaggtgggg agaggaagct c 21
<210> 33
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 33
actacctggt gagctccggg c 21
<210> 34
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 34
gttggtgaat ggggaccggc g 21
<210> 35
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 35
cgtgaggaga gggacgtgtt g 21
<210> 36
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 36
cctgaggtcg agatagtgag a 21
<210> 37
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 37
tgggtgcaga agcgggtgga g 21
<210> 38
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 38
acggctgggt gcagaagcgg g 21
<210> 39
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 39
ggggcggggc cttagaaatt g 21
<210> 40
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 40
gactcgtggt gggcggggca t 21
<210> 41
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 41
aacccctatg ccccgcccac c 21
<210> 42
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 42
cgccccgcgt ccgatcacct t 21
<210> 43
<211> 19
<212> DNA
<213> 智人(Homo sapiens)
<400> 43
gtggtggggg aaaagaacc 19
<210> 44
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 44
agagagaagg ggagacagac a 21
<210> 45
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 45
tccaagggtg tgcaggatgg t 21
<210> 46
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 46
gggtgtgcag gatggttagg g 21
<210> 47
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 47
aggatggtta gggtggggta a 21
<210> 48
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 48
tttccttctc cccttgttct t 21
<210> 49
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 49
tcagggtgga aggtggaaga a 21
<210> 50
<211> 20
<212> DNA
<213> 智人(Homo sapiens)
<400> 50
cctgaactgt cacctggagt 20
<210> 51
<211> 20
<212> DNA
<213> 智人(Homo sapiens)
<400> 51
cctgaactgt cacctggagt 20
<210> 52
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 52
ggctgcacct gaactgtcac c 21
<210> 53
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 53
ggacattcta catgagaacg t 21
<210> 54
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 54
ccccaccccc acgttctcat g 21
<210> 55
<211> 19
<212> DNA
<213> 智人(Homo sapiens)
<400> 55
cgttctcatg tagaatgtc 19
<210> 56
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 56
tcctgggtaa cggcccagac g 21
<210> 57
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 57
tggttcaaga accacctgca t 21
<210> 58
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 58
tgaaccacac tttggaaaac c 21
<210> 59
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 59
cctgattgtc atctgtaaat a 21
<210> 60
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 60
agaagtcaag gtcctatgac t 21
<210> 61
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 61
gacagggagg cctggactcc c 21
<210> 62
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 62
aagtggaggc aggatggaga g 21
<210> 63
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 63
ggaggcagga tggagagagg g 21
<210> 64
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 64
gcctggggcg ggggaggggg c 21
<210> 65
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 65
agatgcagct cgggccacaa a 21
<210> 66
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 66
gcagggtaaa ctgagaccgg g 21
<210> 67
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 67
gggagaaggt ttttccagag g 21
<210> 68
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 68
gacatatgag ggccagaggg g 21
<210> 69
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 69
gggccagagg ggcagggtgc t 21
<210> 70
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 70
agaaaccagt gaccagtgag c 21
<210> 71
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 71
gcctggggcc agggctggaa g 21
<210> 72
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 72
tcagccacca ttcccgccac c 21
<210> 73
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 73
gtctcggttc ctatgagccg t 21
<210> 74
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 74
cgagactttg gggctggggg t 21
<210> 75
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 75
actttggggc tgggggtggg g 21
<210> 76
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 76
gagggacaac tttgggttct t 21
<210> 77
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 77
aagaacccaa agttgtccct c 21
<210> 78
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 78
tggttctccc acagggcccg c 21
<210> 79
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 79
catagagccc acttttgggg g 21
<210> 80
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 80
tgtgactcag agccatggct t 21
<210> 81
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 81
tgtgactcct aagccatggc t 21
<210> 82
<211> 19
<212> DNA
<213> 智人(Homo sapiens)
<400> 82
tgactcagag ccatggctt 19
<210> 83
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 83
cagagtaagg tcagcagagg c 21
<210> 84
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 84
gctgacctta ctctgcccct c 21
<210> 85
<211> 19
<212> DNA
<213> 智人(Homo sapiens)
<400> 85
gggctcctgc agatggggt 19
<210> 86
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 86
tggccgactt cttgcagtgg g 21
<210> 87
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 87
tacgtggccg acttcttgca g 21
<210> 88
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 88
ggggctccag ccccaggaag c 21
<210> 89
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 89
cccctccccg ggccgggggc t 21
<210> 90
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 90
ggcgggaggg gggctggacc a 21
<210> 91
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 91
attcccccta ctctagcact g 21
<210> 92
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 92
ctctgtgctc taccttttta t 21
<210> 93
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 93
ggagcaaggg ggtggttgct a 21
<210> 94
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 94
acagaggaag ccacaggtgt g 21
<210> 95
<211> 19
<212> DNA
<213> 智人(Homo sapiens)
<400> 95
gggaggccag ggcagcttc 19
<210> 96
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 96
agcttcttgg gtgactcaga g 21
<210> 97
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 97
tgtccgagtc ggaatccatc t 21
<210> 98
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 98
gctaaattta actgtccgag t 21
<210> 99
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 99
ctgagggcta aatttaactg t 21
<210> 100
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 100
ggtcacccct gttcaggctc t 21
<210> 101
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 101
gggcaccctc agagcctgaa c 21
<210> 102
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 102
ccaggggcca aggagctatt t 21
<210> 103
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 103
gtttctaatc ccagcctggg c 21
<210> 104
<211> 20
<212> DNA
<213> 智人(Homo sapiens)
<400> 104
agccacattc ctgcccaggc 20
<210> 105
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 105
attagaaaca gaaacatttc g 21
<210> 106
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 106
acagaaacat ttcggggggt g 21
<210> 107
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 107
ctcttggctc tcggagccgc a 21
<210> 108
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 108
cctgcggctc cgagagccaa g 21
<210> 109
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 109
gaaactagga ggcaaggacc g 21
<210> 110
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 110
atgaggggcc ctggctgcgg g 21
<210> 111
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 111
gcaggatgct cttctcccca a 21
<210> 112
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 112
ggtcctgctc cagccgctgg t 21
<210> 113
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 113
ccggcttttc ctgctatgaa a 21
<210> 114
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 114
gcagtccgct ttccccagag g 21
<210> 115
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 115
aaagcggact gcctagaacc a 21
<210> 116
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 116
ttgctattgt ctgtgctttt g 21
<210> 117
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 117
acagacaata gcaagggcag c 21
<210> 118
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 118
gaccagtcac atgctgggga c 21
<210> 119
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 119
tcacatgctg gggacaggga t 21
<210> 120
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 120
atggctagga ggctgggggc a 21
<210> 121
<211> 19
<212> DNA
<213> 智人(Homo sapiens)
<400> 121
ggcacagtgg tgtgaagcc 19
<210> 122
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 122
gtcaagggga gggagccagc c 21
<210> 123
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 123
gctgccaccc tggccttgcc c 21
<210> 124
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 124
tgagttcttt ccccagcacc t 21
<210> 125
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 125
gtggtagagg taggactgtc c 21
<210> 126
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<400> 126
cctacctcta ccactgactt g 21
<210> 127
<211> 89
<212> DNA
<213> 智人(Homo sapiens)
<400> 127
gtggtggggg aaaagaaccg agggtcacca gaaagggcac tggagacaag ggggaaagcc 60
ccaccctctg tctgtctccc cttctctct 89
<210> 128
<211> 25
<212> DNA
<213> 智人(Homo sapiens)
<400> 128
aagtggaggc aggatggaga gaggg 25
<210> 129
<211> 317
<212> DNA
<213> 智人(Homo sapiens)
<400> 129
cccggtctca gtttaccctg ccaacccaac ttcatcctct actaaaggga ggccaggaga 60
gtcattaggg gctgtgggag gttgggagaa ggtttttcca gaggctgaat ggcctggccc 120
ttcttggcct ccaccttccc atctgtgaaa tgggaggagg agggaaagga accaggcctg 180
ggtcagaaag agatggacac aagaggaagc aggggaaggc ggagaaacag gaaacaagtg 240
tcacacacag tttgttacac acaggccaaa aaccaaacac cagtcactga gggccagaca 300
tatgagggcc agagggg 317
<210> 130
<211> 243
<212> DNA
<213> 智人(Homo sapiens)
<400> 130
agaaaccagt gaccagtgag cccgagtcct gggcttgggg aggaggtggg cagacaaggc 60
agctggcaga agcggaagca tcctccttcc agccctggcc ccaggccctg gaaagccctt 120
gcaagaaggg gttttgtggg acagctggaa ggttggaggc cctgggccag gagaactaaa 180
ggacgcaggg acccggggtg gcgggaatgg tggctgaccc acacggctca taggaaccga 240
gac 243
<210> 131
<211> 61
<212> DNA
<213> 智人(Homo sapiens)
<400> 131
tgtgactcct aagccatggc tctgagtcac agccacccta gcctctgctg accttactct 60
g 61
<210> 132
<211> 57
<212> DNA
<213> 智人(Homo sapiens)
<400> 132
accccatctg caggagcccc gagggtaggc actcacccca ctgcaagaag tcggcca 57
<210> 133
<211> 189
<212> DNA
<213> 智人(Homo sapiens)
<400> 133
gggaggccag ggcagcttct tgggtgactc agagatggat tccgactcgg acagttaaat 60
ttagccctca ggctctctgc tttataccag cttttttttt tttttttttt ttttttccca 120
ggaggtgggg aaggggtggt gaggacagga ccaggagggg gcaccctcag agcctgaaca 180
ggggtgacc 189
<210> 134
<211> 40
<212> DNA
<213> 智人(Homo sapiens)
<400> 134
gcccaggctg ggattagaaa cagaaacatt tcggggggtg 52
<210> 135
<211> 28
<212> DNA
<213> 智人(Homo sapiens)
<400> 135
attagaaaca gaaacatttc ggggggtg 28
<210> 136
<211> 106
<212> DNA
<213> 智人(Homo sapiens)
<400> 136
agaaaccagt gaccagtgag cccgagtcct gggcttgggg aggaggtggg cagacaaggc 60
agctggcaga agcggaagca tcctccttcc agccctggcc ccaggc 106
<210> 137
<211> 363
<212> DNA
<213> 智人(Homo sapiens)
<400> 137
agccacattc ctgcccaggc tgggattaga aacagaaaca tttcgggggg tggagggtgg 60
cgcgggaaga cacactccct gcggctccga gagccaagag gaatctgagc tcttctttcc 120
agggtggacg gttctccaga gtggaagtcg gggcctccaa ccggcctggg gtgcctgggc 180
cttgagaggg ccaggcctga gaaactagga ggcaaggacc gaggagtccc agctgggcag 240
ggcctgggaa acgtggtcct gggcagttct gataatttaa aaaacaccga ggactttgat 300
gaggggccct ggctgcgggc aggatgctct tctccccaag agggtcctgc tccagccgct 360
ggt 363
<210> 138
<211> 19
<212> RNA
<213> 新凶手弗朗西丝菌(Francisella novicid)
<220>
<221> misc_feature
<222> (1)..(19)
<223> crRNA的5'-柄
<400> 138
aauuucuacu guuguagau 19
<210> 139
<211> 1053
<212> PRT
<213> 金黄色葡萄球菌(Staphylococcus aureus)
<220>
<221> 变体
<222> (10)..(10)
<223> 将Asp残基转变成Ala残基
<220>
<221> 变体
<222> (580)..(580)
<223> 将Asn残基转变成Ala残基
<400> 139
Met Lys Arg Asn Tyr Ile Leu Gly Leu Ala Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Ala Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Glu Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Asn Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Asn Asp Leu Leu Asn Arg Ile
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro Arg Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 140
<211> 1053
<212> PRT
<213> 金黄色葡萄球菌(Staphylococcus aureus)
<220>
<221> 变体
<222> (10)..(10)
<223> 将Asp残基转变成Ala残基
<220>
<221> 变体
<222> (557)..(557)
<223> 将His残基转变成Ala残基
<400> 140
Met Lys Arg Asn Tyr Ile Leu Gly Leu Ala Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp Ala Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
725 730 735
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
740 745 750
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
755 760 765
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Glu Leu Ile
770 775 780
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
785 790 795 800
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
805 810 815
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
820 825 830
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
835 840 845
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
850 855 860
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
865 870 875 880
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
885 890 895
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
900 905 910
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
915 920 925
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
930 935 940
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
945 950 955 960
Glu Phe Ile Ala Ser Phe Tyr Asn Asn Asp Leu Ile Lys Ile Asn Gly
965 970 975
Glu Leu Tyr Arg Val Ile Gly Val Asn Asn Asp Leu Leu Asn Arg Ile
980 985 990
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met
995 1000 1005
Asn Asp Lys Arg Pro Pro Arg Ile Ile Lys Thr Ile Ala Ser Lys
1010 1015 1020
Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu
1025 1030 1035
Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1040 1045 1050
<210> 141
<211> 1028
<212> PRT
<213> 人工序列
<220>
<223> 氨基酸残基(dSaCas9的第721至第745位氨基酸)缺失突变体
<220>
<221> 变体
<222> (10)..(10)
<223> 将Asp残基转变成Ala残基
<220>
<221> 变体
<222> (580)..(580)
<223> 将Asn残基转变成Ala残基
<400> 141
Met Lys Arg Asn Tyr Ile Leu Gly Leu Ala Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Ala Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Thr Glu Gln Glu Tyr Lys Glu Ile Phe Ile Thr Pro His Gln Ile Lys
725 730 735
His Ile Lys Asp Phe Lys Asp Tyr Lys Tyr Ser His Arg Val Asp Lys
740 745 750
Lys Pro Asn Arg Glu Leu Ile Asn Asp Thr Leu Tyr Ser Thr Arg Lys
755 760 765
Asp Asp Lys Gly Asn Thr Leu Ile Val Asn Asn Leu Asn Gly Leu Tyr
770 775 780
Asp Lys Asp Asn Asp Lys Leu Lys Lys Leu Ile Asn Lys Ser Pro Glu
785 790 795 800
Lys Leu Leu Met Tyr His His Asp Pro Gln Thr Tyr Gln Lys Leu Lys
805 810 815
Leu Ile Met Glu Gln Tyr Gly Asp Glu Lys Asn Pro Leu Tyr Lys Tyr
820 825 830
Tyr Glu Glu Thr Gly Asn Tyr Leu Thr Lys Tyr Ser Lys Lys Asp Asn
835 840 845
Gly Pro Val Ile Lys Lys Ile Lys Tyr Tyr Gly Asn Lys Leu Asn Ala
850 855 860
His Leu Asp Ile Thr Asp Asp Tyr Pro Asn Ser Arg Asn Lys Val Val
865 870 875 880
Lys Leu Ser Leu Lys Pro Tyr Arg Phe Asp Val Tyr Leu Asp Asn Gly
885 890 895
Val Tyr Lys Phe Val Thr Val Lys Asn Leu Asp Val Ile Lys Lys Glu
900 905 910
Asn Tyr Tyr Glu Val Asn Ser Lys Cys Tyr Glu Glu Ala Lys Lys Leu
915 920 925
Lys Lys Ile Ser Asn Gln Ala Glu Phe Ile Ala Ser Phe Tyr Asn Asn
930 935 940
Asp Leu Ile Lys Ile Asn Gly Glu Leu Tyr Arg Val Ile Gly Val Asn
945 950 955 960
Asn Asp Leu Leu Asn Arg Ile Glu Val Asn Met Ile Asp Ile Thr Tyr
965 970 975
Arg Glu Tyr Leu Glu Asn Met Asn Asp Lys Arg Pro Pro Arg Ile Ile
980 985 990
Lys Thr Ile Ala Ser Lys Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp
995 1000 1005
Ile Leu Gly Asn Leu Tyr Glu Val Lys Ser Lys Lys His Pro Gln
1010 1015 1020
Ile Ile Lys Lys Gly
1025
<210> 142
<211> 6
<212> PRT
<213> 人工序列
<220>
<223> GGSGGS连接物
<400> 142
Gly Gly Ser Gly Gly Ser
1 5
<210> 143
<211> 1034
<212> PRT
<213> 人工序列
<220>
<223> 带有GGSGGS连接物的氨基酸残基(dSaCas9的第721至第745位氨基酸残基)缺失突变体
<220>
<221> 变体
<222> (10)..(10)
<223> 将Asp残基转变成Ala残基
<220>
<221> 变体
<222> (580)..(580)
<223> 将Asn残基转变成Ala残基
<220>
<221> MISC_FEATURE
<222> (721)..(726)
<223> GGSGGS连接物
<400> 143
Met Lys Arg Asn Tyr Ile Leu Gly Leu Ala Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
485 490 495
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
500 505 510
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
515 520 525
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
530 535 540
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
545 550 555 560
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
565 570 575
Gln Glu Glu Ala Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
580 585 590
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
595 600 605
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
610 615 620
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
625 630 635 640
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
645 650 655
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
660 665 670
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
675 680 685
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
690 695 700
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
705 710 715 720
Gly Gly Ser Gly Gly Ser Thr Glu Gln Glu Tyr Lys Glu Ile Phe Ile
725 730 735
Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp Tyr Lys Tyr
740 745 750
Ser His Arg Val Asp Lys Lys Pro Asn Arg Glu Leu Ile Asn Asp Thr
755 760 765
Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu Ile Val Asn
770 775 780
Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu Lys Lys Leu
785 790 795 800
Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His Asp Pro Gln
805 810 815
Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly Asp Glu Lys
820 825 830
Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr Leu Thr Lys
835 840 845
Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile Lys Tyr Tyr
850 855 860
Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp Tyr Pro Asn
865 870 875 880
Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr Arg Phe Asp
885 890 895
Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val Lys Asn Leu
900 905 910
Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser Lys Cys Tyr
915 920 925
Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala Glu Phe Ile
930 935 940
Ala Ser Phe Tyr Asn Asn Asp Leu Ile Lys Ile Asn Gly Glu Leu Tyr
945 950 955 960
Arg Val Ile Gly Val Asn Asn Asp Leu Leu Asn Arg Ile Glu Val Asn
965 970 975
Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met Asn Asp Lys
980 985 990
Arg Pro Pro Arg Ile Ile Lys Thr Ile Ala Ser Lys Thr Gln Ser Ile
995 1000 1005
Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu Tyr Glu Val Lys
1010 1015 1020
Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
1025 1030
<210> 144
<211> 886
<212> PRT
<213> 人工序列
<220>
<223> 氨基酸残基(dSaCas9的第482至第648位氨基酸残基)缺失突变体
<220>
<221> 变体
<222> (10)..(10)
<223> 将Asp残基转变成Ala残基
<400> 144
Met Lys Arg Asn Tyr Ile Leu Gly Leu Ala Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Thr Arg Tyr Ala Thr Arg Gly Leu Met Asn Leu Leu Arg Ser Tyr
485 490 495
Phe Arg Val Asn Asn Leu Asp Val Lys Val Lys Ser Ile Asn Gly Gly
500 505 510
Phe Thr Ser Phe Leu Arg Arg Lys Trp Lys Phe Lys Lys Glu Arg Asn
515 520 525
Lys Gly Tyr Lys His His Ala Glu Asp Ala Leu Ile Ile Ala Asn Ala
530 535 540
Asp Phe Ile Phe Lys Glu Trp Lys Lys Leu Asp Lys Ala Lys Lys Val
545 550 555 560
Met Glu Asn Gln Met Phe Glu Glu Lys Gln Ala Glu Ser Met Pro Glu
565 570 575
Ile Glu Thr Glu Gln Glu Tyr Lys Glu Ile Phe Ile Thr Pro His Gln
580 585 590
Ile Lys His Ile Lys Asp Phe Lys Asp Tyr Lys Tyr Ser His Arg Val
595 600 605
Asp Lys Lys Pro Asn Arg Glu Leu Ile Asn Asp Thr Leu Tyr Ser Thr
610 615 620
Arg Lys Asp Asp Lys Gly Asn Thr Leu Ile Val Asn Asn Leu Asn Gly
625 630 635 640
Leu Tyr Asp Lys Asp Asn Asp Lys Leu Lys Lys Leu Ile Asn Lys Ser
645 650 655
Pro Glu Lys Leu Leu Met Tyr His His Asp Pro Gln Thr Tyr Gln Lys
660 665 670
Leu Lys Leu Ile Met Glu Gln Tyr Gly Asp Glu Lys Asn Pro Leu Tyr
675 680 685
Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr Leu Thr Lys Tyr Ser Lys Lys
690 695 700
Asp Asn Gly Pro Val Ile Lys Lys Ile Lys Tyr Tyr Gly Asn Lys Leu
705 710 715 720
Asn Ala His Leu Asp Ile Thr Asp Asp Tyr Pro Asn Ser Arg Asn Lys
725 730 735
Val Val Lys Leu Ser Leu Lys Pro Tyr Arg Phe Asp Val Tyr Leu Asp
740 745 750
Asn Gly Val Tyr Lys Phe Val Thr Val Lys Asn Leu Asp Val Ile Lys
755 760 765
Lys Glu Asn Tyr Tyr Glu Val Asn Ser Lys Cys Tyr Glu Glu Ala Lys
770 775 780
Lys Leu Lys Lys Ile Ser Asn Gln Ala Glu Phe Ile Ala Ser Phe Tyr
785 790 795 800
Asn Asn Asp Leu Ile Lys Ile Asn Gly Glu Leu Tyr Arg Val Ile Gly
805 810 815
Val Asn Asn Asp Leu Leu Asn Arg Ile Glu Val Asn Met Ile Asp Ile
820 825 830
Thr Tyr Arg Glu Tyr Leu Glu Asn Met Asn Asp Lys Arg Pro Pro Arg
835 840 845
Ile Ile Lys Thr Ile Ala Ser Lys Thr Gln Ser Ile Lys Lys Tyr Ser
850 855 860
Thr Asp Ile Leu Gly Asn Leu Tyr Glu Val Lys Ser Lys Lys His Pro
865 870 875 880
Gln Ile Ile Lys Lys Gly
885
<210> 145
<211> 892
<212> PRT
<213> 人工序列
<220>
<223> 带有GGSGGS连接物的氨基酸残基(dSaCas9的第482至第648位氨基酸残基)缺失突变体
<220>
<221> 变体
<222> (10)..(10)
<223> 将Asp残基转变成Ala残基
<220>
<221> MISC_FEATURE
<222> (482)..(487)
<223> GGSGGS连接物
<400> 145
Met Lys Arg Asn Tyr Ile Leu Gly Leu Ala Ile Gly Ile Thr Ser Val
1 5 10 15
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
20 25 30
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
35 40 45
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
50 55 60
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
65 70 75 80
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
85 90 95
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
100 105 110
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
115 120 125
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
130 135 140
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
145 150 155 160
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
165 170 175
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
180 185 190
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
195 200 205
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
210 215 220
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
225 230 235 240
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
245 250 255
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
260 265 270
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
275 280 285
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
290 295 300
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
305 310 315 320
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
325 330 335
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
340 345 350
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
355 360 365
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
370 375 380
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
385 390 395 400
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
405 410 415
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
420 425 430
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
435 440 445
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
450 455 460
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
465 470 475 480
Glu Gly Gly Ser Gly Gly Ser Thr Arg Tyr Ala Thr Arg Gly Leu Met
485 490 495
Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys Val
500 505 510
Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp Lys
515 520 525
Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp Ala
530 535 540
Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys Leu
545 550 555 560
Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys Gln
565 570 575
Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu Ile
580 585 590
Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp Tyr
595 600 605
Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Glu Leu Ile Asn
610 615 620
Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu Ile
625 630 635 640
Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu Lys
645 650 655
Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His Asp
660 665 670
Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly Asp
675 680 685
Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr Leu
690 695 700
Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile Lys
705 710 715 720
Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp Tyr
725 730 735
Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr Arg
740 745 750
Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val Lys
755 760 765
Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser Lys
770 775 780
Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala Glu
785 790 795 800
Phe Ile Ala Ser Phe Tyr Asn Asn Asp Leu Ile Lys Ile Asn Gly Glu
805 810 815
Leu Tyr Arg Val Ile Gly Val Asn Asn Asp Leu Leu Asn Arg Ile Glu
820 825 830
Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn Met Asn
835 840 845
Asp Lys Arg Pro Pro Arg Ile Ile Lys Thr Ile Ala Ser Lys Thr Gln
850 855 860
Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn Leu Tyr Glu Val
865 870 875 880
Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys Gly
885 890
<210> 146
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<220>
<221> misc_feature
<222> (1)..(21)
<223> 对应于靶向序列(SEQ ID NO:5)的crRNA
<400> 146
cccagucgag gccaaagaag a 21
<210> 147
<211> 21
<212> DNA
<213> 智人(Homo sapiens)
<220>
<221> misc_feature
<222> (1)..(21)
<223> 与靶向序列(SEQ ID NO:5)互补的序列
<400> 147
tcttctttgg cctcgactgg g 21
<210> 148
<211> 19
<212> DNA
<213> 新凶手弗朗西丝菌(Francisella novicid)
<220>
<221> misc_feature
<223> crRNA的5'-柄
<400> 148
aatttctact gttgtagat 19
<210> 149
<211> 83
<212> DNA
<213> 金黄色葡萄球菌(Staphylococcus aureus)
<220>
<221> misc_feature
<222> (1)..(83)
<223> 编码tracrRNA的序列
<400> 149
gttttagtac tctggaaaca gaatctacta aaacaaggca aaatgccgtg tttatctcgt 60
caacttgttg gcgagatttt ttt 83
<210> 150
<211> 76
<212> DNA
<213> 金黄色葡萄球菌(Staphylococcus aureus)
<220>
<221> misc_feature
<222> (1)..(76)
<223> SaCas9 gRNA支架序列
<400> 150
gttttagtac tctggaaaca gaatctacta aaacaaggca aaatgccgtg tttatctcgt 60
caacttgttg gcgaga 76
<210> 151
<211> 3477
<212> DNA
<213> 人工序列
<220>
<223> 融合到KRAB的dSaCas9(DNA)
<400> 151
atggccccaa agaagaagcg gaaggtcggt atccacggag tcccagcagc caagcggaac 60
tacatcctgg gcctggccat cggcatcacc agcgtgggct acggcatcat cgactacgag 120
acacgggacg tgatcgatgc cggcgtgcgg ctgttcaaag aggccaacgt ggaaaacaac 180
gagggcaggc ggagcaagag aggcgccaga aggctgaagc ggcggaggcg gcatagaatc 240
cagagagtga agaagctgct gttcgactac aacctgctga ccgaccacag cgagctgagc 300
ggcatcaacc cctacgaggc cagagtgaag ggcctgagcc agaagctgag cgaggaagag 360
ttctctgccg ccctgctgca cctggccaag agaagaggcg tgcacaacgt gaacgaggtg 420
gaagaggaca ccggcaacga gctgtccacc aaagagcaga tcagccggaa cagcaaggcc 480
ctggaagaga aatacgtggc cgaactgcag ctggaacggc tgaagaaaga cggcgaagtg 540
cggggcagca tcaacagatt caagaccagc gactacgtga aagaagccaa acagctgctg 600
aaggtgcaga aggcctacca ccagctggac cagagcttca tcgacaccta catcgacctg 660
ctggaaaccc ggcggaccta ctatgaggga cctggcgagg gcagcccctt cggctggaag 720
gacatcaaag aatggtacga gatgctgatg ggccactgca cctacttccc cgaggaactg 780
cggagcgtga agtacgccta caacgccgac ctgtacaacg ccctgaacga cctgaacaat 840
ctcgtgatca ccagggacga gaacgagaag ctggaatatt acgagaagtt ccagatcatc 900
gagaacgtgt tcaagcagaa gaagaagccc accctgaagc agatcgccaa agaaatcctc 960
gtgaacgaag aggatattaa gggctacaga gtgaccagca ccggcaagcc cgagttcacc 1020
aacctgaagg tgtaccacga catcaaggac attaccgccc ggaaagagat tattgagaac 1080
gccgagctgc tggatcagat tgccaagatc ctgaccatct accagagcag cgaggacatc 1140
caggaagaac tgaccaatct gaactccgag ctgacccagg aagagatcga gcagatctct 1200
aatctgaagg gctataccgg cacccacaac ctgagcctga aggccatcaa cctgatcctg 1260
gacgagctgt ggcacaccaa cgacaaccag atcgctatct tcaaccggct gaagctggtg 1320
cccaagaagg tggacctgtc ccagcagaaa gagatcccca ccaccctggt ggacgacttc 1380
atcctgagcc ccgtcgtgaa gagaagcttc atccagagca tcaaagtgat caacgccatc 1440
atcaagaagt acggcctgcc caacgacatc attatcgagc tggcccgcga gaagaactcc 1500
aaggacgccc agaaaatgat caacgagatg cagaagcgga accggcagac caacgagcgg 1560
atcgaggaaa tcatccggac caccggcaaa gagaacgcca agtacctgat cgagaagatc 1620
aagctgcacg acatgcagga aggcaagtgc ctgtacagcc tggaagccat ccctctggaa 1680
gatctgctga acaacccctt caactatgag gtggaccaca tcatccccag aagcgtgtcc 1740
ttcgacaaca gcttcaacaa caaggtgctc gtgaagcagg aagaagccag caagaagggc 1800
aaccggaccc cattccagta cctgagcagc agcgacagca agatcagcta cgaaaccttc 1860
aagaagcaca tcctgaatct ggccaagggc aagggcagaa tcagcaagac caagaaagag 1920
tatctgctgg aagaacggga catcaacagg ttctccgtgc agaaagactt catcaaccgg 1980
aacctggtgg ataccagata cgccaccaga ggcctgatga acctgctgcg gagctacttc 2040
agagtgaaca acctggacgt gaaagtgaag tccatcaatg gcggcttcac cagctttctg 2100
cggcggaagt ggaagtttaa gaaagagcgg aacaaggggt acaagcacca cgccgaggac 2160
gccctgatca ttgccaacgc cgatttcatc ttcaaagagt ggaagaaact ggacaaggcc 2220
aaaaaagtga tggaaaacca gatgttcgag gaaaagcagg ccgagagcat gcccgagatc 2280
gaaaccgagc aggagtacaa agagatcttc atcacccccc accagatcaa gcacattaag 2340
gacttcaagg actacaagta cagccaccgg gtggacaaga agcctaatag agagctgatt 2400
aacgacaccc tgtactccac ccggaaggac gacaagggca acaccctgat cgtgaacaat 2460
ctgaacggcc tgtacgacaa ggacaatgac aagctgaaaa agctgatcaa caagagcccc 2520
gaaaagctgc tgatgtacca ccacgacccc cagacctacc agaaactgaa gctgattatg 2580
gaacagtacg gcgacgagaa gaatcccctg tacaagtact acgaggaaac cgggaactac 2640
ctgaccaagt actccaaaaa ggacaacggc cccgtgatca agaagattaa gtattacggc 2700
aacaaactga acgcccatct ggacatcacc gacgactacc ccaacagcag aaacaaggtc 2760
gtgaagctgt ccctgaagcc ctacagattc gacgtgtacc tggacaatgg cgtgtacaag 2820
ttcgtgaccg tgaagaatct ggatgtgatc aaaaaagaaa actactacga agtgaatagc 2880
aagtgctatg aggaagctaa gaagctgaag aagatcagca accaggccga gtttatcgcc 2940
tccttctaca acaacgatct gatcaagatc aacggcgagc tgtatagagt gatcggcgtg 3000
aacaacgacc tgctgaaccg gatcgaagtg aacatgatcg acatcaccta ccgcgagtac 3060
ctggaaaaca tgaacgacaa gaggcccccc aggatcatta agacaatcgc ctccaagacc 3120
cagagcatta agaagtacag cacagacatt ctgggcaacc tgtatgaagt gaaatctaag 3180
aagcaccctc agatcatcaa aaagggcaaa aggccggcgg ccacgaaaaa ggccggccag 3240
gcaaaaaaga aaaagggatc catggatgct aagtcactaa ctgcctggtc ccggacactg 3300
gtgaccttca aggatgtatt tgtggacttc accagggagg agtggaagct gctggacact 3360
gctcagcaga tcgtgtacag aaatgtgatg ctggagaact ataagaacct ggtttccttg 3420
ggttatcagc ttactaagcc agatgtgatc ctccggttgg agaagggaga agagccc 3477
<210> 152
<211> 1159
<212> PRT
<213> 人工序列
<220>
<223> 融合到KRAB的dSaCas9(蛋白质)
<400> 152
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Lys Arg Asn Tyr Ile Leu Gly Leu Ala Ile Gly Ile Thr Ser Val
20 25 30
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
35 40 45
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
50 55 60
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
65 70 75 80
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
85 90 95
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
100 105 110
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
115 120 125
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
130 135 140
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
145 150 155 160
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
165 170 175
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
180 185 190
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
195 200 205
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
210 215 220
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
225 230 235 240
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
245 250 255
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
260 265 270
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
275 280 285
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
290 295 300
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
305 310 315 320
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
325 330 335
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
340 345 350
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
355 360 365
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
370 375 380
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
385 390 395 400
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
405 410 415
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
420 425 430
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
435 440 445
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
450 455 460
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
465 470 475 480
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
485 490 495
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
500 505 510
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
515 520 525
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
530 535 540
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
545 550 555 560
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
565 570 575
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
580 585 590
Gln Glu Glu Ala Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
595 600 605
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
610 615 620
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
625 630 635 640
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
645 650 655
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
660 665 670
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
675 680 685
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
690 695 700
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
705 710 715 720
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
725 730 735
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
740 745 750
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
755 760 765
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
770 775 780
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Glu Leu Ile
785 790 795 800
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
805 810 815
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
820 825 830
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
835 840 845
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
850 855 860
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
865 870 875 880
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
885 890 895
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
900 905 910
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
915 920 925
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
930 935 940
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
945 950 955 960
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
965 970 975
Glu Phe Ile Ala Ser Phe Tyr Asn Asn Asp Leu Ile Lys Ile Asn Gly
980 985 990
Glu Leu Tyr Arg Val Ile Gly Val Asn Asn Asp Leu Leu Asn Arg Ile
995 1000 1005
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn
1010 1015 1020
Met Asn Asp Lys Arg Pro Pro Arg Ile Ile Lys Thr Ile Ala Ser
1025 1030 1035
Lys Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn
1040 1045 1050
Leu Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys
1055 1060 1065
Gly Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys
1070 1075 1080
Lys Lys Gly Ser Met Asp Ala Lys Ser Leu Thr Ala Trp Ser Arg
1085 1090 1095
Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr Arg Glu
1100 1105 1110
Glu Trp Lys Leu Leu Asp Thr Ala Gln Gln Ile Val Tyr Arg Asn
1115 1120 1125
Val Met Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr Gln
1130 1135 1140
Leu Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu
1145 1150 1155
Pro
<210> 153
<211> 399
<212> DNA
<213> 人工序列
<220>
<223> blastR表达盒(DNA)
<400> 153
atggccaagc ctttgtctca agaagaatcc accctcattg aaagagcaac ggctacaatc 60
aacagcatcc ccatctctga agactacagc gtcgccagcg cagctctctc tagcgacggc 120
cgcatcttca ctggtgtcaa tgtatatcat tttactgggg gaccttgtgc agaactcgtg 180
gtgctgggca ctgctgctgc tgcggcagct ggcaacctga cttgtatcgt cgcgatcgga 240
aatgagaaca ggggcatctt gagcccctgc ggacggtgcc gacaggtgct tctcgatctg 300
catcctggga tcaaagccat agtgaaggac agtgatggac agccgacggc agttgggatt 360
cgtgaattgc tgccctctgg ttatgtgtgg gagggctaa 399
<210> 154
<211> 132
<212> PRT
<213> 人工序列
<220>
<223> blastR表达盒(蛋白质)
<400> 154
Met Ala Lys Pro Leu Ser Gln Glu Glu Ser Thr Leu Ile Glu Arg Ala
1 5 10 15
Thr Ala Thr Ile Asn Ser Ile Pro Ile Ser Glu Asp Tyr Ser Val Ala
20 25 30
Ser Ala Ala Leu Ser Ser Asp Gly Arg Ile Phe Thr Gly Val Asn Val
35 40 45
Tyr His Phe Thr Gly Gly Pro Cys Ala Glu Leu Val Val Leu Gly Thr
50 55 60
Ala Ala Ala Ala Ala Ala Gly Asn Leu Thr Cys Ile Val Ala Ile Gly
65 70 75 80
Asn Glu Asn Arg Gly Ile Leu Ser Pro Cys Gly Arg Cys Arg Gln Val
85 90 95
Leu Leu Asp Leu His Pro Gly Ile Lys Ala Ile Val Lys Asp Ser Asp
100 105 110
Gly Gln Pro Thr Ala Val Gly Ile Arg Glu Leu Leu Pro Ser Gly Tyr
115 120 125
Val Trp Glu Gly
130
<210> 155
<211> 241
<212> DNA
<213> 人工序列
<220>
<223> U6启动子
<400> 155
gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc tgttagagag 60
ataattagaa ttaatttgac tgtaaacaca aagatattag tacaaaatac gtgacgtaga 120
aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat ggactatcat 180
atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt gtggaaagga 240
c 241
<210> 156
<211> 82
<212> RNA
<213> 金黄色葡萄球菌(Staphylococcus aureus)
<220>
<221> misc_feature
<222> (1)..(82)
<223> tracrRNA
<400> 156
guuuuaguac ucuggaaaca gaaucuacua aaacaaggca aaaugccgug uuuaucacgu 60
caacuuguug gcgagauuuu uu 82
<210> 157
<211> 19
<212> RNA
<213> 智人(Homo sapiens)
<400> 157
gugguggggg aaaagaacc 19
<210> 158
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 158
agagagaagg ggagacagac a 21
<210> 159
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 159
gggugugcag gaugguuagg g 21
<210> 160
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 160
aaguggaggc aggauggaga g 21
<210> 161
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 161
ggaggcagga uggagagagg g 21
<210> 162
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 162
gcaggguaaa cugagaccgg g 21
<210> 163
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 163
gacauaugag ggccagaggg g 21
<210> 164
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 164
agaaaccagu gaccagugag c 21
<210> 165
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 165
gccuggggcc agggcuggaa g 21
<210> 166
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 166
ucagccacca uucccgccac c 21
<210> 167
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 167
gucucgguuc cuaugagccg u 21
<210> 168
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 168
ugugacucag agccauggcu u 21
<210> 169
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 169
ugugacuccu aagccauggc u 21
<210> 170
<211> 19
<212> RNA
<213> 智人(Homo sapiens)
<400> 170
ugacucagag ccauggcuu 19
<210> 171
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 171
cagaguaagg ucagcagagg c 21
<210> 172
<211> 19
<212> RNA
<213> 智人(Homo sapiens)
<400> 172
gggcuccugc agauggggu 19
<210> 173
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 173
uggccgacuu cuugcagugg g 21
<210> 174
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 174
ggggcuccag ccccaggaag c 21
<210> 175
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 175
auucccccua cucuagcacu g 21
<210> 176
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 176
agcuucuugg gugacucaga g 21
<210> 177
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 177
cugagggcua aauuuaacug u 21
<210> 178
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 178
ggucaccccu guucaggcuc u 21
<210> 179
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 179
guuucuaauc ccagccuggg c 21
<210> 180
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 180
auuagaaaca gaaacauuuc g 21
<210> 181
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 181
acagaaacau uucggggggu g 21
<210> 182
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 182
ccugcggcuc cgagagccaa g 21
<210> 183
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 183
gaaacuagga ggcaaggacc g 21
<210> 184
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 184
gcaggaugcu cuucucccca a 21
<210> 185
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 185
acagacaaua gcaagggcag c 21
<210> 186
<211> 21
<212> RNA
<213> 智人(Homo sapiens)
<400> 186
ucacaugcug gggacaggga u 21
<210> 187
<211> 450
<212> DNA
<213> 人工序列
<220>
<223> CK8启动子
<400> 187
ctagactagc atgctgccca tgtaaggagg caaggcctgg ggacacccga gatgcctggt 60
tataattaac ccagacatgt ggctgccccc ccccccccaa cacctgctgc ctctaaaaat 120
aaccctgcat gccatgttcc cggcgaaggg ccagctgtcc cccgccagct agactcagca 180
cttagtttag gaaccagtga gcaagtcagc ccttggggca gcccatacaa ggccatgggg 240
ctgggcaagc tgcacgcctg ggtccggggt gggcacggtg cccgggcaac gagctgaaag 300
ctcatctgct ctcaggggcc cctccctggg gacagcccct cctggctagt cacaccctgt 360
aggctcctct atataaccca ggggcacagg ggctgccctc attctaccac cacctccaca 420
gcacagacag acactcagga gccagccagc 450
<210> 188
<211> 16
<212> PRT
<213> 人工序列
<220>
<223> 核定位信号
<400> 188
Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala Ala
1 5 10 15
<210> 189
<211> 48
<212> DNA
<213> 人工序列
<220>
<223> 编码SEQ ID NO 188的NLS的DNA序列
<400> 189
gccccaaaga agaagcggaa ggtcggtatc cacggagtcc cagcagcc 48
<210> 190
<211> 16
<212> PRT
<213> 人工序列
<220>
<223> 核定位信号
<400> 190
Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys
1 5 10 15
<210> 191
<211> 48
<212> DNA
<213> 人工序列
<220>
<223> 编码SEQ ID NO 190的NLS的DNA序列
<400> 191
aaaaggccgg cggccacgaa aaaggccggc caggcaaaaa agaaaaag 48
<210> 192
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> DMD外显子78正向引物
<400> 192
ttagaggagg tgatggagca 20
<210> 193
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> DMD外显子78反向引物
<400> 193
gatactaagg actccatcgc 20
<210> 194
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> MBNL1外显子7正向引物
<400> 194
gctgcccaat accaggtcaa c 21
<210> 195
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> MBNL1外显子7反向引物
<400> 195
tggtgggaga aatgctgtat gc 22
<210> 196
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> KIF13A外显子21正向引物
<400> 196
acctgtgcag cattcaggga cac 23
<210> 197
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> KIF13A外显子21反向引物
<400> 197
ctcgtcgttt aatgagtgca tctg 24
<210> 198
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> TNNT2外显子5正向引物
<400> 198
atagaagagg tggtggaaga gtac 24
<210> 199
<211> 24
<212> DNA
<213> 人工序列
<220>
<223> TNNT2外显子5反向引物
<400> 199
gtctcagcct ctgcttcagc atcc 24
<210> 200
<211> 3480
<212> DNA
<213> 人工序列
<220>
<223> 带有末端终止密码子的融合到KRAB的dSaCas9(DNA)
<400> 200
atggccccaa agaagaagcg gaaggtcggt atccacggag tcccagcagc caagcggaac 60
tacatcctgg gcctggccat cggcatcacc agcgtgggct acggcatcat cgactacgag 120
acacgggacg tgatcgatgc cggcgtgcgg ctgttcaaag aggccaacgt ggaaaacaac 180
gagggcaggc ggagcaagag aggcgccaga aggctgaagc ggcggaggcg gcatagaatc 240
cagagagtga agaagctgct gttcgactac aacctgctga ccgaccacag cgagctgagc 300
ggcatcaacc cctacgaggc cagagtgaag ggcctgagcc agaagctgag cgaggaagag 360
ttctctgccg ccctgctgca cctggccaag agaagaggcg tgcacaacgt gaacgaggtg 420
gaagaggaca ccggcaacga gctgtccacc aaagagcaga tcagccggaa cagcaaggcc 480
ctggaagaga aatacgtggc cgaactgcag ctggaacggc tgaagaaaga cggcgaagtg 540
cggggcagca tcaacagatt caagaccagc gactacgtga aagaagccaa acagctgctg 600
aaggtgcaga aggcctacca ccagctggac cagagcttca tcgacaccta catcgacctg 660
ctggaaaccc ggcggaccta ctatgaggga cctggcgagg gcagcccctt cggctggaag 720
gacatcaaag aatggtacga gatgctgatg ggccactgca cctacttccc cgaggaactg 780
cggagcgtga agtacgccta caacgccgac ctgtacaacg ccctgaacga cctgaacaat 840
ctcgtgatca ccagggacga gaacgagaag ctggaatatt acgagaagtt ccagatcatc 900
gagaacgtgt tcaagcagaa gaagaagccc accctgaagc agatcgccaa agaaatcctc 960
gtgaacgaag aggatattaa gggctacaga gtgaccagca ccggcaagcc cgagttcacc 1020
aacctgaagg tgtaccacga catcaaggac attaccgccc ggaaagagat tattgagaac 1080
gccgagctgc tggatcagat tgccaagatc ctgaccatct accagagcag cgaggacatc 1140
caggaagaac tgaccaatct gaactccgag ctgacccagg aagagatcga gcagatctct 1200
aatctgaagg gctataccgg cacccacaac ctgagcctga aggccatcaa cctgatcctg 1260
gacgagctgt ggcacaccaa cgacaaccag atcgctatct tcaaccggct gaagctggtg 1320
cccaagaagg tggacctgtc ccagcagaaa gagatcccca ccaccctggt ggacgacttc 1380
atcctgagcc ccgtcgtgaa gagaagcttc atccagagca tcaaagtgat caacgccatc 1440
atcaagaagt acggcctgcc caacgacatc attatcgagc tggcccgcga gaagaactcc 1500
aaggacgccc agaaaatgat caacgagatg cagaagcgga accggcagac caacgagcgg 1560
atcgaggaaa tcatccggac caccggcaaa gagaacgcca agtacctgat cgagaagatc 1620
aagctgcacg acatgcagga aggcaagtgc ctgtacagcc tggaagccat ccctctggaa 1680
gatctgctga acaacccctt caactatgag gtggaccaca tcatccccag aagcgtgtcc 1740
ttcgacaaca gcttcaacaa caaggtgctc gtgaagcagg aagaagccag caagaagggc 1800
aaccggaccc cattccagta cctgagcagc agcgacagca agatcagcta cgaaaccttc 1860
aagaagcaca tcctgaatct ggccaagggc aagggcagaa tcagcaagac caagaaagag 1920
tatctgctgg aagaacggga catcaacagg ttctccgtgc agaaagactt catcaaccgg 1980
aacctggtgg ataccagata cgccaccaga ggcctgatga acctgctgcg gagctacttc 2040
agagtgaaca acctggacgt gaaagtgaag tccatcaatg gcggcttcac cagctttctg 2100
cggcggaagt ggaagtttaa gaaagagcgg aacaaggggt acaagcacca cgccgaggac 2160
gccctgatca ttgccaacgc cgatttcatc ttcaaagagt ggaagaaact ggacaaggcc 2220
aaaaaagtga tggaaaacca gatgttcgag gaaaagcagg ccgagagcat gcccgagatc 2280
gaaaccgagc aggagtacaa agagatcttc atcacccccc accagatcaa gcacattaag 2340
gacttcaagg actacaagta cagccaccgg gtggacaaga agcctaatag agagctgatt 2400
aacgacaccc tgtactccac ccggaaggac gacaagggca acaccctgat cgtgaacaat 2460
ctgaacggcc tgtacgacaa ggacaatgac aagctgaaaa agctgatcaa caagagcccc 2520
gaaaagctgc tgatgtacca ccacgacccc cagacctacc agaaactgaa gctgattatg 2580
gaacagtacg gcgacgagaa gaatcccctg tacaagtact acgaggaaac cgggaactac 2640
ctgaccaagt actccaaaaa ggacaacggc cccgtgatca agaagattaa gtattacggc 2700
aacaaactga acgcccatct ggacatcacc gacgactacc ccaacagcag aaacaaggtc 2760
gtgaagctgt ccctgaagcc ctacagattc gacgtgtacc tggacaatgg cgtgtacaag 2820
ttcgtgaccg tgaagaatct ggatgtgatc aaaaaagaaa actactacga agtgaatagc 2880
aagtgctatg aggaagctaa gaagctgaag aagatcagca accaggccga gtttatcgcc 2940
tccttctaca acaacgatct gatcaagatc aacggcgagc tgtatagagt gatcggcgtg 3000
aacaacgacc tgctgaaccg gatcgaagtg aacatgatcg acatcaccta ccgcgagtac 3060
ctggaaaaca tgaacgacaa gaggcccccc aggatcatta agacaatcgc ctccaagacc 3120
cagagcatta agaagtacag cacagacatt ctgggcaacc tgtatgaagt gaaatctaag 3180
aagcaccctc agatcatcaa aaagggcaaa aggccggcgg ccacgaaaaa ggccggccag 3240
gcaaaaaaga aaaagggatc catggatgct aagtcactaa ctgcctggtc ccggacactg 3300
gtgaccttca aggatgtatt tgtggacttc accagggagg agtggaagct gctggacact 3360
gctcagcaga tcgtgtacag aaatgtgatg ctggagaact ataagaacct ggtttccttg 3420
ggttatcagc ttactaagcc agatgtgatc ctccggttgg agaagggaga agagccctga 3480
<210> 201
<211> 208
<212> DNA
<213> 人工序列
<220>
<223> b珠蛋白polyA序列
<400> 201
ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc 60
tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc 120
tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag ggggaggatt 180
gggaagagaa tagcaggcat gctgggga 208
<210> 202
<211> 242
<212> DNA
<213> 人工序列
<220>
<223> U6聚合物
<400> 202
gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc tgttagagag 60
ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac gtgacgtaga 120
aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat ggactatcat 180
atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt gtggaaagga 240
cg 242
<210> 203
<211> 8241
<212> DNA
<213> 人工序列
<220>
<223> pED0001
<400> 203
gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc 60
gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 120
tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 180
gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 240
accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc 300
accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 360
gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 420
ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 480
atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 540
gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 600
cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 660
gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 720
gttcctggcc ttttgctggc cttttgctca catgtcctgc aggcagctgc gcgctcgctc 780
gctcactgag gccgcccggg cgtcgggcga cctttggtcg cccggcctca gtgagcgagc 840
gagcgcgcag agagggagtg gccaactcca tcactagggg ttcctgcggc ctctagactc 900
gaggcgttga cattgattat tgactagtta ttaatagtaa tcaattacgg ggtcattagt 960
tcatagccca tatatggagt tccgcgttac ataacttacg gtaaatggcc cgcctggctg 1020
accgcccaac gacccccgcc cattgacgtc aataatgacg tatgttccca tagtaacgcc 1080
aatagggact ttccattgac gtcaatgggt ggagtattta cggtaaactg cccacttggc 1140
agtacatcaa gtgtatcata tgccaagtac gccccctatt gacgtcaatg acggtaaatg 1200
gcccgcctgg cattatgccc agtacatgac cttatgggac tttcctactt ggcagtacat 1260
ctacgtatta gtcatcgcta ttaccatggt gatgcggttt tggcagtaca tcaatgggcg 1320
tggatagcgg tttgactcac ggggatttcc aagtctccac cccattgacg tcaatgggag 1380
tttgttttgg caccaaaatc aacgggactt tccaaaatgt cgtaacaact ccgccccatt 1440
gacgcaaatg ggcggtaggc gtgtacggtg ggaggtctat ataagcagag ctctctggct 1500
aactaccggt gccaccatgg ccccaaagaa gaagcggaag gtcggtatcc acggagtccc 1560
agcagccaag cggaactaca tcctgggcct ggccatcggc atcaccagcg tgggctacgg 1620
catcatcgac tacgagacac gggacgtgat cgatgccggc gtgcggctgt tcaaagaggc 1680
caacgtggaa aacaacgagg gcaggcggag caagagaggc gccagaaggc tgaagcggcg 1740
gaggcggcat agaatccaga gagtgaagaa gctgctgttc gactacaacc tgctgaccga 1800
ccacagcgag ctgagcggca tcaaccccta cgaggccaga gtgaagggcc tgagccagaa 1860
gctgagcgag gaagagttct ctgccgccct gctgcacctg gccaagagaa gaggcgtgca 1920
caacgtgaac gaggtggaag aggacaccgg caacgagctg tccaccaaag agcagatcag 1980
ccggaacagc aaggccctgg aagagaaata cgtggccgaa ctgcagctgg aacggctgaa 2040
gaaagacggc gaagtgcggg gcagcatcaa cagattcaag accagcgact acgtgaaaga 2100
agccaaacag ctgctgaagg tgcagaaggc ctaccaccag ctggaccaga gcttcatcga 2160
cacctacatc gacctgctgg aaacccggcg gacctactat gagggacctg gcgagggcag 2220
ccccttcggc tggaaggaca tcaaagaatg gtacgagatg ctgatgggcc actgcaccta 2280
cttccccgag gaactgcgga gcgtgaagta cgcctacaac gccgacctgt acaacgccct 2340
gaacgacctg aacaatctcg tgatcaccag ggacgagaac gagaagctgg aatattacga 2400
gaagttccag atcatcgaga acgtgttcaa gcagaagaag aagcccaccc tgaagcagat 2460
cgccaaagaa atcctcgtga acgaagagga tattaagggc tacagagtga ccagcaccgg 2520
caagcccgag ttcaccaacc tgaaggtgta ccacgacatc aaggacatta ccgcccggaa 2580
agagattatt gagaacgccg agctgctgga tcagattgcc aagatcctga ccatctacca 2640
gagcagcgag gacatccagg aagaactgac caatctgaac tccgagctga cccaggaaga 2700
gatcgagcag atctctaatc tgaagggcta taccggcacc cacaacctga gcctgaaggc 2760
catcaacctg atcctggacg agctgtggca caccaacgac aaccagatcg ctatcttcaa 2820
ccggctgaag ctggtgccca agaaggtgga cctgtcccag cagaaagaga tccccaccac 2880
cctggtggac gacttcatcc tgagccccgt cgtgaagaga agcttcatcc agagcatcaa 2940
agtgatcaac gccatcatca agaagtacgg cctgcccaac gacatcatta tcgagctggc 3000
ccgcgagaag aactccaagg acgcccagaa aatgatcaac gagatgcaga agcggaaccg 3060
gcagaccaac gagcggatcg aggaaatcat ccggaccacc ggcaaagaga acgccaagta 3120
cctgatcgag aagatcaagc tgcacgacat gcaggaaggc aagtgcctgt acagcctgga 3180
agccatccct ctggaagatc tgctgaacaa ccccttcaac tatgaggtgg accacatcat 3240
ccccagaagc gtgtccttcg acaacagctt caacaacaag gtgctcgtga agcaggaaga 3300
agccagcaag aagggcaacc ggaccccatt ccagtacctg agcagcagcg acagcaagat 3360
cagctacgaa accttcaaga agcacatcct gaatctggcc aagggcaagg gcagaatcag 3420
caagaccaag aaagagtatc tgctggaaga acgggacatc aacaggttct ccgtgcagaa 3480
agacttcatc aaccggaacc tggtggatac cagatacgcc accagaggcc tgatgaacct 3540
gctgcggagc tacttcagag tgaacaacct ggacgtgaaa gtgaagtcca tcaatggcgg 3600
cttcaccagc tttctgcggc ggaagtggaa gtttaagaaa gagcggaaca aggggtacaa 3660
gcaccacgcc gaggacgccc tgatcattgc caacgccgat ttcatcttca aagagtggaa 3720
gaaactggac aaggccaaaa aagtgatgga aaaccagatg ttcgaggaaa agcaggccga 3780
gagcatgccc gagatcgaaa ccgagcagga gtacaaagag atcttcatca ccccccacca 3840
gatcaagcac attaaggact tcaaggacta caagtacagc caccgggtgg acaagaagcc 3900
taatagagag ctgattaacg acaccctgta ctccacccgg aaggacgaca agggcaacac 3960
cctgatcgtg aacaatctga acggcctgta cgacaaggac aatgacaagc tgaaaaagct 4020
gatcaacaag agccccgaaa agctgctgat gtaccaccac gacccccaga cctaccagaa 4080
actgaagctg attatggaac agtacggcga cgagaagaat cccctgtaca agtactacga 4140
ggaaaccggg aactacctga ccaagtactc caaaaaggac aacggccccg tgatcaagaa 4200
gattaagtat tacggcaaca aactgaacgc ccatctggac atcaccgacg actaccccaa 4260
cagcagaaac aaggtcgtga agctgtccct gaagccctac agattcgacg tgtacctgga 4320
caatggcgtg tacaagttcg tgaccgtgaa gaatctggat gtgatcaaaa aagaaaacta 4380
ctacgaagtg aatagcaagt gctatgagga agctaagaag ctgaagaaga tcagcaacca 4440
ggccgagttt atcgcctcct tctacaacaa cgatctgatc aagatcaacg gcgagctgta 4500
tagagtgatc ggcgtgaaca acgacctgct gaaccggatc gaagtgaaca tgatcgacat 4560
cacctaccgc gagtacctgg aaaacatgaa cgacaagagg ccccccagga tcattaagac 4620
aatcgcctcc aagacccaga gcattaagaa gtacagcaca gacattctgg gcaacctgta 4680
tgaagtgaaa tctaagaagc accctcagat catcaaaaag ggcaaaaggc cggcggccac 4740
gaaaaaggcc ggccaggcaa aaaagaaaaa gggatccatg gatgctaagt cactaactgc 4800
ctggtcccgg acactggtga ccttcaagga tgtatttgtg gacttcacca gggaggagtg 4860
gaagctgctg gacactgctc agcagatcgt gtacagaaat gtgatgctgg agaactataa 4920
gaacctggtt tccttgggtt atcagcttac taagccagat gtgatcctcc ggttggagaa 4980
gggagaagag cccggaagcg gtgctactaa cttcagcctg ctgaagcagg ctggagacgt 5040
ggaggagaac cctggaccta ccgagtacaa gcccacggtg cgcctcgcca cccgcgacga 5100
cgtccccagg gccgtacgca ccctcgccgc cgcgttcgcc gactaccccg ccacgcgcca 5160
caccgtcgat ccggaccgcc acatcgagcg ggtcaccgag ctgcaagaac tcttcctcac 5220
gcgcgtcggg ctcgacatcg gcaaggtgtg ggtcgcggac gacggcgccg cggtggcggt 5280
ctggaccacg ccggagagcg tcgaagcggg ggcggtgttc gccgagatcg gcccgcgcat 5340
ggccgagttg agcggttccc ggctggccgc gcagcaacag atggaaggcc tcctggcgcc 5400
gcaccggccc aaggagcccg cgtggttcct ggccaccgtc ggagtctcgc ccgaccacca 5460
gggcaagggt ctgggcagcg ccgtcgtgct ccccggagtg gaggcggccg agcgcgccgg 5520
ggtgcccgcc ttcctggaaa cctccgcgcc ccgcaacctc cccttctacg agcggctcgg 5580
cttcaccgtc accgccgacg tcgaggtgcc cgaaggaccg cgcacctggt gcatgacccg 5640
caagcccggt gcctgagaat tcctagagct cgctgatcag cctcgactgt gccttctagt 5700
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 5760
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 5820
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agagaatagc 5880
aggcatgctg gggaggtacc gagggcctat ttcccatgat tccttcatat ttgcatatac 5940
gatacaaggc tgttagagag ataattggaa ttaatttgac tgtaaacaca aagatattag 6000
tacaaaatac gtgacgtaga aagtaataat ttcttgggta gtttgcagtt ttaaaattat 6060
gttttaaaat ggactatcat atgcttaccg taacttgaaa gtatttcgat ttcttggctt 6120
tatatatctt gtggaaagga cgaaacaccg gagaccacgg caggtctcag ttttagtact 6180
ctggaaacag aatctactaa aacaaggcaa aatgccgtgt ttatctcgtc aacttgttgg 6240
cgagattttt gcggccgcag gaacccctag tgatggagtt ggccactccc tctctgcgcg 6300
ctcgctcgct cactgaggcc gggcgaccaa aggtcgcccg acgcccgggc tttgcccggg 6360
cggcctcagt gagcgagcga gcgcgcagct gcctgcaggg gcgcctgatg cggtattttc 6420
tccttacgca tctgtgcggt atttcacacc gcatacgtca aagcaaccat agtacgcgcc 6480
ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga ccgctacact 6540
tgccagcgcc ttagcgcccg ctcctttcgc tttcttccct tcctttctcg ccacgttcgc 6600
cggctttccc cgtcaagctc taaatcgggg gctcccttta gggttccgat ttagtgcttt 6660
acggcacctc gaccccaaaa aacttgattt gggtgatggt tcacgtagtg ggccatcgcc 6720
ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata gtggactctt 6780
gttccaaact ggaacaacac tcaactctat ctcgggctat tcttttgatt tataagggat 6840
tttgccgatt tcggtctatt ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa 6900
ttttaacaaa atattaacgt ttacaatttt atggtgcact ctcagtacaa tctgctctga 6960
tgccgcatag ttaagccagc cccgacaccc gccaacaccc gctgacgcgc cctgacgggc 7020
ttgtctgctc ccggcatccg cttacagaca agctgtgacc gtctccggga gctgcatgtg 7080
tcagaggttt tcaccgtcat caccgaaacg cgcgagacga aagggcctcg tgatacgcct 7140
atttttatag gttaatgtca tgataataat ggtttcttag acgtcaggtg gcacttttcg 7200
gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc 7260
gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag 7320
tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt 7380
tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt 7440
gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga 7500
acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat 7560
tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga 7620
gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag 7680
tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg 7740
accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg 7800
ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt 7860
agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg 7920
gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc 7980
ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg gaagccgcgg 8040
tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac 8100
ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact 8160
gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa 8220
acttcatttt taatttaaaa g 8241
<210> 204
<211> 282
<212> DNA
<213> 人工序列
<220>
<223> EFS启动子
<400> 204
ctagctaggt cttgaaagga gtgggaattg gctccggtgc ccgtcagtgg gcagagcgca 60
catcgcccac agtccccgag aagttggggg gaggggtcgg caattgatcc ggtgcctaga 120
gaaggtggcg cggggtaaac tgggaaagtg atgtcgtgta ctggctccgc ctttttcccg 180
agggtggggg agaaccgtat ataagtgcag tagtcgccgt gaacgttctt tttcgcaacg 240
ggtttgccgc cagaacacag gaccggttct agagcgctgc ca 282
Claims (26)
1.一种多核苷酸,其包含下述碱基序列:
(a)编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列,和
(b)编码向导RNA的碱基序列,所述向导RNA靶向人类DMPK基因的表达调控区中SEQ IDNO:127、SEQ ID NO:46、SEQ ID NO:128、SEQ ID NO:129、SEQ ID NO:130、SEQ ID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:133、SEQ ID NO:137、SEQ IDNO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域。
2.根据权利要求1所述的多核苷酸,其包含下述碱基序列:
(a)编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列,和
(b)编码向导RNA的碱基序列,所述向导RNA靶向人类DMPK基因的表达调控区中SEQ IDNO:127、SEQ ID NO:46、SEQ ID NO:128、SEQ ID NO:66、SEQ ID NO:68、SEQ ID NO:130、SEQID NO:131、SEQ ID NO:132、SEQ ID NO:88、SEQ ID NO:91、SEQ ID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:134、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ IDNO:117或SEQ ID NO:119所示的区域中长度为18至24个核苷酸的连续区域。
3.根据权利要求1或2所述的多核苷酸,其包含下述碱基序列:
(a)编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列,和
(b)编码向导RNA的碱基序列,所述向导RNA靶向人类DMPK基因的表达调控区中SEQ IDNO:63、SEQ ID NO:136、SEQ ID NO:83、SEQ ID NO:99、SEQ ID NO:135、SEQ ID NO:109或SEQ ID NO:111所示的区域中长度为18至24个核苷酸的连续区域。
4.根据权利要求1所述的多核苷酸,其中所述编码向导RNA的碱基序列包含SEQ ID NO:43、SEQ ID NO:44、SEQ ID NO:46、SEQ ID NO:62、SEQ ID NO:63、SEQ ID NO:66、SEQ IDNO:68、SEQ ID NO:70、SEQ ID NO:71、SEQ ID NO:72、SEQ ID NO:73、SEQ ID NO:80、SEQ IDNO:81、SEQ ID NO:82、SEQ ID NO:83、SEQ ID NO:85、SEQ ID NO:86、SEQ ID NO:88、SEQ IDNO:91、SEQ ID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:103、SEQ ID NO:105、SEQID NO:106、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:43、SEQID NO:44、SEQ ID NO:46、SEQ ID NO:62、SEQ ID NO:63、SEQ ID NO:66、SEQ ID NO:68、SEQID NO:70、SEQ ID NO:71、SEQ ID NO:72、SEQ ID NO:73、SEQ ID NO:80、SEQ ID NO:81、SEQID NO:82、SEQ ID NO:83、SEQ ID NO:85、SEQ ID NO:86、SEQ ID NO:88、SEQ ID NO:91、SEQID NO:96、SEQ ID NO:99、SEQ ID NO:100、SEQ ID NO:103、SEQ ID NO:105、SEQ ID NO:106、SEQ ID NO:108、SEQ ID NO:109、SEQ ID NO:111、SEQ ID NO:117或SEQ ID NO:119所示的碱基序列。
5.根据权利要求1至4中的任一项所述的多核苷酸,其包含编码向导RNA的至少两个碱基序列,其中所述至少两个碱基序列是不同的。
6.根据权利要求1至5中的任一项所述的多核苷酸,其中所述转录阻遏物选自KRAB、MeCP2、SIN3A、HDT1、MBD2B、NIPP1和HP1A。
7.根据权利要求6所述的多核苷酸,其中所述转录阻遏物是KRAB。
8.根据权利要求1至7中的任一项所述的多核苷酸,其中所述核酸酶缺陷型CRISPR效应蛋白是dCas9。
9.根据权利要求8所述的多核苷酸,其中所述dCas9源自于金黄色葡萄球菌(Staphylococcus aureus)。
10.根据权利要求1至9中的任一项所述的多核苷酸,其还包含用于所述编码向导RNA的碱基序列的启动子序列和/或用于所述编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列的启动子序列。
11.根据权利要求10所述的多核苷酸,其中用于所述编码向导RNA的碱基序列的启动子序列选自U6启动子、SNR6启动子、SNR52启动子、SCR1启动子、RPR1启动子、U3启动子和H1启动子。
12.根据权利要求11所述的多核苷酸,其中用于所述编码向导RNA的碱基序列的启动子序列是U6启动子。
13.根据权利要求10至12中的任一项所述的多核苷酸,其中用于所述编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列的启动子序列是遍在启动子或肌肉特异性启动子。
14.根据权利要求13所述的多核苷酸,其中所述遍在启动子选自EFS启动子、CMV启动子和CAG启动子。
15.根据权利要求13所述的多核苷酸,其中所述肌肉特异性启动子选自CK8启动子、肌球蛋白重链激酶(MHCK)启动子、肌肉肌酸激酶(MCK)启动子、合成C5-12(Syn)启动子和Des启动子。
16.根据权利要求15所述的多核苷酸,其中所述肌肉特异性启动子是CK8启动子。
17.根据权利要求10至16中的任一项所述的多核苷酸,
其中所述编码向导RNA的碱基序列包含SEQ ID NO:70、SEQ ID NO:81、SEQ ID NO:83或SEQ ID NO:99所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ IDNO:70、SEQ ID NO:81、SEQ ID NO:83或SEQ ID NO:99所示的碱基序列,
所述转录阻遏物是KRAB,
所述核酸酶缺陷型CRISPR效应蛋白是源自于金黄色葡萄球菌的dCas9,
用于所述编码向导RNA的碱基序列的启动子序列是U6启动子,并且
用于所述编码核酸酶缺陷型CRISPR效应蛋白与转录阻遏物的融合蛋白的碱基序列的启动子序列是CK8启动子。
18.根据权利要求17所述的多核苷酸,
其中所述编码向导RNA的碱基序列包含SEQ ID NO:83所示的碱基序列或其中缺失、替换、插入和/或添加了1至3个碱基的SEQ ID NO:83所示的碱基序列。
19.一种载体,其包含根据权利要求1至18中的任一项所述的多核苷酸。
20.根据权利要求19所述的载体,其中所述载体是质粒载体或病毒载体。
21.根据权利要求20所述的载体,其中所述病毒载体选自腺相关病毒(AAV)载体、腺病毒载体和慢病毒载体。
22.根据权利要求21所述的载体,其中所述AAV载体选自AAV1、AAV2、AAV6、AAV7、AAV8、AAV9、Anc80、AAV587MTP、AAV588MTP、AAV-B1、AAVM41和AAVrh74。
23.根据权利要求22所述的载体,其中所述AAV载体是AAV9。
24.一种药物组合物,其包含根据权利要求1至18中的任一项所述的多核苷酸或根据权利要求19至23中的任一项所述的载体。
25.根据权利要求24所述的药物组合物,其用于治疗或预防1型强直性肌营养不良症。
26.一种用于治疗或预防1型强直性肌营养不良症的方法,所述方法包括向需要的对象给药根据权利要求1至18中的任一项所述的多核苷酸或根据权利要求19至23中的任一项所述的载体。
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201962853373P | 2019-05-28 | 2019-05-28 | |
US62/853,373 | 2019-05-28 | ||
US202063025417P | 2020-05-15 | 2020-05-15 | |
US63/025,417 | 2020-05-15 | ||
PCT/JP2020/021851 WO2020241903A1 (en) | 2019-05-28 | 2020-05-27 | Method for treating muscular dystrophy by targeting dmpk gene |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113785066A true CN113785066A (zh) | 2021-12-10 |
CN113785066B CN113785066B (zh) | 2024-06-18 |
Family
ID=71787010
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202080032034.2A Active CN113785066B (zh) | 2019-05-28 | 2020-05-27 | 通过靶向dmpk基因治疗肌营养不良症的方法 |
Country Status (14)
Country | Link |
---|---|
US (1) | US20220233721A1 (zh) |
EP (1) | EP3976785A1 (zh) |
JP (1) | JP7565620B2 (zh) |
KR (1) | KR20220012926A (zh) |
CN (1) | CN113785066B (zh) |
AU (1) | AU2020283372A1 (zh) |
BR (1) | BR112021023899A2 (zh) |
CA (1) | CA3135271A1 (zh) |
CO (1) | CO2021014746A2 (zh) |
IL (1) | IL287178A (zh) |
MX (1) | MX2021014478A (zh) |
SG (1) | SG11202111427RA (zh) |
TW (1) | TW202111122A (zh) |
WO (1) | WO2020241903A1 (zh) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2022114243A1 (en) * | 2020-11-25 | 2022-06-02 | Astellas Pharma Inc. | Method for treating muscular dystrophy by targeting dmpk gene |
WO2022234519A1 (en) * | 2021-05-05 | 2022-11-10 | Crispr Therapeutics Ag | Compositions and methods for using sacas9 scaffold sequences |
WO2023018637A1 (en) * | 2021-08-09 | 2023-02-16 | Vertex Pharmaceuticals Incorporated | Gene editing of regulatory elements |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103167883A (zh) * | 2010-07-19 | 2013-06-19 | F·C·贝内特 | 肌强直性营养障碍蛋白激酶(dmpk)表达的调节 |
CN104837997A (zh) * | 2012-09-25 | 2015-08-12 | 建新公司 | 用于肌强直性营养不良治疗的肽连接吗啉代反义寡核苷酸 |
WO2018081806A2 (en) * | 2016-10-31 | 2018-05-03 | University Of Florida Research Foundation, Inc. | Compositions and methods for impeding transcription of expanded microsatellite repeats |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1390490B1 (en) | 2001-05-24 | 2009-04-15 | Genzyme Corporation | Muscle-specific expression vectors |
PL222683B1 (pl) | 2001-11-13 | 2016-08-31 | Univ Pennsylvania | Rekombinowane wirusy stowarzyszone z adenowirusem (AAV), sposoby ich wytwarzania, pakująca komórka gospodarza, kompozycja zawierająca wirus, zastosowanie wirusa, wyizolowane wirusy AAV, białka,sztuczne białka kapsydu, cząsteczki, komórki gospodarza, sposoby dostarczania transgenu, sposób identyfikacji serotypu, zestaw diagnostyczny, sposób izolacji nowych wirusów, rekombinowana komórka |
CN102199626B (zh) | 2003-09-30 | 2015-06-24 | 宾夕法尼亚大学托管会 | 腺伴随病毒(aav)进化支、序列、含有这些序列的载体及它们的应用 |
JP5704361B2 (ja) | 2010-10-27 | 2015-04-22 | 学校法人自治医科大学 | 神経系細胞への遺伝子導入のためのアデノ随伴ウイルスビリオン |
JP6910947B2 (ja) | 2015-03-24 | 2021-07-28 | 協和キリン株式会社 | 核酸含有脂質ナノ粒子 |
MX2017016289A (es) | 2015-06-18 | 2018-08-15 | Broad Inst Inc | Mutaciones de la enzima crispr que reducen los efectos fuera del blanco. |
EP3478829A1 (en) | 2016-06-29 | 2019-05-08 | Crispr Therapeutics AG | Materials and methods for treatment of myotonic dystrophy type 1 (dm1) and other related disorders |
AU2019281158A1 (en) | 2018-06-08 | 2021-01-14 | Modalis Therapeutics Corporation | Modified Cas9 protein and use thereof |
AU2019365440A1 (en) | 2018-10-24 | 2021-05-20 | Modalis Therapeutics Corporation | Modified Cas9 protein, and use thereof |
-
2020
- 2020-05-27 SG SG11202111427RA patent/SG11202111427RA/en unknown
- 2020-05-27 AU AU2020283372A patent/AU2020283372A1/en not_active Abandoned
- 2020-05-27 KR KR1020217042397A patent/KR20220012926A/ko not_active Application Discontinuation
- 2020-05-27 WO PCT/JP2020/021851 patent/WO2020241903A1/en unknown
- 2020-05-27 US US17/614,461 patent/US20220233721A1/en active Pending
- 2020-05-27 CA CA3135271A patent/CA3135271A1/en active Pending
- 2020-05-27 MX MX2021014478A patent/MX2021014478A/es unknown
- 2020-05-27 CN CN202080032034.2A patent/CN113785066B/zh active Active
- 2020-05-27 TW TW109117727A patent/TW202111122A/zh unknown
- 2020-05-27 JP JP2022518586A patent/JP7565620B2/ja active Active
- 2020-05-27 BR BR112021023899A patent/BR112021023899A2/pt unknown
- 2020-05-27 EP EP20746301.9A patent/EP3976785A1/en active Pending
-
2021
- 2021-10-11 IL IL287178A patent/IL287178A/en unknown
- 2021-11-02 CO CONC2021/0014746A patent/CO2021014746A2/es unknown
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103167883A (zh) * | 2010-07-19 | 2013-06-19 | F·C·贝内特 | 肌强直性营养障碍蛋白激酶(dmpk)表达的调节 |
CN104837997A (zh) * | 2012-09-25 | 2015-08-12 | 建新公司 | 用于肌强直性营养不良治疗的肽连接吗啉代反义寡核苷酸 |
WO2018081806A2 (en) * | 2016-10-31 | 2018-05-03 | University Of Florida Research Foundation, Inc. | Compositions and methods for impeding transcription of expanded microsatellite repeats |
Non-Patent Citations (5)
Title |
---|
ANTONIA A. DOMINGUEZ等: "Beyond editing: repurposing CRISPR–Cas9 for precision genome regulation and interrogation", 《NATURE REVIEWS MOLECULAR CELL BIOLOGY》, vol. 17, no. 1, pages 5 - 15, XP055345154, DOI: 10.1038/nrm.2015.2 * |
ASHISH N. RAO等: "A Therapeutic Double Whammy: Transcriptional or Post-transcriptional Suppression of Microsatellite Repeat Toxicity by Cas9", 《MOLECULAR CELL》, vol. 68, no. 3, pages 473 - 475, XP085274741, DOI: 10.1016/j.molcel.2017.10.023 * |
BELINDA S. PINTO等: "Impeding Transcription of Expanded Microsatellite Repeats by Deactivated Cas9", 《MOLECULAR CELL》, vol. 68, no. 3, pages 480 * |
HANHUI MA等: "A CRISPR-Based Selective Gene Inhibition Method Reveals Dynamic Features of a Cell Nucleus Nanobody Related to the Disease Myotonic Dystrophy", 《SMALL METHODS》, vol. 2, no. 9, pages 3 - 4 * |
LUKE A. GILBERT等: "Genome-Scale CRISPR-Mediated Control of Gene Repression and Activation", 《CELL》, vol. 159, no. 3, pages 648 * |
Also Published As
Publication number | Publication date |
---|---|
AU2020283372A1 (en) | 2021-11-18 |
JP2022534627A (ja) | 2022-08-02 |
KR20220012926A (ko) | 2022-02-04 |
CN113785066B (zh) | 2024-06-18 |
EP3976785A1 (en) | 2022-04-06 |
SG11202111427RA (en) | 2021-11-29 |
MX2021014478A (es) | 2022-01-06 |
IL287178A (en) | 2021-12-01 |
WO2020241903A1 (en) | 2020-12-03 |
US20220233721A1 (en) | 2022-07-28 |
TW202111122A (zh) | 2021-03-16 |
JP7565620B2 (ja) | 2024-10-11 |
BR112021023899A2 (pt) | 2022-01-18 |
CA3135271A1 (en) | 2020-12-03 |
CO2021014746A2 (es) | 2021-11-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113785066B (zh) | 通过靶向dmpk基因治疗肌营养不良症的方法 | |
KR20210143230A (ko) | 뉴클레오티드 서열을 편집하기 위한 방법 및 조성물 | |
KR20230019843A (ko) | 표적 이중 가닥 뉴클레오티드 서열의 두 가닥의 동시 편집을 위한 방법 및 조성물 | |
JP7416451B2 (ja) | CRISPR-Casによる標的化された核内RNA切断及びポリアデニル化 | |
JP2008539698A (ja) | 転写後レベルでの核酸発現調節のための方法および組成物 | |
CA2222880A1 (en) | Recombinant adenovirus and adeno-associated virus, cell lines, and methods of production and use thereof | |
BR112021009481A2 (pt) | método para tratamento da distrofia muscular direcionando gene de utrofina. | |
KR20210125560A (ko) | 유전성 질환의 치료를 위한 것을 포함하는, 아데노신 데아미나제 염기 편집기를 사용한 질환-관련 유전자의 스플라이스 수용체 부위 파괴 | |
EP4100533A1 (en) | Ribozyme-mediated rna assembly and expression | |
KR20230003511A (ko) | 안면견갑상완 근이영양증에 대한 crispr-억제 | |
WO2020210724A1 (en) | Htra1 modulation for treatment of amd | |
WO2022045366A1 (en) | Method for treating facioscapulohumeral muscular dystrophy (fshd) by targeting dux4 gene | |
JP2022513376A (ja) | レトロウイルスインテグラーゼ-Cas9融合タンパク質を使用した指向性非相同DNA挿入によるゲノム編集 | |
KR20230142776A (ko) | Rna 아데노 관련 바이러스(raav) 벡터 및 그 용도 | |
KR20230026504A (ko) | 폴리뉴클레오티드 | |
TW202112797A (zh) | 經由靶向lama1基因治療肌肉萎縮症的方法 | |
WO2016037039A1 (en) | Compositions for increasing survival of motor neuron protein (smn) levels in target cells and methods of use thereof for the treatment of spinal muscular atrophy | |
CN115044583A (zh) | 用于基因编辑的rna框架和基因编辑方法 | |
WO2022145495A1 (en) | Method for treating spinocerebellar ataxias (sca) by targeting atxn7 gene | |
WO2022176859A1 (en) | Method for treating muscular dystrophy by targeting lama1 gene | |
RU2809065C1 (ru) | Нуклеиновая кислота для аллотопической экспрессии гена MT-ND4 | |
WO2023190935A1 (en) | Method for treating myopathies by targeting titin gene | |
KR20230037586A (ko) | Mapt 유전자를 표적으로 하는 알츠하이머병의 치료 방법 | |
CN118159650A (zh) | 一种CRISPR-Cas13系统及其应用 | |
JP2022548911A (ja) | 目的遺伝子の相同遺伝子を転写活性化する方法および生体外診断方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 40062995 Country of ref document: HK |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20231122 Address after: Tokyo, Japan Applicant after: Morris Medical Co.,Ltd. Address before: Tokyo, Japan Applicant before: ASTELLAS PHARMA Inc. Applicant before: Morris Medical Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant |