CN111304242A - 一种基于SaKKHn-pBE系统制备单突变体的方法 - Google Patents
一种基于SaKKHn-pBE系统制备单突变体的方法 Download PDFInfo
- Publication number
- CN111304242A CN111304242A CN201911202341.6A CN201911202341A CN111304242A CN 111304242 A CN111304242 A CN 111304242A CN 201911202341 A CN201911202341 A CN 201911202341A CN 111304242 A CN111304242 A CN 111304242A
- Authority
- CN
- China
- Prior art keywords
- sequence
- sakkhn
- protein
- pbe
- lys
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 43
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 110
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 73
- 235000007164 Oryza sativa Nutrition 0.000 claims abstract description 27
- 235000009566 rice Nutrition 0.000 claims abstract description 27
- 239000002773 nucleotide Substances 0.000 claims description 74
- 125000003729 nucleotide group Chemical group 0.000 claims description 74
- 108020004414 DNA Proteins 0.000 claims description 40
- 102000053602 DNA Human genes 0.000 claims description 28
- 241000209094 Oryza Species 0.000 claims description 26
- 210000004027 cell Anatomy 0.000 claims description 25
- 239000002299 complementary DNA Substances 0.000 claims description 18
- 241000196324 Embryophyta Species 0.000 claims description 16
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 15
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 12
- 108091027544 Subgenomic mRNA Proteins 0.000 claims description 12
- 125000000539 amino acid group Chemical group 0.000 claims description 7
- 108020001507 fusion proteins Proteins 0.000 claims description 7
- 102000037865 fusion proteins Human genes 0.000 claims description 7
- 210000004899 c-terminal region Anatomy 0.000 claims description 6
- 238000006467 substitution reaction Methods 0.000 claims description 6
- 238000012217 deletion Methods 0.000 claims description 5
- 230000037430 deletion Effects 0.000 claims description 5
- 241001465754 Metazoa Species 0.000 claims description 4
- 238000013518 transcription Methods 0.000 claims description 3
- 230000035897 transcription Effects 0.000 claims description 3
- 210000004102 animal cell Anatomy 0.000 claims description 2
- 230000035772 mutation Effects 0.000 abstract description 22
- 238000002360 preparation method Methods 0.000 abstract description 3
- 240000007594 Oryza sativa Species 0.000 abstract 1
- 239000013604 expression vector Substances 0.000 description 136
- 238000003259 recombinant expression Methods 0.000 description 136
- 238000012408 PCR amplification Methods 0.000 description 52
- 235000018102 proteins Nutrition 0.000 description 47
- 239000013598 vector Substances 0.000 description 24
- 108091026890 Coding region Proteins 0.000 description 12
- 108091028043 Nucleic acid sequence Proteins 0.000 description 11
- 230000037429 base substitution Effects 0.000 description 11
- 239000001963 growth medium Substances 0.000 description 9
- 208000015181 infectious disease Diseases 0.000 description 9
- 206010020649 Hyperkeratosis Diseases 0.000 description 8
- 241000589158 Agrobacterium Species 0.000 description 7
- OJOBTAOGJIWAGB-UHFFFAOYSA-N acetosyringone Chemical compound COC1=CC(C(C)=O)=CC(OC)=C1O OJOBTAOGJIWAGB-UHFFFAOYSA-N 0.000 description 6
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 230000008439 repair process Effects 0.000 description 6
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 5
- 239000007787 solid Substances 0.000 description 5
- 108091033409 CRISPR Proteins 0.000 description 4
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 4
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 3
- 102100035102 E3 ubiquitin-protein ligase MYCBP2 Human genes 0.000 description 3
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 3
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 3
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 3
- 108091027974 Mature messenger RNA Proteins 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 229940104302 cytosine Drugs 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 230000006780 non-homologous end joining Effects 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 238000002791 soaking Methods 0.000 description 3
- 108010061238 threonyl-glycine Proteins 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- 229930024421 Adenine Natural products 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 108010031325 Cytidine deaminase Proteins 0.000 description 2
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 2
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- CGWHAXBNGYQBBK-JBACZVJFSA-N Glu-Trp-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)C1=CC=C(O)C=C1 CGWHAXBNGYQBBK-JBACZVJFSA-N 0.000 description 2
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 2
- 108020005004 Guide RNA Proteins 0.000 description 2
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 2
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 2
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 108020005093 RNA Precursors Proteins 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- 108010028230 Trp-Ser- His-Pro-Gln-Phe-Glu-Lys Proteins 0.000 description 2
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 102000006943 Uracil-DNA Glycosidase Human genes 0.000 description 2
- 108010072685 Uracil-DNA Glycosidase Proteins 0.000 description 2
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 2
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 2
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 2
- 229960000643 adenine Drugs 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 2
- 239000011230 binding agent Substances 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 238000010362 genome editing Methods 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 229960002989 glutamic acid Drugs 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 238000005286 illumination Methods 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 108090000765 processed proteins & peptides Proteins 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 230000008263 repair mechanism Effects 0.000 description 2
- 239000012883 rooting culture medium Substances 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 229940113082 thymine Drugs 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 101150097623 1B gene Proteins 0.000 description 1
- 108010079649 APOBEC-1 Deaminase Proteins 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- VYMJAWXRWHJIMS-LKTVYLICSA-N Ala-Tyr-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VYMJAWXRWHJIMS-LKTVYLICSA-N 0.000 description 1
- HULHGJZIZXCPLD-FXQIFTODSA-N Arg-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HULHGJZIZXCPLD-FXQIFTODSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 1
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 1
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 1
- UEONJSPBTSWKOI-CIUDSAMLSA-N Asn-Gln-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O UEONJSPBTSWKOI-CIUDSAMLSA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- 102100040397 C->U-editing enzyme APOBEC-1 Human genes 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 1
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 1
- GRNOCLDFUNCIDW-ACZMJKKPSA-N Cys-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N GRNOCLDFUNCIDW-ACZMJKKPSA-N 0.000 description 1
- DRXOWZZHCSBUOI-YJRXYDGGSA-N Cys-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N)O DRXOWZZHCSBUOI-YJRXYDGGSA-N 0.000 description 1
- ZOMMHASZJQRLFS-IHRRRGAJSA-N Cys-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N ZOMMHASZJQRLFS-IHRRRGAJSA-N 0.000 description 1
- 102000005381 Cytidine Deaminase Human genes 0.000 description 1
- 102100026846 Cytidine deaminase Human genes 0.000 description 1
- 102000000311 Cytosine Deaminase Human genes 0.000 description 1
- 108010080611 Cytosine Deaminase Proteins 0.000 description 1
- 230000033616 DNA repair Effects 0.000 description 1
- 230000008265 DNA repair mechanism Effects 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- XZWYTXMRWQJBGX-VXBMVYAYSA-N FLAG peptide Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 XZWYTXMRWQJBGX-VXBMVYAYSA-N 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 1
- GNDJOCGXGLNCKY-ACZMJKKPSA-N Gln-Cys-Cys Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O GNDJOCGXGLNCKY-ACZMJKKPSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 1
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- ZMVCLTGPGWJAEE-JYJNAYRXSA-N Glu-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)O ZMVCLTGPGWJAEE-JYJNAYRXSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- YOTHMZZSJKKEHZ-SZMVWBNQSA-N Glu-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCC(O)=O)=CNC2=C1 YOTHMZZSJKKEHZ-SZMVWBNQSA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 1
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 1
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- VLIJYPMATZSOLL-YUMQZZPRSA-N Gly-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VLIJYPMATZSOLL-YUMQZZPRSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 1
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 1
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 1
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- WVUDHMBJNBWZBU-XUXIUFHCSA-N Ile-Lys-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N WVUDHMBJNBWZBU-XUXIUFHCSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- MSASLZGZQAXVFP-PEDHHIEDSA-N Ile-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N MSASLZGZQAXVFP-PEDHHIEDSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 1
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- YWFZWQKWNDOWPA-XIRDDKMYSA-N Leu-Trp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O YWFZWQKWNDOWPA-XIRDDKMYSA-N 0.000 description 1
- URJUVJDTPXCQFL-IHPCNDPISA-N Leu-Trp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N URJUVJDTPXCQFL-IHPCNDPISA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- DZQYZKPINJLLEN-KKUMJFAQSA-N Lys-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O DZQYZKPINJLLEN-KKUMJFAQSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- PGBPWPTUOSCNLE-JYJNAYRXSA-N Lys-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N PGBPWPTUOSCNLE-JYJNAYRXSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 1
- RYOLKFYZBHMYFW-WDSOQIARSA-N Lys-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 RYOLKFYZBHMYFW-WDSOQIARSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 1
- XKJUFUPCHARJKX-UWVGGRQHSA-N Met-Gly-His Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 XKJUFUPCHARJKX-UWVGGRQHSA-N 0.000 description 1
- DJBCKVNHEIJLQA-GMOBBJLQSA-N Met-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCSC)N DJBCKVNHEIJLQA-GMOBBJLQSA-N 0.000 description 1
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- TWEWRDAAIYBJTO-ULQDDVLXSA-N Met-Tyr-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N TWEWRDAAIYBJTO-ULQDDVLXSA-N 0.000 description 1
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 101001027316 Oryza sativa subsp. japonica Growth-regulating factor 4 Proteins 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 1
- ABQFNJAFONNUTH-FHWLQOOXSA-N Phe-Gln-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N ABQFNJAFONNUTH-FHWLQOOXSA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- OVJMCXAPGFDGMG-HKUYNNGSSA-N Phe-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OVJMCXAPGFDGMG-HKUYNNGSSA-N 0.000 description 1
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 1
- ZVJGAXNBBKPYOE-HKUYNNGSSA-N Phe-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 ZVJGAXNBBKPYOE-HKUYNNGSSA-N 0.000 description 1
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- BFXZQMWKTYWGCF-PYJNHQTQSA-N Pro-His-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BFXZQMWKTYWGCF-PYJNHQTQSA-N 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 102000002150 RNase Z Human genes 0.000 description 1
- 108010001294 RNase Z Proteins 0.000 description 1
- 102000004167 Ribonuclease P Human genes 0.000 description 1
- 108090000621 Ribonuclease P Proteins 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 1
- RJHJPZQOMKCSTP-CIUDSAMLSA-N Ser-His-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O RJHJPZQOMKCSTP-CIUDSAMLSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 1
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 239000005708 Sodium hypochlorite Substances 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- GHXXDFDIDHIEIL-WFBYXXMGSA-N Trp-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GHXXDFDIDHIEIL-WFBYXXMGSA-N 0.000 description 1
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 1
- ZJPSMXCFEKMZFE-IHPCNDPISA-N Trp-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O ZJPSMXCFEKMZFE-IHPCNDPISA-N 0.000 description 1
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 1
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- MXFPBNFKVBHIRW-BZSNNMDCSA-N Tyr-Lys-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O MXFPBNFKVBHIRW-BZSNNMDCSA-N 0.000 description 1
- KGSDLCMCDFETHU-YESZJQIVSA-N Tyr-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O KGSDLCMCDFETHU-YESZJQIVSA-N 0.000 description 1
- IGXLNVIYDYONFB-UFYCRDLUSA-N Tyr-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 IGXLNVIYDYONFB-UFYCRDLUSA-N 0.000 description 1
- PYJKETPLFITNKS-IHRRRGAJSA-N Tyr-Pro-Asn Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O PYJKETPLFITNKS-IHRRRGAJSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- 101710172430 Uracil-DNA glycosylase inhibitor Proteins 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 238000006481 deamination reaction Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- XEYBHCRIKKKOSS-UHFFFAOYSA-N disodium;azanylidyneoxidanium;iron(2+);pentacyanide Chemical compound [Na+].[Na+].[Fe+2].N#[C-].N#[C-].N#[C-].N#[C-].N#[C-].[O+]#N XEYBHCRIKKKOSS-UHFFFAOYSA-N 0.000 description 1
- 230000005782 double-strand break Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- IDGUHHHQCWSQLU-UHFFFAOYSA-N ethanol;hydrate Chemical compound O.CCO IDGUHHHQCWSQLU-UHFFFAOYSA-N 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 108010002685 hygromycin-B kinase Proteins 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 230000033607 mismatch repair Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 150000007523 nucleic acids Chemical group 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 238000006303 photolysis reaction Methods 0.000 description 1
- 230000015843 photosynthesis, light reaction Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- JQXXHWHPUNPDRT-WLSIYKJHSA-N rifampicin Chemical compound O([C@](C1=O)(C)O/C=C/[C@@H]([C@H]([C@@H](OC(C)=O)[C@H](C)[C@H](O)[C@H](C)[C@@H](O)[C@@H](C)\C=C\C=C(C)/C(=O)NC=2C(O)=C3C([O-])=C4C)C)OC)C4=C1C3=C(O)C=2\C=N\N1CC[NH+](C)CC1 JQXXHWHPUNPDRT-WLSIYKJHSA-N 0.000 description 1
- 229960001225 rifampicin Drugs 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 108010015840 seryl-prolyl-lysyl-lysine Proteins 0.000 description 1
- SUKJFIGYRHOWBL-UHFFFAOYSA-N sodium hypochlorite Chemical compound [Na+].Cl[O-] SUKJFIGYRHOWBL-UHFFFAOYSA-N 0.000 description 1
- 229940083618 sodium nitroprusside Drugs 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 229940027257 timentin Drugs 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 108010068794 tyrosyl-tyrosyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- General Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Cell Biology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明公开了一种基于SaKKHn‑pBE系统制备单突变体的方法。所述单突变体的制备方法,包括如下步骤:将SaKKHn‑pBE碱基编辑系统导入生物体或生物细胞内,通过所述SaKKHn‑pBE碱基编辑系统实现基因组靶点序列的编辑,获得所述单突变体;所述单突变体为所述靶点序列中单个位点C突变为T的生物突变体;所述SaKKHn‑pBE碱基编辑系统包括SaKKHn蛋白质、tRNA‑sgRNA和PmCDA1蛋白质。通过本发明的方法对23个水稻基因组靶点序列进行测试,结果发现几乎所有靶点均能够获得突变效率最高的单位点C的单突变体,且其单突变体所占比例大多超过50%。
Description
技术领域
本发明属于生物技术领域,具体涉及一种基于SaKKHn-pBE系统制备单突变体的方法。
背景技术
CRISPR-Cas9技术已经成为强有力的基因组编辑手段,被广泛应用到很多组织和细胞中。CRISPR/Cas9 protein-RNA复合物通过向导RNA(guide RNA)定位于靶点上,切割产生DNA双链断裂(dsDNA break,DSB),而后生物体会本能的启动DNA修复机制修复DSB。修复机制一般有两种,一种是非同源末端连接(non-homologous end joining,NHEJ),另一种是同源重组(homology-directed repair,HDR)。通常情况下NHEJ占大多数,因此修复产生的随机的indels(insertions or deletions)比精确修复高很多。对于碱基精确替换,因为HDR效率低以及需要DNA模板,所以使用HDR实现碱基精确替换的应用受到很大的限制。
2016年,David Liu和Akihiko Kondo两个实验室分别独立报道了两种不同类型的胞嘧啶碱基编辑器(cytosine base editor,CBE),分别使用了两种不同的胞苷脱氨酶rAPOBEC1(rat APOBEC1)和PmCDA1(activation-induced cytidine deaminase(AID)ortholog from sea lamprey),原理都是通过使用胞苷脱氨酶直接实现对单个胞嘧啶(Cytosine,C)碱基进行编辑,而不再通过产生DSB和启动HDR修复,大大提高了C替换为胸腺嘧啶(Thymine,T)的碱基编辑效率。具体为dead Cas9(dCas9)或the Cas9 nickase(Cas9n)连带着rAPOBEC1或PmCDA1通过sgRNA定位到靶点,rAPOBEC1或PmCDA1催化非配对的单链DNA上的C发生胞嘧啶脱氨反应变成尿嘧啶(Uracil,U),通过DNA的修复使得U与腺嘌呤(Adenine,A)配对,又通过DNA复制,最终使得T与A配对,从而实现了C到T的转换。在所测试的编辑器中,SpCas9n(D10A)&rAPOBEC1/PmCDA1&UGI碱基编辑系统(其含有尿嘧啶DNA糖化酶抑制剂(uracil DNA glycosylase inhibitor,UGI))的平均突变率较高,原因有二:一是UGI可以抑制尿嘧啶DNA糖化酶(uracil DNA glycosylase,UDG)催化清除DNA中U,二是SpCas9n(D10A)在非编辑链上产生切口,诱导真核错配修复机制或long-patch BER(base-excision repair)修复机制,促使U:G错配更多的偏好性修复成U:A。
目前已经在动物和植物中开发了很多CBE,每个CBE都有其碱基编辑窗口,碱基编辑窗口内存在的C都有可能突变为T,即所获得的突变体群体中可能只是某一个位置C发生单突变,或是包含不同位置的C发生的多种单突变,也可能是不同位置的C共突变,也可能是单突变和多突变都包含。无论动物还是植物,实际应用中,为避免临近碱基突变所带来的潜在的、不定的负面影响,均期望能够获得所需突变碱基的精确突变,尽可能减少临近碱基的同时突变。目前关于植物中CBE产生何种突变类型的报道非常有限,已有报道中,并非所有的CBE对其编辑的靶点均能够获得单突变(Wu,Y.et al.(2019)Increasing cytosine baseediting scope and efficiency with engineered Cas9-PmCDA1 fusions and themodified sgRNA in rice.Front.Genet.10,379.),或是有的靶点即使获得单突变,单突变体所占比例也不高(Zhang,C.et al.(2019)Expanding the base editing scope to GAand relaxed NG PAM sites by improved xCas9 system.Plant Biotechnol.J.doi:10.1111/pbi.13259.[Epub ahead of print];Zong,Y.(2018)Efficient C-to-T baseediting in plants using a fusion of nCas9 and humanAPOBEC3A.Nat.Biotechnol.36,950-953.)。
发明内容
本发明的目的是提供一种单突变体的制备方法。
为了实现上述目的,本发明提供了一种单突变体的制备方法。
本发明提供的单突变体的制备方法包括如下步骤:将SaKKHn-pBE碱基编辑系统导入生物体或生物细胞内,通过所述SaKKHn-pBE碱基编辑系统实现基因组靶点序列的编辑,获得所述单突变体;
所述单突变体为所述靶点序列中单个位点C突变为T的生物突变体;
所述SaKKHn-pBE碱基编辑系统包括SaKKHn蛋白质、tRNA-sgRNA和PmCDA1蛋白质;
所述tRNA-sgRNA靶向所述靶点序列;
所述tRNA-sgRNA如式I所示:tRNA-所述靶点序列转录的RNA-sgRNA骨架(式I);
所述tRNA为m1)或m2)或m3):
m1)将序列1第474-550位中的T替换为U得到的RNA分子;
m2)将m1)所示的RNA分子经过一个或几个核苷酸的取代和/或缺失和/或添加且具有相同功能的RNA分子;
m3)与m1)或m2)限定的核苷酸序列具有75%或75%以上同一性且具有相同功能的RNA分子;
所述sgRNA骨架为n1)或n2)或n3):
n1)将序列1第571-647位中的T替换为U得到的RNA分子;
n2)将n1)所示的RNA分子经过一个或几个核苷酸的取代和/或缺失和/或添加且具有相同功能的RNA分子;
n3)与n1)或n2)限定的核苷酸序列具有75%或75%以上同一性且具有相同功能的RNA分子。
上述方法中,所述SaKKHn-pBE碱基编辑系统还包括UGI蛋白质。
所述方法可为方法(一)或方法(二)或方法(三)或方法(四):
所述方法(一)包括如下步骤:将SaKKHn蛋白质的编码基因、转录tRNA-sgRNA的DNA分子和PmCDA1蛋白质的编码基因导入生物体或生物细胞内,使所述SaKKHn蛋白质、所述tRNA-sgRNA和所述PmCDA1蛋白质的编码基因均得到表达,实现基因组靶点序列的编辑,获得所述单突变体;
所述方法(二)包括如下步骤:将SaKKHn蛋白质的编码基因、转录tRNA-sgRNA的DNA分子、PmCDA1蛋白质的编码基因和UGI蛋白质的编码基因导入生物体或生物细胞内,使所述SaKKHn蛋白质、所述tRNA-sgRNA、所述PmCDA1蛋白质和所述UGI蛋白质的编码基因均得到表达,实现基因组靶点序列的编辑,获得所述单突变体;
所述方法(三)包括如下步骤:将SaKKHn蛋白质、tRNA-sgRNA和PmCDA1蛋白质导入生物体或生物细胞内,实现基因组靶点序列的编辑,获得所述单突变体;
所述方法(四)包括如下步骤:将SaKKHn蛋白质、tRNA-sgRNA、PmCDA1蛋白质和UGI蛋白质导入生物体或生物细胞内,实现基因组靶点序列的编辑,获得所述单突变体。
进一步的,所述SaKKHn蛋白质为A1)或A2)或A3):
A1)氨基酸序列是序列2所示的蛋白质;
A2)将序列表中序列2所示的氨基酸序列经过一个或几个氨基酸残基的取代和/或缺失和/或添加且具有相同功能的蛋白质;
A3)在A1)或A2)的N端或/和C端连接标签得到的融合蛋白质;
所述PmCDA1蛋白质为C1)或C2)或C3):
C1)氨基酸序列是序列3所示的蛋白质;
C2)将序列表中序列3所示的氨基酸序列经过一个或几个氨基酸残基的取代和/或缺失和/或添加且具有相同功能的蛋白质;
C3)在C1)或C2)的N端或/和C端连接标签得到的融合蛋白质;
所述UGI蛋白质为E1)或E2)或E3):
E1)氨基酸序列是序列4所示的蛋白质;
E2)将序列表中序列4所示的氨基酸序列经过一个或几个氨基酸残基的取代和/或缺失和/或添加且具有相同功能的蛋白质;
E3)在E1)或E2)的N端或/和C端连接标签得到的融合蛋白质。
为了使A1)、C1)或E1)中的蛋白质便于纯化,可在由序列表中序列2或序列3或序列4所示的氨基酸序列组成的蛋白质的氨基末端或羧基末端连接上如下表所示的标签。
表、标签的序列
标签 | 残基 | 序列 |
Poly-Arg | 5-6(通常为5个) | RRRRR |
Poly-His | 2-10(通常为6个) | HHHHHH |
FLAG | 8 | DYKDDDDK |
Strep-tag II | 8 | WSHPQFEK |
c-myc | 10 | EQKLISEEDL |
上述A2)、C2)或E2)中的蛋白质,为与序列2或序列3或序列4所示蛋白质的氨基酸序列具有75%或75%以上同一性且具有相同功能的蛋白质。所述具有75%或75%以上同一性为具有75%、具有80%、具有85%、具有90%、具有95%、具有96%、具有97%、具有98%或具有99%的同一性。
上述A2)、C2)或E2)中的蛋白质可人工合成,也可先合成其编码基因,再进行生物表达得到。
上述A2)、C2)或E2)中的蛋白质的编码基因可通过将序列1的第3013-6225位(编码序列2所示的蛋白质)、第6511-7134位(编码序列3所示的蛋白质)或第7156-7452位(编码序列4所示的蛋白质)所示的DNA序列中缺失一个或几个氨基酸残基的密码子,和/或进行一个或几个碱基对的错义突变,和/或在其5′端和/或3′端连接上表所示的标签的编码序列得到。
更进一步的,所述SaKKHn蛋白质的编码基因为b1)或b2)或b3):
b1)序列表中序列1第3013-6225位所示的cDNA分子或DNA分子;
b2)与b1)限定的核苷酸序列具有75%或75%以上同一性,且编码所述SaKKHn的cDNA分子或DNA分子;
b3)在严格条件下与b1)或b2)限定的核苷酸序列杂交,且编码所述SaKKHn的cDNA分子或DNA分子;
所述PmCDA1蛋白质的编码基因为d1)或d2)或d3):
d1)序列表中序列1第6511-7134位所示的cDNA分子或DNA分子;
d2)与d1)限定的核苷酸序列具有75%或75%以上同一性,且编码所述PmCDA1的cDNA分子或DNA分子;
d3)在严格条件下与d1)或d2)限定的核苷酸序列杂交,且编码所述PmCDA1的cDNA分子或DNA分子;
所述UGI蛋白质的编码基因为f1)或f2)或f3):
f1)序列表中序列1第7156-7452位所示的cDNA分子或DNA分子;
f2)与f1)限定的核苷酸序列具有75%或75%以上同一性,且编码所述UGI的cDNA分子或DNA分子;
f3)在严格条件下与f1)或f2)限定的核苷酸序列杂交,且编码所述UGI的cDNA分子或DNA分子。
本领域普通技术人员可以很容易地采用已知的方法,例如定向进化和点突变的方法,对本发明的编码所述SaKKHn、所述PmCDA1或所述UGI的核苷酸序列进行突变。那些经过人工修饰的,具有与本发明的所述SaKKHn、所述PmCDA1或所述UGI的核苷酸序列75%或者更高同一性的核苷酸,只要编码所述SaKKHn、所述PmCDA1或所述UGI且具有相同功能,均是衍生于本发明的核苷酸序列并且等同于本发明的序列。
这里使用的术语“同一性”指与天然核酸序列的序列相似性。“同一性”包括与本发明的编码序列2、3或4所示的氨基酸序列组成的蛋白质的核苷酸序列具有75%或更高,或85%或更高,或90%或更高,或95%或更高同一性的核苷酸序列。同一性可以用肉眼或计算机软件进行评价。使用计算机软件,两个或多个序列之间的同一性可以用百分比(%)表示,其可以用来评价相关序列之间的同一性。
所述严格条件是在2×SSC,0.1%SDS的溶液中,在68℃下杂交并洗膜2次,每次5min,又于0.5×SSC,0.1%SDS的溶液中,在68℃下杂交并洗膜2次,每次15min;或,0.1×SSPE(或0.1×SSC)、0.1%SDS的溶液中,65℃条件下杂交并洗膜。
上述75%或75%以上同一性,可为80%、85%、90%或95%以上的同一性。
上述方法中,所述转录tRNA-esgRNA的DNA分子转录后得到的所述tRNA-esgRNA为不成熟的RNA前体,该RNA前体中的tRNA会被两种酶(RNase P和RNase Z)切割掉后得到成熟的RNA。一个重组表达载体中有多少个靶点,就会得到多少个独立的成熟的RNA,每个成熟的RNA依次由所述靶点序列转录的RNA和所述sgRNA骨架组成,或依次由所述靶点序列转录的RNA、所述sgRNA骨架和所述tRNA残留的个别碱基组成。
上述方法中,所述方法(二)中,所述UGI的个数可为一个或两个或多个。在本发明的具体实施例中,所述UGI的个数具体为一个。
上述方法中,所述方法(二)中,所述SaKKHn蛋白质的编码基因、所述转录tRNA-esgRNA的DNA分子、所述PmCDA1蛋白质的编码基因和所述UGI蛋白质的编码基因通过重组表达载体导入生物体或生物细胞内。
所述SaKKHn蛋白质的编码基因、所述转录tRNA-esgRNA的DNA分子、所述PmCDA1蛋白质的编码基因和所述UGI蛋白质的编码基因可通过同一个重组表达载体导入生物体或生物细胞内,也可通过两个或者多个重组表达载体共同导入生物体或生物细胞内。
在本发明的具体实施例中,所述SaKKHn蛋白质的编码基因、所述转录tRNA-esgRNA的DNA分子、所述PmCDA1蛋白质的编码基因和所述UGI蛋白质的编码基因通过同一个重组表达载体导入生物体或生物细胞内。所述重组表达载体含有表达盒甲和表达盒乙;所述表达盒甲表达上述tRNA-sgRNA,所述表达盒乙表达由上述SaKKHn蛋白质、上述PmCDA1蛋白质和上述UGI组成的融合蛋白。
所述重组表达载体具体为SaKKHn-pBE-1重组表达载体、SaKKHn-pBE-2重组表达载体、SaKKHn-pBE-3重组表达载体、SaKKHn-pBE-4重组表达载体、SaKKHn-pBE-5重组表达载体、SaKKHn-pBE-6重组表达载体、SaKKHn-pBE-7重组表达载体、SaKKHn-pBE-8重组表达载体、SaKKHn-pBE-9重组表达载体、SaKKHn-pBE-10重组表达载体、SaKKHn-pBE-11重组表达载体、SaKKHn-pBE-12重组表达载体、SaKKHn-pBE-13重组表达载体、SaKKHn-pBE-14重组表达载体、SaKKHn-pBE-15重组表达载体、SaKKHn-pBE-16重组表达载体、SaKKHn-pBE-17重组表达载体、SaKKHn-pBE-18重组表达载体、SaKKHn-pBE-19重组表达载体、SaKKHn-pBE-20重组表达载体或SaKKHn-pBE-21重组表达载体。
所述SaKKHn-pBE-20重组表达载体的核苷酸序列为序列表中的序列1。
所述SaKKHn-pBE-17重组表达载体的核苷酸序列为将序列1第474-995位替换为序列5,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-7重组表达载体的核苷酸序列为将序列1第474-995位替换为序列6,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-1重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T1靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-2重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T2靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-3重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T3靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-4重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T4靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-5重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T5靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-6重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T6靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-8重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T8靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-9重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T9靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-10重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T10靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-11重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T11靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-12重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T12靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-13重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T13靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-14重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T14靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-15重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T15靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-16重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T16靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-18重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T19靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-19重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T20靶点序列,且保持其他序列不变后得到的序列。
所述SaKKHn-pBE-21重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T23靶点序列,且保持其他序列不变后得到的序列。
上述各个重组表达载体中的靶点序列见表1。
上述方法中,所述靶点序列中单个位点C突变为T中的“单个位点”可为靶点序列中任意位置上的C。
上述方法中,所述基因组靶点序列的编辑为将所述靶点序列中的C突变为T。所述靶点序列的PAM序列为NNNRRT。
上述方法中,所述生物体为p1)或p2)或p3)或p4):p1)植物或动物;p2)单子叶植物或双子叶植物;p3)禾本科植物;p4)水稻。
所述生物细胞为q1)或q2)或q3)或q4):q1)植物细胞或动物细胞;q2)单子叶植物细胞或双子叶植物细胞;q3)禾本科植物细胞;q4)水稻细胞。
为了实现上述目的,本发明还提供了一种制备单突变体的产品。
本发明提供的制备单突变体的产品包括上述SaKKHn-pBE碱基编辑系统。
上述方法或产品在提高单突变体比例中的应用也属于本发明的保护范围。
所述单突变体为利用上述SaKKHn-pBE碱基编辑系统进行基因组靶点序列编辑获得的所述靶点序列中单个位点C突变为T的生物突变体;
所述单突变体比例为利用上述SaKKHn-pBE碱基编辑系统进行基因组靶点序列编辑获得的所述靶点序列中单个位点C突变为T的生物突变体占获得的所有生物突变体的比例。
本发明基于SaKKHn-pBE碱基编辑系统对23个水稻基因组靶点序列进行测试,结果发现几乎所有靶点均能够获得突变效率最高的单位点C的单突变体,且其单突变体所占比例大多超过50%。
附图说明
图1为重组表达载体的结构示意图。
图2为单位点C的C·T碱基替换效率。其中,红圈中所对应的为单位点C·T碱基替换效率最高的位点C。
具体实施方式
下面结合具体实施方式对本发明进行进一步的详细描述,给出的实施例仅为了阐明本发明,而不是为了限制本发明的范围。下述实施例中的实验方法,如无特殊说明,均为常规方法。下述实施例中所用的材料、试剂、仪器等,如无特殊说明,均可从商业途径得到。下述实施例中,如无特殊说明,序列表中各核苷酸序列的第1位均为相应DNA/RNA的5′末端核苷酸,末位均为相应DNA/RNA的3′末端核苷酸。
引物对T1由引物T1-F:5’-atggatactggctgcctgtcatc-3’和引物T1-R:5’-cagcaatcacgacctgtaatgg-3’组成,用于扩增靶点T1和T4。
引物对T2由引物T2-F:5’-cagtgtcaaggttttgtgcaag-3’和引物T2-R:5’-acttaaaacgttcataaaaaggacc-3’组成,用于扩增靶点T2。
引物对T3由引物T3-F:5’-gagcccttatgaatattttcttcc-3’和引物T3-R:5’-gaatgttttgctactatcgtacgg-3’组成,用于扩增靶点T3。
引物对T5由引物T5-F:5’-tggatcggatatggacttctc-3’和引物T5-R:5’-gaaatgaacaatcacctgagatctttg-3’组成,用于扩增靶点T5和T18。
引物对T6由引物T6-F:5’-tcatggtgatctctcctcgg-3’和引物T6-R:5’-actatggtaaactcgtgatgcttc-3’组成,用于扩增靶点T6。
引物对T7由引物T7-F:5’-cgagctacctgaagaacaactacc-3’和引物T7-R:5’-cctcgattgcctgaaatttg-3’组成,用于扩增靶点T7。
引物对T8由引物T8-F:5’-tgcgagctcgacaacatcatg-3’和引物T8-R:5’-gacggcccatgtggaaacc-3’组成,用于扩增靶点T8。
引物对T9由引物T9-F:5’-tcatcaccaaccacctcttcc-3’和引物T9-R:5’-ataccagtcagctcaacaaacca-3’组成,用于扩增靶点T9。
引物对T10由引物T10-F:5’-gacactgaaatcatcgaagcaac-3’和引物T10-R:5’-acttttcaggctcctacgagatag-3’组成,用于扩增靶点T10。
引物对T11由引物T11-F:5’-aggtacgtgcagtacaacgtg-3’和引物T11-R:5’-acatcattgaagttctccctcc-3’组成,用于扩增靶点T11。
引物对T12由引物T12-F:5’-attctgattgctttaaaccgatttc-3’和引物T12-R:5’-aacaggaactcccaccaacttc-3’组成,用于扩增靶点T12。
引物对T13由引物T13-F:5’-cctcatccaatcgactgacac-3’和引物T13-R:5’-gtaattgtgcttggtgatggag-3’组成,用于扩增靶点T13和T16。
引物对T14由引物T14-F:5’-gacgcccatagtcgaggtc-3’和引物T14-R:5’-ctctgctggatcaatgtcaatg-3’组成,用于扩增靶点T14。
引物对T15由引物T15-F:5’-tgcttctcaaatggcctagc-3’和引物T15-R:5’-atatgtcggtgggataggatg-3’组成,用于扩增靶点T15。
引物对T17由引物T17-F:5’-aacacggtcaccaacttcatc-3’和引物T17-R:5’-acaacctggcttgctatatatgc-3’组成,用于扩增靶点T17。
引物对T19由引物T19-F:5’-ttcaaattctaatccccaatcc-3’和引物T19-R:5’-tcgtacctgtctgcaaccttg-3’组成,用于扩增靶点T19和T21。
引物对T20由引物T20-F:5’-gctttagatgatttgttacatttcgc-3’和引物T20-R:5’-tgagttggtatggcaagaacaag-3’组成,用于扩增靶点T20和T22。
引物对T23由引物T23-F:5’-tacgcgagtgcatgcagatg-3’和引物T23-R:5’-aacattgatcagcctaaccaaac-3’组成,用于扩增靶点T23。
以下实施例中,C·T碱基替换是指靶点序列中任何位置的C突变为T。
C·T碱基替换效率=发生C·T碱基替换的阳性T0苗数/分析的总阳性T0苗数×100%。
单位点C的C·T碱基替换效率=目标单位点发生C·T碱基替换的阳性T0苗数/分析的总阳性T0苗数×100%。
日本晴水稻:参考文献:梁卫红,王高华,杜京尧,等.硝普钠及其光解产物对日本晴水稻幼苗生长和5种激素标记基因表达的影响[J].河南师范大学学报(自然版),2017(2):48-52.;公众可以从北京市农林科学院获得。
恢复培养基:含有200mg/L特美汀的N6固体培养基。
筛选培养基:含有50mg/L潮霉素的N6固体培养基。
分化培养基:含有2mg/L KT、0.2mg/L NAA、0.5g/L谷氨酸、0.5g/L脯氨酸的N6固体培养基。
生根培养基:含有0.2mg/L NAA、0.5g/L谷氨酸、0.5g/L脯氨酸的N6固体培养基。
实施例1、碱基编辑系统SaKKHn-pBE及其在水稻基因组编辑中的应用
一、重组表达载体的构建
重组表达载体结构示意图如图1所示。
人工合成如下重组表达载体,各载体均为环状质粒:
SaKKHn-pBE-1重组表达载体、SaKKHn-pBE-2重组表达载体、SaKKHn-pBE-3重组表达载体、SaKKHn-pBE-4重组表达载体、SaKKHn-pBE-5重组表达载体、SaKKHn-pBE-6重组表达载体、SaKKHn-pBE-7重组表达载体、SaKKHn-pBE-8重组表达载体、SaKKHn-pBE-9重组表达载体、SaKKHn-pBE-10重组表达载体、SaKKHn-pBE-11重组表达载体、SaKKHn-pBE-12重组表达载体、SaKKHn-pBE-13重组表达载体、SaKKHn-pBE-14重组表达载体、SaKKHn-pBE-15重组表达载体、SaKKHn-pBE-16重组表达载体、SaKKHn-pBE-17重组表达载体、SaKKHn-pBE-18重组表达载体、SaKKHn-pBE-19重组表达载体、SaKKHn-pBE-20重组表达载体和SaKKHn-pBE-21重组表达载体。
SaKKHn-pBE-20重组表达载体的核苷酸序列为序列表中的序列1。其中,序列1的第131-467位为OsU3启动子的核苷酸序列,第474-550位和第648-724位均为tRNA的核苷酸序列,第551-647位和第725-821位分别为靶向OsWaxy基因的两个sgRNA的核苷酸序列,这两个sgRNA的共同sgRNA骨架的核苷酸序列为序列1的第571-647位或序列1的第745-821位,第996-1286位为OsU3终止子的核苷酸序列;序列1的第1293-3006位为OsUbq3启动子的核苷酸序列,第3013-6225位为SaKKHn蛋白质的编码序列(不含有终止密码子),编码序列2所示的SaKKHn蛋白质;序列1的第6511-7134位为PmCDA1蛋白质的编码序列(不含有终止密码子),编码序列3所示的PmCDA1蛋白质;序列1的第7156-7452位为UGI蛋白质的编码序列,编码序列4所示的UGI蛋白质;序列1的第7459-7653位为35S终止子的核苷酸序列;序列1的第7728-9720位为ZmUbi1启动子的核苷酸序列,第9727-10749位为潮霉素磷酸转移酶的编码序列,第10779-10994位为CaMV35S polyA的核苷酸序列。SaKKHn-pBE-20重组表达载体中的两个靶点分别为T21和T22,序列见表1。
SaKKHn-pBE-17重组表达载体的核苷酸序列为将序列1第474-995位替换为序列5,且保持其他序列不变后得到的序列。其中,序列5的第1-77位和第175-251位均为tRNA的核苷酸序列,第78-174位和第252-348位分别为靶向OsNRT1.1B基因和OsGRF4基因的两个sgRNA的核苷酸序列。第98-174位和第272-348位为sgRNA骨架的核苷酸序列。SaKKHn-pBE-17重组表达载体中的两个靶点分别为T17和T18,序列见表1。
SaKKHn-pBE-7重组表达载体的核苷酸序列为将序列1第474-995位替换为序列6,且保持其他序列不变后得到的序列。其中,序列6的第1-77位为tRNA的核苷酸序列,第78-174位为靶向OsWaxy基因的sgRNA的核苷酸序列,第98-174位为sgRNA骨架的核苷酸序列。SaKKHn-pBE-7重组表达载体中的靶点为T7,序列见表1。
SaKKHn-pBE-1重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T1靶点序列,且保持其他序列不变后得到的序列。T1靶点序列见表1。
SaKKHn-pBE-2重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T2靶点序列,且保持其他序列不变后得到的序列。T2靶点序列见表1。
SaKKHn-pBE-3重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T3靶点序列,且保持其他序列不变后得到的序列。T3靶点序列见表1。
SaKKHn-pBE-4重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T4靶点序列,且保持其他序列不变后得到的序列。T4靶点序列见表1。
SaKKHn-pBE-5重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T5靶点序列,且保持其他序列不变后得到的序列。T5靶点序列见表1。
SaKKHn-pBE-6重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T6靶点序列,且保持其他序列不变后得到的序列。T6靶点序列见表1。
SaKKHn-pBE-8重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T8靶点序列,且保持其他序列不变后得到的序列。T8靶点序列见表1。
SaKKHn-pBE-9重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T9靶点序列,且保持其他序列不变后得到的序列。T9靶点序列见表1。
SaKKHn-pBE-10重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T10靶点序列,且保持其他序列不变后得到的序列。T10靶点序列见表1。
SaKKHn-pBE-11重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T11靶点序列,且保持其他序列不变后得到的序列。T11靶点序列见表1。
SaKKHn-pBE-12重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T12靶点序列,且保持其他序列不变后得到的序列。T12靶点序列见表1。
SaKKHn-pBE-13重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T13靶点序列,且保持其他序列不变后得到的序列。T13靶点序列见表1。
SaKKHn-pBE-14重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T14靶点序列,且保持其他序列不变后得到的序列。T14靶点序列见表1。
SaKKHn-pBE-15重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T15靶点序列,且保持其他序列不变后得到的序列。T15靶点序列见表1。
SaKKHn-pBE-16重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T16靶点序列,且保持其他序列不变后得到的序列。T16靶点序列见表1。
SaKKHn-pBE-18重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T19靶点序列,且保持其他序列不变后得到的序列。T19靶点序列见表1。
SaKKHn-pBE-19重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T20靶点序列,且保持其他序列不变后得到的序列。T20靶点序列见表1。
SaKKHn-pBE-21重组表达载体的核苷酸序列为将SaKKHn-pBE-7重组表达载体序列中的T7靶点序列替换为T23靶点序列,且保持其他序列不变后得到的序列。T23靶点序列见表1。
各载体的靶点核苷酸序列及相应的PAM序列如表1所示。
表1
二、水稻阳性T0苗的获得
将步骤一获得的SaKKHn-pBE-1载体,SaKKHn-pBE-2载体,SaKKHn-pBE-3载体,SaKKHn-pBE-4载体,SaKKHn-pBE-5载体,SaKKHn-pBE-6载体,SaKKHn-pBE-7载体,SaKKHn-pBE-8载体,SaKKHn-pBE-9载体,SaKKHn-pBE-10载体,SaKKHn-pBE-11载体,SaKKHn-pBE-12载体,SaKKHn-pBE-13载体,SaKKHn-pBE-14载体,SaKKHn-pBE-15载体,SaKKHn-pBE-16载体,SaKKHn-pBE-17载体,SaKKHn-pBE-18载体,SaKKHn-pBE-19载体,SaKKHn-pBE-20载体和SaKKHn-pBE-21载体分别按照如下步骤1-9进行操作:
1、将载体导入农杆菌EHA105(上海唯地生物技术有限公司的产品,CAT#:AC1010),得到重组农杆菌。
2、采用培养基(含50μg/ml卡那霉素和25μg/ml利福平的YEP培养基)培养重组农杆菌,28℃,150rpm震荡培养至OD600为1.0-2.0,室温条件下,10000rpm离心1min,用侵染液(将N6液体培养基中的糖替换为葡萄糖和蔗糖,葡萄糖和蔗糖在侵染液中的浓度分别为10g/L和20g/L)重悬菌体并稀释至OD600为0.2,得到农杆菌侵染液。
3、水稻品种日本晴成熟种子去壳脱粒,置于100mL三角瓶中,加入70%(v/v)乙醇水溶液浸泡30sec,再置于25%(v/v)次氯酸钠水溶液中,120rpm震荡灭菌30min,无菌水冲洗3次,用滤纸吸干水分,然后将种子胚朝下置于N6固体培养基上,28℃暗培养4-6周,得到水稻愈伤。
4、完成步骤3后,将水稻愈伤浸泡置于农杆菌侵染液甲(农杆菌侵染液甲为向农杆菌侵染液中加入乙酰丁香酮得到的液体,乙酰丁香酮的添加量满足乙酰丁香酮与农杆菌侵染液的体积比为25μl:50ml)中浸泡10min,然后,放在铺有两层灭菌滤纸的培养皿(内含约200ml不含农杆菌的侵染液)上,21℃暗培养1天。
5、取步骤4得到的水稻愈伤放入恢复培养基上,25-28℃暗培养3天。
6、取步骤5得到的水稻愈伤,置于筛选培养基上,28℃暗培养2周。
7、取步骤6得到的水稻愈伤,再次置于筛选培养基上,28℃暗培养2周,得到水稻抗性愈伤。
8、取步骤7得到的水稻抗性愈伤放入分化培养基上,25℃光照培养1个月左右,将分化出来的小苗移至生根培养基上,25℃光照培养2周,获取水稻T0苗。
9、提取水稻T0苗的基因组DNA并以其作为模板,采用引物F(5’-attatgtagcttgtgcgtttcg-3’)和引物R(5’-ctccacctcattgacattatgc-3’)组成的引物对进行PCR扩增,得到PCR扩增产物;将该PCR扩增产物进行琼脂糖凝胶电泳,然后进行如下判断:如果PCR扩增产物中含有约898bp的DNA片段,则相应的水稻T0苗为水稻阳性T0苗;如果PCR扩增产物中不含有约898bp的DNA片段,则相应的水稻T0苗不为水稻阳性T0苗。
三、结果分析
1、每载体分别取步骤二所获得的水稻阳性T0苗的基因组DNA作为模板,对于T1靶点,采用引物对T1进行PCR扩增,得到PCR扩增产物;对于T2靶点,采用引物对T2进行PCR扩增,得到PCR扩增产物;对于T3靶点,采用引物对T3进行PCR扩增,得到PCR扩增产物;对于T4靶点,采用引物对T1进行PCR扩增,得到PCR扩增产物;对于T5靶点,采用引物对T5进行PCR扩增,得到PCR扩增产物;对于T6靶点,采用引物对T6进行PCR扩增,得到PCR扩增产物;对于T7靶点,采用引物对T7进行PCR扩增,得到PCR扩增产物;对于T8靶点,采用引物对T8进行PCR扩增,得到PCR扩增产物;对于T9靶点,采用引物对T9进行PCR扩增,得到PCR扩增产物;对于T10靶点,采用引物对T10进行PCR扩增,得到PCR扩增产物;对于T11靶点,采用引物对T11进行PCR扩增,得到PCR扩增产物;对于T12靶点,采用引物对T12进行PCR扩增,得到PCR扩增产物;对于T13靶点,采用引物对T13进行PCR扩增,得到PCR扩增产物;对于T14靶点,采用引物对T14进行PCR扩增,得到PCR扩增产物;对于T15靶点,采用引物对T15进行PCR扩增,得到PCR扩增产物;对于T16靶点,采用引物对T13进行PCR扩增,得到PCR扩增产物;对于T17靶点,采用引物对T17进行PCR扩增,得到PCR扩增产物;对于T18靶点,采用引物对T5进行PCR扩增,得到PCR扩增产物;对于T19靶点,采用引物对T19进行PCR扩增,得到PCR扩增产物;对于T20靶点,采用引物对T20进行PCR扩增,得到PCR扩增产物;对于T21靶点,采用引物对T19进行PCR扩增,得到PCR扩增产物;对于T22靶点,采用引物对T20进行PCR扩增,得到PCR扩增产物;对于T23靶点,采用引物对T23进行PCR扩增,得到PCR扩增产物。
2、将步骤1得到的PCR扩增产物进行Sanger测序及分析。测序结果只针对各靶点区进行分析。分别统计T1、T2、T3、T4、T5、T6、T7、T8、T9、T10、T11、T12、T13、T14、T15、T16、T17、T18、T19、T20、T21、T22和T23的发生C·T碱基替换的阳性T0苗数,计算得出C·T碱基替换效率及单位点C的C·T碱基替换效率。
结果表明,碱基编辑系统SaKKHn-pBE能够对PAM为NNNRRT(R=A或G)的靶点进行碱基编辑(表2)。除靶点T20中的C6外,图2中红圈中所对应的单位点C·T碱基替换效率最高的C均获得了T0单突变体(表2和表3)。绝大多数靶点的单位点C·T碱基替换效率最高的C的单突变比例大于等于50%,只有靶点T8、T13和T20的单位点C·T碱基替换效率最高的C的单突变比例低于50%,分别为46.7%、42.9%和16.7%(表3)。
表2
注:靶序列中粉红色C表示在T0苗中突变为T。
表3
以上对本发明进行了详述。对于本领域技术人员来说,在不脱离本发明的宗旨和范围,以及无需进行不必要的实验情况下,可在等同参数、浓度和条件下,在较宽范围内实施本发明。虽然本发明给出了特殊的实施例,应该理解为,可以对本发明作进一步的改进。总之,按本发明的原理,本申请欲包括任何变更、用途或对本发明的改进,包括脱离了本申请中已公开范围,而用本领域已知的常规技术进行的改变。按以下附带的权利要求的范围,可以进行一些基本特征的应用。
序列表
<110>北京市农林科学院
<120>一种基于SaKKHn-pBE系统制备单突变体的方法
<160>6
<170>PatentIn version 3.5
<210>1
<211>17400
<212>DNA
<213>Artificial Sequence
<400>1
ggtggcagga tatattgtgg tgtaaacatg gcactagcct caccgtcttc gcagacgagg 60
ccgctaagtc gcagctacgc tctcaacggc actgactagg tagtttaaac gtgcacttaa 120
ttaaggtacc gaagcaactt aaagttatca ggcatgcatg gatcttggag gaatcagatg 180
tgcagtcagg gaccatagca caagacaggc gtcttctact ggtgctacca gcaaatgctg 240
gaagccggga acactgggta cgttggaaac cacgtgatgt gaagaagtaa gataaactgt 300
aggagaaaag catttcgtag tgggccatga agcctttcag gacatgtatt gcagtatggg 360
ccggcccatt acgcaattgg acgacaacaa agactagtat tagtaccacc tcggctatcc 420
acatagatca aagctgattt aaaagagttg tgcagatgat ccgtggcgga tccaacaaag 480
caccagtggt ctagtggtag aatagtaccc tgccacggta cagacccggg ttcgattccc 540
ggctggtgca catccaatgc gatgatcaag gttttagtac tctggaaaca gaatctacta 600
aaacaaggca aaatgccgtg tttatctcgt caacttgttg gcgagataac aaagcaccag 660
tggtctagtg gtagaatagt accctgccac ggtacagacc cgggttcgat tcccggctgg 720
tgcaaatcac cagtggaagc taaggtttta gtactctgga aacagaatct actaaaacaa 780
ggcaaaatgc cgtgtttatc tcgtcaactt gttggcgaga taacaaagca ccagtggtct 840
agtggtagaa tagtaccctg ccacggtaca gacccgggtt cgattcccgg ctggtgcaac 900
cggatttgaa cgatggacgt tttagtactc tggaaacaga atctactaaa acaaggcaaa 960
atgccgtgtt tatctcgtca acttgttggc gagatttttt tttttcgttt tgcattgagt 1020
tttctccgtc gcatgtttgc agttttattt tccgttttgc attgaaattt ctccgtctca 1080
tgtttgcagc gtgttcaaaa agtacgcagc tgtatttcac ttatttacgg cgccacattt 1140
tcatgccgtt tgtgccaact atcccgagct agtgaataca gcttggcttc acacaacact 1200
ggtgacccgc tgacctgctc gtacctcgta ccgtcgtacg gcacagcatt tggaattaaa 1260
gggtgtgatc gatactgctt gctgctaagc ttacaaattc gggtcaaggc ggaagccagc 1320
gcgccacccc acgtcagcaa atacggaggc gcggggttga cggcgtcacc cggtcctaac 1380
ggcgaccaac aaaccagcca gaagaaatta cagtaaaaaa aaagtaaatt gcactttgat 1440
ccacctttta ttacctaagt ctcaatttgg atcaccctta aacctatctt ttcaatttgg 1500
gccgggttgt ggtttggact accatgaaca acttttcgtc atgtctaact tccctttcag 1560
caaacatatg aaccatatat agaggagatc ggccgtatac tagagctgat gtgtttaagg 1620
tcgttgattg cacgagaaaa aaaaatccaa atcgcaacaa tagcaaattt atctggttca 1680
aagtgaaaag atatgtttaa aggtagtcca aagtaaaact tatagataat aaaatgtggt 1740
ccaaagcgta attcactcaa aaaaaatcaa cgagacgtgt accaaacgga gacaaacggc 1800
atcttctcga aatttcccaa ccgctcgctc gcccgcctcg tcttcccgga aaccgcggtg 1860
gtttcagcgt ggcggattct ccaagcagac ggagacgtca cggcacggga ctcctcccac 1920
cacccaaccg ccataaatac cagccccctc atctcctctc ctcgcatcag ctccaccccc 1980
gaaaaatttc tccccaatct cgcgaggctc tcgtcgtcga atcgaatcct ctcgcgtcct 2040
caaggtacgc tgcttctcct ctcctcgctt cgtttcgatt cgatttcgga cgggtgaggt 2100
tgttttgttg ctagatccga ttggtggtta gggttgtcga tgtgattatc gtgagatgtt 2160
taggggttgt agatctgatg gttgtgattt gggcacggtt ggttcgatag gtggaatcgt 2220
ggttaggttt tgggattgga tgttggttct gatgattggg gggaattttt acggttagat 2280
gaattgttgg atgattcgat tggggaaatc ggtgtagatc tgttggggaa ttgtggaact 2340
agtcatgcct gagtgattgg tgcgatttgt agcgtgttcc atcttgtagg ccttgttgcg 2400
agcatgttca gatctactgt tccgctcttg attgagttat tggtgccatg ggttggtgca 2460
aacacaggct ttaatatgtt atatctgttt tgtgtttgat gtagatctgt agggtagttc 2520
ttcttagaca tggttcaatt atgtagcttg tgcgtttcga tttgatttca tatgttcaca 2580
gattagataa tgatgaactc ttttaattaa ttgtcaatgg taaataggaa gtcttgtcgc 2640
tatatctgtc ataatgatct catgttacta tctgccagta atttatgcta agaactatat 2700
tagaatatca tgttacaatc tgtagtaata tcatgttaca atctgtagtt catctatata 2760
atctattgtg gtaatttctt tttactatct gtgtgaagat tattgccact agttcattct 2820
acttatttct gaagttcagg atacgtgtgc tgttactacc tatctgaata catgtgtgat 2880
gtgcctgtta ctatcttttt gaatacatgt atgttctgtt ggaatatgtt tgctgtttga 2940
tccgttgttg tgtccttaat cttgtgctag ttcttaccct atctgtttgg tgattatttc 3000
ttgcagtacg taatggctcc taagaagaag cggaaggttg gcatccacgg tgtcccggcg 3060
gcaaagagaa actacatcct gggtctggcc atcggtatta catcggtggg ctacggcatc 3120
atcgactacg agacaaggga tgtcatcgat gccggcgtcc ggctcttcaa ggaggccaac 3180
gtggagaata acgagggcag gcgctccaag cgcggcgcgc ggaggctgaa gcgcaggcgg 3240
aggcatcgca tccagcgggt gaagaagctc ctcttcgact acaatctgct cacggatcat 3300
tccgagctgt ctggcatcaa cccatacgag gcgcgggtga agggcctgtc ccagaagctc 3360
tcggaggagg agttctcggc ggccctgctg catctcgcga agaggcgcgg cgtgcataat 3420
gtcaatgagg tggaggagga taccggcaat gagctgtcaa ccaaggagca gatcagcagg 3480
aactccaagg cgctggagga gaagtatgtg gcggagctcc agctcgagag gctgaagaag 3540
gatggcgagg tccggggctc catcaatagg ttcaagacat cggactacgt gaaggaggcc 3600
aagcagctcc tgaaggtgca gaaggcgtac caccagctgg accagagctt catcgacacc 3660
tacatcgatc tgctcgagac acgccggacg tactacgagg gcccgggcga gggctcaccg 3720
ttcggctgga aggacatcaa ggagtggtac gagatgctga tgggccactg cacctacttc 3780
cctgaggagc tgaggagcgt gaagtacgcg tacaatgcgg acctctacaa cgccctgaac 3840
gacctcaata acctcgtgat cacgcgcgac gagaatgaga agctcgagta ctacgagaag 3900
ttccagatca tcgagaacgt gttcaagcag aagaagaagc cgaccctcaa gcagatcgcc 3960
aaggagatcc tcgtcaatga ggaggacatc aagggctaca gggtgacctc gaccggcaag 4020
ccagagttca ccaacctgaa ggtctaccac gacatcaagg atatcaccgc ccgcaaggag 4080
atcatcgaga atgcggagct cctggatcag atcgcgaaga tcctcaccat ctaccagtcc 4140
agcgaggaca tccaggagga gctcacgaac ctgaatagcg agctgaccca ggaggagatc 4200
gagcagatct ccaacctcaa gggctacacc ggcacgcaca atctgagcct caaggcgatc 4260
aatctcatcc tcgatgagct ctggcataca aatgataacc agatcgccat cttcaatcgc 4320
ctcaagctgg tcccaaagaa ggtcgatctg tcgcagcaga aggagatccc aacgacactg 4380
gtcgatgact tcatcctctc acctgtcgtg aagaggtcgt tcatccagtc gatcaaggtc 4440
atcaatgcga tcatcaagaa gtacggcctc cctaatgata tcatcatcga gctggcccgc 4500
gagaagaatt caaaggacgc gcagaagatg atcaacgaga tgcagaagag gaatcggcag 4560
acaaacgagc gcatcgagga gatcatccgc acaaccggca aggagaatgc caagtacctg 4620
atcgagaaga tcaagctgca tgacatgcag gagggcaagt gcctctactc actggaggcc 4680
atcccactcg aggacctgct gaataaccca ttcaattacg aggtcgacca tatcatcccg 4740
cgctccgtgt cgttcgacaa ttccttcaat aacaaggtcc tcgtcaagca ggaggagaac 4800
tccaagaagg gcaatcgcac cccgttccag tacctgtcct cttcggacag caagatctct 4860
tacgagacat tcaagaagca catcctcaac ctggccaagg gcaagggccg gatctccaag 4920
accaagaagg agtacctcct ggaggagagg gatatcaacc ggttcagcgt gcagaaggac 4980
ttcatcaatc gcaacctggt cgatacccgg tacgccacca ggggcctcat gaacctgctc 5040
cggtcctact tccgggtgaa caatctcgac gtgaaggtca agagcatcaa cggcggcttc 5100
acctcgttcc tcaggcggaa gtggaagttc aagaaggagc ggaacaaggg ctacaagcac 5160
catgccgagg acgccctcat catcgcgaac gcggacttca tcttcaagga gtggaagaag 5220
ctcgataagg cgaagaaggt catggagaac cagatgttcg aggagaagca ggccgagtcg 5280
atgccagaga tcgagacaga gcaggagtac aaggagatct tcatcacccc gcaccagatc 5340
aagcacatca aggacttcaa ggactacaag tactcccatc gggtcgataa gaagccaaat 5400
cggaagctca tcaatgatac cctctactcg acacgcaagg atgacaaggg caacaccctg 5460
atcgtcaata acctcaatgg cctctacgac aaggataacg acaagctgaa gaagctcatc 5520
aacaagagcc cagagaagct cctcatgtac caccacgatc cgcagacata ccagaagctc 5580
aagctgatca tggagcagta cggcgacgag aagaacccac tctacaagta ctacgaggag 5640
acaggcaact acctgaccaa gtactccaag aaggacaatg gcccagtgat caagaagatc 5700
aagtactacg gcaataagct gaacgcccac ctcgatatca cggacgatta ccctaacagc 5760
cggaataagg tggtcaagct gtccctcaag ccgtaccgct tcgacgtcta cctggataac 5820
ggcgtctaca agttcgtgac agtcaagaat ctcgacgtca tcaagaagga gaactactac 5880
gaggtcaatt ctaagtgcta cgaggaggcc aagaagctca agaagatcag caaccaggcc 5940
gagttcatcg ccagcttcta caagaacgat ctgatcaaga tcaacggcga gctctacagg 6000
gtcatcggcg tgaacaatga cctgctcaat aggatcgagg tgaacatgat cgacatcacc 6060
taccgcgagt acctcgagaa catgaacgat aagcggcctc cacacatcat caagacaatc 6120
gcctctaaga cccagtccat caagaagtac tccacggata tcctcggcaa cctctacgag 6180
gtgaagtcaa agaagcaccc gcagatcatc aagaagggct cggctggagg aggaggcacg 6240
ggaggaggag gctccgccga gtatgtgcgc gcgctcttcg acttcaacgg caatgacgag 6300
gaggatctcc ctttcaagaa gggcgacatc ctccgcatcc gcgataagcc ggaggagcag 6360
tggtggaacg cagaggactc cgagggcaag cggggcatga tcctggtgcc atacgtcgag 6420
aagtacagcg gcgattacaa ggaccacgat ggcgactaca aggatcatga catcgattac 6480
aaggacgatg acgataagtc cggcgtcgac atgacggacg cggagtatgt gcgcatccac 6540
gagaagctcg atatctacac cttcaagaag cagttcttca acaataagaa gtcggtgtcc 6600
catcggtgct acgtcctctt cgagctgaag cgcaggggag agcgccgcgc ctgcttctgg 6660
ggctacgcgg tgaataagcc gcagtcaggc acagagcgcg gcatccacgc cgagatcttc 6720
tcgatccgga aggtcgagga gtacctccgc gacaacccag gccagttcac gatcaattgg 6780
tactccagct ggtccccttg cgcagattgc gcagagaaga tcctcgagtg gtacaaccag 6840
gagctgaggg gcaatggcca taccctcaag atctgggcct gcaagctgta ctacgagaag 6900
aacgcgagga atcagatcgg cctctggaac ctgcgggata atggcgtggg cctcaacgtg 6960
atggtgtccg agcactacca gtgctgccgc aagatcttca tccagtcctc ccacaatcag 7020
ctgaacgaga ataggtggct cgaaaagacc ctgaagcgcg ccgagaagtg gaggagcgag 7080
ctgtctatca tgatccaggt caagatcctg cacaccacaa agtcaccggc ggtgggcggc 7140
ggcggcagcg aattctccgg cggcagcacg aacctcagcg acatcatcga gaaggagaca 7200
ggcaagcagc tcgtgatcca ggagtctatc ctcatgctgc ctgaggaggt ggaggaggtc 7260
atcggcaaca agccggagtc cgatatcctc gtgcacaccg cctacgacga gtcgacagat 7320
gagaatgtca tgctcctgac ctccgacgca ccagagtaca agccatgggc gctcgtgatc 7380
caggattcca acggcgagaa taagatcaag atgctgtctg gcggctcccc gaagaagaag 7440
cgcaaggtct agactagtct gaaatcacca gtctctctct acaaatctat ctctctctat 7500
aataatgtgt gagtagttcc cagataaggg aattagggtt cttatagggt ttcgctcatg 7560
tgttgagcat ataagaaacc cttagtatgt atttgtattt gtaaaatact tctatcaata 7620
aaatttctaa ttcctaaaac caaaatccag tggggcgccc gacctgtact cgcgaaggtt 7680
aacttacaga gagtgtccgg gcgcgcctgg tggatcgtcc gcctaggctg cagtgcagcg 7740
tgacccggtc gtgcccctct ctagagataa tgagcattgc atgtctaagt tataaaaaat 7800
taccacatat tttttttgtc acacttgttt gaagtgcagt ttatctatct ttatacatat 7860
atttaaactt tactctacga ataatataat ctatagtact acaataatat cagtgtttta 7920
gagaatcata taaatgaaca gttagacatg gtctaaagga caattgagta ttttgacaac 7980
aggactctac agttttatct ttttagtgtg catgtgttct cctttttttt tgcaaatagc 8040
ttcacctata taatacttca tccattttat tagtacatcc atttagggtt tagggttaat 8100
ggtttttata gactaatttt tttagtacat ctattttatt ctattttagc ctctaaatta 8160
agaaaactaa aactctattt tagttttttt atttaataat ttagatataa aatagaataa 8220
aataaagtga ctaaaaatta aacaaatacc ctttaagaaa ttaaaaaaac taaggaaaca 8280
tttttcttgt ttcgagtaga taatgccagc ctgttaaacg ccgtcgacga gtctaacgga 8340
caccaaccag cgaaccagca gcgtcgcgtc gggccaagcg aagcagacgg cacggcatct 8400
ctgtcgctgc ctctggaccc ctctcgagag ttccgctcca ccgttggact tgctccgctg 8460
tcggcatcca gaaattgcgt ggcggagcgg cagacgtgag ccggcacggc aggcggcctc 8520
ctcctcctct cacggcaccg gcagctacgg gggattcctt tcccaccgct ccttcgcttt 8580
cccttcctcg cccgccgtaa taaatagaca ccccctccac accctctttc cccaacctcg 8640
tgttgttcgg agcgcacaca cacacaacca gatctccccc aaatccaccc gtcggcacct 8700
ccgcttcaag gtacgccgct cgtcctcccc ccccccccct ctctaccttc tctagatcgg 8760
cgttccggtc catggttagg gcccggtagt tctacttctg ttcatgtttg tgttagatcc 8820
gtgtttgtgt tagatccgtg ctgctagcgt tcgtacacgg atgcgacctg tacgtcagac 8880
acgttctgat tgctaacttg ccagtgtttc tctttgggga atcctgggat ggctctagcc 8940
gttccgcaga cgggatcgat ttcatgattt tttttgtttc gttgcatagg gtttggtttg 9000
cccttttcct ttatttcaat atatgccgtg cacttgtttg tcgggtcatc ttttcatgct 9060
tttttttgtc ttggttgtga tgatgtggtc tggttgggcg gtcgttctag atcggagtag 9120
aattctgttt caaactacct ggtggattta ttaattttgg atctgtatgt gtgtgccata 9180
catattcata gttacgaatt gaagatgatg gatggaaata tcgatctagg ataggtatac 9240
atgttgatgc gggttttact gatgcatata cagagatgct ttttgttcgc ttggttgtga 9300
tgatgtggtg tggttgggcg gtcgttcatt cgttctagat cggagtagaa tactgtttca 9360
aactacctgg tgtatttatt aattttggaa ctgtatgtgt gtgtcataca tcttcatagt 9420
tacgagttta agatggatgg aaatatcgat ctaggatagg tatacatgtt gatgtgggtt 9480
ttactgatgc atatacatga tggcatatgc agcatctatt catatgctct aaccttgagt 9540
acctatctat tataataaac aagtatgttt tataattatt ttgatcttga tatacttgga 9600
tgatggcata tgcagcagct atatgtggat ttttttagcc ctgccttcat acgctattta 9660
tttgcttggt actgtttctt ttgtcgatgc tcaccctgtt gtttggtgtt acttctgcag 9720
gagctcatga aaaagcctga actcaccgcg acgtctgtcg agaagtttct gatcgaaaag 9780
ttcgacagcg tctccgacct gatgcagctc tcggagggcg aagaatctcg tgctttcagc 9840
ttcgatgtag gagggcgtgg atatgtcctg cgggtaaata gctgcgccga tggtttctac 9900
aaagatcgtt atgtttatcg gcactttgca tcggccgcgc tcccgattcc ggaagtgctt 9960
gacattgggg agtttagcga gagcctgacc tattgcatct cccgccgttc acagggtgtc 10020
acgttgcaag acctgcctga aaccgaactg cccgctgttc tacaaccggt cgcggaggct 10080
atggatgcga tcgctgcggc cgatcttagc cagacgagcg ggttcggccc attcggaccg 10140
caaggaatcg gtcaatacac tacatggcgt gatttcatat gcgcgattgc tgatccccat 10200
gtgtatcact ggcaaactgt gatggacgac accgtcagtg cgtccgtcgc gcaggctctc 10260
gatgagctga tgctttgggc cgaggactgc cccgaagtcc ggcacctcgt gcacgcggat 10320
ttcggctcca acaatgtcct gacggacaat ggccgcataa cagcggtcat tgactggagc 10380
gaggcgatgt tcggggattc ccaatacgag gtcgccaaca tcttcttctg gaggccgtgg 10440
ttggcttgta tggagcagca gacgcgctac ttcgagcgga ggcatccgga gcttgcagga 10500
tcgccacgac tccgggcgta tatgctccgc attggtcttg accaactcta tcagagcttg 10560
gttgacggca atttcgatga tgcagcttgg gcgcagggtc gatgcgacgc aatcgtccga 10620
tccggagccg ggactgtcgg gcgtacacaa atcgcccgca gaagcgcggc cgtctggacc 10680
gatggctgtg tagaagtact cgccgatagt ggaaaccgac gccccagcac tcgtccgagg 10740
gcaaagaaat agagtagatg ccgaccggga tctgtcgatc gacaagctcg agtttctcca 10800
taataatgtg tgagtagttc ccagataagg gaattagggt tcctataggg tttcgctcat 10860
gtgttgagca tataagaaac ccttagtatg tatttgtatt tgtaaaatac ttctatcaat 10920
aaaatttcta attcctaaaa ccaaaatcca gtactaaaat ccagatcccc cgaattaatt 10980
cggcgttaat tcagcctgca ggacgcgttt aattaagtgc acgcggccgc ctacttagtc 11040
aagagcctcg cacgcgactg tcacgcggcc aggatcgcct cgtgagcctc gcaatctgta 11100
cctagtgttt aaactatcag tgtttgacag gatatattgg cgggtaaacc taagagaaaa 11160
gagcgtttat tagaataacg gatatttaaa agggcgtgaa aaggtttatc cgttcgtcca 11220
tttgtatgtg catgccaacc acagggttcc cctcgggatc aaagtacttt gatccaaccc 11280
ctccgctgct atagtgcagt cggcttctga cgttcagtgc agccgtcttc tgaaaacgac 11340
atgtcgcaca agtcctaagt tacgcgacag gctgccgccc tgcccttttc ctggcgtttt 11400
cttgtcgcgt gttttagtcg cataaagtag aatacttgcg actagaaccg gagacattac 11460
gccatgaaca agagcgccgc cgctggcctg ctgggctatg cccgcgtcag caccgacgac 11520
caggacttga ccaaccaacg ggccgaactg cacgcggccg gctgcaccaa gctgttttcc 11580
gagaagatca ccggcaccag gcgcgaccgc ccggagctgg ccaggatgct tgaccaccta 11640
cgccctggcg acgttgtgac agtgaccagg ctagaccgcc tggcccgcag cacccgcgac 11700
ctactggaca ttgccgagcg catccaggag gccggcgcgg gcctgcgtag cctggcagag 11760
ccgtgggccg acaccaccac gccggccggc cgcatggtgt tgaccgtgtt cgccggcatt 11820
gccgagttcg agcgttccct aatcatcgac cgcacccgga gcgggcgcga ggccgccaag 11880
gcccgaggcg tgaagtttgg cccccgccct accctcaccc cggcacagat cgcgcacgcc 11940
cgcgagctga tcgaccagga aggccgcacc gtgaaagagg cggctgcact gcttggcgtg 12000
catcgctcga ccctgtaccg cgcacttgag cgcagcgagg aagtgacgcc caccgaggcc 12060
aggcggcgcg gtgccttccg tgaggacgca ttgaccgagg ccgacgccct ggcggccgcc 12120
gagaatgaac gccaagagga acaagcatga aaccgcacca ggacggccag gacgaaccgt 12180
ttttcattac cgaagagatc gaggcggaga tgatcgcggc cgggtacgtg ttcgagccgc 12240
ccgcgcacgt ctcaaccgtg cggctgcatg aaatcctggc cggtttgtct gatgccaagc 12300
tggcggcctg gccggccagc ttggccgctg aagaaaccga gcgccgccgt ctaaaaaggt 12360
gatgtgtatt tgagtaaaac agcttgcgtc atgcggtcgc tgcgtatatg atgcgatgag 12420
taaataaaca aatacgcaag gggaacgcat gaaggttatc gctgtactta accagaaagg 12480
cgggtcaggc aagacgacca tcgcaaccca tctagcccgc gccctgcaac tcgccggggc 12540
cgatgttctg ttagtcgatt ccgatcccca gggcagtgcc cgcgattggg cggccgtgcg 12600
ggaagatcaa ccgctaaccg ttgtcggcat cgaccgcccg acgattgacc gcgacgtgaa 12660
ggccatcggc cggcgcgact tcgtagtgat cgacggagcg ccccaggcgg cggacttggc 12720
tgtgtccgcg atcaaggcag ccgacttcgt gctgattccg gtgcagccaa gcccttacga 12780
catatgggcc accgccgacc tggtggagct ggttaagcag cgcattgagg tcacggatgg 12840
aaggctacaa gcggcctttg tcgtgtcgcg ggcgatcaaa ggcacgcgca tcggcggtga 12900
ggttgccgag gcgctggccg ggtacgagct gcccattctt gagtcccgta tcacgcagcg 12960
cgtgagctac ccaggcactg ccgccgccgg cacaaccgtt cttgaatcag aacccgaggg 13020
cgacgctgcc cgcgaggtcc aggcgctggc cgctgaaatt aaatcaaaac tcatttgagt 13080
taatgaggta aagagaaaat gagcaaaagc acaaacacgc taagtgccgg ccgtccgagc 13140
gcacgcagca gcaaggctgc aacgttggcc agcctggcag acacgccagc catgaagcgg 13200
gtcaactttc agttgccggc ggaggatcac accaagctga agatgtacgc ggtacgccaa 13260
ggcaagacca ttaccgagct gctatctgaa tacatcgcgc agctaccaga gtaaatgagc 13320
aaatgaataa atgagtagat gaattttagc ggctaaagga ggcggcatgg aaaatcaaga 13380
acaaccaggc accgacgccg tggaatgccc catgtgtgga ggaacgggcg gttggccagg 13440
cgtaagcggc tgggttgtct gccggccctg caatggcact ggaaccccca agcccgagga 13500
atcggcgtga cggtcgcaaa ccatccggcc cggtacaaat cggcgcggcg ctgggtgatg 13560
acctggtgga gaagttgaag gccgcgcagg ccgcccagcg gcaacgcatc gaggcagaag 13620
cacgccccgg tgaatcgtgg caagcggccg ctgatcgaat ccgcaaagaa tcccggcaac 13680
cgccggcagc cggtgcgccg tcgattagga agccgcccaa gggcgacgag caaccagatt 13740
ttttcgttcc gatgctctat gacgtgggca cccgcgatag tcgcagcatc atggacgtgg 13800
ccgttttccg tctgtcgaag cgtgaccgac gagctggcga ggtgatccgc tacgagcttc 13860
cagacgggca cgtagaggtt tccgcagggc cggccggcat ggccagtgtg tgggattacg 13920
acctggtact gatggcggtt tcccatctaa ccgaatccat gaaccgatac cgggaaggga 13980
agggagacaa gcccggccgc gtgttccgtc cacacgttgc ggacgtactc aagttctgcc 14040
ggcgagccga tggcggaaag cagaaagacg acctggtaga aacctgcatt cggttaaaca 14100
ccacgcacgt tgccatgcag cgtacgaaga aggccaagaa cggccgcctg gtgacggtat 14160
ccgagggtga agccttgatt agccgctaca agatcgtaaa gagcgaaacc gggcggccgg 14220
agtacatcga gatcgagcta gctgattgga tgtaccgcga gatcacagaa ggcaagaacc 14280
cggacgtgct gacggttcac cccgattact ttttgatcga tcccggcatc ggccgttttc 14340
tctaccgcct ggcacgccgc gccgcaggca aggcagaagc cagatggttg ttcaagacga 14400
tctacgaacg cagtggcagc gccggagagt tcaagaagtt ctgtttcacc gtgcgcaagc 14460
tgatcgggtc aaatgacctg ccggagtacg atttgaagga ggaggcgggg caggctggcc 14520
cgatcctagt catgcgctac cgcaacctga tcgagggcga agcatccgcc ggttcctaat 14580
gtacggagca gatgctaggg caaattgccc tagcagggga aaaaggtcga aaaggtctct 14640
ttcctgtgga tagcacgtac attgggaacc caaagccgta cattgggaac cggaacccgt 14700
acattgggaa cccaaagccg tacattggga accggtcaca catgtaagtg actgatataa 14760
aagagaaaaa aggcgatttt tccgcctaaa actctttaaa acttattaaa actcttaaaa 14820
cccgcctggc ctgtgcataa ctgtctggcc agcgcacagc cgaagagctg caaaaagcgc 14880
ctacccttcg gtcgctgcgc tccctacgcc ccgccgcttc gcgtcggcct atcgcggccg 14940
ctggccgctc aaaaatggct ggcctacggc caggcaatct accagggcgc ggacaagccg 15000
cgccgtcgcc actcgaccgc cggcgcccac atcaaggcac cctgcctcgc gcgtttcggt 15060
gatgacggtg aaaacctctg acacatgcag ctcccggaga cggtcacagc ttgtctgtaa 15120
gcggatgccg ggagcagaca agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg 15180
ggcgcagcca tgacccagtc acgtagcgat agcggagtgt atactggctt aactatgcgg 15240
catcagagca gattgtactg agagtgcacc atatgcggtg tgaaataccg cacagatgcg 15300
taaggagaaa ataccgcatc aggcgctctt ccgcttcctc gctcactgac tcgctgcgct 15360
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 15420
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 15480
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 15540
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 15600
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 15660
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 15720
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 15780
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 15840
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 15900
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaagg acagtatttg 15960
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 16020
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 16080
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 16140
acgaaaactc acgttaaggg attttggtca tgcattctag gtactaaaac aattcatcca 16200
gtaaaatata atattttatt ttctcccaat caggcttgat ccccagtaag tcaaaaaata 16260
gctcgacata ctgttcttcc ccgatatcct ccctgatcga ccggacgcag aaggcaatgt 16320
cataccactt gtccgccctg ccgcttctcc caagatcaat aaagccactt actttgccat 16380
ctttcacaaa gatgttgctg tctcccaggt cgccgtggga aaagacaagt tcctcttcgg 16440
gcttttccgt ctttaaaaaa tcatacagct cgcgcggatc tttaaatgga gtgtcttctt 16500
cccagttttc gcaatccaca tcggccagat cgttattcag taagtaatcc aattcggcta 16560
agcggctgtc taagctattc gtatagggac aatccgatat gtcgatggag tgaaagagcc 16620
tgatgcactc cgcatacagc tcgataatct tttcagggct ttgttcatct tcatactctt 16680
ccgagcaaag gacgccatcg gcctcactca tgagcagatt gctccagcca tcatgccgtt 16740
caaagtgcag gacctttgga acaggcagct ttccttccag ccatagcatc atgtcctttt 16800
cccgttccac atcataggtg gtccctttat accggctgtc cgtcattttt aaatataggt 16860
tttcattttc tcccaccagc ttatatacct tagcaggaga cattccttcc gtatctttta 16920
cgcagcggta tttttcgatc agttttttca attccggtga tattctcatt ttagccattt 16980
attatttcct tcctcttttc tacagtattt aaagataccc caagaagcta attataacaa 17040
gacgaactcc aattcactgt tccttgcatt ctaaaacctt aaataccaga aaacagcttt 17100
ttcaaagttg ttttcaaagt tggcgtataa catagtatcg acggagccga ttttgaaacc 17160
gcggtgatca caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga 17220
gatcatccgt gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac 17280
atgagcaaag tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg 17340
ctgcctgtat cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct 17400
<210>2
<211>1071
<212>PRT
<213>Artificial Sequence
<400>2
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Lys Arg Asn Tyr Ile Leu Gly Leu Ala Ile Gly Ile Thr Ser Val
20 25 30
Gly Tyr Gly Ile Ile Asp Tyr Glu Thr Arg Asp Val Ile Asp Ala Gly
35 40 45
Val Arg Leu Phe Lys Glu Ala Asn Val Glu Asn Asn Glu Gly Arg Arg
50 55 60
Ser Lys Arg Gly Ala Arg Arg Leu Lys Arg Arg Arg Arg His Arg Ile
65 70 75 80
Gln Arg Val Lys Lys Leu Leu Phe Asp Tyr Asn Leu Leu Thr Asp His
85 90 95
Ser Glu Leu Ser Gly Ile Asn Pro Tyr Glu Ala Arg Val Lys Gly Leu
100 105 110
Ser Gln Lys Leu Ser Glu Glu Glu Phe Ser Ala Ala Leu Leu His Leu
115 120 125
Ala Lys Arg Arg Gly Val His Asn Val Asn Glu Val Glu Glu Asp Thr
130 135 140
Gly Asn Glu Leu Ser Thr Lys Glu Gln Ile Ser Arg Asn Ser Lys Ala
145 150 155 160
Leu Glu Glu Lys Tyr Val Ala Glu Leu Gln Leu Glu Arg Leu Lys Lys
165 170 175
Asp Gly Glu Val Arg Gly Ser Ile Asn Arg Phe Lys Thr Ser Asp Tyr
180 185 190
Val Lys Glu Ala Lys Gln Leu Leu Lys Val Gln Lys Ala Tyr His Gln
195 200 205
Leu Asp Gln Ser Phe Ile Asp Thr Tyr Ile Asp Leu Leu Glu Thr Arg
210 215 220
Arg Thr Tyr Tyr Glu Gly Pro Gly Glu Gly Ser Pro Phe Gly Trp Lys
225 230 235 240
Asp Ile Lys Glu Trp Tyr Glu Met Leu Met Gly His Cys Thr Tyr Phe
245 250 255
Pro Glu Glu Leu Arg Ser Val Lys Tyr Ala Tyr Asn Ala Asp Leu Tyr
260 265 270
Asn Ala Leu Asn Asp Leu Asn Asn Leu Val Ile Thr Arg Asp Glu Asn
275 280 285
Glu Lys Leu Glu Tyr Tyr Glu Lys Phe Gln Ile Ile Glu Asn Val Phe
290 295 300
Lys Gln Lys Lys Lys Pro Thr Leu Lys Gln Ile Ala Lys Glu Ile Leu
305 310 315 320
Val Asn Glu Glu Asp Ile Lys Gly Tyr Arg Val Thr Ser Thr Gly Lys
325 330 335
Pro Glu Phe Thr Asn Leu Lys Val Tyr His Asp Ile Lys Asp Ile Thr
340 345 350
Ala Arg Lys Glu Ile Ile Glu Asn Ala Glu Leu Leu Asp Gln Ile Ala
355 360 365
Lys Ile Leu Thr Ile Tyr Gln Ser Ser Glu Asp Ile Gln Glu Glu Leu
370 375 380
Thr Asn Leu Asn Ser Glu Leu Thr Gln Glu Glu Ile Glu Gln Ile Ser
385 390 395 400
Asn Leu Lys Gly Tyr Thr Gly Thr His Asn Leu Ser Leu Lys Ala Ile
405 410 415
Asn Leu Ile Leu Asp Glu Leu Trp His Thr Asn Asp Asn Gln Ile Ala
420 425 430
Ile Phe Asn Arg Leu Lys Leu Val Pro Lys Lys Val Asp Leu Ser Gln
435 440 445
Gln Lys Glu Ile Pro Thr Thr Leu Val Asp Asp Phe Ile Leu Ser Pro
450 455 460
Val Val Lys Arg Ser Phe Ile Gln Ser Ile Lys Val Ile Asn Ala Ile
465 470 475 480
Ile Lys Lys Tyr Gly Leu Pro Asn Asp Ile Ile Ile Glu Leu Ala Arg
485 490 495
Glu Lys Asn Ser Lys Asp Ala Gln Lys Met Ile Asn Glu Met Gln Lys
500 505 510
Arg Asn Arg Gln Thr Asn Glu Arg Ile Glu Glu Ile Ile Arg Thr Thr
515 520 525
Gly Lys Glu Asn Ala Lys Tyr Leu Ile Glu Lys Ile Lys Leu His Asp
530 535 540
Met Gln Glu Gly Lys Cys Leu Tyr Ser Leu Glu Ala Ile Pro Leu Glu
545 550 555 560
Asp Leu Leu Asn Asn Pro Phe Asn Tyr Glu Val Asp His Ile Ile Pro
565 570 575
Arg Ser Val Ser Phe Asp Asn Ser Phe Asn Asn Lys Val Leu Val Lys
580 585 590
Gln Glu Glu Asn Ser Lys Lys Gly Asn Arg Thr Pro Phe Gln Tyr Leu
595 600 605
Ser Ser Ser Asp Ser Lys Ile Ser Tyr Glu Thr Phe Lys Lys His Ile
610 615 620
Leu Asn Leu Ala Lys Gly Lys Gly Arg Ile Ser Lys Thr Lys Lys Glu
625 630 635 640
Tyr Leu Leu Glu Glu Arg Asp Ile Asn Arg Phe Ser Val Gln Lys Asp
645 650 655
Phe Ile Asn Arg Asn Leu Val Asp Thr Arg Tyr Ala Thr Arg Gly Leu
660 665 670
Met Asn Leu Leu Arg Ser Tyr Phe Arg Val Asn Asn Leu Asp Val Lys
675 680 685
Val Lys Ser Ile Asn Gly Gly Phe Thr Ser Phe Leu Arg Arg Lys Trp
690 695 700
Lys Phe Lys Lys Glu Arg Asn Lys Gly Tyr Lys His His Ala Glu Asp
705 710 715 720
Ala Leu Ile Ile Ala Asn Ala Asp Phe Ile Phe Lys Glu Trp Lys Lys
725 730 735
Leu Asp Lys Ala Lys Lys Val Met Glu Asn Gln Met Phe Glu Glu Lys
740 745 750
Gln Ala Glu Ser Met Pro Glu Ile Glu Thr Glu Gln Glu Tyr Lys Glu
755 760 765
Ile Phe Ile Thr Pro His Gln Ile Lys His Ile Lys Asp Phe Lys Asp
770 775 780
Tyr Lys Tyr Ser His Arg Val Asp Lys Lys Pro Asn Arg Lys Leu Ile
785 790 795 800
Asn Asp Thr Leu Tyr Ser Thr Arg Lys Asp Asp Lys Gly Asn Thr Leu
805 810 815
Ile Val Asn Asn Leu Asn Gly Leu Tyr Asp Lys Asp Asn Asp Lys Leu
820 825 830
Lys Lys Leu Ile Asn Lys Ser Pro Glu Lys Leu Leu Met Tyr His His
835 840 845
Asp Pro Gln Thr Tyr Gln Lys Leu Lys Leu Ile Met Glu Gln Tyr Gly
850 855 860
Asp Glu Lys Asn Pro Leu Tyr Lys Tyr Tyr Glu Glu Thr Gly Asn Tyr
865 870 875 880
Leu Thr Lys Tyr Ser Lys Lys Asp Asn Gly Pro Val Ile Lys Lys Ile
885 890 895
Lys Tyr Tyr Gly Asn Lys Leu Asn Ala His Leu Asp Ile Thr Asp Asp
900 905 910
Tyr Pro Asn Ser Arg Asn Lys Val Val Lys Leu Ser Leu Lys Pro Tyr
915 920 925
Arg Phe Asp Val Tyr Leu Asp Asn Gly Val Tyr Lys Phe Val Thr Val
930 935 940
Lys Asn Leu Asp Val Ile Lys Lys Glu Asn Tyr Tyr Glu Val Asn Ser
945 950 955 960
Lys Cys Tyr Glu Glu Ala Lys Lys Leu Lys Lys Ile Ser Asn Gln Ala
965 970 975
Glu Phe Ile Ala Ser Phe Tyr Lys Asn Asp Leu Ile Lys Ile Asn Gly
980 985 990
Glu Leu Tyr Arg Val Ile Gly Val Asn Asn Asp Leu Leu Asn Arg Ile
995 1000 1005
Glu Val Asn Met Ile Asp Ile Thr Tyr Arg Glu Tyr Leu Glu Asn
1010 1015 1020
Met Asn Asp Lys Arg Pro Pro His Ile Ile Lys Thr Ile Ala Ser
1025 1030 1035
Lys Thr Gln Ser Ile Lys Lys Tyr Ser Thr Asp Ile Leu Gly Asn
1040 1045 1050
Leu Tyr Glu Val Lys Ser Lys Lys His Pro Gln Ile Ile Lys Lys
1055 1060 1065
Gly Ser Ala
1070
<210>3
<211>208
<212>PRT
<213>Artificial Sequence
<400>3
Met Thr Asp Ala Glu Tyr Val Arg Ile His Glu Lys Leu Asp Ile Tyr
1 5 10 15
Thr Phe Lys Lys Gln Phe Phe Asn Asn Lys Lys Ser Val Ser His Arg
20 25 30
Cys Tyr Val Leu Phe Glu Leu Lys Arg Arg Gly Glu Arg Arg Ala Cys
35 40 45
Phe Trp Gly Tyr Ala Val Asn Lys Pro Gln Ser Gly Thr Glu Arg Gly
50 55 60
Ile His Ala Glu Ile Phe Ser Ile Arg Lys Val Glu Glu Tyr Leu Arg
65 70 75 80
Asp Asn Pro Gly Gln Phe Thr Ile Asn Trp Tyr Ser Ser Trp Ser Pro
85 90 95
Cys Ala Asp Cys Ala Glu Lys Ile Leu Glu Trp Tyr Asn Gln Glu Leu
100 105 110
Arg Gly Asn Gly His Thr Leu Lys Ile Trp Ala Cys Lys Leu Tyr Tyr
115 120 125
Glu Lys Asn Ala Arg Asn Gln Ile Gly Leu Trp Asn Leu Arg Asp Asn
130 135 140
Gly Val Gly Leu Asn Val Met Val Ser Glu His Tyr Gln Cys Cys Arg
145 150 155 160
Lys Ile Phe Ile Gln Ser Ser His Asn Gln Leu Asn Glu Asn Arg Trp
165 170 175
Leu Glu Lys Thr Leu Lys Arg Ala Glu Lys Trp Arg Ser Glu Leu Ser
180 185 190
Ile Met Ile Gln Val Lys Ile Leu His Thr Thr Lys Ser Pro Ala Val
195 200 205
<210>4
<211>98
<212>PRT
<213>Artificial Sequence
<400>4
Ser Gly Gly Ser Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly
1 5 10 15
Lys Gln Leu Val Ile Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val
20 25 30
Glu Glu Val Ile Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr
35 40 45
Ala Tyr Asp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp
50 55 60
Ala Pro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly
65 70 75 80
Glu Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Pro Lys Lys Lys Arg
85 90 95
Lys Val
<210>5
<211>522
<212>DNA
<213>Artificial Sequence
<400>5
aacaaagcac cagtggtcta gtggtagaat agtaccctgc cacggtacag acccgggttc 60
gattcccggc tggtgcagat gccacacagc aaggagtgtt ttagtactct ggaaacagaa 120
tctactaaaa caaggcaaaa tgccgtgttt atctcgtcaa cttgttggcg agataacaaa 180
gcaccagtgg tctagtggta gaatagtacc ctgccacggt acagacccgg gttcgattcc 240
cggctggtgc acagaaccga caacagatga ggttttagta ctctggaaac agaatctact 300
aaaacaaggc aaaatgccgt gtttatctcg tcaacttgtt ggcgagataa caaagcacca 360
gtggtctagt ggtagaatag taccctgcca cggtacagac ccgggttcga ttcccggctg 420
gtgcaccagc tcatttggct cggcggtttt agtactctgg aaacagaatc tactaaaaca 480
aggcaaaatg ccgtgtttat ctcgtcaact tgttggcgag at 522
<210>6
<211>174
<212>DNA
<213>Artificial Sequence
<400>6
aacaaagcac cagtggtcta gtggtagaat agtaccctgc cacggtacag acccgggttc 60
gattcccggc tggtgcatcc tcggcgtagt acgggctgtt ttagtactct ggaaacagaa 120
tctactaaaa caaggcaaaa tgccgtgttt atctcgtcaa cttgttggcg agat 174
Claims (10)
1.一种单突变体的制备方法,包括如下步骤:将SaKKHn-pBE碱基编辑系统导入生物体或生物细胞内,通过所述SaKKHn-pBE碱基编辑系统实现基因组靶点序列的编辑,获得所述单突变体;
所述单突变体为所述靶点序列中单个位点C突变为T的生物突变体;
所述SaKKHn-pBE碱基编辑系统包括SaKKHn蛋白质、tRNA-sgRNA和PmCDA1蛋白质;
所述tRNA-sgRNA靶向所述靶点序列;
所述tRNA-sgRNA如式I所示:tRNA-所述靶点序列转录的RNA-sgRNA骨架(式I);
所述tRNA为m1)或m2)或m3):
m1)将序列1第474-550位中的T替换为U得到的RNA分子;
m2)将m1)所示的RNA分子经过一个或几个核苷酸的取代和/或缺失和/或添加且具有相同功能的RNA分子;
m3)与m1)或m2)限定的核苷酸序列具有75%或75%以上同一性且具有相同功能的RNA分子;
所述sgRNA骨架为n1)或n2)或n3):
n1)将序列1第571-647位中的T替换为U得到的RNA分子;
n2)将n1)所示的RNA分子经过一个或几个核苷酸的取代和/或缺失和/或添加且具有相同功能的RNA分子;
n3)与n1)或n2)限定的核苷酸序列具有75%或75%以上同一性且具有相同功能的RNA分子。
2.根据权利要求1所述的方法,其特征在于:所述SaKKHn-pBE碱基编辑系统还包括UGI蛋白质。
3.根据权利要求1或2所述的方法,其特征在于:所述方法为方法(一)或方法(二)或方法(三)或方法(四):
所述方法(一)包括如下步骤:将SaKKHn蛋白质的编码基因、转录tRNA-sgRNA的DNA分子和PmCDA1蛋白质的编码基因导入生物体或生物细胞内,使所述SaKKHn蛋白质、所述tRNA-sgRNA和所述PmCDA1蛋白质的编码基因均得到表达,实现基因组靶点序列的编辑,获得所述单突变体;
所述方法(二)包括如下步骤:将SaKKHn蛋白质的编码基因、转录tRNA-sgRNA的DNA分子、PmCDA1蛋白质的编码基因和UGI的编码基因导入生物体或生物细胞内,使所述SaKKHn蛋白质、所述tRNA-sgRNA、所述PmCDA1蛋白质和所述UGI蛋白质的编码基因均得到表达,实现基因组靶点序列的编辑,获得所述单突变体;
所述方法(三)包括如下步骤:将SaKKHn蛋白质、tRNA-sgRNA和PmCDA1蛋白质导入生物体或生物细胞内,实现基因组靶点序列的编辑,获得所述单突变体;
所述方法(四)包括如下步骤:将SaKKHn蛋白质、tRNA-sgRNA、PmCDA1蛋白质和UGI蛋白质导入生物体或生物细胞内,实现基因组靶点序列的编辑,获得所述单突变体。
4.根据权利要求1-3任一所述的方法,其特征在于:
所述SaKKHn蛋白质为A1)或A2)或A3):
A1)氨基酸序列是序列2所示的蛋白质;
A2)将序列表中序列2所示的氨基酸序列经过一个或几个氨基酸残基的取代和/或缺失和/或添加且具有相同功能的蛋白质;
A3)在A1)或A2)的N端或/和C端连接标签得到的融合蛋白质;
和/或,所述PmCDA1蛋白质为C1)或C2)或C3):
C1)氨基酸序列是序列3所示的蛋白质;
C2)将序列表中序列3所示的氨基酸序列经过一个或几个氨基酸残基的取代和/或缺失和/或添加且具有相同功能的蛋白质;
C3)在C1)或C2)的N端或/和C端连接标签得到的融合蛋白质;
和/或,所述UGI蛋白质为E1)或E2)或E3):
E1)氨基酸序列是序列4所示的蛋白质;
E2)将序列表中序列4所示的氨基酸序列经过一个或几个氨基酸残基的取代和/或缺失和/或添加且具有相同功能的蛋白质;
E3)在E1)或E2)的N端或/和C端连接标签得到的融合蛋白质。
5.根据权利要求1-4任一所述的方法,其特征在于:
所述SaKKHn蛋白质的编码基因为b1)或b2)或b3):
b1)序列表中序列1第3013-6225位所示的cDNA分子或DNA分子;
b2)与b1)限定的核苷酸序列具有75%或75%以上同一性,且编码所述SaKKHn的cDNA分子或DNA分子;
b3)在严格条件下与b1)或b2)限定的核苷酸序列杂交,且编码所述SaKKHn的cDNA分子或DNA分子;
和/或,所述PmCDA1蛋白质的编码基因为d1)或d2)或d3):
d1)序列表中序列1第6511-7134位所示的cDNA分子或DNA分子;
d2)与d1)限定的核苷酸序列具有75%或75%以上同一性,且编码所述PmCDA1的cDNA分子或DNA分子;
d3)在严格条件下与d1)或d2)限定的核苷酸序列杂交,且编码所述PmCDA1的cDNA分子或DNA分子;
和/或,所述UGI蛋白质的编码基因为f1)或f2)或f3):
f1)序列表中序列1第7156-7452位所示的cDNA分子或DNA分子;
f2)与f1)限定的核苷酸序列具有75%或75%以上同一性,且编码所述UGI的cDNA分子或DNA分子;
f3)在严格条件下与f1)或f2)限定的核苷酸序列杂交,且编码所述UGI的cDNA分子或DNA分子。
6.根据权利要求1-5任一所述的方法,其特征在于:所述基因组靶点序列的编辑为将所述靶点序列中的C突变为T。
7.根据权利要求1-6任一所述的方法,其特征在于:
所述生物体为p1)或p2)或p3)或p4):
p1)植物或动物;
p2)单子叶植物或双子叶植物;
p3)禾本科植物;
p4)水稻;
和/或,所述生物细胞为q1)或q2)或q3)或q4):
q1)植物细胞或动物细胞;
q2)单子叶植物细胞或双子叶植物细胞;
q3)禾本科植物细胞;
q4)水稻细胞。
8.一种制备单突变体的产品,其包括权利要求1-7中任一所述的SaKKHn-pBE碱基编辑系统。
9.权利要求1-7任一所述的方法或权利要求8所述的产品在提高单突变体比例中的应用。
10.根据权利要求8所述的产品或权利要求9所述的应用,其特征在于:
所述单突变体为利用权利要求1-7中任一所述的SaKKHn-pBE碱基编辑系统进行基因组靶点序列编辑获得的所述靶点序列中单个位点C突变为T的生物突变体;
所述单突变体比例为利用权利要求1-7中任一所述的SaKKHn-pBE碱基编辑系统进行基因组靶点序列编辑获得的所述靶点序列中单个位点C突变为T的生物突变体占获得的所有生物突变体的比例。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911202341.6A CN111304242A (zh) | 2019-11-29 | 2019-11-29 | 一种基于SaKKHn-pBE系统制备单突变体的方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911202341.6A CN111304242A (zh) | 2019-11-29 | 2019-11-29 | 一种基于SaKKHn-pBE系统制备单突变体的方法 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111304242A true CN111304242A (zh) | 2020-06-19 |
Family
ID=71156151
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911202341.6A Pending CN111304242A (zh) | 2019-11-29 | 2019-11-29 | 一种基于SaKKHn-pBE系统制备单突变体的方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111304242A (zh) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109456973A (zh) * | 2018-12-28 | 2019-03-12 | 北京市农林科学院 | SpCas9n&PmCDA1&UGI碱基编辑系统在植物基因编辑中的应用 |
CN109652440A (zh) * | 2018-12-28 | 2019-04-19 | 北京市农林科学院 | VQRn-Cas9&PmCDA1&UGI碱基编辑系统在植物基因编辑中的应用 |
-
2019
- 2019-11-29 CN CN201911202341.6A patent/CN111304242A/zh active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109456973A (zh) * | 2018-12-28 | 2019-03-12 | 北京市农林科学院 | SpCas9n&PmCDA1&UGI碱基编辑系统在植物基因编辑中的应用 |
CN109652440A (zh) * | 2018-12-28 | 2019-04-19 | 北京市农林科学院 | VQRn-Cas9&PmCDA1&UGI碱基编辑系统在植物基因编辑中的应用 |
Non-Patent Citations (4)
Title |
---|
BENJAMIN P KLEINSTIVER 等: "Broadening the targeting range of Staphylococcus aureus CrIsPr-Cas9 by modifying PAM recognition", 《NATURE BIOTECHNOLOGY》 * |
F. ANN RAN 等: "In vivo genome editing using Staphylococcus aureus Cas9", 《NATURE》 * |
RUIYING QIN 等: "Developing a highly efficient and wildly adaptive CRISPR-SaCas9 toolset for plant genome editing", 《PLANT BIOTECHNOLOGY JOURNAL》 * |
YING WU 等: "Increasing Cytosine Base Editing Scope and Efficiency With Engineered Cas9-PmCDA1 Fusions and the Modified sgRNA in Rice", 《FRONTIERS IN GENETICS》 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108707621B (zh) | 一种CRISPR/Cpf1系统介导的以RNA转录本为修复模板的同源重组方法 | |
CN101855233B (zh) | 合成5’utr、表达载体以及增强转基因表达的方法 | |
CN107418954B (zh) | 毛白杨基因PtomiR390a及其应用 | |
CN108085287B (zh) | 一种重组谷氨酸棒状杆菌、其制备方法及其应用 | |
CN109206496B (zh) | 蛋白质GhFLS1在调控植物耐热性中的应用 | |
CN108342409B (zh) | 一种植物RNAi表达载体及其构建方法和应用 | |
CN110835631B (zh) | 一种改造的sgRNA及其在提高碱基编辑效率中的应用 | |
CN110835630B (zh) | 一种高效的sgRNA及其在基因编辑中的应用 | |
CN112778405B (zh) | 一种与植物开花期相关的蛋白及其编码基因与应用 | |
CN113121662B (zh) | 棉花GhBZR3蛋白及其编码基因在调节植物生长发育中的应用 | |
CN103205458A (zh) | 一种适合单子叶植物转化的中间表达载体及其构建方法 | |
CN110408646B (zh) | 一种植物遗传转化筛选载体及其应用 | |
CN111304242A (zh) | 一种基于SaKKHn-pBE系统制备单突变体的方法 | |
CN110923263B (zh) | 水稻β-淀粉酶BA1及其编码基因与应用 | |
CN110747186B (zh) | 在植物中高效生成不携带转基因元件的突变体的CRISPR/Cas9系统和方法 | |
CN111154797A (zh) | 一种基因枪介导的玉米骨干自交系的遗传转化方法 | |
CN109321594B (zh) | 一种以黄花蒿悬浮细胞系为受体的转iaaM基因提高黄花蒿中青蒿素含量的方法 | |
CN111269298B (zh) | 蛋白质GhCCoAOMT7在调控植物耐热性中的应用 | |
CN111187787A (zh) | 一种多功能植物表达载体及其构建方法和应用 | |
CN111154796B (zh) | 一种农杆菌介导的玉米骨干自交系的遗传转化方法 | |
CN109694402B (zh) | 植物木质素合成相关蛋白及其编码基因和应用 | |
CN112458111A (zh) | 一种利用水稻内源序列抑制HSL1基因表达的RNAi植物表达载体及其应用 | |
CN112575028A (zh) | 一种抑制HIS1基因表达的RNAi植物表达载体及其应用 | |
CN109694877B (zh) | 培育具有不同木质素含量的转基因植物的方法 | |
CN112458113A (zh) | 一种植物转基因显性抑制载体及其应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200619 |
|
RJ01 | Rejection of invention patent application after publication |