CN117511942A - Method for specifically knocking down plant circRNA based on CRISPR/RfxCas13d system - Google Patents
Method for specifically knocking down plant circRNA based on CRISPR/RfxCas13d system Download PDFInfo
- Publication number
- CN117511942A CN117511942A CN202311463065.5A CN202311463065A CN117511942A CN 117511942 A CN117511942 A CN 117511942A CN 202311463065 A CN202311463065 A CN 202311463065A CN 117511942 A CN117511942 A CN 117511942A
- Authority
- CN
- China
- Prior art keywords
- rfxcas13d
- lys
- crrna
- circrna
- vector
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108091033409 CRISPR Proteins 0.000 title claims abstract description 38
- 238000010354 CRISPR gene editing Methods 0.000 title claims abstract description 37
- 238000000034 method Methods 0.000 title claims abstract description 28
- 239000013598 vector Substances 0.000 claims abstract description 61
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 29
- 239000012634 fragment Substances 0.000 claims abstract description 28
- 108090000790 Enzymes Proteins 0.000 claims abstract description 21
- 102000004190 Enzymes Human genes 0.000 claims abstract description 21
- 230000002194 synthesizing effect Effects 0.000 claims abstract description 19
- 230000008685 targeting Effects 0.000 claims abstract description 12
- 108700010070 Codon Usage Proteins 0.000 claims abstract description 6
- 230000009261 transgenic effect Effects 0.000 claims abstract description 6
- 206010020649 Hyperkeratosis Diseases 0.000 claims abstract description 4
- 239000013604 expression vector Substances 0.000 claims abstract description 4
- 241000196324 Embryophyta Species 0.000 claims description 63
- 239000013612 plasmid Substances 0.000 claims description 21
- 241000209094 Oryza Species 0.000 claims description 20
- 235000007164 Oryza sativa Nutrition 0.000 claims description 15
- 235000009566 rice Nutrition 0.000 claims description 15
- 108020005004 Guide RNA Proteins 0.000 claims description 14
- 241000588724 Escherichia coli Species 0.000 claims description 10
- 238000012163 sequencing technique Methods 0.000 claims description 10
- 238000012795 verification Methods 0.000 claims description 10
- 238000000137 annealing Methods 0.000 claims description 5
- 238000003776 cleavage reaction Methods 0.000 claims description 4
- 230000007017 scission Effects 0.000 claims description 4
- 102000012410 DNA Ligases Human genes 0.000 claims description 3
- 108010061982 DNA Ligases Proteins 0.000 claims description 3
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 claims description 3
- 235000007688 Lycopersicon esculentum Nutrition 0.000 claims description 3
- 108091034057 RNA (poly(A)) Proteins 0.000 claims description 3
- 108020005091 Replication Origin Proteins 0.000 claims description 3
- 101710090029 Replication-associated protein A Proteins 0.000 claims description 3
- 230000029087 digestion Effects 0.000 claims description 3
- 230000006801 homologous recombination Effects 0.000 claims description 3
- 238000002744 homologous recombination Methods 0.000 claims description 3
- 125000006850 spacer group Chemical group 0.000 claims description 3
- 230000001131 transforming effect Effects 0.000 claims description 3
- 230000034512 ubiquitination Effects 0.000 claims description 3
- 238000010798 ubiquitination Methods 0.000 claims description 3
- 241000219194 Arabidopsis Species 0.000 claims description 2
- 240000003768 Solanum lycopersicum Species 0.000 claims 1
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 239000002773 nucleotide Substances 0.000 claims 1
- 125000003729 nucleotide group Chemical group 0.000 claims 1
- 238000011160 research Methods 0.000 abstract description 7
- 238000011161 development Methods 0.000 abstract description 6
- 238000003976 plant breeding Methods 0.000 abstract description 5
- 230000014509 gene expression Effects 0.000 description 16
- 108020004999 messenger RNA Proteins 0.000 description 16
- 238000003197 gene knockdown Methods 0.000 description 12
- 238000006243 chemical reaction Methods 0.000 description 10
- 108010005233 alanylglutamic acid Proteins 0.000 description 9
- 108010092854 aspartyllysine Proteins 0.000 description 9
- 238000005516 engineering process Methods 0.000 description 9
- 239000013615 primer Substances 0.000 description 8
- 108010054155 lysyllysine Proteins 0.000 description 7
- 238000010276 construction Methods 0.000 description 6
- 108010034529 leucyl-lysine Proteins 0.000 description 6
- 108010003700 lysyl aspartic acid Proteins 0.000 description 5
- 108010064235 lysylglycine Proteins 0.000 description 5
- 108010051242 phenylalanylserine Proteins 0.000 description 5
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 4
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 4
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 4
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 4
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 4
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 4
- 150000001413 amino acids Chemical class 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 230000008774 maternal effect Effects 0.000 description 4
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 3
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 3
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 3
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 3
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 3
- 108091028075 Circular RNA Proteins 0.000 description 3
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 3
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 3
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 3
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 3
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 3
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000001976 enzyme digestion Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 108010057821 leucylproline Proteins 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- WYPUMLRSQMKIJU-BPNCWPANSA-N Ala-Arg-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WYPUMLRSQMKIJU-BPNCWPANSA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 2
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 2
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 2
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 2
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 2
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 2
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 2
- WTFIFQWLQXZLIZ-UMPQAUOISA-N Arg-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O WTFIFQWLQXZLIZ-UMPQAUOISA-N 0.000 description 2
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 2
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 2
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 2
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 2
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 2
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 2
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 2
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 2
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 2
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 2
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 2
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 2
- UWMIZBCTVWVMFI-FXQIFTODSA-N Asp-Ala-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UWMIZBCTVWVMFI-FXQIFTODSA-N 0.000 description 2
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 2
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 2
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 2
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 2
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 2
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- LHRCZIRWNFRIRG-SRVKXCTJSA-N Cys-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O LHRCZIRWNFRIRG-SRVKXCTJSA-N 0.000 description 2
- 239000003155 DNA primer Substances 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- SOIAHPSKKUYREP-CIUDSAMLSA-N Gln-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SOIAHPSKKUYREP-CIUDSAMLSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 2
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 2
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 2
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 2
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 2
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 2
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 2
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 2
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 2
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 2
- WRFOZIJRODPLIA-QWRGUYRKSA-N Gly-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O WRFOZIJRODPLIA-QWRGUYRKSA-N 0.000 description 2
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- AVQOSMRPITVTRB-CIUDSAMLSA-N His-Asn-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AVQOSMRPITVTRB-CIUDSAMLSA-N 0.000 description 2
- CSTNMMIHMYJGFR-IHRRRGAJSA-N His-His-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 CSTNMMIHMYJGFR-IHRRRGAJSA-N 0.000 description 2
- PGXZHYYGOPKYKM-IHRRRGAJSA-N His-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CCCCN)C(=O)O PGXZHYYGOPKYKM-IHRRRGAJSA-N 0.000 description 2
- UWNUQPZUSRFIIN-JUKXBJQTSA-N His-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N UWNUQPZUSRFIIN-JUKXBJQTSA-N 0.000 description 2
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 2
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 2
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 2
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 2
- ZXIGYKICRDFISM-DJFWLOJKSA-N Ile-His-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZXIGYKICRDFISM-DJFWLOJKSA-N 0.000 description 2
- MTONDYJJCIBZTK-PEDHHIEDSA-N Ile-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(=O)O)N MTONDYJJCIBZTK-PEDHHIEDSA-N 0.000 description 2
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 2
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 2
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 2
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- 241000227653 Lycopersicon Species 0.000 description 2
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 2
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 2
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 2
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 2
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 2
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 2
- OXHSZBRPUGNMKW-DCAQKATOSA-N Met-Gln-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OXHSZBRPUGNMKW-DCAQKATOSA-N 0.000 description 2
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 2
- PHURAEXVWLDIGT-LPEHRKFASA-N Met-Ser-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N PHURAEXVWLDIGT-LPEHRKFASA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 2
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 2
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 2
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 2
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- 108010003201 RGH 0205 Proteins 0.000 description 2
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 2
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 2
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 2
- 108091027967 Small hairpin RNA Proteins 0.000 description 2
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 2
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 2
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 2
- MXKUGFHWYYKVDV-SZMVWBNQSA-N Trp-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(C)C)C(O)=O MXKUGFHWYYKVDV-SZMVWBNQSA-N 0.000 description 2
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 2
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 2
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 2
- AVFGBGGRZOKSFS-KJEVXHAQSA-N Tyr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O AVFGBGGRZOKSFS-KJEVXHAQSA-N 0.000 description 2
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 2
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 2
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 2
- IJBTVYLICXHDRI-FXQIFTODSA-N Val-Ala-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IJBTVYLICXHDRI-FXQIFTODSA-N 0.000 description 2
- IJBTVYLICXHDRI-UHFFFAOYSA-N Val-Ala-Ala Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C)C(O)=O IJBTVYLICXHDRI-UHFFFAOYSA-N 0.000 description 2
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 2
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 2
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 2
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 2
- 108010017893 alanyl-alanyl-alanine Proteins 0.000 description 2
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 238000010363 gene targeting Methods 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 230000017156 mRNA modification Effects 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 238000003753 real-time PCR Methods 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010015840 seryl-prolyl-lysyl-lysine Proteins 0.000 description 2
- 239000004055 small Interfering RNA Substances 0.000 description 2
- 108010001055 thymocartin Proteins 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 108700026220 vif Genes Proteins 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- 108010040956 Ala-Asp-Glu-Leu Proteins 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 1
- MAISCYVJLBBRNU-DCAQKATOSA-N Arg-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N MAISCYVJLBBRNU-DCAQKATOSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 1
- FIQKRDXFTANIEJ-ULQDDVLXSA-N Arg-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FIQKRDXFTANIEJ-ULQDDVLXSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 description 1
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 1
- SKQTXVZTCGSRJS-SRVKXCTJSA-N Asn-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O SKQTXVZTCGSRJS-SRVKXCTJSA-N 0.000 description 1
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- SNAWMGHSCHKSDK-GUBZILKMSA-N Asp-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SNAWMGHSCHKSDK-GUBZILKMSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 1
- ZMWOJVAXTOUHAP-ZKWXMUAHSA-N Cys-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N ZMWOJVAXTOUHAP-ZKWXMUAHSA-N 0.000 description 1
- VIOQRFNAZDMVLO-NRPADANISA-N Cys-Val-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIOQRFNAZDMVLO-NRPADANISA-N 0.000 description 1
- 108020004414 DNA Proteins 0.000 description 1
- 208000035240 Disease Resistance Diseases 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 1
- SWDSRANUCKNBLA-AVGNSLFASA-N Gln-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SWDSRANUCKNBLA-AVGNSLFASA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 1
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 1
- LWYUQLZOIORFFJ-XKBZYTNZSA-N Glu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O LWYUQLZOIORFFJ-XKBZYTNZSA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 1
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- IUKIDFVOUHZRAK-QWRGUYRKSA-N Gly-Lys-His Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IUKIDFVOUHZRAK-QWRGUYRKSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 1
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 1
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 1
- MWXBCJKQRQFVOO-DCAQKATOSA-N His-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CN=CN1)N MWXBCJKQRQFVOO-DCAQKATOSA-N 0.000 description 1
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 1
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 1
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- BLFXHAFTNYZEQE-VKOGCVSHSA-N Ile-Trp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BLFXHAFTNYZEQE-VKOGCVSHSA-N 0.000 description 1
- JSLIXOUMAOUGBN-JUKXBJQTSA-N Ile-Tyr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JSLIXOUMAOUGBN-JUKXBJQTSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 1
- MKBIVWXCFINCLE-SRVKXCTJSA-N Lys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N MKBIVWXCFINCLE-SRVKXCTJSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 1
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- TVOOGUNBIWAURO-KATARQTJSA-N Lys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N)O TVOOGUNBIWAURO-KATARQTJSA-N 0.000 description 1
- BVRNWWHJYNPJDG-XIRDDKMYSA-N Lys-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N BVRNWWHJYNPJDG-XIRDDKMYSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- BLIPQDLSCFGUFA-GUBZILKMSA-N Met-Arg-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O BLIPQDLSCFGUFA-GUBZILKMSA-N 0.000 description 1
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 1
- FGAMAYQCWQCUNF-DCAQKATOSA-N Met-His-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FGAMAYQCWQCUNF-DCAQKATOSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 1
- OIFHHODAXVWKJN-ULQDDVLXSA-N Met-Phe-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 OIFHHODAXVWKJN-ULQDDVLXSA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- HOTNHEUETJELDL-BPNCWPANSA-N Met-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N HOTNHEUETJELDL-BPNCWPANSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 240000008467 Oryza sativa Japonica Group Species 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- DZVXMMSUWWUIQE-ACRUOGEOSA-N Phe-His-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N DZVXMMSUWWUIQE-ACRUOGEOSA-N 0.000 description 1
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 1
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 1
- 108010091086 Recombinases Proteins 0.000 description 1
- 102000018120 Recombinases Human genes 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 1
- QFCQNHITJPRQTB-IEGACIPQSA-N Thr-Lys-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O QFCQNHITJPRQTB-IEGACIPQSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- UXUAZXWKIGPUCH-RCWTZXSCSA-N Thr-Met-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O UXUAZXWKIGPUCH-RCWTZXSCSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- MVHHTXAUJCIOMZ-WDSOQIARSA-N Trp-Arg-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N MVHHTXAUJCIOMZ-WDSOQIARSA-N 0.000 description 1
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 1
- XGEUYEOEZYFHRL-KKXDTOCCSA-N Tyr-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XGEUYEOEZYFHRL-KKXDTOCCSA-N 0.000 description 1
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 1
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- WPXKRJVHBXYLDT-JUKXBJQTSA-N Tyr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPXKRJVHBXYLDT-JUKXBJQTSA-N 0.000 description 1
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 1
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 1
- IGXLNVIYDYONFB-UFYCRDLUSA-N Tyr-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 IGXLNVIYDYONFB-UFYCRDLUSA-N 0.000 description 1
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 1
- JQOMHZMWQHXALX-FHWLQOOXSA-N Tyr-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JQOMHZMWQHXALX-FHWLQOOXSA-N 0.000 description 1
- KRXFXDCNKLANCP-CXTHYWKRSA-N Tyr-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 KRXFXDCNKLANCP-CXTHYWKRSA-N 0.000 description 1
- NXPDPYYCIRDUHO-ULQDDVLXSA-N Tyr-Val-His Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 NXPDPYYCIRDUHO-ULQDDVLXSA-N 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- 241000746966 Zizania Species 0.000 description 1
- 235000002636 Zizania aquatica Nutrition 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000009418 agronomic effect Effects 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008436 biogenesis Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 210000004027 cell Anatomy 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 108091027963 non-coding RNA Proteins 0.000 description 1
- 102000042567 non-coding RNA Human genes 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
Abstract
The invention discloses a method for specifically knocking down plant circRNA based on a CRISPR/RfxCas13d system, which comprises the following steps: s1, designing and synthesizing a target gene sequence coding for RfxCas13d with plant codon preference, and constructing a pCAMBIA1300-RfxCas13d recombinant vector; s2, synthesizing a AtU6-crRNA gene fragment, wherein the AtU6-crRNA gene fragment comprises two BsaI enzyme cutting sites, and constructing a pENTR4:gRNA4-AtU6-crRNA recombinant vector; s3, synthesizing and constructing a pCAMBIA1300-RfxCas13d-BSJ-crRNA vector of a specific targeting plant circRNA BSJ site; s4, transferring the constructed pCAMBIA1300-RfxCas13d-BSJ-crRNA expression vector to plant callus, and obtaining a transgenic plant with specificity knocked down plant circRNA through tissue culture; the invention greatly accelerates the functional research of the plant circRNA, thereby realizing the development of the circRNA functional gain mutant and new germplasm in plant breeding.
Description
Technical Field
The invention relates to the technical field of biology, in particular to a method for specifically knocking down plant circRNA based on a CRISPR/RfxCas13d system.
Background
Circular RNAs (circrnas) are new members of the recently added non-coding RNA family, with covalently closed circular structures, as distinguished from most linear RNAs. Research shows that the circRNA has stable structure, rich variety, conserved sequence, specific expression in cells and tissues, many potential functions, plays an important role in regulating gene expression and various biological functions, can be used as a biosensor, a regulator and a potential therapeutic tool, and provides a new strategy and tool for future crop disease resistance breeding.
However, since the circRNA shares an overlapping sequence with its cognate linear mRNA except for the reverse splice site (BSJ), this feature makes it difficult to distinguish the function of the circRNA and its cognate mRNA, it remains a challenge to knock down the level of the circRNA without affecting the expression of its parent gene. Currently, the knockdown of CircRNA by RNA interference (siRNA/shRNA) is a common method in humans and animals. However, in plant circRNA research, the aim of knocking out the circRNA can only be achieved by directly knocking out exons forming circular RNA through the CRISPR-Cas9 system or by suppressing the biogenesis of the circRNA through knocking out reverse complementary sequences in flanking sequences, however, because the sequence of the circRNA is almost completely consistent with that of a parent gene thereof, the CRISPR-Cas9 system generally does not achieve a good result for knocking out only the circRNA without affecting the expression of the parent gene thereof. Thus, there is currently no viable approach to specific knockdown of plant circRNA.
The average length of the RfxCas13d enzyme is 930 amino acids, 20% smaller than other Cas13 enzymes, and 33% smaller than Cas 9. It is able to knock down RNA without changing the genome under the guidance of guide RNA. These properties allow RfxCas13d to be an important mRNA editing tool that can be developed as a practical tool for biological mRNA editing without permanently altering the genomic sequence to affect gene expression. Studies have shown that the knockout efficiency of the rcRNA mediated by the RfxCas13d-BSJ-gRNA in animal cells is 65 percent on average, and the knockout efficiency of the shRNA is 35 percent, which shows that the CRISPR/RfxCas13d can be widely applied to the study of the function of the circRNA.
Therefore, the current urgent need to overcome the difficulty in the prior art, realize the breakthrough of the circRNA editing technology in the plant body, construct a CRISPR/RfxCas13d vector which can be suitable for knocking down the circRNA of the plant, target the sequence on the BSJ locus of the circRNA of the plant, effectively and specifically distinguish the circular RNA from the mRNAs, and specifically knock down the circRNA. The functional research of the plant circRNA is greatly accelerated, and the development of the circRNA functional gain mutant and new germplasm in plant breeding is further realized.
Disclosure of Invention
The invention aims to provide a method for specifically knocking down plant circRNA based on a CRISPR/RfxCas13d system, which is used for solving the problem that the specific editing of target circRNA cannot be realized in a plant body in the prior art, greatly accelerating the functional research of the plant circRNA, and further realizing the development of a circRNA functional gain mutant and a new germplasm in plant breeding.
The invention provides a method for specifically knocking down plant circRNA based on a CRISPR/RfxCas13d system, which comprises the following steps:
s1, designing and synthesizing a target gene sequence coding for RfxCas13d with plant codon preference, and constructing a pCAMBIA1300-RfxCas13d recombinant vector;
s2, synthesizing a AtU6-crRNA gene fragment, wherein the AtU6-crRNA gene fragment comprises two BsaI enzyme cutting sites, and constructing a pENTR4:gRNA4-AtU6-crRNA recombinant vector;
s3, synthesizing and constructing a pCAMBIA1300-RfxCas13d-BSJ-crRNA vector of a specific targeting plant circRNA BSJ site;
s4, transferring the constructed pCAMBIA1300-RfxCas13d-BSJ-crRNA expression vector to plant callus, and obtaining a transgenic plant with specificity knockdown plant circRNA through tissue culture.
Preferably, in step S1, the pCAMBIA1300-RfxCas13d recombinant vector comprises a CaMV 35S promoter, a hygromycin gene, a CaMV poly (A) signal terminator, pVS1 RepA, a pVS1 replication origin, a rice ubiquitination promoter, a SV40 NLS-RfxCas13d-SV40 NLS and a NOS terminator.
Preferably, step S1 comprises the steps of: the target gene sequence encoding RfxCas13d is optimized and synthesized through plant codon preference, and is directly cloned to a linearized pUC57 vector, which is named pUC57: rfxCas13d. Based on CRISPR/spCas9 vector, plasmid pUC57 was purified using BamHI and NcoI: double digestion is carried out on the RfxCas13d and the CRISPR/spCas9 vector, 3003bp RfxCas13d gene fragments and about 12.6kb CRISPR/spCas9 vector frameworks are respectively recovered, the 3003bp RfxCas13d fragments are connected to the CRISPR/spCas9 vector frameworks by using T4 DNA ligase, the plasmid is named pCAMBIA1300-RfxCas13d, the plasmid is stored after sequencing, and the transformed escherichia coli is stored after verification.
Preferably, the base sequence of the SV40 NLS-RfxCas13d-SV40 NLS gene is SEQ ID No.1.
SEQ ID No.1(SV40 NLS-RfxCas13d-SV40 NLS)
atgtctccgaaaaaaaaaaggaaggtcgaggcgagcattgagaaaaagaagagctttgctaaaggtatgggggttaagagtacattggtgtccggttccaaagtgtatatgactacatttgcggaaggcagcgacgcaaggctggagaaaatagttgaaggcgattcaataagatcggtgaatgaaggagaagccttctccgctgaaatggccgacaaaaatgccggttataagattggtaacgctaagttctcgcacccaaagggttatgcagtcgtggcgaacaatccactgtacacaggtcctgtgcagcaggatatgcttggtttgaaggaaacgttggaaaagcggtacttcggtgagtccgctgacggcaatgataatatatgtattcaggtgatacacaacatcctcgatatagaaaaaatcctcgctgagtacattacaaatgccgcctacgcggtcaataacatatccggtttggacaaagacatcataggatttggaaaattctctacagtgtacacgtacgatgaatttaaagacccggagcatcacagagcagctttcaacaacaatgataagctcataaatgcaataaaagcgcaatatgatgaatttgacaactttcttgataacccgaggctgggctattttggtcaggctttcttcagcaaagaagggcgcaactatattataaattacggcaatgagtgctatgatattttggcattgttgtccggactgcggcattgggttgttcataataacgaggaagagagtaggatcagtcgcacatggctgtataaccttgataagaatctcgataatgagtatatttcgactttgaactacttgtatgatcgc
attactaacgaactcaccaacagcttttcgaagaactcggcagccaacgtgaattatatagcggaaaccttggggatta
acccagcagaattcgcagaacagtactttcgcttcagcataatgaaggaacaaaagaacctcggttttaacatcacaaa
actcagagaggttatgctggatagaaaagacatgtcagaaattcggaaaaatcataaagtgttcgattccatcaggacg
aaggtctacacaatgatggatttcgtcatctacagatattatattgaggaggatgcaaaagtcgcagcggccaacaaaa
gcctccctgacaatgaaaagtcgctctctgaaaaggacatctttgtcataaaccttcggggcagttttaacgatgaccaa
aaggacgccttgtactacgatgaagcaaatcgcatctggaggaaacttgagaacataatgcataacataaaggagttt
cgggggaacaagacgagagaatacaaaaagaaggacgcgccaagacttcctagaattctcccagcggggcgcga
cgtctcagcgttctccaagctcatgtacgcgcttaccatgttcctcgatggaaaagagataaatgatcttttgactacgct
cattaacaagttcgacaacattcaatctttcctgaaagtgatgcctctcataggggtcaacgcaaagttcgttgaagaata
cgcctttttcaaggactctgcgaagatagccgatgaactccgcctcataaagagctttgcgcggatgggtgaacctatt
gctgacgcccggagggcaatgtatattgacgcgatcaggattcttggaactaatctctcctacgacgaacttaaggctc
ttgctgataccttttctcttgatgaaaacgggaacaaactcaagaagggaaaacacggtatgcggaatttcatcataaat
aacgttatttcaaacaagagattccattacctgataagatacggagatccagcccatctgcacgaaatcgcgaaaaacg
aggctgttgttaaattcgttttggggagaatcgctgacatacaaaaaaagcaggggcaaaacgggaagaaccagatc
gaccggtactacgaaacctgtatcggtaaggacaaagggaagagtgtgtccgaaaaggttgacgcccttacaaaaat
catcaccggtatgaactatgaccaattcgacaagaaaagaagtgttatagaagatacgggaagagagaatgcggagc
gggaaaaatttaaaaagatcatatcgctctatctgaccgttatctatcatatccttaaaaacatagtcaacatcaacgcac
ggtacgtgataggcttccattgtgtggaacgggacgcccagttgtacaaagagaaaggatacgacataaacctcaag
aagctcgaagagaagggttttagctctgttacgaaactttgtgcgggtatcgatgaaaccgcgcctgacaaacggaaa
gacgttgaaaaggagatggcagaacgcgctaaagagtctatagacagtcttgagtcagcaaatcccaagctctacgc
gaactacataaaatattctgacgaaaaaaaagctgaagaatttaccagacaaataaacagagagaaggctaagactg
cgttgaatgcctatctgcggaacactaaatggaatgtcataattcgggaagaccttctgcggatcgacaataaaacctg
caccctctttagaaataaggctgtccacctggaagttgctcgctatgtgcatgcgtatattaacgacattgctgaggttaa
cagctactttcagctgtatcattacatcatgcagaggattattatgaacgagcgctacgagaagtcctccgggaaggttt
cagagtattttgacgcagtcaacgatgagaaaaagtacaacgatcggctgctgaaactcctgtgtgtcccattcgggta
ttgtataccgcgcttcaagaacctctcaatagaggcgctctttgaccgcaacgaggccgcaaagtttgataaagaaaaa
aagaaagttagtgggaatagtggctcaggaccaaagaagaaaagaaaggttgcagcggcatatccgtacgatgtgc
cggattatgcgtga
Preferably, the amino acid sequence encoded by RfxCas13d is SEQ ID No.2.
SEQ ID No.2
Met Ser Pro Lys Lys Lys Arg Lys Val Glu Ala Ser Ile Glu Lys Lys Lys SerPhe Ala Lys Gly Met Gly Val Lys Ser Thr Leu Val Ser Gly Ser Lys Val Tyr MetThr Thr Phe Ala Glu Gly Ser Asp Ala Arg Leu Glu Lys Ile Val Glu Gly Asp SerIle Arg Ser Val Asn Glu Gly Glu Ala Phe Ser Ala Glu Met Ala Asp Lys Asn AlaGly Tyr Lys Ile Gly Asn Ala Lys Phe Ser His Pro Lys Gly Tyr Ala Val Val AlaAsn Asn Pro Leu Tyr Thr Gly Pro Val Gln Gln Asp Met Leu Gly Leu Lys Glu ThrLeu Glu Lys Arg Tyr Phe Gly Glu Ser Ala Asp Gly Asn Asp Asn Ile Cys Ile GlnVal Ile His Asn Ile Leu Asp Ile Glu Lys Ile Leu Ala Glu Tyr Ile Thr Asn Ala AlaTyr Ala Val Asn Asn Ile Ser Gly Leu Asp Lys Asp Ile Ile Gly Phe Gly Lys PheSer Thr Val Tyr Thr Tyr Asp Glu Phe Lys Asp Pro Glu His His Arg Ala Ala PheAsn Asn Asn Asp Lys Leu Ile Asn Ala Ile Lys Ala Gln Tyr Asp Glu Phe Asp AsnPhe Leu Asp Asn Pro Arg Leu Gly Tyr Phe Gly Gln Ala Phe Phe Ser Lys Glu GlyArg Asn Tyr Ile Ile Asn Tyr Gly Asn Glu Cys Tyr Asp Ile Leu Ala Leu Leu SerGly Leu Arg His Trp Val Val His Asn Asn Glu Glu Glu Ser Arg Ile Ser Arg ThrTrp Leu Tyr Asn Leu Asp Lys Asn Leu Asp Asn Glu Tyr Ile Ser Thr Leu Asn TyrLeu Tyr Asp Arg Ile Thr Asn Glu Leu Thr Asn Ser Phe Ser Lys Asn Ser Ala AlaAsn Val Asn Tyr Ile Ala Glu Thr Leu Gly Ile Asn Pro Ala Glu Phe Ala Glu GlnTyr Phe Arg Phe Ser Ile Met Lys Glu Gln Lys Asn Leu Gly Phe Asn Ile Thr LysLeu Arg Glu Val Met Leu Asp Arg Lys Asp Met Ser Glu Ile Arg Lys Asn His LysVal Phe Asp Ser Ile Arg Thr Lys Val Tyr Thr Met Met Asp Phe Val Ile Tyr ArgTyr Tyr Ile Glu Glu Asp Ala Lys Val Ala Ala Ala Asn Lys Ser Leu Pro Asp AsnGlu Lys Ser Leu Ser Glu Lys Asp Ile Phe Val Ile Asn Leu Arg Gly Ser Phe AsnAsp Asp Gln Lys Asp Ala Leu Tyr Tyr Asp Glu Ala Asn Arg Ile Trp Arg Lys LeuGlu Asn Ile Met His Asn Ile Lys Glu Phe Arg Gly Asn Lys Thr Arg Glu Tyr LysLys Lys Asp Ala Pro Arg Leu Pro Arg Ile Leu Pro Ala Gly Arg Asp Val Ser AlaPhe Ser Lys Leu Met Tyr Ala Leu Thr Met Phe Leu Asp Gly Lys Glu Ile Asn AspLeu Leu Thr Thr Leu Ile Asn Lys Phe Asp Asn Ile Gln Ser Phe Leu Lys Val MetPro Leu Ile Gly Val Asn Ala Lys Phe Val Glu Glu Tyr Ala Phe Phe Lys Asp SerAla Lys Ile Ala Asp Glu Leu Arg Leu Ile Lys Ser Phe Ala Arg Met Gly Glu ProIle Ala Asp Ala Arg Arg Ala Met Tyr Ile Asp Ala Ile Arg Ile Leu Gly Thr AsnLeu Ser Tyr Asp Glu Leu Lys Ala Leu Ala Asp Thr Phe Ser Leu Asp Glu Asn GlyAsn Lys Leu Lys Lys Gly Lys His Gly Met Arg Asn Phe Ile Ile Asn Asn Val IleSer Asn Lys Arg Phe His Tyr Leu Ile Arg Tyr Gly Asp Pro Ala His Leu His GluIle Ala Lys Asn Glu Ala Val Val Lys Phe Val Leu Gly Arg Ile Ala Asp Ile GlnLys Lys Gln Gly Gln Asn Gly Lys Asn Gln Ile Asp Arg Tyr Tyr Glu Thr Cys IleGly Lys Asp Lys Gly Lys Ser Val Ser Glu Lys Val Asp Ala Leu Thr Lys Ile IleThr Gly Met Asn Tyr Asp Gln Phe Asp Lys Lys Arg Ser Val Ile Glu Asp Thr GlyArg Glu Asn Ala Glu Arg Glu Lys Phe Lys Lys Ile Ile Ser Leu Tyr Leu Thr ValIle Tyr His Ile Leu Lys Asn Ile Val Asn Ile Asn Ala Arg Tyr Val Ile Gly Phe HisCys Val Glu Arg Asp Ala Gln Leu Tyr Lys Glu Lys Gly Tyr Asp Ile Asn Leu LysLys Leu Glu Glu Lys Gly Phe Ser Ser Val Thr Lys Leu Cys Ala Gly Ile Asp GluThr Ala Pro Asp Lys Arg Lys Asp Val Glu Lys Glu Met Ala Glu Arg Ala Lys GluSer Ile Asp Ser Leu Glu Ser Ala Asn Pro Lys Leu Tyr Ala Asn Tyr Ile Lys TyrSer Asp Glu Lys Lys Ala Glu Glu Phe Thr Arg Gln Ile Asn Arg Glu Lys Ala LysThr Ala Leu Asn Ala Tyr Leu Arg Asn Thr Lys Trp Asn Val Ile Ile Arg Glu AspLeu Leu Arg Ile Asp Asn Lys Thr Cys Thr Leu Phe Arg Asn Lys Ala Val His LeuGlu Val Ala Arg Tyr Val His Ala Tyr Ile Asn Asp Ile Ala Glu Val Asn Ser TyrPhe Gln Leu Tyr His Tyr Ile Met Gln Arg Ile Ile Met Asn Glu Arg Tyr Glu LysSer Ser Gly Lys Val Ser Glu Tyr Phe Asp Ala Val Asn Asp Glu Lys Lys Tyr AsnAsp Arg Leu Leu Lys Leu Leu Cys Val Pro Phe Gly Tyr Cys Ile Pro Arg Phe LysAsn Leu Ser Ile Glu Ala Leu Phe Asp Arg Asn Glu Ala Ala Lys Phe Asp Lys GluLys Lys Lys Val Ser Gly Asn Ser Gly Ser Gly Pro Lys Lys Lys Arg Lys Val AlaAla Ala Tyr Pro Tyr Asp Val Pro Asp Tyr Ala*
Preferably, in step S2, the pENTR4 gRNA 4-AtU-crRNA recombinant vector comprises AtU of a promoter, a crRNA repeat leader sequence, and a spacer nucleotide sequence containing two BsaI cleavage sites.
Preferably, the base sequence of the AtU6-crRNA gene fragment is SEQ ID No.3.
SEQ ID No.3(AtU6-crRNA)
aagcttcattcggagtttttgtatcttgtttcatagtttgtcccaggattagaatgattaggcatcgaaccttcaagaatttgattgaataaaacatcttcattcttaagatatgaagataatcttcaaaaggcccctgggaatctgaaagaagagaagcaggcccatttatatgggaaagaacaatagtatttcttatataggcccatttaagttgaaaacaatcttcaaaagtcccacatcgcttagataagaaaacgaagctgagtttatatacagctagagtcgaagtagtgattgaacccctaccaactggtcggggtttgaaacagagaccagaattcggtctcatttttttttggatccactagtcctgcagg
Preferably, step S2 includes: and (3) linearizing the pENTR4:gRNA4 vector by using HindIII, recovering a linearization vector skeleton, artificially synthesizing a AtU-crRNA gene fragment containing two BsaI enzyme cutting sites, directly synthesizing the fragment onto the pENTR4:gRNA4 vector by using the HindIII enzyme cutting sites to obtain the pENTR4:gRNA 4-AtU-crRNA vector, sequencing the plasmid, preserving the plasmid, and transforming the escherichia coli after verification is successful and preserving the plasmid.
Preferably, step S3 includes: designing a gRNAs targeting sequence aiming at a BSJ site of the circRNA, synthesizing an RfxCas13d-BSJ-gRNA target sequence primer, and connecting the primer to BsaI linearized pENTR4:gRNA 4-AtU-crRNA carrier in a primer annealing mode; and (3) taking the gRNA4-AtU6-BSJ-crRNA vector constructed in the previous step as a template, designing primers with HindIII and SbfI enzyme cutting sites, amplifying AtU-BSJ-crRNA fragments with HindIII and SbfI enzyme cutting sites, carrying out double enzyme cutting on the pCAMBIA1300-RfxCas13d recombinant vector by using HindIII and SbfI, and finally obtaining the pCAMBIA1300-RfxCas13d-BSJ-crRNA vector by a homologous recombination mode. Further preferably, finally, the plasmid is stored after sequencing by PCR and enzyme digestion verification, and the transformed E.coli is stored after verification. A CRISPR/RfxCas13d knock-down vector is obtained that specifically knocks down the circRNA without affecting the expression of its parent gene.
Preferably, the plants include rice, tomato, arabidopsis thaliana.
In summary, the method of the present invention has the following features and advantages: the RfxCas13d subjected to plant codon optimization does not influence the normal physiological activities of plants; the invention provides a technology for knocking down plant circRNA by using CRISPR/RfxCas13d, which is used for directly shearing and knocking down target circRNA in a plant body through crRNA mediation orientation based on a BSJ locus by using a CRISPR/RfxCas13d editing system, so as to realize accurate targeting; the technology provided by the invention gets rid of the limitation that the parent mRNA is knocked out when the plant circRNA is knocked out by the CRISPR-cas9 technology. The invention overcomes the difficulty of the prior art, realizes the breakthrough of the circRNA editing technology in plants, converts the CRISPR/RfxCas13d system into a practical tool for exerting effect in the plants, can be used for the functional research of the plant circRNA, further realizes the development of the circRNA functional gain mutant and new germplasm in plant breeding, and greatly accelerates the development of biological editing technology in bioengineering and agronomic germplasm innovation.
Compared with the prior art, the invention has the beneficial effects that:
1. the CRISPR Cas13d technology is applied to experimental study of knocking down plant circRNA for the first time;
2. the whole operation implementation process of the method is simple and effective;
3. the invention eliminates the limitation of gene targeting sites, does not reduce the expression of the parent mRNA in the targeting process of the plant circRNA BSJ site, and can obtain higher plant circRNA targeting efficiency;
4. the technology of the invention can be used for functional research of plant circRNA, thereby realizing the development of circRNA functional gain mutant and new germplasm in plant breeding.
Drawings
FIG. 1 is a schematic vector diagram of pCAMBIA1300-RfxCas13 d.
FIG. 2 is a schematic representation of the vector of pENTR4, gRNA4-AtU 6-crRNA.
FIG. 3 shows the expression levels of the circRNA of three circRNA knock-down plants and wild-type plants of rice in the examples of the present invention.
FIG. 4 shows the expression level of female mRNA of the circRNA of three circRNA knock-down plants and wild type plants of rice in the examples of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
EXAMPLE 1 construction of pCAMBIA1300-RfxCas13d and pENTR4: gRNA 4-AtU-crRNA recombinant vectors
1. Construction of pCAMBIA1300-RfxCas13d recombinant vector
Searching a target gene sequence of CRISPR VI d type (Cas 13 d) in NCBI functional network, and optimizing plant codon preference of the obtained Cas13d original sequence on the premise of not changing coded amino acid. Artificially synthesizing 3003bp optimized RfxCas13d gene fragment, cloning the fragment onto pUC57, and naming the fragment as pUC57: rfxCas13d, chemically synthesized by the biological engineering (Shanghai) Co., ltd.
Plasmid pUC57 was digested with BamHI and NcoI: double digestion is carried out on the RfxCas13d and the CRISPR/spCas9 vector (laboratory preservation vector), 3003bp RfxCas13d gene fragments and about 12.3kb CRISPR/spCas9 vector frameworks are respectively recovered, the 3003bp RfxCas13d fragments are connected to the CRISPR/spCas9 vector frameworks by using T4 DNA ligase, the plasmid is named pCAMBIA1300-RfxCas13d, the plasmid is preserved after sequencing, and the transformed escherichia coli is preserved after verification.
As shown in fig. 1, the plasmid pCAMBIA1300-RfxCas13d is constructed as follows: the promoter comprises a CaMV 35S promoter, a hygromycin gene, a CaMV poly (A) signal terminator, pVS1 RepA, a pVS1 replication origin, a rice ubiquitination promoter, an SV40 NLS-RfxCas13d-SV40 NLS (shown as SEQ ID No. 1) and an NOS terminator. The amino acid sequence of the RfxCas13d code is shown in SEQ ID No.2.
SEQ ID No.1
atgtctccgaaaaaaaaaaggaaggtcgaggcgagcattgagaaaaagaagagctttgctaaaggtatgggggttaagagtacattggtgtccggttccaaagtgtatatgactacatttgcggaaggcagcgacgcaaggctggagaaaatagttgaaggcgattcaataagatcggtgaatgaaggagaagccttctccgctgaaatggccgacaaaaatgccggttataagattggtaacgctaagttctcgcacccaaagggttatgcagtcgtggcgaacaatccactgtacacaggtcctgtgcagcaggatatgcttggtttgaaggaaacgttggaaaagcggtacttcggtgagtccgctgacggcaatgataatatatgtattcaggtgatacacaacatcctcgatatagaaaaaatcctcgctgagtacattacaaatgccgcctacgcggtcaataacatatccggtttggacaaagacatcataggatttggaaaattctctacagtgtacacgtacgatgaatttaaagacccggagcatcacagagcagctttcaacaacaatgataagctcataaatgcaataaaagcgcaatatgatgaatttgacaactttcttgataacccgaggctgggctattttggtcaggctttcttcagcaaagaagggcgcaactatattataaattacggcaatgagtgctatgatattttggcattgttgtccggactgcggcattgggttgttcataataacgaggaagagagtaggatcagtcgcacatggctgtataaccttgataagaatctcgataatgagtatatttcgactttgaactacttgtatgatcgcattactaacgaactcaccaacagcttttcgaagaactcggcagccaacgtgaattatatagcggaaaccttggggattaacccagcagaattcgcagaacagtactttcgcttcagcataatgaaggaacaaaagaacctcggttttaacatcacaaaactcagagaggttatgctggatagaaaagacatgtcagaaattcggaaaaatcataaagtgttcgattccatcaggacgaaggtctacacaatgatggatttcgtcatctacagatattatattgaggaggatgcaaaagtcgcagcggccaacaaaagcctccctgacaatgaaaagtcgctctctgaaaaggacatctttgtcataaaccttcggggcagttttaacgatgaccaaaaggacgccttgtactacgatgaagcaaatcgcatctggaggaaacttgagaacataatgcataacataaaggagtttcgggggaacaagacgagagaatacaaaaagaaggacgcgccaagacttcctagaattctcccagcggggcgcgacgtctcagcgttctccaagctcatgtacgcgcttaccatgttcctcgatggaaaagagataaatgatcttttgactacgctcattaacaagttcgacaacattcaatctttcctgaaagtgatgcctctcataggggtcaacgcaaagttcgttgaagaatacgcctttttcaaggactctgcgaagatagccgatgaactccgcctcataaagagctttgcgcggatgggtgaacctattgctgacgcccggagggcaatgtatattgacgcgatcaggattcttggaactaatctctcctacgacgaacttaaggctcttgctgataccttttctcttgatgaaaacgggaacaaactcaagaagggaaaacacggtatgcggaatttcatcataaataacgttatttcaaacaagagattccattacctgataagatacggagatccagcccatctgcacgaaatcgcgaaaaacgaggctgttgttaaattcgttttggggagaatcgctgacatacaaaaaaagcaggggcaaaacgggaagaaccagatcgaccggtactacgaaacctgtatcggtaaggacaaagggaagagtgtgtccgaaaaggttgacgcccttacaaaaatcatcaccggtatgaactatgaccaattcgacaagaaaagaagtgttatagaagatacgggaagagagaatgcggagcgggaaaaatttaaaaagatcatatcgctctatctgaccgttatctatcatatccttaaaaacatagtcaacatcaacgcacggtacgtgataggcttccattgtgtggaacgggacgcccagttgtacaaagagaaaggatacgacataaacctcaagaagctcgaagagaagggttttagctctgttacgaaactttgtgcgggtatcgatgaaaccgcgcctgacaaacggaaagacgttgaaaaggagatggcagaacgcgctaaagagtctatagacagtcttgagtcagcaaatcccaagctctacgcgaactacataaaatattctgacgaaaaaaaagctgaagaatttaccagacaaataaacagagagaaggctaagactgcgttgaatgcctatctgcggaacactaaatggaatgtcataattcgggaagaccttctgcggatcgacaataaaacctgcaccctctttagaaataaggctgtccacctggaagttgctcgctatgtgcatgcgtatattaacgacattgctgaggttaacagctactttcagctgtatcattacatcatgcagaggattattatgaacgagcgctacgagaagtcctccgggaaggtttcagagtattttgacgcagtcaacgatgagaaaaagtacaacgatcggctgctgaaactcctgtgtgtcccattcgggtattgtataccgcgcttcaagaacctctcaatagaggcgctctttgaccgcaacgaggccgcaaagtttgataaagaaaaaaagaaagttagtgggaatagtggctcaggaccaaagaagaaaagaaaggttgcagcggcatatccgtacgatgtgccggattatgcgtga
SEQ ID No.2
Met Ser Pro Lys Lys Lys Arg Lys Val Glu Ala Ser Ile Glu Lys Lys Lys SerPhe Ala Lys Gly Met Gly Val Lys Ser Thr Leu Val Ser Gly Ser Lys Val Tyr MetThr Thr Phe Ala Glu Gly Ser Asp Ala Arg Leu Glu Lys Ile Val Glu Gly Asp SerIle Arg Ser Val Asn Glu Gly Glu Ala Phe Ser Ala Glu Met Ala Asp Lys Asn AlaGly Tyr Lys Ile Gly Asn Ala Lys Phe Ser His Pro Lys Gly Tyr Ala Val Val AlaAsn Asn Pro Leu Tyr Thr Gly Pro Val Gln Gln Asp Met Leu Gly Leu Lys Glu ThrLeu Glu Lys Arg Tyr Phe Gly Glu Ser Ala Asp Gly Asn Asp Asn Ile Cys Ile GlnVal Ile His Asn Ile Leu Asp Ile Glu Lys Ile Leu Ala Glu Tyr Ile Thr Asn Ala AlaTyr Ala Val Asn Asn Ile Ser Gly Leu Asp Lys Asp Ile Ile Gly Phe Gly Lys PheSer Thr Val Tyr Thr Tyr Asp Glu Phe Lys Asp Pro Glu His His Arg Ala Ala PheAsn Asn Asn Asp Lys Leu Ile Asn Ala Ile Lys Ala Gln Tyr Asp Glu Phe Asp AsnPhe Leu Asp Asn Pro Arg Leu Gly Tyr Phe Gly Gln Ala Phe Phe Ser Lys Glu GlyArg Asn Tyr Ile Ile Asn Tyr Gly Asn Glu Cys Tyr Asp Ile Leu Ala Leu Leu SerGly Leu Arg His Trp Val Val His Asn Asn Glu Glu Glu Ser Arg Ile Ser Arg ThrTrp Leu Tyr Asn Leu Asp Lys Asn Leu Asp Asn Glu Tyr Ile Ser Thr Leu Asn TyrLeu Tyr Asp Arg Ile Thr Asn Glu Leu Thr Asn Ser Phe Ser Lys Asn Ser Ala AlaAsn Val Asn Tyr Ile Ala Glu Thr Leu Gly Ile Asn Pro Ala Glu Phe Ala Glu Gln
Tyr Phe Arg Phe Ser Ile Met Lys Glu Gln Lys Asn Leu Gly Phe Asn Ile Thr Lys
Leu Arg Glu Val Met Leu Asp Arg Lys Asp Met Ser Glu Ile Arg Lys Asn His Lys
Val Phe Asp Ser Ile Arg Thr Lys Val Tyr Thr Met Met Asp Phe Val Ile Tyr Arg
Tyr Tyr Ile Glu Glu Asp Ala Lys Val Ala Ala Ala Asn Lys Ser Leu Pro Asp Asn
Glu Lys Ser Leu Ser Glu Lys Asp Ile Phe Val Ile Asn Leu Arg Gly Ser Phe Asn
Asp Asp Gln Lys Asp Ala Leu Tyr Tyr Asp Glu Ala Asn Arg Ile Trp Arg Lys Leu
Glu Asn Ile Met His Asn Ile Lys Glu Phe Arg Gly Asn Lys Thr Arg Glu Tyr Lys
Lys Lys Asp Ala Pro Arg Leu Pro Arg Ile Leu Pro Ala Gly Arg Asp Val Ser Ala
Phe Ser Lys Leu Met Tyr Ala Leu Thr Met Phe Leu Asp Gly Lys Glu Ile Asn Asp
Leu Leu Thr Thr Leu Ile Asn Lys Phe Asp Asn Ile Gln Ser Phe Leu Lys Val Met
Pro Leu Ile Gly Val Asn Ala Lys Phe Val Glu Glu Tyr Ala Phe Phe Lys Asp Ser
Ala Lys Ile Ala Asp Glu Leu Arg Leu Ile Lys Ser Phe Ala Arg Met Gly Glu Pro
Ile Ala Asp Ala Arg Arg Ala Met Tyr Ile Asp Ala Ile Arg Ile Leu Gly Thr Asn
Leu Ser Tyr Asp Glu Leu Lys Ala Leu Ala Asp Thr Phe Ser Leu Asp Glu Asn Gly
Asn Lys Leu Lys Lys Gly Lys His Gly Met Arg Asn Phe Ile Ile Asn Asn Val Ile
Ser Asn Lys Arg Phe His Tyr Leu Ile Arg Tyr Gly Asp Pro Ala His Leu His Glu
Ile Ala Lys Asn Glu Ala Val Val Lys Phe Val Leu Gly Arg Ile Ala Asp Ile Gln
Lys Lys Gln Gly Gln Asn Gly Lys Asn Gln Ile Asp Arg Tyr Tyr Glu Thr Cys Ile
Gly Lys Asp Lys Gly Lys Ser Val Ser Glu Lys Val Asp Ala Leu Thr Lys Ile Ile
Thr Gly Met Asn Tyr Asp Gln Phe Asp Lys Lys Arg Ser Val Ile Glu Asp Thr Gly
Arg Glu Asn Ala Glu Arg Glu Lys Phe Lys Lys Ile Ile Ser Leu Tyr Leu Thr Val
Ile Tyr His Ile Leu Lys Asn Ile Val Asn Ile Asn Ala Arg Tyr Val Ile Gly Phe His
Cys Val Glu Arg Asp Ala Gln Leu Tyr Lys Glu Lys Gly Tyr Asp Ile Asn Leu Lys
Lys Leu Glu Glu Lys Gly Phe Ser Ser Val Thr Lys Leu Cys Ala Gly Ile Asp Glu
Thr Ala Pro Asp Lys Arg Lys Asp Val Glu Lys Glu Met Ala Glu Arg Ala Lys Glu
Ser Ile Asp Ser Leu Glu Ser Ala Asn Pro Lys Leu Tyr Ala Asn Tyr Ile Lys Tyr
Ser Asp Glu Lys Lys Ala Glu Glu Phe Thr Arg Gln Ile Asn Arg Glu Lys Ala Lys
Thr Ala Leu Asn Ala Tyr Leu Arg Asn Thr Lys Trp Asn Val Ile Ile Arg Glu Asp
Leu Leu Arg Ile Asp Asn Lys Thr Cys Thr Leu Phe Arg Asn Lys Ala Val His LeuGlu Val Ala Arg Tyr Val His Ala Tyr Ile Asn Asp Ile Ala Glu Val Asn Ser TyrPhe Gln Leu Tyr His Tyr Ile Met Gln Arg Ile Ile Met Asn Glu Arg Tyr Glu LysSer Ser Gly Lys Val Ser Glu Tyr Phe Asp Ala Val Asn Asp Glu Lys Lys Tyr AsnAsp Arg Leu Leu Lys Leu Leu Cys Val Pro Phe Gly Tyr Cys Ile Pro Arg Phe LysAsn Leu Ser Ile Glu Ala Leu Phe Asp Arg Asn Glu Ala Ala Lys Phe Asp Lys GluLys Lys Lys Val Ser Gly Asn Ser Gly Ser Gly Pro Lys Lys Lys Arg Lys Val AlaAla Ala Tyr Pro Tyr Asp Val Pro Asp Tyr Ala*
2. Construction of pENTR4 gRNA4-AtU6-crRNA recombinant vector
A380 bp AtU-crRNA (shown as SEQ ID No. 3) gene fragment is artificially synthesized and cloned onto a pENTR4:gRNA4 vector, named pENTR4:gRNA 4-AtU-crRNA, and chemically synthesized by the division of biological engineering (Shanghai). The specific construction method is as follows: and (3) using a HindIII linearized pENTR4:gRNA4 vector, recovering a linearized vector skeleton of about 2.2kb, artificially synthesizing a 380bp AtU-crRNA gene fragment containing two BsaI enzyme cutting sites, directly synthesizing the fragment onto the pENTR4:gRNA4 vector through the HindIII enzyme cutting sites to obtain the pENTR4:gRNA 4-AtU-crRNA vector, sequencing the plasmid, preserving the plasmid, and transforming the E.coli after verification success.
As shown in FIG. 2, the construction of plasmid pENTR4: gRNA4-AtU6-crRNA is as follows: atU6 promoter, crRNA repeat leader sequence, spacer nucleotide sequence containing two BsaI cleavage sites.
SEQ ID No.3
aagcttcattcggagtttttgtatcttgtttcatagtttgtcccaggattagaatgattaggcatcgaaccttcaagaatttgattgaataaaacatcttcattcttaagatatgaagataatcttcaaaaggcccctgggaatctgaaagaagagaagcaggcccatttatatgggaaagaacaatagtatttcttatataggcccatttaagttgaaaacaatcttcaaaagtcccacatcgcttagataagaaaacgaagctgagtttatatacagctagagtcgaagtagtgattgaacccctaccaactggtcggggtttgaaacagagaccagaattcggtctcatttttttttggatccactagtcctgcagg
EXAMPLE 2 construction of Rice pCAMBIA1300-RfxCas13d-BSJ-crRNA vector
The technical method for knocking down plant circRNA by actually applying crRNA mediated CRISPR/RfxCas13d editing technology based on BSJ locus in plants in the embodiment specifically adopts experimental plants as model plant rice, but the experimental plants are not limited to rice species, and other plant species can be selected for optimization, such as tomatoes, arabidopsis and the like.
Taking rice 3 circrnas (chr03_30006457_30006842, chr08_27737960_27738788, chr07_176529_176845) as an example, designing 30bp (15 bp each) gRNAs targeting sequence for the BSJ site of the circrnas as shown in SEQ ID No.4-6, synthesizing an RfxCas13d-BSJ-gRNA target sequence primer by the division of biological engineering (Shanghai) Co., ltd, diluting the two oligonucleotide primers to 50 mu M respectively as shown in SEQ ID No.7-12, and annealing according to the reaction system of Table 1; the linearized vector pENTR4 is gRNA 4-AtU-crRNA, bsaI is digested and reacted for 1h at 37 ℃, and the reaction system is shown in Table 2; the double-stranded oligonucleotide product was ligated to the linearized vector pENTR4 gRNA4-AtU6-crRNA, the ligation reaction system being shown in Table 3; the gRNA4-AtU6-BSJ-crRNA vector constructed in the above steps is used as a template, primers with HindIII and SbfI enzyme cutting sites are designed, atU-BSJ-crRNA fragments with HindIII and SbfI enzyme cutting sites are amplified, hindIII and SbfI are used for carrying out double enzyme cutting on the pCAMBIA1300-RfxCas13d recombinant vector, a connection reaction system is shown in Table 4, and finally the pCAMBIA1300-RfxCas13d-BSJ-crRNA vector is obtained through a homologous recombination mode, and the connection reaction system is shown in Table 5. Finally, through PCR and enzyme digestion verification, the plasmid is stored after sequencing, and the transformed escherichia coli is stored after verification is successful. Finally, the pCAMBIA1300-RfxCas13d-BSJ-crRNA vector of the specific targeting rice circRNA BSJ site is obtained.
TABLE 1 annealing ligation System for oligonucleotide primers
The annealing procedure is as follows: 95 ℃ for 2min; -0.1 ℃/8s, down to 25 ℃, about 90min.
TABLE 2 cleavage of pENTR4 gRNA4-AtU6-crRNA System
After the reaction at 37℃was completed, the reaction was inactivated at 80℃for 2min.
Table 3T4 enzyme ligation System
After reaction for 1h at 22 ℃, the ligation product was directly transformed into E.coli strain, and the strain P was confirmed by sequencing.
TABLE 4 enzyme digestion system
Reagent(s) | Volume mu L |
pCAMBIA1300-RfxCas13d | 1μg |
HindIII | 1 |
SbfI | 1 |
10x buffer T | 4 |
Sterile deionized water | Up to 50 |
Total volume of | 50 |
After 1h reaction at 37 ℃, running the gel and recovering the DNA fragment of linearized pCAMBIA1300-RfxCas13 d.
TABLE 5 homologous recombinase ligation System
After 1h reaction at 50 ℃, the product is directly transformed into an escherichia coli strain, and sequencing verifies that the correct strain is preserved.
The sequence listing according to this example is shown in table 6 below.
Table 6 sequence listing
/>
/>
/>
Example 3 specific knock-down of Rice circRNA Using Rice pCAMBIA1300-RfxCas13d-BSJ-crRNA
1. Transgenic Rice plant harvesting
The rice japonica rice variety Nippon is used as a test material, the constructed pCAMBIA1300-RfxCas13d-BSJ-crRNA expression vector is transferred into rice callus through agrobacterium mediation, and a transgenic rice plant is obtained through tissue culture.
2. Real-time fluorescent quantitative PCR detection of expression of circRNA and its corresponding female parent mRNA the seedling stage wild rice and transgenic rice plant grown for about 1 month are sampled, RNA is extracted, and the expression of circRNA and its corresponding female parent mRNA is detected by real-time fluorescent quantitative PCR. The sequence of the circRNA detection primer is shown as SEQ ID No.13-18, the sequence of the corresponding parent mRNA detection primer is shown as SEQ ID No.19-24, and the sequence of the corresponding parent mRNA is shown as SEQ ID No. 25-27.
As shown in fig. 3, the expression level of 3 girrnas in rice was significantly reduced compared to the control, while the expression level of the corresponding maternal mRNA was not knocked down (fig. 4), indicating that the expression of girrnas in rice could be specifically knocked down by CRISPR/RfxCas13d vector system.
Table 7 sequence listing
SEQ ID No.25 (1 maternal mRNA sequence: > Os03t 0732100-01)
atgggaatagcggcgccaccgtgtcagccagggcagaccaccacgttcgtcataagccaacccaccagcagcaagagctccccggcggcgatcgtcgtccggccgccggccacggcgagttcttcttctatgtcccactcccagggtttccaccagcccagcagcggcgccgtcttcggcttctcctccgatggcttcgacggccgccccggctccggccaagaccaccagcagcacgagcagcagcagcagcagcacgtggcgcagcagagccgccgcgacaagctgagggtgcaaggcttcgacccggcggcggcggcggcggggcacgggttgctccccatcgagggcgacgagcatggcgcggagcccggcgccatgtacgaccacgcggaggcggcggcggccggggcgtcgaacatgctctccgagatgttcaacttcccgtcgcagccgccgacgggcccctccgccaccgagctgctcgccagccagatgaacgccaactaccggttcgggttccggcaggcggcgggtttggccggcggcgaaggcggttggttcggcggcggcggcgcggccggccggactgggttggtgctcggtggggccagcttgggttcgttaggtgagacgtcgtcgccgaagcagcaagcgagcggcatggcgggcctcgccgctgacccggccgcggcgatgcacctgttcctgatgaacccgcagcagcagcagcagcagcagtcacggtcgtcgacgtcgccgccgccgtcggacgcgcagtcggcaatccaccagcaccacgaagcgttccaggcgttcggcggcgccggtgcagcagcgttcggcggcggcgcggcggccggcgtggtggaaggccaggggctctccctctcgctgtcgccgtcgctgcagcagctggagatggcgaagcaggcggaggagctgcgcgtccgcgacggcgtgctctacttcaaccggcaacagcagcagcagcaggcggcggcggcggcggcgtcggtgcagcagcagctcccgatggcattgcacgggcaggtcggcgtgctggggcagcagctgcacggcggcgggtacggcggcccggcgggcgtcgccggcgtcctccgcaactccaagtacacccgcgcggcgcaggagctcctcgaggagttctgcagcgtcggccgcggccagatcaagggcggcggcggccgcggctccgcccccaataaccctaactccagcaaggccgccgcctccagctccggcgccgcccagtccccctcgtcggcgtccaaagaaccccctcaactctcccccgccgaccgcttcgagcaccagcgcaagaaggccaagctcatctccatgctcgacgaggtggacaggaggtacaaccactactgcgaccagatgcagatggtggtgaacttcttcgactcggtgatggggttcggggcggcgacgccgtacacggcgctggcgcagaaggccatgtcgaggcacttccggtgcctcaaggacgcgatcgcggcgcagctgagggggacgtgcgaggcgctgggggagaaggacgccggcacggggtcggggctgaccaagggggagacgccgcggctgcgggccatcgaccagagcctccggcagcagcgcgccttccaccacatggggatcatggagcaggaggcgtggcgcccccagcgcggcctccccgagcgctccgtcaacatcctccgctcctggctcttcgaacacttcctccacccgtacccgagcgatgcagataagcacctgttggcgaggcagaccgggctgtccaggaaccaggtctcgaattggttcatcaacgcccgtgtccggctatggaagcccatgatcgaggagatgtaccagcaagaatgcaaggagctcgagggctcctccggcgccggcgacgacccctccggcgccgacgacacgcactcgccgacgaccaccgccgccgcgcatcatcagcacaggcacggccagctaatggtagagcacggcggcgcgtccagcggcggcggcgccgccatgtcgtcgcacaagcacgagcccggcgtcgtcgcggggccctcctcgtcgtcggcggcggcggtggcggacgccgccttcgtcggcatcgacccggtggagctcctcggcggcgacggcgcggcggccgacgacctgtacgggcggttcgacccggccggcgccgtcagggtgcggtacgggccggcgggcgccgccgccggcgccgccgcggcggcggccggcgacgtgtcgctgacgctgggcctgcagcacgccggcgccggcaacgccgggcccgacggcagcggccgcttctccttgcgagactacagtggttgctga
SEQ ID No.26 (2 maternal mRNA sequence: > Os08t 0554400-01)
atggcagccgcctcgtctgcacactacttcggtctcggcgagccgcagatgcagcagcagcagcagcagcctcccctgcagaacaacgcggctgctccggtcgcggccacgccgccgcccaagaagaagaggaatcagcccgggaatccaaatcctgatgcggaggtggtggcgctgtcgccgcacacgctgctggcgacgaacaggttcgtgtgcgaggtgtgcaacaaggggttccagagggagcagaacctgcagctgcaccggcgagggcacaacctgccatggaagctgaagcagaagaaccccaaggagacgcggcggcgggtgtacctgtgcccggagccgtcctgcgtccaccacgacccgtcgcgagccctcggcgacctcaccggcatcaagaagcactactcccgcaagcacggcgagaagaagtggaagtgcgacaagtgcaacaagcgctacgccgtccagtccgactggaaggctcactccaagacctgcggcacccgcgagtaccgctgcgactgcggcaccctcttctccaggagggacagcttcatcacccaccgcgccttctgcgacgcgctggcgcaggagagcggcaggattatgccacccatgggcgccgccctgtacgccgccgccggcgccggcatggccatcggcggcctcaccggcatggccgcctcccaccagctccaaccgttccaggaccactcctccgccatcaccaccgccgccaacgccgccgcgcagttcgaccacctcatggccacgtcgtccgccgcggccggctcccctgcgttccgcgccgcgcagccgacgtcgtcgtcctcctcgccgttctacctcggcggcggcggcgacgacggacaggcccacacctcgctgctccacggcaagccggcgttccacggcctaatgcagctgccggagcagcaggggagcaatggcggcggcctccttaacctgagctacttctccggcgggaacggcggccaccaccaccaccaccaggagggacgactcgtcttcccggatcagttcaacggcgtcgccgccgggaacggcgcccgcgccggcagcggcgagcacggcaacagcggtaacaacgccgactcggggagcatcttctccgggaacatgatgggcggcggcggcggcttctcgtccttgtacagctcgtccgaccagaccgtcccgccgccgcagatgtcggcgacggcgcttctccagaaggccgcacagatgggcgcgacgacgagcagcggcggcgccggcagcgtgaactcgctactcagaggtctcggaagcggcggcggcggcggcgctctgaacggtaagcccgccggagcagccgggttcatcatgtccggggagagctcgtcgaggagcactgcttctcagacggcggagaacgagagccagctccgggagctgatgatgaacacgctctcggcgaccgggggcggcaccggcgccggaacggtgttcgtcggcggcggcttccccggcgtggacgacggcaagctgagcacgagggacttcctcggcgtgagcggcggcgcgccggggcttcaactgcggcacggcggcgccgccgggatgggcatggccggctcactggaccaagagatgaagtaa
SEQ ID No.27 (3 maternal mRNA sequence: > Os07t 0482566-01)
cggcgtttctcttctctgcctccacaccgcgccccattcccatgtgaaggagctcgtctgcttctcttgctccacaccttgaccacgagccattccaccgcattctctaattaagctgggagggagaaggacaccggcgggtgcccatcgtcgagcacctggtgtgcactcgcgctcggctccgcatctgctccgcttcaccactgcctcgacagcgaccggtcgatctcctccgacaacccaaagcaaggcaagcctagcggaaaggccagagtcagtatcttcgtccaatcgctagtcgctagatggccgatatagccgctaacgaagctgaccaacaggctgctggacctgcaagcgatcccgtacggccacaagtggatccccagctgctgatggctgctcgccgcggcgacagcgatctgctcaaggagctgctggggctcaacatggatccacttgtgacatcagagggagtcgtcgtcgtcgtggacgtcgtcccgcctccccggcgtactcctcctgctgcggctgcggctgctggtgatgttgatcgaaccggtcacctcctcactcccggcggcggcgccaccgccacttgatggggtgacagccgagggggctccctgctgcacgtggtcgcggagtgcggggacagccacaagttccgcgactgcaccaggctgatctactacagggagaagcacctcctggacgcgcccaacggcaatggtgacacgcccctgcactgcgccgcggccgccgggaacgccgagatgatctccttcctcatccacctcgcggccgccggcgacgacggaaatacggaggcggagaaggcggagaaggaggtggcgtacctgagaatgcacaacaaccgtggggagaccgcgttgcaccatgcggtccgtgcagctgctgctgctgccgacaacgaggacgataagcagctggccctggactgcatcgaccagctgatggccgccgatccacagctggccgccatccaacacccgaatgagaaggctgcttccccattgtacctggccatctcactgggtgaaattggcattgcaaagcatctgttcgacaaaactgaaggcgagctgtcctactccggaccagatggacggaacgtctggcatgcagctgtttcttttcctcaagcgctagatatggttttggaatggttcaacgataagcaattgatggcggccgacatgcggcaacaagaaggtggggatcaccggcatgtggcagctgatgatgaactcctcgcgcggctcagtagccagagggacaatgacaacgggagcacacctcttcacttggccgcatcaattaacgggttaccgtcggcactatatattcccatatgttctccgcgagttttggcgccgttgcggcggccaaagccggtggagctgctgctgaaagccaacgaattcgcggcgtaccagccggacaaccaagggatgtacccaatacacgccgccgcgtcggctggcagcctggagaccgtgaagatcctgcttgagaaacacccagactgcgccactctgcgagatgcgagaggaagaactttcctccacgctgctgttgagaagaagaggtttggagtggttaagtatgtatgccactggaagaggaaggaaaggtttgcattgatcctgaatgcacaggacaataacggggacaccgcgctatccatgctggagatcttggagtcttccgacgccttatttggtgtcataaggaacgcttggacgtaccgaattaaataagaagggcatgagacctattgatgtttcatggagcatgatgcctctaaaatgttattatgcatgggatctaagagtccacatacgaacattgctattgaaattgggagctccatatggtgaaagtcagagccacatcttcaacaaaaaacgacacgccatcatcgtcgaccccaaatttaaaggaggcgaggaaaagatgttggagaatgtaacagctgcaacgcaagtcttggcccttttctccgtcctcatcacaaccgtgacatttgcgtcggttttcacgttgcccggaggctaccggtctgccggcgatggtggcagtgccgccggcacgccgttgcttgctgggcgtgggtgctacgctttcgatgcgtttatcctctctgacgctctagcattcgtttgctatttcatggctacgtccgtcctgctctatgccggggtgcctgcttataaactcgaggtccgcctccgccacattaattttgcatatagcctgatgatgaattctggaagaagcttggcggccgctgtagctttgggactctacgtggtgctgcttccaccggttggccgcaccattgcgatcgcgatcgctgccgcgatggtcatgcttgcacttctgctcagcaaggcatcagaaggcatcgagtctctcttcggcattgctattgcaagaaaccggaagctgtcaattcgagactctgtggtaggcttcactatatatgtgggggaacgctattggtccttcatcctaatcttcggcctcccagcaattcgtaagtgggcaagggcagggtaagatattaagcacgtacatggacgaccttaattcacattcggacatcatcacgtacggccggaacagttaatgcatgtgtgtgtcgttgtttttatgatcatttgtattactatacctgtctcacaatctcccatatatataattatcgcatacgtcgtccaattgtaataatgtaataaactaacctcatgtacgtcgttgcctgttgttcattattcagcttattaaaataaacaaggggcaactatgtgtcgtgcgttcttgccagcccctatatattttttca
In summary, the invention is a technical method for successfully knocking down plant circRNA by using a CRISPR-Cas13d system aiming at BSJ site specificity. It has obvious difference from the traditional method for knocking down the plant circRNA: the traditional method for knocking down the plant circRNA is to directly knock down the exons or flanking introns forming the circRNA by using a CRISPR-Cas9 system, but the method can also knock down or reduce the expression of the parent mRNA, and has non-specificity; the invention eliminates the limitation of the gene targeting site, does not reduce the expression of the parent mRNA in the targeting process of the plant circRNA BSJ site, and can obtain higher plant circRNA targeting efficiency.
Although embodiments of the present invention have been disclosed above, it is not limited to the details and embodiments shown and described, it is fully applicable to the various fields of adaptation of the present invention and further modifications will be readily apparent to those of ordinary skill in the art, and thus the present invention is not limited to the specific details and illustrations shown and described herein without departing from the general concepts defined in the claims and their equivalents.
Claims (10)
1. A method for specifically knocking down plant circRNA based on CRISPR/RfxCas13d system, comprising the steps of:
s1, designing and synthesizing a target gene sequence coding for RfxCas13d with plant codon preference, and constructing a pCAMBIA1300-RfxCas13d recombinant vector;
s2, synthesizing a AtU6-crRNA gene fragment, wherein the AtU6-crRNA gene fragment comprises two BsaI enzyme cutting sites, and constructing a pENTR4:gRNA4-AtU6-crRNA recombinant vector;
s3, synthesizing and constructing a pCAMBIA1300-RfxCas13d-BSJ-crRNA vector of a specific targeting plant circRNA BSJ site;
s4, transferring the constructed pCAMBIA1300-RfxCas13d-BSJ-crRNA expression vector to plant callus, and obtaining a transgenic plant with specificity knockdown plant circRNA through tissue culture.
2. The method for specifically knocking down plant circRNA based on CRISPR/RfxCas13d system according to claim 1, wherein in step S1, the pCAMBIA1300-RfxCas13d recombinant vector comprises CaMV 35S promoter, hygromycin gene, caMV poly (a) signal terminator, pVS1 RepA, pVS1 replication origin, rice ubiquitination promoter, SV40 NLS-RfxCas13d-SV40 NLS and NOS terminator.
3. The method for specifically knocking down plant circRNA based on CRISPR/RfxCas13d system according to claim 2, wherein step S1 comprises the steps of: the target gene sequence encoding RfxCas13d is optimized and synthesized through plant codon preference, and is directly cloned to a linearized pUC57 vector, which is named pUC57: rfxCas13d. Based on CRISPR/spCas9 vector, plasmid pUC57 was purified using BamHI and NcoI: double digestion is carried out on the RfxCas13d and the CRISPR/spCas9 vector, 3003bp RfxCas13d gene fragments and about 12.6kb CRISPR/spCas9 vector frameworks are respectively recovered, the 3003bp RfxCas13d fragments are connected to the CRISPR/spCas9 vector frameworks by using T4 DNA ligase, the plasmid is named pCAMBIA1300-RfxCas13d, the plasmid is stored after sequencing, and the transformed escherichia coli is stored after verification.
4. The method for specifically knocking down plant circRNA based on a CRISPR/RfxCas13d system according to claim 2, wherein the nucleotide sequence of the SV40 NLS-RfxCas13d-SV40 NLS gene is SEQ ID No.1.
5. The method for specifically knocking down plant circRNA based on CRISPR/RfxCas13d system according to claim 2, wherein the amino acid sequence encoded by RfxCas13d is SEQ id No.2.
6. The method for specifically knocking down plant circRNA based on CRISPR/RfxCas13d system according to claim 1, wherein in step S2, pENTR4: gRNA 4-AtU-crRNA recombinant vector comprises AtU6 promoter, crRNA repeated leader sequence, spacer nucleotide sequence containing two BsaI cleavage sites.
7. The method for specifically knocking down plant circRNA based on the CRISPR/RfxCas13d system according to claim 6, wherein the base sequence of AtU-crRNA gene fragment is SEQ ID No.3.
8. The method for specifically knocking down plant circRNA based on CRISPR/RfxCas13d system according to claim 6, wherein step S2 comprises: and (3) linearizing the pENTR4:gRNA4 vector by using HindIII, recovering a linearization vector skeleton, artificially synthesizing a AtU-crRNA gene fragment containing two BsaI enzyme cutting sites, directly synthesizing the fragment onto the pENTR4:gRNA4 vector by using the HindIII enzyme cutting sites to obtain the pENTR4:gRNA 4-AtU-crRNA vector, sequencing the plasmid, preserving the plasmid, and transforming the escherichia coli after verification is successful and preserving the plasmid.
9. The method for specifically knocking down plant circRNA based on CRISPR/RfxCas13d system according to claim 1, wherein step S3 comprises: designing a gRNAs targeting sequence aiming at a BSJ site of the circRNA, synthesizing an RfxCas13d-BSJ-gRNA target sequence primer, and connecting the primer to BsaI linearized pENTR4:gRNA 4-AtU-crRNA carrier in a primer annealing mode; and (3) taking the gRNA4-AtU6-BSJ-crRNA vector constructed in the previous step as a template, designing primers with HindIII and SbfI enzyme cutting sites, amplifying AtU-BSJ-crRNA fragments with HindIII and SbfI enzyme cutting sites, carrying out double enzyme cutting on the pCAMBIA1300-RfxCas13d recombinant vector by using HindIII and SbfI, and finally obtaining the pCAMBIA1300-RfxCas13d-BSJ-crRNA vector by a homologous recombination mode.
10. The method for specifically knocking down plant circRNA based on CRISPR/RfxCas13d system according to claim 1, wherein said plant comprises rice, tomato, arabidopsis.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311463065.5A CN117511942A (en) | 2023-11-06 | 2023-11-06 | Method for specifically knocking down plant circRNA based on CRISPR/RfxCas13d system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311463065.5A CN117511942A (en) | 2023-11-06 | 2023-11-06 | Method for specifically knocking down plant circRNA based on CRISPR/RfxCas13d system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117511942A true CN117511942A (en) | 2024-02-06 |
Family
ID=89757768
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311463065.5A Pending CN117511942A (en) | 2023-11-06 | 2023-11-06 | Method for specifically knocking down plant circRNA based on CRISPR/RfxCas13d system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117511942A (en) |
-
2023
- 2023-11-06 CN CN202311463065.5A patent/CN117511942A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107012164B (en) | CRISPR/Cpf1 plant genome directed modification functional unit, vector containing functional unit and application of functional unit | |
CN110157726B (en) | Method for site-directed substitution of plant genome | |
CN109705198B (en) | Application of OsCKX7 protein and coding gene thereof in regulation and control of resistance to plant sheath blight | |
WO2018098935A1 (en) | Vector for plant genome site-directed base substitution | |
JP2022511508A (en) | Gene silencing by genome editing | |
IL271283B1 (en) | Methods and compositions for targeted genomic insertion | |
WO2022142472A1 (en) | Application of mirna 408 in regulation and control of cadmium accumulation in crops | |
CN113994007B (en) | Method for expressing nucleic acid | |
CN113846075A (en) | MAD7-NLS fusion protein, nucleic acid construct for site-directed editing of plant genome and application thereof | |
CN112126652B (en) | Application of rice OsAUX3 gene in regulation of rice seed grain length | |
WO2023227137A1 (en) | Use of tapdil4-1b gene in fusarium head blight resistance of plant and method for constructing tapdil4-1b transgenic plant | |
CN110951743B (en) | Method for improving plant gene replacement efficiency | |
CN114540406A (en) | Genome editing expression box, vector and application thereof | |
CN116179589B (en) | SlPRMT5 gene and application of protein thereof in regulation and control of tomato fruit yield | |
CN109371037B (en) | Tobacco AKT1 gene and application thereof | |
EP3709792B1 (en) | Plant promoter for transgene expression | |
WO2023216415A1 (en) | Base editing system based on bimolecular deaminase complementation, and use thereof | |
CN117511942A (en) | Method for specifically knocking down plant circRNA based on CRISPR/RfxCas13d system | |
CN112375765B (en) | Disease-susceptible gene OsHXK5 of rice bacterial leaf streak and application thereof | |
CN109207485A (en) | Application of the OsAPS1 gene in improvement Rice Resistance characteristic of disease | |
US20220403396A1 (en) | Methods and compositions for dna base editing | |
CN114438056A (en) | CasF2 protein, CRISPR/Cas gene editing system and application thereof in plant gene editing | |
JP2022522823A (en) | Suppression of target gene expression by genome editing of natural miRNA | |
CN113429467B (en) | Application of NPF7.6 protein in regulation and control of nitrogen tolerance of leguminous plant root nodule | |
CN109553668B (en) | Tobacco KUP1 gene and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |