CN115216483B - poly(A)尾巴非A修饰在促进mRNA翻译中的应用 - Google Patents
poly(A)尾巴非A修饰在促进mRNA翻译中的应用 Download PDFInfo
- Publication number
- CN115216483B CN115216483B CN202210410067.7A CN202210410067A CN115216483B CN 115216483 B CN115216483 B CN 115216483B CN 202210410067 A CN202210410067 A CN 202210410067A CN 115216483 B CN115216483 B CN 115216483B
- Authority
- CN
- China
- Prior art keywords
- poly
- tail
- mrna
- modification
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108091036407 Polyadenylation Proteins 0.000 title claims abstract description 136
- 108020004999 messenger RNA Proteins 0.000 title claims abstract description 95
- 238000012986 modification Methods 0.000 title claims abstract description 94
- 230000004048 modification Effects 0.000 title claims abstract description 93
- 238000013519 translation Methods 0.000 title abstract description 58
- 241000124008 Mammalia Species 0.000 claims description 7
- 108090000623 proteins and genes Proteins 0.000 abstract description 85
- 102000004169 proteins and genes Human genes 0.000 abstract description 33
- 101100228929 Caenorhabditis elegans gld-4 gene Proteins 0.000 abstract description 31
- 241000244206 Nematoda Species 0.000 abstract description 24
- 101710124239 Poly(A) polymerase Proteins 0.000 abstract description 8
- 241000282414 Homo sapiens Species 0.000 abstract description 6
- 238000002474 experimental method Methods 0.000 abstract description 6
- 230000001737 promoting effect Effects 0.000 abstract description 3
- 238000012252 genetic analysis Methods 0.000 abstract description 2
- 210000004027 cell Anatomy 0.000 description 38
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 26
- 108091034057 RNA (poly(A)) Proteins 0.000 description 22
- 238000000034 method Methods 0.000 description 20
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 17
- 108020004414 DNA Proteins 0.000 description 13
- 230000014509 gene expression Effects 0.000 description 13
- 125000003729 nucleotide group Chemical group 0.000 description 13
- 238000001890 transfection Methods 0.000 description 13
- 241000196324 Embryophyta Species 0.000 description 12
- 150000007523 nucleic acids Chemical class 0.000 description 12
- 239000002773 nucleotide Substances 0.000 description 12
- 210000004602 germ cell Anatomy 0.000 description 11
- 108020004707 nucleic acids Proteins 0.000 description 11
- 102000039446 nucleic acids Human genes 0.000 description 11
- 210000001082 somatic cell Anatomy 0.000 description 11
- 241000699666 Mus <mouse, genus> Species 0.000 description 10
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 10
- 244000005700 microbiome Species 0.000 description 10
- 241000206602 Eukaryota Species 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- 238000009396 hybridization Methods 0.000 description 9
- 239000000126 substance Substances 0.000 description 8
- 238000013518 transcription Methods 0.000 description 8
- 230000035897 transcription Effects 0.000 description 8
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 7
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 7
- 230000000875 corresponding effect Effects 0.000 description 7
- 238000009826 distribution Methods 0.000 description 7
- 241000894007 species Species 0.000 description 7
- 102000053602 DNA Human genes 0.000 description 6
- 241000252212 Danio rerio Species 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- 241000283973 Oryctolagus cuniculus Species 0.000 description 6
- 241000209094 Oryza Species 0.000 description 6
- 235000007164 Oryza sativa Nutrition 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 230000001186 cumulative effect Effects 0.000 description 6
- 210000004681 ovum Anatomy 0.000 description 6
- 235000009566 rice Nutrition 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 101100014712 Caenorhabditis elegans gld-2 gene Proteins 0.000 description 5
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 5
- 102000002322 Egg Proteins Human genes 0.000 description 5
- 108010000912 Egg Proteins Proteins 0.000 description 5
- 101000735431 Homo sapiens Terminal nucleotidyltransferase 4A Proteins 0.000 description 5
- 241001460678 Napo <wasp> Species 0.000 description 5
- 102100034939 Terminal nucleotidyltransferase 4A Human genes 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 230000002596 correlated effect Effects 0.000 description 5
- 235000013601 eggs Nutrition 0.000 description 5
- 238000002560 therapeutic procedure Methods 0.000 description 5
- 239000013598 vector Substances 0.000 description 5
- 241000219195 Arabidopsis thaliana Species 0.000 description 4
- 238000011529 RT qPCR Methods 0.000 description 4
- 241000282898 Sus scrofa Species 0.000 description 4
- 150000001413 amino acids Chemical group 0.000 description 4
- 238000004445 quantitative analysis Methods 0.000 description 4
- 101000735429 Homo sapiens Terminal nucleotidyltransferase 4B Proteins 0.000 description 3
- 108700008625 Reporter Genes Proteins 0.000 description 3
- 102100034938 Terminal nucleotidyltransferase 4B Human genes 0.000 description 3
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 3
- 238000009825 accumulation Methods 0.000 description 3
- 230000033228 biological regulation Effects 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 238000003119 immunoblot Methods 0.000 description 3
- 238000010348 incorporation Methods 0.000 description 3
- 210000004185 liver Anatomy 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- 241000219194 Arabidopsis Species 0.000 description 2
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 2
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- 241000244203 Caenorhabditis elegans Species 0.000 description 2
- 238000012413 Fluorescence activated cell sorting analysis Methods 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 2
- 241001529936 Murinae Species 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 2
- 108010026552 Proteome Proteins 0.000 description 2
- 238000003559 RNA-seq method Methods 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 2
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 239000012620 biological material Substances 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000000799 fluorescence microscopy Methods 0.000 description 2
- 101150090208 gld-4 gene Proteins 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 229920001519 homopolymer Polymers 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 108700021021 mRNA Vaccine Proteins 0.000 description 2
- 229940126582 mRNA vaccine Drugs 0.000 description 2
- 238000004949 mass spectrometry Methods 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 238000000528 statistical test Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000007671 third-generation sequencing Methods 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 229960005486 vaccine Drugs 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- JTKLCCFLSLCCST-SZMVWBNQSA-N Arg-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JTKLCCFLSLCCST-SZMVWBNQSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 1
- RWWPBOUMKFBHAL-FXQIFTODSA-N Arg-Asn-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O RWWPBOUMKFBHAL-FXQIFTODSA-N 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 1
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 1
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- KNOGLZBISUBTFW-QRTARXTBSA-N Asp-Trp-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O KNOGLZBISUBTFW-QRTARXTBSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000167854 Bourreria succulenta Species 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- KIHRUISMQZVCNO-ZLUOBGJFSA-N Cys-Asp-Asp Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KIHRUISMQZVCNO-ZLUOBGJFSA-N 0.000 description 1
- ZFHXNNXMNLWKJH-HJPIBITLSA-N Cys-Tyr-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZFHXNNXMNLWKJH-HJPIBITLSA-N 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- GFLNKSQHOBOMNM-AVGNSLFASA-N Gln-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GFLNKSQHOBOMNM-AVGNSLFASA-N 0.000 description 1
- KFHASAPTUOASQN-JYJNAYRXSA-N Gln-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KFHASAPTUOASQN-JYJNAYRXSA-N 0.000 description 1
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 1
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- OBIHEDRRSMRKLU-ACZMJKKPSA-N Glu-Cys-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OBIHEDRRSMRKLU-ACZMJKKPSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- JPWIMMUNWUKOAD-STQMWFEESA-N Gly-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN JPWIMMUNWUKOAD-STQMWFEESA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 description 1
- TXLQHACKRLWYCM-DCAQKATOSA-N His-Glu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O TXLQHACKRLWYCM-DCAQKATOSA-N 0.000 description 1
- CTJHHEQNUNIYNN-SRVKXCTJSA-N His-His-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O CTJHHEQNUNIYNN-SRVKXCTJSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- GUXQAPACZVVOKX-AVGNSLFASA-N His-Lys-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GUXQAPACZVVOKX-AVGNSLFASA-N 0.000 description 1
- DEOQGJUXUQGUJN-KKUMJFAQSA-N His-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DEOQGJUXUQGUJN-KKUMJFAQSA-N 0.000 description 1
- XJFITURPHAKKAI-SRVKXCTJSA-N His-Pro-Gln Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CN=CN1 XJFITURPHAKKAI-SRVKXCTJSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000642191 Homo sapiens Terminal uridylyltransferase 4 Proteins 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 1
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- 238000001276 Kolmogorov–Smirnov test Methods 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- IYXDSYWCVVXSKB-CIUDSAMLSA-N Met-Asn-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IYXDSYWCVVXSKB-CIUDSAMLSA-N 0.000 description 1
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 1
- HLYIDXAXQIJYIG-CIUDSAMLSA-N Met-Gln-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HLYIDXAXQIJYIG-CIUDSAMLSA-N 0.000 description 1
- NCVJJAJVWILAGI-SRVKXCTJSA-N Met-Gln-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NCVJJAJVWILAGI-SRVKXCTJSA-N 0.000 description 1
- WWWGMQHQSAUXBU-BQBZGAKWSA-N Met-Gly-Asn Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O WWWGMQHQSAUXBU-BQBZGAKWSA-N 0.000 description 1
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 1
- MQASRXPTQJJNFM-JYJNAYRXSA-N Met-Pro-Phe Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MQASRXPTQJJNFM-JYJNAYRXSA-N 0.000 description 1
- TWEWRDAAIYBJTO-ULQDDVLXSA-N Met-Tyr-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N TWEWRDAAIYBJTO-ULQDDVLXSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 1
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 1
- 101710132455 Protein A2 Proteins 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 230000026279 RNA modification Effects 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 1
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- 238000000692 Student's t-test Methods 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 102100033225 Terminal uridylyltransferase 4 Human genes 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 1
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- QNMIVTOQXUSGLN-SZMVWBNQSA-N Trp-Arg-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QNMIVTOQXUSGLN-SZMVWBNQSA-N 0.000 description 1
- YEGMNOHLZNGOCG-UBHSHLNASA-N Trp-Asn-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YEGMNOHLZNGOCG-UBHSHLNASA-N 0.000 description 1
- ILDJYIDXESUBOE-HSCHXYMDSA-N Trp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ILDJYIDXESUBOE-HSCHXYMDSA-N 0.000 description 1
- 102000004243 Tubulin Human genes 0.000 description 1
- 108090000704 Tubulin Proteins 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- FBVGQXJIXFZKSQ-GMVOTWDCSA-N Tyr-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N FBVGQXJIXFZKSQ-GMVOTWDCSA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- 241000269370 Xenopus <genus> Species 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000011712 cell development Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 235000019693 cherries Nutrition 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000010485 coping Effects 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 238000000684 flow cytometry Methods 0.000 description 1
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000006911 nucleation Effects 0.000 description 1
- 238000010899 nucleation Methods 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 238000012910 preclinical development Methods 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 230000009711 regulatory function Effects 0.000 description 1
- 230000008844 regulatory mechanism Effects 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 230000002459 sustained effect Effects 0.000 description 1
- 238000012353 t test Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/67—General methods for enhancing the expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/65—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression using markers
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/106—Plasmid DNA for vertebrates
- C12N2800/107—Plasmid DNA for vertebrates for mammalian
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2840/00—Vectors comprising a special translation-regulating system
- C12N2840/10—Vectors comprising a special translation-regulating system regulates levels of translation
- C12N2840/105—Vectors comprising a special translation-regulating system regulates levels of translation enhancing translation
Landscapes
- Genetics & Genomics (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Wood Science & Technology (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
本发明公开了poly(A)尾巴非A修饰在促进mRNA翻译中的应用。本发明通过分析人、小鼠细胞的poly(A)尾巴,以及mRNA的翻译,发现poly(A)尾巴非A修饰与翻译效率呈正相关。遗传学分析显示,线虫中的一种非典型poly(A)聚合酶GLD‑4可通过建立poly(A)尾巴非A修饰来调控翻译。并进一步通过实验证明poly(A)尾巴非A修饰可以有效地促进mRNA的翻译,poly(A)尾巴中的G、C、U的添加可以促进mRNA的翻译并提高相应蛋白质的含量。从而提出poly(A)尾巴非A修饰可以作为有效促进mRNA翻译的全新技术手段。
Description
技术领域
本发明涉及生物技术领域中,poly(A)尾巴非A修饰在促进mRNA翻译中的应用。
背景技术
Poly(A)尾巴影响RNA的稳定性、出核和翻译效率,因而对于RNA的代谢至关重要。然而,由于二代测序分析平台读取核酸同聚物序列的固有缺陷,大多数的转录组分析均丢失了 poly(A)尾巴信息。因此,poly(A)尾巴对于RNA的命运和功能调控是亟待解析的。研究人员使用TAIL-seq和PAL-seq(poly(A)-tail length profiling by sequencing)等方法在 Illumina平台估算了poly(A)尾巴的长度(Chang et al.,2014;Subtelny et al.,2014)。不同基因的翻译效率大不相同(Ingolia et al.,2009)。果蝇、非洲爪蟾和小鼠等多个物种的卵细胞RNA poly(A)尾巴研究显示,poly(A)尾巴长度和mRNA翻译效率之间具有明显的相关性(Eichhorn et al.,2016;Lim et al.,2016;Luong et al.,2020;Subtelny etal., 2014;Yang et al.,2020)。卵细胞中较长的poly(A)尾巴可以导致更为有效的mRNA翻译,但体细胞中却不存在相同的调控机制(Luong et al.,2020;Park et al.,2016;Subtelny et al.,2014)。出人意料的是,即使很短的poly(A)尾巴(最短20nt)也可以支持体细胞里 mRNA的翻译(Park et al.,2016)。同时也有研究观点认为体细胞中poly(A)尾巴长度只要超过20nt即可,更长的poly(A)尾巴也不会有额外的mRNA翻译调控功能(Park etal.,2016)。
TAIL-seq分析结果显示,RNA的3’末端可通过末端延伸机制掺入其他核苷酸,这种修饰目前认为可调控RNA的稳定性,从而成为了mRNA稳定性调控的另一层次(Chang etal., 2014;Chang et al.,2018;Lim et al.,2014;Lim et al.,2018;Morgan et al.,2019;Morgan et al.,2017)。去腺苷酸化的短poly(A)尾巴可被TUT4/7加上末端U修饰,这种修饰可将靶RNA分子标记并降解(Lim et al.,2014)。相反,TENT4A/B引入的G修饰可以防止 poly(A)尾巴被CCR4-NOT复合体去腺苷酸化,从而提高mRNA的稳定性(Lim et al.,2018)。
Poly(A)尾巴内部的核苷酸组成曾是一个巨大的未知,近期基于三代测序平台的PAIso-seq和FLAM-seq方法被开发出来,实现了对于poly(A)尾巴这类长核酸同聚物的读取(Legnini et al.,2019;Liu et al.,2019)。这两种方法均检测到了一种在小鼠卵子、线虫和人细胞系的poly(A)尾巴内部广泛分布的新RNA修饰:非A修饰(poly(A)Tail Internalnon-A Modifications,pATIM)(Legnini et al.,2019;Liu et al.,2019),然而其功能未知。不同基因含有pATIM的转录本比例也大不相同,这也引出了一个重要的科学问题:pATIM有什么样的功能(Arango et al.,2018;Wang et al.,2015)。pATIM功能的研究具有重要的意义。
发明内容
本发明所要解决的技术问题是如何提高mRNA翻译效率。
为解决上述技术问题,本发明首先提供了下述任一应用:
1、通过在poly(A)尾巴中掺入非A修饰在mRNA翻译或提高mRNA的翻译效率中的应用;
2、促进poly(A)尾巴非A修饰的物质在提高mRNA翻译效率中的应用;
3、促进poly(A)尾巴非A修饰的物质在制备提高mRNA翻译效率产品中的应用。
在poly(A)尾巴中掺入非A修饰可通过直接化学合成掺入非A修饰完成,也可通过生物化学方法完成,也可通过在mRNA末端连接含有非A修饰的poly(A)尾巴完成,也可通过将包括含有非A修饰的poly(A)尾巴模板的mRNA转录模板进行转录完成。
进一步,在poly(A)尾巴中掺入非A修饰可通过生物化学方法在细胞外完成,也可向细胞中导入具有促进poly(A)尾巴非A修饰功能蛋白质的编码基因完成,也即,可在细胞外完成,也可在细胞内完成。
所述促进poly(A)尾巴非A修饰的物质可为通过化学方法在poly(A)尾巴中掺入非A修饰的物质,也可为可在不含poly(A)尾巴的RNA末端连接含有非A修饰的poly(A)尾巴的物质。
所述通过生物化学方法在poly(A)尾巴中掺入非A修饰的物质可为具有在poly(A)尾巴中掺入非A修饰功能的蛋白质、基因等。所述可在不含poly(A)尾巴的RNA末端连接含有非 A修饰的poly(A)尾巴的物质可为具有在RNA末端连接含有非A修饰的poly(A)尾巴功能的蛋白质、基因等,也可为具有在RNA末端连接含有非A修饰的poly(A)尾巴功能的蛋白质、基因等与含有非A修饰poly(A)尾巴的组合物。
上述应用中,所述转录模板可为PCR双链DNA产物、酶切双链DNA产物、质粒、黏粒、噬菌体或病毒载体。
上述应用中,所述poly(A)尾巴非A修饰可为poly(A)尾巴中G、C和/或U的插入。即,含有非A修饰的poly(A)尾巴为非纯A序列。
上述应用中,所述促进poly(A)尾巴非A修饰的物质可为蛋白质或生物材料,所述蛋白质为如下A1)-A5)中的任一种:
A1)非经典的PAP(ncPAP);
A2)GLD-4、TENT4A或TENT4B蛋白质;
A3)氨基酸序列是SEQ ID No.2的GLD-4蛋白质;
A4)将序列表中SEQ ID No.2所示的氨基酸序列经过一个或几个氨基酸残基的取代和/ 或缺失和/或添加且具有相同功能的蛋白质;
A5)在A1)或A2)或A3)或A4)的N端或/和C端连接标签得到的融合蛋白质;
所述生物材料为下述B1)至B5)中的任一种:
B1)编码所述蛋白质的核酸分子;
B2)含有B1)所述核酸分子的表达盒;
B3)含有B1)所述核酸分子的重组载体、或含有B2)所述表达盒的重组载体;
B4)含有B1)所述核酸分子的重组微生物、或含有B2)所述表达盒的重组微生物、或含有B3)所述重组载体的重组微生物;
B5)含有B1)所述核酸分子的细胞系、或含有B2)所述表达盒的细胞系。
上述应用中,A2)所述蛋白质可为与GLD-4蛋白质具有75%或75%以上同一性且具有相同功能的蛋白质。所述具有75%或75%以上同一性为具有75%、具有80%、具有85%、具有90%、具有95%、具有96%、具有97%、具有98%或具有99%的同一性。
上述应用中,B1)所述核酸分子可为如下b11)或b12)或b13)或b14):
b11)编码序列是序列表中SEQ ID No.1的cDNA分子或DNA分子;
b12)序列表中SEQ ID No.1所示的DNA分子;
b13)与b11)或b12)限定的核苷酸序列具有75%或75%以上同一性,且编码所述蛋白质的cDNA分子或DNA分子;
b14)在严格条件下与b11)或b12)或b13)限定的核苷酸序列杂交,且编码所述蛋白质的cDNA分子或DNA分子。
其中,所述核酸分子可以是DNA,如cDNA、基因组DNA或重组DNA;所述核酸分子也可以是RNA,如mRNA或hnRNA等。
本领域普通技术人员可以很容易地采用已知的方法,例如定向进化和点突变的方法,对本发明的编码所述蛋白质的核苷酸序列进行突变。那些经过人工修饰的,具有与本发明的所述蛋白质的核苷酸序列75%或者更高同一性的核苷酸,只要编码所述蛋白质且具有所述蛋白质功能,均是衍生于本发明的核苷酸序列并且等同于本发明的序列。
这里使用的术语“同一性”指与天然核酸序列的序列相似性。“同一性”包括与本发明的编码SEQ ID No.2所示的氨基酸序列组成的蛋白质的核苷酸序列具有75%或更高,或85%或更高,或90%或更高,或95%或更高同一性的核苷酸序列。同一性可以用肉眼或计算机软件进行评价。使用计算机软件,两个或多个序列之间的同一性可以用百分比(%)表示,其可以用来评价相关序列之间的同一性。
上述应用中,所述严格条件可为如下:50℃,在7%十二烷基硫酸钠(SDS)、0.5MNaPO4和1mM EDTA的混合溶液中杂交,在50℃,2×SSC,0.1%SDS中漂洗;还可为:50℃,在7%SDS、0.5M NaPO4和1mM EDTA的混合溶液中杂交,在50℃,1×SSC,0.1%SDS 中漂洗;还可为:50℃,在7%SDS、0.5M NaPO4和1mM EDTA的混合溶液中杂交,在 50℃,0.5×SSC,0.1%SDS中漂洗;还可为:50℃,在7%SDS、0.5M NaPO4和1mM EDTA 的混合溶液中杂交,在50℃,0.1×SSC,0.1%SDS中漂洗;还可为:50℃,在7%SDS、 0.5M NaPO4和1mM EDTA的混合溶液中杂交,在65℃,0.1×SSC,0.1%SDS中漂洗;也可为:在6×SSC,0.5%SDS的溶液中,在65℃下杂交,然后用2×SSC,0.1%SDS和1×SSC,0.1% SDS各洗膜一次;也可为:2×SSC,0.1%SDS的溶液中,在68℃下杂交并洗膜2次,每次 5min,又于0.5×SSC,0.1%SDS的溶液中,在68℃下杂交并洗膜2次,每次15min;也可为:0.1×SSPE(或0.1×SSC)、0.1%SDS的溶液中,65℃条件下杂交并洗膜。
上述75%或75%以上同一性,可为80%、85%、90%或95%以上的同一性。
上述应用中,B2)所述的含有编码所述蛋白质的核酸分子的表达盒(基因表达盒),是指能够在宿主细胞中表达所述蛋白质的DNA,该DNA不但可包括启动所述蛋白质编码基因转录的启动子,还可包括终止所述蛋白质编码基因基因转录的终止子。进一步,所述表达盒还可包括增强子序列。
可用现有的表达载体构建含有所述蛋白质编码基因表达盒的重组载体。
上述应用中,所述载体可为质粒、黏粒、噬菌体或病毒载体。
上述应用中,所述微生物可为酵母、细菌、藻或真菌。
上述应用中,所述细胞系可包括也可不包括繁殖材料。
上述应用中,所述mRNA翻译可为真核生物中的mRNA翻译。
上述应用中,所述mRNA翻译可为所述真核生物生殖细胞或体细胞中的mRNA翻译。
上述应用中,所述真核生物可为动物或植物或真核微生物。
所述动物可为哺乳动物(如人、小鼠、兔或猪)或非哺乳动物(如斑马鱼、果蝇或线虫)。
所述植物可为双子叶植物(如拟南芥)或单子叶植物(如水稻)。
所述真核微生物可为酵母。
本发明还提供了提高mRNA翻译效率的方法,所述方法包括:对目的mRNA的poly(A)尾巴进行非A修饰,实现目的mRNA翻译效率的提高。
上述方法中,对目的mRNA的poly(A)尾巴进行非A修饰可通过直接化学合成掺入非A 修饰完成,也可通过生物化学方法在mRNA末端生成含有非A修饰的poly(A)尾巴完成,也可通过在mRNA末端连接含有非A修饰的poly(A)尾巴完成,也可通过将包括含有非A修饰的poly(A)尾巴模板的mRNA转录模板进行转录完成。
上述方法中,所述非A修饰通过施加所述促进poly(A)尾巴非A修饰的物质实现。
上述方法中,所述mRNA翻译可为真核生物中的mRNA翻译。
上述方法中,所述mRNA翻译可为所述真核生物生殖细胞或体细胞中的mRNA翻译。
上述方法中,所述真核生物可为动物或植物或真核微生物。
所述动物可为哺乳动物(如人、小鼠、兔或猪)或非哺乳动物(如斑马鱼、果蝇或线虫)。
所述植物可为双子叶植物(如拟南芥)或单子叶植物(如水稻)。
所述真核微生物可为酵母。
本发明还提供了提高mRNA翻译效率的产品,所述产品为所述促进poly(A)尾巴非A修饰的物质。
上述产品中,所述mRNA翻译可为真核生物中的mRNA翻译。
上述产品中,所述mRNA翻译可为所述真核生物生殖细胞或体细胞中的mRNA翻译。
上述产品中,所述真核生物可为动物或植物或真核微生物。
所述动物可为哺乳动物(如人、小鼠、兔或猪)或非哺乳动物(如斑马鱼、果蝇或线虫)。
所述植物可为双子叶植物(如拟南芥)或单子叶植物(如水稻)。
所述真核微生物可为酵母。
本发明中,poly(A)尾巴的非A修饰是指poly(A)尾巴中含有G、C和/或U。poly(A)尾巴的非A修饰可通过将poly(A)尾巴中任意一个或多个A替换为G、C和/或U,或在poly(A) 尾巴中任意位置添加一个或多个G、C和/或U实现。“多个”可为两个或三个或四个或五个或六个或七个或八个或九个或十个等。
本发明中poly(A)尾巴中非A修饰的G、C和/或U的位置与个数没有要求,只要可以提高翻译效率,均在本发明的保护范围内。
在本发明的一个实施例中,含有非A修饰的poly(A)尾巴的序列为SEQ ID No.4、5、6。
本发明通过分析人、小鼠、兔、猪、斑马鱼、果蝇、线虫、拟南芥、水稻和酵母的poly(A) 尾巴,发现pATIM在真核生物中是RNA poly(A)尾巴的一种保守特性,其分布没有物种和组织特异性。通过分析mRNA的翻译,发现pATIM可以提高翻译效率。遗传学分析显示,线虫中的一种非典型poly(A)聚合酶GLD-4可通过建立pATIM来调控翻译。并通过在体外转录合成的mRNA的poly(A)尾巴中添加非A修饰,与相应的含有纯A的同样长度poly(A)尾巴的mRNA进行比较,进一步发现poly(A)尾巴中非A修饰可以有效地促进mRNA的翻译,poly(A)尾巴中的G、C、U的添加可以促进mRNA的翻译并提高相应蛋白质的含量。说明,poly(A)尾巴中非A修饰在真核生物中扮演的潜在普遍性翻译调控角色,更证实了poly(A)尾巴中非A修饰是提高mRNA效率的有效工具。
近些年来,医药工程领域基于mRNA的相关治疗方法蓬勃发展(Sahin et al.,2014)。由于mRNA更易设计和制备,与其他基于蛋白的疗法相比,mRNA疗法的成本优势十分突出。 mRNA疫苗的临床前开发可以十分迅速,对于应对突发性新发传染病往往具有巨大潜力 (Deweerdt,2019;Zhong et al.,2018)。第一种mRNA疫苗从开始研发到进入1/2期临床试验只花了1个月(Pardi et al.,2018;Thanh Le et al.,2020;Wang et al.,2020)。在mRNA疗法和疫苗研发过程中,mRNA翻译效率的提高是一大主要技术挑战(Sahin et al.,2014;Zhang et al.,2019)。poly(A)尾巴中非A修饰促进翻译效率新机制的发现,可以使得在mRNA的poly(A)尾巴中掺入非A修饰作为一种全新的技术进一步推动 mRNA疗法、疫苗等多种mRNA应用。
附图说明
图1为poly(A)尾巴序列分析流程图。
图2为poly(A)尾巴非A修饰与翻译效率正相关。
(A-D)不同样品中全基因组范围内poly(A)尾巴具有/不具有poly(A)尾巴非A修饰的基因翻译效率(mRNA-normalized TE)累积分布频率(CDF)。作图数据的来源样品分别为:(A)GV 期小鼠卵子(基因数:有G,n=4104;无G,n=1363;有C,n=4109;无C,n=1358;有U,n=4116;无U,n=1351),(B)线虫生殖细胞(基因数:有G,n=1390;无G,n=1585;有C,n=2233;无C,n=742;有U,n=2170;无U,n=805),(C)线虫体细胞(基因数:有G,n=539;无G,n=934;有C,n=909;无C,n=564;有U,n=842;无U,n=631),(D)HeLa细胞(基因数:有G,n=2075;无G,n=810;有C,n=2323;无C,n=562;有U,n=1884;无U,n=1001)。 poly(A)尾巴非A修饰和TE之间的相关性用Kolmogorov-Smirnov(KS)检验计算p值。
(E-F)具有/不具有poly(A)尾巴非A修饰的基因质谱定量蛋白丰度累积分布频率图。数据分别来自于GV期卵子(E)(基因数:有G,n=2307;无G,n=768;有C,n=2308;无C, n=767;有U,n=2306;无U,n=769),和HeLa细胞(F)(基因数:有G,n=2398;无G,n=1029;有C,n=2694;无C,n=733;有U,n=2166;无U,n=1261)。p值由KS检验求得。
有G:pATIM-G,即poly(A)尾巴中含有G;
无G:poly(A)尾巴中不含G;
有C:pATIM-C,即poly(A)尾巴中含有C;
无C:poly(A)尾巴中不含C;
有U:pATIM-U,即poly(A)尾巴中含有U;
无U:poly(A)尾巴中不含U。
TE:翻译效率(Translational efficiency)。
P值是统计两条累计分布曲线的KS统计检验p值。
图3为10个物种的GLD-4的同源基因所编码蛋白质的序列分析。下划线表示对于催化活性和NTP结合重要的氨基酸。
图4为GLD-4是线虫里poly(A)尾巴非A修饰的建立和翻译效率的调控所必需的。
(A)箱型图分别展示了WT和gld-4(ef15)突变体背景下,每个基因含有pATIM-G(左), pATIM-C(中)和pATIM-U(右)修饰的转录本占比。依据转录本含有poly(A)尾巴非A修饰比例的低(在WT中一个基因包含的转录本含有G,C,U修饰的转录本数量占总的有poly(A)尾巴转录本数比例小于5%)或者高(在WT中一个基因包含的转录本含有G,C,U修饰的转录本数量占总的有poly(A)尾巴转录本数比例大于等于5%),WT中的基因被分为2组(基因数:低G,n=2758;高G,n=169;低C,n=2374;高C,n=553;低U,n=2392;高U,n=535)。 p值由双尾t检验求得。
(B)gld-4 RNAi(gld-4突变体)和WT线虫中mRNA校正的核糖体结合(TE)变化累积频率分布图。数据分别展示了具有及不具有poly(A)尾巴非A修饰基因的数据(基因数:有G,n=970;无G,n=2869;有C,n=1873;无C,n=1966;有U,n=1816;无U,n=2023)。P值是统计两条累计分布曲线的KS统计检验p值。
图5为poly(A)尾巴非A修饰增强了翻译效率。
(A)尾巴长度一致但具有不同poly(A)尾巴非A修饰的报告mRNA示意图。
(B)基于不同poly(A)尾巴的转录本翻译效率报告系统示意图。
(C)HeLa细胞转染不同poly(A)尾巴报告mRNA18小时后的图像。标尺长度:100μm。DIC表示微分干涉对比明场成像。
(D)HeLa细胞转染报告mRNA42小时后表达的FACS定量结果。上图为荧光信号散点图,下图为荧光信号分布直方图。
(E)HeLa细胞转染报告mRNA42h后的qRT-PCR(左,mRNA水平)和免疫印迹(右,蛋白水平,使用抗Flag抗体检测所有报告基因融合表达的Flag标签)定量结果。
图6为不同时间点报告mRNA的表达定量分析。
(A)FACS(流式细胞荧光分选技术)检测到的转染后不同时间点HeLa细胞中报告EGFPmRNA表达水平定量分析。对照组为未转染mRNA的细胞。A98和A98pATIMs-2为分别转染了具有A98和A98pATIMs-2尾巴报告mRNA的细胞。
(B)FACS检测到的转染后不同时间点HeLa细胞中EGFP的相对荧光强度的定量分析。参与分析的细胞与上述(A)中的细胞一致。
P2表示较低严谨度的阈值门,包括荧光较弱的细胞;P3表示较高严谨度的阈值门,只包括荧光强的细胞。
具体实施方式
下面结合具体实施方式对本发明进行进一步的详细描述,给出的实施例仅为了阐明本发明,而不是为了限制本发明的范围。以下提供的实施例可作为本技术领域普通技术人员进行进一步改进的指南,并不以任何方式构成对本发明的限制。
下述实施例中的实验方法,如无特殊说明,均为常规方法,按照本领域内的文献所描述的技术或条件或者按照产品说明书进行。下述实施例中所用的材料、试剂、仪器等,如无特殊说明,均可从商业途径得到。以下实施例中的定量实验,均设置三次重复实验,结果取平均值。下述实施例中,如无特殊说明,序列表中各核苷酸序列的第1位均为相应 DNA/RNA的5′末端核苷酸,末位均为相应DNA/RNA的3′末端核苷酸。
下述实施例中的gld-4(ef15)突变体(Mark Schmid,BeateKüchler,Christian R.Eckmann,Two conserved regulatory cytoplasmic poly(A)polymerases,GLD-4and GLD-2,regulate meiotic progression in C.elegans,Genes&Dev.2009.23:824-836),公众可从申请人处获得该生物材料,该生物材料只为重复本发明的相关实验所用,不可作为其它用途使用。gld-4(ef15)突变体与背景个体的区别仅在于GLD-4基因发生了突变。 GLD-4基因的序列为序列表中SEQ ID No.1,编码SEQ ID No.2所示的GLD-4蛋白质。
实施例1、poly(A)尾巴非A修饰是真核生物RNA poly(A)尾巴的一个保守特征
本实施例通过分析人、小鼠、兔、猪、斑马鱼、果蝇、线虫、拟南芥、水稻和酵母的poly(A)尾巴,发现poly(A)尾巴非A修饰(简称为pATIM)在真核生物中是RNA poly(A) 尾巴的一种保守特性,其分布没有物种和组织特异性。
待测样本:人Hela细胞,小鼠肝脏,兔肝脏,猪肝脏,斑马鱼个体,果蝇个体,线虫个体,拟南芥,水稻叶片,酵母。
提取各样本的总RNA,而后制备PAIso-seq文库,然后进行第三代测序并对测序结果进行分析,将PAIso-seq测序序列比对到基因组,对于RNA末端非基因组模板转录的不能比对到基因组的序列即为poly(A)尾巴,详细分析方法见文献 (10.1038/s41467-019-13228-9),步骤如图1所示,各物种的poly(A)尾巴见表1。
表1、10个物种的poly(A)尾巴非A修饰
实施例2、poly(A)尾巴非A修饰与翻译效率正相关
本实施例发明人利用已有的小鼠GV期卵子(GV卵)(GSE135525)、线虫生殖细胞(GSE62859)与体细胞(GSE58918)以及HeLa细胞(GSE79664)的核糖体结合数据来分析翻译效率与poly(A)尾巴非A修饰之间的关系,比较含有和不含有poly(A)尾巴非A修饰两组基因的翻译效率累计曲线。所分析基因为数据中至少检测到10条poly(A)尾巴的基因。在小鼠GV期卵子数据中一个基因“有G”定义为该基因的数据中大于等于25%的poly(A) 尾巴中有G,“无G”定义为该基因的数据中小于25%的poly(A)尾巴中无G,“有C”定义为该基因的数据中大于等于25%的poly(A)尾巴中有C,“无C”定义为该基因的数据中小于25%的poly(A)尾巴中无C,“有U”定义为该基因的数据中大于等于25%的poly(A)尾巴中有U,“无U”定义为该基因的数据中小于25%的poly(A)尾巴中无U。在线虫细胞和HeLa 细胞数据中,由于数据测序深度较低且非A碱基含量较GV卵中更低,一个基因“有G”定义为该基因的数据中至少有一条poly(A)尾巴中有G,“无G”定义为该基因的数据中所有的poly(A)尾巴中无G,“有C”定义为该基因的数据中至少有一条poly(A)尾巴中有C,“无C”定义为该基因的数据中所有的poly(A)尾巴中无C,“有U”定义为该基因的数据中至少有一条poly(A)尾巴中有U,“无U”定义为该基因的数据中所有的poly(A)尾巴中无U。
发现pATIM-G(即poly(A)尾巴中含有G)和pATIM-C(即poly(A)尾巴中含有C)修饰与翻译效率呈正相关。其中,翻译效率(Translational Efficiency,TE)是指样本中某个基因的总mRNA分子中与核糖体结合并进行翻译mRNA分子占总mRNA分子的比例,利用基因的Ribo-seq和RNA-seq的数据,就可以直接计算这个数值,具体计算公式为:TE =(FPKM inRibo-seq)/(FPKM in RNA-seq)。
结果发现,无论在生殖细胞还是体细胞中,pATIM-G和pATIM-C修饰都有显著更高的翻译效率(图2中A-D)。而pATIM-U(即poly(A)尾巴中含有U)修饰也可以提高翻译效率。
考虑到mRNA翻译的最终结果是相应蛋白的积累,发明人利用已发表的质谱蛋白组数据在GV卵(即小鼠GV期卵子)和HeLa细胞里分析了poly(A)尾巴非A修饰与蛋白丰度之间的关系,比较含有和不含有poly(A)尾巴非A修饰两组基因的翻译效率累计曲线。已发表的质谱蛋白组数据来自于:Bekker-Jensen et al.,2017;Wang et al.,2010 (10.1016/j.cels.2017.05.009;10.1073/pnas.1013185107)。
结果显示,无论是在GV卵里还是在HeLa细胞里,含有poly(A)尾巴非A修饰的基因均有着较高的表达丰度,这一结果也与核糖体结合数据相吻合(图2中E,F)。
上述系列结果显示,无论是在生殖细胞还是在体细胞中,poly(A)尾巴中的poly(A)尾巴非A修饰都可能对mRNA起到一种普遍的翻译调控作用,进一步表明poly(A)尾巴并非仅仅是mRNA的一个简单结构组成,更可以在生殖细胞和体细胞里作为mRNA功能的关键调控因子发挥作用。
实施例3、线虫中GLD-4通过建立poly(A)尾巴非A修饰增强翻译效率
Poly(A)聚合酶(PAP)催化产生了RNA poly(A)尾巴。经典PAP催化了转录偶联的mRNA 多聚腺苷酸化,而非经典的PAP(ncPAP)催化了转录后的poly(A)尾巴重新多聚腺苷酸化(Lim et al.,2014;Lim et al.,2018)。线虫里的两种ncPAP——全身性表达的GLD-2和主要表达于生殖细胞里的GLD-4对于生殖细胞发育十分重要(Schmid et al.,2009;Wanget al.,2002)。gld-2敲除可到导致poly(A)尾巴长度发生全局性降低;而gld-4敲除后,转录组水平的poly(A)尾巴长度仅有微弱下降(Nousch et al.,2014)。gld-4突变体中的总体翻译效率发生下降,而类似的下降并未在gld-2突变体中观察到(Millonigg et al.,2014;Nousch et al.,2014),暗示GLD-2和GLD-4可能具有不同的功能机制。
实施例1中所提及的10个物种均含有1-2个GLD-4的同源基因,这些基因编码的蛋白都具有保守的核心核苷酸转移酶结构域(图3)。哺乳动物基因组中的GLD-4同源基因为TENT4A/B。纯化的TENT4A和TENT4B蛋白可以催化产生主要由A组成的、同时掺入G、U和C 的poly(A)尾巴,这一结果也与在HeLa细胞中所观察到的混合尾巴相一致(Limetal.,2018,10.1126/science.aam5794)。
本实施例发现,线虫中的一种非典型poly(A)聚合酶GLD-4可通过建立poly(A)尾巴非A 修饰来调控翻译,GLD-4/TENT4A/TENT4B催化产生了RNA poly(A)尾巴非A修饰。
发明人首先以gld-4(ef15)突变体的完整个体为待测样本,提取其总RNA,而后按照实施例1的方法制备PAIso-seq文库,然后进行第三代测序并对RNA poly(A)尾巴序列进行了分析,实验包含2个生物学重复,并以野生型线虫(WT)为对照。线虫中Gld-4的cDNA 序列为序列表中SEQ ID No.1,编码序列表中SEQ ID No.2所示的Gld-4蛋白质。
结果发现,野生型(WT)转录本中带有poly(A)尾巴非A修饰的基因,在gld-4突变体中有显著的pATIM-G、pATIM-C和pATIM-U修饰的丢失(图4中A)。这一结果显示,GLD-4可以催化产生带有poly(A)尾巴非A修饰的RNA尾巴。
考虑到线虫生殖细胞基因中具有poly(A)尾巴非A修饰的转录本比例与翻译效率呈正相关,发明人还比较了gld-4敲除前后,具有和不具有poly(A)尾巴非A修饰的基因,它们的翻译效率有没有发生变化。
符合预期的是,gld-4突变体中,相较于不具有poly(A)尾巴非A修饰的基因,具有poly(A) 尾巴非A修饰的基因有显著更高的翻译效率(图4中B),这一结果再次证实GLD-4可通过催化poly(A)尾巴非A修饰来提高翻译效率。
综上,GLD-4/TENT4A/TENT4B这类ncPAP能够催化产生poly(A)尾巴中的这种进化上保守的、可促进mRNA翻译的poly(A)尾巴非A修饰,提高mRNA的翻译效率。
实施例4、人工合成的poly(A)尾巴非A修饰可促进翻译
由于poly(A)尾巴非A修饰与翻译效率正相关,而线虫中GLD-4的丢失伴随着poly(A) 尾巴非A修饰的减少与翻译效率的降低,因而发明人进一步利用体外合成的mRNA报告系统来研究poly(A)尾巴非A修饰在翻译效率方面的作用。发明人合成了poly(A)尾巴长度相同,但poly(A)尾巴非A修饰数量不同的荧光报告基因(EGFP与mCherry)mRNA(图5中A),并将这些mRNA转染HeLa细胞,检测对应的报告mRNA及其编码蛋白的动态变化(图5中B)。步骤如下:
所合成mRNA为四种EGFP的mRNA与四种mCherry的mRNA,EGFP的四种mRNA分别是EGFP 的CDS序列后分别跟随A98、A98pATIMs-1、A98pATIMs-2或A98pATIMs-3序列,mCherry的四种mRNA分别是mCherry的CDS序列后分别跟随A98、A98pATIMs-1、A98pATIMs-2或A98pATIMs-3序列。A98、A98pATIMs-1、A98pATIMs-2、A98pATIMs-3与CDS序列间均含有编码FLAG标签的RNA序列(GAUUACAAGGACGACGAUGACAAG)。
A98、A98pATIMs-1、A98pATIMs-2与A98pATIMs-3分别为四种poly(A)尾巴,其序列分别为序列表中SEQ ID No.3-6(即图5A中TGA之后的序列),A98不含有poly(A)尾巴非A 修饰。EGFP的CDS序列为序列表中SEQ ID No.7(该序列为图5A中自5’端至TGA的序列),mCherry的CDS序列为序列表中SEQ ID No.8(该序列为图5A中自5’端至TGA的序列)。
用RNase-free水溶解各mRNA,使用前储存于-80℃。
HeLa细胞培养于37℃、5%CO2的潮湿培养箱中,培养基为含有10%FBS(Gibco)的DMEM 培养基(Invitrogen)。HeLa细胞的mRNA转染使用的是Lipofectamine MessengerMAX转染试剂(Invitrogen)。每种mRNA的转染量均为2500ng/(250,000–1,000,000细胞)。
转染之后的特定时间点,拍照、进行FACS分析或者收取细胞进行下一步实验。
在转染18小时,荧光显微镜检测转染不同EGFPmRNA的细胞的荧光信号,结果显示,与纯A尾巴(A98)对照相比,具有poly(A)尾巴非A修饰尾巴的mRNA有显著更高的EGFP荧光蛋白表达水平(图5中C)。
在转染42小时,用荧光激活细胞分选技术(Fluorescent-activated CellSorting,FACS) 来比较A98pATIMs-2修饰的poly(A)尾巴和纯A尾巴之间的翻译效率差异。相较之下, A98pATIMs-2报告基因的EGFP荧光有所增强;而在mCherry中也观察到了类似的荧光增强现象(图5中D)。这些结果显示poly(A)尾巴非A修饰对于翻译的增强效果并不局限于某种特定的mRNA。利用未转染mRNA的HeLa细胞作为对照组(Control)。
在转染42小时,分别通过免疫印迹和qPCR定量分析这些细胞中的蛋白和mRNA水平。利用未转染mRNA的HeLa细胞作为对照组(Control)。
qPCR定量分析所用引物如表2所示。
表2、引物序列
免疫印迹所用一抗:鼠抗FLAG(Sigma,F1804;1:1000稀释)、鼠抗Tubulin(Sigma,T9026;1:1000稀释)。二抗:荧光抗体(LI-COR,925-32210,1:10000稀释)。Tubulin 为内参。
与荧光显微镜镜检观察和FACS分析结果一致的是,转染了带有poly(A)尾巴非A修饰 mRNA的细胞,其Cherry和EGFP蛋白产物均有更高的积累(图5中E)。qPCR结果显示,相应的mRNA水平并无增加,显示蛋白水平的增加是由更高的翻译效率而非更强的mRNA稳定性所导致的(图5中E)。
细胞转染后分时间点收样检测荧光,结果显示,mRNA的这种翻译增强效果在转染6-120h 内是持续的(图6)。利用未转染mRNA的HeLa细胞作为对照组(Control)。
上述实验结果清楚地揭示,poly(A)尾巴非A修饰可以极大地促进mRNA翻译。
以上对本发明进行了详述。对于本领域技术人员来说,在不脱离本发明的宗旨和范围,以及无需进行不必要的实验情况下,可在等同参数、浓度和条件下,在较宽范围内实施本发明。虽然本发明给出了特殊的实施例,应该理解为,可以对本发明作进一步的改进。总之,按本发明的原理,本申请欲包括任何变更、用途或对本发明的改进,包括脱离了本申请中已公开范围,而用本领域已知的常规技术进行的改变。按以下附带的权利要求的范围,可以进行一些基本特征的应用。
<110>中国科学院遗传与发育生物学研究所
<120>poly(A)尾巴非A修饰在促进mRNA翻译中的应用
<160> 8
<170>PatentIn version 3.5
<210> 1
<211> 2538
<212> DNA
<213> 线虫(Caenorhabditis Elegans)
<400> 1
atgaatgaagacagcagattatcatcatcacaacaaccgtcaacgtcgacacctcgttca 60
tcgattccttcgactatgaattcggatgaaccaaatacgtgtcgccgtcttagtcagtcc 120
caggaacagccatcaacgtccagaacttgcaaaagtgaaactccagagtttggatatagt 180
gattcattgccgttcgccccatggcgaaggaaacggtacggactgaatattcaaggactg 240
cacgaggaaattgtcgatatgtatcattggataaaaccgaacgagatagagtcgcgactt 300
cgaacgaaggtctttgagaaggtccgcgattctgtgttacggcgatggaaacagaaaaca 360
ataaaaatctcaatgtttggaagtcttagaacgaatctcttcttacctacttcggatatt 420
gacgttctcgttgaatgcgatgattgggttggaacacctggagattggcttgctgaaaca 480
gcacgtggacttgaagcggacaatattgcggagtcggtgatggtctatggaggagcattt 540
gtgcctattgtgaaaatggtggatagagatactcgattgagcattgatatttcattcaat 600
acagttcaaggagtacgagcagcttcatacattgcaaaggtgaaagaagagtttccatta 660
atagaaccacttgttctcctgctcaaacaattccttcattaccgtaatctcaatcaaaca 720
ttcaccggaggtctatcctcctatggactcgtccttttgctcgttaactttttccagctt 780
tacgccctaaacatgcgaagtcgaacaatttacgaccgaggagtcaatctaggacactta 840
ttgcttcgtttcctcgagctgtattctttagagttcaactttgaagaaatgggaatatca 900
cctggacaatgttgctatattccaaagtcggctagcggagcaagatacggtcacaagcag 960
gctcaaccagggaatctagcattggaagatccattacttactgcaaatgatgtcggtaga 1020
tctacatacaacttttcttcaatagccaatgcctttgggcaagcttttcaaatattactt 1080
gttgccgtcacattacgcgagcgaaaagggaaaaatcatgtggctatgagagcttacaaa 1140
ggatctcttctccatctcattatgccattcacatcaaaagagctcacatatcgtaactgg 1200
ttaatgagtggtgtgttatcaatgcctggccaagaagcaccggcatcgtatgacttaaat 1260
caattgcataacactttggtttctccgatggtcgatctttcacgatacgcttggcttaga 1320
aaagcaccagcgaaggcagaaaagcgggattctcgacctctgacgattgtaaatccagca 1380
gatgatcgtcaaactcttgcccaacagttgaagaagcagattctggaacaaactgaagca 1440
aagaaatcgttagaaaaaatgccggcttgtgatgacaacaaaaaagaagaggaattggtt 1500
gcaacaagagaaactgatgttgaattggaagcagaagatactgaatcagaaggtcatcat 1560
aatggtgaaaatgatttgattcttactggaccacctctaccaacgtctacacaatccgtc 1620
aatacatctgccacagtatctactgcagcgagtatttcggagcgagaagacactgattca 1680
ccaggactgtcttcgtcaatgggaaatcaatcttcagaagaagatgaggataacggaatt 1740
aataatcgaaacaattctgctgttcctgttcaattcaagaaacccttcaatgaagttgtt 1800
gcccagccagcaagagaatcgaagagaacgcagacaaccagtgaagacaagatgcaagat 1860
caattccatttcaatggatactcatatcctccaccctcacgatatgcagcaggtaccgcg 1920
gcaccatcacacaaacatcgtaatgcgcacccgcaacggcagcgtccttcaattcgtaat 1980
ctttcacaaggctctgatgggtcagatgaatacaatgttgaatcatggaacaacaatata 2040
agacaaggaagaagagcctcgtcgaattcaccaagtccatccagacagcagacgaatact 2100
agaaattgtggacccactaataatattccctacgattcatttcgctcacaaaacaagaat 2160
tcgacgttagatggttcgaataattcatcagaagagccaataacaatgtatgccgatgtt 2220
gttaaaaagaagtcttcgattacaacatcaaccaatacttccactgcagatgtcaacgtc 2280
accaatggaaatccaatacccgcaaatggaataattccacaatctatggccgtggtcaac 2340
gttggaagaggatcctatagaaatgctcttacaacttcccccatgacgcctccatcagca 2400
catacttcaatgcagaagcaacatcatttgcgtaaagataacgaatgtggtttcgacaac 2460
aactcagccacttcttctactgatttaagccatcatcaaccacaattggttccaccggtg 2520
aatcgtctgcagcgatag 2538
<210> 2
<211> 845
<212> PRT
<213> 线虫(Caenorhabditis Elegans)
<400> 2
Met Asn Glu Asp Ser Arg Leu Ser Ser Ser Gln Gln Pro Ser Thr Ser
1 5 10 15
Thr Pro Arg Ser Ser Ile Pro Ser Thr Met Asn Ser Asp Glu Pro Asn
20 25 30
Thr Cys Arg Arg Leu Ser Gln Ser Gln Glu Gln Pro Ser Thr Ser Arg
35 40 45
Thr Cys Lys Ser Glu Thr Pro Glu Phe Gly Tyr Ser Asp Ser Leu Pro
50 55 60
Phe Ala Pro Trp Arg Arg Lys Arg Tyr Gly Leu Asn Ile Gln Gly Leu
65 70 75 80
His Glu Glu Ile Val Asp Met Tyr His Trp Ile Lys Pro Asn Glu Ile
85 90 95
Glu Ser Arg Leu Arg Thr Lys Val Phe Glu Lys Val Arg Asp Ser Val
100 105 110
Leu Arg Arg Trp Lys Gln Lys Thr Ile Lys Ile Ser Met Phe Gly Ser
115 120 125
Leu Arg Thr Asn Leu Phe Leu Pro Thr Ser Asp Ile Asp Val Leu Val
130 135 140
Glu Cys Asp Asp Trp Val Gly Thr Pro Gly Asp Trp Leu Ala Glu Thr
145 150 155 160
Ala Arg Gly Leu Glu Ala Asp Asn Ile Ala Glu Ser Val Met Val Tyr
165 170 175
Gly Gly Ala Phe Val Pro Ile Val Lys Met Val Asp Arg Asp Thr Arg
180 185 190
Leu Ser Ile Asp Ile Ser Phe Asn Thr Val Gln Gly Val Arg Ala Ala
195 200 205
Ser Tyr Ile Ala Lys Val Lys Glu Glu Phe Pro Leu Ile Glu Pro Leu
210 215 220
Val Leu Leu Leu Lys Gln Phe Leu His Tyr Arg Asn Leu Asn Gln Thr
225 230 235 240
Phe Thr Gly Gly Leu Ser Ser Tyr Gly Leu Val Leu Leu Leu Val Asn
245 250 255
Phe Phe Gln Leu Tyr Ala Leu Asn Met Arg Ser Arg Thr Ile Tyr Asp
260 265 270
Arg Gly Val Asn Leu Gly His Leu Leu Leu Arg Phe Leu Glu Leu Tyr
275 280 285
Ser Leu Glu Phe Asn Phe Glu Glu Met Gly Ile Ser Pro Gly Gln Cys
290 295 300
Cys Tyr Ile Pro Lys Ser Ala Ser Gly Ala Arg Tyr Gly His Lys Gln
305 310 315 320
Ala Gln Pro Gly Asn Leu Ala Leu Glu Asp Pro Leu Leu Thr Ala Asn
325 330 335
Asp Val Gly Arg Ser Thr Tyr Asn Phe Ser Ser Ile Ala Asn Ala Phe
340 345 350
Gly Gln Ala Phe Gln Ile Leu Leu Val Ala Val Thr Leu Arg Glu Arg
355 360 365
Lys Gly Lys Asn His Val Ala Met Arg Ala Tyr Lys Gly Ser Leu Leu
370 375 380
His Leu Ile Met Pro Phe Thr Ser Lys Glu Leu Thr Tyr Arg Asn Trp
385 390 395 400
Leu Met Ser Gly Val Leu Ser Met Pro Gly Gln Glu Ala Pro Ala Ser
405 410 415
Tyr Asp Leu Asn Gln Leu His Asn Thr Leu Val Ser Pro Met Val Asp
420 425 430
Leu Ser Arg Tyr Ala Trp Leu Arg Lys Ala Pro Ala Lys Ala Glu Lys
435 440 445
Arg Asp Ser Arg Pro Leu Thr Ile Val Asn Pro Ala Asp Asp Arg Gln
450 455 460
Thr Leu Ala Gln Gln Leu Lys Lys Gln Ile Leu Glu Gln Thr Glu Ala
465 470 475 480
Lys Lys Ser Leu Glu Lys Met Pro Ala Cys Asp Asp Asn Lys Lys Glu
485 490 495
Glu Glu Leu Val Ala Thr Arg Glu Thr Asp Val Glu Leu Glu Ala Glu
500 505 510
Asp Thr Glu Ser Glu Gly His His Asn Gly Glu Asn Asp Leu Ile Leu
515 520 525
Thr Gly Pro Pro Leu Pro Thr Ser Thr Gln Ser Val Asn Thr Ser Ala
530 535 540
Thr Val Ser Thr Ala Ala Ser Ile Ser Glu Arg Glu Asp Thr Asp Ser
545 550 555 560
Pro Gly Leu Ser Ser Ser Met Gly Asn Gln Ser Ser Glu Glu Asp Glu
565 570 575
Asp Asn Gly Ile Asn Asn Arg Asn Asn Ser Ala Val Pro Val Gln Phe
580 585 590
Lys Lys Pro Phe Asn Glu Val Val Ala Gln Pro Ala Arg Glu Ser Lys
595 600 605
Arg Thr Gln Thr Thr Ser Glu Asp Lys Met Gln Asp Gln Phe His Phe
610 615 620
Asn Gly Tyr Ser Tyr Pro Pro Pro Ser Arg Tyr Ala Ala Gly Thr Ala
625 630 635 640
Ala Pro Ser His Lys His Arg Asn Ala His Pro Gln Arg Gln Arg Pro
645 650 655
Ser Ile Arg Asn Leu Ser Gln Gly Ser Asp Gly Ser Asp Glu Tyr Asn
660 665 670
Val Glu Ser Trp Asn Asn Asn Ile Arg Gln Gly Arg Arg Ala Ser Ser
675 680 685
Asn Ser Pro Ser Pro Ser Arg Gln Gln Thr Asn Thr Arg Asn Cys Gly
690 695 700
Pro Thr Asn Asn Ile Pro Tyr Asp Ser Phe Arg Ser Gln Asn Lys Asn
705 710 715 720
Ser Thr Leu Asp Gly Ser Asn Asn Ser Ser Glu Glu Pro Ile Thr Met
725 730 735
Tyr Ala Asp Val Val Lys Lys Lys Ser Ser Ile Thr Thr Ser Thr Asn
740 745 750
Thr Ser Thr Ala Asp Val Asn Val Thr Asn Gly Asn Pro Ile Pro Ala
755 760 765
Asn Gly Ile Ile Pro Gln Ser Met Ala Val Val Asn Val Gly Arg Gly
770 775 780
Ser Tyr Arg Asn Ala Leu Thr Thr Ser Pro Met Thr Pro Pro Ser Ala
785 790 795 800
His Thr Ser Met Gln Lys Gln His His Leu Arg Lys Asp Asn Glu Cys
805 810 815
Gly Phe Asp Asn Asn Ser Ala Thr Ser Ser Thr Asp Leu Ser His His
820 825 830
Gln Pro Gln Leu Val Pro Pro Val Asn Arg Leu Gln Arg
835 840 845
<210> 3
<211> 98
<212> RNA
<213> 人工序列(Artificial sequence)
<400> 3
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 60
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 98
<210> 4
<211> 98
<212> RNA
<213> 人工序列(Artificial sequence)
<400> 4
aaaaaaaaaaaaaaaaaaaacaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaauaaaaaaa 60
aaaaaaaacaaaaaaaaaaaaaaaaaaaaaaagaaaaa 98
<210> 5
<211> 98
<212> RNA
<213> 人工序列(Artificial sequence)
<400> 5
aaaaaaaaaaaacaaaaaaaaaaaaaaaaaaaagaaaaaaaaaaaaaaaaaaaaaaaaac 60
ccaaaaaaaaaaaaaaaaaaaaaaaaaaaaggaaaaaa 98
<210> 6
<211> 98
<212> RNA
<213> 人工序列(Artificial sequence)
<400> 6
aaaaaaaaaaaaacaaaaaaaaaaaaaaaaaaaggaaaaaaaaaaaaaaaaaaaaaaaaa 60
aaaagaaaaaaaaaaaaaaaaaaaaaaaaaccaaaaaa 98
<210> 7
<211> 717
<212> RNA
<213> 人工序列(Artificial sequence)
<400> 7
auggugagcaagggcgaggagcuguucaccgggguggugcccauccuggucgagcuggac 60
ggcgacguaaacggccacaaguucagcguguccggcgagggcgagggcgaugccaccuac 120
ggcaagcugacccugaaguucaucugcaccaccggcaagcugcccgugcccuggcccacc 180
cucgugaccacccugaccuacggcgugcagugcuucagccgcuaccccgaccacaugaag 240
cagcacgacuucuucaaguccgccaugcccgaaggcuacguccaggagcgcaccaucuuc 300
uucaaggacgacggcaacuacaagacccgcgccgaggugaaguucgagggcgacacccug 360
gugaaccgcaucgagcugaagggcaucgacuucaaggaggacggcaacauccuggggcac 420
aagcuggaguacaacuacaacagccacaacgucuauaucauggccgacaagcagaagaac 480
ggcaucaaggugaacuucaagauccgccacaacaucgaggacggcagcgugcagcucgcc 540
gaccacuaccagcagaacacccccaucggcgacggccccgugcugcugcccgacaaccac 600
uaccugagcacccaguccgcccugagcaaagaccccaacgagaagcgcgaucacaugguc 660
cugcuggaguucgugaccgccgccgggaucacucucggcauggacgagcuguacaag 717
<210> 8
<211> 708
<212> RNA
<213> 人工序列(Artificial sequence)
<400> 8
auggugagcaagggcgaggaggauaacauggccaucaucaaggaguucaugcgcuucaag 60
gugcacauggagggcuccgugaacggccacgaguucgagaucgagggcgagggcgagggc 120
cgccccuacgagggcacccagaccgccaagcugaaggugaccaaggguggcccccugccc 180
uucgccugggacauccuguccccucaguucauguacggcuccaaggccuacgugaagcac 240
cccgccgacauccccgacuacuugaagcuguccuuccccgagggcuucaagugggagcgc 300
gugaugaacuucgaggacggcggcguggugaccgugacccaggacuccucccugcaggac 360
ggcgaguucaucuacaaggugaagcugcgcggcaccaacuuccccuccgacggccccgua 420
augcagaagaagaccaugggcugggaggccuccuccgagcggauguaccccgaggacggc 480
gcccugaagggcgagaucaagcagaggcugaagcugaaggacggcggccacuacgacgcu 540
gaggucaagaccaccuacaaggccaagaagcccgugcagcugcccggcgccuacaacguc 600
aacaucaaguuggacaucaccucccacaacgaggacuacaccaucguggaacaguacgaa 660
cgcgccgagggccgccacuccaccggcggcauggacgagcuguacaag 708
Claims (1)
1.通过在poly(A)尾巴中掺入四处非A修饰在提高哺乳动物中mRNA的翻译效率中的应用;所述非A修饰为poly(A)尾巴中含有G、C和/或U;
其中,所述非A修饰的poly(A)尾巴的序列选自SEQ ID NO: 4、5和6。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111002112 | 2021-08-30 | ||
CN2021110021127 | 2021-08-30 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115216483A CN115216483A (zh) | 2022-10-21 |
CN115216483B true CN115216483B (zh) | 2024-01-23 |
Family
ID=83606371
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210410067.7A Active CN115216483B (zh) | 2021-08-30 | 2022-04-19 | poly(A)尾巴非A修饰在促进mRNA翻译中的应用 |
Country Status (3)
Country | Link |
---|---|
CN (1) | CN115216483B (zh) |
AU (1) | AU2022335966A1 (zh) |
WO (1) | WO2023029600A1 (zh) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6054296A (en) * | 1988-10-24 | 2000-04-25 | Symbicom Ab | 66 kDa antigen from Borrelia |
AU2011253592A1 (en) * | 2005-09-28 | 2011-12-15 | Biontech Ag | Modification of RNA, producing an increased transcript stability and translation efficiency |
CN108823225A (zh) * | 2018-05-31 | 2018-11-16 | 中国科学院理化技术研究所杭州研究院 | 蛋白质翻译过程中直接实现脂肪酸修饰的表达系统及应用 |
CN110218753A (zh) * | 2019-05-27 | 2019-09-10 | 湖北爱济莱斯生物科技有限公司 | 一种提高体外合成mRNA稳定性的方法 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE102005046490A1 (de) * | 2005-09-28 | 2007-03-29 | Johannes-Gutenberg-Universität Mainz | Modifikationen von RNA, die zu einer erhöhten Transkriptstabilität und Translationseffizienz führen |
US11926819B2 (en) * | 2019-05-28 | 2024-03-12 | The Regents Of The University Of California | Methods of adding polymers to ribonucleic acids |
-
2022
- 2022-04-19 CN CN202210410067.7A patent/CN115216483B/zh active Active
- 2022-05-19 WO PCT/CN2022/093735 patent/WO2023029600A1/zh active Application Filing
- 2022-05-19 AU AU2022335966A patent/AU2022335966A1/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6054296A (en) * | 1988-10-24 | 2000-04-25 | Symbicom Ab | 66 kDa antigen from Borrelia |
AU2011253592A1 (en) * | 2005-09-28 | 2011-12-15 | Biontech Ag | Modification of RNA, producing an increased transcript stability and translation efficiency |
CN108823225A (zh) * | 2018-05-31 | 2018-11-16 | 中国科学院理化技术研究所杭州研究院 | 蛋白质翻译过程中直接实现脂肪酸修饰的表达系统及应用 |
CN110218753A (zh) * | 2019-05-27 | 2019-09-10 | 湖北爱济莱斯生物科技有限公司 | 一种提高体外合成mRNA稳定性的方法 |
Non-Patent Citations (5)
Title |
---|
Jaechul Lim等.Mixed tailing by TENT4A and TENT4B shields mRNA from rapid deadenylation.Science.2018,第361卷摘要,图1-图2. * |
Poly(A) inclusive RNA isoform sequencing (PAIso−seq) reveals wide-spread non-adenosine residues within RNA poly(A) tails;Yusheng Liu等;《Nature Communications》;20191122;第10卷(第1期);第6页左栏最后1段-右栏第1段 * |
Poly(A) RNA polymerase gld-4 [Caenorhabditis elegans], NCBI Reference Sequence: NP_492446.3;Sulson,J.E.等;《Genbank数据库》;20210809;CDS、ORIGIN * |
Vladyslava Liudkovska等.Functions and mechanisms of RNA tailing by metazoan terminal nucleotidyltransferases.Wiley Interdisciplinary reviews RNA.2021,第12卷(第2期),第2页最后1段,第3页图1,第6页表1,第10页第2段,第12页第2-3段. * |
Željka Trepotec.Minimalistic 5’-UTRs and segmented poly(A) tails - a step towards increased potency of transcript therapies.Dissertation zum Erwerb des Doktorgrades der Naturwissenschaften an der Medizinischen Fakultät der Ludwig-Maximilians-Universität zu München.2019,摘要. * |
Also Published As
Publication number | Publication date |
---|---|
AU2022335966A1 (en) | 2024-04-11 |
WO2023029600A1 (zh) | 2023-03-09 |
CN115216483A (zh) | 2022-10-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Metodiev et al. | Methylation of 12S rRNA is necessary for in vivo stability of the small subunit of the mammalian mitochondrial ribosome | |
Kadyrova et al. | Translational control of maternal Cyclin B mRNA by Nanos in the Drosophila germline | |
Ladomery et al. | Xp54, the Xenopus homologue of human RNA helicase p54, is an integral component of stored mRNP particles in oocytes | |
Du et al. | DNA binding of centromere protein C (CENPC) is stabilized by single-stranded RNA | |
Xie et al. | Differential gene expression in fully-grown oocytes between gynogenetic and gonochoristic crucian carps | |
Zhu et al. | U1 snRNP-dependent function of TIAR in the regulation of alternative RNA processing of the human calcitonin/CGRP pre-mRNA | |
EP3487998B1 (en) | Compositions and methods for identifying rna binding polypeptide targets | |
Albani et al. | DcE2F, a functional plant E2F-like transcriptional activator from Daucus carota | |
US20190390205A1 (en) | Incorporation of internal polya-encoded poly-lysine sequence tags and their variations for the tunable control of protein synthesis in bacterial and eukaryotic cells | |
US20060105341A1 (en) | Trap-tagging: a novel method for the identification and purification of rna-protein complexes | |
Beckers et al. | Active genes in junk DNA? Characterization of DUX genes embedded within 3.3 kb repeated elements | |
US6410233B2 (en) | Isolation and identification of control sequences and genes modulated by transcription factors | |
CN115216483B (zh) | poly(A)尾巴非A修饰在促进mRNA翻译中的应用 | |
Alterman et al. | Regulated expression of a chimeric histone gene introduced into mouse fibroblasts | |
Matsuo et al. | Expression of Asn-d-Trp-Phe-NH2 in the brain of the terrestrial slug Limax valentianus | |
JPH10512447A (ja) | ナンセンス媒介によるmRNAの崩壊機能非存在下における異種ポリペプチドの産生 | |
Clayton | Mitochondrial DNA replication | |
Chiefari et al. | Methods to study protein-binding to pseudogene transcripts | |
Andoh et al. | Visual screening for localized RNAs in yeast revealed novel RNAs at the bud-tip | |
US11013221B2 (en) | Anti-aging transgenic Caenorhabditis elegans | |
CN106749560B (zh) | 脂滴或脂肪体通过mldsr蛋白参与的转录调控的发现及应用 | |
XU | cDNA cloning of a mouse factor that activates transcription from a metal response element of the mouse metallothionein-I gene in yeast | |
Wang et al. | Abnormal overexpression of SoxD enhances melanin synthesis in the Ursa mutant of Bombyx mori | |
US10160968B2 (en) | RNA tagging | |
Cooper et al. | The DUX gene family and FSHD |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |