CN115197327A - RNA modified chimeric protein and application thereof - Google Patents
RNA modified chimeric protein and application thereof Download PDFInfo
- Publication number
- CN115197327A CN115197327A CN202110379093.3A CN202110379093A CN115197327A CN 115197327 A CN115197327 A CN 115197327A CN 202110379093 A CN202110379093 A CN 202110379093A CN 115197327 A CN115197327 A CN 115197327A
- Authority
- CN
- China
- Prior art keywords
- rna
- subunit
- chimeric protein
- functional fragment
- methyltransferase
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108020001507 fusion proteins Proteins 0.000 title claims abstract description 144
- 102000037865 fusion proteins Human genes 0.000 title claims abstract description 144
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims abstract description 279
- 102000004190 Enzymes Human genes 0.000 claims abstract description 154
- 108090000790 Enzymes Proteins 0.000 claims abstract description 154
- 239000012634 fragment Substances 0.000 claims abstract description 107
- 101800001779 2'-O-methyltransferase Proteins 0.000 claims abstract description 51
- 230000000694 effects Effects 0.000 claims abstract description 38
- 108020005161 RNA Caps Proteins 0.000 claims abstract description 32
- 210000004027 cell Anatomy 0.000 claims description 60
- 238000000034 method Methods 0.000 claims description 55
- 108090000623 proteins and genes Proteins 0.000 claims description 54
- 102000004169 proteins and genes Human genes 0.000 claims description 46
- 108060004795 Methyltransferase Proteins 0.000 claims description 42
- 102000016397 Methyltransferase Human genes 0.000 claims description 42
- 230000014509 gene expression Effects 0.000 claims description 42
- 150000007523 nucleic acids Chemical class 0.000 claims description 37
- 150000001413 amino acids Chemical class 0.000 claims description 33
- 108091026890 Coding region Proteins 0.000 claims description 32
- 102000039446 nucleic acids Human genes 0.000 claims description 30
- 108020004707 nucleic acids Proteins 0.000 claims description 30
- 102000004357 Transferases Human genes 0.000 claims description 29
- 108090000992 Transferases Proteins 0.000 claims description 29
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 claims description 28
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 claims description 28
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Natural products C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 claims description 28
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 28
- 229940029575 guanosine Drugs 0.000 claims description 28
- 241000700618 Vaccinia virus Species 0.000 claims description 16
- 230000035772 mutation Effects 0.000 claims description 16
- 108010074725 Alpha,alpha-trehalose phosphorylase Proteins 0.000 claims description 12
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 12
- 239000013604 expression vector Substances 0.000 claims description 11
- 239000000872 buffer Substances 0.000 claims description 6
- 238000011534 incubation Methods 0.000 claims description 6
- 238000002156 mixing Methods 0.000 claims description 6
- 230000000875 corresponding effect Effects 0.000 claims description 5
- 230000003248 secreting effect Effects 0.000 claims description 4
- 102220470192 Aryl hydrocarbon receptor repressor_K62A_mutation Human genes 0.000 claims description 3
- 102220610330 Musculin_N42A_mutation Human genes 0.000 claims description 3
- 102220482066 Nuclear cap-binding protein subunit 2_Y43A_mutation Human genes 0.000 claims description 3
- 102220551679 Putative Dresden prostate carcinoma protein 2_L61A_mutation Human genes 0.000 claims description 3
- 210000004899 c-terminal region Anatomy 0.000 claims description 3
- 239000013599 cloning vector Substances 0.000 claims description 3
- 230000000295 complement effect Effects 0.000 claims description 3
- 239000003795 chemical substances by application Substances 0.000 claims description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 claims description 2
- 239000004973 liquid crystal related substance Substances 0.000 claims 4
- 239000002184 metal Substances 0.000 claims 3
- 229910052751 metal Inorganic materials 0.000 claims 3
- 108020004999 messenger RNA Proteins 0.000 description 46
- 235000018102 proteins Nutrition 0.000 description 42
- 235000001014 amino acid Nutrition 0.000 description 26
- 229940024606 amino acid Drugs 0.000 description 26
- 101150077194 CAP1 gene Proteins 0.000 description 25
- 102000040430 polynucleotide Human genes 0.000 description 24
- 108091033319 polynucleotide Proteins 0.000 description 24
- 239000002157 polynucleotide Substances 0.000 description 24
- 108090000765 processed proteins & peptides Proteins 0.000 description 24
- 238000013518 transcription Methods 0.000 description 24
- 108020004414 DNA Proteins 0.000 description 23
- 101100438378 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) fac-1 gene Proteins 0.000 description 23
- 229920001184 polypeptide Polymers 0.000 description 23
- 102000004196 processed proteins & peptides Human genes 0.000 description 23
- 108010027510 vaccinia virus capping enzyme Proteins 0.000 description 22
- 230000035897 transcription Effects 0.000 description 21
- 238000006243 chemical reaction Methods 0.000 description 19
- 239000013598 vector Substances 0.000 description 19
- 239000000758 substrate Substances 0.000 description 17
- 108010001267 Protein Subunits Proteins 0.000 description 15
- 102000002067 Protein Subunits Human genes 0.000 description 15
- 230000015572 biosynthetic process Effects 0.000 description 14
- 238000003786 synthesis reaction Methods 0.000 description 12
- 230000004048 modification Effects 0.000 description 11
- 238000012986 modification Methods 0.000 description 11
- 238000004519 manufacturing process Methods 0.000 description 10
- 239000001226 triphosphate Substances 0.000 description 10
- 235000011178 triphosphate Nutrition 0.000 description 10
- 238000000338 in vitro Methods 0.000 description 9
- 239000002773 nucleotide Substances 0.000 description 9
- 125000003729 nucleotide group Chemical group 0.000 description 9
- 238000000746 purification Methods 0.000 description 9
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 9
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 8
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 8
- 230000003197 catalytic effect Effects 0.000 description 8
- 230000002255 enzymatic effect Effects 0.000 description 8
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 8
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 7
- MEFKEPWMEQBLKI-AIRLBKTGSA-N S-adenosyl-L-methioninate Chemical compound O[C@@H]1[C@H](O)[C@@H](C[S+](CC[C@H](N)C([O-])=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 MEFKEPWMEQBLKI-AIRLBKTGSA-N 0.000 description 6
- 125000000539 amino acid group Chemical group 0.000 description 6
- 239000011324 bead Substances 0.000 description 6
- 108020001778 catalytic domains Proteins 0.000 description 6
- 230000009144 enzymatic modification Effects 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 238000001890 transfection Methods 0.000 description 5
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 108010076504 Protein Sorting Signals Proteins 0.000 description 4
- 230000006819 RNA synthesis Effects 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 238000000137 annealing Methods 0.000 description 4
- 108010092854 aspartyllysine Proteins 0.000 description 4
- 230000004071 biological effect Effects 0.000 description 4
- 230000008827 biological function Effects 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 230000000977 initiatory effect Effects 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 239000000523 sample Substances 0.000 description 4
- 230000001225 therapeutic effect Effects 0.000 description 4
- 230000014616 translation Effects 0.000 description 4
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 3
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 3
- 102000009617 Inorganic Pyrophosphatase Human genes 0.000 description 3
- 108010009595 Inorganic Pyrophosphatase Proteins 0.000 description 3
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 3
- 108700026244 Open Reading Frames Proteins 0.000 description 3
- 108700040121 Protein Methyltransferases Proteins 0.000 description 3
- 102000055027 Protein Methyltransferases Human genes 0.000 description 3
- 108700008625 Reporter Genes Proteins 0.000 description 3
- 102000006382 Ribonucleases Human genes 0.000 description 3
- 108010083644 Ribonucleases Proteins 0.000 description 3
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 3
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 3
- 239000004473 Threonine Substances 0.000 description 3
- 206010046865 Vaccinia virus infection Diseases 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 238000006911 enzymatic reaction Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 230000002349 favourable effect Effects 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 108010057821 leucylproline Proteins 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000009465 prokaryotic expression Effects 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 208000007089 vaccinia Diseases 0.000 description 3
- 239000013603 viral vector Substances 0.000 description 3
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 2
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 2
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 2
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 241000701022 Cytomegalovirus Species 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 2
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 2
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 2
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- MFQVZYSPCIZFMR-MGHWNKPDSA-N His-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N MFQVZYSPCIZFMR-MGHWNKPDSA-N 0.000 description 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 2
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 2
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 2
- 101100245221 Mus musculus Prss8 gene Proteins 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 2
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 2
- 230000026279 RNA modification Effects 0.000 description 2
- 108010012974 RNA triphosphatase Proteins 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 101710137500 T7 RNA polymerase Proteins 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 235000004279 alanine Nutrition 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 238000000855 fermentation Methods 0.000 description 2
- 230000004151 fermentation Effects 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- RQFCJASXJCIDSX-UUOKFMHZSA-N guanosine 5'-monophosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O RQFCJASXJCIDSX-UUOKFMHZSA-N 0.000 description 2
- 235000013928 guanylic acid Nutrition 0.000 description 2
- 239000000833 heterodimer Substances 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- -1 i.e. Proteins 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 238000004255 ion exchange chromatography Methods 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000011987 methylation Effects 0.000 description 2
- 238000007069 methylation reaction Methods 0.000 description 2
- 239000000693 micelle Substances 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 239000003161 ribonuclease inhibitor Substances 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 238000007619 statistical method Methods 0.000 description 2
- 210000004895 subcellular structure Anatomy 0.000 description 2
- 230000002195 synergetic effect Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- ZQJHYRVSKHGGJY-YPKJBDGSSA-N (2s,3r)-2-[[(2s)-2-[[(2s)-1-[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]pyrrolidine-2-carbonyl]amino]-3-phenylpropanoyl]amino]-3-hydroxybutanoic acid Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=CC=C1 ZQJHYRVSKHGGJY-YPKJBDGSSA-N 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- OGHAROSJZRTIOK-KQYNXXCUSA-O 7-methylguanosine Chemical compound C1=2N=C(N)NC(=O)C=2[N+](C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OGHAROSJZRTIOK-KQYNXXCUSA-O 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- TVUFMYKTYXTRPY-HERUPUMHSA-N Ala-Trp-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O TVUFMYKTYXTRPY-HERUPUMHSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- ALOVURZCXKYKJC-NAKRPEOUSA-N Arg-Asp-Gln-Ser Chemical compound N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O ALOVURZCXKYKJC-NAKRPEOUSA-N 0.000 description 1
- YFBGNGASPGRWEM-DCAQKATOSA-N Arg-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFBGNGASPGRWEM-DCAQKATOSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- PSZDKHGMNWAHGO-UHFFFAOYSA-N Asn Asn Pro Val Chemical compound CC(C)C(C(O)=O)NC(=O)C1CCCN1C(=O)C(CC(N)=O)NC(=O)C(N)CC(N)=O PSZDKHGMNWAHGO-UHFFFAOYSA-N 0.000 description 1
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 1
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 1
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 1
- SJPZTWAYTJPPBI-GUBZILKMSA-N Asn-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SJPZTWAYTJPPBI-GUBZILKMSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- NJPLPRFQLBZAMH-IHRRRGAJSA-N Asn-Tyr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O NJPLPRFQLBZAMH-IHRRRGAJSA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 1
- JRBVWZLHBGYZNY-QEJZJMRPSA-N Asp-Gln-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRBVWZLHBGYZNY-QEJZJMRPSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- LBFYTUPYYZENIR-GHCJXIJMSA-N Asp-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N LBFYTUPYYZENIR-GHCJXIJMSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- OZBXOELNJBSJOA-UBHSHLNASA-N Asp-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OZBXOELNJBSJOA-UBHSHLNASA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- UEFODXNXUAVPTC-VEVYYDQMSA-N Asp-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UEFODXNXUAVPTC-VEVYYDQMSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- LLRJPYJQNBMOOO-QEJZJMRPSA-N Asp-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N LLRJPYJQNBMOOO-QEJZJMRPSA-N 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 101710158575 Cap-specific mRNA (nucleoside-2'-O-)-methyltransferase Proteins 0.000 description 1
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- KGIHMGPYGXBYJJ-SRVKXCTJSA-N Cys-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CS KGIHMGPYGXBYJJ-SRVKXCTJSA-N 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- RGRMOYQUIJVQQD-SRVKXCTJSA-N Gln-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N RGRMOYQUIJVQQD-SRVKXCTJSA-N 0.000 description 1
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 1
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 1
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 1
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- AOCARQDSFTWWFT-DCAQKATOSA-N Glu-Met-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AOCARQDSFTWWFT-DCAQKATOSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- ZKLYPEGLWFVRGF-IUCAKERBSA-N Gly-His-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZKLYPEGLWFVRGF-IUCAKERBSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 1
- JUIOPCXACJLRJK-AVGNSLFASA-N His-Lys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N JUIOPCXACJLRJK-AVGNSLFASA-N 0.000 description 1
- MTHDIEPOBSRDIV-ULQDDVLXSA-N His-Met-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 MTHDIEPOBSRDIV-ULQDDVLXSA-N 0.000 description 1
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- DLTCGJZBNFOWFL-LKTVYLICSA-N His-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N DLTCGJZBNFOWFL-LKTVYLICSA-N 0.000 description 1
- JATYGDHMDRAISQ-KKUMJFAQSA-N His-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O JATYGDHMDRAISQ-KKUMJFAQSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- PUFNQIPSRXVLQJ-IHRRRGAJSA-N His-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N PUFNQIPSRXVLQJ-IHRRRGAJSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- IIXDMJNYALIKGP-DJFWLOJKSA-N Ile-Asn-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IIXDMJNYALIKGP-DJFWLOJKSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- IDMNOFVUXYYZPF-DKIMLUQUSA-N Ile-Lys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IDMNOFVUXYYZPF-DKIMLUQUSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- CEPIAEUVRKGPGP-DSYPUSFNSA-N Ile-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 CEPIAEUVRKGPGP-DSYPUSFNSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 1
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- ZAENPHCEQXALHO-GUBZILKMSA-N Lys-Cys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZAENPHCEQXALHO-GUBZILKMSA-N 0.000 description 1
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- YXTKSLRSRXKXNV-IHRRRGAJSA-N Lys-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N YXTKSLRSRXKXNV-IHRRRGAJSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- RYOLKFYZBHMYFW-WDSOQIARSA-N Lys-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 RYOLKFYZBHMYFW-WDSOQIARSA-N 0.000 description 1
- HONVOXINDBETTI-KKUMJFAQSA-N Lys-Tyr-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 HONVOXINDBETTI-KKUMJFAQSA-N 0.000 description 1
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 1
- BWECSLVQIWEMSC-IHRRRGAJSA-N Lys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BWECSLVQIWEMSC-IHRRRGAJSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 1
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 1
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 1
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 1
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 1
- CNAGWYQWQDMUGC-IHRRRGAJSA-N Met-Phe-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CNAGWYQWQDMUGC-IHRRRGAJSA-N 0.000 description 1
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- GHQFLTYXGUETFD-UFYCRDLUSA-N Met-Tyr-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N GHQFLTYXGUETFD-UFYCRDLUSA-N 0.000 description 1
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 239000012124 Opti-MEM Substances 0.000 description 1
- 206010034016 Paronychia Diseases 0.000 description 1
- 102000005877 Peptide Initiation Factors Human genes 0.000 description 1
- 108010044843 Peptide Initiation Factors Proteins 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 1
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 1
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 1
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 1
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- ACJULKNZOCRWEI-ULQDDVLXSA-N Phe-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O ACJULKNZOCRWEI-ULQDDVLXSA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- 108010010677 Phosphodiesterase I Proteins 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- DSGSTPRKNYHGCL-JYJNAYRXSA-N Pro-Phe-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DSGSTPRKNYHGCL-JYJNAYRXSA-N 0.000 description 1
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 1
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 1
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 1
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- BVRBCQBUNGAWFP-KKUMJFAQSA-N Pro-Tyr-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O BVRBCQBUNGAWFP-KKUMJFAQSA-N 0.000 description 1
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108010065868 RNA polymerase SP6 Proteins 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 108091028733 RNTP Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- LDEBVRIURYMKQS-WISUUJSJSA-N Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO LDEBVRIURYMKQS-WISUUJSJSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- XPVIVVLLLOFBRH-XIRDDKMYSA-N Ser-Trp-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)CO)C(O)=O XPVIVVLLLOFBRH-XIRDDKMYSA-N 0.000 description 1
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 1
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 1
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 1
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- YJVJPJPHHFOVMG-VEVYYDQMSA-N Thr-Met-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YJVJPJPHHFOVMG-VEVYYDQMSA-N 0.000 description 1
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 1
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 1
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- VGNKUXWYFFDWDH-BEMMVCDISA-N Thr-Trp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N)O VGNKUXWYFFDWDH-BEMMVCDISA-N 0.000 description 1
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 101000956368 Trittame loki CRISP/Allergen/PR-1 Proteins 0.000 description 1
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 1
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 1
- XGEUYEOEZYFHRL-KKXDTOCCSA-N Tyr-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XGEUYEOEZYFHRL-KKXDTOCCSA-N 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- XBWKCYFGRXKWGO-SRVKXCTJSA-N Tyr-Cys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XBWKCYFGRXKWGO-SRVKXCTJSA-N 0.000 description 1
- ZAGPDPNPWYPEIR-SRVKXCTJSA-N Tyr-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ZAGPDPNPWYPEIR-SRVKXCTJSA-N 0.000 description 1
- BODHJXJNRVRKFA-BZSNNMDCSA-N Tyr-Cys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BODHJXJNRVRKFA-BZSNNMDCSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 1
- LRHBBGDMBLFYGL-FHWLQOOXSA-N Tyr-Phe-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LRHBBGDMBLFYGL-FHWLQOOXSA-N 0.000 description 1
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 1
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 1
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 1
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- MYLNLEIZWHVENT-VKOGCVSHSA-N Val-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N MYLNLEIZWHVENT-VKOGCVSHSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- UFCHCOKFAGOQSF-BQFCYCMXSA-N Val-Trp-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N UFCHCOKFAGOQSF-BQFCYCMXSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- YKZVPMUGEJXEOR-JYJNAYRXSA-N Val-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N YKZVPMUGEJXEOR-JYJNAYRXSA-N 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 1
- 108010068991 arginyl-threonyl-prolyl-prolyl-prolyl-seryl-glycine Proteins 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 108010066988 asparaginyl-alanyl-glycyl-alanine Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 108010028263 bacteriophage T3 RNA polymerase Proteins 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 239000013622 capto Q Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 238000002659 cell therapy Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000012761 co-transfection Methods 0.000 description 1
- 238000001246 colloidal dispersion Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 108010081447 cytochrophin-4 Proteins 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 239000001177 diphosphate Substances 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 1
- 230000009088 enzymatic function Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 150000002333 glycines Chemical class 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 239000005090 green fluorescent protein Substances 0.000 description 1
- 239000004226 guanylic acid Substances 0.000 description 1
- 108010064833 guanylyltransferase Proteins 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 238000009776 industrial production Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000015788 innate immune response Effects 0.000 description 1
- 210000005007 innate immune system Anatomy 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 239000000543 intermediate Substances 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 210000003712 lysosome Anatomy 0.000 description 1
- 230000001868 lysosomic effect Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 229960004452 methionine Drugs 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 230000001035 methylating effect Effects 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000004005 microsphere Substances 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000002088 nanocapsule Substances 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 150000003833 nucleoside derivatives Chemical class 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 238000000053 physical method Methods 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 108010094020 polyglycine Proteins 0.000 description 1
- 229920000232 polyglycine polymer Polymers 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 238000012257 pre-denaturation Methods 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 230000003381 solubilizing effect Effects 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1241—Nucleotidyltransferases (2.7.7)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K19/00—Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1003—Transferases (2.) transferring one-carbon groups (2.1)
- C12N9/1007—Methyltransferases (general) (2.1.1.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1085—Transferases (2.) transferring alkyl or aryl groups other than methyl groups (2.5)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/14—Preparation of compounds containing saccharide radicals produced by the action of a carbohydrase (EC 3.2.x), e.g. by alpha-amylase, e.g. by cellulase, hemicellulase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/26—Preparation of nitrogen-containing carbohydrates
- C12P19/28—N-glycosides
- C12P19/30—Nucleotides
- C12P19/34—Polynucleotides, e.g. nucleic acids, oligoribonucleotides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y201/00—Transferases transferring one-carbon groups (2.1)
- C12Y201/01—Methyltransferases (2.1.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y306/00—Hydrolases acting on acid anhydrides (3.6)
- C12Y306/01—Hydrolases acting on acid anhydrides (3.6) in phosphorus-containing anhydrides (3.6.1)
- C12Y306/01025—Triphosphatase (3.6.1.25)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Abstract
The present invention provides a chimeric protein subunit comprising, linked to each other, (a) a D12 subunit of RNA capping enzyme or a functional fragment thereof, or a variant having at least 90% sequence identity thereto and having D12 subunit activity, and (b) an RNA cap structure 2 '-O-methyltransferase or a functional fragment thereof, or a variant having at least 90% sequence identity thereto and having RNA cap structure 2' -O-methyltransferase activity. The invention also provides chimeric proteins comprising the chimeric protein subunit and an RNA capping enzyme D1 subunit or a functional fragment thereof.
Description
Technical Field
The invention relates to the field of RNA synthesis, in particular to a chimeric protein for modifying RNA and application thereof.
Background
In Vitro Transcription (IVT) of synthetic mRNA has a significant advantage in biosafety over DNA-based therapeutics due to the difficulty of integrating RNA into the genome, the short time window for RNA to express translated proteins, and the better control of the initiation time and amount of expression of the protein of interest without resorting to intracellular transcription steps.
Therapeutic mRNAs mimic primarily the natural structure of eukaryotic mRNAs and are characterized primarily by the presence of a cap at the 5-terminus, which consists of the first transcribed nucleotide of the RNA linked to inverted 7-methylguanosine (m 7G) and linked to the first nucleotide by a triphosphate bridge to form a structure called cap0 (cap 0, shown in FIG. 10, C). Cap0 is sufficient to recruit translation initiation factors and prevent mRNA degradation. Then, 2' -hydroxymethyl of the first cap proximal nucleotide forms cap1 (cap 1, shown in FIG. 10, C). The Cap0 structure has the main functions of: modulation of splicing; outputting a core; stabilizing the mRNA structure by protecting the mRNA from 5' -exonuclease; assisting innate immunity in recognizing "self-RNA" serves as an anchor for the recruitment of promoters, initiating protein synthesis and 5 'to 3' circularization of mRNA during translation. The cap1 structure is closely related to the recognition of the innate immune system, and the host immune protein can recognize abnormal non-capped RNA.
Because IVT does not directly produce eukaryotic capped mRNA with biological functions, but only 5' -triphosphorylated RNA, the RNA produced by it requires different strategies to effect capping of the RNA. At present, two production strategies for preparing mRNA with a cap mainly exist, one is a cap analogue co-transcription capping mode, and the other is a post-transcription capping enzyme modified RNA capping mode.
In the cap analog co-transcription capping mode, where the cap analog is added directly to the IVT, an RNA polymerase with relaxed substrate specificity (e.g., T3, T7 or SP6 RNA polymerase) can incorporate the cap analog at the 5 'end of the RNA at the start of transcription, directly producing the corresponding 5' cap mRNA. Limitations of co-transcriptional capping include: competition for GTP with cap analogs results in only partial capping of mRNA obtained from IVT, and capped/uncapped mRNA is often difficult to isolate and ratioed. Another problem with m7GpppG as a co-transcribed cap analog is that it results in RNA extension in the "wrong" direction, i.e., at the 3' -OH of m7G, resulting in a misdirected mRNA cap.
Post-transcriptional capping enzyme-modified RNA capping means that the RNA product generated by IVT is subjected to enzymatic capping reaction with a specific capping enzyme and/or mRNA modifying enzyme in a suitable reaction system. The enzymatic reaction of cap0 comprises three successive reaction steps: first, 5' -triphosphatase (TPase) hydrolyzes the gamma-phosphate of RNA; next, guanosine transferase (GTase) couples the resulting 5' -diphosphate terminated β -phosphate with GMP using GTP as a substrate to form 5' -5' -linked Gppp-RNA; finally, RNA (guanine-N7) methyltransferases (N7-MTases) catalyze the methylation of the cap structure at the N7 position using S-adenosyl-L-methionine (AdoMet) as a substrate. Finally, m7G cap-specific 2' -O-methyltransferase (2-O-MTase) modifies the ribose on the first nucleotide to generate the cap1 structure. As shown in figure 10, a.
Although many capping enzymes and mRNA modifying enzymes derived from eukaryotes or viruses have been reported in the literature as having a cap modifying function, there are few enzymes that can be applied to mass production of cap-structured mRNA. There remains a need in the art for capping enzymes that combine the properties of high efficiency and low cost.
Disclosure of Invention
The present invention aims to provide a protein or protein subunit for capping modification of RNA and a method for modifying RNA using the same. The technical scheme of the invention simplifies the production process of the mRNA production related modified enzyme and reduces the mRNA production added raw materials.
In a first aspect, the invention provides a chimeric protein subunit comprising, linked to each other, (a) the D12 subunit of an RNA capping enzyme or a functional fragment thereof, or a variant having at least 90% sequence identity thereto and having D12 subunit activity, and (b) an RNA cap structure 2 '-O-methyltransferase or a functional fragment thereof, or a variant having at least 90% sequence identity thereto and having RNA cap structure 2' -O-methyltransferase activity, and optionally (c) a linker between (a) and (b).
In one or more embodiments, the RNA capping enzyme is a vaccinia virus RNA capping enzyme.
In one or more embodiments, the RNA cap structure 2 '-O-methyltransferase is a vaccinia virus 2' -O-methyltransferase.
In one or more embodiments, the carboxy terminus of (a) is linked to the amino terminus of (b).
In one or more embodiments, (b) further has a His-tag and/or a MBP-tag at its N-or C-terminus. In one or more embodiments, the His tag is set forth in SEQ ID NO 8. In one or more embodiments, the MBP tag is set forth in SEQ ID NO 9.
In one or more embodiments, the amino acid sequence of the D12 subunit of the RNA capping enzyme is shown in SEQ ID NO 1, items 1-287.
In one or more embodiments, the variant of the D12 subunit of the RNA capping enzyme, or a functional fragment thereof, has a mutation selected from one or more of: N42A, Y43A, L61A, K62A, F245A, L246A, K111A, R112A, N120A, N121A, N126A, N127A, F141A, R142A, K223A, D224A, H260A, S261A, E275A, N276A, R280A, R281A.
In one or more embodiments, the amino acid sequence of the RNA cap structure 2' -O-methyltransferase is shown in SEQ ID NO. 1, items 303-635.
In one or more embodiments, the variant of the RNA cap structure 2' -O-methyltransferase or a functional fragment thereof has the following characteristics:
(1) A mutation selected from one or more of: K41D, C178S, a201R, a201K, C272S; and/or
(2) Wherein one or more amino acids selected from R, K, H, Y, C, D or E are mutated to A.
In one or more embodiments, a variant of the RNA cap structure 2' -O-methyltransferase or a functional fragment thereof is set forth in SEQ ID No. 1, items 303-635, and has a mutation selected from one or more of: K41D, C178S, A201R, A201K, C272S, one or more amino acids selected from R, K, H, Y, C, D, E are mutated to A.
In one or more embodiments, the amino acid sequence of the linker is as set forth in amino acids 288-302 of SEQ ID NO 1.
In one or more embodiments, the amino acid sequence of a subunit of the chimeric protein is set forth in SEQ ID NO 1.
In a second aspect, the present invention provides a chimeric protein comprising: (1) A chimeric protein subunit according to any one of the embodiments of the first aspect herein, and (2) an RNA capping enzyme D1 subunit or a functional fragment thereof, or a variant thereof having at least 90% sequence identity thereto and corresponding activity.
In one or more embodiments, the protein is a heterodimeric protein of (1) and (2).
In one or more embodiments, the RNA capping enzyme is a vaccinia virus RNA capping enzyme.
In one or more embodiments, the functional fragment of a D1 subunit of an RNA capping enzyme comprises:
(1) An N7 methyltransferase (N7-MTase) domain or a functional fragment thereof;
(2) An N7 methyltransferase (N7-MTase) domain or a functional fragment thereof and a5' -triphosphatase (TPase) domain or a functional fragment thereof;
(3) An N7 methyltransferase (N7-MTase) domain or a functional fragment thereof and a guanosine transferase (GTase) domain or a functional fragment thereof; or
(4) An N7 methyltransferase (N7-MTase) domain or a functional fragment thereof, a5' -triphosphatase (TPase) domain or a functional fragment thereof, and a guanosine transferase (GTase) domain or a functional fragment thereof.
In one or more embodiments, the RNA capping enzyme D1 subunit is set forth in SEQ ID NO 3.
In one or more embodiments, a functional fragment of the D1 subunit of the RNA capping enzyme comprises amino acids 498-844 or 540-844 of SEQ ID NO 3.
In one or more embodiments, the 5' -triphosphatase (TPase) domain is represented by amino acids 1-225 of SEQ ID NO 3.
In one or more embodiments, the guanosine transferase (GTase) domain is shown as amino acids 226-530 of SEQ ID NO: 3.
In one or more embodiments, the N7 methyltransferase (N7-MTase) domain is depicted as amino acids 531-844 of SEQ ID NO: 3.
In one or more embodiments, the variant of the N7-methyltransferase domain, or functional fragment thereof, has a mutation selected from one or more of: D545A, R548A, N550D, Y555F, R560K, R794A, R808A, Y683S, Y684A, Y684F, D598A, G600A, G602A, I681A, S684A, F685A, T571A, L575A, L576A, M579A, F585A, L586A, D587A, D784A, N785A, R794A, F798A, M805A, E806A.
The invention also provides a nucleic acid molecule comprising a sequence selected from:
(1) A coding sequence for a subunit of a chimeric protein as described in the first aspect herein,
(2) A variant having at least 80% sequence identity to (1),
(3) The complementary sequence of (1) or (2).
In one or more embodiments, the nucleic acid molecule has the sequence shown in SEQ ID No. 2, or a variant thereof having at least 80% sequence identity thereto, or a degenerate variant thereof encoding the same amino acid sequence.
The invention also provides a nucleic acid molecule comprising a sequence selected from
(1) The coding sequence for a subunit of a chimeric protein as described in the first aspect herein, and the coding sequence for a subunit of RNA capping enzyme D1, or a functional fragment or variant thereof,
(2) A variant having at least 80% sequence identity to (1),
(3) The complementary sequence of (1) or (2).
In one or more embodiments, the RNA capping enzyme is a vaccinia virus RNA capping enzyme.
In one or more embodiments, the functional fragment of the D1 subunit of the RNA capping enzyme comprises:
(1) An N7 methyltransferase (N7-MTase) domain or a functional fragment thereof;
(2) An N7 methyltransferase (N7-MTase) domain or a functional fragment thereof and a5' -triphosphatase (TPase) domain or a functional fragment thereof;
(3) An N7 methyltransferase (N7-MTase) domain or a functional fragment thereof and a guanosine transferase (GTase) domain or a functional fragment thereof; or
(4) An N7 methyltransferase (N7-MTase) domain or a functional fragment thereof, a5' -triphosphatase (TPase) domain or a functional fragment thereof, and a guanosine transferase (GTase) domain or a functional fragment thereof.
In one or more embodiments, the RNA capping enzyme D1 subunit is set forth in SEQ ID NO 3.
In one or more embodiments, a functional fragment of the D1 subunit of the RNA capping enzyme comprises amino acids 498-844 or 540-844 of SEQ ID NO 3.
In one or more embodiments, the 5' -triphosphatase (TPase) domain is shown as amino acids 1-225 of SEQ ID NO 3.
In one or more embodiments, the guanosine transferase (GTase) domain is as set forth in SEQ ID NO 3 amino acids 226-530.
In one or more embodiments, the N7 methyltransferase (N7-MTase) domain is set forth in amino acids 531-844 of SEQ ID NO: 3.
In one or more embodiments, the coding sequence for the D1 subunit of an RNA capping enzyme, or a functional fragment thereof, is as set forth in SEQ ID NO. 4, or a variant having at least 80% sequence identity thereto, or a degenerate variant encoding the same amino acid sequence therewith.
In one or more embodiments, the coding sequences for the subunits of the chimeric protein are set forth in SEQ ID NO 2.
In another aspect, the invention provides a nucleic acid construct comprising:
(1) Expressing a subunit of the chimeric protein of the first aspect, or the chimeric protein of the second aspect,
(2) Comprising a nucleic acid molecule as described herein.
In one or more embodiments, the nucleic acid construct comprises an expression cassette for the chimeric protein subunit and an expression cassette for a D1 subunit of RNA capping enzyme, or a functional fragment thereof; or the nucleic acid construct is an expression cassette, wherein the coding sequence for the chimeric protein subunit and the coding sequence for the RNA capping enzyme D1 subunit or a functional fragment thereof are in the expression cassette.
In one or more embodiments, the nucleic acid construct is a cloning vector or an expression vector.
In another aspect, the invention provides a host cell comprising, expressing and/or secreting a chimeric protein subunit or a chimeric protein as described herein.
In one or more embodiments, the host cell comprises a nucleic acid molecule, nucleic acid construct, as described herein.
The present invention also provides a method of modifying a target RNA to a capped RNA or a method of preparing a capped target RNA, comprising: contacting a target RNA with a chimeric protein described herein, or contacting a target RNA with a cell expressing a chimeric protein described herein, under conditions that allow the RNA to be catalytically capped.
In one or more embodiments, the conditions that allow RNA capping are incubation at 37 ℃ for at least 20 minutes.
In one or more embodiments, contacting the RNA of interest with the chimeric proteins described herein comprises expressing the RNA of interest in cells expressing the chimeric proteins described herein, or mixing the RNA of interest with the chimeric proteins described herein.
In one or more embodiments, the method comprises the step of introducing a DNA sequence expressing a target RNA into the host cell.
In one or more embodiments, the method comprises the steps of:
optionally (1) denaturing the target RNA, e.g.by incubation at 50-70 ℃ for 1-60 minutes, preferably at 65 ℃ for 5-20 minutes,
optionally (2) renaturing the target RNA of the product of (1), e.g., incubating at 0 ℃ for 2-10 minutes
(3) Incubating the RNA with the chimeric protein described herein for at least 20 minutes, e.g., 30-90 minutes, at 37 deg.C to obtain capped RNA,
optionally (4) purifying the capped RNA.
In one or more embodiments, the incubated mixture further comprises one or more agents selected from the group consisting of: GTP, SAM and buffer solution.
In one or more embodiments, the target RNA is selected from the group consisting of 5' -triphosphorylated RNA, 5' -diphosphonated RNA, and RNA having a5' -Gppp cap structure.
In one or more embodiments, the method modifies a target RNA selected from the group consisting of 5' -m7 Gppp-structured capped RNA, 5' -diphosphonated RNA, and RNA having a5' -Gppp cap structure into a capped RNA having a5' -m7Gppp structure, and the chimeric protein comprises an N7 methyltransferase domain of an RNA capping enzyme D1 subunit or a functional fragment thereof, a5' -triphosphatase domain or a functional fragment thereof, a guanosine transferase domain or a functional fragment thereof, and the chimeric protein subunit. Preferably, the chimeric protein comprises an RNA capping enzyme D1 subunit shown as SEQ ID NO. 3 and a chimeric protein subunit shown as SEQ ID NO. 1.
In one or more embodiments, the methods modify a target RNA selected from the group consisting of 5' -m7 gppnmp-structured capped RNA, 5' -diphosphonated RNA, and RNA having a5' -Gppp cap structure into a capped RNA having a5' -m7 gppnmp structure, the chimeric protein comprising an N7 methyltransferase (N7-MTase) domain of a D1 subunit of RNA capping enzyme or a functional fragment thereof, a5' -triphosphatase (TPase) domain or a functional fragment thereof, and a guanosine transferase (GTase) domain or a functional fragment thereof, and the chimeric protein subunit. Preferably, the chimeric protein comprises an RNA capping enzyme D1 subunit shown as SEQ ID NO. 3 and a chimeric protein subunit shown as SEQ ID NO. 1.
In one or more embodiments, the method is a method of capping a target RNA selected from the group consisting of 5 '-diphosphonated RNA and RNA having a5' -Gppp structure, the chimeric protein comprises an N7 methyltransferase (N7-MTase) domain of the D1 subunit of RNA capping enzyme or a functional fragment thereof and a guanosine transferase (GTase) domain or a functional fragment thereof, and the chimeric protein subunits. Preferably, the subunit of the chimeric protein is shown in SEQ ID NO 1.
In one or more embodiments, the method is a method of capping a target RNA that is an RNA having a5' -Gppp structure, the chimeric protein comprises an N7 methyltransferase (N7-MTase) domain of an RNA capping enzyme D1 subunit, or a functional fragment thereof, and the chimeric protein subunit. Preferably, the subunit of the chimeric protein is shown in SEQ ID NO 1.
In one or more embodiments, the 5' -triphosphatase (TPase) domain is represented by amino acids 1-225 of SEQ ID NO 3.
In one or more embodiments, the guanosine transferase (GTase) domain is as set forth in SEQ ID NO 3 amino acids 226-530.
In one or more embodiments, the N7 methyltransferase (N7-MTase) domain is depicted as amino acids 531-844 of SEQ ID NO: 3.
The invention also provides a method for methylating RNA with a5' -m7Gppp structure, which comprises the following steps: contacting the RNA with a chimeric protein subunit described herein, or contacting the RNA with a cell expressing a chimeric protein subunit described herein.
In one or more embodiments, contacting the RNA with a chimeric protein subunit described herein comprises expressing the RNA in a cell described herein that expresses the chimeric protein subunit described herein.
In a third aspect, the invention provides a fusion protein comprising
(a) An RNA cap structure 2 '-O-methyltransferase or a functional fragment thereof or a variant having at least 90% sequence identity thereto and having RNA cap structure 2' -O-methyltransferase activity, and
(b) His tag and/or MBP tag at (a) N-or C-terminus.
In one or more embodiments, there is a linker between (a) and (b).
In one or more embodiments, the RNA cap structure 2 '-O-methyltransferase is a vaccinia virus 2' -O-methyltransferase.
In one or more embodiments, the amino acid sequence of the RNA cap structure 2' -O-methyltransferase is shown in SEQ ID NO. 1, items 303-635.
In one or more embodiments, the variant of the RNA cap structure 2' -O-methyltransferase or a functional fragment thereof has the following characteristics:
(1) A mutation selected from one or more of: K41D, C178S, a201R, a201K, C272S; and/or
(2) Wherein one or more amino acids selected from R, K, H, Y, C, D or E are mutated to A.
In one or more embodiments, the variant of the RNA cap structure 2' -O-methyltransferase, or a functional fragment thereof, is set forth in SEQ ID No. 1, items 303-635, and has a mutation selected from one or more of: K41D, C178S, A201R, A201K, C272S, one or more amino acids selected from R, K, H, Y, C, D, E are mutated to A.
In one or more embodiments, the His tag is set forth in SEQ ID NO 8.
In one or more embodiments, the MBP tag is set forth in SEQ ID NO 9.
The invention also provides a nucleic acid sequence encoding a fusion protein as described in the third aspect herein, a nucleic acid construct or a host cell comprising said nucleic acid sequence.
Drawings
FIG. 1 shows vector diagrams of 2-O-MTase, MBP- (2-O-MTase), vaccinia capping enzyme D1: D12, and chimerical enzyme D1: D12- ((2-O-MTase)).
FIG. 2 shows SDS-PAGE identification of cell lysates in which MBP and/or His tag was added to the N-terminus of 2-O-methyltransferase.
FIG. 3 is a SDS-PAGE identification of cell lysates of vaccinia virus capping enzyme and chimeric enzyme.
FIG. 4, SDS-PAGE, a map identifying wild-type vaccinia virus capping enzyme, 2-O-methyltransferase, and chimeric enzyme proteins.
FIG. 5, HPLC-MS analysis of vaccinia virus capping enzyme VVCE modified RNA triphosphate activity of 30 units.
FIG. 6, HPLC-MS analysis of the activity of 30 units of 2-O-methyltransferase-modified cap0 RNA.
FIG. 7 HPLC-MS analysis of vaccinia virus capping enzyme and 2-O-methyltransferase synergistically modify RNA triphosphate activity by 30 units.
FIG. 8, HPLC-MS analysis of chimeric enzyme-modified RNA triphosphate 30 units activity.
FIG. 9 shows a comparison of the biological functions of cells modified with wild-type and chimeric enzymes, respectively, to prepare eGFP mRNA. Wherein, 1, the vaccinia virus capping enzyme and the vaccinia virus 2-O-methyltransferase cooperate with enzyme modification to prepare eGFP mRNA;2, preparing eGFP mRNA by modifying chimeric enzyme with single enzyme; 3, capless eGFP mRNA unmodified control group.
FIG. 10, A, schematic representation of the RNA capping modification enzymatic reaction; b, schematic diagram of the exemplary embodiment of the invention; c, cap0 and cap1 are schematic structural diagrams.
Detailed Description
The inventors propose a novel RNA-modified chimeric enzyme having both the activities of a capping enzyme RNA triphosphatase, guanosine transferase, guanine methyltransferase and the methyltransferase of an mRNA cap structure 2' -O-methyltransferase, and realizing a simplified process when applied to mRNA production.
In the process of researching prokaryotic expression of mRNA cap structure 2 '-O-methyltransferase, the inventor finds that when the natural protein of the 2' -O-methyltransferase is expressed independently, the expression is unstable, and the target protein can not be obtained or can be obtained only in a relatively small amount in the fermentation process. If the MBP solubilizing promotion label is fused at the N end of the 2 '-O-methyltransferase, the 2' -O-methyltransferase fusion protein can be stably expressed in high yield in the fermentation process. In addition, the inventor also finds that when the vaccinia virus capping enzyme is expressed by using a double-promoter vector, the expression amounts of the D1 protein subunit and the D12 protein subunit cannot be evenly expressed to reach 1:1, which is detrimental to the formation of more D1: D12 complexes, wherein the expression level of the D12 protein subunit is much greater than that of the D1 protein subunit, which greatly reduces the expression yield of the complete capping enzyme D1: D12 complex structure.
In view of the above findings, applicants have linked 2 '-O-methyltransferases to the C-terminus of the D12 protein subunit (FIG. 10, B), in order to balance the reduction in the expression level of D12, and in order to stabilize the 2' -O-methyltransferase. Surprisingly, the chimeric enzymes designed above completely retained the native activities of 5' -triphosphatase, guanosine transferase, N7 methyltransferase, 2-O-methyltransferase.
Accordingly, in a first aspect the present invention provides a chimeric protein subunit comprising, linked to each other, (a) the D12 subunit of RNA capping enzyme or a functional fragment thereof, or a variant thereof having at least 90% sequence identity thereto and having D12 subunit activity, and (b) an RNA cap structure 2 '-O-methyltransferase or a functional fragment thereof, or a variant thereof having at least 90% sequence identity thereto and having RNA cap structure 2' -O-methyltransferase activity, and optionally (c) a linker between (a) and (b). The invention also provides chimeric proteins comprising the chimeric protein subunit and an RNA capping enzyme D1 subunit or a functional fragment thereof. The chimeric protein is a heterodimer.
"chimeric enzyme" refers to a non-natural enzyme that does not occur in nature, and a chimeric enzyme may comprise catalytic domains derived from different sources (e.g., from different enzymes), or catalytic domains derived from the same source (e.g., from the same enzyme) but arranged in a manner different than that found in nature. Chimeric enzymes can be proteins in which one (i.e., a single subunit) or more (i.e., multiple subunits) catalytic domains or proteins are linked, covalently or non-covalently. "catalytic domain" refers to a protein domain that is necessary and sufficient (especially in terms of its three-dimensional structure) to ensure enzyme function. "oligomeric enzyme" refers to a multi-subunit enzyme consisting of at least two polypeptide chains linked together, either covalently or non-covalently. "Oligomers" include homo-and hetero-oligomerases, a multi-subunit enzyme consisting of only one type of monomer (subunit), and hetero-oligomerases consisting of different types of monomer (subunit), e.g., heterodimers.
In some embodiments, the RNA capping enzyme described herein is vaccinia virus RNA capping enzyme (VVCE). Vaccinia virus capping enzyme is a heterodimer of two viral proteins, D1 (844 aa) and D12 (287 aa) (D1: D12).
The chimeric proteins herein may comprise an RNA capping enzyme D1 subunit or variant thereof. The D1 subunit has three catalytic domains and therefore contains three biological enzyme activities, and all three steps can be performed in m7 gpppprna synthesis. The three catalytic domains are bound together in the 97kDa D1 protein, with the catalytic domains of RNA5' -triphosphatase (TPase) and guanosine transferase (GTase) being located in the N-terminal portion, and the catalytic domain of N7 methyltransferase (N7-MTase) being located in the C-terminal portion of the D1 protein. Illustratively, the D1 subunit of vaccinia virus RNA capping enzyme is shown in SEQ ID NO. 3, the 5' -triphosphatase domain is shown in amino acids 1-225 of SEQ ID NO. 3, the guanylyl transferase domain is shown in amino acids 226-530 of SEQ ID NO. 3, and the N7 methyltransferase domain is shown in amino acids 531-844 of SEQ ID NO. 3.
The chimeric proteins herein may also include truncated forms of the D1 subunit of RNA capping enzyme or variants thereof, preferably comprising the N7-MTase domain. Truncated vaccinia capping enzymes comprising an N7-MTase domain and having Biological enzymatic activity are known in the art, for example, as described by Shuman Z S et al (Shuman Z S, RNA,2008, higman M A et al, journal of Biological Chemistry, 1994), including but not limited to the 498-844 amino acid fragment of the D1 protein or the 540-844 amino acid fragment of the D1 protein. Accordingly, the chimeric proteins herein also include functional fragments of the D1 subunit of RNA capping enzyme comprising: (1) an N7 methyltransferase domain or a functional fragment thereof; (2) An N7 methyltransferase domain or a functional fragment thereof and a5' -triphosphatase domain or a functional fragment thereof; (3) An N7 methyltransferase domain or functional fragment thereof and a guanosine transferase domain or functional fragment thereof; or (4) an N7 methyltransferase domain or a functional fragment thereof, a5' -triphosphatase domain or a functional fragment thereof, and a guanosine transferase domain or a functional fragment thereof. In one or more embodiments, a functional fragment of the D1 subunit of the RNA capping enzyme comprises amino acids 498-844 or 540-844 of SEQ ID NO 3.
The D12 subunit (33 kDa) has no methyltransferase catalytic activity per se, but it can activate the methyltransferase activity of the enhanced D1 protein. The amino acid sequence of the D12 subunit of vaccinia virus RNA capping enzyme is shown in SEQ ID NO. 1, 1-287.
In some embodiments, the RNA cap structure 2 '-O-methyltransferase described herein is a vaccinia virus 2' -O-methyltransferase (2-O-MTase). The 39kDa 2 '-O-methyltransferase, also known as the VP39 protein, effects cap-specific mRNA (nucleoside 2' -O-) -methyl transfer, converting the cap-0 structure to the cap-1 structure. The amino acid sequence of the vaccinia virus RNA cap structure 2' -O-methyltransferase is shown in SEQ ID NO. 1, 303-635.
In some aspects, the present invention also provides a MBP tag fused 2' -O-methyltransferase protein for improving protein expression stability, which has comparable or better biological enzyme functional activity than native vaccinia virus 2-O-methyltransferase. The fusion protein comprises (a) an RNA cap structure 2 '-O-methyltransferase or a functional fragment thereof or a variant thereof having at least 90% sequence identity thereto and having RNA cap structure 2' -O-methyltransferase activity, and (b) a His-tag and/or an MBP-tag located (a) N-or C-terminally, and optionally a linker between (a) and (b). The RNA cap structure 2' -O-methyltransferase was as previously described. The His-tag described herein is a short peptide containing one or more consecutive histidine residues. The MBP tags described herein have the conventional meaning in the art.
In the present invention, a polypeptide or protein (e.g., vaccinia virus capping enzyme or subunit or domain thereof, RNA cap 2 '-O-methyltransferase) also includes a mutant having at least 70% sequence identity thereto and retaining the activity (e.g., 5' -triphosphorylation activity, guanosine transferase activity, N7 methyl transferase activity, 2-O-methyl transferase activity) of the polypeptide or protein. The mutant comprises: an amino acid sequence having at least 70%, at least 80%, preferably at least 85%, preferably at least 90%, preferably at least 95%, preferably at least 97% sequence identity to a reference sequence and retaining the biological activity of the reference sequence. Sequence identity between two aligned sequences can be calculated using, for example, BLASTp from NCBI. Mutants also include amino acid sequences having one or several mutations (insertions, deletions or substitutions) in the amino acid sequence while still retaining the biological activity of the reference sequence. The number of mutations usually means within 1-50, such as 1-20, 1-10, 1-8, 1-5 or 1-3. The substitution is preferably a conservative substitution. For scFv, the mutations may occur in the CDR regions (including those described above) or in the FR regions, provided that the biological activity of the reference sequence is retained after the mutation. For example, conservative substitutions with amino acids that are similar or analogous in performance are not typically used in the art to alter the function (e.g., enzymatic activity) of a protein or polypeptide. "amino acids with similar or analogous properties" include, for example, families of amino acid residues with analogous side chains, including amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, substitution of one or more sites with another amino acid residue from the same side chain species in a polypeptide of the invention will not substantially affect its activity.
For example, it is known in the art that variants of the D12 subunit of RNA capping enzymes may have mutations selected from one or more of the following and retain the activity of the D12 subunit: N42A, Y43A, L61A, K62A, F245A, L246A, K111A, R112A, N120A, N121A, N126A, N127A, F141A, R142A, K223A, D224A, H260A, S261A, E275A, N276A, R280A, R281A. As another example, schnierle BS et al in the paper disclose a series of effective mutated amino acid sequences of mRNA cap structure 2' -O-methyltransferase, which still retains higher enzymatic catalytic activity. These mutants include, but are not limited to, the C178S, C272S, K41D, a201R, a210K mutants of 2' -O-methyltransferase; the charged amino acids R, K, H, Y, C, D, E of 2' -O-methyltransferase were randomly replaced by A. Zheng S, colin PY and Mao X respectively disclose effective mutation amino acid sequences of a series of vaccinia virus capping enzyme N7-methyltransferase structural domains in the literature, and the mutants also retain higher enzyme catalytic activity. These mutants include, but are not limited to, D545A, R548A, N550D, Y555F, R560K, R794A, R808A, Y683S, Y684A, Y684F, D598A, G600A, G602A, I681A, S684A, F685A, T571A, L575A, L576A, M579A, F585A, L586A, D587A, D784A, N785A, R794A, F798A, M805A, E806A. (Schnierle BS et al, J Biol chem.1994; zheng S et al, RNA.2008; mao X et al, biochemistry.1996; kyieleis OJ et al, structure.2014; colin PY. Sci Rep.2020; nayanendu Saha et al, virology,2001, nayanendu Saha et al, J.VIROL, 2003) the foregoing documents are incorporated herein by reference in their entirety.
In some embodiments, the polypeptides or proteins described herein further comprise a signal peptide capable of directing it to a subcellular structure. The signal peptide may be located at the N-terminus or C-terminus of the polypeptide. Such subcellular structures include, but are not limited to, the golgi or endoplasmic reticulum, proteasomes, cell membranes, or lysosomes.
Herein, a Linker (Linker) is a polypeptide segment that connects different proteins or polypeptides, with the purpose of maintaining the connected proteins or polypeptides in their respective spatial conformations to maintain the function or activity of the proteins or polypeptides. Exemplary linkers include linkers containing G and/or S. Typically, the linker contains one or more motifs which repeat back and forth. Preferably, the motifs are adjacent in the linker sequence with no intervening amino acid residues between the repeats. The linker sequence may comprise 1,2, 3, 4 or 5 repeat motifs. The length of the linker may be 3-25 amino acid residues, for example 3-15, 5-15, 10-20 amino acid residues. In certain embodiments, the linker sequence is a polyglycine linker sequence. The number of glycines in the linker sequence is not particularly limited, and is typically 2-20, e.g., 2-15, 2-10, 2-8. In addition to glycine and serine, other known amino acid residues may be contained in the linker, such as alanine (a), leucine (L), threonine (T), glutamic acid (E), phenylalanine (F), arginine (R), glutamine (Q), and the like. In certain embodiments, the different proteins or polypeptides of the invention are linked by (GGGGS) n, where n is an integer from 1 to 5. In one or more embodiments, the amino acid sequence of the linker is as set forth in amino acids 288-302 of SEQ ID NO 1.
Within the scope of the present invention, polypeptide linkers useful herein that do not affect the folding of the enzymatic domain and that can achieve the effect of the chimeric enzymes of the invention are well known to those skilled in the art and include, but are not limited to, GGGGIAPISMVGGGGS (Turner, ritter et al 1997), SPNGASNSGSAPDTSSAPGSQ (Hennecke, krebber et al 1998), EGKSSGSESKSTE (Bird, hardman et al 1988), EGKSSGSGSESKEF (Newton, xue et al 1996), GGGSGGGSGGGTGGGSGGGGG (Robinson and Sauer 1998), GSGSGGSGSEGKSKG (Bedzyk, weidner et al 1990), YPRSIYIRIIRRRHPSPSLTT (Tang, jiang et al 1996), GSGKPGSGEGS (Ting, kain et al 2001), SSADDAKKDAAKKDDKDDAKKDA (Pantoliano, bird et al 1991), GSADDAXXDAAXKDDAKKDDAKKDGS (Gregoire, lin et al 1996), LSADAKKDAAKKDDKDDAKKDL (Pavlikova, bersford et al 1999), AEAAAKEAAAKEAAAKA (Wickham, carrion et al 1995), GSTSGSGKPGSGSGGAGTGAGGGAGSTSTSGGKPSGEG (Ting, kain et al 2001), LSLEVAEEIARLEAEV (Ting, kain et al 2001), GTPTPTPTPTPTPTGEF (Gustavsson, lehtio et al 2001), GSGSGSGGSGSGGTKG (Whitlow, bell et al 1993) and GSHSGGK GK (Ting, kain et al 42334, or the universal linker described in US 20142334. The above documents are incorporated herein by reference in their entirety.
The invention includes polynucleotides encoding the chimeric protein subunits or chimeric proteins of the invention. The polynucleotide of the present invention may be in the form of DNA or RNA. The form of DNA includes cDNA, genomic DNA or artificially synthesized DNA. The DNA may be single-stranded or double-stranded. The DNA may be the coding strand or the non-coding strand. The invention also includes degenerate variants of polynucleotides that encode a polypeptide or protein, i.e., polynucleotides that encode the same amino acid sequence but differ in nucleotide sequence.
Polynucleotides described herein include sequences that are altered by codon optimization, so long as the amino acid sequence encoded by the polynucleotide is not altered. Codon-optimized sequences may exhibit more favorable expression for a particular species. Methods for codon optimizing polynucleotide sequences are well known in the art.
The polynucleotide of the present invention may be the coding sequence of the chimeric protein subunit and the coding sequence of the RNA capping enzyme D1 subunit or a functional fragment thereof, or an expression cassette of the chimeric protein subunit and an expression cassette of the RNA capping enzyme D1 subunit or a functional fragment thereof. Herein, a coding sequence refers to a portion of a nucleic acid sequence that directly determines the amino acid sequence of its protein product (e.g., a chimeric protein subunit, a polypeptide of the RNA capping enzyme D1 subunit or a functional fragment thereof, etc.). The boundaries of the coding sequence are generally determined by a ribosome binding site (for prokaryotic cells) immediately upstream of the 5 'open reading frame of the mRNA and a transcription termination sequence immediately downstream of the 3' open reading frame of the mRNA. A coding sequence can include, but is not limited to, DNA, cDNA, and recombinant nucleic acid sequences. Herein, an expression cassette refers to the complete elements required for expression of a gene of interest, including the promoter, gene coding sequence, and PolyA tailing signal sequence. The polynucleotides described herein may be two separate nucleic acid molecules, each comprising a coding sequence for a chimeric protein subunit and a coding sequence for a D1 subunit of RNA capping enzyme or a functional fragment thereof, e.g., an expression cassette for a chimeric protein subunit and an expression cassette for a D1 subunit of RNA capping enzyme or a functional fragment thereof, respectively; alternatively, the coding sequence of the chimeric protein subunit and the coding sequence of the RNA capping enzyme D1 subunit or a functional fragment thereof may be linked as one nucleic acid molecule via a linker, e.g. the coding sequence of the chimeric protein subunit and the coding sequence of the RNA capping enzyme D1 subunit or a functional fragment thereof are in the same expression cassette, or the two expression cassettes are linked as the same nucleic acid molecule via a suitable linker. In certain embodiments, the polynucleotide of the invention is a nucleic acid molecule comprising a promoter, a coding sequence encoding the chimeric protein subunit and the RNA capping enzyme D1 subunit or a functional fragment thereof, and a PolyA tailing signal, wherein the coding sequence of the chimeric protein subunit and the coding sequence of the RNA capping enzyme D1 subunit or a functional fragment thereof are in one expression cassette. In one or more embodiments, the polynucleotide further comprises an optional signal peptide. In one or more embodiments, the polynucleotide comprises SEQ ID NO 2 or SEQ ID NO 2 and 4.
In certain embodiments, the coding sequence or expression cassette is integrated into the genome of the cell. Thus, in these embodiments, the cell described herein has stably integrated into its genome an expression cassette comprising a subunit encoding a chimeric protein described herein and a subunit of RNA capping enzyme D1, or a functional fragment thereof.
The invention also relates to nucleic acid constructs comprising a polynucleotide as described herein, and one or more control sequences operably linked to the sequences. The polynucleotides of the present invention may be manipulated in a variety of ways to ensure expression of the chimeric protein subunits or chimeric proteins. The nucleic acid construct may be manipulated prior to insertion into the vector, depending on the type of expression vector or requirements. Techniques for altering polynucleotide sequences using recombinant DNA methods are known in the art.
The control sequence may be an appropriate promoter sequence. The promoter sequence is typically operably linked to the coding sequence of the protein to be expressed. The promoter may be any nucleotide sequence which shows transcriptional activity in the host cell of choice including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell. The control sequence may also be a suitable transcription terminator sequence, a sequence recognized by a host cell to terminate transcription. The terminator sequence is operably linked to the 3' terminus of the nucleotide sequence encoding the polypeptide. Any terminator which is functional in the host cell of choice may be used in the present invention. The control sequence may also be a suitable leader sequence, a nontranslated region of an mRNA which is important for translation by the host cell. The leader sequence is operably linked to the 5' terminus of the nucleotide sequence encoding the polypeptide. Any terminator which is functional in the host cell of choice may be used in the present invention.
In certain embodiments, the nucleic acid construct is a vector. The vector may be a cloning vector, an expression vector, or a homologous recombinant vector. The polynucleotides of the present invention can be cloned into many types of vectors, for example, plasmids, phagemids, phage derivatives, animal viruses, and cosmids. Cloning vectors may be used to provide coding sequences for therapeutic proteins and polypeptides of the invention, such as a nucleic acid molecule comprising coding sequences for therapeutic proteins and polypeptides. The expression vector may be provided to the cell in the form of a viral vector. Expression of a polynucleotide of the invention is typically achieved by operably linking the polynucleotide of the invention to a promoter and incorporating the construct into an expression vector. The vector may be suitable for replication and integration into eukaryotic cells. Typical cloning vectors contain transcriptional and translational terminators, initiation sequences, and promoters useful for regulating the expression of the desired nucleic acid sequence. Homologous recombinant vectors are used to integrate the expression cassettes described herein into the host genome.
Generally, suitable vectors comprise an origin of replication functional in at least one organism, a promoter sequence, a convenient restriction enzyme site and one or more selectable markers. For example, in certain embodiments, the invention uses a lentiviral vector comprising a replication initiation site, a 3'LTR,5' LTR, a polynucleotide as described herein, and optionally a selectable marker.
An example of a suitable promoter is the immediate early Cytomegalovirus (CMV) promoter sequence. The promoter sequence is a strong constitutive promoter sequence capable of driving high level expression of any polynucleotide sequence operably linked thereto. Another example of a suitable promoter is elongation growth factor-1 α (EF-1 α). However, other constitutive promoter sequences known in the art to be suitable for prokaryotic or eukaryotic cells may also be used.
To assess the expression of a therapeutic protein, polypeptide, or portion thereof, the expression vector introduced into the cells may also contain either or both of a selectable marker gene or a reporter gene to facilitate identification and selection of expressing cells from the population of cells sought to be transfected or infected by the viral vector. In other aspects, the selectable marker may be carried on a separate piece of DNA and used in a co-transfection procedure. Both the selectable marker and the reporter gene may be flanked by appropriate regulatory sequences to enable expression in a host cell. Useful selectable markers include, for example, antibiotic resistance genes and the like. Suitable reporter genes may include genes encoding luciferase, β -galactosidase, chloramphenicol acetyltransferase, secreted alkaline phosphatase, or green fluorescent protein. Suitable expression systems are well known and can be prepared using known techniques or obtained commercially.
The polynucleotides described herein can generally be obtained by PCR amplification. Specifically, primers can be designed based on the nucleotide sequences disclosed herein, particularly open reading frame sequences, and the relevant sequences can be amplified using commercially available cDNA libraries or cDNA libraries prepared by conventional methods known to those skilled in the art as templates. When the sequence is long, it is often necessary to perform two or more PCR amplifications, and then splice together the amplified fragments in the correct order. Alternatively, the nucleic acid molecules described herein can also be synthesized directly.
Methods for introducing and expressing genes into cells are known in the art. The vector may be readily introduced into a host cell by any method known in the art, e.g., mammalian, bacterial, yeast or insect cells. For example, the expression vector may be transferred into a host cell by physical, chemical or biological means.
Physical methods for introducing polynucleotides into host cells include calcium phosphate precipitation, lipofection, particle bombardment, microinjection, electroporation, and the like. Biological methods for introducing a polynucleotide of interest into a host cell include the use of DNA vectors, RNA vectors, or viral vectors, such as lentiviral vectors. Chemical means of introducing polynucleotides into host cells include colloidal dispersion systems such as macromolecular complexes, nanocapsules, microspheres, beads; and lipid-based systems including oil-in-water emulsions, micelles, mixed micelles, and liposomes.
Herein, a host cell contains, expresses and/or secretes a protein subunit or protein described herein. Herein, when a cell is referred to as containing or comprising, expressing, secreting a molecule, such as a polypeptide, "containing" means that the molecule is contained within or on the surface of the cell; "expressing" means that the cell produces the molecule; "secreted" means that the cell secretes the expressed molecule outside the cell. Host cells include both cells that are ultimately used to secrete the chimeric protein or a subunit thereof, and cells that produce RNA, as well as various cells used in the production of such cells, such as e.coli cells, for use, e.g., in providing a coding sequence for a protein of the invention or in providing a vector as described herein. After the host cell expresses a protein subunit or protein described herein, the protein subunit or protein can be purified by methods conventional in the art for purifying proteins (e.g., chromatography, including affinity chromatography, ion exchange chromatography, and the like).
Chemical modification of one or more steps in the RNA capping process can be performed using the chimeric protein subunits or chimeric proteins of the invention. Accordingly, the present invention provides a method of modifying a target RNA into a capped RNA or a method of preparing a capped target RNA, comprising: contacting a target RNA with a chimeric protein described herein, or contacting a target RNA with a cell expressing a chimeric protein described herein.
Herein, a "target RNA" is an RNA of interest obtained in vivo or in vitro. Typically, the target RNA is uncapped or incompletely capped RNA. The target RNA may be from any source, e.g., in Vitro Transcription (IVT), chemical synthesis, extraction from the body, etc. In vitro transcription refers to the enzymatic synthesis of RNA using a DNA transcription template with T7, T3, SP6 promoters and a corresponding DNA-dependent RNA polymerase in a suitable buffer system including, but not limited to, NTP, mg 2+ And components such as Ribonuclease Inhibitors (RI) and inorganic pyrophosphatase (iPPase). Compared with solid phase synthesis of RNA, in vitro transcription synthesis can prepare a large amount of long-chain RNA with high quality, and is suitable for industrial production of mRNA drugs. In one or more embodiments, the RNA of interest described herein is an in vitro transcribed cap-free single-stranded RNA (ssRNA) that has all the features of an mRNA other than a cap, including but not limited to a 5'-UTR, a 3' -UTR, a protein or polypeptide translation coding region.
Methods for in vitro transcription of RNA substrates are well known in the art, and exemplary methods for transcription of polymerases (e.g., T7, T3, SP 6) include mixed incubation of a polymerase, rNTP, and a DNA transcription template containing a promoter recognized by the polymerase. The reaction system also comprises one or more reagents selected from the following: mgCl 2 Buffer, iPPase, inhibitor, nuclease-free water, and the like. The DNA for transcription of the template can be obtained by methods conventional in the art, such as synthesis, hybridization, PCR, etc., and the reagents required for these methods are also well known in the art.
Without wishing to be bound by theory, target RNAs herein include 5' -triphosphorylated RNA, 5' -diphosphonated RNA, and RNA having a5' -Gppp structure. As used herein, "capped RNA" includes capped RNA having a structure of 5'-m7Gppp (cap 0) or capped RNA having a structure of 5' -m7 GppNmp (cap 1). In each of the above structures, "5'" indicates that the group is located at the 5' end of the RNA, "m" indicates methylated, "7" indicates the methylation position, "G" is guanylic acid, "p" is a phosphate group, and "N" is any nucleotide located at the 5' end of the target RNA. Optionally, the target RNA may be purified and/or denatured and renatured RNA. Reagents and procedures for purifying and denaturing, renaturing RNA are known in the art, for example, by incubating an RNA sample at 50-70 ℃ for 1-60 minutes (e.g., 65 ℃ for 5-20 minutes), followed by 0 ℃ for 2-10 minutes. However, in the method of the present invention, the capping of RNA can be achieved without purification and denaturation and renaturation.
In order to contact the RNA of interest with the chimeric proteins described herein, the RNA of interest can be expressed in a cell expressing the chimeric proteins described herein (e.g., by introducing a DNA sequence expressing the RNA of interest into a host cell described herein). The target RNA can be expressed in the cell by any suitable method known in the art, for example, by introducing into the cell DNA encoding the RNA which is transcribed in the cell to form the target RNA, which is capped catalyzed by the chimeric protein or chimeric protein subunit expressed by the cell.
Alternatively, the RNA of interest (e.g., RNA transcribed in vitro) can be mixed in solution with the chimeric proteins described herein to effect contact. The RNA may be purified and/or renatured prior to mixing. Procedures and conditions for RNA capping in solution are known in the art, for example, mixing and incubating target RNA in nuclease-free water with capping buffer, GTP, SAM, and the chimeric proteins herein. An incubation step such as incubation at 37 ℃ for 30-90 minutes. The modified capped RNA product can be purified, for example, by magnetic beads.
In the capping method described herein, the molar ratio of the target RNA to the chimeric protein is less than 865 1, preferably 86.5-173. The inventors have found that different capping RNAs can be obtained at different molar ratios. For example, if the molar ratio of the target RNA to the chimeric protein is less than 865 1, the method modifies the target RNA to a capped RNA having a5' -m7Gppp structure (cap 0). If the number of moles of the target RNA and the chimeric protein is less than 86.5-173, the method modifies the target RNA into a capped RNA (cap 1) having a5' -m7 GppNmp structure.
Furthermore, the chimeric proteins may comprise different domains for different target RNAs. If the target RNA is a5' -triphosphorylated RNA, it is desirable that the chimeric protein has the activity of 5' -triphosphatase, guanosine transferase, N7 methyltransferase, 2-O-methyltransferase, i.e.the chimeric protein comprises the N7 methyltransferase (N7-MTase) domain or functional fragment thereof, the 5' -triphosphatase (TPase) domain or functional fragment thereof and the guanosine transferase (GTase) domain or functional fragment thereof of the RNA capping enzyme D1 subunit, and the chimeric protein subunits.
If the target RNA is a5' -diphosphonated RNA, it is desirable that the chimeric protein has the activity of a guanosine transferase, an N7 methyltransferase, a 2-O-methyltransferase, i.e.the chimeric protein comprises the N7 methyltransferase (N7-MTase) domain or a functional fragment thereof and the guanosine transferase (GTase) domain or a functional fragment thereof of the D1 subunit of the RNA capping enzyme, and the subunits of the chimeric protein.
If the target RNA is an RNA having a5' -Gppp structure, it is desirable that the chimeric protein has the activity of N7 methyltransferase (N7-MTase) and 2-O-methyltransferase, i.e., the chimeric protein comprises the N7 methyltransferase (N7-MTase) domain of the D1 subunit of RNA capping enzyme or a functional fragment thereof and the chimeric protein subunit.
In another aspect, the present invention can also modify a capped RNA having a 5'-m7Gppp structure (cap 0) into a capped RNA having a5' -m7 gppnmp structure (cap 1), comprising the steps of: contacting the RNA with a chimeric protein subunit described herein (e.g., expressing the RNA in a cell expressing a chimeric protein subunit described herein), or contacting the RNA with a cell expressing a chimeric protein subunit described herein.
The invention has the advantages that:
1. the chimeric enzyme with the biological activities of the capping enzyme and the 2' -O-methyltransferase can reduce the production process and the production cost;
2. the cap0 and cap1 are continuous reactions, and the chimeric enzyme covalently couples the two enzymes together, so that the two enzymes are more favorable for continuous contact with an RNA substrate, and the modification efficiency is improved.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods and materials are now described. All publications and patents specifically mentioned herein are incorporated herein by reference in their entirety for all purposes including describing and disclosing the chemicals, equipment, statistical analyses and methods reported in the publications that could be used in connection with the invention. All references cited in this specification are to be considered as indicative of the level of skill in the art. Nothing herein is to be construed as an admission that the invention is not entitled to antedate such disclosure by virtue of prior invention.
The invention will be further illustrated with reference to the following specific examples. It should be understood that these examples are for illustrative purposes only and are not intended to limit the scope of the present invention. The experimental procedures, for which specific conditions are not noted in the following examples, are generally performed according to conventional conditions such as those described in J. SammBruk et al, molecular cloning protocols, third edition, scientific Press, 2002, or according to the manufacturer's recommendations.
Examples
Example 1 expression and purification of native vaccinia virus capping enzyme, 2-O-methyltransferase, chimeric enzyme
In order to be able to overexpress the vaccinia virus capping enzyme D1: d12, 2-O-methyltransferase 2-O-MTase and D1: D12- (2-O-MTase) chimeric enzyme, and DNA sequences were optimally designed according to the amino acid sequences corresponding to the above proteins and cloned into commercial prokaryotic expression vectors pET28a (Novagen), pMAL-c5X (NEB) and pRSF-Duet1 (Novagen). The amino acid and DNA sequences of the enzyme are shown in SEQ ID NO 1-4, and the prokaryotic expression vector diagram is shown in figure 1.
To ensure high level expression of the enzyme and to allow direct reproducible purification of the vaccinia virus capping enzyme or chimeric enzyme complex, a single His tag was added to the N-terminus of the D1 enzyme to facilitate purification of the complete D1: D12 or D1: D12- (2-O-MTase) protein complex. In order to make the 2-O-methyltransferase more stable and facilitate purification, a His tag and/or an MBP tag is added to the N-terminus of the 2-O-methyltransferase. The plasmids were transformed into E.coli BL21 (DE 3) expression bacteria according to the instructions of the commercial expression vectors, single colonies were picked up and cultured in LB at 37 ℃ to the logarithmic phase, and then added with 1mM IPTG inducer at the final concentration and cultured for further induction at 25 ℃ for 16 hours. After collection of the induced bacteria, all enzyme proteins were purified using two standard purification steps, ni affinity chromatography, capto SP ImpRes or Capto Q ImpRes (Cytiva corporation) ion exchange chromatography to isolate the final enzyme protein. The lysate before and after induction and the finally purified enzyme protein are respectively subjected to SDS-PAGE (polyacrylamide gel electrophoresis), wherein FIG. 2 is a SDS-PAGE identification picture of the lysate with His and/or MBP labels added at the N end of 2-O-methyltransferase, FIG. 3 is a SDS-PAGE identification picture of the lysate of vaccinia virus capping enzyme and chimeric enzyme, and FIG. 4 is a SDS-PAGE identification picture of purified MBP- (2-O-MTase), vaccinia virus capping enzyme D1: D12 and chimeric enzyme D1: D12- ((2-O-MTase).
As shown in FIG. 2, when the 2-O-methyltransferase does not have the MBP tag, protein expression is unstable and the yield is low, so that the 2-O-methyltransferases used in the present invention are all 2-O-methyltransferases in the form of MBP tags, i.e., MBP- (2-O-MTase). As shown in FIG. 3, when the natural vaccinia virus capping enzyme is expressed, the expression level of the D12 protein subunit is much higher than that of the D1 protein subunit, which is not favorable for the formation of stable 1:1, D1: D12 complex in stable relationship; when 2-O-MTase and D12 are expressed in a fusion mode, the expression level of the fusion protein subunit is closer to that of D1, and the formation of a protein complex D1: D12- (2-O-MTase) is facilitated. As shown in FIG. 4, the three expression schemes described above all resulted in a protein or protein subunit of the desired size after purification, wherein the vaccinia virus capping enzyme comprises a 33kD protein subunit D12 and a 100kD protein subunit D1; the chimeric enzyme comprises 73kD D12- (2-O-MTase) fusion protein subunit and 100kD D1 protein subunit, and meets the expected protein size; the size of the MBP- (2-O-MTase) fusion protein is 83kD, and the sum of the his-MBP protein tag of 44kD and the 2-O-MTase protein of 39kD also meets the expected protein size.
Example 2 in vitro transcription and enzymatic modification of RNA substrates to synthesize Cap0 mRNA, cap1 mRNA
Primers containing the T7 promoter sequence were designed and synthesized as follows, RNA30-T7F:5' galatatacgcactcactataGGGAAGGAGAGGAAGAAGGAAAGGGAAGAAGGAAGAAGAAGAAGAAGAA-; RNA30-R: 5- 'TTCTTTCTTCCCTTCTTCCTCTCCTTCCttagtgagtgtctattatc-3' (SEQ ID NO: 6). Mixing the primers according to the proportion in the table 1 to carry out annealing reaction, wherein the annealing procedure is as follows: pre-denaturation at 95 ℃ for 10min, gradient cooling to 20 ℃ within 2 hours, keeping the temperature at 20 ℃ for one hour, and taking out. The magnetic beads are purified to obtain the 30nt RNA (RNA 30) transcription DNA template.
TABLE 1 primer annealing reaction System
Reagent | Volume of |
10 × annealing buffer | 5μL |
RNA30-T7F(100μM) | 20μL |
RNA30-R(100μM) | 20μL |
Water without ribonuclease | To 50 μ L |
Transcription synthesis of RNA: the reaction system for preparing the T7 RNA polymerase at room temperature is added with reaction components according to the sequence shown in the table below, and the reaction system can be expanded or reduced according to equal proportion. The reaction was carried out at 37 ℃ for 6 to 16 hours. In this example, the transcribed RNA was 30nt RNA. And 2h, after the reaction is finished, adding 2U DNase, digesting for 15min at the temperature of 1 ℃, and then carrying out magnetic bead purification for the next RNA capping modification enzymatic reaction.
TABLE 2 RNA transcription Synthesis System
Capping modified RNA: taking 10ug of the purified RNA to a 1.5ml centrifuge tube, and diluting the RNA to 14ul by using water without nuclease; heating at 65 ℃ for 10 minutes, taking out the centrifuge tube, and placing on ice for 5 minutes; the following components were added in sequence according to tables 3-6 for different purposes of RNA modification and incubated at 37 ℃ for 30-90 minutes. The step is suitable for capping reaction of 10ug RNA, and the volume of the reaction substrate can be amplified according to the experiment requirement. After the reaction is finished, a final modified RNA product can be obtained by adopting a magnetic bead purification mode.
TABLE 3 enzymatic modification of uncapped RNA capped as Cap0RNA Synthesis System
TABLE 4 enzymatic modification of cap0RNA as cap1 RNA Synthesis System
TABLE 5 enzymatic modification of uncapped RNA capped with Cap1 RNA Synthesis System (Natural enzyme)
Components | Volume of | |
Denatured cap- | 10μg | |
10 + capping buffer | 2.0μl | |
GTP(10mM) | 1.0μl | |
SAM(32mM) | 1.0μl | |
Vaccinia virus capping enzyme D1: D12 | X μ g (as shown in FIG. 7) | |
MBP-(2-O-MTase) | Y μ g (as shown in FIG. 7) | |
Water without ribonuclease | To 20. Mu.l |
TABLE 6 enzymatic modification of uncapped RNA capped with Cap1 RNA Synthesis System (chimeric enzyme)
Components | Volume of | |
Denatured cap- | 10μg | |
10 + capping buffer | 2.0μl | |
GTP(10mM) | 1.0μl | |
SAM(32mM) | 1.0μl | |
Chimeric enzyme D1: d12- (2-O-MTase) | X μ g (as shown in FIG. 8) | |
Water without ribonuclease | To 20. Mu.l |
Example 3 detection of the efficiency of 5' capping of RNA catalyzed by different RNA-modifying enzymes by HPLC-MS
As shown in Table 7, the expected molecular weights of the transcribed or enzymatically modified intermediates or final products in example 2 were calculated and these sample distributions were subjected to HPLC-MS (liquid chromatography-Mass Spectrometry) detection. The proportion of target products generated after the RNA substrates are subjected to enzymatic catalytic reaction can be deduced by comparing the detected approximate molecular weights, and then the enzyme catalytic activities of different RNA modification enzymes are deduced. The results are shown in FIGS. 5 to 8.
TABLE 7 predicted molecular weight Scale for different cap structures RNA30
* Note: the actual mass spectrometric detection molecular weight and the predicted molecular weight sometimes have a deviation of less than 5Da, but do not affect the judgment of the target product
The results in FIG. 5 show that: the vaccinia virus capping enzyme (about 133 kDa) is capable of converting the modification of the cap-free triphosphate RNA30 with the expected molecular weight of 10623Da into the cap0RNA30 with the expected molecular weight of 10903Da under the condition that the amount of the capping enzyme is more than 0.1. Mu.g, and when the enzyme dosage is less than 0.05. Mu.g, the modification of the cap-free triphosphate RNA30 is incomplete, only part of the substrate is converted into the cap0RNA30, and a small amount of the substrate is converted into the G-cap RNA30. Namely, the substrate RNA: when the molar ratio of the capping enzyme is less than 1330, cap0 can be completely capped. (Note: RNA approximately equal to 10kDa, facilitating rapid calculation)
The results in FIG. 6 show that: 2-O-methyltransferase (about 81 kDa), at greater than 0.5. Mu.g, was able to convert the modification of cap0RNA30 with the expected molecular weight of 10903Da to cap1 RNA30 with the expected molecular weight of 10918Da, and at enzyme doses less than 0.25. Mu.g, the modification of cap0RNA30 was incomplete and only a portion of the substrate was converted to cap1 RNA30. Namely, the substrate RNA: when the molar ratio of the 2-O-methyltransferase is less than 162, the cap0 can be completely modified into the cap1 cap.
FIG. 7 shows HPLC-MS analysis of vaccinia virus capping enzyme and 2-O-methyltransferase synergistically modifying RNA triphosphate activity of 30 units. Based on the detection result of the capping enzyme modified RNA triphosphate 30 as cap0RNA30, 0.2. Mu.g of vaccinia capping enzyme was added to each group in this experimental group to ensure that the RNA triphosphate 30 was completely converted into cap0RNA30, and the 2-O-methyltransferase was added in a gradient of 0.25. Mu.g, 0.5. Mu.g, 1. Mu.g, and 2. Mu.g. The results in FIG. 7 show that: when the addition amount of the 2-O-methyltransferase is 0.25ug, only half of the substrate is converted into cap1 RNA30; when the addition amount of the 2-O-methyltransferase is 0.5ug, a small amount of substrate is still not converted into cap1 RNA30; when the addition amount of the 2-O-methyltransferase is more than 1ug, the RNA30 can be modified cooperatively to be completely converted into cap1 RNA. This indicates that more 2-O methyltransferase protein is required to modify RNA to cap1 in one step than to modify RNA to cap1 structure in steps. In the synergistic capping system, the substrate RNA: capping enzyme molar ratio less than 665:1, cap0 can be completely capped, RNA: the molar ratio of the 2-O-methyltransferase is between 81 and 162:1, a complete cap0 modification to cap1 can be achieved, indicating that the enzyme usage in the synergistic capping reaction system and the stepwise capping reaction system is similar.
The results in FIG. 8 show that: the chimeric enzyme (173kDa, 133kDa + 39kDa) can completely convert the uncapped triphosphate RNA30 with the expected molecular weight of 10623Da into cap0RNA30 with the expected molecular weight of 10903Da under the condition of 0.1-0.2 mu g; of these, about 90% of RNA30 was completely capped at 0.1. Mu.g. This indicates that the chimeric enzyme retains the activity of the three enzymatic domains of 5' -triphosphatase, guanosine transferase and N7-guanine methyltransferase in the wild-type vaccinia virus capping enzyme in the same dose as the wild-type enzyme alone. When the chimeric enzyme is used in a dose of 2. Mu.g, the cap0RNA30 converted to 10903Da can be further completely modified to convert it to a cap1 RNA30 of the expected molecular weight of 10918 Da. This shows that the chimeric enzyme not only completely retains the activity of the catalytic domain of 2-O-methyltransferase, but also can more efficiently and thoroughly catalyze to obtain cap1 RNA30. I.e., chimeric enzyme capping system, substrate RNA: when the mole ratio of the chimeric enzyme is less than 865, cap0 can be completely capped; RNA: when the mosaic molar ratio is between 86.5 and 173, the cap0 can be completely modified into a cap1 cap. The indexes are all equivalent to the mole ratio index 665, 81-162 of the mixed enzyme system, and show that the chimeric enzyme can realize the same function of the mixed enzyme under the same molar concentration, namely the chimeric enzyme is proved not to change the catalytic activity of each functional structural domain, and the simplified production and use of the enzyme are favorably realized (compared with the use of two enzymes, only one enzyme is required to be produced, the step cost is saved, and only one enzyme is required to be added during the reaction).
Example 4 comparison of cellular biological function of eGFP mRNA prepared by separately modifying wild-type and chimeric enzymes
A template DNA (SEQ ID NO: 7) for gene synthesis eGFPmRNA transcription was subcloned into a commercial vector such as pUC57 and the like. The plasmid is transformed into Top10 colibacillus, the plasmid is extracted and then linearized by restriction endonuclease, and a DNA linear template is recovered by magnetic beads and used for RNA transcription synthesis experiment. The transcription synthesis system and procedure of eGFP mRNA are described in Table 2 of example 2, and the reaction system and procedure for enzymatically modifying the prepared cap1 eGFP mRNA after transcription synthesis are described in tables 5 and 6 of example 2. Respectively taking and identifying mRNA samples which are completely modified by capping for carrying out transfection tests, numbering a cap1 eGFP mRNA prepared by combining natural vaccinia virus capping enzyme and 2-O-methyltransferase modified RNA as a No. 1 sample, numbering a cap1 eGFP mRNA prepared by chimeric enzyme modified RNA as a No. 2 sample, and numbering a non-capped eGFP mRNA which is not modified by enzyme as a No. 3 sample. The above samples were transfected into Chinese Hamster Ovary (CHO) cells following the following cell transfection procedure to verify the cellular biological function of the modified cap1 eGFP mRNA. Transfection procedure: (1) The day before transfection, CHO cells were digested, plated in 24-well plates at a density of 1 × 105/well. (2) When in transfection, 2 EP tubes with the volume of 1.5ml are respectively taken, 50ul of opti-MEM culture solution is respectively added into each tube, 2.25ul of PEI transfection reagent is added into one tube, 1.5ug of mRNA is added into the other tube, after standing for 5min, PEI solution is added into mRNA solution, and after uniform mixing, the mixture is left standing for 20min. (3) Adding the prepared PEI-mRNA solution into CHO cells containing serum-free culture solution, placing the CHO cells in a 37-degree incubator, culturing for 4 hours, and then replacing the culture medium with a complete culture medium for continuous culture. (4) After 24 hours of culture, fluorescence photographing is carried out, and the EGFP positive rate is detected by detecting the flow type. Duplicate well replicates were made for all three samples.
As shown in FIG. 9 and Table 8, the green fluorescence positive rate and fluorescence intensity of the eGFP mRNA samples No. 1 and No. 2 transfected CHO cells were at the same level, and there was no significant difference. This demonstrates that the same effect of natural vaccinia virus capping enzyme in combination with 2-O-methyltransferase modification of RNA can be achieved with a single modification using chimeric enzyme.
TABLE 8 EGFP positivity and fluorescence intensity of cells transfected with different mRNA samples
Sequence listing
<110> Shanghai cell therapy group Co., ltd
<120> RNA modified chimeric protein and application thereof
<130> 210138
<160> 9
<170> SIPOSequenceListing 1.0
<210> 1
<211> 635
<212> PRT
<213> Artificial Sequence
<400> 1
Met Asp Glu Ile Val Lys Asn Ile Arg Glu Gly Thr His Val Leu Leu
1 5 10 15
Pro Phe Tyr Glu Thr Leu Pro Glu Leu Asn Leu Ser Leu Gly Lys Ser
20 25 30
Pro Leu Pro Ser Leu Glu Tyr Gly Ala Asn Tyr Phe Leu Gln Ile Ser
35 40 45
Arg Val Asn Asp Leu Asn Arg Met Pro Thr Asp Met Leu Lys Leu Phe
50 55 60
Thr His Asp Ile Met Leu Pro Glu Ser Asp Leu Asp Lys Val Tyr Glu
65 70 75 80
Ile Leu Lys Ile Asn Ser Val Lys Tyr Tyr Gly Arg Ser Thr Lys Ala
85 90 95
Asp Ala Val Val Ala Asp Leu Ser Ala Arg Asn Lys Leu Phe Lys Arg
100 105 110
Glu Arg Asp Ala Ile Lys Ser Asn Asn His Leu Thr Glu Asn Asn Leu
115 120 125
Tyr Ile Ser Asp Tyr Lys Met Leu Thr Phe Asp Val Phe Arg Pro Leu
130 135 140
Phe Asp Phe Val Asn Glu Lys Tyr Cys Ile Ile Lys Leu Pro Thr Leu
145 150 155 160
Phe Gly Arg Gly Val Ile Asp Thr Met Arg Ile Tyr Cys Ser Leu Phe
165 170 175
Lys Asn Val Arg Leu Leu Lys Cys Val Ser Asp Ser Trp Leu Lys Asp
180 185 190
Ser Ala Ile Met Val Ala Ser Asp Val Cys Lys Lys Asn Leu Asp Leu
195 200 205
Phe Met Ser His Val Lys Ser Val Thr Lys Ser Ser Ser Trp Lys Asp
210 215 220
Val Asn Ser Val Gln Phe Ser Ile Leu Asn Asn Pro Val Asp Thr Glu
225 230 235 240
Phe Ile Asn Lys Phe Leu Glu Phe Ser Asn Arg Val Tyr Glu Ala Leu
245 250 255
Tyr Tyr Val His Ser Leu Leu Tyr Ser Ser Met Thr Ser Asp Ser Lys
260 265 270
Ser Ile Glu Asn Lys His Gln Arg Arg Leu Val Lys Leu Leu Leu Gly
275 280 285
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Met Asp
290 295 300
Val Val Ser Leu Asp Lys Pro Phe Met Tyr Phe Glu Glu Ile Asp Asn
305 310 315 320
Glu Leu Asp Tyr Glu Pro Glu Ser Ala Asn Glu Val Ala Lys Lys Leu
325 330 335
Pro Tyr Gln Gly Gln Leu Lys Leu Leu Leu Gly Glu Leu Phe Phe Leu
340 345 350
Ser Lys Leu Gln Arg His Gly Ile Leu Asp Gly Ala Thr Val Val Tyr
355 360 365
Ile Gly Ser Ala Pro Gly Thr His Ile Arg Tyr Leu Arg Asp His Phe
370 375 380
Tyr Asn Leu Gly Val Ile Ile Lys Trp Met Leu Ile Asp Gly Arg His
385 390 395 400
His Asp Pro Ile Leu Asn Gly Leu Arg Asp Val Thr Leu Val Thr Arg
405 410 415
Phe Val Asp Glu Glu Tyr Leu Arg Ser Ile Lys Lys Gln Leu His Pro
420 425 430
Ser Lys Ile Ile Leu Ile Ser Asp Val Arg Ser Lys Arg Gly Gly Asn
435 440 445
Glu Pro Ser Thr Ala Asp Leu Leu Ser Asn Tyr Ala Leu Gln Asn Val
450 455 460
Met Ile Ser Ile Leu Asn Pro Val Ala Ser Ser Leu Lys Trp Arg Cys
465 470 475 480
Pro Phe Pro Asp Gln Trp Ile Lys Asp Phe Tyr Ile Pro His Gly Asn
485 490 495
Lys Met Leu Gln Pro Phe Ala Pro Ser Tyr Ser Ala Glu Met Arg Leu
500 505 510
Leu Ser Ile Tyr Thr Gly Glu Asn Met Arg Leu Thr Arg Val Thr Lys
515 520 525
Ser Asp Ala Val Asn Tyr Glu Lys Lys Met Tyr Tyr Leu Asn Lys Ile
530 535 540
Val Arg Asn Lys Val Val Val Asn Phe Asp Tyr Pro Asn Gln Glu Tyr
545 550 555 560
Asp Tyr Phe His Met Tyr Phe Met Leu Arg Thr Val Tyr Cys Asn Lys
565 570 575
Thr Phe Pro Thr Thr Lys Ala Lys Val Leu Phe Leu Gln Gln Ser Ile
580 585 590
Phe Arg Phe Leu Asn Ile Pro Thr Thr Ser Thr Glu Lys Val Ser His
595 600 605
Glu Pro Ile Gln Arg Lys Ile Ser Ser Lys Asn Ser Met Ser Lys Asn
610 615 620
Arg Asn Ser Lys Arg Ser Val Arg Ser Asn Lys
625 630 635
<210> 2
<211> 1908
<212> DNA
<213> Artificial Sequence
<400> 2
atggacgaaa tcgttaaaaa catccgtgaa ggtacccacg ttctgctgcc gttctacgaa 60
accctgccgg aactgaacct gtctctgggt aaatctccgc tgccgtctct ggaatacggt 120
gctaactact tcctgcagat ctctcgtgtt aacgacctga accgtatgcc gaccgacatg 180
ctgaaactgt tcacccacga catcatgctg ccggaatctg acctggacaa agtttacgaa 240
atcctgaaaa tcaactctgt taaatactac ggtcgttcta ccaaagctga cgctgttgtt 300
gctgacctgt ctgctcgtaa caaactgttc aaacgtgaac gtgacgctat caaatctaac 360
aaccacctga ccgaaaacaa cctgtacatc tctgactaca aaatgctgac cttcgacgtt 420
ttccgtccgc tgttcgactt cgttaacgaa aaatactgca tcatcaaact gccgaccctg 480
ttcggtcgtg gtgttatcga caccatgcgt atctactgct ctctgttcaa aaacgttcgt 540
ctgctgaaat gcgtttctga ctcttggctg aaagactctg ctatcatggt tgcttctgac 600
gtttgcaaaa aaaacctgga cctgttcatg tctcacgtta aatctgttac caaatcttct 660
tcttggaaag acgttaactc tgttcagttc tctatcctga acaacccggt tgacaccgaa 720
ttcatcaaca aattcctgga attctctaac cgtgtttacg aagctctgta ctacgttcac 780
tctctgctgt actcttctat gacctctgac tctaaatcta tcgaaaacaa acaccagcgt 840
cgtctggtta aactgctgct gggtggtggt ggttctggtg gtggtggttc tggtggtggt 900
ggttctatgg acgttgtttc tctggacaaa ccgttcatgt acttcgaaga aatcgacaac 960
gaactggact acgaaccgga atctgctaac gaagttgcta aaaaactgcc gtaccagggt 1020
cagctgaaac tgctgctggg tgaactgttc ttcctgtcta aactgcagcg tcacggtatc 1080
ctggacggtg ctaccgttgt ttacatcggt tctgctccgg gtacccacat ccgttacctg 1140
cgtgaccact tctacaacct gggtgttatc atcaaatgga tgctgatcga cggtcgtcac 1200
cacgacccga tcctgaacgg tctgcgtgac gttaccctgg ttacccgttt cgttgacgaa 1260
gaatacctgc gttctatcaa aaaacagctg cacccgtcta aaatcatcct gatctctgac 1320
gttcgttcta aacgtggtgg taacgaaccg tctaccgctg acctgctgtc taactacgct 1380
ctgcagaacg ttatgatctc tatcctgaac ccggttgctt cttctctgaa atggcgttgc 1440
ccgttcccgg accagtggat caaagacttc tacatcccgc acggtaacaa aatgctgcag 1500
ccgttcgctc cgtcttactc tgctgaaatg cgtctgctgt ctatctacac cggtgaaaac 1560
atgcgtctga cccgtgttac caaatctgac gctgttaact acgaaaaaaa aatgtactac 1620
ctgaacaaaa tcgttcgtaa caaagttgtt gttaacttcg actacccgaa ccaggaatac 1680
gactacttcc acatgtactt catgctgcgt accgtttact gcaacaaaac cttcccgacc 1740
accaaagcta aagttctgtt cctgcagcag tctatcttcc gtttcctgaa catcccgacc 1800
acctctaccg aaaaagtttc tcacgaaccg atccagcgta aaatctcttc taaaaactct 1860
atgtctaaaa accgtaactc taaacgttct gttcgttcta acaaataa 1908
<210> 3
<211> 844
<212> PRT
<213> Artificial Sequence
<400> 3
Met Asp Ala Asn Val Val Ser Ser Ser Thr Ile Ala Thr Tyr Ile Asp
1 5 10 15
Ala Leu Ala Lys Asn Ala Ser Glu Leu Glu Gln Arg Ser Thr Ala Tyr
20 25 30
Glu Ile Asn Asn Glu Leu Glu Leu Val Phe Ile Lys Pro Pro Leu Ile
35 40 45
Thr Leu Thr Asn Val Val Asn Ile Ser Thr Ile Gln Glu Ser Phe Ile
50 55 60
Arg Phe Thr Val Thr Asn Lys Glu Gly Val Lys Ile Arg Thr Lys Ile
65 70 75 80
Pro Leu Ser Lys Val His Gly Leu Asp Val Lys Asn Val Gln Leu Val
85 90 95
Asp Ala Ile Asp Asn Ile Val Trp Glu Lys Lys Ser Leu Val Thr Glu
100 105 110
Asn Arg Leu His Lys Glu Cys Leu Leu Arg Leu Ser Thr Glu Glu Arg
115 120 125
His Ile Phe Leu Asp Tyr Lys Lys Tyr Gly Ser Ser Ile Arg Leu Glu
130 135 140
Leu Val Asn Leu Ile Gln Ala Lys Thr Lys Asn Phe Thr Ile Asp Phe
145 150 155 160
Lys Leu Lys Tyr Phe Leu Gly Ser Gly Ala Gln Ser Lys Ser Ser Leu
165 170 175
Leu His Ala Ile Asn His Pro Lys Ser Arg Pro Asn Thr Ser Leu Glu
180 185 190
Ile Glu Phe Thr Pro Arg Asp Asn Glu Thr Val Pro Tyr Asp Glu Leu
195 200 205
Ile Lys Glu Leu Thr Thr Leu Ser Arg His Ile Phe Met Ala Ser Pro
210 215 220
Glu Asn Val Ile Leu Ser Pro Pro Ile Asn Ala Pro Ile Lys Thr Phe
225 230 235 240
Met Leu Pro Lys Gln Asp Ile Val Gly Leu Asp Leu Glu Asn Leu Tyr
245 250 255
Ala Val Thr Lys Thr Asp Gly Ile Pro Ile Thr Ile Arg Val Thr Ser
260 265 270
Asn Gly Leu Tyr Cys Tyr Phe Thr His Leu Gly Tyr Ile Ile Arg Tyr
275 280 285
Pro Val Lys Arg Ile Ile Asp Ser Glu Val Val Val Phe Gly Glu Ala
290 295 300
Val Lys Asp Lys Asn Trp Thr Val Tyr Leu Ile Lys Leu Ile Glu Pro
305 310 315 320
Val Asn Ala Ile Asn Asp Arg Leu Glu Glu Ser Lys Tyr Val Glu Ser
325 330 335
Lys Leu Val Asp Ile Cys Asp Arg Ile Val Phe Lys Ser Lys Lys Tyr
340 345 350
Glu Gly Pro Phe Thr Thr Thr Ser Glu Val Val Asp Met Leu Ser Thr
355 360 365
Tyr Leu Pro Lys Gln Pro Glu Gly Val Ile Leu Phe Tyr Ser Lys Gly
370 375 380
Pro Lys Ser Asn Ile Asp Phe Lys Ile Lys Lys Glu Asn Thr Ile Asp
385 390 395 400
Gln Thr Ala Asn Val Val Phe Arg Tyr Met Ser Ser Glu Pro Ile Ile
405 410 415
Phe Gly Glu Ser Ser Ile Phe Val Glu Tyr Lys Lys Phe Ser Asn Asp
420 425 430
Lys Gly Phe Pro Lys Glu Tyr Gly Ser Gly Lys Ile Val Leu Tyr Asn
435 440 445
Gly Val Asn Tyr Leu Asn Asn Ile Tyr Cys Leu Glu Tyr Ile Asn Thr
450 455 460
His Asn Glu Val Gly Ile Lys Ser Val Val Val Pro Ile Lys Phe Ile
465 470 475 480
Ala Glu Phe Leu Val Asn Gly Glu Ile Leu Lys Pro Arg Ile Asp Lys
485 490 495
Thr Met Lys Tyr Ile Asn Ser Glu Asp Tyr Tyr Gly Asn Gln His Asn
500 505 510
Ile Ile Val Glu His Leu Arg Asp Gln Ser Ile Lys Ile Gly Asp Ile
515 520 525
Phe Asn Glu Asp Lys Leu Ser Asp Val Gly His Gln Tyr Ala Asn Asn
530 535 540
Asp Lys Phe Arg Leu Asn Pro Glu Val Ser Tyr Phe Thr Asn Lys Arg
545 550 555 560
Thr Arg Gly Pro Leu Gly Ile Leu Ser Asn Tyr Val Lys Thr Leu Leu
565 570 575
Ile Ser Met Tyr Cys Ser Lys Thr Phe Leu Asp Asp Ser Asn Lys Arg
580 585 590
Lys Val Leu Ala Ile Asp Phe Gly Asn Gly Ala Asp Leu Glu Lys Tyr
595 600 605
Phe Tyr Gly Glu Ile Ala Leu Leu Val Ala Thr Asp Pro Asp Ala Asp
610 615 620
Ala Ile Ala Arg Gly Asn Glu Arg Tyr Asn Lys Leu Asn Ser Gly Ile
625 630 635 640
Lys Thr Lys Tyr Tyr Lys Phe Asp Tyr Ile Gln Glu Thr Ile Arg Ser
645 650 655
Asp Thr Phe Val Ser Ser Val Arg Glu Val Phe Tyr Phe Gly Lys Phe
660 665 670
Asn Ile Ile Asp Trp Gln Phe Ala Ile His Tyr Ser Phe His Pro Arg
675 680 685
His Tyr Ala Thr Val Met Asn Asn Leu Ser Glu Leu Thr Ala Ser Gly
690 695 700
Gly Lys Val Leu Ile Thr Thr Met Asp Gly Asp Lys Leu Ser Lys Leu
705 710 715 720
Thr Asp Lys Lys Thr Phe Ile Ile His Lys Asn Leu Pro Ser Ser Glu
725 730 735
Asn Tyr Met Ser Val Glu Lys Ile Ala Asp Asp Arg Ile Val Val Tyr
740 745 750
Asn Pro Ser Thr Met Ser Thr Pro Met Thr Glu Tyr Ile Ile Lys Lys
755 760 765
Asn Asp Ile Val Arg Val Phe Asn Glu Tyr Gly Phe Val Leu Val Asp
770 775 780
Asn Val Asp Phe Ala Thr Ile Ile Glu Arg Ser Lys Lys Phe Ile Asn
785 790 795 800
Gly Ala Ser Thr Met Glu Asp Arg Pro Ser Thr Arg Asn Phe Phe Glu
805 810 815
Leu Asn Arg Gly Ala Ile Lys Cys Glu Gly Leu Asp Val Glu Asp Leu
820 825 830
Leu Ser Tyr Tyr Val Val Tyr Val Phe Ser Lys Arg
835 840
<210> 4
<211> 2535
<212> DNA
<213> Artificial Sequence
<400> 4
atggacgcta acgttgtttc ttcttctacc atcgctacct acatcgacgc tctggctaaa 60
aacgcttctg aactggaaca gcgttctacc gcttacgaaa tcaacaacga actggaactg 120
gttttcatca aaccgccgct gatcaccctg accaacgttg ttaacatctc taccatccag 180
gaatctttca tccgtttcac cgttaccaac aaagaaggtg ttaaaatccg taccaaaatc 240
ccgctgtcta aagttcacgg tctggacgtt aaaaacgttc agctggttga cgctatcgac 300
aacatcgttt gggaaaaaaa atctctggtt accgaaaacc gtctgcacaa agaatgcctg 360
ctgcgtctgt ctaccgaaga acgtcacatc ttcctggact acaaaaaata cggttcttct 420
atccgtctgg aactggttaa cctgatccag gctaaaacca aaaacttcac catcgacttc 480
aaactgaaat acttcctggg ttctggtgct cagtctaaat cttctctgct gcacgctatc 540
aaccacccga aatctcgtcc gaacacctct ctggaaatcg aattcacccc gcgtgacaac 600
gaaaccgttc cgtacgacga actgatcaaa gaactgacca ccctgtctcg tcacatcttc 660
atggcttctc cggaaaacgt tatcctgtct ccgccgatca acgctccgat caaaaccttc 720
atgctgccga aacaggacat cgttggtctg gacctggaaa acctgtacgc tgttaccaaa 780
accgacggta tcccgatcac catccgtgtt acctctaacg gtctgtactg ctacttcacc 840
cacctgggtt acatcatccg ttacccggtt aaacgtatca tcgactctga agttgttgtt 900
ttcggtgaag ctgttaaaga caaaaactgg accgtttacc tgatcaaact gatcgaaccg 960
gttaacgcta tcaacgaccg tctggaagaa tctaaatacg ttgaatctaa actggttgac 1020
atctgcgacc gtatcgtttt caaatctaaa aaatacgaag gtccgttcac caccacctct 1080
gaagttgttg acatgctgtc tacctacctg ccgaaacagc cggaaggtgt tatcctgttc 1140
tactctaaag gtccgaaatc taacatcgac ttcaaaatca aaaaagaaaa caccatcgac 1200
cagaccgcta acgttgtttt ccgttacatg tcttctgaac cgatcatctt cggtgaatct 1260
tctatcttcg ttgaatacaa aaaattctct aacgacaaag gtttcccgaa agaatacggt 1320
tctggtaaaa tcgttctgta caacggtgtt aactacctga acaacatcta ctgcctggaa 1380
tacatcaaca cccacaacga agttggtatc aaatctgttg ttgttccgat caaattcatc 1440
gctgaattcc tggttaacgg tgaaatcctg aaaccgcgta tcgacaaaac catgaaatac 1500
atcaactctg aagactacta cggtaaccag cacaacatca tcgttgaaca cctgcgtgac 1560
cagtctatca aaatcggtga catcttcaac gaagacaaac tgtctgacgt tggtcaccag 1620
tacgctaaca acgacaaatt ccgtctgaac ccggaagttt cttacttcac caacaaacgt 1680
acccgtggtc cgctgggtat cctgtctaac tacgttaaaa ccctgctgat ctctatgtac 1740
tgctctaaaa ccttcctgga cgactctaac aaacgtaaag ttctggctat cgacttcggt 1800
aacggtgctg acctggaaaa atacttctac ggtgaaatcg ctctgctggt tgctaccgac 1860
ccggacgctg acgctatcgc tcgtggtaac gaacgttaca acaaactgaa ctctggtatc 1920
aaaaccaaat actacaaatt cgactacatc caggaaacca tccgttctga caccttcgtt 1980
tcttctgttc gtgaagtttt ctacttcggt aaattcaaca tcatcgactg gcagttcgct 2040
atccactact ctttccaccc gcgtcactac gctaccgtta tgaacaacct gtctgaactg 2100
accgcttctg gtggtaaagt tctgatcacc accatggacg gtgacaaact gtctaaactg 2160
accgacaaaa aaaccttcat catccacaaa aacctgccgt cttctgaaaa ctacatgtct 2220
gttgaaaaaa tcgctgacga ccgtatcgtt gtttacaacc cgtctaccat gtctaccccg 2280
atgaccgaat acatcatcaa aaaaaacgac atcgttcgtg ttttcaacga atacggtttc 2340
gttctggttg acaacgttga cttcgctacc atcatcgaac gttctaaaaa attcatcaac 2400
ggtgcttcta ccatggaaga ccgtccgtct acccgtaact tcttcgaact gaaccgtggt 2460
gctatcaaat gcgaaggtct ggacgttgaa gacctgctgt cttactacgt tgtttacgtt 2520
ttctctaaac gttaa 2535
<210> 5
<211> 50
<212> DNA
<213> Artificial Sequence
<400> 5
gataatacga ctcactatag ggaaggagag gaaggaaagg gaagaaagaa 50
<210> 6
<211> 50
<212> DNA
<213> Artificial Sequence
<400> 6
ttctttcttc cctttccttc ctctccttcc ctatagtgag tcgtattatc 50
<210> 7
<211> 1173
<212> DNA
<213> Artificial Sequence
<400> 7
tctagataat acgactcact atagggagaa ttcgccacca tggtgagcaa gggcgaggag 60
ctgttcaccg gggtggtgcc catcctggtc gagctggacg gcgacgtaaa cggccacaag 120
ttcagcgtgt ccggcgaggg cgagggcgat gccacctacg gcaagctgac cctgaagttc 180
atctgcacca ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac cctgacctac 240
ggcgtgcagt gcttcagccg ctaccccgac cacatgaagc agcacgactt cttcaagtcc 300
gccatgcccg aaggctacgt ccaggagcgc accatcttct tcaaggacga cggcaactac 360
aagacccgcg ccgaggtgaa gttcgagggc gacaccctgg tgaaccgcat cgagctgaag 420
ggcatcgact tcaaggagga cggcaacatc ctggggcaca agctggagta caactacaac 480
agccacaacg tctatatcat ggccgacaag cagaagaacg gcatcaaggt gaacttcaag 540
atccgccaca acatcgagga cggcagcgtg cagctcgccg accactacca gcagaacacc 600
cccatcggcg acggccccgt gctgctgccc gacaaccact acctgagcac ccagtccgcc 660
ctgagcaaag accccaacga gaagcgcgat cacatggtcc tgctggagtt cgtgaccgcc 720
gccgggatca ctctcggcat ggacgagctg tacaagtaag gatcctgcac tagtgctgtc 780
gacgctcgct ttcttgctgt ccaatttcta ttaaaggttc ctttgttccc taagtccaac 840
tactaaactg ggggatatta tgaagggcct tgagcatctg gattctgcct aataaaaaac 900
atttattttc attgcgctcg ctttcttgct gtccaatttc tattaaaggt tcctttgttc 960
cctaagtcca actactaaac tgggggatat tatgaagggc cttgagcatc tggattctgc 1020
ctaataaaaa acatttattt tcattgcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1080
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1140
aaaaaaaaaa aaaaaaaaaa aaaaaaaggt acc 1173
<210> 8
<211> 6
<212> PRT
<213> Artificial Sequence
<400> 8
His His His His His His
1 5
<210> 9
<211> 370
<212> PRT
<213> Artificial Sequence
<400> 9
Lys Ile Glu Glu Gly Lys Leu Val Ile Trp Ile Asn Gly Asp Lys Gly
1 5 10 15
Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp Thr Gly
20 25 30
Ile Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys Phe Pro
35 40 45
Gln Val Ala Ala Thr Gly Asp Gly Pro Asp Ile Ile Phe Trp Ala His
50 55 60
Asp Arg Phe Gly Gly Tyr Ala Gln Ser Gly Leu Leu Ala Glu Ile Thr
65 70 75 80
Pro Asp Lys Ala Phe Gln Asp Lys Leu Tyr Pro Phe Thr Trp Asp Ala
85 90 95
Val Arg Tyr Asn Gly Lys Leu Ile Ala Tyr Pro Ile Ala Val Glu Ala
100 105 110
Leu Ser Leu Ile Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro Lys Thr
115 120 125
Trp Glu Glu Ile Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys Gly Lys
130 135 140
Ser Ala Leu Met Phe Asn Leu Gln Glu Pro Tyr Phe Thr Trp Pro Leu
145 150 155 160
Ile Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly Lys Tyr
165 170 175
Asp Ile Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala Gly Leu
180 185 190
Thr Phe Leu Val Asp Leu Ile Lys Asn Lys His Met Asn Ala Asp Thr
195 200 205
Asp Tyr Ser Ile Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr Ala Met
210 215 220
Thr Ile Asn Gly Pro Trp Ala Trp Ser Asn Ile Asp Thr Ser Lys Val
225 230 235 240
Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gln Pro Ser Lys
245 250 255
Pro Phe Val Gly Val Leu Ser Ala Gly Ile Asn Ala Ala Ser Pro Asn
260 265 270
Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr Asp Glu
275 280 285
Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val Ala Leu
290 295 300
Lys Ser Tyr Glu Glu Glu Leu Val Lys Asp Pro Arg Ile Ala Ala Thr
305 310 315 320
Met Glu Asn Ala Gln Lys Gly Glu Ile Met Pro Asn Ile Pro Gln Met
325 330 335
Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val Ile Asn Ala Ala Ser
340 345 350
Gly Arg Gln Thr Val Asp Glu Ala Leu Lys Asp Ala Gln Thr Gly Ala
355 360 365
Ser Thr
370
Claims (12)
1. A chimeric protein subunit comprising, fused to (a) the D12 subunit of an RNA capping enzyme or a functional fragment thereof, or a variant thereof having at least 90% sequence identity thereto and having D12 subunit activity, and (b) an RNA cap structure 2 '-O-methyltransferase or a functional fragment thereof, or a variant thereof having at least 90% sequence identity thereto and having RNA cap structure 2' -O-methyltransferase activity, and optionally (c) a linker between (a) and (b),
preferably, the first and second electrodes are formed of a metal,
the RNA capping enzyme is a vaccinia virus RNA capping enzyme,
the RNA cap structure 2 '-O-methyltransferase is vaccinia virus 2' -O-methyltransferase,
(a) The carboxy terminus of (b) is linked to the amino terminus of (b).
2. The chimeric protein subunit of claim 1,
the amino acid sequence of the D12 subunit of the RNA capping enzyme is shown in SEQ ID NO 1, 1-287, and/or
The amino acid sequence of the RNA cap structure 2' -O-methyltransferase is shown as SEQ ID NO. 1, 303-635, and/or
A variant of the D12 subunit of the RNA capping enzyme or a functional fragment thereof has a mutation selected from one or more of: N42A, Y43A, L61A, K62A, F245A, L246A, K111A, R112A, N120A, N121A, N126A, N127A, F141A, R142A, K223A, D224A, H260A, S261A, E275A, N276A, R280A, R281A, and/or
A variant of RNA cap structure 2' -O-methyltransferase or a functional fragment thereof has the following characteristics: (1) a mutation selected from one or more of: K41D, C178S, a201R, a201K, C272S; and/or (2) one or more amino acids selected from R, K, H, Y, C, D or E are mutated to A, and/or
(b) Further has a His tag and/or MBP tag at its N-or C-terminus,
the amino acid sequence of the linker is shown as the 288 th-302 th amino acid of SEQ ID NO. 1,
preferably, the amino acid sequence of the subunit of the chimeric protein is shown in SEQ ID NO 1.
3. A chimeric protein comprising: (1) The chimeric protein subunit of claim 1 or 2, and (2) an RNA capping enzyme D1 subunit or a functional fragment thereof, or a variant thereof having at least 90% sequence identity thereto and corresponding activity,
preferably, the first and second liquid crystal display panels are,
the chimeric protein is a heterodimeric protein, and/or
The RNA capping enzyme is a vaccinia virus RNA capping enzyme, and/or
Functional fragments of the D1 subunit of RNA capping enzyme comprise:
(1) An N7 methyltransferase domain or a functional fragment thereof;
(2) An N7 methyltransferase domain or a functional fragment thereof and a5' -triphosphatase domain or a functional fragment thereof;
(3) An N7 methyltransferase domain or functional fragment thereof and a guanosine transferase domain or functional fragment thereof; or
(4) An N7 methyltransferase domain or a functional fragment thereof, a5' -triphosphatase domain or a functional fragment thereof, and a guanosine transferase domain or a functional fragment thereof.
4. The chimeric protein of claim 3,
the 5' -triphosphatase domain is shown as amino acids 1-225 of SEQ ID NO. 3, and/or
The guanosine transferase domain is shown as SEQ ID NO:3 amino acids 226-530, and/or
The structure domain of the N7 methyltransferase is shown as amino acids 531-844 of SEQ ID NO. 3, and/or
The subunit of the RNA capping enzyme D1 is shown as SEQ ID NO 3, and/or
The functional fragment of the D1 subunit of the RNA capping enzyme comprises amino acids 498-844 or 540-844 of SEQ ID NO. 3, and/or
A variant of the N7-methyltransferase domain or a functional fragment thereof has a mutation selected from one or more of: D545A, R548A, N550D, Y555F, R560K, R794A, R808A, Y683S, Y684A, Y684F, D598A, G600A, G602A, I681A, S684A, F685A, T571A, L575A, L576A, M579A, F585A, L586A, D587A, D784A, N785A, R794A, F798A, M805A, E806A.
5. A nucleic acid molecule comprising a sequence selected from:
(1) The coding sequence of a subunit of the chimeric protein of claim 1 or 2, or the coding sequence of the chimeric protein of claim 3 or 4,
(2) A variant having at least 80% sequence identity to (1),
(3) The complementary sequence of (1) or (2),
preferably, the first and second liquid crystal display panels are,
the coding sequence of the chimeric protein subunit is shown as SEQ ID NO. 2, or a variant having at least 80% sequence identity with the same, or a degenerate variant encoding the same amino acid sequence with the same, and/or
The coding sequence of the chimeric protein, the coding sequence of the RNA capping enzyme D1 subunit or a functional fragment thereof is shown as SEQ ID NO. 4, or a variant having at least 80% sequence identity therewith, or a degenerate variant encoding the same amino acid sequence therewith.
6. A nucleic acid construct, said nucleic acid construct:
(1) Expressing the chimeric protein subunit of claim 1 or 2, or the chimeric protein of claim 3 or 4,
(2) Comprising the nucleic acid molecule of claim 5,
preferably, the first and second liquid crystal display panels are,
the nucleic acid construct comprises an expression cassette for the chimeric protein subunit and an expression cassette for the RNA capping enzyme D1 subunit or a functional fragment thereof; or the nucleic acid construct is an expression cassette, wherein the coding sequence for the chimeric protein subunit and the coding sequence for the D1 subunit of RNA capping enzyme or a functional fragment thereof are in the expression cassette, and/or
The nucleic acid construct is a cloning vector or an expression vector.
7. A host cell comprising, expressing and/or secreting a chimeric protein subunit of claim 1 or 2, or a chimeric protein of claim 3 or 4,
preferably, the host cell comprises the nucleic acid molecule of claim 5 and/or the nucleic acid construct of claim 6.
8. A method of modifying a target RNA to a capped RNA or a method of preparing a capped target RNA, comprising: contacting a target RNA with the chimeric protein subunit of claim 1 or 2 or the chimeric protein of claim 3 or 4, or contacting a target RNA with a cell comprising, expressing and/or secreting the chimeric protein subunit of claim 1 or 2 or the chimeric protein of claim 3 or 4, under conditions that allow for RNA capping,
preferably, the first and second electrodes are formed of a metal,
contacting the target RNA with the chimeric protein subunit or chimeric protein comprises expressing the target RNA in a cell expressing the chimeric protein subunit or chimeric protein, or mixing the target RNA with the chimeric protein subunit or chimeric protein, and/or
The method comprises the step of introducing into the cell a DNA sequence expressing an RNA of interest.
9. The method of claim 8, wherein the method comprises the steps of:
optionally (1) denaturing the target RNA,
optionally (2) renaturing the target RNA,
(3) Incubating RNA with a chimeric protein subunit of claim 1 or 2 or a chimeric protein of claim 3 or 4 under conditions that allow for RNA capping to obtain capped RNA,
optionally (4) purifying the capped RNA,
preferably, the first and second liquid crystal display panels are,
the conditions that allow RNA capping are incubation at 37 ℃ for at least 20 minutes, and/or
The conditions that allow RNA capping further comprise the presence of one or more agents selected from the group consisting of: GTP, SAM, buffer, and/or
The target RNA is selected from the group consisting of 5' -triphosphorylated RNA, 5' -diphosphonated RNA, and RNA having a5' -Gppp cap structure.
10. The method of claim 8 or 9,
the method modifies a target RNA selected from the group consisting of 5'-m7 Gppp-structured capped RNA, 5' -triphosphorylated RNA, 5 '-diphosphonated RNA, and RNA having a5' -Gppp cap structure into a capped RNA having a 5'-m7Gppp structure, the chimeric protein comprises an N7 methyltransferase domain of a D1 subunit of an RNA capping enzyme or a functional fragment thereof, a5' -triphosphatase domain or a functional fragment thereof, a guanosine transferase domain or a functional fragment thereof, and the chimeric protein subunit; preferably, the chimeric protein comprises an RNA capping enzyme D1 subunit shown as SEQ ID NO. 3 and a chimeric protein subunit shown as SEQ ID NO. 1,
the method modifies a target RNA selected from the group consisting of 5'-m7 gppnp-structured capped RNA, 5' -triphosphorylated RNA, 5 '-diphosphonated RNA, and RNA having a5' -Gppp cap structure into a capped RNA having a 5'-m7 gppnp structure, the chimeric protein comprising an N7 methyltransferase (N7-MTase) domain of RNA-capping enzyme D1 subunit or a functional fragment thereof, a5' -triphosphatase (TPase) domain or a functional fragment thereof, and a guanosine transferase (GTase) domain or a functional fragment thereof, and the chimeric protein subunit; preferably, the chimeric protein comprises an RNA capping enzyme D1 subunit shown as SEQ ID NO. 3 and a chimeric protein subunit shown as SEQ ID NO. 1,
the method is a method of capping a target RNA selected from the group consisting of 5 '-diphosphonated RNA and RNA having a5' -Gppp structure, the chimeric protein comprising an N7 methyltransferase (N7-MTase) domain of the D1 subunit of RNA capping enzyme or a functional fragment thereof and a guanosine transferase (GTase) domain or a functional fragment thereof, and the chimeric protein subunit; preferably, the subunit of the chimeric protein is represented by SEQ ID NO 1, or
The method is a method of capping a target RNA that is an RNA having a5' -Gppp structure, the chimeric protein comprises an N7 methyltransferase (N7-MTase) domain of the D1 subunit of the RNA capping enzyme, or a functional fragment thereof, and the chimeric protein subunit; preferably, the subunit of the chimeric protein is shown as SEQ ID NO: 1.
11. A fusion protein comprising
(a) An RNA cap structure 2 '-O-methyltransferase or a functional fragment thereof or a variant thereof having at least 90% sequence identity and having RNA cap structure 2' -O-methyltransferase activity, and
(b) A His-tag and/or MBP-tag at (a) the N-or C-terminus,
preferably, the first and second electrodes are formed of a metal,
(a) And (b) there is a joint between them,
the RNA cap structure 2 '-O-methyltransferase is vaccinia virus 2' -O-methyltransferase,
the amino acid sequence of the RNA cap structure 2' -O-methyltransferase is shown as SEQ ID NO. 1, 303-635,
the variant of said RNA cap structure 2' -O-methyltransferase or a functional fragment thereof has the following characteristics: (1) a mutation selected from one or more of: K41D, C178S, a201R, a201K, C272S; and/or (2) one or more amino acids selected from R, K, H, Y, C, D or E are mutated to A, and/or
His tag is shown as SEQ ID NO:8, and/or
The MBP tag is shown as SEQ ID NO 9.
12. A nucleic acid sequence encoding the fusion protein of claim 11, a nucleic acid construct or a host cell comprising the nucleic acid sequence.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110379093.3A CN115197327A (en) | 2021-04-08 | 2021-04-08 | RNA modified chimeric protein and application thereof |
PCT/CN2022/085760 WO2022214065A1 (en) | 2021-04-08 | 2022-04-08 | Rna-modifying chimeric protein and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110379093.3A CN115197327A (en) | 2021-04-08 | 2021-04-08 | RNA modified chimeric protein and application thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115197327A true CN115197327A (en) | 2022-10-18 |
Family
ID=83545070
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110379093.3A Pending CN115197327A (en) | 2021-04-08 | 2021-04-08 | RNA modified chimeric protein and application thereof |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN115197327A (en) |
WO (1) | WO2022214065A1 (en) |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2377938A1 (en) * | 2010-04-16 | 2011-10-19 | Eukarys | Capping-prone RNA polymerase enzymes and their applications |
EP3303575B1 (en) * | 2015-05-29 | 2022-03-16 | CureVac AG | Method for adding cap structures to rna using immobilized enzymes |
WO2019020811A1 (en) * | 2017-07-27 | 2019-01-31 | Eukarys | New chimeric enzymes and their applications |
WO2019175356A1 (en) * | 2018-03-15 | 2019-09-19 | Biontech Rna Pharmaceuticals Gmbh | 5'-cap-trinucleotide- or higher oligonucleotide compounds and their uses in stabilizing rna, expressing proteins and in therapy |
CN111893128A (en) * | 2020-06-24 | 2020-11-06 | 苏州市泽悦生物技术有限公司 | Method for preparing recombinant eukaryotic mRNA by using prokaryotic transcription system and application thereof |
-
2021
- 2021-04-08 CN CN202110379093.3A patent/CN115197327A/en active Pending
-
2022
- 2022-04-08 WO PCT/CN2022/085760 patent/WO2022214065A1/en active Application Filing
Also Published As
Publication number | Publication date |
---|---|
WO2022214065A1 (en) | 2022-10-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8003346B2 (en) | Mutant PCNA | |
KR20110044753A (en) | Rna polymerase mutant with improved functions | |
US20180100006A1 (en) | Method for the expression of polypeptides using modified nucleic acids | |
CN114423871A (en) | Template-free enzymatic synthesis of polynucleotides using poly (A) and poly (U) polymerases | |
MXPA04005717A (en) | Expression system. | |
WO2018004014A1 (en) | Recombinant protein having transglutaminase activity | |
CN115197327A (en) | RNA modified chimeric protein and application thereof | |
US20240101983A1 (en) | Programmable rna editing platform | |
CN115703842A (en) | Base editor for efficient and highly accurate cytosine C to guanine G conversion | |
CN114107303A (en) | sgRNA, plasmid, IRF7 function-deficient cell line and construction method and application thereof | |
CN114645033A (en) | Nucleoside triphosphate hydrolase and purification method and application thereof | |
CN107460177A (en) | Using the RNA polymerase mutant of chemically modified nucleoside acid | |
KR100441201B1 (en) | Reporter gene-containing plasmid which is convertible to T-vector and the preparation method thereof | |
EP2109671B1 (en) | Expression cassette, use of the expression cassette, vector, host cell, a method for producing a polypeptide | |
CN114207140A (en) | Terminal deoxynucleotidyl transferase variants and uses thereof | |
CN115261363B (en) | Method for measuring RNA deaminase activity of APOBEC3A and RNA high-activity APOBEC3A variant | |
US20040209323A1 (en) | Protein expression by codon harmonization and translational attenuation | |
US20230279464A1 (en) | Biosensors for selectively identifying azide ions | |
US20230287457A1 (en) | Type i-c crispr system from neisseria lactamica and methods of use | |
RU2740715C2 (en) | Modified groel protein of microorganism thermus thermophilus | |
KR20220145324A (en) | Use of CAS9 protein from bacterium Pasteurella pneumotropica | |
WO2023039434A1 (en) | Systems and methods for transposing cargo nucleotide sequences | |
KR20240051994A (en) | Systems, compositions, and methods comprising retrotransposons and functional fragments thereof | |
CN116410952A (en) | Mutant Taq DNA polymerase, coding gene, recombinant expression vector, recombinant bacterium and application thereof | |
CN113073094A (en) | Single-base mutation system based on cytidine deaminase LjCDA1L1_4a and mutants thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Country or region after: China Address after: 201805 No. 1585 Garden National Road, Anting Town, Jiading District, Shanghai Applicant after: Shanghai Cell Therapy Group Co.,Ltd. Address before: 201805 No. 1585 Garden National Road, Anting Town, Jiading District, Shanghai Applicant before: SHANGHAI CELL THERAPY GROUP Co.,Ltd. Country or region before: China |