CN113652430A - 一种光控rna代谢调控系统 - Google Patents
一种光控rna代谢调控系统 Download PDFInfo
- Publication number
- CN113652430A CN113652430A CN202111024451.5A CN202111024451A CN113652430A CN 113652430 A CN113652430 A CN 113652430A CN 202111024451 A CN202111024451 A CN 202111024451A CN 113652430 A CN113652430 A CN 113652430A
- Authority
- CN
- China
- Prior art keywords
- rna
- gly
- light
- val
- lys
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000004060 metabolic process Effects 0.000 title claims abstract description 82
- 230000033228 biological regulation Effects 0.000 title claims abstract description 67
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 146
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 141
- 229920001184 polypeptide Polymers 0.000 claims abstract description 140
- 239000012636 effector Substances 0.000 claims abstract description 89
- 230000001105 regulatory effect Effects 0.000 claims abstract description 85
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims abstract description 70
- 239000013604 expression vector Substances 0.000 claims abstract description 44
- 238000000034 method Methods 0.000 claims abstract description 37
- 108091027981 Response element Proteins 0.000 claims abstract description 33
- 230000001276 controlling effect Effects 0.000 claims abstract description 17
- 230000004570 RNA-binding Effects 0.000 claims abstract description 10
- 108090000623 proteins and genes Proteins 0.000 claims description 162
- 210000004027 cell Anatomy 0.000 claims description 136
- 102000004169 proteins and genes Human genes 0.000 claims description 89
- 230000027455 binding Effects 0.000 claims description 87
- 230000000694 effects Effects 0.000 claims description 49
- 238000005286 illumination Methods 0.000 claims description 40
- 101710163270 Nuclease Proteins 0.000 claims description 38
- 239000002773 nucleotide Substances 0.000 claims description 37
- 125000003729 nucleotide group Chemical group 0.000 claims description 37
- 238000013519 translation Methods 0.000 claims description 30
- 238000013518 transcription Methods 0.000 claims description 27
- 102000044126 RNA-Binding Proteins Human genes 0.000 claims description 26
- 239000013612 plasmid Substances 0.000 claims description 25
- 101710159080 Aconitate hydratase A Proteins 0.000 claims description 12
- 101710159078 Aconitate hydratase B Proteins 0.000 claims description 12
- 101710105008 RNA-binding protein Proteins 0.000 claims description 12
- 230000004807 localization Effects 0.000 claims description 12
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 11
- 102100032157 Adenylate cyclase type 10 Human genes 0.000 claims description 7
- 101000775498 Homo sapiens Adenylate cyclase type 10 Proteins 0.000 claims description 7
- 241000588724 Escherichia coli Species 0.000 claims description 6
- 108010077850 Nuclear Localization Signals Proteins 0.000 claims description 6
- 230000001580 bacterial effect Effects 0.000 claims description 6
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 6
- 241000221961 Neurospora crassa Species 0.000 claims description 5
- 241000589776 Pseudomonas putida Species 0.000 claims description 4
- 108090000944 RNA Helicases Proteins 0.000 claims description 4
- 102000004409 RNA Helicases Human genes 0.000 claims description 4
- 210000003463 organelle Anatomy 0.000 claims description 4
- 230000002165 photosensitisation Effects 0.000 claims description 4
- 239000003504 photosensitizing agent Substances 0.000 claims description 4
- 210000001236 prokaryotic cell Anatomy 0.000 claims description 4
- 102000028701 rRNA binding proteins Human genes 0.000 claims description 4
- 108091009356 rRNA binding proteins Proteins 0.000 claims description 4
- 102000021501 regulatory RNA binding proteins Human genes 0.000 claims description 4
- 108091011116 regulatory RNA binding proteins Proteins 0.000 claims description 4
- 108091092562 ribozyme Proteins 0.000 claims description 4
- 102000019694 tRNA binding proteins Human genes 0.000 claims description 4
- 108091016288 tRNA binding proteins Proteins 0.000 claims description 4
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 3
- 244000063299 Bacillus subtilis Species 0.000 claims description 3
- 235000014469 Bacillus subtilis Nutrition 0.000 claims description 3
- 108700043340 Bacillus subtilis GlcT Proteins 0.000 claims description 3
- 241000191965 Staphylococcus carnosus Species 0.000 claims description 3
- 108700006850 Bacillus subtilis SacY Proteins 0.000 claims description 2
- 102000004190 Enzymes Human genes 0.000 claims description 2
- 108090000790 Enzymes Proteins 0.000 claims description 2
- 230000025545 Golgi localization Effects 0.000 claims description 2
- 241000186660 Lactobacillus Species 0.000 claims description 2
- AUNGANRZJHBGPY-SCRDCRAPSA-N Riboflavin Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-SCRDCRAPSA-N 0.000 claims description 2
- 230000007711 cytoplasmic localization Effects 0.000 claims description 2
- 230000030583 endoplasmic reticulum localization Effects 0.000 claims description 2
- 229940039696 lactobacillus Drugs 0.000 claims description 2
- 230000025608 mitochondrion localization Effects 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 2
- 108700013958 Lactobacillus casei LacT Proteins 0.000 claims 1
- 241000228347 Monascus <ascomycete fungus> Species 0.000 claims 1
- 238000006384 oligomerization reaction Methods 0.000 claims 1
- 108091028043 Nucleic acid sequence Proteins 0.000 abstract description 29
- 230000002503 metabolic effect Effects 0.000 abstract description 21
- 230000008901 benefit Effects 0.000 abstract description 2
- 108020004566 Transfer RNA Proteins 0.000 description 432
- 239000012634 fragment Substances 0.000 description 85
- 235000018102 proteins Nutrition 0.000 description 67
- 239000013598 vector Substances 0.000 description 53
- 238000011144 upstream manufacturing Methods 0.000 description 51
- 108020004999 messenger RNA Proteins 0.000 description 34
- 230000015556 catabolic process Effects 0.000 description 32
- 238000006243 chemical reaction Methods 0.000 description 32
- 238000006731 degradation reaction Methods 0.000 description 32
- 108020004414 DNA Proteins 0.000 description 25
- 230000014509 gene expression Effects 0.000 description 22
- 230000035897 transcription Effects 0.000 description 22
- 230000014621 translational initiation Effects 0.000 description 20
- 108010044843 Peptide Initiation Factors Proteins 0.000 description 19
- 102000005877 Peptide Initiation Factors Human genes 0.000 description 19
- 238000010367 cloning Methods 0.000 description 19
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 19
- 108020001507 fusion proteins Proteins 0.000 description 19
- 102000037865 fusion proteins Human genes 0.000 description 19
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 18
- 108010012058 leucyltyrosine Proteins 0.000 description 18
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 17
- 150000001413 amino acids Chemical class 0.000 description 16
- 238000001514 detection method Methods 0.000 description 16
- 108010050848 glycylleucine Proteins 0.000 description 15
- 238000003199 nucleic acid amplification method Methods 0.000 description 15
- 108700020471 RNA-Binding Proteins Proteins 0.000 description 14
- 108010047857 aspartylglycine Proteins 0.000 description 14
- 108010061238 threonyl-glycine Proteins 0.000 description 14
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 13
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 13
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 13
- 230000003321 amplification Effects 0.000 description 13
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 12
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 12
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 12
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 12
- 239000000047 product Substances 0.000 description 12
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 11
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 11
- 239000013613 expression plasmid Substances 0.000 description 11
- 108010049041 glutamylalanine Proteins 0.000 description 11
- 108010053725 prolylvaline Proteins 0.000 description 11
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 10
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 10
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 10
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 10
- WQSCVMQDZYTFQU-FXQIFTODSA-N Asn-Cys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WQSCVMQDZYTFQU-FXQIFTODSA-N 0.000 description 10
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 10
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 10
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 10
- VXYQOFXBIXKPCX-BQBZGAKWSA-N Ser-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N VXYQOFXBIXKPCX-BQBZGAKWSA-N 0.000 description 10
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 10
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 10
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 10
- 235000001014 amino acid Nutrition 0.000 description 10
- 229940024606 amino acid Drugs 0.000 description 10
- 230000006870 function Effects 0.000 description 10
- 108010005942 methionylglycine Proteins 0.000 description 10
- 230000002829 reductive effect Effects 0.000 description 10
- 102200120948 rs1057520695 Human genes 0.000 description 10
- 238000001890 transfection Methods 0.000 description 10
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 9
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 9
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 9
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 9
- XIDSGDJNUJRUHE-VEVYYDQMSA-N Asn-Thr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O XIDSGDJNUJRUHE-VEVYYDQMSA-N 0.000 description 9
- LDLZOAJRXXBVGF-GMOBBJLQSA-N Asp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N LDLZOAJRXXBVGF-GMOBBJLQSA-N 0.000 description 9
- FYAULIGIFPPOAA-ZPFDUUQYSA-N Gln-Ile-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O FYAULIGIFPPOAA-ZPFDUUQYSA-N 0.000 description 9
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 9
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 9
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 9
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 9
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 9
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 9
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 9
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 9
- LPNWWHBFXPNHJG-AVGNSLFASA-N Met-Val-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN LPNWWHBFXPNHJG-AVGNSLFASA-N 0.000 description 9
- ZFVWWUILVLLVFA-AVGNSLFASA-N Phe-Gln-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N ZFVWWUILVLLVFA-AVGNSLFASA-N 0.000 description 9
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 9
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 9
- 230000014632 RNA localization Effects 0.000 description 9
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 9
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 9
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 9
- AVFGBGGRZOKSFS-KJEVXHAQSA-N Tyr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O AVFGBGGRZOKSFS-KJEVXHAQSA-N 0.000 description 9
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 9
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 9
- FVTCRASFADXXNN-SCRDCRAPSA-N flavin mononucleotide Chemical compound OP(=O)(O)OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O FVTCRASFADXXNN-SCRDCRAPSA-N 0.000 description 9
- 229940013640 flavin mononucleotide Drugs 0.000 description 9
- 239000011768 flavin mononucleotide Substances 0.000 description 9
- FVTCRASFADXXNN-UHFFFAOYSA-N flavin mononucleotide Natural products OP(=O)(O)OCC(O)C(O)C(O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O FVTCRASFADXXNN-UHFFFAOYSA-N 0.000 description 9
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 9
- 230000004048 modification Effects 0.000 description 9
- 238000012986 modification Methods 0.000 description 9
- 238000011160 research Methods 0.000 description 9
- 235000019231 riboflavin-5'-phosphate Nutrition 0.000 description 9
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 8
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 8
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 8
- 241000894006 Bacteria Species 0.000 description 8
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 8
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 8
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 8
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 8
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 8
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 8
- 102000015097 RNA Splicing Factors Human genes 0.000 description 8
- 108010039259 RNA Splicing Factors Proteins 0.000 description 8
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 8
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 8
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 8
- 108010068265 aspartyltyrosine Proteins 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 8
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 8
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 7
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 7
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 7
- 239000006002 Pepper Substances 0.000 description 7
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 7
- 238000006471 dimerization reaction Methods 0.000 description 7
- 230000004049 epigenetic modification Effects 0.000 description 7
- 108010020532 tyrosyl-proline Proteins 0.000 description 7
- 235000002566 Capsicum Nutrition 0.000 description 6
- 241000196324 Embryophyta Species 0.000 description 6
- 241000206602 Eukaryota Species 0.000 description 6
- 241000722363 Piper Species 0.000 description 6
- 235000016761 Piper aduncum Nutrition 0.000 description 6
- 235000017804 Piper guineense Nutrition 0.000 description 6
- 235000008184 Piper nigrum Nutrition 0.000 description 6
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 6
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 6
- 230000008859 change Effects 0.000 description 6
- 238000011161 development Methods 0.000 description 6
- 230000018109 developmental process Effects 0.000 description 6
- 239000005090 green fluorescent protein Substances 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- 210000004962 mammalian cell Anatomy 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 108010072644 valyl-alanyl-prolyl-glycine Proteins 0.000 description 6
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 5
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 5
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 5
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 5
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 5
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 5
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 5
- 108060001084 Luciferase Proteins 0.000 description 5
- 239000005089 Luciferase Substances 0.000 description 5
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 5
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 5
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 5
- 108091023040 Transcription factor Proteins 0.000 description 5
- 102000040945 Transcription factor Human genes 0.000 description 5
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 5
- 108010041407 alanylaspartic acid Proteins 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- 238000004113 cell culture Methods 0.000 description 5
- VWWQXMAJTJZDQX-UYBVJOGSSA-N flavin adenine dinucleotide Chemical compound C1=NC2=C(N)N=CN=C2N1[C@@H]([C@H](O)[C@@H]1O)O[C@@H]1CO[P@](O)(=O)O[P@@](O)(=O)OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C2=NC(=O)NC(=O)C2=NC2=C1C=C(C)C(C)=C2 VWWQXMAJTJZDQX-UYBVJOGSSA-N 0.000 description 5
- 235000019162 flavin adenine dinucleotide Nutrition 0.000 description 5
- 239000011714 flavin adenine dinucleotide Substances 0.000 description 5
- 229940093632 flavin-adenine dinucleotide Drugs 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 108010015792 glycyllysine Proteins 0.000 description 5
- 108010037850 glycylvaline Proteins 0.000 description 5
- 238000003384 imaging method Methods 0.000 description 5
- 150000007523 nucleic acids Chemical class 0.000 description 5
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 5
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 5
- 230000004044 response Effects 0.000 description 5
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 4
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 4
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 4
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 4
- RYEWQKQXRJCHIO-SRVKXCTJSA-N Asp-Asn-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RYEWQKQXRJCHIO-SRVKXCTJSA-N 0.000 description 4
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- WSWAUVHXQREQQG-JYJNAYRXSA-N His-Tyr-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O WSWAUVHXQREQQG-JYJNAYRXSA-N 0.000 description 4
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 4
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 4
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 4
- 108020004511 Recombinant DNA Proteins 0.000 description 4
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 4
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 4
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 4
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 4
- 238000000692 Student's t-test Methods 0.000 description 4
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 4
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 4
- ZOCJFNXUVSGBQI-HSHDSVGOSA-N Thr-Trp-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ZOCJFNXUVSGBQI-HSHDSVGOSA-N 0.000 description 4
- WKCFCVBOFKEVKY-HSCHXYMDSA-N Trp-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WKCFCVBOFKEVKY-HSCHXYMDSA-N 0.000 description 4
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 4
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 4
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 4
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- 230000003190 augmentative effect Effects 0.000 description 4
- 230000007423 decrease Effects 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 238000010353 genetic engineering Methods 0.000 description 4
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 4
- 108010034529 leucyl-lysine Proteins 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 108091005949 mKalama1 Proteins 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 4
- 108010054624 red fluorescent protein Proteins 0.000 description 4
- 108010026333 seryl-proline Proteins 0.000 description 4
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 4
- 230000032258 transport Effects 0.000 description 4
- 210000005253 yeast cell Anatomy 0.000 description 4
- PPINMSZPTPRQQB-NHCYSSNCSA-N 2-[[(2s)-1-[(2s)-2-[[(2s)-2-amino-3-methylbutanoyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetic acid Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PPINMSZPTPRQQB-NHCYSSNCSA-N 0.000 description 3
- 101001082110 Acanthamoeba polyphaga mimivirus Eukaryotic translation initiation factor 4E homolog Proteins 0.000 description 3
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 3
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 3
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 3
- RWWPBOUMKFBHAL-FXQIFTODSA-N Arg-Asn-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O RWWPBOUMKFBHAL-FXQIFTODSA-N 0.000 description 3
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 3
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 3
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 3
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 3
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 3
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 3
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 3
- 244000075850 Avena orientalis Species 0.000 description 3
- 235000007319 Avena orientalis Nutrition 0.000 description 3
- 108010016529 Bacillus amyloliquefaciens ribonuclease Proteins 0.000 description 3
- 108090000994 Catalytic RNA Proteins 0.000 description 3
- 102000053642 Catalytic RNA Human genes 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- 101001082109 Danio rerio Eukaryotic translation initiation factor 4E-1B Proteins 0.000 description 3
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 3
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 3
- CHDWDBPJOZVZSE-KKUMJFAQSA-N Glu-Phe-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CHDWDBPJOZVZSE-KKUMJFAQSA-N 0.000 description 3
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 3
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 3
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 3
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 3
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 3
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 3
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 3
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 3
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 3
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 3
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 3
- 101710161044 Phytochrome 1 Proteins 0.000 description 3
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 3
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 3
- 108020003224 Small Nucleolar RNA Proteins 0.000 description 3
- 102000042773 Small Nucleolar RNA Human genes 0.000 description 3
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 3
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 3
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 3
- ABCLYRRGTZNIFU-BWAGICSOSA-N Thr-Tyr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O ABCLYRRGTZNIFU-BWAGICSOSA-N 0.000 description 3
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 3
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 3
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 3
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 3
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 3
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 3
- 210000004102 animal cell Anatomy 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 230000008827 biological function Effects 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- 239000000975 dye Substances 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 3
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 3
- 238000002744 homologous recombination Methods 0.000 description 3
- 230000006801 homologous recombination Effects 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 210000004020 intracellular membrane Anatomy 0.000 description 3
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 108091027963 non-coding RNA Proteins 0.000 description 3
- 102000042567 non-coding RNA Human genes 0.000 description 3
- 108010012581 phenylalanylglutamate Proteins 0.000 description 3
- 108010070643 prolylglutamic acid Proteins 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 230000007115 recruitment Effects 0.000 description 3
- 238000007363 ring formation reaction Methods 0.000 description 3
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 3
- 230000037423 splicing regulation Effects 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 210000001519 tissue Anatomy 0.000 description 3
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 2
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 2
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 2
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 2
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- 108020005544 Antisense RNA Proteins 0.000 description 2
- OOBVTWHLKYJFJH-FXQIFTODSA-N Arg-Ala-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O OOBVTWHLKYJFJH-FXQIFTODSA-N 0.000 description 2
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 2
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 2
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 2
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 2
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 2
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 2
- WTFIFQWLQXZLIZ-UMPQAUOISA-N Arg-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O WTFIFQWLQXZLIZ-UMPQAUOISA-N 0.000 description 2
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 2
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 2
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 2
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 2
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 2
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 2
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 2
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 2
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 2
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 2
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 2
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 2
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 2
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 2
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 108091033409 CRISPR Proteins 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 108010037139 Cryptochromes Proteins 0.000 description 2
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 2
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 2
- 241001534811 Erythrobacter litoralis Species 0.000 description 2
- 108060002716 Exonuclease Proteins 0.000 description 2
- ZFADFBPRMSBPOT-KKUMJFAQSA-N Gln-Arg-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZFADFBPRMSBPOT-KKUMJFAQSA-N 0.000 description 2
- IPHGBVYWRKCGKG-FXQIFTODSA-N Gln-Cys-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O IPHGBVYWRKCGKG-FXQIFTODSA-N 0.000 description 2
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 2
- LTXLIIZACMCQTO-GUBZILKMSA-N Gln-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LTXLIIZACMCQTO-GUBZILKMSA-N 0.000 description 2
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 2
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 2
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 2
- JTWZNMUVQWWGOX-SOUVJXGZSA-N Gln-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JTWZNMUVQWWGOX-SOUVJXGZSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 2
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 2
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 2
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- MVORZMQFXBLMHM-QWRGUYRKSA-N Gly-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 MVORZMQFXBLMHM-QWRGUYRKSA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 2
- MREVELMMFOLESM-HOCLYGCPSA-N Gly-Trp-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O MREVELMMFOLESM-HOCLYGCPSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- 102000017013 Heterogeneous Nuclear Ribonucleoprotein A1 Human genes 0.000 description 2
- 108010014594 Heterogeneous Nuclear Ribonucleoprotein A1 Proteins 0.000 description 2
- NQKRILCJYCASDV-QWRGUYRKSA-N His-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 NQKRILCJYCASDV-QWRGUYRKSA-N 0.000 description 2
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 2
- YYOCMTFVGKDNQP-IHRRRGAJSA-N His-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N YYOCMTFVGKDNQP-IHRRRGAJSA-N 0.000 description 2
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 2
- 101000615488 Homo sapiens Methyl-CpG-binding domain protein 2 Proteins 0.000 description 2
- 101001063514 Homo sapiens Telomerase-binding protein EST1A Proteins 0.000 description 2
- LVQDUPQUJZWKSU-PYJNHQTQSA-N Ile-Arg-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LVQDUPQUJZWKSU-PYJNHQTQSA-N 0.000 description 2
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 2
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 2
- ZUPJCJINYQISSN-XUXIUFHCSA-N Ile-Met-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUPJCJINYQISSN-XUXIUFHCSA-N 0.000 description 2
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- 244000199866 Lactobacillus casei Species 0.000 description 2
- 235000013958 Lactobacillus casei Nutrition 0.000 description 2
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 2
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 2
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 2
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 2
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 2
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 2
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 2
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 2
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 2
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 2
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 2
- 102100021299 Methyl-CpG-binding domain protein 2 Human genes 0.000 description 2
- 108700011259 MicroRNAs Proteins 0.000 description 2
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- 101150056369 PIN gene Proteins 0.000 description 2
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 2
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 2
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 2
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 2
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 2
- LYCOGHUNJCETDK-JYJNAYRXSA-N Phe-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N LYCOGHUNJCETDK-JYJNAYRXSA-N 0.000 description 2
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 2
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 2
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 2
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 2
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 2
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- 108010003201 RGH 0205 Proteins 0.000 description 2
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 2
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 2
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 2
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 2
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 2
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 2
- 102000039471 Small Nuclear RNA Human genes 0.000 description 2
- 108700026226 TATA Box Proteins 0.000 description 2
- 102100031022 Telomerase-binding protein EST1A Human genes 0.000 description 2
- 108091046869 Telomeric non-coding RNA Proteins 0.000 description 2
- NOWXWJLVGTVJKM-PBCZWWQYSA-N Thr-Asp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O NOWXWJLVGTVJKM-PBCZWWQYSA-N 0.000 description 2
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 2
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 2
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 2
- OHDXOXIZXSFCDN-RCWTZXSCSA-N Thr-Met-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OHDXOXIZXSFCDN-RCWTZXSCSA-N 0.000 description 2
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 2
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 2
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 108700009124 Transcription Initiation Site Proteins 0.000 description 2
- HOJPPPKZWFRTHJ-PJODQICGSA-N Trp-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HOJPPPKZWFRTHJ-PJODQICGSA-N 0.000 description 2
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 2
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 2
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 2
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 2
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 2
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- LMSBRIVOCYOKMU-NRPADANISA-N Val-Gln-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N LMSBRIVOCYOKMU-NRPADANISA-N 0.000 description 2
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 2
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 2
- MHAHQDBEIDPFQS-NHCYSSNCSA-N Val-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C MHAHQDBEIDPFQS-NHCYSSNCSA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 2
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 2
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 2
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 2
- NSUUANXHLKKHQB-BZSNNMDCSA-N Val-Pro-Trp Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 NSUUANXHLKKHQB-BZSNNMDCSA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 2
- 238000007259 addition reaction Methods 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 238000010170 biological method Methods 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 230000009087 cell motility Effects 0.000 description 2
- RTIXKCRFFJGDFG-UHFFFAOYSA-N chrysin Chemical compound C=1C(O)=CC(O)=C(C(C=2)=O)C=1OC=2C1=CC=CC=C1 RTIXKCRFFJGDFG-UHFFFAOYSA-N 0.000 description 2
- 239000003184 complementary RNA Substances 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 239000012228 culture supernatant Substances 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 239000000539 dimer Substances 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 101150114135 eIF4E gene Proteins 0.000 description 2
- 230000014789 establishment of RNA localization Effects 0.000 description 2
- 102000013165 exonuclease Human genes 0.000 description 2
- 239000012091 fetal bovine serum Substances 0.000 description 2
- 238000001917 fluorescence detection Methods 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 230000003301 hydrolyzing effect Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 238000007852 inverse PCR Methods 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 229940017800 lactobacillus casei Drugs 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 239000002679 microRNA Substances 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 210000004940 nucleus Anatomy 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108091029842 small nuclear ribonucleic acid Proteins 0.000 description 2
- DAEPDZWVDSPTHF-UHFFFAOYSA-M sodium pyruvate Chemical compound [Na+].CC(=O)C([O-])=O DAEPDZWVDSPTHF-UHFFFAOYSA-M 0.000 description 2
- 229960005322 streptomycin Drugs 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 230000002195 synergetic effect Effects 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 108010066587 tRNA Methyltransferases Proteins 0.000 description 2
- 102000018477 tRNA Methyltransferases Human genes 0.000 description 2
- 230000014616 translation Effects 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- RRBGTUQJDFBWNN-MUGJNUQGSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O RRBGTUQJDFBWNN-MUGJNUQGSA-N 0.000 description 1
- 102220497337 14-3-3 protein zeta/delta_R59A_mutation Human genes 0.000 description 1
- HXUVTXPOZRFMOY-NSHDSACASA-N 2-[[(2s)-2-[[2-[(2-aminoacetyl)amino]acetyl]amino]-3-phenylpropanoyl]amino]acetic acid Chemical compound NCC(=O)NCC(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 HXUVTXPOZRFMOY-NSHDSACASA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- NOIIUHRQUVNIDD-UHFFFAOYSA-N 3-[[oxo(pyridin-4-yl)methyl]hydrazo]-N-(phenylmethyl)propanamide Chemical compound C=1C=CC=CC=1CNC(=O)CCNNC(=O)C1=CC=NC=C1 NOIIUHRQUVNIDD-UHFFFAOYSA-N 0.000 description 1
- RIGJCKIIBUZTCX-OVCLIPMQSA-N 4-[(Z)-1-cyano-2-[5-[2-hydroxyethyl(methyl)amino]thieno[3,2-b]thiophen-2-yl]ethenyl]benzonitrile Chemical compound CN(CCO)C1=CC2=C(S1)C=C(S2)/C=C(\C#N)/C3=CC=C(C=C3)C#N RIGJCKIIBUZTCX-OVCLIPMQSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- NYCXYKOXLNBYID-UHFFFAOYSA-N 5,7-Dihydroxychromone Natural products O1C=CC(=O)C=2C1=CC(O)=CC=2O NYCXYKOXLNBYID-UHFFFAOYSA-N 0.000 description 1
- DQRZBOYXYFUFGA-UHFFFAOYSA-N 7-benzyl-4-[(4-hydroxyphenyl)methyl]-2,5,8-triazatetracyclo[7.7.0.02,6.010,15]hexadeca-1(9),3,5,7,10(15),11,13-heptaene-3,13-diol Chemical compound Oc1c(Cc2ccc(O)cc2)nc2c(Cc3ccccc3)nc3-c4ccc(O)cc4Cc3n12 DQRZBOYXYFUFGA-UHFFFAOYSA-N 0.000 description 1
- 241000186361 Actinobacteria <class> Species 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- PPSSXNJBIKWOLE-JNHSIYEJSA-N Ala-Phe-Trp-Asn Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(N)=O)C(O)=O)C1=CC=CC=C1 PPSSXNJBIKWOLE-JNHSIYEJSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- 241000223600 Alternaria Species 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- LFAUVOXPCGJKTB-DCAQKATOSA-N Arg-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N LFAUVOXPCGJKTB-DCAQKATOSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 1
- YHZQOSXDTFRZKU-WDSOQIARSA-N Arg-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 YHZQOSXDTFRZKU-WDSOQIARSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- CQMQJWRCRQSBAF-BPUTZDHNSA-N Asn-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N CQMQJWRCRQSBAF-BPUTZDHNSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 1
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 1
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- YGHCVNQOZZMHRZ-DJFWLOJKSA-N Asn-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N YGHCVNQOZZMHRZ-DJFWLOJKSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 1
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 description 1
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- FQHBAQLBIXLWAG-DCAQKATOSA-N Asp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N FQHBAQLBIXLWAG-DCAQKATOSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- SAKCBXNPWDRWPE-BQBZGAKWSA-N Asp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N SAKCBXNPWDRWPE-BQBZGAKWSA-N 0.000 description 1
- APXVSPGLYXFFSD-AJNGGQMLSA-N Asp-Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)N)CC1=CC=CC=C1 APXVSPGLYXFFSD-AJNGGQMLSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- KOWYNSKRPUWSFG-IHPCNDPISA-N Asp-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC(=O)O)N KOWYNSKRPUWSFG-IHPCNDPISA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000193744 Bacillus amyloliquefaciens Species 0.000 description 1
- 102100032985 CCR4-NOT transcription complex subunit 7 Human genes 0.000 description 1
- 108050006912 CCR4-NOT transcription complex subunit 7 Proteins 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- CLDCTNHPILWQCW-CIUDSAMLSA-N Cys-Arg-Glu Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N CLDCTNHPILWQCW-CIUDSAMLSA-N 0.000 description 1
- CEZSLNCYQUFOSL-BQBZGAKWSA-N Cys-Arg-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O CEZSLNCYQUFOSL-BQBZGAKWSA-N 0.000 description 1
- JTNKVWLMDHIUOG-IHRRRGAJSA-N Cys-Arg-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JTNKVWLMDHIUOG-IHRRRGAJSA-N 0.000 description 1
- XRTISHJEPHMBJG-SRVKXCTJSA-N Cys-Asp-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XRTISHJEPHMBJG-SRVKXCTJSA-N 0.000 description 1
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 1
- VXLXATVURDNDCG-CIUDSAMLSA-N Cys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N VXLXATVURDNDCG-CIUDSAMLSA-N 0.000 description 1
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 1
- NGOIQDYZMIKCOK-NAKRPEOUSA-N Cys-Val-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NGOIQDYZMIKCOK-NAKRPEOUSA-N 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- 229930182832 D-phenylalanine Natural products 0.000 description 1
- COLNVLDHVKWLRT-MRVPVSSYSA-N D-phenylalanine Chemical compound OC(=O)[C@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-MRVPVSSYSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 241000252212 Danio rerio Species 0.000 description 1
- 108010046331 Deoxyribodipyrimidine photo-lyase Proteins 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 101100232687 Drosophila melanogaster eIF4A gene Proteins 0.000 description 1
- 101150056653 EndoA gene Proteins 0.000 description 1
- 241000588698 Erwinia Species 0.000 description 1
- 101710091919 Eukaryotic translation initiation factor 4G Proteins 0.000 description 1
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 1
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 1
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 1
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 1
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 1
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 1
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 1
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- ZMVCLTGPGWJAEE-JYJNAYRXSA-N Glu-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)O ZMVCLTGPGWJAEE-JYJNAYRXSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- AKAPKBNIVNPIPO-KKUMJFAQSA-N His-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 AKAPKBNIVNPIPO-KKUMJFAQSA-N 0.000 description 1
- 101000833167 Homo sapiens Poly(A) RNA polymerase GLD2 Proteins 0.000 description 1
- 241000701109 Human adenovirus 2 Species 0.000 description 1
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- JQLFYZMEXFNRFS-DJFWLOJKSA-N Ile-Asp-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N JQLFYZMEXFNRFS-DJFWLOJKSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 1
- FJWALBCCVIHZBS-QXEWZRGKSA-N Ile-Met-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N FJWALBCCVIHZBS-QXEWZRGKSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- WRDTXMBPHMBGIB-STECZYCISA-N Ile-Tyr-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 WRDTXMBPHMBGIB-STECZYCISA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- HMDDEJADNKQTBR-BZSNNMDCSA-N Leu-His-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMDDEJADNKQTBR-BZSNNMDCSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 1
- LFXSPAIBSZSTEM-PMVMPFDFSA-N Leu-Trp-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N LFXSPAIBSZSTEM-PMVMPFDFSA-N 0.000 description 1
- UFPLDOKWDNTTRP-ULQDDVLXSA-N Leu-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=C(O)C=C1 UFPLDOKWDNTTRP-ULQDDVLXSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 102000006830 Luminescent Proteins Human genes 0.000 description 1
- 108010047357 Luminescent Proteins Proteins 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- SEZADXQOJJTXPG-VFAJRCTISA-N Lys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N)O SEZADXQOJJTXPG-VFAJRCTISA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- PJWDQHNOJIBMRY-JYJNAYRXSA-N Met-Arg-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PJWDQHNOJIBMRY-JYJNAYRXSA-N 0.000 description 1
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 1
- DBOMZJOESVYERT-GUBZILKMSA-N Met-Asn-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N DBOMZJOESVYERT-GUBZILKMSA-N 0.000 description 1
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 1
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 1
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 1
- FMYLZGQFKPHXHI-GUBZILKMSA-N Met-Met-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O FMYLZGQFKPHXHI-GUBZILKMSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- 101100046352 Mus musculus Tjap1 gene Proteins 0.000 description 1
- 108010074084 Muscle Proteins Proteins 0.000 description 1
- 102000008934 Muscle Proteins Human genes 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 108010002747 Pfu DNA polymerase Proteins 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- OPEVYHFJXLCCRT-AVGNSLFASA-N Phe-Gln-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O OPEVYHFJXLCCRT-AVGNSLFASA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- SRILZRSXIKRGBF-HRCADAONSA-N Phe-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N SRILZRSXIKRGBF-HRCADAONSA-N 0.000 description 1
- NJONQBYLTANINY-IHPCNDPISA-N Phe-Trp-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(N)=O)C(O)=O NJONQBYLTANINY-IHPCNDPISA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- 102000009097 Phosphorylases Human genes 0.000 description 1
- 108010073135 Phosphorylases Proteins 0.000 description 1
- 102100024380 Poly(A) RNA polymerase GLD2 Human genes 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- 241001505332 Polyomavirus sp. Species 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- QCMYJBKTMIWZAP-AVGNSLFASA-N Pro-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 QCMYJBKTMIWZAP-AVGNSLFASA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 102000009572 RNA Polymerase II Human genes 0.000 description 1
- 108010009460 RNA Polymerase II Proteins 0.000 description 1
- 108091008103 RNA aptamers Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102000018120 Recombinases Human genes 0.000 description 1
- 108010091086 Recombinases Proteins 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- RZUOXAKGNHXZTB-GUBZILKMSA-N Ser-Arg-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RZUOXAKGNHXZTB-GUBZILKMSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- NJSPTZXVPZDRCU-UBHSHLNASA-N Ser-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N NJSPTZXVPZDRCU-UBHSHLNASA-N 0.000 description 1
- SNNSYBWPPVAXQW-ZLUOBGJFSA-N Ser-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O SNNSYBWPPVAXQW-ZLUOBGJFSA-N 0.000 description 1
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108020004688 Small Nuclear RNA Proteins 0.000 description 1
- 244000057717 Streptococcus lactis Species 0.000 description 1
- 235000014897 Streptococcus lactis Nutrition 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 102220537421 Testin_K27A_mutation Human genes 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- XIHGJKFSIDTDKV-LYARXQMPSA-N Thr-Phe-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIHGJKFSIDTDKV-LYARXQMPSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 241000723873 Tobacco mosaic virus Species 0.000 description 1
- RERIQEJUYCLJQI-QRTARXTBSA-N Trp-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERIQEJUYCLJQI-QRTARXTBSA-N 0.000 description 1
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 1
- UPUNWAXSLPBMRK-XTWBLICNSA-N Trp-Thr-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UPUNWAXSLPBMRK-XTWBLICNSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- HKIUVWMZYFBIHG-KKUMJFAQSA-N Tyr-Arg-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O HKIUVWMZYFBIHG-KKUMJFAQSA-N 0.000 description 1
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 1
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 1
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- 241001482199 Vaucheria frigida Species 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 239000000853 adhesive Substances 0.000 description 1
- 230000001070 adhesive effect Effects 0.000 description 1
- 239000000556 agonist Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 108091005948 blue fluorescent proteins Proteins 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000024245 cell differentiation Effects 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000019522 cellular metabolic process Effects 0.000 description 1
- 230000005754 cellular signaling Effects 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 229940043370 chrysin Drugs 0.000 description 1
- 235000015838 chrysin Nutrition 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 238000012761 co-transfection Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 241001493065 dsRNA viruses Species 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 150000002211 flavins Chemical class 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 239000003292 glue Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 239000000710 homodimer Substances 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 229910017053 inorganic salt Inorganic materials 0.000 description 1
- 238000002743 insertional mutagenesis Methods 0.000 description 1
- 239000000543 intermediate Substances 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 230000033001 locomotion Effects 0.000 description 1
- 101150055499 lov-1 gene Proteins 0.000 description 1
- 238000000504 luminescence detection Methods 0.000 description 1
- 238000004020 luminiscence type Methods 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 108091064355 mitochondrial RNA Proteins 0.000 description 1
- 230000002438 mitochondrial effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 230000002232 neuromuscular Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 108010091617 pentalysine Proteins 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 238000007539 photo-oxidation reaction Methods 0.000 description 1
- 230000005622 photoelectricity Effects 0.000 description 1
- 230000029553 photosynthesis Effects 0.000 description 1
- 238000010672 photosynthesis Methods 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 239000000049 pigment Substances 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 230000004845 protein aggregation Effects 0.000 description 1
- 230000006916 protein interaction Effects 0.000 description 1
- 230000007026 protein scission Effects 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000014493 regulation of gene expression Effects 0.000 description 1
- 230000037425 regulation of transcription Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 102000037983 regulatory factors Human genes 0.000 description 1
- 108091008025 regulatory factors Proteins 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000000754 repressing effect Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 230000033764 rhythmic process Effects 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 102220005154 rs33988732 Human genes 0.000 description 1
- 102220045254 rs374739820 Human genes 0.000 description 1
- 102220312959 rs770692934 Human genes 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 235000010378 sodium ascorbate Nutrition 0.000 description 1
- PPASLZSBLFJQEF-RKJRWTFHSA-M sodium ascorbate Substances [Na+].OC[C@@H](O)[C@H]1OC(=O)C(O)=C1[O-] PPASLZSBLFJQEF-RKJRWTFHSA-M 0.000 description 1
- 229960005055 sodium ascorbate Drugs 0.000 description 1
- 229940054269 sodium pyruvate Drugs 0.000 description 1
- PPASLZSBLFJQEF-RXSVEWSESA-M sodium-L-ascorbate Chemical compound [Na+].OC[C@H](O)[C@H]1OC(=O)C(O)=C1[O-] PPASLZSBLFJQEF-RXSVEWSESA-M 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- RWRDLPDLKQPQOW-UHFFFAOYSA-N tetrahydropyrrole Natural products C1CCNC1 RWRDLPDLKQPQOW-UHFFFAOYSA-N 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
Landscapes
- Genetics & Genomics (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Chemical & Material Sciences (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Plant Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明提供一种光控RNA代谢调控系统,包括:a)重组光控RNA效应因子,所述重组光控RNA效应因子包括作为RNA结合结构域的第一多肽,作为光敏结构域的第二多肽和作为RNA效应结构域的第三多肽;b)靶调控单元,包括被所述第一多肽识别/结合的至少一个反应元件和被第三多肽调节的靶标RNA序列。本发明还提供包含这种光控RNA代谢调控系统的表达载体,及用这种光控RNA代谢调控系统在宿主细胞中调控RNA代谢的方法。本发明还提供装有所述光控RNA代谢调控系统各组分的试剂盒。本发明的光控RNA代谢调控系统具有高效率、高时空分辨率等优点,可用于时空精确调控活细胞中RNA的多种代谢活动。
Description
技术领域
本发明涉及遗传工程、光遗传学、合成生物学等多学科交叉领域,具体地涉及RNA代谢调控领域,更具体地涉及一种光控RNA代谢调控系统和采用该系统调控宿主细胞中RNA代谢的方法。
背景技术
随着基因组测序合成的技术创新、系统生物学和各类“组学”的涌现及发展,诞生了一个全新的的交叉学科“合成生物学”。合成生物学的核心在于将遗传信息按照工程原理进行设计,关键在于对各基因元器件功能的调节和控制。合成生物学可以允许人们按照生命系统运行规则的认识,以最优化的方式对其进行重新编程,或者创造自然界原本不存在的人造法则。合成生物学可用于构建具有全新功能的生物元件、模块,甚至是“人工合成生物体系”。这不仅可以推动人们对生命本质的理解,还可以革新生物技术发展模式,培育新型生物产业。
近年来,人们通过使用光和遗传编码的光敏蛋白技术来控制细胞行为这一领域取得了激动人心的突破,大量的研究成果在国际顶级杂志发表,由此也产生了一门新兴交叉学科“光遗传学(Optogenetics)”。光遗传学是基因工程和光学技术的一种结合,它可以允许人们在时间和空间上精确调控细胞的生命活动,其时间精度可以达毫秒级而空间精度可以精确到特定单一细胞,因此相对于传统的生物研究方法具有非常大的优势。到目前为止,科学家们从细菌、植物甚至动物细胞内陆续发现各种各样的光敏蛋白,这些光敏蛋白倍用于光合作用,视觉,生物节律等各种生命过程。人们还发现在细菌与植物中,光敏蛋白还被用于控制基因的转录与表达(He,Q.et al.,Science,2002,297:840-843;Malzahn,E.etal.,Cell,2010,142:762-772.)。随着越来越多的光敏蛋白的结构被解析,科学家们对光在生命活动调控中作用的认识更加深刻。目前,科学家已经找到或者人工改造、合成了一些可用于控制真核细胞运动、蛋白质相互作用于信号转导,神经肌肉动作,肌肉蛋白聚集与细胞运动,蛋白质剪切,蛋白质降解,蛋白质表达等光敏蛋白(Arrenberg,A.B.et al.,Science,2010,330:971-974;Deng,W.et al.,American journal of physiology Regulatory,2014,307:R1292-1302;Duebel,J.et al.,Current opinion in ophthalmology,2015,26:226-232),并发展了各种光传导与操纵技术与设备。这使得人类对生命现象的控制程度达到了前所未有的水平。
近年来,人们对RNA分子的认识,从最初的DNA与蛋白质间遗传信息传递的中间体,到RNA剪接和编辑。基因转录与翻译调控、分子感应和应答催化等,RNA所行使的多种生物功能及作用机制不断获得深入解析。在真核细胞中,新合成的RNA会经历多个步骤才能成为信使RNA(mRNA),在mRNA的生命周期中,RNA结合蛋白在调控不同时期RNA的代谢中发挥着至关重要的作用,如加帽、剪接、多腺苷化、编辑、转运、翻译、降解等(Arrenberg,A.B.et al.,Science,2010,330:971-974.)。除了mRNA外,还有很多非编码RNA,如转运RNA(tRNA)、核小RNA(snRNA)、核仁小RNA(snoRNA)、microRNA以及长链非编码RNA(lncRNA)等,这些非编码RNA的功能常常依赖于特定的RNA结合蛋白。因此,RNA结合蛋白广泛参了RNA的各类细胞代谢,如RNA的剪接、运输、定位、降解等。开发可以调控这些代谢活动的RNA结合蛋白将会为细胞功能研究提供独特的生物工具。
RNA结合蛋白通常由RNA结合结构域和功能结构域构成,前者可以特异性识别RNA序列并结合,后者发挥相应的功能进行RNA代谢调控。人们基于这样的原理,将不同的功能结构域和识别不同特异性RNA序列的RNA结合结构域结合起来,合成了一些具有特定功能的RNA结合蛋白。其中研究比较广泛的是将Pumilio/FBF重复序列(PUF)与不同的功能结构域融合,合成得到可以调控RNA稳定性、定位、剪接、翻译等的代谢活动的工程化RNA结合蛋白。2007年,Ozawa等人将绿色荧光蛋白GFP的N端结构域和C端结构域分别与两个PUF RNA结合结构域融合,用于观测哺乳动物细胞中线粒体RNA(Ozawa,T.et al.,Nature methods,20074:413-419.)。人们后来基于类似的技术追踪不容动物细胞b-激动蛋白的RNA和植物细胞中烟草花叶病毒的RNA(Tilsner,J.et al.,The Plant journal,2009,57:758-770;Yamada,T.et al.,Analytical chemistry,2011,83:5708-5714)。PUF结构域本身也可以作为翻译调控因子,结合到mRNA的5’端,通过空间位阻效应来影响翻译的起始(Cao,J.et al.,Nucleic acids research,2015,43:4353-4362;Cao,J.et al.,Angewandte Chemie,2014,53:4900-4904.)。Zefeng Wang课题组将PUF结构域与SMG6蛋白的PIN RNA核酸酶结构域融合,得到位点特异性的RNA核酸酶,通过控制mRNA的稳定性来调控细菌和哺乳动物细胞中的基因表达(Choudhury,R.et al.,Nature communications,2012,3:1147)。Amy Cooke等人则是将GLD2和CAF1结构域与PUF结构域融合,得到的RNA结合蛋白可以特异性控制mRNA 3’端的多聚腺苷化,通过调控mRNA的稳定性来影响基因表达(Cooke,A.et at.,Proceedingsof the National Academy of Sciences of the United States of America,2011,108:15870-15875.)。最近,人们甚至将RNA结合蛋白与火热的CRISPR/Cas9技术结合起来,开发了可以高水平激活内源基因表达的系统(Zalatan,J.G.,et al.,Cell,2015,160:339-350)。
然而,无论是自然存在或者是人工合成的RNA结合蛋白,它们的活性很难被调控。只要细胞表达这些RNA结合蛋白,它们就会发挥作用。然而,大多数RNA是在特定的时间和特定的空间里发挥响应的功能(Wang,Y.et al.,The FEBS journal,2013280:3755-3767.),开发可以在时间和空间上精确调控RNA代谢的技术对于研究RNA功能、调控基因表达等生命科学基础研究至关重要。本申请人认为可以通过合成生物学的方法,合成光可控的RNA结合蛋白,利用光来调控其与RNA的结合,并将其与不同功能结构域结合,获得具有不同功能的RNA结合蛋白。这些RNA结合蛋白将克服现有RNA结合蛋白不可控的缺点,可以让人们在时间和空间上调控RNA的转录、翻译、稳定性等代谢活动,可被广泛应用于生物科学的基础研究以及生物技术特别是合成生物技术中。经过潜心研究,本申请人发明了一种光控RNA代谢调控系统,它由重组光控RNA效应因子及其对应的靶调控单元两部分组成,具有良好的RNA代谢行为调控能力,可以在时间和空间上精确调控RNA的各类代谢活动。
因此,本发明的第一个目的是提供一种新型光控RNA代谢调控系统。
本发明的第二个目的是提供含有所述光控RNA代谢调控系统的原核或真核表达载体。
本发明的第三个目的是提供用所述光控RNA代谢调控系统调节宿主细胞中靶标RNA代谢的方法。
本发明的第四个目的是提供装有所光控RNA代谢调控系统各组分的试剂盒。
发明概述
本发明涉及一种光控RNA代谢调控系统,包括两个部分:a)重组光控RNA效应因子,所述重组光控RNA效应因子包括作为RNA结合结构域的第一多肽,作为光敏结构域的第二多肽和作为RNA效应结构域的第三多肽;b)靶调控单元:包括被所述第一多肽识别/结合的至少一个反应元件、被第三多肽调节的靶标RNA序列。
按照本发明的光控RNA代谢调控系统,第一部分中重组光控RNA效应因子的第一多肽为RNA结合结构域,它能够特异性地识别反应元件。第一多肽可选自抗转录终止因子蛋白的RNA识别结合结构域、RNA衰减子RNA识别结合结构域、RNA干扰酶RNA识别结合结构域、小调节RNA结合蛋白RNA识别结合结构域、RNA解旋酶RNA识别结合结构域、核酶RNA识别结合结构域、tRNA结合蛋白RNA识别结合结构域、rRNA结合蛋白RNA识别结合结构域。第二多肽为光敏结构域,通常来自以黄素类为生色团的光敏蛋白;第三多肽为RNA效应结构域,包括RNA剪接调控结构域、RNA翻译调控因子结构域、RNA核酸酶结构域、RNA表观遗传学修饰酶结构域。
第一多肽、第二多肽和第三多肽之间可以直接连接,也可操作性地通过接头肽连接。接头肽的氨基酸个数是可变的(如0,1,2,3,4,5,6,7,8,9,10个或更多)。
第一多肽、第二多肽可构成一种光控RNA结合蛋白融合蛋白(简称为光控RNA结合蛋白),可以用于体外研究重组光控RNA效应因子的RNA结合特点。本发明的光控RNA代谢调控系统中,第二部分靶调控单元中的反应元件和靶标RNA序列之间也可直接连接,或操作性连接。
按照本发明的光控RNA代谢调控系统,第一部分中的重组光控RNA效应因子还可进一步包含附加的多肽,如促进重组光控RNA效应因子融合蛋白向不同细胞器运输的第四多肽(如细胞核定位信号肽)。第四多肽与第一、第二、第三多肽可直接连接或通过接头肽连接。
本发明还涉及含有本发明的光控RNA代谢调控系统的原核或真核表达载体。所述表达载体可以是单独含有重组光控RNA效应因子编码基因的载体,也可以是单独含有靶调控单元编码序列的原核或真核表达载体,所述靶调控单元中含有反应元件但待调控的靶核酸序列空缺。或者,也可以是同时含有重组光控RNA效应因子编码基因和靶调控单元中的反应元件但待调控的RNA编码序列空缺的原核或真核表达载体。
本发明也涉及用本发明的光控RNA代谢调控系统在宿主细胞中调控RNA代谢的方法,包括以下步骤:
a)将所光控RNA代谢调控系统构建在原核或真核质粒表达载体中;
b)将重组质粒引入宿主细胞;
c)光照诱导所述宿主细胞,调控宿主细胞中靶标RNA的代谢。
本发明的在宿主细胞中调控靶标RNA代谢的方法,所涉及的光照方法,包括光源的选择和光源的控制。光源非限制地包括LED灯、白炽灯、荧光灯、激光;光照方法包括光照量、光照时间、光照强度及光照频率的选定。用扫描、投影、光模具等方法在空间上控制靶标RNA的代谢也包含在本发明范围中。
本发明进一步涉及一种试剂盒,该试剂盒装有含本发明光控RNA代谢调控系统的原核或真核表达载体或/和引入了所述光控RNA效应因子原核或真核表达载体的宿主细胞,及相应的说明书。本发明的试剂盒还可装有含反应元件但待调控的靶核酸序列空缺的靶调控单元的原核或真核表达载体。
发明详述
本发明提供一种基于光敏多肽的光控RNA代谢调控系统,用于在时间和空间上调节在核或真核生物宿主细胞中目的RNA的各类代谢活动。本发明的光控RNA代谢调控系统涉及至少两个部分:第一部分是能够在宿主细胞中表达的重组光控RNA效应因子融合蛋白的编码核苷酸序列,该融合蛋白由三个或四个多肽组成,其中第一多肽是其RNA结合结构域,第二多肽为光敏结构域,第三多肽为RNA效应结构域,第四多肽为细胞器定位信号片段;第二部分是由反应元件-待调控的靶核酸序列组成的靶调控单元核苷酸序列,其中的反应元件为上述重组光控RNA效应因子融合蛋白第一多肽所识别/结合的RNA核苷酸基序。第一部分的三个或四个多肽优选采用有关蛋白的截短的功能活性片段(即结构域)。可通过基因工程技术将本发明的光控RNA代谢调控系统的第一部分和第二部分构建在一个原核或真核表达载体中或分别构建在二个原核或真核表达载体中。针对特定的宿主细胞类型采用不同的常规方法将其导入宿主细胞,使之表达本发明的重组光控RNA效应因子融合蛋白,用适当波长的光照射可导致其第二光敏多肽二聚化能力改变,进而使重组光控RNA效应因子的二聚化能力发生变化,二聚化的重组光控RNA效应因子可结合于本发明第二部分靶调控单元核苷酸序列中的反应元件,并通过重组光控RNA效应因子的第三多肽的RNA效应结构域调控目的RNA的剪接、修饰、运输、翻译、降解等代谢活动。
本发明提供的这种光控RNA代谢调控系统,可利用几乎不会损伤细胞或机体的光照射,在时间上和空间上调节原核或真核宿主细胞中目的RNA的代谢。
本文所用术语的定义和解释
“光控”“光可控”,“光敏”和“光诱导”的蛋白在本文中含义相同,可互换使用,指对光照敏感的、可用相应波长的光以不同强度或不同频率照射,调节该蛋白的构象或构型从而影响其活性,包括激活、增强或阻遏其活性。
“宿主”指原核生物和真核生物,原核生物包括各类细菌,真核生物包括单细胞真核生物如酵母菌,和多细胞真核生物,如植物和动物,尤其是哺乳动物,包括人。
“宿主细胞”在本专利中指所有的原核与真核细胞,原核细胞包括但不限于大肠杆菌、枯草芽孢杆菌、乳酸菌、放线菌等,真核细胞包括但不限于酵母细胞、真菌细胞、植物细胞、线虫细胞、果蝇细胞、昆虫细胞、斑马鱼细胞、动物细胞和哺乳动物细胞,其中哺乳动物细胞可以是原始的未经改造的哺乳动物细胞,比如HEK293,Hela,H1299细胞等,也可以是在细胞株上再进行基因组改造的得到的哺乳动物细胞株,也可以是其他和本发明的光控RNA代谢调控系统相容的宿主细胞均可以。
“目的RNA”、“靶标RNA”也可称为“感兴趣RNA”指任何有功能的RNA,包括编码RNA和非编码RNA,其中非编码RNA包括rRNA,tRNA,snRNA,snoRNA和microRNA等多种已知功能的RNA,还包括未知功能的RNA。这些RNA的共同特点是都能从基因组上转录而来,但是不翻译成蛋白,在RNA水平上就能行使各自的生物学功能。
“报告RNA”为目的RNA的一种,指其表达容易被检测的有用RNA。为便于检测本发明基于光敏多肽的光控RNA代谢调控系统的效果,可以选择以下已知的的报告RNA:Pepper荧光RNA,它是基于Pepper RNA适配体特异性识别结合HBC系列染料并显著激活其荧光的原理、高斯荧光素酶(Gluc)mRNA、绿色荧光蛋白(GFP)mRNA,红色荧光蛋白(mCherry)mRNA。但本发明的光控RNA代谢调控系统不限于调控报告RNA的代谢,而可用于调控任何有功能RNA的代谢。
“报告蛋白”为报告RNA翻译产生的蛋白质,一般指活性容易被检测的蛋白质。为便于检测本发明基于光敏多肽的光控RNA代谢调控系统的效果,可以选择报告RNA编码的以下广泛应用的报告蛋白:高斯荧光素酶(Gluc)、绿色荧光蛋白(GFP),红色荧光蛋白(mCherry)等。
“转录”在本文中专指原核或真核生物宿主细胞中通过RNA聚合酶将DNA序列转录产生相对应RNA序列的过程。真核生物基因的转录比原核生物复杂得多,真核生物的三类RNA聚合酶I、II和III分别转录三类真核基因DNA,产生三类RNA(rRNA、mRNA、tRNA)及反义RNA。文中的转录因子调节的转录过程为RNA聚合酶II启动的转录,即DNA转录为mRNA。“转录调节”本文指真核基因转录的调节,包括启动或阻遏转录,增强或抑制转录,上调或下调转录。
“RNA代谢”、“RNA代谢活动””、“RNA代谢行为”在本文中含义相同可互换使用,指的是RNA在转录生成之后经历的一系列代谢过程,包括但不限于剪接、表观遗传学修饰、运输、定位、翻译、降解等,其中表观遗传学修饰包括但不限于甲基化修饰、假尿嘧啶修饰。
“调控效果”、“RNA代谢调控效果”在本文中指重组光控RNA效应因子在蓝光光照和黑暗条件下调控靶标RNA代谢的差异,可以是直接的,也可以是间接的,例如mRNA的水平既可以通过检测mRNA的含量来直接反应,也可以通过其翻译产生的蛋白质的水平来间接反应。一般情况下,光照与黑暗条件下靶标RNA的代谢差异越大,说明重组光控RNA效应因子的调控效果越好。在实际应用中,只要光照和黑暗条件下靶标RNA的代谢存在统计学意义上的差异,就可以认为该重组光控RNA效应因子具有调控靶标RNA代谢的能力。在本发明的一具体实施列中,重组光控RNA核酸酶因子在光照条件下对靶标RNA的降解是黑暗条件下的7.4倍;在本发明的另一具体实施方式中,重组光控RNA核酸酶因子在光照条件下对靶标RNA的降解体现在其编码的蛋白质水平为黑暗条件下的13.1倍。在本发明的另一具体实施列中,重组光控RNA核酸酶因子在光照条件下对靶标RNA的降解体现在其编码的蛋白质水平为黑暗条件下的65%。在本发明的另一具体实施列中,重组光控RNA翻译起始因子在光照条件下激活靶标RNA翻译的水平是黑暗条件下的8.5倍;在本发明的另一具体实施列中,重组光控RNA剪接因子在光照条件下促进靶标RNA外显子包含的剪接是黑暗条件下的2.6倍。
“表达”、“目的蛋白基因表达”、“基因表达”在本文中含义相同可互换使用,指目的基因的DNA序列转录产生携带该基因信息的RNA(mRNA或反义RNA)和该RNA携带的信息在核糖体中被翻译产生目的蛋白二者,即转录产生信息RNA和翻译产生目的蛋白都叫做表达。本文包括这两种含义,主要指产生目的蛋白。
“RNA效应因子”、“RNA效应因子融合蛋白”、“RNA效应蛋白”“RNA效应结构域”、“RNA代谢调控因子”、“RNA代谢调控因子结构域”在本文中含义相同可互换使用,指原核或真核生物中可以调控RNA代谢的蛋白质,它可以是一个蛋白质,也可以是多个相互作用的蛋白或多肽的统称,可以是天然的或人工改造的或人为融合的,包括但不限于RNA剪接调控结构域、RNA翻译起始因子结构域、RNA翻译抑制因子结构域、RNA核酸酶结构域、RNA外切酶结构域、RNA表观遗传学修饰结构域,其中RNA表观遗传学修饰结构域包括但不限于RNA甲基化酶结构域、RNA去甲基化酶结构域、RNA假尿嘧啶合成酶结构域。RNA的效应蛋白质可以单独或募集其它RNA效应多肽一起,通过与靶调控单元中反应元件的结合和相互作用,从而调控靶标RNA的各类代谢活动。
“光控RNA结合蛋白”在本发明中指由第一多肽和第二多肽构成的一种融合蛋白,在适当波长的光照射下,融合蛋白的二聚化能力发生改变,使得其与反应元件的结合能力发生改变。在本发明的一具体实施方式中,第一多肽LicTCAT与第二多肽VIVID(N56K+C71V+I85V)操作性连接获得重组光控RNA结合蛋白LicV,其氨基酸序列为序列SEQ ID NO:1。
“重组光控RNA定位因子”在本发明中指将本发明所述光控RNA结合蛋白与第四多肽直接相连或操作性(即可隔开若干个氨基酸)相连获得。所述光控RNA定位因子可调控活细胞中靶标RNA的定位。在本发明的一具体实施方式中,光控RNA结合蛋白与细胞内膜定位信号操作性连接;在本发明的另一具体实施方式中,光控RNA结合蛋白与细胞核定位信号操作性连接。
“靶调控单元”指人造的由反应元件和靶标RNA序列组成的RNA序列(不是蛋白质),其中反应元件可以位于靶标RNA序列的5’端,可以是靶标RNA序列的3’端,也可以位于靶标RNA序列的中间,二者可直接相连或操作性(即可隔开若干个核苷酸)相连。
“靶调控单元编码核苷酸序列”在本专利中指可以转录产生本专利所述靶调控单元的DNA序列。
“待调控RNA编码核苷酸序列”、“靶标RNA编码核苷酸序列”、“目的RNA编码核苷酸序列”在本文中含义相同可互换使用,指可以转录产生本专利所述待调控RNA或目的RNA或靶标RNA的DNA序列。这种DNA序列可包含在宿主细胞的染色体DNA序列中或包含在人工构建的表达载体中。
“反应元件”指RNA效应因子特异性识别/结合的一个或多个RNA基序,不同的RNA效应因子有与其相对应的不同的反应元件,RNA效应因子包含能与这种RNA基序结合的结合结构域。当RNA效应因子与其相应的反应元件特异性结合后,RNA效应因子自身或其招募的辅因子协同作用于目的RNA序列,调控RNA的代谢活动。在本发明中,反应元件指能够与重组光控RNA效应因子的第一多肽特异性识别/结合的RNA基序,例如LicT的反应元件为长25bp的RNA基序(序列2)。
“启动子”指启动和导致其下游基因转录产生RNA的DNA序列。启动子包括原核启动子和真核启动子,可以是天然基因的启动子或人工修饰的启动子。不同的启动子可指导基因在不同发育阶段的不同类型的组织或细胞中转录,或对不同环境或生理条件反应时的基因表达。启动子通常可分为“组成型启动子”、“可诱导启动子”或“可调控启动子”;按组织和细胞划分可分为“细胞特异性启动子”、“组织特异性启动子”、“发育特异性启动子”或“细胞分化特异性启动子”。可表达的天然细胞结构蛋白基因上游都有与其相配的启动子,不同的基因DNA片段可以有相同的启动子。可用于表达本发明重组光敏转录因子的常用组成型启动子的非限制性例子有:来源于多瘤病毒、腺病毒2、巨细胞病毒CMV和猿病毒40(SV40)的启动子。多数真核基因转录起始点上游大约-25于-30位核苷酸处有富含AT的区域称为TATA盒,本文称为“最小启动子”,它确定了目的基因的转录起始位点,但本身不足以有效地启动基因转录。在TATA盒上游还有其它转录必须的核苷酸基序,即本文所述的转录因子其特异性识别/结合的反应元件,该反应元件被其相应的转录因子结合后向最小启动子传达反应性,并在转录因子募集的辅因子协同作用下激活最小启动子引起下游基因转录产生相应的目标RNA。
“载体”、“表达载体”、“基因表达载体”、“重组基因表达载体”或“质粒”在本文中含义相同可互换使用,指能在原核或真核细胞中表达重组蛋白或靶标RNA的载体,这种表达载体可以是人工构建的质粒或重组病毒载体
“转染”指宿主细胞经物理或化学方法,如电穿孔、磷酸钙共沉淀、脂质转染胺或DEAE-葡聚糖介导的转染、DNA粒子轰击和显微注射等处理,使细胞摄入外源加入的携带基因的表达载体,或通过生物学媒介,如逆转录病毒载体、腺病毒载体、受体介导的DNA摄取等将携带基因的表达载体递送入宿主细胞中。这些载体进入宿主细胞后可作为游离体形式存在于胞质中,或整合入细胞染色体中,在适当条件下该细胞可瞬时表达或长期表达载体所携带的基因编码的蛋白质或功能性RNA。这种宿主细胞就叫做被载体转染的细胞。用表达载体转染宿主细胞的方法可参见Sambrooka等人(分子克隆实验手册,第二版,冷泉港出版社(1989)),和其它有关教材。
本发明基于光敏多肽的光控RNA代谢调控系统第一部分的重组光控RNA效应因子是由三种或四种功能性多肽片段直接通过肽键串联连接或通过接头肽串联连接形成的融合蛋白。在适当波长的光照射下,该融合蛋白能结合于本发明第二部分靶调控单元中的反应元件,通过其自身或招募宿主细胞本身的其他辅因子,一起协同作用于靶调控单元中的靶标RNA序列,从而调控靶标RNA的代谢。
本文中,“重组RNA效应因子融合蛋白”与“重组RNA效应因子”含义相同,可互换使用。
本发明的重组光控RNA效应因子含有第一多肽,该多肽能特异性识别结合所述靶调控单元中的反应元件RNA序列;第一多肽选自:抗转录终止因子蛋白的RNA识别结合结构域、RNA衰减子RNA识别结合结构域、RNA干扰酶RNA识别结合结构域、小调节RNA结合蛋白RNA识别结合结构域、RNA解旋酶RNA识别结合结构域、核酶RNA识别结合结构域、tRNA结合蛋白RNA识别结合结构域、rRNA结合蛋白RNA识别结合结构域。分析相关文献,可用作本发明的第一多肽优选包括但不限于:粗糙芽孢杆菌LicT蛋白的RNA识别结合结构域、大肠杆菌BglG蛋白的RNA识别结合结构域、枯草芽胞杆菌SacY蛋白的RNA识别结合结构域、枯草芽胞杆菌GlcT蛋白的RNA识别结合结构域、PyrR蛋白的RNA识别结合结构域、RapZ蛋白的RNA识别结合结构域、EndoA蛋白的RNA识别结合结构域、蛋白的RNA识别结合结构域,更优选LicT蛋白的RNA识别结合结构域、BglG蛋白的RNA识别结合结构域、SacY蛋白的RNA识别结合结构域和GlcT蛋白的RNA识别结合结构域。在本发明一优选实施方式中,第一多肽为LicT蛋白RNA识别结合结构域。在本发明另一优选实施方式中,第一多肽为BglG蛋白RNA识别结合结构域。在本发明另一优选实施方式中,第一多肽为SacY蛋白的RNA识别结合结构域。在本发明另一优选实施方式中,第一多肽为GlcT蛋白的RNA识别结合结构域。
除了本发明优选实施方式中使用的第一多肽,其中所述第一多肽还可选自其他抗转录终止因子蛋白的RNA识别结合结构域,包括但不限于枯草芽胞杆菌SacT蛋白的RNA识别结合结构域、菊欧氏杆菌(Erwinia chrysanthemi)Arbg蛋白的RNA识别结合结构域、乳酸菌(Lactococcus lactis)BglR蛋白的RNA识别结合结构域、干酪乳杆菌(Lactobacilluscasei)LacT蛋白的RNA识别结合结构域、肉葡萄球菌(Staphylococcus carnosus)GlcT蛋白的RNA识别结合结构域。除了抗转录终止因子蛋白的RNA识别结合结构域以外,其中所述第一多肽还可选自RNA衰减子RNA识别结合结构域、RNA干扰酶RNA识别结合结构域、小调节RNA结合蛋白RNA识别结合结构域、RNA解旋酶RNA识别结合结构域、核酶RNA识别结合结构域、tRNA结合蛋白RNA识别结合结构域、rRNA结合蛋白RNA识别结合结构域。
本发明重组光控RNA效应因子融合蛋白中的第二多肽是光敏多肽,该多肽来自以黄素类(FMN或FAD)为生色团的光敏结构域。如含有光-氧-电压(LOV)结构域的光敏蛋白;类似光裂解酶的隐花色素(photolyase-like cryptochromes);利用FAD的蓝光蛋白(bluelight using FAD,BLUF)。优选含LOV结构域的光敏蛋白,经适当波长光照射后,第二多肽的二聚化能力发生改变,使重组光控RNA效应因子的二聚化能力发生变化,二聚化的重组光控RNA效应因子结合于相应的反应元件,从而调节目的RNA的代谢活动。本发明包括但不限于以下所述的优选光敏蛋白或其功能活性截短体:粗糙链孢霉菌的VIVID LOV结构域、细菌Erythrobacter litoralis的EL222 LOV结构域、燕麦光敏色素1基因的LOV结构域AsLOV2、无隔藻金色素蛋白1的LOV结构域AuLOV、恶臭假单胞菌的PpLOV LOV结构域。
本发明第一个优选的第二多肽是粗糙链孢霉菌的VIVID蛋白的光敏结构域及其突变体。VIVID是存在于粗糙链孢霉菌(Neurospora crassa)细胞内参与蓝光调控细胞信号传导通路的一种光敏蛋白质。在蓝光照射下它能与黄素腺嘌呤二核苷酸(FAD)发生蛋白分子间反应形成二聚体。全长VIVID蛋白含有186个氨基酸,只含一个对光敏感的LOV结构域。研究表明VIVID蛋白缺失了N端36个氨基酸的截短体蛋白(VIVID-36)稳定性比全长蛋白更好,而蓝光照射后形成的VIVID-36二聚体在黑暗条件下恢复单体形式,含点突变N56K和C71V的VIVID-36二聚化能力更强。在本发明一优选实施方式中,第二多肽是含两个点突变的删除前1-36个氨基酸的VIVID(N56K+C71V)。
本发明第二个优选的第二多肽是细菌Erythrobacter litoralis EL222蛋白的LOV结构域。EL222蛋白的LOV结构域位于其N端的1-182个氨基酸,在蓝光照射下能与黄素单核苷酸(FMN)结合生成一种加成产物,进而形成同源二聚体。本发明将EL222蛋白的LOV结构域连接于第一多肽,成功导致可用光照调节重组光控RNA效应因子的第一多肽与对相应反应元件的结合能力。本发明含有EL222蛋白的LOV结构域为第二多肽的光控RNA效应因子LicEB,其在光照条件下可以结合其对应的反应元件,促进靶标RNA的降解。
本发明第三个优选的第二多肽是燕麦(Avena sativa)光敏色素1基因的LOV2结构域(AsLOV2)。燕麦细胞光敏色素1的N端为LOV1和LOV2光氧电压(LOV)结构域,在蓝光照射下均能与黄素单核苷酸(FMN)结合生成一种加成产物。本发明将燕麦光敏色素1的LOV2结构域连接于第一多肽,成功导致可用光照调节重组光控RNA效应因子的第一多肽与对相应反应元件的结合能力。本发明含有AsLOV2为第二多肽的光控RNA效应因子LicAsB,其在光照条件下可以结合其对应的反应元件,促进靶标RNA的降解。
本发明第四个优选的第二多肽是无隔藻(Stramenopile algae Vaucheriafrigida)金色素1(aureochrome1)蛋白C端的LOV结构域(简写为AuLOV)。该LOV结构域在蓝光照射下能与黄素单核苷酸(FMN)结合生成一种加成产物。本发明将无隔藻金色素1的LOV2结构域连接于第一多肽,成功导致可用光照调节重组光控RNA效应因子的第一多肽与对相应反应元件的结合能力。本发明含有AuLOV为第二多肽的光控RNA效应因子LicAuB,其在光照条件下可以结合其对应的反应元件,促进靶标RNA的降解。
本发明第五个优选的第二多肽是恶臭假单胞菌(Pseudomonas putida)的LOV结构域(简称为PpLOV)。该LOV结构域在蓝光照射下能与黄素单核苷酸(FMN)结合生成一种加成产物。本发明将PpLOV结构域连接于第一多肽,成功导致可用光照调节重组光控RNA效应因子的第一多肽与对相应反应元件的结合能力。本发明含有PpLOV为第二多肽的光控RNA效应因子LicPB,其在光照条件下可以结合其对应的反应元件,促进靶标RNA的降解。
本发明的重组光控RNA效应因子含有第三多肽,该多肽是一种RNA效应结构域,它可以是调控RNA任一代谢活动的蛋白质,包括RNA的剪接、表观遗传学修饰、运输、定位、翻译、降解等代谢活动。所述第三多肽包括但不限于RNA剪接调控结构域、RNA翻译起始因子结构域、RNA翻译抑制因子结构域、RNA核酸酶结构域、RNA外切酶结构域、RNA表观遗传学修饰结构域,其中RNA表观遗传学修饰结构域包括但不限于RNA甲基化酶结构域、RNA去甲基化酶结构域、RNA假尿嘧啶合成酶结构域。RNA的效应结构域可以单独或募集其它RNA效应多肽一起,通过与靶调控单元中反应元件的结合和相互作用,从而调控靶标RNA的各类代谢活动。在本发明的实施方案中,第三多肽为hnRNP A1 RNA剪接调控结构域,它可以调控RNA的可变剪接过程;在本发明的另一实施方案中,第三多肽为丝氨酸-精氨酸(SR)蛋白的富含精氨酸-苏氨酸(RS)RNA剪接调控结构域,它可以调控RNA的可变剪接过程;在本发明的另一实施方案中,第三多肽为eIF4E RNA翻译起始因子结构域,它可以特异性地识别mRNA的5'端的帽子结构,通过招募eIF4A和eIF4G共同组成eIF4F复合物参与翻译起始;在本发明的另一实施方案中,第三多肽为解淀粉芽孢杆菌中barnase RNA核酸酶结构域,它可以催化水解RNA,导致其降解失去功能;在本发明的另一实施方案中,第三多肽为SMG6蛋白的PIN(PilT aminoterminus)RNA核酸酶结构域,它可以催化水解RNA,导致其降解失去功能。
本发明的重组光控RNA效应因子融合蛋白还可包括第四多肽,该多肽是定位信号肽用以促进融合蛋白向不同细胞器运输。所述第四多肽与第一、第二、第三多肽直接或通过接头肽相连。所述第四多肽可选自细胞核定位信号肽、线粒体定位信号肽、高尔基体定位信号肽、内质网定位信号肽、细胞质定位信号肽、线粒体外膜定位信号肽、细胞膜内膜定位信号。在本发明的一具体实施方案中,第四多肽为细胞核定位信号肽,它可以介导本发明所述重组光控RNA效应因子定位到细胞核。其中,第四多肽可以是一个或多个,如果需要或多个效果更好的话优选多个。
如上所述,本发明的重组光控RNA效应因子所含的三种或四种多肽各自可以有多种选择,将三种或四种多肽连接成融合蛋白又可以有多种组合选择,本发明优选具有良好活性的各多肽的功能结构域片段制备重组光控RNA效应因子融合蛋白,通过在宿主细胞中表达优选的RNA代谢调控能力强的,即诱导和非诱导时导致靶标RNA代谢差异大的该重组光控RNA效应因子,用于调节靶标RNA的代谢,但不论何种选择与组合,只要能实现本发明所设想的利用光调控宿主细胞中靶标RNA代谢的重组光控RNA效应因子各种组合都属于本发明的范围。
本发明基于重组光控RNA效应因子的光控RNA代谢调控系统的第二部分是由重组光控RNA效应因子特异性识别/结合的反应元件-待调控RNA序列组成的靶调控单元(核苷酸序列),具体来说,其中反应元件的核苷酸基序视本发明不同实施方式所选的重组光控RNA效应因子融合蛋白的第一多肽不同而不同。换句语说,反应元件是第一多肽的特异性反应元件,必须根据所选择的第一多肽来选择与其相应的反应元件。例如,第一多肽为LicT、BglG、SacY、GlcT蛋白的RNA识别/结合结构域时,其相应的反应元件应为“序列2、3、4、5”基序。靶转录单元中的反应元件至少为一个或可以有多个,在具体的实施方式中,反应元件为1、2、3、4或5个,如果需要或多个效果更好的话优选多个。
与反应元件操作性相连的是待调控的靶标RNA序列,可以是具有功能的任意RNA核苷酸序列。为了验证本发明系统的效果和便于检测,在本发明的实施例中,采用了示范性的报告RNA:Pepper荧光RNA、高斯荧光素酶(Gluc)mRNA、绿色荧光蛋白(GFP)mRNA,红色荧光蛋白(mCherry)mRNA,但本发明的靶标RNA不限于这些报告RNA。在本发明的本领域技术人员知道,所谓“操作性相连指反应元件与靶标RNA序列之间或多个反应元件之间不是直接相连而可以隔开若干个核苷酸,只要仍能协同作用即可。
可用标准重组DNA技术将本发明光控RNA代谢调控系统的第一部分和第二部分构建在一个原核或真核表达载体中或分别构建在二个原核或真核表达载体中。可用标准技术将这种表达载体引入各种宿主细胞群中调控靶标RNA的各类代谢活动,进一步选择产生有用的转基因生物,如转基因小鼠。本发明的光控RNA代谢调控系统可用于在宿主细胞中内源RNA或外源RNA代谢的调控。
本领域众所周知,氨基酸的密码子核苷酸有简并性(即某些氨基酸可有二个、或三个、或四个密码子,它们称为该氨基酸的简并密码子),本发明所述的各种重组光控RNA效应因子的编码核酸,本发明包括它们各自的所有简并核苷酸序列。本发明所述的各种重组光控RNA效应因子的氨基酸序列,本发明包括它们各自的所有含保守性缺失、添加、置换修饰但仍保留了其原有功能活性的氨基酸序列类似物。
本发明提供含待调控的靶标RNA编码核苷酸序列空缺的靶调控单元的原核或真核表达载体,待调控的靶标RNA编码核苷酸序列空缺是让用户可以自行选择所需的待调控的靶标RNA编码核苷酸序列,例如目的RNA的编码核苷酸序列,用标准的重组DNA技术将其插入本发明的这种表达载体中,通过上面所述重组光控RNA效应因子来调节转录出来的靶标RNA的代谢活动。
本发明也提供分别转化了各种重组光控RNA效应因子基因的宿主细胞表达载体或者基因组上整合了各种重组光控RNA效应因子表达框的菌株或细胞株,同时提供含有反应元件-待调控靶标RNA编码核苷酸序列空缺的表达载体。用户可用标准重组DNA技术将自行选择的待调控靶标RNA编码核苷酸序列插入该表达载体中,然后将该重新构建的载体引入已转化了各种重组光控RNA效应因子的宿主细胞或者基因组上已整合了各种重组光控RNA效应因子表达框的宿主菌株或细胞株,并培养这种菌株或细胞株表达重组光控RNA效应因子和反应元件-靶标RNA,利用光调控靶标RNA的代谢活动,探究它们的生物学功能。
本发明还提供装有各种表达载体或已转化了这种载体或者基因组上整合了各种重组光控RNA效应因子表达框的宿主细胞的试剂盒。在一个实施方式中,该试剂盒中一些容器分别装有含一种或多种重组光控RNA效应因子基因的表达载体。在另一个实施方式中,该试剂盒中的一些容器分别装有含一种或多种重组光控RNA效应因子基因的表达载体,另一些容器分别装有含靶调控单元(其中反应元件-待调控靶标RNA编码核苷酸序列空缺)的表达载体。在还有一个实施方式中,该试剂盒中一些容器装有已转化了含重组光控RNA效应因子基因的表达载体或者基因组上已整合了各种重组光控RNA效应因子表达框的宿主细胞,另一些容器装有反应元件-待调控靶标RNA编码核苷酸序列空缺的宿主细胞表达载体。本发明的试剂盒还可以包含相应的光照控制设备,例如LED灯及其调控装置。所有试剂盒都装有相应的说明书,以说明盒中的各成分、使用目的和使用方法,并提供有关的参考文献目录。
本发明还包括光控RNA代谢调控系统在宿主细胞中调控靶标RNA代谢的方法,包括步骤:
a)将所光控RNA代谢调控系统构建在原核或真核质粒表达载体中;
b)将重组质粒引入宿主细胞;
c)光照诱导所述宿主细胞,调控宿主细胞中靶标RNA的代谢。
光照诱导所述宿主细胞的方法包括光源的选择和光源的使用。光源非限制地包括LED灯、白炽灯、荧光灯、激光。在本发明的一个实施方式中,光源选用蓝色LED(460-470nm)。光照方法包括光照量、光照强度、光照时间、光照频率以及用扫描、投影、光模具等方法在空间上控制靶标RNA的代谢也包含在本发明范围中。在本发明的一个实施方式中,光照强度为0-1.8W/m2不等;在另一个实施方式中,用打印的投影片作为光模具,在空间上调节不同位置的细胞的靶标RNA的代谢;在另一个实施方式中,用中性灰度片作为光模具,在空间上调节不同位置的细胞的靶标RNA的代谢水平。
附图简要说明
图1含不同第一多肽的重组光控RNA效应因子质粒示意图。
图2含不同第二多肽的重组光控RNA效应因子质粒示意图。
图3含不同第三多肽的重组光控RNA效应因子质粒示意图。
图4含不同第一多肽的重组光控RNA核酸酶因子对活细胞中靶标Gluc mRNA的降解效果。检测结果采用学生t检验进行光照与黑暗条件下的差异分析,*p<0.05,**p<0.01,***p<0.001,N.S.,无显著差异。
图5含不同第二多肽的重组光控RNA核酸酶因子对活细胞中靶标Gluc mRNA的降解效果。检测结果采用学生t检验进行光照与黑暗条件下的差异分析,*p<0.05,**p<0.01,***p<0.001,N.S.,无显著差异。
图6重组光控RNA核酸酶因子LicVB对活细胞中靶标mCherry mRNA的降解效果。标尺,100μm。
图7重组光控RNA核酸酶因子LicVB对活细胞中Pepper靶标RNA的降解效果。标尺,50μm。
图8重组光控RNA核酸酶因子LicVPIN对活细胞中Pepper靶标RNA的降解效果。标尺,50μm。
图9重组光控RNA核酸酶因子LicVB对大肠杆菌和酵母细胞中靶标mCherry mRNA的降解效果。检测结果采用学生t检验进行光照与黑暗条件下的差异分析,*p<0.05,**p<0.01,***p<0.001,N.S.,无显著差异。
图10重组光控RNA剪接因子LicVA1和LicVRS对活细胞中靶标RNA剪接的调控效果。(A)剪接产物逆转录并扩增后跑胶结果;(B)对(A)电泳胶条带的统计扽西结果。检测结果采用学生t检验进行光照与黑暗条件下的差异分析,*p<0.05,**p<0.01,***p<0.001,N.S.,无显著差异。
图11重组光控RNA翻译起始因子LicV4E对活细胞中靶标RNA翻译的调控效果。标尺,200μm。
图12不同光照强度下重组光控RNA翻译起始因子LicV4E对活细胞中靶标RNA翻译的调控效果。标尺,1mm。
图13重组光控RNA翻译起始因子LicV4E在空间上精确调控靶标RNA的翻译效果。标尺,3mm。
图14重组光控RNA定位因子对靶标RNA定位的调控效果。(A)LicV-mKalama1-CAAX对Pepper-RATLicT靶标RNA定位调控的成像效果;(B)对(A)中靶标RNA成像的统计结果;(C)LicV-mKalama1-3xNLS对Pepper-RATLicT靶标RNA定位调控的成像效果;(D)对(A)中靶标RNA成像的统计结果。标尺,25μm。
具体实施方式
以下用实施例对本发明作进一步阐述。这些实施例仅仅用于举例说明,而不对本发明的范围构成任何限制。实施例中主要采用常规的基因工程分子生物学克隆方法,这些方法是本领域普通技术人员所熟知的,例如:简·罗斯凯姆斯等的《分子生物学实验参考手册》和J.萨姆布鲁克,D.W.拉塞尔著,黄培堂等译:《分子克隆实验指南》(第三版,2002年8月,科学出版社出版,北京)中的有关章节。本领域普通技术人员按照以下实施例,不难根据具体情况略作修改和变换而成功实施本发明,这些修改和变换均落在本申请权利要求的范围内。
实施例中所有用于PCR的引物均由上海杰瑞生物工程技术有限公司合成、纯化和经质谱法鉴定正确。实施例中构建的表达质粒都经过序列测定,序列测定由杰李测序公司完成。各实施例所用的Taq DNA聚合酶购自东盛生物,pfu DNA聚合酶购自天根生化科技(北京)有限公司,PrimeSTAR DNA聚合酶购自TaKaRa公司,三种聚合酶购买时都附带赠送对应聚合酶缓冲液和dNTP。T4连接酶、T4磷酸化酶(T4 PNK)购自Fermentas公司,购买时附带有相对应的缓冲液等。实施例中所用的一步法快速克隆试剂盒(含同源重组酶)购自翊圣生物科技有限公司。除非特别声明,无机盐类化学试剂均购自国药集团上海化学试剂公司。卡那霉素(Kanamycin)购自Ameresco公司;氨苄青霉素(Amp)购自Ameresco公司;链霉素购自Ameresco公司;384孔发光检测白板、384孔荧光检测黑板购自Grenier公司。
实施例中所用的胶回收试剂盒购自生工公司,普通质粒小抽试剂盒购自天根生化科技(北京)有限公司。克隆菌株MachI购自北京全式金公司,BL21(DE3)菌株购自北京全式金公司。细胞系HEK293T和HEK293购自中国科学院典型培养物保藏委员会细胞库。
实施例中用到的主要仪器:Biotek Synergy 2多功能酶标仪(美国Bio-Tek公司),X-15R高速冷冻离心机(美国Beckman公司),Microfuge22R台式高速冷冻离心机(美国Beckman公司),PCR扩增仪(德国Biometra公司),共聚焦显微镜(LEICA TCS SP8 MP)(德国Leica),光度计(日本和光公司),核酸电泳仪(申能博彩公司)、LED蓝灯(深圳虹升光电公司定制)。
缩写词意义如下:“h”指小时,“min”指分钟,“s”指秒,“d”指天,“uL”指微升,“mL”指毫升,“L“指升,“bp”指碱基对,“mM”指毫摩尔,“μM”指微摩尔。
20种氨基酸及简称
中文名称 | 三字母缩写 | 单字母符号 | 中文名称 | 三字母缩写 | 单字母符号 |
甘氨酸 | Gly | G | 苏氨酸 | Thr | T |
丙氨酸 | Ala | A | 半胱氨酸 | Cys | C |
缬氨酸 | Val | V | 蛋氨酸 | Met | M |
亮氨酸 | Leu | L | 天冬酰胺 | Asn | N |
异亮氨酸 | Ile | I | 谷氨酰胺 | Gln | Q |
脯氨酸 | Pro | P | 天冬氨酸 | Asp | D |
苯丙氨酸 | Phe | F | 谷氨酸 | Glu | E |
酪氨酸 | Tyr | Y | 赖氨酸 | Lys | K |
色氨酸 | Trp | W | 精氨酸 | Arg | R |
丝氨酸 | Ser | S | 组氨酸 | His | H |
实施例中用到的常规分子生物学方法
(一)DNA片段5’端磷酸化反应然后自身环化反应:
从微生物中抽提出的质粒或者基因组末端都含有磷酸基团,而PCR产物没有,故需对PCR产物的5’端碱基进行磷酸基团加成反应,只有末端含有磷酸基团DNA分子才能发生连接反应。自身环化连接反应指线性化载体的3’端和5’端连接反应。
T4 PNK为T4多聚核苷酸激酶的简写,用于对DNA分子的5’端磷酸基团的加成反应。使5’端磷酸化的DNA片段产物自身环化的反应体系:
(二)重叠PCR
重叠PCR是常用的将两段不同或相同的基因连接起来的方法。例如图1,将基因AD和基因BC连接起来,首先设计两对引物A、D和C、B,分别用来扩增基因AD和BC,在引物D和引物C的5’端含有一定长度的互补序列。第一轮PCR得到的扩增产物AD和BC经过回收后作为第二轮PCR的模板。
第二轮按常规PCR流程扩增10轮,PCR体系为:
第二轮PCR后加入引物A和引物B,再继续扩增30轮即可得到AD和BC连接了的序列。
(三)反向PCR
反向PCR是下述实施例中用于定点突变、截短突变、插入突变所用到的一种技术。其基本原理参考Takara公司的MutaBEST试剂盒的实验流程。在对应的变异部位设计反向PCR引物,其中一条引物的5’端包含变异的核苷酸序列。扩增后的产物经过胶回收纯化后,进行5’端磷酸化反应然后自身环化反应,转化入感受态细胞。
(四)一步法同源重组片段
用于同源重组的PCR目的片段的5’和3’末端序列分别和线性化载体的末端序列完全一致,使用一步克隆试剂盒进行同源重组反应,反应体系为:
(五)荧光素酶(Gluc)活性的检测
检测报告基因荧光素酶Gluc的相对表达量的实验根据NEB公司试剂盒提供的使用说明进行。细胞培养到了合适的时间,从细胞培养液中吸取10μl加到Grenier公司的384孔白板中,现场配置新鲜的Gluc检测液(1μM腔肠素,0.1M Tris-HCl缓冲液,0.3M抗坏血酸钠,pH 7.4),用Eppendolf的12通道电动分液器吸取检测液10μl/通道加入到384孔白板中的细胞培养液中,立即使用多功能酶标仪读取相对发光强度(RLU)。
(六)细胞培养、转染与荧光检测
在适当的温度和气体混合物(通常为37℃,5%CO2)环境下用DMEM(HyClone,目录号:SH302431)培养HEK293与HEK293T等细胞。DMEM中含有4mM谷氨酰胺,4.5g/l葡萄糖,10%胎牛血清(FBS),丙酮酸钠和终浓度为100U/ml的青霉素以及100μg/ml的链霉素。除非另有说明,用Hieff TransTM脂质体转染试剂(Yeasen)、Lipo3000或Lipo2000转染试剂(Thermo)根据生产商的说明书进行细胞转染。在细胞成像实验中,利用Leica SP8共聚焦显微镜使用561nm的激光器对mCherry荧光进行成像,使用488nm的激光器对EGFP荧光进行成像,使用458nm的激光器对Pepper485和mKalama1的荧光进行成像。
实施例1分别构建含不同重组光控RNA效应因子编码基因的表达载体。
为构建含有以LicT、BglG、SacY或GlcT蛋白的RNA识别结合结构域为第一多肽的重组光控RNA效应因子,全基因合成LicT、BglG、SacY和GlcT蛋白的RNA识别结合结构域编码DNA片段,分别以P1和P2、P3和P4、P5和P6、P7和P8扩增上述DNA片段获得LicTCAT、BglGCAT、SacYCAT和GlcTCAT基因片段。全基因合成第三多肽barnase(K27A+N58D+R59A+E73A)(简称barnaseM4)编码DNA片段,利用P9和P10扩增该片段获得barnaseM4基因片段。利用引物P11和P12以pGAVPO质粒(Wang et al.,Nature Methods,2012:266-269)为模板扩增第二多肽VVD(N56K+C71V)基因片段,利用重叠PCR分别将VVD(N56K+C71V)、barnaseM4和LicTCAT、BglGCAT、SacYCAT或GlcTCAT基因片段进行连接,分别获得LicTCAT-VVD(N56K+C71V)-barnaseM4(简称LicVB,其氨基序列为序列SEQ ID NO:6)、BglGCAT-VVD(N56K+C71V)-barnaseM4(简称BglVB,其氨基序列为序列SEQ ID NO:7)、SacYCAT-VVD(N56K+C71V)-barnaseM4(简称SacVB,其氨基序列为序列SEQ ID NO:8)和GlcTCAT-VVD(N56K+C71V)-barnaseM4(简称GlcVB,其氨基序列为序列SEQ ID NO:9)基因片段。利用P13和P14扩增pEGFP-N1-FLAG载体(Addgene:60360)使其线性化,利用一步法克隆试剂盒将上述重叠PCR获得的基因片段插入到线性化的pEGFP-N1-FLAG载体中,得到的质粒命名为pCMV-LicVB、pCMV-BglVB、pCMV-SacVB和pCMV-GlcVB(图1),分别编码重组光控核酸酶因子LicVB、BglVB、SacVB和GlcVB。
扩增LicTCAT基因片段的引物为:
上游引物(P1):5’-agatccgctagcgctatgaaaattgcgaaggtgat-3’
下游引物(P2):5’-agcgtagagcgtatgtcctgcggctttgctaattcttgctgatacatccttgttatcga-3’
扩增BglGCAT基因片段的引物为:
上游引物(P3):
5’-agatccgctagcgctatgaacatgcaaatcaccaaaattc-3’
下游引物(P4):
5’-agcgtagagcgtatgtacgtcccaggtaccgttcagttcatgactgctcaag-3’
扩增SacYCAT基因片段的引物为:
上游引物(P5):
5’-agatccgctagcgctatgaaaattaaaagaatcttaaatc-3’
下游引物(P6):
5’-agcgtagagcgtatgagacccaccgccatagtcaggtgtatcttttctg-3’
扩增GlcTCAT基因片段的引物为:
上游引物(P7):
5’-agatccgctagcgctatgaatgggtccttcacagtg-3’
下游引物(P8):
5’-agcgtagagcgtatgcaggtcacaagtacctcgacattgttccttctcgtcttttaa-3’
扩增barnaseM4基因片段的引物为:
上游引物(P9):
5’-gttccagattacgctgaattcatggcacaggttatcaacacgtttg-3’
下游引物(P10):
5’-gatctagagtgtacattatctgatttttgtaaaggtttga-3’
扩增VVD(N56K/C71V)基因片段的引物为:
上游引物(P11):5’-catacgctctacgctcccggcggt-3’
下游引物(P12):5’-agcgtaatctggaacatcgtatgggtactgcagttccgtttcgcactggaaac-3’
扩增pEGFP-N1-FLAG载体使其线性化的引物为:
上游引物(P13):
5’-tgtacactctagatcataatcagc-3’
下游引物(P14):
5’-agcgctagcggatctgacggttcac-3’
为构建含有以EL222、AsLOV、AuLOV或PpLOV蛋白的LOV结构域为第二多肽的重组光控RNA效应因子,全基因合成EL222、AsLOV、AuLOV和PpLOV蛋白LOV结构域编码DNA片段,分别以P15和P16、P17和P18、P19和P20、P21和P22扩增上述DNA片段获得EL222、AsLOV、AuLOV和PpLOV基因片段。利用P23和P24扩增本实施例构建的pCMV-LicVB载体,去除VVD(N56K+C71V)基因序列并使其线性化,利用一步法克隆试剂盒将上述基因片段插入到线性化的pCMV-LicVB载体中,得到的质粒命名为pCMV-LicEB、pCMV-LicAsB、pCMV-LicAuB和pCMV-LicPB,分别编码重组光控核酸酶因子LicEB、LicAsB、LicAuB和LicPB(图2),其氨基酸序列分别为序列SEQ ID NO:10、11、12和13。
扩增EL222 LOV结构域基因片段的引物为:
上游引物(P15):
5’-aacaaggatgtatcagctgatacgatactgggtagtccgagcatgctggatatgggacaaga-3’
下游引物(P16):
5’-gtatgggtactgcagtttgagcatctcggcggctc-3’
扩增AsLOV基因片段的引物为:
上游引物(P17):
5’-aacaaggatgtatcagtcggtagtcagcagaattttgtgataactgatgcaagc-3’
下游引物(P18):
5’-gtatgggtactgcagcactagcaacttggcgtaatc-3’
扩增AuLOV基因片段的引物为:
上游引物(P19):
5’-aacaaggatgtatcaatcctagtcggtacacagaattttgtgataactgatg-3’
下游引物(P20):
5’-gtatgggtactgcagcactagcaacttggcgtaatc-3’
扩增PpLOV基因片段的引物为:
上游引物(P21):
5’-aacaaggatgtatcatactcacgtatgattaatgcccaactcctgcagagc-3’
下游引物(P22):
5’-gtatgggtactgcagagcccgttcgtctggttttggtcttg-3’
扩增pCMV-LicVB载体使其线性化的引物为:
上游引物(P23):
5’-ctgcagtacccatacgatgttccag-3’
下游引物(P24):
5’-tgatacatccttgttatcgagcg-3’
为构建含有以hnRNP A1 RNA剪接调控结构域、RS RNA剪接调控结构域、eIF4E RNA翻译起始因子结构域或PIN RNA核酸酶结构域为第三多肽的重组光控RNA效应因子,全基因合成上述结构域的编码DNA片段,分别以P25和P26、P27和P28、P29和P30、P31和P32扩增上述DNA片段获得含第四多肽的A1-NLS和RS-NLS以及eIF4E和PIN基因片段。利用P33和P34扩增本实施例构建的pCMV-LicVB载体,去除barnaseM4基因序列并使其线性化,利用一步法克隆试剂盒将上述基因片段插入到线性化的pCMV-LicVB载体中,得到的质粒命名为pCMV-LicVA1、pCMV-LicVRS、pCMV-LicV4E和pCMV-LicVPIN,分别编码重组光控RNA剪接调控因子LicVA1-NLS和LicVRS-NLS、重组光控RNA翻译起始因子LicV4E和重组光控核酸酶因子LicVPIN(图3),其氨基酸序列分别为序列SEQ ID NO:14、15、16和17。
扩增A1-NLS结构域基因片段的引物为:
上游引物(P25):
5’-cagtgcgaaacggaaggtggcggtggctcgggcggagggggttcgggaggtatgggtcgaagtggttctggaa-3’
下游引物(P26):
5’-gatctagagtgtacattataccttcctctttttcttgggggggaggatcccaaatcttctgccact-3’
扩增RS-NLS基因片段的引物为:
上游引物(P27):
5’-tacgctgaattcatgcgttacagccggcgaagaagaagc-3’
下游引物(P28):
5’-gatctagagtgtacattataccttcctctttttcttgggggggaggatcccgtccattctttcaggacttg-3’
扩增eIF4E基因片段的引物为:
上游引物(P29):
5’-tacgctgaattcatgatggcgactgtcgaaccggaaac-3’
下游引物(P30):
5’-gatctagagtgtacactaaacaacaaacctatttttag-3’
扩增PIN基因片段的引物为:
上游引物(P31):
5’-cagtgcgaaacggaaatggccttgcacgccagaaacatcgccatggagctcgaaatcagacc-3’
下游引物(P32):
5’-gatctagagtgtacactagcccacctgggcccacgtgag-3’
扩增pCMV-LicVB载体使其线性化的引物为:
上游引物(P33):
5’-tgtacactctagatcataatcagc-3’
下游引物(P34):
5’-catgaattcagcgtaatctgga-3’
全基因合成J23117启动子和rrnB转录终止子DNA序列,利用引物P35和P36、P37和P38对它们进行扩增获得J23117启动子和rrnB DNA片段,利用引物P39和P40以本实施例中的pCMV-LicVB质粒为模板扩增LicVB基因片段,利用重叠PCR将J23117、rrnB和LicVB连接起来,获得J23117-LicVB-rrnB重组DNA片段。利用P41和P42扩增pCDFDuet1载体(Novagen)去除原有的T7启动子和多克隆位点区使其线性化,利用一步法克隆试剂盒将J23117-LicVB-rrnB片段插入到线性化的pCDFDuet1载体中,得到的细菌表达质粒命名为pJ23117-LicVB,其编码重组光控核酸酶因子LicVB。
扩增J23117启动子的引物为:
上游引物(P35):
5’-gatggtgtccgggatggatccgcctatgcagcgac-3’
下游引物(P36):
5’-cttcgcaattttcatagatctctgcctgaagttatagtg-3’
扩增rrnB转录终止子的引物为:
上游引物(P37):
5’-acaaaaatcagataagagagtagggaactgccaggcatc-3’
下游引物(P38):
5’-cggtggcagcagttagctagcgcaaacaacagataaaac-3’
扩增LicVB基因片段的引物为:
上游引物(P39):
5’-atgaaaattgcgaaggtgatcaac-3’
下游引物(P40):
5’-ttatctgatttttgtaaaggtttg-3’
扩增pCDFDuet1载体使其线性化的引物为:
上游引物(P41):
5’-taactgctgccaccgctgagcaataac-3’
下游引物(P42):
5’-atcccggacaccatcgaatggcgc-3’
利用引物P43和P44以本实施例中的pCMV-LicVB质粒为模板扩增LicVB基因片段,利用P45和P46扩增pGADT7载体(Clontech)使其线性化,利用一步法克隆试剂盒将LicVB基因片段插入到线性化的pGADT7载体中,得到的酵母表达质粒命名为pGADT7-LicVB,其编码重组光控核酸酶因子LicVB。
扩增LicVB基因片段的引物为:
上游引物(P43):
5’-ccaagctttgcaaagatgaaaattgcgaaggtgatcaac-3’
下游引物(P44):
5’-catctgcagctcgagttatctgatttttgtaaaggtttg-3’
扩增pGADT7载体使其线性化的引物为:
上游引物(P45):
5’-ctcgagctgcagatgaatcgtagatac-3’
下游引物(P46):
5’-ctttgcaaagcttggagttgattg-3’
实施例2构建含有不同靶调控单元的表达载体
为了检测光控RNA效应因子对靶标RNA代谢的调控效果,需要构建含相应靶调控单元编码核苷酸序列的表达质粒。为了检测重组光控RNA核酸酶因子调控靶标RNA降解的效果,合成2xRATLicT编码DNA片段,利用P47和P48以其为模板进行扩增获得2xRATLicT片段,分别利用P49和P50、P51和P52以pU5-Gluc和pU5-mCherry(Wang et al.,Nature Methods,2012:266-269)为模板扩增Gluc和mCherry基因片段,利用重叠PCR分别将2xRATLicT片段与Gluc和mCherry连接起来,获得Gluc-2xRATLicT和mCherry-2xRATLicT片段。利用P53和P54扩增pCDNA3.1 hygro(+)载体(Invitrohen)使其线性化,利用一步法克隆试剂盒将Gluc-2xRATLicT和mCherry-2xRATLicT片段插入到线性化的pCDNA3.1载体中,得到的表达质粒分别命名为pCDNA3.1-Gluc-2xRATLicT和pCDNA3.1-mCherry-2xRATLicT,其分别编码Gluc-2xRATLicT和mCherry-2xRATLicT靶调控单元,其核苷酸序列为序列SEQ ID NO:18和19。
扩增2xRATLicT片段的引物为:
上游引物(P47):
5’-aaaatggtgggattgttactgc-3’
下游引物(P48):
5’-ccctctagactcgaggtttaaacgggccctctagac-3’
扩增Gluc基因片段的引物为:
上游引物(P49):
5’-cccaagctggctagcatgggagtcaaagttctgtttg-3’
下游引物(P50):
5’-caatcccaccattttttagtcaccaccggcccccttg-3’
扩增mCherry基因片段的引物为:
上游引物(P51):
5’-cccaagctggctagcatggtgagcaagggcgaggag-3’
下游引物(P52):
5’-caatcccaccattttctacttgtacagctcgtccatgccg-3’
扩增pCDNA3.1 hygro(+)载体使其线性化的引物为:
上游引物(P53):
5’-ctcgagtctagagggcccgtttaaac-3’
下游引物(P54):
5’-gctagccagcttgggtctccctatag-3’
为了直接观察重组光控RNA核酸酶因子对靶标RNA降解的调控效果,利用Pepper荧光RNA系统(Chen et al.,Nature Biotechonogy,2019,37:1287-1293)作为报告RNA。在商业化公司合成Pepper-RATLicT编码DNA片段,利用P55和P56以其为模板进行扩增获得Pepper-RATLicT片段。在商业化公司合成U6启动子DNA片段,利用P57和P58以其为模板进行扩增获得U6启动子片段。利用重叠PCR将Pepper-RAT与U6启动子连接起来,获得U6-Pepper-RATLicT片段。利用P59和P60扩增pEGFP-N1-FLAG载体去除CMV启动子和多克隆位点区使其线性化,利用一步法克隆试剂盒将U6-Pepper-RATLicT片段插入到线性化的pEGFP-N1-FLAG载体中,得到的表达质粒分别命名为pU6-Pepper-RATLicT,其编码Pepper-RATLicT靶调控单元,其核苷酸序列为序列SEQ ID NO:20。
扩增Pepper-RATLicT片段的引物为:
上游引物(P55):
5’-tggaaaggacgaaacgggcccccaatcgtggcgtgtcggc-3’
下游引物(P56):
5’-cgaggtcgagaattcaaaaaagggccccggcgccagtgcctgcctttc-3’
扩增U6启动子片段的引物为:
上游引物(P57):
5’-gccgcccccttcaccgagggcctatttcccatgattc-3’
下游引物(P58):
5’-gtttcgtcctttccacaagatatataaag-3’
扩增pEGFP-N1-FLAG载体使其线性化的引物为:
上游引物(P59):
5’-gaattctcgacctcgagacaaatggcagtattc-3’
下游引物(P60):
5’-ggtgaagggggcggccgctcgaggcta-3’
为了观察细菌和酵母细胞中重组光控RNA核酸酶因子调控靶标RNA降解的效果,利用引物P61和P62以pCDNA3.1-mCherry-2xRATLicT为模板扩增mCherry-2xRATLicT片段,在商业化公司合成J23106启动子DNA片段,利用引物P63和P64以其为模板扩增J23106启动子片段,利用重叠PCR将mCherry-2xRATLicT与J23106启动子连接起来,获得J23106-mCherry-2xRATLicT片段。利用P65和P66扩增实施例1中的pJ23117-LicVB载体去除J23117-LicVB使其线性化,利用一步法克隆试剂盒将J23106-mCherry-2xRATLicT片段插入到线性化的pJ23117-LicVB载体中,得到的细菌表达质粒命名为pJ23106-mCherry-2xRATLicT。利用P67和P68扩增pGADT7载体使其线性化,利用引物P69和P70以pCDNA3.1-mCherry-2xRATLicT为模板扩增mCherry-2xRATLicT片段,利用一步法克隆试剂盒将mCherry-2xRATLicT片段插入到线性化的pGADT7载体中,得到的酵母表达质粒命名为pGADT7-mCherry-2xRATLicT。
扩增mCherry-2xRATLicT片段的引物为:
上游引物(P61):
5’-atggtgagcaagggcgaggaggat-3’
下游引物(P62):
5’-cagttccctactctcgtttaaacgggccctctagac-3’
扩增J23106启动子片段的引物为:
上游引物(P63):
5’-cgatggtgtccgggaggatccgcctatgcagcgacaaa-3’
下游引物(P64):
5’-gcccttgctcaccatagatctctgcctgaagttatagtg-3’
扩增pJ23117-LicVB载体使其线性化的引物为:
上游引物(P65):
5’-gagagtagggaactgccaggcatc-3’
下游引物(P66):
5’-tcccggacaccatcgaatggcgc-3’
扩增mCherry-2xRATLicT片段用于构建pGADT7-mCherry-2xRATLicT的引物为:
上游引物(P67):
5’-ccaagctttgcaaagatggtgagcaagggcgaggaggat-3’
下游引物(P68):
5’-ccctctagactcgagcggatcccattttagggttttgcctg-3’
扩增pGADT7载体使其线性化的引物为:
上游引物(P69):
5’-ctcgagctgcagatgaatcgtagatac-3’
下游引物(P70):
5’-ctttgcaaagcttggagttgattg-3’
为了检测含不同第一多肽的重组光控RNA核酸酶因子调控靶标RNA降解的效果,在商业化公司合成2xRATBglG、2xRATSacY和2xRATGlcT片段,利用引物P71和P72、引物P73和P74、引物P75和P76分别对它们进行扩增,利用P77和P78扩增本实施例中的pCDNA3.1-Gluc-2xRATLicT载体去除2xRATLicT片段使其线性化,利用一步法克隆试剂盒分别将2xRATBglG、2xRATSacY和2xRATGlcT片段插入到线性化的pCDNA3.1-Gluc-2xRATLicT载体中,得到的表达质粒分别命名为pCDNA3.1-Gluc-2xRATBglG、pCDNA3.1-Gluc-2xRATSacY和pCDNA3.1-Gluc-2xRATGlcT,其分别编码Gluc-2xRATBglG、Gluc-2xRATSacY和Gluc-2xRATGlcT靶调控单元,其核苷酸序列为序列SEQ ID NO:21、22和23。
扩增2xRATBglG片段的引物为:
上游引物(P71):
5’-ctaaaaaatggtgggggattgtt-3’
下游引物(P72):
5’-ccctctagactcgaggggccctctagactcgagcgga-3’
扩增2xRATSacY片段的引物为:
上游引物(P73):
5’-ctaaaaaatggtgggggtttgtt-3’
下游引物(P74):
5’-ccctctagactcgaggggccctctagactcgagcgga-3’
扩增2xRATGlcT片段的引物为:
上游引物(P75):
5’-ctaaaaaatggtgggttactgatt-3’
下游引物(P76):
5’-ccctctagactcgaggggccctctagactcgagcgga-3’
扩增pCDNA3.1-Gluc-2xRATLicT使其线性化的引物为:
上游引物(P77):
5’-gagtctagagggcccctcgagtctagagggcccgt-3’
下游引物(P78):
5’-cccaccattttttagtcaccaccggcccccttg-3’
为了检测重组光控RNA翻译起始因子调控靶标RNA翻译的效果,在商业化公司合成4xRATLicT和EGFP DNA片段,分别采用引物P79和P80、P81和P82以它们为模板进行扩增,再利用重叠PCR将它们连接起来,获得4xRATLicT-EGFP片段。利用P83和P84扩增本实施例构建的pCDNA3.1-mCherry-2xRATLicT载体,去除2xRATLicT序列使其线性化,利用一步法克隆试剂盒将4xRATLicT-EGFP片段插入到线性化的pCDNA3.1-mCherry-2xRATLicT载体中,得到的表达质粒命名为pCDNA3.1-mCherry-4xRATLicT-EGFP,其编码mCherry-4xRATLicT-EGFP靶调控单元,核苷酸序列为序列SEQ ID NO:24。为了检测重组光控RNA剪接因子调控靶标RNA剪接的效果,利用引物P85和P86通过反向PCR将RATLicT片段插入pGZ3-GUM载体(中科院王泽峰老师课题组馈赠))中,线性化的片段通过磷酸化后连接,得到了质粒命名为pGZ3-GUM-RATLicT,其编码的靶调控单元核苷酸序列为序列SEQ ID NO:25。
扩增4xRATLicT片段的引物为:
上游引物(P79):
5’-gagctgtacaagtagaatacgactcactatagggag-3’
下游引物(P80):
5’-gcccttgctcaccatggtggccaagctttgtacag-3’
扩增EGFP片段的引物为:
上游引物(P81):
5’-atggtgagcaagggcgaggagctg-3’
下游引物(P82):
5’-ccctctagactcgagctacttgtacagctcgtccatg-3’
扩增pCDNA3.1-mCherry-2xRATLicT载体使其线性化的引物为:
上游引物(P83):
5’-ctcgagtctagagggcccgtttaaac-3’
下游引物(P84):
5’-ctacttgtacagctcgtccatgccgccggtggag-3’
扩增pGZ3-GUM载体使其线性化的引物为:
上游引物(P85):
5’-gcaggcaaaacccttcgggcccagcatcgctgga-3’
下游引物(P86):
5’-cgtagcagtaacaatcccattctcgagaaccatacgaactttg-3’
实施例3构建不同光控RNA定位因子表达质粒
为了检测重组光控RNA定位因子对靶标RNA定位的调控效果,全基因合成mKalama1蓝色荧光蛋白编码基因、CAAX细胞内膜定位信号编码基因和3xNLS核定位信号编码基因,利用P87和P88、P89和P90、P91和P92以其为模板分别进行扩增,利用P93和P94以pCMV-LicVB为模板扩增LicV编码基因,利用重叠PCR分别将LicV与mKalama1和CAAX或3xNLS连接,分别获得LicV-mKalama1-CAAX和LicV-mKalama1-3xNLS重组基因片段,利用引物P95和P96扩增pEGFP-N1-FLAG载体(Addgene:60360)使其线性化,利用一步法克隆试剂盒将上述重叠PCR获得的基因片段插入到线性化的pEGFP-N1-FLAG载体中,得到的质粒命名为pCMV-LicV-mKalama1-CAAX和LicV-mKalama1-3xNLS,分别编码重组光控定位因子的氨基酸序列分别为序列SEQ ID NO:26和27。
扩增mKalama1基因片段的引物为:
上游引物(P87):
5’-tggcggtggctcgggcggtggtgaattcatgatggtgagcaagggagaggagctg-3’
下游引物(P88):
5’-cttgtacagctcgtccatgccggg-3’
扩增CAAX基因片段的引物为:
上游引物(P89):
5’-gacgagctgtacaagggcagcggaagatctaaga-3’
下游引物(P90):
5’-agagtcgcggccgctttacatgatcacgcacttagtc-3’
扩增3xNLS基因片段的引物为:
上游引物(P91):
5’-gacgagctgtacaagctgtacaaggatccaaaaa-3’
下游引物(P92):
5’-agagtcgcggccgctttatacctttctcttcttttttggat-3’
扩增LicV基因片段的引物为:
上游引物(P93):
5’-agatccgctagcgctatgaaaattgcgaaggtgatcaac-3’
下游引物(P94):
5’-cccgagccaccgccaccagcgtaatctggaacatcgtatgggtactgcagttccgtttcgcactgg-3’
扩增pEGFP-N1-FLAG载体使其线性化的引物为:
上游引物(P95):
5’-agcggccgcgactctagatcataatcagcca-3’
下游引物(P96):
5’-agcgctagcggatctgacggttcac-3’
实施例4重组光控RNA核酸酶因子调控靶标RNA的降解
为了检测含不同第一多肽的重组光控RNA核酸酶因子对靶标RNA降解的调控效果,分别将pCDNA3.1-Gluc-2xRATLicT与pCMV-LicVB、pCDNA3.1-Gluc-2xRATBglG与pCMV-BglVB、pCDNA3.1-Gluc-2xRATSacY与pCMV-SacVB、pCDNA3.1-Gluc-2xRATGlcT与pCMV-GlcVB共转染HEK293T细胞,分别以不含RATLicT反应元件的pCDNA3.1-Gluc作为对照。转染6h后将细胞分别置于黑暗和蓝光(1.8W/m2)照射下培养24小时,检测细胞培养液上清中的Gluc活性。检测结果如附图4所示,光照条件下上清中的Gluc活性显著低于黑暗条件下的Gluc活性,对照细胞中的Gluc活性无明显变化,表明光照可以诱导LicVB、BglVB、SacVB和GlcVB重组光控RNA核酸酶因子结合相对应的反应元件,水解靶标Gluc mRNA,导致细胞中靶标Gluc mRNA水平降低,最终使得合成的Gluc蛋白水平下降,活性降低。该结果表明含不同第一多肽的重组光控RNA核酸酶因子可以调控靶标RNA的降解。
为了检测含不同第二多肽的重组光控RNA核酸酶因子对靶标RNA降解的调控效果,将pCDNA3.1-Gluc-2xRATLicT分别与pCMV-LicEB、pCMV-LicAsB、pCMV-LicAuB和pCMV-LicPB共转染HEK293T细胞,以不含RATLicT反应元件的pCDNA3.1-Gluc作为对照。转染6h后将细胞分别置于黑暗和蓝光(1.8W/m2)照射下培养24小时,检测细胞培养液上清中的Gluc活性。检测结果如附图5所示,光照条件下上清中的Gluc活性显著低于黑暗条件下的Gluc活性,对照细胞中的Gluc活性无明显变化,表明光照可以诱导LicEB、LicAsB、LicAuB和LicPB重组光控RNA核酸酶因子结合2xRATLicT反应元件,水解靶标Gluc mRNA,导致细胞中靶标Gluc mRNA水平降低,最终使得合成的Gluc蛋白水平下降,活性降低。该结果表明含不同第二多肽的重组光控RNA核酸酶因子可以调控靶标RNA的降解。
为了进一步检测重组光控RNA核酸酶因子对不同靶标RNA的降解调控效果,将pCMV-LicVB分别与pCDNA3.1-mCherry-2xRATLicT和pU6-Pepper-RATLicT共转染HEK293T细胞,将pCMV-LicVPIN和pU6-Pepper-RATLicT共转染HEK293T细胞,分别以不含RATLicT反应元件的pCDNA3.1-mCherry和pU6-Pepper作为对照。转染6h后将细胞分别置于黑暗和蓝光(1.8W/m2)照射下培养24小时,分别检测细胞中mCherry的荧光信号和Pepper485的荧光信号(表达Pepper-RATLicT靶调控单元的细胞需要加入1μM HBC485染料(Chen et al.,NatureBiotechonogy,2019,37:1287-1293)对靶标RNA进行特异性标记)。检测结果分别如附图6-8所示,在附图6中,光照条件下细胞中的mCherry信号显著低于黑暗条件下的mCherry信号,对照细胞中的mCherry信号无明显变化,表明光照可以诱导LicVB结合mCherry-2xRATLicT靶转录单元中的2xRATLicT反应元件,水解靶标mCherry mRNA,导致细胞中靶标mCherry mRNA水平降低,最终使得合成的mCherry蛋白水平下降,荧光降低。在附图7和8中,光照条件下细胞中的Pepper485信号显著低于黑暗条件下的Pepper485信号,对照细胞中的Pepper485信号无明显变化,表明光照可以诱导LicVB和LicVPIN结合Pepper-RATLicT靶转录单元中的RATLicT反应元件,水解Pepper靶标RNA,导致细胞中Pepper RNA水平下降,Pepper485荧光信号降低。
为了检测重组光控RNA核酸酶因子对不同宿主细胞中靶标RNA降级的调控效果,将pJ23106-mCherry-2xRATLicT和pJ23117-LicVB共转化大肠杆菌细胞,将pGADT7-mCherry-2xRATLicT和pGADT7-LicVB共转化BY4741酿酒酵母细胞,分别以不含RATLicT反应元件的pJ23106-mCherry和pGADT7-mCherry作为对照。分别挑取单克隆在黑暗条件下培养过夜,按照1:100稀释到新鲜培养基后分别光照和黑暗条件下培养到对数生长期后期,分别检测细胞中mCherry的荧光信号。检测结果分别如图9所示,光照条件下大肠杆菌和酵母细胞中的mCherry信号显著低于黑暗条件下的mCherry信号,对照细胞中的mCherry信号无明显变化,表明光照可以诱导LicVB结合mCherry-2xRATLicT靶转录单元中的2xRATLicT反应元件,水解靶标mCherry mRNA,导致细胞中mCherry靶标mRNA水平降低,最终使得合成的mCherry蛋白水平下降,荧光降低。该结果表明重组光控RNA核酸酶因子可以调控不同宿主细胞中靶标RNA的降解。
实施例5重组光控RNA剪接因子调控靶标RNA的剪接
为了检测重组光控RNA剪接因子调控靶标RNA的剪接,将pGZ3-GUM-RATLicT分别与pCMV-LicVA1和pCMV-LicVRS共转染HEK293T细胞,以不表达重组光控RNA剪接因子的空载体和表达不含第三多肽重组光控RNA剪接因子的质粒作为对照。转染6h后将细胞分别置于黑暗和蓝光(1.8W/m2)照射下培养24小时,检测细胞中靶标RNA的剪接结果。检测结果如附图10所示,细胞中靶标RNA两种剪接产物的比例在光照和黑暗条件下差异显著,而对照细胞中则无明显差异,表明重组光控RNA剪接因子LicVA1和LicVRS可用于靶标RNA的可变剪接。
实施例6重组光控RNA翻译起始因子调控靶标RNA的翻译
为了检测重组光控RNA翻译起始因子调控靶标RNA的翻译,将pCMV-LicV4E和pCDNA3.1-mCherry-4xRATLicT-EGFP共转染HEK293T细胞,以表达不含第三多肽重组光控RNA翻译起始因子的质粒作为对照。转染6h后将细胞分别置于黑暗和蓝光(1.8W/m2)照射下培养24小时,检测不同条件下细胞中mCherry和EGFP的荧光信号。检测结果如附图11所示,光照条件下EGFP的信号要远远强于黑暗条件下培养细胞中的EGFP信号,而对照细胞中的EGFP信号都很低,表明光照可以诱导LicV4E重组光控RNA翻译起始因子结合4xRATLicT反应元件,通过第三多肽eIF4E招募其他翻译因子一起启动靶标EGFP mRNA的翻译,合成EGFP蛋白。该结果表明重组光控RNA翻译起始因子可用于调控靶标RNA的翻译。
为了检测不同光照强度对重组光控RNA翻译起始因子调控靶标RNA翻译的影响,将转染了pCMV-LicV4E和pCDNA3.1-mCherry-4xRATLicT-EGFP的HEK293T细胞置于不光强度的蓝光下培养24h,对不同条件下细胞的mCherry和EGFP荧光信号进行成像。检测结果如附图12所示,随着蓝光光照强度的增加,EGFP的荧光信号也随之增加。该结果表明重组光控RNA翻译起始因子可用于定量调控靶标RNA的翻译。
为了检测重组光控RNA翻译起始因子是否可以在空间上精确调控靶标RNA翻译,将转染了pCMV-LicV4E和pCDNA3.1-mCherry-4xRATLicT-EGFP的HEK293T细胞置于只能透过特定区域光膜片的蓝光下培养,24h后对细胞的mCherry和EGFP荧光信号进行成像。检测结果如附图13所示,只有接收蓝光照射的细胞高水平表达EGFP蛋白,而旁边临近细胞的EGFP信号很弱。该结果表明重组光控RNA翻译起始因子可用于在空间上精确调控靶标RNA的翻译。
实施例7重组光控RNA定位因子调控靶标RNA的定位
为了检测重组光控RNA定位因子调控靶标RNA的定位,将pU6-Pepper-RATLicT分别与pCMV-LicV-mKalama1-CAAX和LicV-mKalama1-3xNLS共转染HEK293T细胞,以表达不含LicV重组光控RNA结合蛋白的质粒作为对照。转染6h后将细胞分别置于黑暗和蓝光(1.8W/m2)照射下培养24小时,加入1μM HBC620染料(Chen et al.,Nature Biotechonogy,2019,37:1287-1293)进行标记,利用荧光显微镜观察光照和黑暗条件下细胞中Pepper620的荧光分布情况。检测结果如附图14所示,在表达LicV-mKalama1-CAAX和Pepper-RATLicT的细胞中,光照可诱导Pepper-RATLicT靶标RNA向细胞内膜富集,而黑暗条件下Pepper-RATLicT靶标RNA则弥散分布在整个细胞中(图14A和B);在表达LicV-mKalama1-3xNLS和Pepper-RATLicT的细胞中,光照可诱导Pepper-RATLicT靶标RNA向细胞核富集,而黑暗条件下Pepper-RATLicT靶标RNA则弥散分布在整个细胞中(图14C和D)。因此,基于光控RNA结合蛋白的光控RNA定位因子可以用于调控靶标RNA在细胞内的分布与定位。
应该理解,在实施例或实验材料方法中所显示的成分用量、反应条件等数字或是本说明书所使用的其它参数均为近似值(除非特别注明),可根据所需要获得的结果而加以改变。并且,这些参数并非用来限定本发明保护范围,而是应用正常的操作技术下所得到的较佳数据。除非另行定义,文中所使用的所有专业与科学用语术与本领域技术人员所熟悉的意义相同。此外,任何与所记载内容相似或均等的方法及材料及均可以应用于本发明中。文中本说明书中所述的较佳实验方法与材料仅作为示范之用。本说明书提及的所有文献都全文引入作为参考。此外应该理解,在阅读了本发明的上述内容后,本领域的技术人员可以对本发明作各种改动或修改,这些等价形式同样落在本申请所附权利要求书所限定的范围内。
SEQUENCE LISTING
<110> 华东理工大学
<120> 一种光控RNA代谢调控系统
<130> 2021-08-17
<160> 27
<170> PatentIn version 3.3
<210> 1
<211> 214
<212> PRT
<213> Synthetic Sequence
<400> 1
Met Lys Ile Ala Lys Val Ile Asn Asn Asn Val Ile Ser Val Val Asn
1 5 10 15
Glu Gln Gly Lys Glu Leu Val Val Met Gly Arg Gly Leu Ala Phe Gln
20 25 30
Lys Lys Ser Gly Asp Asp Val Asp Glu Ala Arg Ile Glu Lys Val Phe
35 40 45
Thr Leu Asp Asn Lys Asp Val Ser Ala Arg Ile Ser Lys Ala Ala Gly
50 55 60
His Thr Leu Tyr Ala Pro Gly Gly Tyr Asp Ile Met Gly Tyr Leu Ile
65 70 75 80
Gln Ile Met Lys Arg Pro Asn Pro Gln Val Glu Leu Gly Pro Val Asp
85 90 95
Thr Ser Val Ala Leu Ile Leu Cys Asp Leu Lys Gln Lys Asp Thr Pro
100 105 110
Ile Val Tyr Ala Ser Glu Ala Phe Leu Tyr Met Thr Gly Tyr Ser Asn
115 120 125
Ala Glu Val Leu Gly Arg Asn Cys Arg Phe Leu Gln Ser Pro Asp Gly
130 135 140
Met Val Lys Pro Lys Ser Thr Arg Lys Tyr Val Asp Ser Asn Thr Ile
145 150 155 160
Asn Thr Met Arg Lys Ala Ile Asp Arg Asn Ala Glu Val Gln Val Glu
165 170 175
Val Val Asn Phe Lys Lys Asn Gly Gln Arg Phe Val Asn Phe Leu Thr
180 185 190
Met Ile Pro Val Arg Asp Glu Thr Gly Glu Tyr Arg Tyr Ser Met Gly
195 200 205
Phe Gln Cys Glu Thr Glu
210
<210> 2
<211> 25
<212> RNA
<213> Synthetic Sequence
<400> 2
auuguuacug cuacggcagg caaaa 25
<210> 3
<211> 26
<212> RNA
<213> Synthetic Sequence
<400> 3
auuguuaccg cacuaagcgg gcaaaa 26
<210> 4
<211> 25
<212> RNA
<213> Synthetic Sequence
<400> 4
uuuguuacug auaaagcagg caaga 25
<210> 5
<211> 26
<212> RNA
<213> Synthetic Sequence
<400> 5
uuacugauuc gaucaggcau gaguga 26
<210> 6
<211> 338
<212> PRT
<213> Synthetic Sequence
<400> 6
Met Lys Ile Ala Lys Val Ile Asn Asn Asn Val Ile Ser Val Val Asn
1 5 10 15
Glu Gln Gly Lys Glu Leu Val Val Met Gly Arg Gly Leu Ala Phe Gln
20 25 30
Lys Lys Ser Gly Asp Asp Val Asp Glu Ala Arg Ile Glu Lys Val Phe
35 40 45
Thr Leu Asp Asn Lys Asp Val Ser Ala Arg Ile Ser Lys Ala Ala Gly
50 55 60
His Thr Leu Tyr Ala Pro Gly Gly Tyr Asp Ile Met Gly Tyr Leu Ile
65 70 75 80
Gln Ile Met Lys Arg Pro Asn Pro Gln Val Glu Leu Gly Pro Val Asp
85 90 95
Thr Ser Val Ala Leu Ile Leu Cys Asp Leu Lys Gln Lys Asp Thr Pro
100 105 110
Ile Val Tyr Ala Ser Glu Ala Phe Leu Tyr Met Thr Gly Tyr Ser Asn
115 120 125
Ala Glu Val Leu Gly Arg Asn Cys Arg Phe Leu Gln Ser Pro Asp Gly
130 135 140
Met Val Lys Pro Lys Ser Thr Arg Lys Tyr Val Asp Ser Asn Thr Ile
145 150 155 160
Asn Thr Met Arg Lys Ala Ile Asp Arg Asn Ala Glu Val Gln Val Glu
165 170 175
Val Val Asn Phe Lys Lys Asn Gly Gln Arg Phe Val Asn Phe Leu Thr
180 185 190
Met Ile Pro Val Arg Asp Glu Thr Gly Glu Tyr Arg Tyr Ser Met Gly
195 200 205
Phe Gln Cys Glu Thr Glu Leu Gln Tyr Pro Tyr Asp Val Pro Asp Tyr
210 215 220
Ala Glu Phe Met Ala Gln Val Ile Asn Thr Phe Asp Gly Val Ala Asp
225 230 235 240
Tyr Leu Gln Thr Tyr His Lys Leu Pro Asp Asn Tyr Ile Thr Ala Ser
245 250 255
Glu Ala Gln Ala Leu Gly Trp Val Ala Ser Lys Gly Asn Leu Ala Asp
260 265 270
Val Ala Pro Gly Lys Ser Ile Gly Gly Asp Ile Phe Ser Asp Ala Glu
275 280 285
Gly Lys Leu Pro Gly Lys Ser Gly Arg Thr Trp Arg Ala Ala Asp Ile
290 295 300
Asn Tyr Thr Ser Gly Phe Arg Asn Ser Asp Arg Ile Leu Tyr Ser Ser
305 310 315 320
Asp Trp Leu Ile Tyr Lys Thr Thr Asp His Tyr Gln Thr Phe Thr Lys
325 330 335
Ile Arg
<210> 7
<211> 337
<212> PRT
<213> Synthetic Sequence
<400> 7
Met Asn Met Gln Ile Thr Lys Ile Leu Asn Asn Asn Val Val Val Val
1 5 10 15
Ile Asp Asp Gln Gln Arg Glu Lys Val Val Met Gly Arg Gly Ile Gly
20 25 30
Phe Gln Lys Arg Ala Gly Glu Arg Ile Asn Ser Ser Gly Ile Glu Lys
35 40 45
Glu Tyr Ala Leu Ser Ser His Glu Leu Asn Gly Thr Trp Asp Val His
50 55 60
Thr Leu Tyr Ala Pro Gly Gly Tyr Asp Ile Met Gly Tyr Leu Ile Gln
65 70 75 80
Ile Met Lys Arg Pro Asn Pro Gln Val Glu Leu Gly Pro Val Asp Thr
85 90 95
Ser Val Ala Leu Ile Leu Cys Asp Leu Lys Gln Lys Asp Thr Pro Ile
100 105 110
Val Tyr Ala Ser Glu Ala Phe Leu Tyr Met Thr Gly Tyr Ser Asn Ala
115 120 125
Glu Val Leu Gly Arg Asn Cys Arg Phe Leu Gln Ser Pro Asp Gly Met
130 135 140
Val Lys Pro Lys Ser Thr Arg Lys Tyr Val Asp Ser Asn Thr Ile Asn
145 150 155 160
Thr Met Arg Lys Ala Ile Asp Arg Asn Ala Glu Val Gln Val Glu Val
165 170 175
Val Asn Phe Lys Lys Asn Gly Gln Arg Phe Val Asn Phe Leu Thr Met
180 185 190
Ile Pro Val Arg Asp Glu Thr Gly Glu Tyr Arg Tyr Ser Met Gly Phe
195 200 205
Gln Cys Glu Thr Glu Leu Gln Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
210 215 220
Glu Phe Met Ala Gln Val Ile Asn Thr Phe Asp Gly Val Ala Asp Tyr
225 230 235 240
Leu Gln Thr Tyr His Lys Leu Pro Asp Asn Tyr Ile Thr Ala Ser Glu
245 250 255
Ala Gln Ala Leu Gly Trp Val Ala Ser Lys Gly Asn Leu Ala Asp Val
260 265 270
Ala Pro Gly Lys Ser Ile Gly Gly Asp Ile Phe Ser Asp Ala Glu Gly
275 280 285
Lys Leu Pro Gly Lys Ser Gly Arg Thr Trp Arg Ala Ala Asp Ile Asn
290 295 300
Tyr Thr Ser Gly Phe Arg Asn Ser Asp Arg Ile Leu Tyr Ser Ser Asp
305 310 315 320
Trp Leu Ile Tyr Lys Thr Thr Asp His Tyr Gln Thr Phe Thr Lys Ile
325 330 335
Arg
<210> 8
<211> 333
<212> PRT
<213> Synthetic Sequence
<400> 8
Met Lys Ile Lys Arg Ile Leu Asn His Asn Ala Ile Val Val Lys Asp
1 5 10 15
Gln Asn Glu Glu Lys Ile Leu Leu Gly Ala Gly Ile Ala Phe Asn Lys
20 25 30
Lys Lys Asn Asp Ile Val Asp Pro Ser Lys Ile Glu Lys Thr Phe Ile
35 40 45
Arg Lys Asp Thr Pro Asp Tyr Gly Gly Gly Ser His Thr Leu Tyr Ala
50 55 60
Pro Gly Gly Tyr Asp Ile Met Gly Tyr Leu Ile Gln Ile Met Lys Arg
65 70 75 80
Pro Asn Pro Gln Val Glu Leu Gly Pro Val Asp Thr Ser Val Ala Leu
85 90 95
Ile Leu Cys Asp Leu Lys Gln Lys Asp Thr Pro Ile Val Tyr Ala Ser
100 105 110
Glu Ala Phe Leu Tyr Met Thr Gly Tyr Ser Asn Ala Glu Val Leu Gly
115 120 125
Arg Asn Cys Arg Phe Leu Gln Ser Pro Asp Gly Met Val Lys Pro Lys
130 135 140
Ser Thr Arg Lys Tyr Val Asp Ser Asn Thr Ile Asn Thr Met Arg Lys
145 150 155 160
Ala Ile Asp Arg Asn Ala Glu Val Gln Val Glu Val Val Asn Phe Lys
165 170 175
Lys Asn Gly Gln Arg Phe Val Asn Phe Leu Thr Met Ile Pro Val Arg
180 185 190
Asp Glu Thr Gly Glu Tyr Arg Tyr Ser Met Gly Phe Gln Cys Glu Thr
195 200 205
Glu Leu Gln Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Glu Phe Met Ala
210 215 220
Gln Val Ile Asn Thr Phe Asp Gly Val Ala Asp Tyr Leu Gln Thr Tyr
225 230 235 240
His Lys Leu Pro Asp Asn Tyr Ile Thr Ala Ser Glu Ala Gln Ala Leu
245 250 255
Gly Trp Val Ala Ser Lys Gly Asn Leu Ala Asp Val Ala Pro Gly Lys
260 265 270
Ser Ile Gly Gly Asp Ile Phe Ser Asp Ala Glu Gly Lys Leu Pro Gly
275 280 285
Lys Ser Gly Arg Thr Trp Arg Ala Ala Asp Ile Asn Tyr Thr Ser Gly
290 295 300
Phe Arg Asn Ser Asp Arg Ile Leu Tyr Ser Ser Asp Trp Leu Ile Tyr
305 310 315 320
Lys Thr Thr Asp His Tyr Gln Thr Phe Thr Lys Ile Arg
325 330
<210> 9
<211> 341
<212> PRT
<213> Synthetic Sequence
<400> 9
Met Asn Gly Ser Phe Thr Val Lys Lys Val Leu Asn Asn Asn Val Leu
1 5 10 15
Ile Ala Ser His His Lys Tyr Ser Glu Val Val Leu Ile Gly Lys Gly
20 25 30
Ile Gly Phe Gly Lys Lys Gln Asp Asp Val Ile Glu Asp Lys Gly Tyr
35 40 45
Asp Lys Met Phe Ile Leu Lys Asp Glu Lys Glu Gln Cys Arg Gly Thr
50 55 60
Cys Asp Leu His Thr Leu Tyr Ala Pro Gly Gly Tyr Asp Ile Met Gly
65 70 75 80
Tyr Leu Ile Gln Ile Met Lys Arg Pro Asn Pro Gln Val Glu Leu Gly
85 90 95
Pro Val Asp Thr Ser Val Ala Leu Ile Leu Cys Asp Leu Lys Gln Lys
100 105 110
Asp Thr Pro Ile Val Tyr Ala Ser Glu Ala Phe Leu Tyr Met Thr Gly
115 120 125
Tyr Ser Asn Ala Glu Val Leu Gly Arg Asn Cys Arg Phe Leu Gln Ser
130 135 140
Pro Asp Gly Met Val Lys Pro Lys Ser Thr Arg Lys Tyr Val Asp Ser
145 150 155 160
Asn Thr Ile Asn Thr Met Arg Lys Ala Ile Asp Arg Asn Ala Glu Val
165 170 175
Gln Val Glu Val Val Asn Phe Lys Lys Asn Gly Gln Arg Phe Val Asn
180 185 190
Phe Leu Thr Met Ile Pro Val Arg Asp Glu Thr Gly Glu Tyr Arg Tyr
195 200 205
Ser Met Gly Phe Gln Cys Glu Thr Glu Leu Gln Tyr Pro Tyr Asp Val
210 215 220
Pro Asp Tyr Ala Glu Phe Met Ala Gln Val Ile Asn Thr Phe Asp Gly
225 230 235 240
Val Ala Asp Tyr Leu Gln Thr Tyr His Lys Leu Pro Asp Asn Tyr Ile
245 250 255
Thr Ala Ser Glu Ala Gln Ala Leu Gly Trp Val Ala Ser Lys Gly Asn
260 265 270
Leu Ala Asp Val Ala Pro Gly Lys Ser Ile Gly Gly Asp Ile Phe Ser
275 280 285
Asp Ala Glu Gly Lys Leu Pro Gly Lys Ser Gly Arg Thr Trp Arg Ala
290 295 300
Ala Asp Ile Asn Tyr Thr Ser Gly Phe Arg Asn Ser Asp Arg Ile Leu
305 310 315 320
Tyr Ser Ser Asp Trp Leu Ile Tyr Lys Thr Thr Asp His Tyr Gln Thr
325 330 335
Phe Thr Lys Ile Arg
340
<210> 10
<211> 353
<212> PRT
<213> Synthetic Sequence
<400> 10
Met Lys Ile Ala Lys Val Ile Asn Asn Asn Val Ile Ser Val Val Asn
1 5 10 15
Glu Gln Gly Lys Glu Leu Val Val Met Gly Arg Gly Leu Ala Phe Gln
20 25 30
Lys Lys Ser Gly Asp Asp Val Asp Glu Ala Arg Ile Glu Lys Val Phe
35 40 45
Thr Leu Asp Asn Lys Asp Val Ser Ala Asp Thr Ile Leu Gly Ser Pro
50 55 60
Ser Met Leu Asp Met Gly Gln Asp Arg Pro Ile Asp Gly Ser Gly Ala
65 70 75 80
Pro Gly Ala Asp Asp Thr Arg Val Glu Val Gln Pro Pro Ala Gln Trp
85 90 95
Val Leu Asp Leu Ile Glu Ala Ser Pro Ile Ala Ser Val Val Ser Asp
100 105 110
Pro Arg Leu Ala Asp Asn Pro Leu Ile Ala Ile Asn Gln Ala Phe Thr
115 120 125
Asp Leu Thr Gly Tyr Ser Glu Glu Glu Cys Val Gly Arg Asn Cys Arg
130 135 140
Phe Leu Ala Gly Ser Gly Thr Glu Pro Trp Leu Thr Asp Lys Ile Arg
145 150 155 160
Gln Gly Val Arg Glu His Lys Pro Val Leu Val Glu Ile Leu Asn Tyr
165 170 175
Lys Lys Asp Gly Thr Pro Phe Arg Asn Ala Val Leu Val Ala Pro Ile
180 185 190
Tyr Asp Asp Asp Asp Glu Leu Leu Tyr Phe Leu Gly Ser Gln Val Glu
195 200 205
Val Asp Asp Asp Gln Pro Asn Met Gly Met Ala Arg Arg Glu Arg Ala
210 215 220
Ala Glu Met Leu Lys Leu Gln Tyr Pro Tyr Asp Val Pro Asp Tyr Ala
225 230 235 240
Glu Phe Met Ala Gln Val Ile Asn Thr Phe Asp Gly Val Ala Asp Tyr
245 250 255
Leu Gln Thr Tyr His Lys Leu Pro Asp Asn Tyr Ile Thr Ala Ser Glu
260 265 270
Ala Gln Ala Leu Gly Trp Val Ala Ser Lys Gly Asn Leu Ala Asp Val
275 280 285
Ala Pro Gly Lys Ser Ile Gly Gly Asp Ile Phe Ser Asp Ala Glu Gly
290 295 300
Lys Leu Pro Gly Lys Ser Gly Arg Thr Trp Arg Ala Ala Asp Ile Asn
305 310 315 320
Tyr Thr Ser Gly Phe Arg Asn Ser Asp Arg Ile Leu Tyr Ser Ser Asp
325 330 335
Trp Leu Ile Tyr Lys Thr Thr Asp His Tyr Gln Thr Phe Thr Lys Ile
340 345 350
Arg
<210> 11
<211> 297
<212> PRT
<213> Synthetic Sequence
<400> 11
Met Lys Ile Ala Lys Val Ile Asn Asn Asn Val Ile Ser Val Val Asn
1 5 10 15
Glu Gln Gly Lys Glu Leu Val Val Met Gly Arg Gly Leu Ala Phe Gln
20 25 30
Lys Lys Ser Gly Asp Asp Val Asp Glu Ala Arg Ile Glu Lys Val Phe
35 40 45
Thr Leu Asp Asn Lys Asp Val Ser Val Gly Ser Gln Gln Asn Phe Val
50 55 60
Ile Thr Asp Ala Ser Leu Pro Asp Asn Pro Ile Val Tyr Ala Ser Arg
65 70 75 80
Gly Phe Leu Thr Leu Thr Gly Tyr Ser Leu Asp Gln Ile Leu Gly Arg
85 90 95
Asn Cys Arg Phe Leu Gln Gly Pro Glu Thr Asp Pro Arg Ala Val Asp
100 105 110
Lys Ile Arg Asn Ala Ile Thr Lys Gly Val Asp Thr Ser Val Cys Leu
115 120 125
Leu Asn Tyr Arg Gln Asp Gly Thr Thr Phe Trp Asn Leu Phe Phe Val
130 135 140
Ala Gly Leu Arg Asp Ser Lys Gly Asn Ile Val Asn Tyr Val Gly Val
145 150 155 160
Gln Ser Lys Val Ser Glu Asp Tyr Ala Lys Leu Leu Val Leu Gln Tyr
165 170 175
Pro Tyr Asp Val Pro Asp Tyr Ala Glu Phe Met Ala Gln Val Ile Asn
180 185 190
Thr Phe Asp Gly Val Ala Asp Tyr Leu Gln Thr Tyr His Lys Leu Pro
195 200 205
Asp Asn Tyr Ile Thr Ala Ser Glu Ala Gln Ala Leu Gly Trp Val Ala
210 215 220
Ser Lys Gly Asn Leu Ala Asp Val Ala Pro Gly Lys Ser Ile Gly Gly
225 230 235 240
Asp Ile Phe Ser Asp Ala Glu Gly Lys Leu Pro Gly Lys Ser Gly Arg
245 250 255
Thr Trp Arg Ala Ala Asp Ile Asn Tyr Thr Ser Gly Phe Arg Asn Ser
260 265 270
Asp Arg Ile Leu Tyr Ser Ser Asp Trp Leu Ile Tyr Lys Thr Thr Asp
275 280 285
His Tyr Gln Thr Phe Thr Lys Ile Arg
290 295
<210> 12
<211> 298
<212> PRT
<213> Synthetic Sequence
<400> 12
Met Lys Ile Ala Lys Val Ile Asn Asn Asn Val Ile Ser Val Val Asn
1 5 10 15
Glu Gln Gly Lys Glu Leu Val Val Met Gly Arg Gly Leu Ala Phe Gln
20 25 30
Lys Lys Ser Gly Asp Asp Val Asp Glu Ala Arg Ile Glu Lys Val Phe
35 40 45
Thr Leu Asp Asn Lys Asp Val Ser Ile Leu Val Gly Thr Gln Asn Phe
50 55 60
Val Ile Thr Asp Ala Ser Leu Pro Asp Asn Pro Ile Val Tyr Ala Ser
65 70 75 80
Arg Gly Phe Leu Thr Leu Thr Gly Tyr Ser Leu Asp Gln Ile Leu Gly
85 90 95
Arg Asn Cys Arg Phe Leu Gln Gly Pro Glu Thr Asp Pro Arg Ala Val
100 105 110
Asp Lys Ile Arg Asn Ala Ile Thr Lys Gly Val Asp Thr Ser Val Cys
115 120 125
Leu Leu Asn Tyr Arg Gln Asp Gly Thr Thr Phe Trp Asn Leu Phe Phe
130 135 140
Val Ala Gly Leu Arg Asp Ser Lys Gly Asn Ile Val Asn Tyr Val Gly
145 150 155 160
Val Gln Ser Lys Val Ser Glu Asp Tyr Ala Lys Leu Leu Val Leu Gln
165 170 175
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Glu Phe Met Ala Gln Val Ile
180 185 190
Asn Thr Phe Asp Gly Val Ala Asp Tyr Leu Gln Thr Tyr His Lys Leu
195 200 205
Pro Asp Asn Tyr Ile Thr Ala Ser Glu Ala Gln Ala Leu Gly Trp Val
210 215 220
Ala Ser Lys Gly Asn Leu Ala Asp Val Ala Pro Gly Lys Ser Ile Gly
225 230 235 240
Gly Asp Ile Phe Ser Asp Ala Glu Gly Lys Leu Pro Gly Lys Ser Gly
245 250 255
Arg Thr Trp Arg Ala Ala Asp Ile Asn Tyr Thr Ser Gly Phe Arg Asn
260 265 270
Ser Asp Arg Ile Leu Tyr Ser Ser Asp Trp Leu Ile Tyr Lys Thr Thr
275 280 285
Asp His Tyr Gln Thr Phe Thr Lys Ile Arg
290 295
<210> 13
<211> 325
<212> PRT
<213> Synthetic Sequence
<400> 13
Met Lys Ile Ala Lys Val Ile Asn Asn Asn Val Ile Ser Val Val Asn
1 5 10 15
Glu Gln Gly Lys Glu Leu Val Val Met Gly Arg Gly Leu Ala Phe Gln
20 25 30
Lys Lys Ser Gly Asp Asp Val Asp Glu Ala Arg Ile Glu Lys Val Phe
35 40 45
Thr Leu Asp Asn Lys Asp Val Ser Tyr Ser Arg Met Ile Asn Ala Gln
50 55 60
Leu Leu Gln Ser Met Val Asp Ala Ser Asn Asp Gly Ile Val Val Ala
65 70 75 80
Glu Lys Glu Gly Asp Asp Thr Ile Leu Ile Tyr Val Asn Ala Ala Phe
85 90 95
Glu Tyr Leu Thr Gly Tyr Ser Arg Asp Glu Ile Leu Tyr Gln Asp Cys
100 105 110
Arg Phe Leu Gln Gly Asp Asp Arg Asp Gln Leu Gly Arg Ala Arg Ile
115 120 125
Arg Lys Ala Met Ala Glu Gly Arg Pro Cys Arg Glu Val Leu Arg Asn
130 135 140
Tyr Arg Lys Asp Gly Ser Ala Phe Trp Asn Glu Leu Ser Ile Thr Pro
145 150 155 160
Val Lys Ser Asp Phe Asp Gln Arg Thr Tyr Phe Ile Gly Ile Gln Lys
165 170 175
Asp Val Ser Arg Gln Val Glu Leu Glu Arg Glu Leu Ala Glu Leu Arg
180 185 190
Ala Arg Pro Lys Pro Asp Glu Arg Ala Leu Gln Tyr Pro Tyr Asp Val
195 200 205
Pro Asp Tyr Ala Glu Phe Met Ala Gln Val Ile Asn Thr Phe Asp Gly
210 215 220
Val Ala Asp Tyr Leu Gln Thr Tyr His Lys Leu Pro Asp Asn Tyr Ile
225 230 235 240
Thr Ala Ser Glu Ala Gln Ala Leu Gly Trp Val Ala Ser Lys Gly Asn
245 250 255
Leu Ala Asp Val Ala Pro Gly Lys Ser Ile Gly Gly Asp Ile Phe Ser
260 265 270
Asp Ala Glu Gly Lys Leu Pro Gly Lys Ser Gly Arg Thr Trp Arg Ala
275 280 285
Ala Asp Ile Asn Tyr Thr Ser Gly Phe Arg Asn Ser Asp Arg Ile Leu
290 295 300
Tyr Ser Ser Asp Trp Leu Ile Tyr Lys Thr Thr Asp His Tyr Gln Thr
305 310 315 320
Phe Thr Lys Ile Arg
325
<210> 14
<211> 364
<212> PRT
<213> Synthetic Sequence
<400> 14
Met Lys Ile Ala Lys Val Ile Asn Asn Asn Val Ile Ser Val Val Asn
1 5 10 15
Glu Gln Gly Lys Glu Leu Val Val Met Gly Arg Gly Leu Ala Phe Gln
20 25 30
Lys Lys Ser Gly Asp Asp Val Asp Glu Ala Arg Ile Glu Lys Val Phe
35 40 45
Thr Leu Asp Asn Lys Asp Val Ser Ala Arg Ile Ser Lys Ala Ala Gly
50 55 60
His Thr Leu Tyr Ala Pro Gly Gly Tyr Asp Ile Met Gly Tyr Leu Ile
65 70 75 80
Gln Ile Met Lys Arg Pro Asn Pro Gln Val Glu Leu Gly Pro Val Asp
85 90 95
Thr Ser Val Ala Leu Ile Leu Cys Asp Leu Lys Gln Lys Asp Thr Pro
100 105 110
Ile Val Tyr Ala Ser Glu Ala Phe Leu Tyr Met Thr Gly Tyr Ser Asn
115 120 125
Ala Glu Val Leu Gly Arg Asn Cys Arg Phe Leu Gln Ser Pro Asp Gly
130 135 140
Met Val Lys Pro Lys Ser Thr Arg Lys Tyr Val Asp Ser Asn Thr Ile
145 150 155 160
Asn Thr Met Arg Lys Ala Ile Asp Arg Asn Ala Glu Val Gln Val Glu
165 170 175
Val Val Asn Phe Lys Lys Asn Gly Gln Arg Phe Val Asn Phe Leu Thr
180 185 190
Met Ile Pro Val Arg Asp Glu Thr Gly Glu Tyr Arg Tyr Ser Met Gly
195 200 205
Phe Gln Cys Glu Thr Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
210 215 220
Gly Gly Met Gly Arg Ser Gly Ser Gly Asn Phe Gly Gly Gly Arg Gly
225 230 235 240
Gly Gly Phe Gly Gly Asn Asp Asn Phe Gly Arg Gly Gly Asn Phe Ser
245 250 255
Gly Arg Gly Gly Phe Gly Gly Ser Arg Gly Gly Gly Gly Tyr Gly Gly
260 265 270
Ser Gly Asp Gly Tyr Asn Gly Phe Gly Asn Asp Gly Ser Asn Phe Gly
275 280 285
Gly Gly Gly Ser Tyr Asn Asp Phe Gly Asn Tyr Asn Asn Gln Ser Ser
290 295 300
Asn Phe Gly Pro Met Lys Gly Gly Asn Phe Gly Gly Arg Ser Ser Gly
305 310 315 320
Pro Tyr Gly Gly Gly Gly Gln Tyr Phe Ala Lys Pro Arg Asn Gln Gly
325 330 335
Gly Tyr Gly Gly Ser Ser Ser Ser Ser Ser Tyr Gly Ser Gly Arg Arg
340 345 350
Phe Gly Ile Leu Pro Pro Lys Lys Lys Arg Lys Val
355 360
<210> 15
<211> 357
<212> PRT
<213> Synthetic Sequence
<400> 15
Met Lys Ile Ala Lys Val Ile Asn Asn Asn Val Ile Ser Val Val Asn
1 5 10 15
Glu Gln Gly Lys Glu Leu Val Val Met Gly Arg Gly Leu Ala Phe Gln
20 25 30
Lys Lys Ser Gly Asp Asp Val Asp Glu Ala Arg Ile Glu Lys Val Phe
35 40 45
Thr Leu Asp Asn Lys Asp Val Ser Ala Arg Ile Ser Lys Ala Ala Gly
50 55 60
His Thr Leu Tyr Ala Pro Gly Gly Tyr Asp Ile Met Gly Tyr Leu Ile
65 70 75 80
Gln Ile Met Lys Arg Pro Asn Pro Gln Val Glu Leu Gly Pro Val Asp
85 90 95
Thr Ser Val Ala Leu Ile Leu Cys Asp Leu Lys Gln Lys Asp Thr Pro
100 105 110
Ile Val Tyr Ala Ser Glu Ala Phe Leu Tyr Met Thr Gly Tyr Ser Asn
115 120 125
Ala Glu Val Leu Gly Arg Asn Cys Arg Phe Leu Gln Ser Pro Asp Gly
130 135 140
Met Val Lys Pro Lys Ser Thr Arg Lys Tyr Val Asp Ser Asn Thr Ile
145 150 155 160
Asn Thr Met Arg Lys Ala Ile Asp Arg Asn Ala Glu Val Gln Val Glu
165 170 175
Val Val Asn Phe Lys Lys Asn Gly Gln Arg Phe Val Asn Phe Leu Thr
180 185 190
Met Ile Pro Val Arg Asp Glu Thr Gly Glu Tyr Arg Tyr Ser Met Gly
195 200 205
Phe Gln Cys Glu Thr Glu Leu Gln Tyr Pro Tyr Asp Val Pro Asp Tyr
210 215 220
Ala Glu Phe Met Arg Tyr Ser Arg Arg Arg Arg Ser Arg Ser Arg Ser
225 230 235 240
Arg Ser His Ser Arg Ser Arg Gly Arg Arg Tyr Ser Arg Ser Arg Ser
245 250 255
Arg Ser Arg Gly Arg Arg Ser Arg Ser Ala Ser Pro Arg Arg Ser Arg
260 265 270
Ser Ile Ser Leu Arg Arg Ser Arg Ser Ala Ser Leu Arg Arg Ser Arg
275 280 285
Ser Gly Ser Ile Lys Gly Ser Arg Tyr Phe Gln Ser Pro Ser Arg Ser
290 295 300
Arg Ser Arg Ser Arg Ser Ile Ser Arg Pro Arg Ser Ser Arg Ser Lys
305 310 315 320
Ser Arg Ser Pro Ser Pro Lys Arg Ser Arg Ser Pro Ser Gly Ser Pro
325 330 335
Arg Arg Ser Ala Ser Pro Glu Arg Met Asp Gly Ile Leu Pro Pro Lys
340 345 350
Lys Lys Arg Lys Val
355
<210> 16
<211> 445
<212> PRT
<213> Synthetic Sequence
<400> 16
Met Lys Ile Ala Lys Val Ile Asn Asn Asn Val Ile Ser Val Val Asn
1 5 10 15
Glu Gln Gly Lys Glu Leu Val Val Met Gly Arg Gly Leu Ala Phe Gln
20 25 30
Lys Lys Ser Gly Asp Asp Val Asp Glu Ala Arg Ile Glu Lys Val Phe
35 40 45
Thr Leu Asp Asn Lys Asp Val Ser Ala Arg Ile Ser Lys Ala Ala Gly
50 55 60
His Thr Leu Tyr Ala Pro Gly Gly Tyr Asp Ile Met Gly Tyr Leu Ile
65 70 75 80
Gln Ile Met Lys Arg Pro Asn Pro Gln Val Glu Leu Gly Pro Val Asp
85 90 95
Thr Ser Val Ala Leu Ile Leu Cys Asp Leu Lys Gln Lys Asp Thr Pro
100 105 110
Ile Val Tyr Ala Ser Glu Ala Phe Leu Tyr Met Thr Gly Tyr Ser Asn
115 120 125
Ala Glu Val Leu Gly Arg Asn Cys Arg Phe Leu Gln Ser Pro Asp Gly
130 135 140
Met Val Lys Pro Lys Ser Thr Arg Lys Tyr Val Asp Ser Asn Thr Ile
145 150 155 160
Asn Thr Met Arg Lys Ala Ile Asp Arg Asn Ala Glu Val Gln Val Glu
165 170 175
Val Val Asn Phe Lys Lys Asn Gly Gln Arg Phe Val Asn Phe Leu Thr
180 185 190
Met Ile Pro Val Arg Asp Glu Thr Gly Glu Tyr Arg Tyr Ser Met Gly
195 200 205
Phe Gln Cys Glu Thr Glu Leu Gln Tyr Pro Tyr Asp Val Pro Asp Tyr
210 215 220
Ala Glu Phe Met Met Ala Thr Val Glu Pro Glu Thr Thr Pro Thr Pro
225 230 235 240
Asn Pro Pro Thr Thr Glu Glu Glu Lys Thr Glu Ser Asn Gln Glu Val
245 250 255
Ala Asn Pro Glu His Tyr Ile Lys His Pro Leu Gln Asn Arg Trp Ala
260 265 270
Leu Trp Phe Phe Lys Asn Asp Lys Ser Lys Thr Trp Gln Ala Asn Leu
275 280 285
Arg Leu Ile Ser Lys Phe Asp Thr Val Glu Asp Phe Trp Ala Leu Tyr
290 295 300
Asn His Ile Gln Leu Ser Ser Asn Leu Met Pro Gly Cys Asp Tyr Ser
305 310 315 320
Leu Phe Lys Asp Gly Ile Glu Pro Met Leu Glu Asp Glu Lys Asn Lys
325 330 335
Arg Gly Gly Arg Trp Leu Ile Thr Leu Asn Lys Gln Gln Arg Arg Ser
340 345 350
Asp Leu Asp Arg Phe Trp Leu Glu Thr Leu Leu Cys Leu Ile Gly Glu
355 360 365
Ser Phe Asp Asp Tyr Ser Asp Asp Val Cys Gly Ala Val Val Asn Val
370 375 380
Arg Ala Lys Gly Asp Lys Ile Ala Ile Trp Thr Thr Glu Cys Glu Asn
385 390 395 400
Arg Glu Ala Val Thr His Ile Gly Arg Val Tyr Lys Glu Arg Leu Gly
405 410 415
Leu Pro Pro Lys Ile Val Ile Gly Tyr Gln Ser His Ala Asp Thr Ala
420 425 430
Thr Lys Ser Gly Ser Thr Thr Lys Asn Arg Phe Val Val
435 440 445
<210> 17
<211> 404
<212> PRT
<213> Synthetic Sequence
<400> 17
Met Lys Ile Ala Lys Val Ile Asn Asn Asn Val Ile Ser Val Val Asn
1 5 10 15
Glu Gln Gly Lys Glu Leu Val Val Met Gly Arg Gly Leu Ala Phe Gln
20 25 30
Lys Lys Ser Gly Asp Asp Val Asp Glu Ala Arg Ile Glu Lys Val Phe
35 40 45
Thr Leu Asp Asn Lys Asp Val Ser Ala Arg Ile Ser Lys Ala Ala Gly
50 55 60
His Thr Leu Tyr Ala Pro Gly Gly Tyr Asp Ile Met Gly Tyr Leu Ile
65 70 75 80
Gln Ile Met Lys Arg Pro Asn Pro Gln Val Glu Leu Gly Pro Val Asp
85 90 95
Thr Ser Val Ala Leu Ile Leu Cys Asp Leu Lys Gln Lys Asp Thr Pro
100 105 110
Ile Val Tyr Ala Ser Glu Ala Phe Leu Tyr Met Thr Gly Tyr Ser Asn
115 120 125
Ala Glu Val Leu Gly Arg Asn Cys Arg Phe Leu Gln Ser Pro Asp Gly
130 135 140
Met Val Lys Pro Lys Ser Thr Arg Lys Tyr Val Asp Ser Asn Thr Ile
145 150 155 160
Asn Thr Met Arg Lys Ala Ile Asp Arg Asn Ala Glu Val Gln Val Glu
165 170 175
Val Val Asn Phe Lys Lys Asn Gly Gln Arg Phe Val Asn Phe Leu Thr
180 185 190
Met Ile Pro Val Arg Asp Glu Thr Gly Glu Tyr Arg Tyr Ser Met Gly
195 200 205
Phe Gln Cys Glu Thr Glu Met Ala Leu His Ala Arg Asn Ile Ala Met
210 215 220
Glu Leu Glu Ile Arg Pro Leu Phe Leu Val Pro Asp Thr Asn Gly Phe
225 230 235 240
Ile Asp His Leu Ala Ser Leu Ala Arg Leu Leu Glu Ser Arg Lys Tyr
245 250 255
Ile Leu Val Val Pro Leu Ile Val Ile Asn Glu Leu Asp Gly Leu Ala
260 265 270
Lys Gly Gln Glu Thr Asp His Arg Ala Gly Gly Tyr Ala Arg Val Val
275 280 285
Gln Glu Lys Ala Arg Lys Ser Ile Glu Phe Leu Glu Gln Arg Phe Glu
290 295 300
Ser Arg Asp Ser Cys Leu Arg Ala Leu Thr Ser Arg Gly Asn Glu Leu
305 310 315 320
Glu Ser Ile Ala Phe Arg Ser Glu Asp Ile Thr Gly Gln Leu Gly Asn
325 330 335
Asn Asp Asp Leu Ile Leu Ser Cys Cys Leu His Tyr Cys Lys Asp Lys
340 345 350
Ala Lys Asp Phe Met Pro Ala Ser Lys Glu Glu Pro Ile Arg Leu Leu
355 360 365
Arg Glu Val Val Leu Leu Thr Asp Asp Arg Asn Leu Arg Val Lys Ala
370 375 380
Leu Thr Arg Asn Val Pro Val Arg Asp Ile Pro Ala Phe Leu Thr Trp
385 390 395 400
Ala Gln Val Gly
<210> 18
<211> 677
<212> RNA
<213> Synthetic Sequence
<400> 18
augggaguca aaguucuguu ugcccugauc ugcaucgcug uggccgaggc caagcccacc 60
gagaacaacg aagacuucaa caucguggcc guggccagca acuucgcgac cacggaucuc 120
gaugcugacc gcgggaaguu gcccggcaag aagcugccgc uggaggugcu caaagagaug 180
gaagccaaug cccggaaagc uggcugcacc aggggcuguc ugaucugccu gucccacauc 240
aagugcacgc ccaagaugaa gaaguucauc ccaggacgcu gccacaccua cgaaggcgac 300
aaagaguccg cacagggcgg cauaggcgag gcgaucgucg acauuccgga gauuccuggg 360
uucaaggacu uggagcccau ggagcaguuc aucgcacagg ucgaucugug uguggacugc 420
acaacuggcu gccucaaagg gcuugccaac gugcaguguu cugaccugcu caagaagugg 480
cugccgcaac gcugugcgac cuuugccagc aagauccagg gccaggugga caagaucaag 540
ggggccggug gugacuaaaa aaugguggga uuguuacugc uacggcaggc aaaacccaaa 600
uggauccucg cgcggggauu guuacugcua cggcaggcaa aacccuaaaa ugggauccgc 660
ucgagucuag agggccc 677
<210> 19
<211> 772
<212> RNA
<213> Synthetic Sequence
<400> 19
auggagggcu ccgugaacgg ccacgaguuc gagaucgagg gcgagggcga gggccgcccc 60
uacgagggca cccagaccgc caagcugaag gugaccaagg guggcccccu gcccuucgcc 120
ugggacaucc uguccccuca guucauguac ggcuccaagg ccuacgugaa gcaccccgcc 180
gacauccccg acuacuugaa gcuguccuuc cccgagggcu ucaaguggga gcgcgugaug 240
aacuucgagg acggcggcgu ggugaccgug acccaggacu ccucccugca ggacggcgag 300
uucaucuaca aggugaagcu gcgcggcacc aacuuccccu ccgacggccc cguaaugcag 360
aagaagacca ugggcuggga ggccuccucc gagcggaugu accccgagga cggcgcccug 420
aagggcgaga ucaagcagag gcugaagcug aaggacggcg gccacuacga cgcugagguc 480
aagaccaccu acaaggccaa gaagcccgug cagcugcccg gcgccuacaa cgucaacauc 540
aaguuggaca ucaccuccca caacgaggac uacaccaucg uggaacagua cgaacgcgcc 600
gagggccgcc acuccaccgg cggcauggac gagcuguaca aguagaaaau ggugggauug 660
uuacugcuac ggcaggcaaa acccaaaugg auccucgcgc ggggauuguu acugcuacgg 720
caggcaaaac ccuaaaaugg gauccgcucg agucuagagg gcccguuuaa ac 772
<210> 20
<211> 96
<212> RNA
<213> Synthetic Sequence
<400> 20
gggcccccaa ucguggcgug ucggccugcc aaacccggga uuguuacugc uacggcaggc 60
aaaacccggg aaaggcaggc acuggcgccg gggccc 96
<210> 21
<211> 687
<212> RNA
<213> Synthetic Sequence
<400> 21
augggaguca aaguucuguu ugcccugauc ugcaucgcug uggccgaggc caagcccacc 60
gagaacaacg aagacuucaa caucguggcc guggccagca acuucgcgac cacggaucuc 120
gaugcugacc gcgggaaguu gcccggcaag aagcugccgc uggaggugcu caaagagaug 180
gaagccaaug cccggaaagc uggcugcacc aggggcuguc ugaucugccu gucccacauc 240
aagugcacgc ccaagaugaa gaaguucauc ccaggacgcu gccacaccua cgaaggcgac 300
aaagaguccg cacagggcgg cauaggcgag gcgaucgucg acauuccgga gauuccuggg 360
uucaaggacu uggagcccau ggagcaguuc aucgcacagg ucgaucugug uguggacugc 420
acaacuggcu gccucaaagg gcuugccaac gugcaguguu cugaccugcu caagaagugg 480
cugccgcaac gcugugcgac cuuugccagc aagauccagg gccaggugga caagaucaag 540
ggggccggug gugacuaaaa aauggugggg gauuguuacc gcacuaagcg ggcaaaaccc 600
ccaaauggau ccucgcgcgg ggggauuguu accgcacuaa gcgggcaaaa cccccuaaaa 660
ugggauccgc ucgagucuag agggccc 687
<210> 22
<211> 685
<212> RNA
<213> Synthetic Sequence
<400> 22
augggaguca aaguucuguu ugcccugauc ugcaucgcug uggccgaggc caagcccacc 60
gagaacaacg aagacuucaa caucguggcc guggccagca acuucgcgac cacggaucuc 120
gaugcugacc gcgggaaguu gcccggcaag aagcugccgc uggaggugcu caaagagaug 180
gaagccaaug cccggaaagc uggcugcacc aggggcuguc ugaucugccu gucccacauc 240
aagugcacgc ccaagaugaa gaaguucauc ccaggacgcu gccacaccua cgaaggcgac 300
aaagaguccg cacagggcgg cauaggcgag gcgaucgucg acauuccgga gauuccuggg 360
uucaaggacu uggagcccau ggagcaguuc aucgcacagg ucgaucugug uguggacugc 420
acaacuggcu gccucaaagg gcuugccaac gugcaguguu cugaccugcu caagaagugg 480
cugccgcaac gcugugcgac cuuugccagc aagauccagg gccaggugga caagaucaag 540
ggggccggug gugacuaaaa aauggugggg guuuguuacu gauaaagcag gcaagacccc 600
caaauggauc cucgcgcggg ggguuuguua cugauaaagc aggcaagacc cccuaaaaug 660
ggauccgcuc gagucuagag ggccc 685
<210> 23
<211> 679
<212> RNA
<213> Synthetic Sequence
<400> 23
augggaguca aaguucuguu ugcccugauc ugcaucgcug uggccgaggc caagcccacc 60
gagaacaacg aagacuucaa caucguggcc guggccagca acuucgcgac cacggaucuc 120
gaugcugacc gcgggaaguu gcccggcaag aagcugccgc uggaggugcu caaagagaug 180
gaagccaaug cccggaaagc uggcugcacc aggggcuguc ugaucugccu gucccacauc 240
aagugcacgc ccaagaugaa gaaguucauc ccaggacgcu gccacaccua cgaaggcgac 300
aaagaguccg cacagggcgg cauaggcgag gcgaucgucg acauuccgga gauuccuggg 360
uucaaggacu uggagcccau ggagcaguuc aucgcacagg ucgaucugug uguggacugc 420
acaacuggcu gccucaaagg gcuugccaac gugcaguguu cugaccugcu caagaagugg 480
cugccgcaac gcugugcgac cuuugccagc aagauccagg gccaggugga caagaucaag 540
ggggccggug gugacuaaaa aauggugggu uacugauucg aucaggcaug agugacccaa 600
auggauccuc gcgcgggguu acugauucga ucaggcauga gugacccuaa aaugggaucc 660
gcucgagucu agagggccc 679
<210> 24
<211> 1677
<212> RNA
<213> Synthetic Sequence
<400> 24
auggugagca agggcgagga ggauaacaug gccaucauca aggaguucau gcgcuucaag 60
gugcacaugg agggcuccgu gaacggccac gaguucgaga ucgagggcga gggcgagggc 120
cgccccuacg agggcaccca gaccgccaag cugaagguga ccaagggugg cccccugccc 180
uucgccuggg acauccuguc cccucaguuc auguacggcu ccaaggccua cgugaagcac 240
cccgccgaca uccccgacua cuugaagcug uccuuccccg agggcuucaa gugggagcgc 300
gugaugaacu ucgaggacgg cggcguggug accgugaccc aggacuccuc ccugcaggac 360
ggcgaguuca ucuacaaggu gaagcugcgc ggcaccaacu uccccuccga cggccccgua 420
augcagaaga agaccauggg cugggaggcc uccuccgagc ggauguaccc cgaggacggc 480
gcccugaagg gcgagaucaa gcagaggcug aagcugaagg acggcggcca cuacgacgcu 540
gaggucaaga ccaccuacaa ggccaagaag cccgugcagc ugcccggcgc cuacaacguc 600
aacaucaagu uggacaucac cucccacaac gaggacuaca ccaucgugga acaguacgaa 660
cgcgccgagg gccgccacuc caccggcggc auggacgagc uguacaagua gaauacgacu 720
cacuauaggg agacccaaaa cggugggauu guuacugcua cggcaggcaa aacccaaagg 780
cuagcaaaac ggugggauug uuacugcuac ggcaggcaaa acccaaagug uacaaaaacg 840
gugggauugu uacugcuacg gcaggcaaaa cccaaaggga uccaaaacgg ugggauuguu 900
acugcuacgg caggcaaaac ccaaagagcg cuggauccug uacaaagcuu ggccaccaug 960
gugagcaagg gcgaggagcu guucaccggg guggugccca uccuggucga gcuggacggc 1020
gacguaaacg gccacaaguu cagcgugucc ggcgagggcg agggcgaugc caccuacggc 1080
aagcugaccc ugaaguucau cugcaccacc ggcaagcugc ccgugcccug gcccacccuc 1140
gugaccaccc ugaccuacgg cgugcagugc uucagccgcu accccgacca caugaagcag 1200
cacgacuucu ucaaguccgc caugcccgaa ggcuacgucc aggagcgcac caucuucuuc 1260
aaggacgacg gcaacuacaa gacccgcgcc gaggugaagu ucgagggcga cacccuggug 1320
aaccgcaucg agcugaaggg caucgacuuc aaggaggacg gcaacauccu ggggcacaag 1380
cuggaguaca acuacaacag ccacaacguc uauaucaugg ccgacaagca gaagaacggc 1440
aucaagguga acuucaagau ccgccacaac aucgaggacg gcagcgugca gcucgccgac 1500
cacuaccagc agaacacccc caucggcgac ggccccgugc ugcugcccga caaccacuac 1560
cugagcaccc aguccgcccu gagcaaagac cccaacgaga agcgcgauca caugguccug 1620
cuggaguucg ugaccgccgc cgggaucacu cucggcaugg acgagcugua caaguag 1677
<210> 25
<211> 1802
<212> RNA
<213> Synthetic Sequence
<400> 25
auggugagca agggcgagga gcuguucacc gggguggugc ccauccuggu cgagcuggac 60
ggcgacguaa acggccacaa guucagcgug uccggcgagg gcgagggcga ugccaccuac 120
ggcaagcuga cccugaaguu caucugcacc accggcaagc ugcccgugcc cuggcccacc 180
cucgugacca cccugaccua cggcgugcag ugcuucagcc gcuaccccga ccacaugaag 240
cagcacgacu ucuucaaguc cgccaugccc gaaggcuacg uccagguaag ucucgacgaa 300
acaaggaugc uguuagaguu ucacaaaccu aaauccgguu gcuuuaguug ucuuaaaagc 360
uuugagcaaa aaccuugauu guucuccagu ggaggugcag ucacugcccu cuauccguug 420
gucauuucau uuguguccuu gccuguuggc aagacuccac ugaaaccucu cugggagauu 480
gguaggugga gggggcagga ggcccuacuu agaaaguguc auugaagcca auccuucuaa 540
cugaccaccu cugcccuccu aauaauucug gugugaaggc guaaugaugu gggcuucagg 600
gucuuuguuc uuccuccccu aagucuucag aauggguagu ugggaguaag ggugguagaa 660
ggggaacugg augaagugga cauggugggg gucuucccau agaggguccc ucauugacua 720
gagcuccuua uugcauauua gauugcacca cccgaaacac cugacuccaa aguucguaug 780
guucucgaga augggauugu uacugcuacg gcaggcaaaa cccuucgggc ccagcaucgc 840
uggaccgccu gaggcccaau ucaaggugag ggucuuuauu guuuuccaac gugcaaauca 900
agucgacuga acauggagga auugagguug gguauuuccc cugagguagg aaaaaggcug 960
ggucaguuuc ccguuagccg ucaaguccuc aucacaucuu uaagccuucc augcaggaua 1020
aagggcugca gagcuauuuu caaauugaca ucaaacugga uuucuguuga cuucgucuuc 1080
ccuuuuuaag guccacagaa gaagauggga aggaaagaag ucugagggca ucuuauuugc 1140
acuccgcugu cauuucuaag gaaggccuuu aaugccaaau ucucaucuuu uaugucccca 1200
cuaaauccua agguucuuga acuucugauc agacagccaa aaaaugaacc aucaacuagc 1260
uuaaccuaac auaugugagg auagaggacu gggacagcuc ucugggccac uggagaguca 1320
gacaggccug cccucugugu gacuugaccg cggucucuuu cuuccaggag cgcaccaucu 1380
ucuucaagga cgacggcaac uacaagaccc gcgccgaggu gaaguucgag ggcgacaccc 1440
uggugaaccg caucgagcug aagggcaucg acuucaagga ggacggcaac auccuggggc 1500
acaagcugga guacaacuac aacagccaca acgucuauau cauggccgac aagcagaaga 1560
acggcaucaa ggugaacuuc aagauccgcc acaacaucga ggacggcagc gugcagcucg 1620
ccgaccacua ccagcagaac acccccaucg gcgacggccc cgugcugcug cccgacaacc 1680
acuaccugag cacccagucc gcccugagca aagaccccaa cgagaagcgc gaucacaugg 1740
uccugcugga guucgugacc gccgccggga ucacucucgg cauggacgag cuguacaagu 1800
aa 1802
<210> 26
<211> 500
<212> PRT
<213> Synthetic Sequence
<400> 26
Met Lys Ile Ala Lys Val Ile Asn Asn Asn Val Ile Ser Val Val Asn
1 5 10 15
Glu Gln Gly Lys Glu Leu Val Val Met Gly Arg Gly Leu Ala Phe Gln
20 25 30
Lys Lys Ser Gly Asp Asp Val Asp Glu Ala Arg Ile Glu Lys Val Phe
35 40 45
Thr Leu Asp Asn Lys Asp Val Ser Ala Arg Ile Ser Lys Ala Ala Gly
50 55 60
His Thr Leu Tyr Ala Pro Gly Gly Tyr Asp Ile Met Gly Tyr Leu Ile
65 70 75 80
Gln Ile Met Lys Arg Pro Asn Pro Gln Val Glu Leu Gly Pro Val Asp
85 90 95
Thr Ser Val Ala Leu Ile Leu Cys Asp Leu Lys Gln Lys Asp Thr Pro
100 105 110
Ile Val Tyr Ala Ser Glu Ala Phe Leu Tyr Met Thr Gly Tyr Ser Asn
115 120 125
Ala Glu Val Leu Gly Arg Asn Cys Arg Phe Leu Gln Ser Pro Asp Gly
130 135 140
Met Val Lys Pro Lys Ser Thr Arg Lys Tyr Val Asp Ser Asn Thr Ile
145 150 155 160
Asn Thr Met Arg Lys Ala Ile Asp Arg Asn Ala Glu Val Gln Val Glu
165 170 175
Val Val Asn Phe Lys Lys Asn Gly Gln Arg Phe Val Asn Phe Leu Thr
180 185 190
Met Ile Pro Val Arg Asp Glu Thr Gly Glu Tyr Arg Tyr Ser Met Gly
195 200 205
Phe Gln Cys Glu Thr Glu Leu Gln Tyr Pro Tyr Asp Val Pro Asp Tyr
210 215 220
Ala Gly Gly Gly Gly Ser Gly Gly Gly Glu Phe Met Met Val Ser Lys
225 230 235 240
Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Met Asp
245 250 255
Gly Asp Val Asn Gly Arg Lys Phe Ser Val Arg Gly Val Gly Glu Gly
260 265 270
Asp Ala Thr His Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Ser Gly
275 280 285
Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Ser Tyr Gly
290 295 300
Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe
305 310 315 320
Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe
325 330 335
Phe Lys Asp Asp Gly Ser Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu
340 345 350
Gly Asp Thr Leu Val Asn Arg Ile Val Leu Lys Gly Thr Asp Phe Lys
355 360 365
Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Met Asn Val
370 375 380
Gly Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly Ile Lys Ala
385 390 395 400
Asn Phe Glu Ile Arg His Asn Val Glu Asp Gly Gly Val Gln Leu Ala
405 410 415
Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Ser Val Leu Leu
420 425 430
Pro Asp Asn His Tyr Leu Ser Val Gln Val Lys Leu Ser Lys Asp Pro
435 440 445
Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Arg Thr Ala Ala
450 455 460
Gly Ile Thr Pro Gly Met Asp Glu Leu Tyr Lys Gly Ser Gly Arg Ser
465 470 475 480
Lys Met Ser Lys Asp Gly Lys Lys Lys Lys Lys Lys Ser Lys Thr Lys
485 490 495
Cys Val Ile Met
500
<210> 27
<211> 502
<212> PRT
<213> Synthetic Sequence
<400> 27
Met Lys Ile Ala Lys Val Ile Asn Asn Asn Val Ile Ser Val Val Asn
1 5 10 15
Glu Gln Gly Lys Glu Leu Val Val Met Gly Arg Gly Leu Ala Phe Gln
20 25 30
Lys Lys Ser Gly Asp Asp Val Asp Glu Ala Arg Ile Glu Lys Val Phe
35 40 45
Thr Leu Asp Asn Lys Asp Val Ser Ala Arg Ile Ser Lys Ala Ala Gly
50 55 60
His Thr Leu Tyr Ala Pro Gly Gly Tyr Asp Ile Met Gly Tyr Leu Ile
65 70 75 80
Gln Ile Met Lys Arg Pro Asn Pro Gln Val Glu Leu Gly Pro Val Asp
85 90 95
Thr Ser Val Ala Leu Ile Leu Cys Asp Leu Lys Gln Lys Asp Thr Pro
100 105 110
Ile Val Tyr Ala Ser Glu Ala Phe Leu Tyr Met Thr Gly Tyr Ser Asn
115 120 125
Ala Glu Val Leu Gly Arg Asn Cys Arg Phe Leu Gln Ser Pro Asp Gly
130 135 140
Met Val Lys Pro Lys Ser Thr Arg Lys Tyr Val Asp Ser Asn Thr Ile
145 150 155 160
Asn Thr Met Arg Lys Ala Ile Asp Arg Asn Ala Glu Val Gln Val Glu
165 170 175
Val Val Asn Phe Lys Lys Asn Gly Gln Arg Phe Val Asn Phe Leu Thr
180 185 190
Met Ile Pro Val Arg Asp Glu Thr Gly Glu Tyr Arg Tyr Ser Met Gly
195 200 205
Phe Gln Cys Glu Thr Glu Leu Gln Tyr Pro Tyr Asp Val Pro Asp Tyr
210 215 220
Ala Gly Gly Gly Gly Ser Gly Gly Gly Glu Phe Met Met Val Ser Lys
225 230 235 240
Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Met Asp
245 250 255
Gly Asp Val Asn Gly Arg Lys Phe Ser Val Arg Gly Val Gly Glu Gly
260 265 270
Asp Ala Thr His Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Ser Gly
275 280 285
Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Ser Tyr Gly
290 295 300
Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe
305 310 315 320
Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe
325 330 335
Phe Lys Asp Asp Gly Ser Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu
340 345 350
Gly Asp Thr Leu Val Asn Arg Ile Val Leu Lys Gly Thr Asp Phe Lys
355 360 365
Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Met Asn Val
370 375 380
Gly Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly Ile Lys Ala
385 390 395 400
Asn Phe Glu Ile Arg His Asn Val Glu Asp Gly Gly Val Gln Leu Ala
405 410 415
Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Ser Val Leu Leu
420 425 430
Pro Asp Asn His Tyr Leu Ser Val Gln Val Lys Leu Ser Lys Asp Pro
435 440 445
Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Arg Thr Ala Ala
450 455 460
Gly Ile Thr Pro Gly Met Asp Glu Leu Tyr Lys Leu Tyr Lys Asp Pro
465 470 475 480
Lys Lys Lys Arg Lys Val Asp Pro Lys Lys Lys Arg Lys Val Asp Pro
485 490 495
Lys Lys Lys Arg Lys Val
500
Claims (18)
1.一种光控RNA代谢调控系统,其特征在于,包括:
a)重组光控RNA效应因子,所述重组光控RNA效应因子包括作为RNA结合结构域的第一多肽,作为光敏结构域的第二多肽和作为RNA效应结构域的第三多肽;
b)靶调控单元:包括被所述第一多肽识别/结合的至少一个反应元件、被第三多肽调节的靶标RNA核苷酸序列。
2.根据权利要求1所述的光控RNA代谢调控系统,其中所述第一多肽、第二多肽和第三多肽之间可操作性连接,和/或其中反应元件和靶标RNA核苷酸序列之间可操作性连接。
3.根据权利要求1所述的光控RNA代谢调控系统,其中所述第一多肽可选自抗转录终止因子蛋白的RNA识别结合结构域、RNA衰减子RNA识别结合结构域、RNA干扰酶RNA识别结合结构域、小调节RNA结合蛋白RNA识别结合结构域、RNA解旋酶RNA识别结合结构域、核酶RNA识别结合结构域、tRNA结合蛋白RNA识别结合结构域、rRNA结合蛋白RNA识别结合结构域。
4.根据权利要求3所述的光控RNA代谢调控系统,其中所述第一多肽可选自BglG/SacY家族抗转录终止因子蛋白的RNA识别结合结构域。
5.根据权利要求4所述的光控RNA代谢调控系统,其中所述第一多肽选自粗糙芽孢杆菌LicT蛋白的RNA识别结合结构域、大肠杆菌BglG蛋白的RNA识别结合结构域、枯草芽胞杆菌SacY蛋白的RNA识别结合结构域、枯草芽胞杆菌GlcT蛋白的RNA识别结合结构域、枯草芽胞杆菌SacT蛋白的RNA识别结合结构域、菊欧氏杆菌Arbg蛋白的RNA识别结合结构域、乳酸菌BglR蛋白的RNA识别结合结构域、干酪乳杆菌LacT蛋白的RNA识别结合结构域、肉葡萄球菌GlcT蛋白的RNA识别结合结构域及其截短体或具有80%以上序列相似性且具有所限定的氨基酸序列的功能的突变体,优选为具有RNA结合活性的截短体或突变体。
6.根据权利要求1所述的光控RNA代谢调控系统,其中所述第二多肽选自含黄素类生色团的光敏蛋白的光敏结构域。
7.根据权利要求6所述的光控RNA代谢调控系统,其中所述第二多肽选自含LOV结构域的光敏蛋白的光敏结构域。
8.根据权利要求7所述的光控RNA代谢调控系统,其中所述第二多肽选自粗糙链孢霉菌的VIVID LOV结构域、细菌Erythrobacter litoralis的EL222 LOV结构域、燕麦光敏色素1基因的LOV结构域AsLOV2、无隔藻金色素蛋白1的LOV结构域AuLOV、恶臭假单胞菌的PpLOVLOV结构域及其截短体或具有80%以上序列相似性且具有所限定的氨基酸序列的功能的突变体,优选为具有光诱导寡聚化能力发生改变的截短体或突变体。
9.根据权利要求1所述的光控RNA代谢调控系统,其中所述第一多肽和第二多肽构成一种光控RNA结合蛋白。
10.根据权利要求1所述的光控RNA代谢调控系统,其中所述第三多肽选自RNA剪接调控结构域、RNA翻译调控因子结构域、RNA核酸酶结构域、RNA表观遗传学修饰酶结构域。
11.如权利要求1所述的光控RNA代谢调控系统,其中还包含促进所述重组光控RNA效应蛋白向不同细胞器运输的定位信号肽第四多肽,所述第四多肽与第一、第二、第三多肽直接或通过接头肽相连。所述第四多肽可选自细胞核定位信号肽、线粒体定位信号肽、高尔基体定位信号肽、内质网定位信号肽、细胞质定位信号肽。
12.根据权利要求1所述的光控RNA代谢调控系统,其中所述反应元件为可被所述第一多肽特异性识别和结合的RNA基序。
13.如权利要求11所述的光控RNA代谢调控系统,其中所述反应元件选自LicT蛋白结合元件、BglG蛋白结合元件、SacY蛋白结合元件、枯草芽胞杆菌GlcT蛋白结合元件、SacT蛋白结合元件、Arbg蛋白结合元件、BglR蛋白结合元件、LacT蛋白结合元件和肉葡萄球菌GlcT蛋白结合元件。
14.含有权利要求1-13之一所述光控RNA代谢调控系统的表达载体。
15.根据权利要求14所述的表达载体,所述表达载体含有所述光控RNA效应因子编码基因,和/或所述靶调控单元编码核苷酸序列,所述靶调控单元中含有反应元件但待调控RNA编码核苷酸序列空缺。
16.用权利要求1-13之一所述光控RNA代谢调控系统在宿主细胞中调控靶标RNA代谢的方法,包括步骤:a)将所述光控RNA代谢调控系统构建在原核或真核质粒表达载体中;b)将重组质粒引入宿主细胞;c)光照诱导所述宿主细胞,调控宿主细胞中靶标RNA的代谢。
17.如权利要求16所述的调控靶标RNA代谢的方法,其中还包括光源的选择和照射方法的选择,所述光源包括LED灯、荧光灯、激光和白炽灯,所述照射方法是持续的光照或不连续的光照。其中所述光源的选择和照射方法的选择包括用光扫描、投影、光模具在空间上控制不同位置的细胞的靶标RNA的代谢。
18.一种试剂盒,装有权利要求14所述表达载体或/和含有所述光控RNA效应因子表达载体的原核或真核细胞,或/和装有含反应元件但待调控靶标RNA编码核苷酸序列空缺的靶调控单元的表达载体,及相应的说明书。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111024451.5A CN113652430B (zh) | 2021-09-02 | 2021-09-02 | 一种光控rna代谢调控系统 |
PCT/CN2022/115877 WO2023030330A1 (zh) | 2021-09-02 | 2022-08-30 | 一种光控rna代谢调控系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111024451.5A CN113652430B (zh) | 2021-09-02 | 2021-09-02 | 一种光控rna代谢调控系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113652430A true CN113652430A (zh) | 2021-11-16 |
CN113652430B CN113652430B (zh) | 2024-08-27 |
Family
ID=78482682
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111024451.5A Active CN113652430B (zh) | 2021-09-02 | 2021-09-02 | 一种光控rna代谢调控系统 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN113652430B (zh) |
WO (1) | WO2023030330A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023030330A1 (zh) * | 2021-09-02 | 2023-03-09 | 华东理工大学 | 一种光控rna代谢调控系统 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102643852A (zh) * | 2011-02-28 | 2012-08-22 | 华东理工大学 | 光可控的基因表达系统 |
CN103031327A (zh) * | 2012-08-02 | 2013-04-10 | 华东理工大学 | 原核细菌光诱导基因表达系统及其调控基因表达的方法 |
US20170355977A1 (en) * | 2016-06-09 | 2017-12-14 | The Trustees Of Princeton University | Optogenetic tool for rapid and reversible clustering of proteins |
CN110831669A (zh) * | 2017-03-07 | 2020-02-21 | 匹兹堡大学联邦高等教育系统 | 神经退行性疾病病理的光遗传学诱导 |
US20210080472A1 (en) * | 2019-09-12 | 2021-03-18 | Massachusetts Institute Of Technology | Methods for simultaneous measurement of multiple biological signals from spectrally identical fluorescent reporters |
CN113136396A (zh) * | 2020-01-20 | 2021-07-20 | 华东理工大学 | 细菌光控基因表达系统及其调控基因表达的方法 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011002977A2 (en) * | 2009-07-01 | 2011-01-06 | The University Of North Carolina At Chapel Hill | Genetically encoded photomanipulation of protein and peptide activity |
CN113652430B (zh) * | 2021-09-02 | 2024-08-27 | 华东理工大学 | 一种光控rna代谢调控系统 |
-
2021
- 2021-09-02 CN CN202111024451.5A patent/CN113652430B/zh active Active
-
2022
- 2022-08-30 WO PCT/CN2022/115877 patent/WO2023030330A1/zh active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102643852A (zh) * | 2011-02-28 | 2012-08-22 | 华东理工大学 | 光可控的基因表达系统 |
CN103031327A (zh) * | 2012-08-02 | 2013-04-10 | 华东理工大学 | 原核细菌光诱导基因表达系统及其调控基因表达的方法 |
US20170355977A1 (en) * | 2016-06-09 | 2017-12-14 | The Trustees Of Princeton University | Optogenetic tool for rapid and reversible clustering of proteins |
CN110831669A (zh) * | 2017-03-07 | 2020-02-21 | 匹兹堡大学联邦高等教育系统 | 神经退行性疾病病理的光遗传学诱导 |
US20210080472A1 (en) * | 2019-09-12 | 2021-03-18 | Massachusetts Institute Of Technology | Methods for simultaneous measurement of multiple biological signals from spectrally identical fluorescent reporters |
CN113136396A (zh) * | 2020-01-20 | 2021-07-20 | 华东理工大学 | 细菌光控基因表达系统及其调控基因表达的方法 |
Non-Patent Citations (3)
Title |
---|
CAO J.C.等: "Characterization of an optogenetic translation activation system", 《40TH ANNUAL NORTHEAST BIOENGINEERING CONFERENCE(NEBEC)》, 27 April 2014 (2014-04-27), pages 1 - 2, XP032692757, DOI: 10.1109/NEBEC.2014.6972749 * |
CAO J.C.等: "Characterization of an optogenetic translation activation system", 《40TH ANNUAL NORTHEAST BIOENGINEERING CONFERENCE(NEBEC)》, pages 1 - 2 * |
FUX L.等: "Interactions between the PTS regulation domains of the BglG transcriptional antiterminator from Escherichia coli", 《JOURNAL OF BIOLOGICAL CHEMISTRY》, vol. 278, no. 47, 21 November 2003 (2003-11-21), pages 46203 - 46209 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023030330A1 (zh) * | 2021-09-02 | 2023-03-09 | 华东理工大学 | 一种光控rna代谢调控系统 |
Also Published As
Publication number | Publication date |
---|---|
CN113652430B (zh) | 2024-08-27 |
WO2023030330A1 (zh) | 2023-03-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2020200256B2 (en) | A three-component CRISPR/Cas complex system and uses thereof | |
CN102643852B (zh) | 光可控的基因表达系统 | |
CA2905229C (en) | Protein enriched microvesicles and methods of making and using the same | |
CN103031327B (zh) | 原核细菌光诱导基因表达系统及其调控基因表达的方法 | |
AU2019222568B2 (en) | Engineered Cas9 systems for eukaryotic genome modification | |
US10221422B2 (en) | Blue light-inducible system for gene expression | |
JP2014509196A5 (zh) | ||
CN106244608B (zh) | 诱导的t7 rna聚合酶 | |
CA3066798A1 (en) | Synthetic guide rna for crispr/cas activator systems | |
CN109337904B (zh) | 基于C2c1核酸酶的基因组编辑系统和方法 | |
Sharma et al. | Rather rule than exception? How to evaluate the relevance of dual protein targeting to mitochondria and chloroplasts | |
CN113652430A (zh) | 一种光控rna代谢调控系统 | |
Osher et al. | Intracellular production of cyclic peptide libraries with SICLOPPS | |
JP2015509957A (ja) | ヒストンアセチル化の標的化 | |
Warren et al. | Ligation-independent cloning and self-cleaving intein as a tool for high-throughput protein purification | |
Liu et al. | GAL4–NF-κB fusion protein augments transgene expression from neuronal promoters in the rat brain | |
KR20220097869A (ko) | 비자연 아미노산의 부위-특이적 도입을 위한 아미노아실-tRNA 신테타제 및 세포주 | |
CN112876536A (zh) | 一种多肽标签及其在体外蛋白合成中的应用 | |
Ikeda et al. | Artificial acceleration of mammalian cell reprogramming by bacterial proteins | |
US20240093206A1 (en) | System of stable gene expression in cell lines and methods of making and using the same | |
Wang et al. | Expression, purification and oligomerization of the S-adenosylmethionine transporter | |
Bhat et al. | The tobacco BY-2 cell line as a model system to understand in planta nuclear coactivator interactions | |
Ren et al. | An improved Tet-on system to tightly conditionally regulate reporter gene expression |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |