WO2018148516A1 - Methods of making proteins with non-standard amino acids - Google Patents
Methods of making proteins with non-standard amino acids Download PDFInfo
- Publication number
- WO2018148516A1 WO2018148516A1 PCT/US2018/017581 US2018017581W WO2018148516A1 WO 2018148516 A1 WO2018148516 A1 WO 2018148516A1 US 2018017581 W US2018017581 W US 2018017581W WO 2018148516 A1 WO2018148516 A1 WO 2018148516A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- amino acid
- cell
- target polypeptide
- protecting group
- standard
- Prior art date
Links
- 150000001413 amino acids Chemical class 0.000 title claims abstract description 424
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 107
- 238000000034 method Methods 0.000 title claims abstract description 101
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 98
- 102000052866 Amino Acyl-tRNA Synthetases Human genes 0.000 claims abstract description 49
- 108700028939 Amino Acyl-tRNA Synthetases Proteins 0.000 claims abstract description 49
- 238000010348 incorporation Methods 0.000 claims abstract description 23
- 210000004027 cell Anatomy 0.000 claims description 202
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 167
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 160
- 229920001184 polypeptide Polymers 0.000 claims description 157
- 125000006239 protecting group Chemical group 0.000 claims description 88
- 101150038575 clpS gene Proteins 0.000 claims description 72
- 108091005804 Peptidases Proteins 0.000 claims description 67
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims description 67
- 239000004365 Protease Substances 0.000 claims description 66
- 150000007523 nucleic acids Chemical group 0.000 claims description 54
- 108020004566 Transfer RNA Proteins 0.000 claims description 50
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 49
- 238000006467 substitution reaction Methods 0.000 claims description 47
- 241000588724 Escherichia coli Species 0.000 claims description 44
- 102000004190 Enzymes Human genes 0.000 claims description 43
- 108090000790 Enzymes Proteins 0.000 claims description 43
- 102000035181 adaptor proteins Human genes 0.000 claims description 38
- 108091005764 adaptor proteins Proteins 0.000 claims description 38
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 31
- 102000003960 Ligases Human genes 0.000 claims description 25
- 108090000364 Ligases Proteins 0.000 claims description 25
- 230000001939 inductive effect Effects 0.000 claims description 25
- 108020004707 nucleic acids Proteins 0.000 claims description 22
- 102000039446 nucleic acids Human genes 0.000 claims description 22
- 241000894006 Bacteria Species 0.000 claims description 21
- 108020004705 Codon Proteins 0.000 claims description 21
- 108090000848 Ubiquitin Proteins 0.000 claims description 20
- 102000044159 Ubiquitin Human genes 0.000 claims description 20
- 230000000593 degrading effect Effects 0.000 claims description 18
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 17
- 238000004519 manufacturing process Methods 0.000 claims description 15
- 108020005206 Amino Acyl Transfer RNA Proteins 0.000 claims description 11
- 210000001236 prokaryotic cell Anatomy 0.000 claims description 11
- 102220076790 rs570768038 Human genes 0.000 claims description 8
- 108020004485 Nonsense Codon Proteins 0.000 claims description 6
- 238000012360 testing method Methods 0.000 claims description 4
- 238000012546 transfer Methods 0.000 claims description 2
- 238000012216 screening Methods 0.000 abstract description 4
- 235000001014 amino acid Nutrition 0.000 description 373
- 229940024606 amino acid Drugs 0.000 description 345
- 235000018102 proteins Nutrition 0.000 description 77
- 230000015556 catabolic process Effects 0.000 description 33
- 238000006731 degradation reaction Methods 0.000 description 33
- 230000000368 destabilizing effect Effects 0.000 description 31
- 230000014509 gene expression Effects 0.000 description 31
- 239000013598 vector Substances 0.000 description 22
- 239000013612 plasmid Substances 0.000 description 21
- 102100037399 Alanine-tRNA ligase, cytoplasmic Human genes 0.000 description 19
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 19
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 19
- 101000879354 Homo sapiens Alanine-tRNA ligase, cytoplasmic Proteins 0.000 description 19
- 239000005090 green fluorescent protein Substances 0.000 description 18
- 230000002068 genetic effect Effects 0.000 description 15
- 238000001727 in vivo Methods 0.000 description 14
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 12
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 12
- 230000037361 pathway Effects 0.000 description 11
- 230000001105 regulatory effect Effects 0.000 description 11
- 108020004414 DNA Proteins 0.000 description 10
- 230000000694 effects Effects 0.000 description 10
- 239000000758 substrate Substances 0.000 description 10
- 230000014616 translation Effects 0.000 description 9
- -1 conditions Substances 0.000 description 8
- 230000017854 proteolysis Effects 0.000 description 8
- 238000012552 review Methods 0.000 description 8
- 108020005038 Terminator Codon Proteins 0.000 description 7
- 239000013604 expression vector Substances 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 230000000087 stabilizing effect Effects 0.000 description 7
- 229930101283 tetracycline Natural products 0.000 description 7
- OFVLGDICTFRJMM-WESIUVDSSA-N tetracycline Chemical compound C1=CC=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(O)=C(C(N)=O)C(=O)[C@@]4(O)C(O)=C3C(=O)C2=C1O OFVLGDICTFRJMM-WESIUVDSSA-N 0.000 description 7
- 238000013519 translation Methods 0.000 description 7
- 241000206602 Eukaryota Species 0.000 description 6
- 101710146427 Probable tyrosine-tRNA ligase, cytoplasmic Proteins 0.000 description 6
- 108010076818 TEV protease Proteins 0.000 description 6
- 101710107268 Tyrosine-tRNA ligase, mitochondrial Proteins 0.000 description 6
- 229960003767 alanine Drugs 0.000 description 6
- 238000003556 assay Methods 0.000 description 6
- 230000001580 bacterial effect Effects 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 102000037865 fusion proteins Human genes 0.000 description 6
- 108020001507 fusion proteins Proteins 0.000 description 6
- 238000010362 genome editing Methods 0.000 description 6
- 108020004999 messenger RNA Proteins 0.000 description 6
- 238000003752 polymerase chain reaction Methods 0.000 description 6
- 230000010076 replication Effects 0.000 description 6
- 241000894007 species Species 0.000 description 6
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 5
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 5
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 5
- 101710123256 Pyrrolysine-tRNA ligase Proteins 0.000 description 5
- 235000004279 alanine Nutrition 0.000 description 5
- 125000000266 alpha-aminoacyl group Chemical group 0.000 description 5
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 5
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 5
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 5
- 108010041758 cleavase Proteins 0.000 description 5
- 239000003623 enhancer Substances 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 125000000524 functional group Chemical group 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 244000005700 microbiome Species 0.000 description 5
- 238000002703 mutagenesis Methods 0.000 description 5
- 231100000350 mutagenesis Toxicity 0.000 description 5
- 230000001915 proofreading effect Effects 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- PAJPWUMXBYXFCZ-UHFFFAOYSA-N 1-aminocyclopropanecarboxylic acid Chemical compound OC(=O)C1(N)CC1 PAJPWUMXBYXFCZ-UHFFFAOYSA-N 0.000 description 4
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 4
- 241000701022 Cytomegalovirus Species 0.000 description 4
- 241000196324 Embryophyta Species 0.000 description 4
- 241000194033 Enterococcus Species 0.000 description 4
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 4
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 4
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 4
- ZFOMKMMPBOQKMC-KXUCPTDWSA-N L-pyrrolysine Chemical compound C[C@@H]1CC=N[C@H]1C(=O)NCCCC[C@H]([NH3+])C([O-])=O ZFOMKMMPBOQKMC-KXUCPTDWSA-N 0.000 description 4
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 4
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 4
- 239000004472 Lysine Substances 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 241001135223 Prevotella melaninogenica Species 0.000 description 4
- 241000235070 Saccharomyces Species 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 241000194017 Streptococcus Species 0.000 description 4
- 102000006275 Ubiquitin-Protein Ligases Human genes 0.000 description 4
- 108010083111 Ubiquitin-Protein Ligases Proteins 0.000 description 4
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 229960001230 asparagine Drugs 0.000 description 4
- 235000009582 asparagine Nutrition 0.000 description 4
- 229940009098 aspartate Drugs 0.000 description 4
- 230000003115 biocidal effect Effects 0.000 description 4
- 238000003776 cleavage reaction Methods 0.000 description 4
- 230000001419 dependent effect Effects 0.000 description 4
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 4
- 235000004554 glutamine Nutrition 0.000 description 4
- 229960002743 glutamine Drugs 0.000 description 4
- 229960000310 isoleucine Drugs 0.000 description 4
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 4
- 229960003646 lysine Drugs 0.000 description 4
- 210000004962 mammalian cell Anatomy 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 239000002773 nucleotide Substances 0.000 description 4
- 125000003729 nucleotide group Chemical group 0.000 description 4
- 230000001323 posttranslational effect Effects 0.000 description 4
- 230000007017 scission Effects 0.000 description 4
- 238000002864 sequence alignment Methods 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 230000002103 transcriptional effect Effects 0.000 description 4
- 229960004441 tyrosine Drugs 0.000 description 4
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 4
- 229960004295 valine Drugs 0.000 description 4
- 239000004474 valine Substances 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- SLXKOJJOQWFEFD-UHFFFAOYSA-N 6-aminohexanoic acid Chemical compound NCCCCCC(O)=O SLXKOJJOQWFEFD-UHFFFAOYSA-N 0.000 description 3
- 108091093088 Amplicon Proteins 0.000 description 3
- 241000193830 Bacillus <bacterium> Species 0.000 description 3
- 241000606108 Bartonella quintana Species 0.000 description 3
- 241001647378 Chlamydia psittaci Species 0.000 description 3
- 241000606153 Chlamydia trachomatis Species 0.000 description 3
- 241000193403 Clostridium Species 0.000 description 3
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 3
- 101000747867 Homo sapiens Upstream-binding protein 1 Proteins 0.000 description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 3
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 3
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 3
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- 241000714474 Rous sarcoma virus Species 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 241000223230 Trichosporon Species 0.000 description 3
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 3
- 102000018378 Tyrosine-tRNA ligase Human genes 0.000 description 3
- 102100025336 Tyrosine-tRNA ligase, mitochondrial Human genes 0.000 description 3
- 102100040065 Upstream-binding protein 1 Human genes 0.000 description 3
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 101150063416 add gene Proteins 0.000 description 3
- 101150110896 bipA gene Proteins 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 229960005091 chloramphenicol Drugs 0.000 description 3
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 230000002538 fungal effect Effects 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 238000012239 gene modification Methods 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 229960003136 leucine Drugs 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 229930182817 methionine Natural products 0.000 description 3
- 229960004452 methionine Drugs 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 229960005190 phenylalanine Drugs 0.000 description 3
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 3
- COLNVLDHVKWLRT-QMMMGPOBSA-N phenylalanine group Chemical group N[C@@H](CC1=CC=CC=C1)C(=O)O COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 3
- 238000003259 recombinant expression Methods 0.000 description 3
- 230000008672 reprogramming Effects 0.000 description 3
- 101150071242 tolC gene Proteins 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 229960004799 tryptophan Drugs 0.000 description 3
- 210000005253 yeast cell Anatomy 0.000 description 3
- JPZXHKDZASGCLU-LBPRGKRZSA-N β-(2-naphthyl)-alanine Chemical compound C1=CC=CC2=CC(C[C@H](N)C(O)=O)=CC=C21 JPZXHKDZASGCLU-LBPRGKRZSA-N 0.000 description 3
- PEMUHKUIQHFMTH-QMMMGPOBSA-N (2s)-2-amino-3-(4-bromophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(Br)C=C1 PEMUHKUIQHFMTH-QMMMGPOBSA-N 0.000 description 2
- KWIPUXXIFQQMKN-VIFPVBQESA-N (2s)-2-amino-3-(4-cyanophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(C#N)C=C1 KWIPUXXIFQQMKN-VIFPVBQESA-N 0.000 description 2
- NEMHIKRLROONTL-QMMMGPOBSA-N (2s)-2-azaniumyl-3-(4-azidophenyl)propanoate Chemical compound OC(=O)[C@@H](N)CC1=CC=C(N=[N+]=[N-])C=C1 NEMHIKRLROONTL-QMMMGPOBSA-N 0.000 description 2
- SNZIFNXFAFKRKT-NSHDSACASA-N (2s)-2-azaniumyl-3-[4-[(2-methylpropan-2-yl)oxy]phenyl]propanoate Chemical compound CC(C)(C)OC1=CC=C(C[C@H]([NH3+])C([O-])=O)C=C1 SNZIFNXFAFKRKT-NSHDSACASA-N 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- UQTZMGFTRHFAAM-ZETCQYMHSA-N 3-iodo-L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(I)=C1 UQTZMGFTRHFAAM-ZETCQYMHSA-N 0.000 description 2
- PZNQZSRPDOEBMS-QMMMGPOBSA-N 4-iodo-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(I)C=C1 PZNQZSRPDOEBMS-QMMMGPOBSA-N 0.000 description 2
- RZVAJINKPMORJF-UHFFFAOYSA-N Acetaminophen Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 description 2
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 2
- 101000640990 Arabidopsis thaliana Tryptophan-tRNA ligase, chloroplastic/mitochondrial Proteins 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- 241000194107 Bacillus megaterium Species 0.000 description 2
- 241000606660 Bartonella Species 0.000 description 2
- 241000221198 Basidiomycota Species 0.000 description 2
- 241000588832 Bordetella pertussis Species 0.000 description 2
- 241000722885 Brettanomyces Species 0.000 description 2
- 101000909256 Caldicellulosiruptor bescii (strain ATCC BAA-1888 / DSM 6725 / Z-1320) DNA polymerase I Proteins 0.000 description 2
- 241001647372 Chlamydia pneumoniae Species 0.000 description 2
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 2
- 241000193163 Clostridioides difficile Species 0.000 description 2
- 241000186227 Corynebacterium diphtheriae Species 0.000 description 2
- OUYCCCASQSFEME-MRVPVSSYSA-N D-tyrosine Chemical compound OC(=O)[C@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-MRVPVSSYSA-N 0.000 description 2
- 229930195709 D-tyrosine Natural products 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 101710177611 DNA polymerase II large subunit Proteins 0.000 description 2
- 101710184669 DNA polymerase II small subunit Proteins 0.000 description 2
- 241001468179 Enterococcus avium Species 0.000 description 2
- 241000605986 Fusobacterium nucleatum Species 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 241000590002 Helicobacter pylori Species 0.000 description 2
- 101710154606 Hemagglutinin Proteins 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 2
- 241000235649 Kluyveromyces Species 0.000 description 2
- WTDRDQBEARUVNC-LURJTMIESA-N L-DOPA Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-LURJTMIESA-N 0.000 description 2
- WTDRDQBEARUVNC-UHFFFAOYSA-N L-Dopa Natural products OC(=O)C(N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-UHFFFAOYSA-N 0.000 description 2
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 2
- 150000008575 L-amino acids Chemical class 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 2
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 2
- 241000186660 Lactobacillus Species 0.000 description 2
- 240000001046 Lactobacillus acidophilus Species 0.000 description 2
- 235000013956 Lactobacillus acidophilus Nutrition 0.000 description 2
- 241000186359 Mycobacterium Species 0.000 description 2
- 241000204051 Mycoplasma genitalium Species 0.000 description 2
- 241000588652 Neisseria gonorrhoeae Species 0.000 description 2
- 108091060545 Nonsense suppressor Proteins 0.000 description 2
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 2
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 102000004245 Proteasome Endopeptidase Complex Human genes 0.000 description 2
- 108090000708 Proteasome Endopeptidase Complex Proteins 0.000 description 2
- 101710176177 Protein A56 Proteins 0.000 description 2
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 2
- 101000902592 Pyrococcus furiosus (strain ATCC 43587 / DSM 3638 / JCM 8422 / Vc1) DNA polymerase Proteins 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- 244000057717 Streptococcus lactis Species 0.000 description 2
- 235000014897 Streptococcus lactis Nutrition 0.000 description 2
- 102100036407 Thioredoxin Human genes 0.000 description 2
- 241000589884 Treponema pallidum Species 0.000 description 2
- 102000002501 Tryptophan-tRNA Ligase Human genes 0.000 description 2
- 108010066496 Ubiquitin-Specific Proteases Proteins 0.000 description 2
- 241000607598 Vibrio Species 0.000 description 2
- 241000607626 Vibrio cholerae Species 0.000 description 2
- 241000607265 Vibrio vulnificus Species 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229960003121 arginine Drugs 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 235000009697 arginine Nutrition 0.000 description 2
- JCZLABDVDPYLRZ-AWEZNQCLSA-N biphenylalanine Chemical compound C1=CC(C[C@H](N)C(O)=O)=CC=C1C1=CC=CC=C1 JCZLABDVDPYLRZ-AWEZNQCLSA-N 0.000 description 2
- 108091005948 blue fluorescent proteins Proteins 0.000 description 2
- 229940041514 candida albicans extract Drugs 0.000 description 2
- 230000006652 catabolic pathway Effects 0.000 description 2
- 230000006800 cellular catabolic process Effects 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 229940038705 chlamydia trachomatis Drugs 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 230000021615 conjugation Effects 0.000 description 2
- 108010082025 cyan fluorescent protein Proteins 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 238000009510 drug design Methods 0.000 description 2
- 238000000684 flow cytometry Methods 0.000 description 2
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 2
- 238000010230 functional analysis Methods 0.000 description 2
- 235000003869 genetically modified organism Nutrition 0.000 description 2
- 229940049906 glutamate Drugs 0.000 description 2
- 229930195712 glutamate Natural products 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 239000000185 hemagglutinin Substances 0.000 description 2
- 229960002885 histidine Drugs 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- 239000012535 impurity Substances 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 229940039696 lactobacillus Drugs 0.000 description 2
- 229940039695 lactobacillus acidophilus Drugs 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 108010079892 phosphoglycerol kinase Proteins 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 229960002429 proline Drugs 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 238000002708 random mutagenesis Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000008684 selective degradation Effects 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 108060008226 thioredoxin Proteins 0.000 description 2
- 238000010361 transduction Methods 0.000 description 2
- 230000026683 transduction Effects 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 239000012137 tryptone Substances 0.000 description 2
- 125000005454 tryptophanyl group Chemical group 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- 108091005957 yellow fluorescent proteins Proteins 0.000 description 2
- FXTLFZWJXBBXGX-QMMMGPOBSA-N (2R)-2-anilino-3-selanylpropanoic acid Chemical compound OC(=O)[C@H](C[SeH])NC1=CC=CC=C1 FXTLFZWJXBBXGX-QMMMGPOBSA-N 0.000 description 1
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- DOCYTUNUHIGJTI-QMMMGPOBSA-N (2r)-2-[(2-nitrophenyl)methylamino]-3-sulfanylpropanoic acid Chemical compound OC(=O)[C@H](CS)NCC1=CC=CC=C1[N+]([O-])=O DOCYTUNUHIGJTI-QMMMGPOBSA-N 0.000 description 1
- VKBLQCDGTHFOLS-NSHDSACASA-N (2s)-2-(4-benzoylanilino)propanoic acid Chemical compound C1=CC(N[C@@H](C)C(O)=O)=CC=C1C(=O)C1=CC=CC=C1 VKBLQCDGTHFOLS-NSHDSACASA-N 0.000 description 1
- LDQAVQLJHUEMEY-JTQLQIEISA-N (2s)-2-(ethylamino)-3-(4-hydroxyphenyl)propanoic acid Chemical compound CCN[C@H](C(O)=O)CC1=CC=C(O)C=C1 LDQAVQLJHUEMEY-JTQLQIEISA-N 0.000 description 1
- CCQGGCWKGAMHGT-KKMMWDRVSA-N (2s)-2-[(4-aminocyclohexyl)amino]propanoic acid Chemical compound OC(=O)[C@H](C)NC1CCC(N)CC1 CCQGGCWKGAMHGT-KKMMWDRVSA-N 0.000 description 1
- JCIYZTBXUJCAMW-JTQLQIEISA-N (2s)-2-[[5-(dimethylamino)naphthalen-1-yl]sulfonylamino]propanoic acid Chemical compound C1=CC=C2C(S(=O)(=O)N[C@@H](C)C(O)=O)=CC=CC2=C1N(C)C JCIYZTBXUJCAMW-JTQLQIEISA-N 0.000 description 1
- POGSZHUEECCEAP-ZETCQYMHSA-N (2s)-2-amino-3-(3-amino-4-hydroxyphenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(N)=C1 POGSZHUEECCEAP-ZETCQYMHSA-N 0.000 description 1
- BABTYIKKTLTNRX-QMMMGPOBSA-N (2s)-2-amino-3-(3-iodophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CC(I)=C1 BABTYIKKTLTNRX-QMMMGPOBSA-N 0.000 description 1
- NFIVJOSXJDORSP-QMMMGPOBSA-N (2s)-2-amino-3-(4-boronophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(B(O)O)C=C1 NFIVJOSXJDORSP-QMMMGPOBSA-N 0.000 description 1
- JSXMFBNJRFXRCX-NSHDSACASA-N (2s)-2-amino-3-(4-prop-2-ynoxyphenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(OCC#C)C=C1 JSXMFBNJRFXRCX-NSHDSACASA-N 0.000 description 1
- LJHYWUVYIKCPGU-VIFPVBQESA-N (2s)-2-amino-3-[4-(carboxymethyl)phenyl]propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(CC(O)=O)C=C1 LJHYWUVYIKCPGU-VIFPVBQESA-N 0.000 description 1
- GTVVZTAFGPQSPC-QMMMGPOBSA-N (2s)-2-azaniumyl-3-(4-nitrophenyl)propanoate Chemical compound OC(=O)[C@@H](N)CC1=CC=C([N+]([O-])=O)C=C1 GTVVZTAFGPQSPC-QMMMGPOBSA-N 0.000 description 1
- IBCKYXVMEMSMQM-JTQLQIEISA-N (2s)-3-(3-acetylphenyl)-2-aminopropanoic acid Chemical compound CC(=O)C1=CC=CC(C[C@H](N)C(O)=O)=C1 IBCKYXVMEMSMQM-JTQLQIEISA-N 0.000 description 1
- ZXSBHXZKWRIEIA-JTQLQIEISA-N (2s)-3-(4-acetylphenyl)-2-azaniumylpropanoate Chemical compound CC(=O)C1=CC=C(C[C@H](N)C(O)=O)C=C1 ZXSBHXZKWRIEIA-JTQLQIEISA-N 0.000 description 1
- QHGDJQUCSGUYMF-QMMMGPOBSA-N (2s)-3-hydroxy-2-[(2-nitrophenyl)methylamino]propanoic acid Chemical compound OC[C@@H](C(O)=O)NCC1=CC=CC=C1[N+]([O-])=O QHGDJQUCSGUYMF-QMMMGPOBSA-N 0.000 description 1
- OJTJKAUNOLVMDX-LBPRGKRZSA-N (2s)-6-amino-2-(phenylmethoxycarbonylamino)hexanoic acid Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)OCC1=CC=CC=C1 OJTJKAUNOLVMDX-LBPRGKRZSA-N 0.000 description 1
- DQUHYEDEGRNAFO-QMMMGPOBSA-N (2s)-6-amino-2-[(2-methylpropan-2-yl)oxycarbonylamino]hexanoic acid Chemical compound CC(C)(C)OC(=O)N[C@H](C(O)=O)CCCCN DQUHYEDEGRNAFO-QMMMGPOBSA-N 0.000 description 1
- UKAUYVFTDYCKQA-UHFFFAOYSA-N -2-Amino-4-hydroxybutanoic acid Natural products OC(=O)C(N)CCO UKAUYVFTDYCKQA-UHFFFAOYSA-N 0.000 description 1
- NILQLFBWTXNUOE-UHFFFAOYSA-N 1-aminocyclopentanecarboxylic acid Chemical compound OC(=O)C1(N)CCCC1 NILQLFBWTXNUOE-UHFFFAOYSA-N 0.000 description 1
- JTTIOYHBNXDJOD-UHFFFAOYSA-N 2,4,6-triaminopyrimidine Chemical compound NC1=CC(N)=NC(N)=N1 JTTIOYHBNXDJOD-UHFFFAOYSA-N 0.000 description 1
- OGNSCSPNOLGXSM-UHFFFAOYSA-N 2,4-diaminobutyric acid Chemical compound NCCC(N)C(O)=O OGNSCSPNOLGXSM-UHFFFAOYSA-N 0.000 description 1
- OMGHIGVFLOPEHJ-UHFFFAOYSA-N 2,5-dihydro-1h-pyrrol-1-ium-2-carboxylate Chemical compound OC(=O)C1NCC=C1 OMGHIGVFLOPEHJ-UHFFFAOYSA-N 0.000 description 1
- VYCNOBNEBXGHKT-UHFFFAOYSA-N 2-(2-methylhydrazinyl)acetic acid Chemical compound CNNCC(O)=O VYCNOBNEBXGHKT-UHFFFAOYSA-N 0.000 description 1
- CLTMYNWFSDZKKI-UHFFFAOYSA-N 2-(aminomethyl)benzoic acid Chemical compound NCC1=CC=CC=C1C(O)=O CLTMYNWFSDZKKI-UHFFFAOYSA-N 0.000 description 1
- CQJAWZCYNRBZDL-UHFFFAOYSA-N 2-(methylazaniumyl)butanoate Chemical compound CCC(NC)C(O)=O CQJAWZCYNRBZDL-UHFFFAOYSA-N 0.000 description 1
- QDGAVODICPCDMU-UHFFFAOYSA-N 2-amino-3-[3-[bis(2-chloroethyl)amino]phenyl]propanoic acid Chemical compound OC(=O)C(N)CC1=CC=CC(N(CCCl)CCCl)=C1 QDGAVODICPCDMU-UHFFFAOYSA-N 0.000 description 1
- JINGUCXQUOKWKH-UHFFFAOYSA-N 2-aminodecanoic acid Chemical compound CCCCCCCCC(N)C(O)=O JINGUCXQUOKWKH-UHFFFAOYSA-N 0.000 description 1
- AKVBCGQVQXPRLD-UHFFFAOYSA-N 2-aminooctanoic acid Chemical compound CCCCCCC(N)C(O)=O AKVBCGQVQXPRLD-UHFFFAOYSA-N 0.000 description 1
- SNDPXSYFESPGGJ-UHFFFAOYSA-N 2-aminopentanoic acid Chemical compound CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 description 1
- JVPFOKXICYJJSC-UHFFFAOYSA-N 2-azaniumylnonanoate Chemical compound CCCCCCCC(N)C(O)=O JVPFOKXICYJJSC-UHFFFAOYSA-N 0.000 description 1
- JVGVDSSUAVXRDY-UHFFFAOYSA-N 3-(4-hydroxyphenyl)lactic acid Chemical compound OC(=O)C(O)CC1=CC=C(O)C=C1 JVGVDSSUAVXRDY-UHFFFAOYSA-N 0.000 description 1
- GSWYUZQBLVUEPH-UHFFFAOYSA-N 3-(azaniumylmethyl)benzoate Chemical compound NCC1=CC=CC(C(O)=O)=C1 GSWYUZQBLVUEPH-UHFFFAOYSA-N 0.000 description 1
- FBTSQILOGYXGMD-LURJTMIESA-N 3-nitro-L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C([N+]([O-])=O)=C1 FBTSQILOGYXGMD-LURJTMIESA-N 0.000 description 1
- ADVPTQAUNPRNPO-REOHCLBHSA-N 3-sulfino-L-alanine Chemical compound OC(=O)[C@@H](N)C[S@@](O)=O ADVPTQAUNPRNPO-REOHCLBHSA-N 0.000 description 1
- CMUHFUGDYMFHEI-QMMMGPOBSA-N 4-amino-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(N)C=C1 CMUHFUGDYMFHEI-QMMMGPOBSA-N 0.000 description 1
- KHABBYNLBYZCKP-UHFFFAOYSA-N 4-aminopiperidin-1-ium-4-carboxylate Chemical compound OC(=O)C1(N)CCNCC1 KHABBYNLBYZCKP-UHFFFAOYSA-N 0.000 description 1
- NIGWMJHCCYYCSF-QMMMGPOBSA-N 4-chloro-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(Cl)C=C1 NIGWMJHCCYYCSF-QMMMGPOBSA-N 0.000 description 1
- XWHHYOYVRVGJJY-QMMMGPOBSA-N 4-fluoro-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(F)C=C1 XWHHYOYVRVGJJY-QMMMGPOBSA-N 0.000 description 1
- XWHHYOYVRVGJJY-UHFFFAOYSA-N 4-fluorophenylalanine Chemical compound OC(=O)C(N)CC1=CC=C(F)C=C1 XWHHYOYVRVGJJY-UHFFFAOYSA-N 0.000 description 1
- LDCYZAJDBXYCGN-VIFPVBQESA-N 5-hydroxy-L-tryptophan Chemical compound C1=C(O)C=C2C(C[C@H](N)C(O)=O)=CNC2=C1 LDCYZAJDBXYCGN-VIFPVBQESA-N 0.000 description 1
- 229940000681 5-hydroxytryptophan Drugs 0.000 description 1
- 241000589291 Acinetobacter Species 0.000 description 1
- 102100022900 Actin, cytoplasmic 1 Human genes 0.000 description 1
- 241000186041 Actinomyces israelii Species 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 108700023418 Amidases Proteins 0.000 description 1
- 102000003991 Aminoacyltransferases Human genes 0.000 description 1
- 108090000438 Aminoacyltransferases Proteins 0.000 description 1
- 241000605281 Anaplasma phagocytophilum Species 0.000 description 1
- 108020005098 Anticodon Proteins 0.000 description 1
- 241001346367 Apiotrichum mycotoxinivorans Species 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- KDZOASGQNOPSCU-WDSKDSINSA-N Argininosuccinic acid Chemical compound OC(=O)[C@@H](N)CCC\N=C(/N)N[C@H](C(O)=O)CC(O)=O KDZOASGQNOPSCU-WDSKDSINSA-N 0.000 description 1
- 241000235349 Ascomycota Species 0.000 description 1
- 241000894009 Azorhizobium caulinodans Species 0.000 description 1
- 241000589149 Azotobacter vinelandii Species 0.000 description 1
- 241000193738 Bacillus anthracis Species 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 241000194106 Bacillus mycoides Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 241000606125 Bacteroides Species 0.000 description 1
- 241000606124 Bacteroides fragilis Species 0.000 description 1
- 241001518086 Bartonella henselae Species 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 241000186000 Bifidobacterium Species 0.000 description 1
- 241000760366 Blastocladiomycota Species 0.000 description 1
- 241000588807 Bordetella Species 0.000 description 1
- 241000588779 Bordetella bronchiseptica Species 0.000 description 1
- 241000589969 Borreliella burgdorferi Species 0.000 description 1
- 241001522017 Brettanomyces anomalus Species 0.000 description 1
- 244000027711 Brettanomyces bruxellensis Species 0.000 description 1
- 235000000287 Brettanomyces bruxellensis Nutrition 0.000 description 1
- 241000193764 Brevibacillus brevis Species 0.000 description 1
- 241000589567 Brucella abortus Species 0.000 description 1
- 241001148106 Brucella melitensis Species 0.000 description 1
- 241001453380 Burkholderia Species 0.000 description 1
- 241000589513 Burkholderia cepacia Species 0.000 description 1
- 241000722910 Burkholderia mallei Species 0.000 description 1
- 241001136175 Burkholderia pseudomallei Species 0.000 description 1
- 241000589876 Campylobacter Species 0.000 description 1
- 241000589877 Campylobacter coli Species 0.000 description 1
- 241000589874 Campylobacter fetus Species 0.000 description 1
- 241000589875 Campylobacter jejuni Species 0.000 description 1
- 241000222122 Candida albicans Species 0.000 description 1
- 241000144583 Candida dubliniensis Species 0.000 description 1
- 244000206911 Candida holmii Species 0.000 description 1
- 235000002965 Candida holmii Nutrition 0.000 description 1
- 241000222173 Candida parapsilosis Species 0.000 description 1
- 241000222157 Candida viswanathii Species 0.000 description 1
- 102100026548 Caspase-8 Human genes 0.000 description 1
- 108090000538 Caspase-8 Proteins 0.000 description 1
- 241000010804 Caulobacter vibrioides Species 0.000 description 1
- 241000606161 Chlamydia Species 0.000 description 1
- 241001508813 Clavispora lusitaniae Species 0.000 description 1
- 241000193155 Clostridium botulinum Species 0.000 description 1
- 241000193468 Clostridium perfringens Species 0.000 description 1
- 241000193449 Clostridium tetani Species 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108010073254 Colicins Proteins 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241000186226 Corynebacterium glutamicum Species 0.000 description 1
- 241000606678 Coxiella burnetii Species 0.000 description 1
- 241001527609 Cryptococcus Species 0.000 description 1
- 241001522864 Cryptococcus gattii VGI Species 0.000 description 1
- 241000223233 Cutaneotrichosporon cutaneum Species 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- COLNVLDHVKWLRT-MRVPVSSYSA-N D-phenylalanine Chemical compound OC(=O)[C@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-MRVPVSSYSA-N 0.000 description 1
- 229930182832 D-phenylalanine Natural products 0.000 description 1
- 230000009946 DNA mutation Effects 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 241000605310 Ehrlichia chaffeensis Species 0.000 description 1
- 241000588697 Enterobacter cloacae Species 0.000 description 1
- 241000520130 Enterococcus durans Species 0.000 description 1
- 241000194032 Enterococcus faecalis Species 0.000 description 1
- 241000194031 Enterococcus faecium Species 0.000 description 1
- 241000194030 Enterococcus gallinarum Species 0.000 description 1
- 241001465321 Eremothecium Species 0.000 description 1
- 241001465328 Eremothecium gossypii Species 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 241001646716 Escherichia coli K-12 Species 0.000 description 1
- 241000660147 Escherichia coli str. K-12 substr. MG1655 Species 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 241000589602 Francisella tularensis Species 0.000 description 1
- 241001621835 Frateuria aurantia Species 0.000 description 1
- 241000207201 Gardnerella vaginalis Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 1
- 241000159512 Geotrichum Species 0.000 description 1
- 244000168141 Geotrichum candidum Species 0.000 description 1
- 235000017388 Geotrichum candidum Nutrition 0.000 description 1
- 241001583499 Glomeromycotina Species 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 241000606790 Haemophilus Species 0.000 description 1
- 241000606768 Haemophilus influenzae Species 0.000 description 1
- 241000606766 Haemophilus parainfluenzae Species 0.000 description 1
- 102100021519 Hemoglobin subunit beta Human genes 0.000 description 1
- 108091005904 Hemoglobin subunit beta Proteins 0.000 description 1
- 101001023784 Heteractis crispa GFP-like non-fluorescent chromoprotein Proteins 0.000 description 1
- 101000724418 Homo sapiens Neutral amino acid transporter B(0) Proteins 0.000 description 1
- YZJSUQQZGCHHNQ-UHFFFAOYSA-N Homoglutamine Chemical compound OC(=O)C(N)CCCC(N)=O YZJSUQQZGCHHNQ-UHFFFAOYSA-N 0.000 description 1
- 206010020460 Human T-cell lymphotropic virus type I infection Diseases 0.000 description 1
- 241000714260 Human T-lymphotropic virus 1 Species 0.000 description 1
- 206010020429 Human ehrlichiosis Diseases 0.000 description 1
- LCWXJXMHJVIJFK-UHFFFAOYSA-N Hydroxylysine Chemical class NCC(O)CC(N)CC(O)=O LCWXJXMHJVIJFK-UHFFFAOYSA-N 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical class O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- 241001534216 Klebsiella granulomatis Species 0.000 description 1
- 241000588747 Klebsiella pneumoniae Species 0.000 description 1
- 235000014663 Kluyveromyces fragilis Nutrition 0.000 description 1
- 241001138401 Kluyveromyces lactis Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- QUOGESRFPZDMMT-UHFFFAOYSA-N L-Homoarginine Natural products OC(=O)C(N)CCCCNC(N)=N QUOGESRFPZDMMT-UHFFFAOYSA-N 0.000 description 1
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 1
- ZGUNAGUHMKGQNY-ZETCQYMHSA-N L-alpha-phenylglycine zwitterion Chemical compound OC(=O)[C@@H](N)C1=CC=CC=C1 ZGUNAGUHMKGQNY-ZETCQYMHSA-N 0.000 description 1
- RHGKLRLOHDJJDR-BYPYZUCNSA-N L-citrulline Chemical compound NC(=O)NCCC[C@H]([NH3+])C([O-])=O RHGKLRLOHDJJDR-BYPYZUCNSA-N 0.000 description 1
- QUOGESRFPZDMMT-YFKPBYRVSA-N L-homoarginine Chemical compound OC(=O)[C@@H](N)CCCCNC(N)=N QUOGESRFPZDMMT-YFKPBYRVSA-N 0.000 description 1
- FFFHZYDWPBMWHY-VKHMYHEASA-N L-homocysteine Chemical compound OC(=O)[C@@H](N)CCS FFFHZYDWPBMWHY-VKHMYHEASA-N 0.000 description 1
- UKAUYVFTDYCKQA-VKHMYHEASA-N L-homoserine Chemical compound OC(=O)[C@@H](N)CCO UKAUYVFTDYCKQA-VKHMYHEASA-N 0.000 description 1
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 1
- ZKZBPNGNEQAJSX-REOHCLBHSA-N L-selenocysteine Chemical compound [SeH]C[C@H](N)C(O)=O ZKZBPNGNEQAJSX-REOHCLBHSA-N 0.000 description 1
- 229930182853 L-selenocysteine Natural products 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- XUIIKFGFIJCVMT-LBPRGKRZSA-N L-thyroxine Chemical compound IC1=CC(C[C@H]([NH3+])C([O-])=O)=CC(I)=C1OC1=CC(I)=C(O)C(I)=C1 XUIIKFGFIJCVMT-LBPRGKRZSA-N 0.000 description 1
- 241000858110 Lachancea Species 0.000 description 1
- 241000235087 Lachancea kluyveri Species 0.000 description 1
- 244000199885 Lactobacillus bulgaricus Species 0.000 description 1
- 235000013960 Lactobacillus bulgaricus Nutrition 0.000 description 1
- 244000199866 Lactobacillus casei Species 0.000 description 1
- 235000013958 Lactobacillus casei Nutrition 0.000 description 1
- 241000194036 Lactococcus Species 0.000 description 1
- 241000589242 Legionella pneumophila Species 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- 108010071170 Leucine-tRNA ligase Proteins 0.000 description 1
- 102100023342 Leucine-tRNA ligase, mitochondrial Human genes 0.000 description 1
- 241000221479 Leucosporidium Species 0.000 description 1
- 241000186779 Listeria monocytogenes Species 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 241001134775 Lysinibacillus fusiformis Species 0.000 description 1
- 241000202974 Methanobacterium Species 0.000 description 1
- 241000203407 Methanocaldococcus jannaschii Species 0.000 description 1
- 241000203353 Methanococcus Species 0.000 description 1
- 241000205276 Methanosarcina Species 0.000 description 1
- 101710181812 Methionine aminopeptidase Proteins 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 102000016397 Methyltransferase Human genes 0.000 description 1
- 241000235048 Meyerozyma guilliermondii Species 0.000 description 1
- 241000191938 Micrococcus luteus Species 0.000 description 1
- 241000243190 Microsporidia Species 0.000 description 1
- 241000588621 Moraxella Species 0.000 description 1
- 241001149965 Mrakia frigida Species 0.000 description 1
- 241000186367 Mycobacterium avium Species 0.000 description 1
- 241000186366 Mycobacterium bovis Species 0.000 description 1
- 241000186364 Mycobacterium intracellulare Species 0.000 description 1
- 241000186362 Mycobacterium leprae Species 0.000 description 1
- 241000908167 Mycobacterium lepraemurium Species 0.000 description 1
- 241000187480 Mycobacterium smegmatis Species 0.000 description 1
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 1
- 241000204031 Mycoplasma Species 0.000 description 1
- 241000202952 Mycoplasma fermentans Species 0.000 description 1
- 241000204048 Mycoplasma hominis Species 0.000 description 1
- 241001135743 Mycoplasma penetrans Species 0.000 description 1
- 241000202934 Mycoplasma pneumoniae Species 0.000 description 1
- DTERQYGMUDWYAZ-ZETCQYMHSA-N N(6)-acetyl-L-lysine Chemical compound CC(=O)NCCCC[C@H]([NH3+])C([O-])=O DTERQYGMUDWYAZ-ZETCQYMHSA-N 0.000 description 1
- CYZKJBZEIFWZSR-LURJTMIESA-N N(alpha)-methyl-L-histidine Chemical compound CN[C@H](C(O)=O)CC1=CNC=N1 CYZKJBZEIFWZSR-LURJTMIESA-N 0.000 description 1
- NPKISZUVEBESJI-AWEZNQCLSA-N N-benzoyl-L-phenylalanine Chemical compound C([C@@H](C(=O)O)NC(=O)C=1C=CC=CC=1)C1=CC=CC=C1 NPKISZUVEBESJI-AWEZNQCLSA-N 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- NPKISZUVEBESJI-UHFFFAOYSA-N Nalpha-benzoyl-L-phenylalanine Natural products C=1C=CC=CC=1C(=O)NC(C(=O)O)CC1=CC=CC=C1 NPKISZUVEBESJI-UHFFFAOYSA-N 0.000 description 1
- RHGKLRLOHDJJDR-UHFFFAOYSA-N Ndelta-carbamoyl-DL-ornithine Natural products OC(=O)C(N)CCCNC(N)=O RHGKLRLOHDJJDR-UHFFFAOYSA-N 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 241000588650 Neisseria meningitidis Species 0.000 description 1
- 241000760367 Neocallimastigomycetes Species 0.000 description 1
- 102100028267 Neutral amino acid transporter B(0) Human genes 0.000 description 1
- CIQHWLTYGMYQQR-QMMMGPOBSA-N O(4')-sulfo-L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(OS(O)(=O)=O)C=C1 CIQHWLTYGMYQQR-QMMMGPOBSA-N 0.000 description 1
- 241001112159 Ogataea Species 0.000 description 1
- 241001099341 Ogataea polymorpha Species 0.000 description 1
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 1
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Natural products OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 241000606860 Pasteurella Species 0.000 description 1
- 241000606856 Pasteurella multocida Species 0.000 description 1
- 241000191992 Peptostreptococcus Species 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 241000235645 Pichia kudriavzevii Species 0.000 description 1
- 102220539780 Piwi-like protein 1_M40A_mutation Human genes 0.000 description 1
- 241000605862 Porphyromonas gingivalis Species 0.000 description 1
- 241000192137 Prochlorococcus marinus Species 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000316848 Rhodococcus <scale insect> Species 0.000 description 1
- 241000223252 Rhodotorula Species 0.000 description 1
- 241000223254 Rhodotorula mucilaginosa Species 0.000 description 1
- 241000606701 Rickettsia Species 0.000 description 1
- 241000606697 Rickettsia prowazekii Species 0.000 description 1
- 241000606695 Rickettsia rickettsii Species 0.000 description 1
- 241000203719 Rothia dentocariosa Species 0.000 description 1
- 102000051619 SUMO-1 Human genes 0.000 description 1
- 108700038981 SUMO-1 Proteins 0.000 description 1
- 235000018370 Saccharomyces delbrueckii Nutrition 0.000 description 1
- 244000253911 Saccharomyces fragilis Species 0.000 description 1
- 235000018368 Saccharomyces fragilis Nutrition 0.000 description 1
- 241001123227 Saccharomyces pastorianus Species 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 241001354013 Salmonella enterica subsp. enterica serovar Enteritidis Species 0.000 description 1
- 241000293871 Salmonella enterica subsp. enterica serovar Typhi Species 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 241000235060 Scheffersomyces stipitis Species 0.000 description 1
- 241000235346 Schizosaccharomyces Species 0.000 description 1
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000607715 Serratia marcescens Species 0.000 description 1
- 241000607764 Shigella dysenteriae Species 0.000 description 1
- 241000228389 Sporidiobolus Species 0.000 description 1
- 241000228393 Sporidiobolus salmonicolor Species 0.000 description 1
- 241000297588 Sporobolomyces koalae Species 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 241000191963 Staphylococcus epidermidis Species 0.000 description 1
- 241000122973 Stenotrophomonas maltophilia Species 0.000 description 1
- 241000193985 Streptococcus agalactiae Species 0.000 description 1
- 241000194043 Streptococcus criceti Species 0.000 description 1
- 241000194049 Streptococcus equinus Species 0.000 description 1
- 241001134658 Streptococcus mitis Species 0.000 description 1
- 241000194019 Streptococcus mutans Species 0.000 description 1
- 241000194025 Streptococcus oralis Species 0.000 description 1
- 241000193998 Streptococcus pneumoniae Species 0.000 description 1
- 241000193996 Streptococcus pyogenes Species 0.000 description 1
- 241000194052 Streptococcus ratti Species 0.000 description 1
- 241000194024 Streptococcus salivarius Species 0.000 description 1
- 241000194023 Streptococcus sanguinis Species 0.000 description 1
- 241000193987 Streptococcus sobrinus Species 0.000 description 1
- 241001312524 Streptococcus viridans Species 0.000 description 1
- 241000187432 Streptomyces coelicolor Species 0.000 description 1
- 241000192707 Synechococcus Species 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 241000723792 Tobacco etch virus Species 0.000 description 1
- 241000006364 Torula Species 0.000 description 1
- 241000235006 Torulaspora Species 0.000 description 1
- 244000288561 Torulaspora delbrueckii Species 0.000 description 1
- 235000014681 Torulaspora delbrueckii Nutrition 0.000 description 1
- GYDJEQRTZSCIOI-UHFFFAOYSA-N Tranexamic acid Chemical compound NCC1CCC(C(O)=O)CC1 GYDJEQRTZSCIOI-UHFFFAOYSA-N 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 241000589886 Treponema Species 0.000 description 1
- 241000589892 Treponema denticola Species 0.000 description 1
- 102000003441 UBR1 Human genes 0.000 description 1
- 101150118716 UBR1 gene Proteins 0.000 description 1
- 102000012275 Ubiquitin domains Human genes 0.000 description 1
- 108050002897 Ubiquitin domains Proteins 0.000 description 1
- 241000607272 Vibrio parahaemolyticus Species 0.000 description 1
- 108091093126 WHP Posttrascriptional Response Element Proteins 0.000 description 1
- 241000604961 Wolbachia Species 0.000 description 1
- 241000235013 Yarrowia Species 0.000 description 1
- 241000607734 Yersinia <bacteria> Species 0.000 description 1
- 241000607447 Yersinia enterocolitica Species 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 241000607477 Yersinia pseudotuberculosis Species 0.000 description 1
- 241000758405 Zoopagomycotina Species 0.000 description 1
- 241000235017 Zygosaccharomyces Species 0.000 description 1
- 241000235029 Zygosaccharomyces bailii Species 0.000 description 1
- 241000235033 Zygosaccharomyces rouxii Species 0.000 description 1
- 241000588901 Zymomonas Species 0.000 description 1
- 241000222126 [Candida] glabrata Species 0.000 description 1
- 241000192286 [Candida] stellata Species 0.000 description 1
- 241000606834 [Haemophilus] ducreyi Species 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- ADVPTQAUNPRNPO-UHFFFAOYSA-N alpha-amino-beta-sulfino-propionic acid Natural products OC(=O)C(N)CS(O)=O ADVPTQAUNPRNPO-UHFFFAOYSA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 102000005922 amidase Human genes 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 229960002684 aminocaproic acid Drugs 0.000 description 1
- QCTBMLYLENLHLA-UHFFFAOYSA-N aminomethylbenzoic acid Chemical compound NCC1=CC=C(C(O)=O)C=C1 QCTBMLYLENLHLA-UHFFFAOYSA-N 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 150000003934 aromatic aldehydes Chemical class 0.000 description 1
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 1
- 229960005261 aspartic acid Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 239000010426 asphalt Substances 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 229940065181 bacillus anthracis Drugs 0.000 description 1
- 229940092523 bartonella quintana Drugs 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 239000012620 biological material Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 229940056450 brucella abortus Drugs 0.000 description 1
- 229940038698 brucella melitensis Drugs 0.000 description 1
- 229940074375 burkholderia mallei Drugs 0.000 description 1
- 229940095731 candida albicans Drugs 0.000 description 1
- 208000032343 candida glabrata infection Diseases 0.000 description 1
- 229940055022 candida parapsilosis Drugs 0.000 description 1
- 125000001314 canonical amino-acid group Chemical group 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 230000022131 cell cycle Effects 0.000 description 1
- 230000036978 cell physiology Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 235000013477 citrulline Nutrition 0.000 description 1
- 229960002173 citrulline Drugs 0.000 description 1
- 101150074451 clpP gene Proteins 0.000 description 1
- 101150043719 clpP1 gene Proteins 0.000 description 1
- 101150102296 clpP2 gene Proteins 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 229960002433 cysteine Drugs 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- YSMODUONRAFBET-UHFFFAOYSA-N delta-DL-hydroxylysine Chemical class NCC(O)CCC(N)C(O)=O YSMODUONRAFBET-UHFFFAOYSA-N 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 230000009504 deubiquitination Effects 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 238000006471 dimerization reaction Methods 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Chemical class OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 229940032049 enterococcus faecalis Drugs 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- YSMODUONRAFBET-UHNVWZDZSA-N erythro-5-hydroxy-L-lysine Chemical class NC[C@H](O)CC[C@H](N)C(O)=O YSMODUONRAFBET-UHNVWZDZSA-N 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 230000003631 expected effect Effects 0.000 description 1
- 108010021843 fluorescent protein 583 Proteins 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 230000001890 gluconeogenic effect Effects 0.000 description 1
- 229960002989 glutamic acid Drugs 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 125000000404 glutamine group Chemical group N[C@@H](CCC(N)=O)C(=O)* 0.000 description 1
- 229960002449 glycine Drugs 0.000 description 1
- 229940047650 haemophilus influenzae Drugs 0.000 description 1
- 229940037467 helicobacter pylori Drugs 0.000 description 1
- 239000008241 heterogeneous mixture Substances 0.000 description 1
- 244000052637 human pathogen Species 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- QJHBJHUKURJDLG-UHFFFAOYSA-N hydroxy-L-lysine Chemical class NCCCCC(NO)C(O)=O QJHBJHUKURJDLG-UHFFFAOYSA-N 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 230000017730 intein-mediated protein splicing Effects 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 229940045505 klebsiella pneumoniae Drugs 0.000 description 1
- 229940031154 kluyveromyces marxianus Drugs 0.000 description 1
- 229940004208 lactobacillus bulgaricus Drugs 0.000 description 1
- 229940017800 lactobacillus casei Drugs 0.000 description 1
- 229940115932 legionella pneumophila Drugs 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 238000012886 linear function Methods 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 238000012737 microarray-based gene expression Methods 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000012243 multiplex automated genomic engineering Methods 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 229930014626 natural product Natural products 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 229960003104 ornithine Drugs 0.000 description 1
- LDCYZAJDBXYCGN-UHFFFAOYSA-N oxitriptan Natural products C1=C(O)C=C2C(CC(N)C(O)=O)=CNC2=C1 LDCYZAJDBXYCGN-UHFFFAOYSA-N 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- TVIDEEHSOPHZBR-AWEZNQCLSA-N para-(benzoyl)-phenylalanine Chemical compound C1=CC(C[C@H](N)C(O)=O)=CC=C1C(=O)C1=CC=CC=C1 TVIDEEHSOPHZBR-AWEZNQCLSA-N 0.000 description 1
- 229940051027 pasteurella multocida Drugs 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 230000007026 protein scission Effects 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000007634 remodeling Methods 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 229940046939 rickettsia prowazekii Drugs 0.000 description 1
- 229940075118 rickettsia rickettsii Drugs 0.000 description 1
- 102220067346 rs138630815 Human genes 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 229960001153 serine Drugs 0.000 description 1
- 229940007046 shigella dysenteriae Drugs 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 239000002002 slurry Substances 0.000 description 1
- IFGCUJZIWBUILZ-UHFFFAOYSA-N sodium 2-[[2-[[hydroxy-(3,4,5-trihydroxy-6-methyloxan-2-yl)oxyphosphoryl]amino]-4-methylpentanoyl]amino]-3-(1H-indol-3-yl)propanoic acid Chemical compound [Na+].C=1NC2=CC=CC=C2C=1CC(C(O)=O)NC(=O)C(CC(C)C)NP(O)(=O)OC1OC(C)C(O)C(O)C1O IFGCUJZIWBUILZ-UHFFFAOYSA-N 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 229960000268 spectinomycin Drugs 0.000 description 1
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 1
- 229940031000 streptococcus pneumoniae Drugs 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- NPDBDJFLKKQMCM-UHFFFAOYSA-N tert-butylglycine Chemical compound CC(C)(C)C(N)C(O)=O NPDBDJFLKKQMCM-UHFFFAOYSA-N 0.000 description 1
- 229940094937 thioredoxin Drugs 0.000 description 1
- 229960002898 threonine Drugs 0.000 description 1
- XUIIKFGFIJCVMT-UHFFFAOYSA-N thyroxine-binding globulin Natural products IC1=CC(CC([NH3+])C([O-])=O)=CC(I)=C1OC1=CC(I)=C(O)C(I)=C1 XUIIKFGFIJCVMT-UHFFFAOYSA-N 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 229940035722 triiodothyronine Drugs 0.000 description 1
- 229940118696 vibrio cholerae Drugs 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 229940098232 yersinia enterocolitica Drugs 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/24—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
- C07K14/245—Escherichia (G)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/52—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from bacteria or Archaea
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/02—Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/06—Preparation of peptides or proteins produced by the hydrolysis of a peptide bond, e.g. hydrolysate products
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Definitions
- the present invention relates in general to methods of making proteins with nonstandard amino acids.
- Naturally-occurring (standard) amino acids are the 20 unique building blocks composing all proteins derived from biological systems.
- Non-standard amino acids have been developed bearing functional groups beyond those encoded by the 20 standard amino acids.
- more than 70 non-standard amino acids have been developed for in vivo protein translation. See Liu et al., Annual Review of Biochemistry 79:413-444 (2010).
- the present disclosure provides a method for selectively degrading proteins having a standard amino acid at an amino acid target location within a mixture or collection or population of proteins including proteins having a nonstandard amino acid at the amino acid target location.
- the standard amino acid may be considered an undesired amino acid.
- the present disclosure also provides a method for selectively degrading proteins having an undesired non-target NSAA at an amino acid target location within a mixture or collection or population of proteins including proteins having a desired nonstandard amino acid at the amino acid target location.
- the undesired non-target NSAA may be considered an undesired amino acid.
- aspects of the disclosure utilize methods and materials described herein to distinguish between different NSAAs at an amino acid target location when more than one NSAA is used during the synthesis procedure, such as when multiple different NSAAs are used simultaneously in vivo.
- an in vivo methods are provided that can discriminate between NSAAs competing for incorporation at the same site of a protein.
- the methods may be carried out in vivo, i.e. within a cell .
- methods of making target polypeptides having a non-standard amino acid substitution at an amino acid target location and methods of degrading target polypeptides having a standard amino acid substitution or an undesired NSAA (either referred to as an undesired amino acid) at the amino acid target location may be carried out in vivo, i.e. within a cell.
- a target polypeptide is made which includes a non-standard amino acid at an amino acid target location using an engineered amino-acyl tRNA synthetase and a transfer RNA corresponding to the non-standard amino acid.
- a removable protecting group is attached to the target polypeptide adjacent to the amino acid target location, such that when the removable protecting group is removed, an N-end amino acid is exposed at the amino acid target location.
- an undesired amino acid i.e., a standard amino acid or an undesired NSAA
- methods are provided herein for degrading the target polypeptide including the undesired amino acid, i.e. a standard amino acid or an undesired NSAA, at the amino acid target location, such as by using the N-end rule pathway for protein degradation.
- post-translational proofreading is used to distinguish correctly translated from incorrectly translated proteins.
- the disclosure provides a post-translational proofreading concept that results in protein degradation in a cell using the N-end rule unless the desired NSAA is incorporated at the N-terminus of the protein.
- the target polypeptide is made within a cell, i.e. in vivo, and the cell produces a protease and may also produce an adapter protein for the protease, i.e. a degradation system.
- a protease i.e. a degradation system.
- Certain cells, such as E. coli naturally produce the protease system ClpAP and the adaptor ClpS, and both components are present in order for the N-end rule degradation pathway to function.
- adaptor variants such as ClpS variants may be used which have different specificities than the natural ClpS.
- cell production of the natural ClpS be inactivated, for example, such as by gene modification where the ClpS gene sequence in the genome is modified to introduce premature stop codons.
- Such degradation systems are representative of a cellular system that degrades proteins according to the N-end rule pathway for protein degradation as is known in the art.
- the half-life of a protein is determined by its N-terminal residue.
- An N-terminal residue is said to be destabilizing if it is recognized by a cellular degradation system, which in turn leads to degradation of the protein. If the N-terminal residue is not destabilizing, i.e, it is not recognized by a cellular degradation system, then the protein is not subject to the N-end rule pathway for protein degradation.
- Destabilizing amino acids may be natural amino acids.
- Destabilizing amino acids may be certain NSAAs.
- Destabilizing amino acids may be referred to as undesired amino acids.
- Stabilizing amino acids may be referred to as desired amino acids.
- Non-standard amino acids such as analogs to natural amino acids, may be stabilizing or destabilizing amino acids.
- Certain non-standard amino acids, such as analogs to natural amino acids may be resistant to degradation by cellular systems. In this manner, the method increases production (yield) of proteins having a desired amino acid, i.e., a stabilizing amino acid or an amino acid which is not destabilizing, such as certain nonstandard amino acids, insofar as such a protein may not be subject to the N-end rule pathway for protein degradation.
- the method decreases production (yield) of proteins having an undesired amino acid, such as a standard amino acid or a certain NSAA, insofar as such a protein may be subject to the N-end rule pathway for protein degradation.
- the method determines successful desired NSAA incorporation to the extent that target polypeptides that include an undesired amino acid, such as a standard amino acid or a certain destabilizing NSAA (undesired NSAA), at the amino acid target location may be degraded and target polypeptides that include a desired non-standard amino acid at the amino acid target location remain and may not be degraded.
- the present disclosure provides a method of optimizing production of proteins including a non-standard amino acid.
- Reaction conditions are provided for making a target polypeptide including a non-standard amino acid substitution at an amino acid target location using an engineered amino-acyl t NA synthetase and a transfer RNA as is known in the art.
- a removable protecting group is attached to the target polypeptide adjacent to the amino acid target location, such that when the removable protecting group is removed, an N-end amino acid is exposed at the amino acid target location. Should the engineered amino-acyl tRNA synthetase and a transfer RNA add a standard amino acid or undesired NSAA.
- methods are provided herein for degrading the target polypeptide including the standard amino acid or undesired NSAA at the amino acid target location.
- the amount of proteins having a desired nonstandard amino acid is determined. Given the amount of protein produced, the reaction conditions and/or the amino-acyl tRNA synthetase and/or tRNA are altered and the amount of proteins having the desired non-standard amino acid is again determined. The process is repeated until the process is optimized for a desired yield of protein including desired NSAA.
- Exemplar ⁇ ' reaction conditions which may be altered according to the present disclosure include changes of culture media or concentration of desired NSAA, Alterations or changes to the amino-acyl tRNA synthetase and/or tRNA include one or more mutations that may improve performance of the amino-acyl tRNA synthetase and/or tRNA. Such mutations may be made by methods known to those of skill in the art such as random mutagenesis approaches such as error-prone polymerase chain reaction (PCR) or directed approaches such as site-saturation mutagenesis or rational point mutagenesis.
- PCR polymerase chain reaction
- the present disclosure provides a method of distinguishing a protein having a nonstandard amino acid, i.e. a desired nonstandard amino acid, at an amino acid target location from a protein having a standard amino acid or undesired nonstandard amino acid at the amino acid target location by degrading the protein if a destabilizing amino acid (i.e., standard amino acid or undesired destabilizing NSAA) recognized by a protease is present at the amino acid target location.
- a destabilizing amino acid i.e., standard amino acid or undesired destabilizing NSAA
- the present disclosure provides a method of making a target protein using an amino- acyl tRN A synthetase engineered or designed to incorporate a target nonstandard amino acid (i.e., a desired NSAA) in the target protein at an amino acid target location, wherein a removable protecting group is adjacent the amino acid target location.
- a target nonstandard amino acid i.e., a desired NSAA
- the removable protecting group is removed to expose the amino acid target location in the presence of a degradation enzyme or degradation enzyme/adaptor complex.
- the adaptor discriminates between amino acids and accordingly whether degradation will occur by the degradation enzyme.
- the degradation enzyme/adaptor complex degrades the target protein if a standard amino acid or undesired NS AA is present at the amino acid target location. If the desired nonstandard amino acid is present at the amino acid target location, then the degradation enzyme/adaptor complex is ineffective to degrade the target protein.
- Figs. 1A and IB illustrate methods described herein to selectively degrade proteins that contain one or more undesired amino acids at one or more target amino acid locations based on a N-end rule degradation strategy.
- Fig. 1 A and Fig. IB depict aspects of the present disclosure where a target protein including a standard amino acid at an amino acid target location is selectively degraded over the target protein including a non-standard amino acid at the amino acid target location. Degradation is carried out in vivo according to the N-end rule.
- Fig. 1 A shows the three candidate site-specific OTS systems used previously in E. coli.
- Fig. 1C shows measurements of in vivo fluorescence normalized to optical density (FL/OD) corresponding to different combinations of post-translational proofreading components.
- FL/OD optical density
- Fig, ID shows a control experiment that demonstrates the expected effect of Ubpl /ClpS expression on FL/OD signal resulting from GFP protein with N-terminal UCG, UAC, or LTAG codons.
- Fig. IE shows the FL/OD as a function of the number of U AG codons.
- Figs. 2A and 2B shows the activities of different AARSs.
- Fig. 2A shows the activity of different amino-acyl tRNA synthetases ("AARS") in the absence and presence of NSAA.
- Fig. 2B shows the activity of the AARSs when Ubpl and ClpS are expressed.
- Figs. 3A-3E shows the effect of various engineered ClpS variants and that ClpS variants may be engineered to be tunable to a particular substrate.
- Fig. 3 A shows the activity profi le of synthetase- 1 on an NSAA panel to determine their natural N-end rule classification of these NSAAs.
- Fig. 3B shows the activity profile of synthetase-2 on structurally smaller NSAAs that were not substrates of synthetase-1.
- Fig. 3C shows the four ClpS residues within four A of the para position of the N-terminal phenylalanine (F) on the substrate peptide.
- Fig. 3D shows the tunability achieved with rationally engineered ClpS variants.
- Fig. 3E shows further the properties of one ClpS variant of interest that leads to the classification of only two tested NSAAs as stabilizing.
- Figs. 4A-4B show sequence alignments of ClpS homologs.
- Fig. 4A shows the sequence alignments of 10 bacterial ClpS homologs including E. coli ClpS with the four ClpS residues of interest highlighted.
- Fig. 4B shows the sequence alignments of E. coli ClpS together with portions of the yeast and human UBRl E3 ubiquitin ligase protein homologs.
- the sequence alignments identify the region of UBRl as suitable for rational engineering to adjust the N-end rule status of various NSAAs using methods known to those of skill in the art.
- Wild-type UBRl in yeast and human cells contains an isoleucine (I) rather than valine (V) at the position equivalent to "ClpS_V65F. Accordingly, one of skill will recognize that the natural N-end status of NSAAs may be different in eukaryotes than in prokaryotes.
- the present disclosure provides a method of making a target polypeptide in a cell, wherein the target polypeptide includes a desired non-standard amino acid substitution at an amino acid target location, i.e. the non-standard amino acid withstands degradation as described herein.
- the terms "polypeptide” and "protein” include compounds that include amino acids joined together by peptide bonds between the carboxyl and amino groups of adjacent amino acid residues.
- Exemplary cells include prokaryotic cells and eukaryotic cells.
- Exemplary prokaryotic cells include bacteria, such as E. coli, such as genetically modified E. coli.
- the method includes genetically modifying the cell to express the target polypeptide including a desired non-standard amino acid substitution at an amino acid target location using an engineered amino-acyi tRNA synthetase and transfer RNA pair corresponding to the non-standard amino acid, and wherein the cell expresses the target polypeptide including a standard amino acid or an undesired NSAA at the amino acid target location when the engineered amino-acyl tRNA synthetase and transfer RNA pair nonselective ⁇ ' adds the standard amino acid or undesired NSAA at the amino acid target location.
- a removable protecting group is attached to the target polypeptide adjacent to the amino acid target location, such that when the removable protecting group is removed, an N- end amino acid is exposed at the amino acid target location.
- the removable protecting group is orthogonal within the cell in which it is being used.
- the cell includes a protease system for degrading the target polypeptide when the N-end amino acid is a standard amino acid.
- the cell includes a protease system for degrading the target polypeptide when the N- end amino acid is an undesired NSAA.
- the protease system includes an adapter protein and a corresponding protease. The adapter protein coordinates with the protease for degrading the target polypeptide when the N-end amino acid is a standard amino acid.
- the protease system is endogenous.
- the protease and adaptor can be expressed constitutively.
- the protease system is exogenous.
- the protease system is under influence of a promoter.
- the adapter protein of the protease system is under influence of an inducible promoter.
- the adapter protein is upregulated.
- overexpression of adaptor to produce adaptor levels in excess of that found normally within a cell improves degradation of polypeptides having an undesired amino acid at the amino acid target location.
- an adaptor protein is provided that facilitates N-end rule classification of an NSAA.
- Ubiquitin is a suitable protecting group in prokaryotic cells because it is orthogonal but it is not a suitable protecting group in eukaryotic cells because it is not orthogonal .
- ubiquitin is N-terminally added to proteins often to initiate the process of protein degradation in the proteasome.
- the adaptor proteins in eukaryotic cells are homoiogs of ClpS known as Ubiquitin E3 ligases. According to the present disclosure, ubiquitin E3 ligase domain is altered in order to change the N-end rule classification of an NSAA.
- the removable protecting group is removed to generate an N- end amino acid, and the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA.
- the target polypeptide including a desired non-standard amino acid substitution i.e. which is resistant to degradation, is enriched within the cell.
- embodiments of the disclosure are directed to methods that allow selective degradation of proteins having a standard amino acid or undesired NSAA instead of a desired nonstandard amino acid at their N-termini in a cell. The methods can be used for producing proteins with desired nonstandard amino acids at their N-termini with no detectable impurities.
- a method of identifying the presence of a target polypeptide including a desired non-standard amino acid, i.e. one which is resistant to degradation is provided.
- the target polypeptide includes a detectable moiety attached to the C-end of the target polypeptide. In this manner, if the target polypeptide (and detectable moiety) that is made by the cell is not subject to degradation as described above, then the detectable moiety is detected as a measure of the amount of target polypeptide generated by the cell.
- a detectable moiety is present at the C-end of the target polypeptide, the removable protecting group is removed to generate an N-end amino acid, the protease (whether accompanied by an adapter protein or not depending upon the protease system being used) degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA, for example, to thereby enrich the target polypeptide including a desired non-standard amino acid substitution, and the detectable moiety is detected as a measure of the amount of the target polypeptide including a desired non-standard amino acid substitution.
- a method for engineering synthetases that are more selective for incorporating non-standard amino acids versus standard amino acids at a selected site in a protein. Since all or substantially ail of proteins bearing a standard amino acid or an undesired NSAA at their N-terminus are degraded leaving only proteins with a desired nonstandard amino at their N-terminus, no or substantially no background signal due to standard amino acid or undesired NSAA incorporation results from the method. Synthetases can be evolved and their variants screened in a high-throughput fashion for their function of producing a protein incorporating a nonstandard amino acid, such as a desired NSAA. In this manner, those synthetases with improved function can be identified and modified further to further improve efficiency and selectivity .
- a cell is genetically modified to include a nucleic acid sequence which encodes for the target polypeptide that includes one or more non-standard amino acids within its amino acid sequence.
- the cell can be genomicaliy recoded, ("a genoniicallv recoded organsim") to the extent that one or more codons have been reassigned to encode for a non-standard amino acid.
- an amino-acyl tRNA synthetase/tRNA pair is engineered and the cell is capable of using the amino-acyl tRNA synthetase/tRNA pair to add the corresponding non-standard amino acid (when present in the cell) to a growing peptide sequence.
- Materials, conditions, and reagents for genetically modifying a cell to make a target protein having one or more amino acid sequences are described in the following references, each of which are hereby incorporated by reference in their entireties.
- MAGE multiplex automatable genome engineering
- CAGE hierarchical conjugative assembly genome engineering
- portions of recoded genomes can be synthesized and subsequently assembled, as described recently in an effort to construct a 57-codon organism (for example, as described in Ostrov, Nili, et al. "Design, synthesis, and testing toward a 57-codon genome.” Science 353.6301 (2016): 819-822 hereby incorporated by reference in its entirety).
- the modification of an organism, whether recoded or not recoded, in order to express a polypeptide containing a site-specific non-standard amino acid has been described extensively in the literature (for example, as described in Wang, Lei, et al.
- Basic to the present disclosure is the use of an amino-acyl tRNA synthetase/tRNA pair cognate to a nonstandard amino acid.
- Exemplar ⁇ ' amino-acyl tRNA synthetase/tRNA pairs cognate to a nonstandard amino acid are known to those of skill in the art or may be designed for particular non-standard amino acids, as is known in the art or as described in Wang, Lei, and Peter G. Schultz. "Expanding the genetic code " Angewandte chemie international edition 44.1 (2005): 34-66; Liu, Chang C, and Peter G. Schultz.
- the amino-acyl tRNA synthetase/tRNA pair cognate to a nonstandard amino acid is orthogonal to the cellular components of the ceil in which it is used.
- the orthogonality (and therefore the suitability) of exogenous amino-acyl tRNA synthetase/tRNA pairs is dependent on the type of host organism.
- the JW/TyrRS/tRNAcuA pair is orthogonal in E. coli but not in eukaryotic cells.
- the EcTyrRS/tR ' NAcuA pair and the cLeuRS/tRNAcuA pair are orthogonal in eukaryotic ceils but not in E. coli, whereas the PylRS/tRNAcuA pair is orthogonal in bacteria, eukaryotic ceils, and animals (see Chin, Jason W. "Expanding and reprogramming the genetic code of cells and animals.” Annual review of biochemistry 83 (2014): 379-408 hereby incorporated by reference in its entirety).
- the exogenous amino acyl tRNA synthetase should not recognize any native amino acids or native tRNA.
- the tRNA should not be recognized by any native amino-acyl tRNA synthetases.
- the non-standard amino acid should not be recognized by any native amino acyl tRNA synthetases, "Orthogonal" pairs meet one or more of the above conditions. It is to be understood that “orthogonal" pairs may lead to some mischarging, i.e. such as insubstantial mischarging for example, of orthogonal tRNA with native amino acids so long as sufficient efficiency of charging to the designed NSAA occurs.
- Exemplary families of synthetases for bacteria in addition to those described above and incorporated by reference include the PylRS/tRNAcuA pair and the Saccharomyces cerevisiae tryptophanyl-tRNA synthetase (ScWRS)/tRNAcuA pair. These exemplary synthetase families have natural analogs (lysine and tryptophan) that are N-end destabilizing amino acids.
- the following references describe useful synthetase famiiies and their associated NSAAs. Blight, Sherry K., et al.
- the target polypeptide includes a removable protecting group adjacent to the amino acid target location such that when the removable protecting group is removed, the amino acid target location is an N-end amino acid.
- exemplary removable protecting groups are known to those of skill in the art and can be readily identified in the literature based on the present disclosure.
- the removable protecting is a peptide sequence produced by the cell when making the target polypeptide.
- the removable protecting is a peptide sequence produced by the cell when making the target polypeptide, such that the removable peptide and the target polypeptide is a fusion.
- the cell is genetically modified to include a foreign nucleic acid sequence encoding the target polypeptide including a non-standard amino acid substitution at an amino acid target location and a removable protecting group attached to the target polypeptide adjacent to the amino acid target location.
- the removable protecting group is foreign to the cell, i.e. it is not endogenous to the cell. In this manner, the removable protecting is orthogonal to endogenous enzymes or other conditions within the cell.
- An exemplary removable protecting group includes a cleavable protecting group, such as an enzyme cleavable protecting group.
- the cell produces an enzyme that cleaves the removable protecting group to generate an N-end amino acid.
- An exemplary removable protecting group is a protein that is cleavable by a corresponding enzyme.
- a removable protecting group is foreign to the cell and is not endogenous.
- the enzyme that cleaves the removable protecting group is foreign to the cell and is not endogenous.
- an exemplary removable protecting group for prokaryotic cells is ubiquitin that is cleavable by Ubpl .
- an exemplar ⁇ ' removable protecting group for eukaryotic cells is the sequence MENLYFQ/*, where "*" is the target position for the NSAA (known in the field as the ⁇ position), where "/" represents the cut site, and where "ENLYFQ/*” is the sequence that is cleavable by certain variants of TEV protease.
- TEV protease cleavage efficiency is influenced by the choice of the amino acid at the ⁇ position.
- mutants of TEV protease have been engineered which have increased or altered substrate tolerance at the ⁇ position (see Renicke, Christian, Roberta Spadaccini, and Christof Taxis.
- an exemplary removable protecting protein domain includes a self-splicing domain, such as an intein, or other cleavable domains such as small ubiquitin modifiers (SUMO proteins).
- An exemplary removable protecting group may be a protein cleavage sequence along with its cognate partner, such as the TEV cleavage site and TEV protease.
- any of the strategies used to remove N-terminal affinity tags in protein purification can serve as alternative ways to expose the N-end target residue.
- An exemplary system to expose the N-end target residue includes a class of enzymes known as methionine aminopeptidases which can remove the first N-terminal residue, such as when the second residue is the amino acid target location which is the desired site of addition of a NSAA.
- the amino acid target location may be the N-terminal location or it may be any location between the N-terminal location and the C -terminal location. Accordingly, methods are provided for removing a protecting group and/or ail amino acids up to the amino acid target location, thereby rendering the amino acid target location being the N-terminal amino acid.
- the target polypeptide includes a detectable moiety attached to the C-end of the target polypeptide.
- detectable moieties are known to those of skill in the art and can be readily identified in the literature based on the present disclosure.
- the detectable moiety is a peptide sequence produced by the cell when making the target polypeptide.
- the detectable moiety is a peptide sequence produced by the cell when making the target polypeptide, such that the detectable moiety and the target polypeptide is a fusion.
- the cell is genetically modified to include a foreign nucleic acid sequence encoding the target polypeptide including a non-standard amino acid substitution at an amino acid target location and a detectable moiety attached to the target polypeptide, for example, at the C-end of the target polypeptide.
- the detectable moiety is foreign to the cell, i.e. it is not endogenous to the cell.
- An exemplary detectable moiety is a fluorescent moiety, such as GFP, that can be detected by fluorimetry, for example.
- An exemplary detectable moiety is a reporter protein.
- An exemplary detectable moiety includes a protein that confers antibiotic resistance which can be detected in the presence of an antibiotic.
- An exemplary detectable moiety includes an enzyme that performs a function (such as Beta-Gaiactosidase) that can lead to easy colorimetric output.
- epitope tags include histidine (His) tags, V5 tags, FLAG tags, influenza hemagglutinin (HA) tags, Myc tags, VSV-G tags, and thioredoxin (Trx) tags.
- reporter genes include, but are not limited to, glutathione-S-transferase (GST), horseradish peroxidase (HRP), chloramphenicol acetyltransferase (CAT) beta-galactosidase, betaglucuronidase, luciferase, green fluorescent protein (GFP), HcRed, DsRed, cyan fluorescent protein (CFP), yellow fluorescent protein (YFP), and autofluorescent proteins including blue fluorescent protein (BFP).
- GST glutathione-S-transferase
- HRP horseradish peroxidase
- CAT chloramphenicol acetyltransferase
- beta-galactosidase beta-galactosidase
- betaglucuronidase beta-galactosidase
- luciferase green fluorescent protein
- GFP green fluorescent protein
- HcRed HcRed
- DsRed cyan fluorescent protein
- YFP yellow
- aspects of the present disclosure include the genetic modification of a cell to include foreign genetic material which can then be expressed by the cell .
- the cell may be modified to include any other genetic material or elements useful in the expression of a nucleic acid sequence.
- Foreign genetic elements may be introduced or provided to a cell using methods known to those of skill in the art.
- the ceil may be genetically modified to include a foreign nucleic acid sequence encoding the target polypeptide including a nonstandard amino acid substitution at an amino acid target location, a removable protecting group attached to the target polypeptide adjacent to the amino acid target location and a detectable moiety attached to the C-end of the target polypeptide.
- the nonstandard amino acid may be encoded by a corresponding nonsense or sense codon.
- the cell may be genomically receded to recognize an engineered amino-acyl tRNA synthetase corresponding or cognate to a non-standard amino acid.
- the cell may be genetically modified to include a foreign nucleic acid sequence encoding an amino-acyl tRNA synthetase and/or a transfer RNA corresponding or cognate to the nonstandard amino acid and wherein the nonstandard amino acid is provided to the cell and the cell expresses the synthetase and the transfer RNA to include the nonstandard amino acid at the amino acid target location.
- the cell is genetically modified to include a foreign nucleic acid sequence encoding an enzyme for cleaving the removable protecting group under influence of an inducible promoter.
- the cell is genetically modified to include an inducible promoter influencing the production of an enzyme system for removal of the removable protecting group.
- the enzyme system or component thereof may be under influence of the inducible promoter.
- the adapter which helps associate the cleavage enzyme with the removable protecting group may be under influence of an inducible promoter.
- nucleic acids may be introduced into a cell using any method known to those skilled in the art for such introduction. Such methods include transfection, transduction, viral transduction, microinjection, lipofection, nucleofection, nanoparticie bombardment, transformation, conjugation and the like. One of skill in the art will readily understand and adapt such methods using readily identifiable literature sources.
- vectors include a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked.
- Vectors used to deliver the nucleic acids to cells as described herein include vectors known to those of skill in the art and used for such purposes.
- Certain exemplary vectors may be plasmids, lentiviruses or adeno-associated vimses known to those of skill in the art.
- Vectors include, but are not limited to, nucleic acid molecules that are single- stranded, double-stranded, or partially double-stranded, nucleic acid molecules that comprise one or more free ends, no free ends (e.g.
- vectors refers to a circular double stranded DNA loop into which additional DNA segments can be inserted, such as by standard molecular cloning techniques.
- viral vector wherein virally-derived DNA or RNA sequences are present in the vector for packaging into a virus (e.g. retroviruses, lentivimses, replication defective retroviruses, adenoviruses, replication defective adenoviruses, and adeno-associated viruses).
- Viral vectors also include polynucleotides carried by a virus for transfection into a host cell.
- vectors are capable of autonomous replication in a host cell into which they are introduced (e.g. bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively linked. Such vectors are referred to herein as "expression vectors.” Common expression vectors of utility in recombinant DNA techniques are often in the form of plasmids.
- Recombinant expression vectors can comprise a nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory elements, which may be selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed.
- "operably linked" is intended to mean that the nucleotide sequence of interest is linked to the regulatory element(s) in a manner that allows for expression of the nucleotide sequence (e.g. in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell).
- regulatory element is intended to include promoters, enhancers, internal ribosoma! entry sites (IRES), and other expression control elements (e.g. transcription termination signals, such as polyadenylation signals and poly-U sequences).
- IRES internal ribosoma! entry sites
- regulatory elements e.g. transcription termination signals, such as polyadenylation signals and poly-U sequences.
- transcription termination signals such as polyadenylation signals and poly-U sequences.
- Regulatory elements include those that direct constitutive expression of a nucleotide sequence in many types of host cell and those that direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences).
- Regulatory elements useful in eukaryotic cells include a tissue-specific promoter that may direct expression primarily in a desired tissue of interest, such as muscle, neuron, bone, skin, blood, specific organs (e.g. liver, pancreas), or particular cell types (e.g. lymphocytes). Regulatory elements may also direct expression in a temporal-dependent manner, such as in a cell-cycle dependent or developmental stage-dependent manner, which may or may not also be tissue or cell-type specific.
- a vector may comprise one or more pol III promoter (e.g. 1, 2, 3, 4, 5, or more pol III promoters), one or more pol II promoters (e.g.
- pol I promoters 1, 2, 3, 4, 5, or more ⁇ II promoters
- pol III promoters include, but are not limited to, U6 and HI promoters.
- pol II promoters include, but are not limited to, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) [see, e.g., Boshart et al, Cell, 41 :521-530 (1985)], the SV40 promoter, the dihydrofolate reductase promoter, the ⁇ -actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EFl a promoter and Pol II promoters described herein.
- RSV Rous sarcoma virus
- CMV cytomegalovirus
- PGK phosphoglycerol kinase
- enhancer elements such as WPRE; CMV enhancers; the R-U5' segment in LTR of HTLV-I (Mol. Cell. Biol, Vol. 8(1), p. 466-472, 1988), SV40 enhancer, and the intron sequence between exons 2 and 3 of rabbit ⁇ -globin (Proc. Natl. Acad, Sci , USA., Vol. 78(3), p. 1527-31 , 1981).
- a vector can be introduced into host cells to thereby produce transcripts, proteins, or peptides, including fusion proteins or peptides, encoded by nucleic acids as described herein (e.g., clustered regularly interspersed short palindromic repeats (CRISPR) transcripts, proteins, enzymes, mutant forms thereof, fusion proteins thereof, etc.).
- CRISPR clustered regularly interspersed short palindromic repeats
- Common prokaryotic promoters include IPTG (isopropyl B-D-l-thiogalactopyranoside) inducible, anhydrotetracycline inducible, or arabinose inducible promoters.
- Such promoters express genes only in the presence of IPTG, anhydrotetracycline, or arabinose in the medium.
- An exemplary promoter for use in bacteria such as E. coli to express aminoacyl tRNA synthetase is an arabinose inducible promoter.
- An exemplar ⁇ ' promoter for use in bacteria such as E. coli to express a reporter protein is an anhydrotetracycline inducible promoter.
- a terminator sequence includes a section of nucleic acid sequence that marks the end of a gene or operon in genomic DNA during transcription. This sequence mediates transcriptional termination by providing signals in the newly synthesized mRNA that trigger processes which release the mRNA from the transcriptional complex. These processes include the direct interaction of the mRNA secondary structure with the complex and/or the indirect activities of recmited termination factors. Release of the transcriptional complex frees RNA polymerase and related transcriptional machinery to begin transcription of new mRNAs. Terminator sequences include those known in the art and identified and described herein.
- the cell includes a protease system for degrading the target polypeptide when the N-end amino acid is a standard amino acid.
- the protease system may be endogenous or exogenous.
- the cell may include an adapter or discriminator protein that coordinates with a protease for degrading the target polypeptide when the N-end amino acid is a standard amino acid.
- the adapter protein may be under influence of an inducible promoter.
- the adapter protein is ClpS or a variant or mutant thereof.
- adapter proteins may have different levels of selectivity for certain amino acids.
- adapter proteins such as ClpS may be altered to improve selectivity, such as between standard amino acids and non-standard amino acids or between a desired NSAA and an undesired NSAA.
- the protease system is a ClpS-ClpAP protease system.
- protease systems include Clps or homologs or mutants thereof, such as ClpS V65I.
- the N-end rule is mediated by homologs of ClpS/ClpAP in bacteria.
- the N-end rule involves more distant homologs of ClpS (UBR1, ubiquitin E3 ligases) and degradation by the proteasome.
- UBR1 ubiquitin E3 ligases
- the present disclosure contemplates use of many of the bacterial ClpS homologs to perform similar functions with slightly different amino acid recognition specificity.
- the present disclosure also contemplates use of eukaryotic protease systems, such as UBRl and related variants to mediate N-end rule recognition with different amino acid recognition specificity in eukaryotes.
- cells according to the present disclosure include prokaryotic cells and eukaryotic cells.
- prokaryotic cells include bacteria.
- Microorganisms which may serve as host cells and which may be genetically modified to produce recombinant microorganisms as described herein may include one or members of the genera Clostridium, Escherichia, Rhodococcus, Pseudomonas, Bacillus, Lactobacillus Saccharomyces, and Enterococcus.
- Particularly suitable microorganisms include bacteria and archaea.
- Exemplary microorganisms include Escherichia coli, Bacillus s htilis, and Saccharomyces cerevisiae.
- Exemplary eukaryotic cells include animal cells, such as human cells, plant cells, fungal cells and the like.
- Bacillus subtil is. Bacillus megaterium, Bifidobacterium bifida,' a. Caulobacter crescentus, Clostridium difficile, Chlamydia trachomatis, Corynebacterium glutamicum, Lactobacillus acidophilus, Lactococcus laciis, Mycoplasma genitalium, Neisseria gonorrhoeae, Prochlorococcus marinus, Pseudomonas aeruginosa, Psuedomonas put Ida. Treponema pallidum, Streptomyces coelicolor, Synechococcus elongates, Vibrio natrigiens, and Zymomonas mob His.
- Exemplary genus and species of bacteria cells include Acetobacter aurantius, Acinetobacter bitumen, Actinomyces israelii, Agrobacterium radiobacter, Agrobacterium tumefaeiens, Anaplasma Anaplasma phagocytophilum, Azorhizobium caulinodans, Azotobacter vinelandii, viridans streptococci, Bacillus anthracis, Bacillus brevis, Bacillus cereus, Bacillus fusiformis, Bacillus licheniformis, Bacillus megaterium, Bacillus mycoides, Bacillus stearothermophilus, Bacillus subtilis, Bacteroides, Bacteroides fragilis, Bacteroides gingivalis, Bacteroides melaninogenicus (also referred to as Prevotella melaninogenica ), Bartonella ,Bartonelia henselae, Bartonella quintana, Bordetella, Bordetella
- Prevotella melaninogenica also known as Bacteroides melaninogenicus
- Pseudomonas aeruginosa Rhizobium radiobacter
- Rickettsia Rickettsia prowazekii
- Rickettsia psittaci Rickettsia quintana
- Rickettsia rickettsii Rickettsia trachomae
- Rochalimaea Rochalimaea henselae
- Rochalimaea quintana Rothia dentocariosa
- Salmonella Salmonella enteritidis, Salmonella typhi, Salmonella typhimurium
- Serratia marcescens Shigella dysenteriae, Staphylococcus, Staphylococcus aureus, Staphylococcus epidermidis, Stenotrophomonas maltophilia
- Streptococcus bovis Streptococcus cricetus, Streptococcus faceium, Streptococcus faecaiis, Streptococcus feais, Streptococcus gallinarum, Streptococcus lactis, Streptococcus mitior, Streptococcus mitis, Streptococcus mutans, Streptococcus oralis, Streptococcus pneumoniae, Streptococcus pyogenes, Streptococcus rattus, Streptococcus salivarius, Streptococcus sanguis, Streptococcus sobrinus, Treponema, Treponema pallidum, Treponema denticola, Vibrio, Vibrio cholerae, Vibrio comma, Vibrio parahemolyticus, Vibrio vulnificus, Wolbachia, Yersinia, Yers
- Exemplary genus and species of yeast cells include Saccharomyces, Saccharomyces cerevisiae, Torula, Saccharomyces bouiardii, Schizosaccharomyces, Schizosaccharomyces pombe, Candida, Candida glabrata, Candida tropicaiis, Yarrowia, Candida parapsilosis, Candida krusei, Saccharomyces pastorianus, Brettanomyces, Brettanomyces bruxellensis, Pichia, Pichia guilliermondii, Cryptococcus, Cryptococcus gattii, Torulaspora, Torulaspora delbrueckii, Zygosaccharomyces, Zygosaccharomyces bailii, Candida lusitaniae, Candida stellata, Geotrichum, Geotrichum candidum, Pichia pastoris, Kluyveromyces, Kluyveromyces marxianus, Candida dubliniensis, Kluy
- Exemplar ⁇ ' genus and species of fungal cells include Sac fungi, Basidiomycota, Zygomycota, Chtridiomycota, Basidiomycetes, Hyphomycet.es, Glomeromycota, Microsporidia, Blastocladiomycota, and Neocallimastigomycota, and other genus and species known to those of skill in the art.
- Exemplary eukaryotic cells include mammalian cells, plant cells, yeast cells and fungal cells.
- SAA standard amino acid
- standard amino acids include one of the L-amino acids that typically naturally occur in proteins on Earth and includes alanine, arginine, asparagine, aspartic acid, cysteine, glutamic acid, glutamine, glycine, histidine, isoleucine, leucine, lysine, methionine, phenylalanine, serine, threonine, tyrosine, tryptophan, proline and valine.
- the standard amino acids that are naturally N-end destabilizing in most bacteria include tyrosine, phenylalanine, tryptophan, leucine, lysine, and arginine.
- the amino acid at the amino acid target location is an NSAA that is stabilizing.
- the natural analog of the NSAA is destabilizing and is present at the amino acid target location, degradation of the polypeptide occurs.
- Standard amino acids that are not naturally destabilizing via the N-end rule using natural ClpS, can be destabilizing when the ClpS is engineered to recognize such standard amino acid.
- the N-end rule in bacteria may also be engineered to recognize isoleucine, valine, aspartate, glutamate, asparagine, and glutamine as destabilizing using methods known to those of skill in the art which is useful when the desired NSAA is an analog of these amino acids.
- isoleucine and valine can be converted into N-end destabilizing residues by introducing a ClpS variant (M40A) that recognizes these amino acids as N-terminal destabilizing residues see (Roman-Hernandez G, Grant RA, Sauer RT, & Baker TA (2009) Molecular basis of substrate selection by the N-end rule adaptor protein ClpS.
- Aspartate and glumatate may be converted into N-end destabilizing residues by introducing a bacterial aminoacyl -transferase from Vibrio vulnificus (Bpt) that is a homoiog of eukaryotic transferases and N-terminally appends a leucine (L) to peptides containing N ⁇ terminally exposed aspartate or glutamate (see Graciet E, et al. (2006) Aminoacyl - transferases and the N-end rule pathway of prokaryotic/eukaryotic specificity in a human pathogen.
- Bpt Vibrio vulnificus
- L leucine
- NTA1 cerevisiae
- an N-end rule destabilizing pathway may be provided for all 20 standard amino acids as a basis for a system where a desired amino acid from among the 20 standard amino acids is N-end destabilizing in at least one context (see Chen, Shun-Jia, et al. "An N-end rule pathway that recognizes proline and destroys gluconeogenic enzymes.” Science 355.6323 (2017): eaa!3655 hereby incorporated by rweterence in its entirety).
- One of skill in the art can identify the eukaryotic proteins required for conferring expanded N-end destabiiization and transfer them to prokaryotes as needed.
- NSAA refers to an unmodified amino acid that is not one of the 20 naturally occurring standard L-amino acids.
- NSAAs also include synthetic amino acids which have been designed to include a non-standard functional group not present in the standard amino acids or are naturally occurring amino acids bearing functional groups not present in the set of standard amino acids.
- a non-standard amino acid may include the structure of a standard amino acid and which includes a non-standard functional group.
- a non-standard amino acid may include the basic amino acid portion of a standard amino acid and include a non-standard functional group.
- NSAAs also refer to natural amino acids that are not used by all organisms (e.g. L- pyrrolysine (B. Hao et al ., A new uag-encoded residue in the structure of a methanogen methyltransferase. Science. 296: 1462) and L-selenocysteine (S. Osawa et al., Recent evidence for evolution of the genetic code. Microbiol Mol Biol Rev. 56:229)).
- NSAAs are also known in the art as unnatural amino acids (UAAs) and non-canonical amino acids (NCAAs).
- NSAAs include, but are not limited to, p-Acetylphenyl alanine, m- Acetylphenylalanine, O-aliyltyrosine, Phenylselenocysteine, p-Propargyloxyphenylalanine, p-Azidophenylalanine, p-Boronophenyl alanine, O-m ethyl tyrosine, p-Aminophenylalanine, p- Cyanophenylalanine, m-Cyanophenyl alanine, p-Fluorophenylalanine, p-Iodophenyl alanine, p-Bromophenylalanine, p-Nitrophenylaianine, L-DOPA, 3-Aminotyrosine, 3-Iodotyrosine, p- Isopropyiphenyl alanine, 3-(2-
- Modified or unusual amino acids include D-amino acids, hydroxy-lysine, 4-hydroxyproline, N-Cbz-protected amino acids, 2,4-diaminobutyric acid, homoarginine, norleucine, ' N -methyl aminobutyric acid, naphthyialanine, phenylglycine, -phenylproline, tert-leucine, 4-aminocyclohexylalanine, N- methyl -norleueine, 3,4-dehydroproline, ⁇ , ⁇ -dimethylaminoglycine, N-methylaminoglycine, 4-aminopiperidine-4-carboxylic acid, 6-aminocaproic acid, trans-4-(aminomethyl)- cyclohexanecarboxylic acid, 2-, 3-, and 4-(aminomethyl)-benzoic acid, 1- aminocyclopentanecarboxylic acid, 1 -aminocyclopropanecar
- NSAAs also include amino acids that are functionalized, e.g., alkyne-functionaiized, azide-functionalized, ketone-functionaiized, aminooxy- functionaiized and the like.
- amino acids that are functionalized, e.g., alkyne-functionaiized, azide-functionalized, ketone-functionaiized, aminooxy- functionaiized and the like.
- an NSAA of the subject invention has a corresponding aminoacyl tRNA synthetase (aaRSVtRNA pair.
- the aminoacyl tRNA synthetase/tRNA pair is orthogonal to those in a genetically modified organism such as, e.g., a prokaryotic cell, a bacterium (e.g., E. coif), a eukaryotic cell, a yeast, a plant cell, an insect cell, a mammalian cell, a virus, etc.
- an NSAA of the subject invention is non-toxic when expressed in a genetically modified organism such as, e.g., a prokaryotic cell, a bacterium (e.g., E. coli), a eukaryotic cell, a yeast, a plant cell, an insect cell, a mammalian cell, a virus, etc.
- a genetically modified organism such as, e.g., a prokaryotic cell, a bacterium (e.g., E. coli), a eukaryotic cell, a yeast, a plant cell, an insect cell, a mammalian cell, a virus, etc.
- an NSAA of the subject invention is not or does not resemble a natural product present in a cell or organism.
- an NSAA of the subject invention is hydrophobic, hydrophilic, polar, positively charged, or negatively charged.
- an NSAA of the subject invention is commercially available (such as, e.g., L-4,4- bipnehyiaianine (bipA) and L-2-Naphthylalanine (napA)) or synthesized according to published protocols.
- bipA bipnehyiaianine
- napA L-2-Naphthylalanine
- the disclosure provides a method of making a protein having a non-standard amino acid incorporated therein, such as at its N-terminus, in a cell .
- the cell is provided with a nucleic acid sequence encoding a ubiquitin fused to the N-terminus of the protein wherein the N-terminus of the protein is an amino acid target location intended to have a nonstandard amino acid.
- the nonstandard amino acid may be encoded by a nonsense or sense codon.
- the cell is provided with a ubiquitin cleavase.
- the cell may include an endogenous protease system, such as a ClpS-ClpAP system.
- the cell is provided with a nonstandard amino acid.
- the cell expresses the fusion protein having either a standard or a nonstandard amino acid incorporated at the amino acid target location.
- the ubiquitin cleavase cleaves the ubiquitin to produce a protein having either the standard or non-standard intervening amino acid at its N-terminus. If a standard amino acid is present at the N- terminus, the ClpS recognizes the standard amino acid at the N-terminus and targets the protein having the standard amino acid at its N-terminus to ClpP for degradation. If a nonstandard amino acid is present at the N-terminus, the Clps does not recognize the nonstandard amino acid and the protein is not targeted for degradation.
- ClpS is an essential component of the N-end rule pathway in Escherichia coli. Nature 439(7077):753-756 and Wang KH, Oakes ESC, Sauer RT, & Baker TA (2008) Tuning the Strength of a Bacterial N- end Rule Degradation Signal. Journal of Biological Chemistry 283(36):24600-24607; Schmidt R, Zahn R, Bukau B, & Mogk A (2009) ClpS is the recognition component for Escherichia coli substrates of the N-end rule degradation pathway.
- the disclosure provides a method of screening for an amino acyl tRNA synthetase variant that preferentially selects a non-standard amino acid against its standard amino acid counterpart for incorporation into a protein in a cell .
- the cell is provided with an amino acyl tRN A synthetase variant.
- the cell is provided with a nucleic acid sequence encoding a ubiquitin fused to the N-terminus of the protein wherein the N-terminus of the protein is an amino acid target location intended to have a nonstandard amino acid, and wherein GFP is fused to the C-end of the protein.
- the nonstandard amino acid may be encoded by a nonsense or sense codon.
- the cell is provided with a ubiquitin cleavase, such as Ubpl .
- the cell may include an endogenous protease system, such as a ClpS-ClpAP system.
- the cell is provided with a non-standard amino acid.
- the cell expresses the fusion protein having either a standard or a non-standard amino acid incorporated at the amino acid target location.
- the ubiquitin cleavase cleaves the ubiquitin to produce a protein having either the standard or non-standard intervening amino acid at its N- terminus.
- the ClpS recognizes the standard amino acid at the N-terminus and targets the protein having the standard amino acid at its N-terminus to ClpP for degradation, including the GFP portion. If a nonstandard amino acid is present at the N-terminus, the Clps does not recognize the nonstandard amino acid and the protein is not targeted for degradation.
- the GFP is detected and is indicative of the presence of a synthetase variant that preferentially selects the non-standard amino acid against its standard amino acid counterpart for incorporation into the protein.
- the strength of the signal detected from the GFP is indicative of the amount of protein produced that included the nonstandard amino acid.
- methods are provided for screening and evolving an amino acyl tRNA synthetase variant that preferentially selects a non-standard amino acid against its standard amino acid counterpart for incorporation into a protein in a cell.
- a fusion protein was constructed having a ubiquitin fused to the N-terminus of a GFP reporter protein having an intervening amino acid encoded by a UAG codon in between the ubiquitin and GFP.
- the plasmid containing the nucleic acid sequence of the fusion protein was introduced into the E. coli.
- the expression of a ubiquitin cleavase from yeast (Ubpl) was induced in E.
- ClpS Since ClpS is endogenous to E. coli, it need not be overexpressed. However, overexpression, such as by use of an inducible promoter, increases the concentration of ClpS and therefore the ability of ClpS to discriminate between desired nonstandard amino acids and undesired standard amino acids or undesired NS AAs.
- the crystal structure of the ClpS protein was used to rationally engineer ClpS variants that have tunable specificities for degrading different sizes of NSAAs, including undetectable degradation of tested analogs of Y/F while maintaining full degradation of Y/F with the ClpS V65I mutant.
- V65I is a standard notation for describing a mutation where the amino acid I has been substituted for amino acid V at position 65
- the ClpS_V65I mutant was capable of distinguishing between standard amino acids and their nonstandard amino acid counterparts when positioned at the N-terminus of a protein for purposes of degradation.
- One aspect of the present disclosure is to inactivate endogenous ClpS if the ClpS V65I mutant is being used. Inactivation of ClpS prevents indiscriminate or nonselective degradation of proteins including smaller nonstandard amino acids.
- Another useful mutant is ClpS_L32F, which causes biphenylalanine (bipA) and p- benzoylphenylalanine (pBnzylF) to be N-end stabilizing and all other tested NSAAs to be N- end destabilizing.
- the ClpS L32F mutant can be used to discriminate between incorporated bipA and all other incorporated NSAAs other than pBnzylF (see Fig. 3E).
- aspects of the present disclosure are directed to local sense codon reassignment.
- a sense codon can be used which creates competition for an N-terminal sense codon between a standard amino acid and a nonstandard amino acid. If the standard amino acid corresponding to the sense codon is destabilizing by the N-end rule, all of the proteins which do not include a sense codon reassigned to a nonstandard amino acid will be degraded. As shown in Fig. IE with respect to the UAC codon which normally encodes tyrosine, when expression of Ubpl/ClpS is induced, nearly all of the GFP containing tyrosine gets degraded. If competition for the UAC codon with nonstandard amino acid is present, then nonstandard amino acid incorporation into the N-terminus of the GFP reporter protein can be detected by measuring the FL/OD increase,
- E. coli strain C321.AA (CP006698.1), which was previously engineered to be devoid of UAG codons and RFl, was used as the E coli strain.
- E. coli strain C321AA. (CP006698.1) is described in M. J. Lajoie et ah, Genomically Recoded Organisms Expand Biological Functions. Science (80- ). 342, 357-360 (2013) hereby incorporated by reference in its entirety.
- a starting reporter construct of an Ubiquitin-*-LFVQEL-sfGFP-His6x fusion under an anhydrotetracycline (ATC) inducible expression system was genomically integrated using ⁇ Red recombineering as described in .
- ATC anhydrotetracycline
- Datsenko, B. !,. Wanner One-step inactivation of chromosomal genes in Escherichia coli K-12 using PGR products. Proc Natl Acad Sci USA. 97, 6640-6645 (2000) and D. Yu et ah, An efficient recombination system for chromosome engineering in Escherichia coli. Proc. Natl. Acad. Sci. U. S.
- Table 1 includes sequences of key constructs such as the reporter construct.
- Multiplex automatable genome engineering as described H. H. Wang et al. Programming ceils by multiplex genome engineering and accelerated evolution. Nature. 460, 894-898 (2009) hereby incorporated by reference in its entirety was used to inactivate the endogenous mutS and clpS genes when needed and to add or remove UAG codons in the integrated reporter, in brief, overnight cultures were diluted 100-fold into 3 mL LB L containing appropriate antibiotics and grown until mid-log.
- MAGE Multiplex automatable genome engineering
- Lambda Red was induced in a shaking water bath (42 °C, 300 rpm, 15 minutes), then induced culture tubes were cooled rapidly in an ice slurry for at least two minutes, Electrocompetent cells were prepared at 4 °C by pelleting 1 mL of culture (centrifuge at 16,000 rcf for 20 seconds) and washing the cell pellet twice with 1 mL ice cold deionized water (dH20). Electrocompetent pellets were resuspended in 50 of dH20 containing the desired DNA. For MAGE oligonucleotides, 5 uM of each oligonucleotide was used. Table 2 includes a list of all oligonucleotides used.
- Allele-specific colony PCR was used to identify desired colonies resulting from MAGE as described in F. J. Isaacs et al. Precise manipulation of chromosomes in vivo enables genome-wide codon replacement. Science (80-. ). 333, 348-353 (2011) hereby incorporated by reference in its entirety. Colony PCR was performed using Kapa 2G Fast HotStart ReadyMix according to manufacturer protocols and Sanger sequencing was performed by Genewiz to verify strain engineering. The strains C321.AA.Ubiq-UAG-sfGFP and C321.AA.Ubiq-UAG-sfGFP_UAG151 are available from Addgene.
- UbiGFPins-F TAAAGAGGAGAAAGGTACCATGCAGATTTTTGTGAAGACTTT
- UbiGFPins-R TGGGATCCCCCATCAAGCTTTTAGTGGTGGTGGTGGTGGT pZEbb one4Ubi GF ACCACCACCACCACCACTAAAAGCTTGATGGGGGATCCCA P-F
- TetR-ins-F tatctgttgtttgtcggtgaacgtctcattttcgccagat
- BipARS_P144Q-F cgcgcgtgaagacgaaaccagaaagttgcggaagttatctac
- PyllFRS and tRNA ⁇ were provided by Professor Dieter Soil (Yale University) as described in L.-T. Guo et al , Polyspecific pyrrolysyl-tRNA synthetases from directed evolution, doi: 10.1073/pnas.14197371 1 1 hereby incorporated by reference in its entirety.
- the PyllFRS was retained on its original pCDF plasmid but the tRNA was cloned into the pZE21 vector for higher expression.
- the pCDF plasmid was maintained using spectinomycin and the pZE21 plasmid was maintained using kanamycin.
- the Pyl-ABK and tRNA ' c ' UA AARS/tRNA pair were provided by Dr.
- E. coli and cloned into the pZE21 vector (Kanamycin resistance, ColEl origin, TET promoter) (Expressys).
- the E. coli genes clpS and clpP were amplified from E. coli MG1655 and cloned into artificial operons downstream of the ubp l gene in the pZE21 vector using Gibson assembly. Artificial operons were created by inserting the following RBS sequence between the ubpl and clp genes: TAATAAAAGGAGATATACC. This RBS was originally designed using the RBS calculator (see H. M. Salis, E. A. Mirsky, C. A. Voigt, Automated design of synthetic ribosome binding sites to control protein expression.
- the four initial positions of interest in the clpS gene correspond to amino acids 32, 43, 65, and 99.
- Gibson assembly was used to ligate both amplicons and the backbone plasmid.
- the pZE/Ubpl/ClpS and pZE Ubpl/ClpS V65I plasmids are available from Addgene.
- the ubpl/clpS_V65I operon was also placed under weak constitutive expression and integrated using Cionetegration as described in F. St-Pierre et al, One-step cloning and chromosomal integration of DNA. ACS Synth. Biol. 2, 537-541 (2013). This strain (C321. AA.Nendint) was used as the host for FACS experiments.
- Cultures for general culturing used herein were grown in LB-Lennox medium (LB L : 10 g/L bacto tryptone, 5 g/L sodium chloride, 5 g/L yeast extract). Cultures for experiments in Figs 3A-3E were grown in 2X YT medium (2XYT: 16 g/L bacto tryptone, 10 g/L bacto yeast extract, 5 g/L sodium chloride) given improved observed final culture densities compared to LB L upon expression of ClpS variants. Unless otherwise indicated, ail cultures were grown in biological triplicate in 96-well deep-well plates in 300 ⁇ , culture volumes at 34C and 400 rpm.
- NSAAs used in this study were purchased from PepTech Corporation, Sigma Aldrich, Santa Cruz Biotechnology, and Toronto Research Chemicals. The following NSAAs were purchased: L-4,4-Biphenylalanine (BipA), L-4-Benzoylphenylalanine (pBenzoylF), O-tert- Butyl-L-tyrosine (tButylY), L-2-Naphthylalanine (NapA), L-4-Acetyiphenyl alanine (pAcF), L-4-Iodophenyl alanine (piF), L-4-Bromophenylalanine (pBromoF), L-4-Chlorophenylalanine (pChloroF), L-4-Fluorophenylalanine (pFluoroF), L-4-Azidophenylalanine (pAzF), L-4- Nitrophenylalanine, L-4-Cyanophenyla
- EP-PCR Error-prone PGR
- MjTyrRS- derived AARSs roughly 175 ng of PGR template was used in each 25 uL. of PCR mix containing primers that have roughly 40 base pairs of homology flanking the AARS coding region. The reaction mixture was subject to 30 cycles with Tm of 63°C and extension time of I min.
- AARS libraries were subject to three rounds of fluorescence activated sorting in a Beckman Coulter MoFlo Astrios. Prior to each round, the usual NSAA incorporation assay procedure was followed such that cells would express GFP reporter proportional to the activity of the AARS library member. One notable deviation from that procedure was the use of a higher and variable inoculum volume to screen the full library at each stage. Cells displaying the top 0.5% of fluorescence activation (50k cells) were collected after Round 1, expanded overnight, and used to inoculate experimental cultures for the next round. Because the next round was a negative screening round, the desired NSAA was not added into culture medium.
- the present disclosure provides a method of making a target polypeptide in a cell, wherein the target polypeptide includes a non-standard amino acid (NSAA) substitution at an amino acid target location, including genetically modifying the cell to express the target polypeptide including a non-standard amino acid substitution at an amino acid target location using an engineered amino-acyl tRNA synthetase and transfer RNA pair corresponding to the non-standard amino acid, and wherein the cell expresses the target polypeptide including a standard amino acid or undesired NSAA at the amino acid target location when the engineered amino-acyl tRNA synthetase and transfer RNA pair non-selectively adds the standard amino acid or undesired NSAA at the amino acid target location, wherein a removable protecting group is attached to the target polypeptide adjacent to the amino acid target location, such that when the removable protecting group is removed, an N-end amino acid is exposed at the amino acid target location.
- NSAA non-standard amino acid
- the removable protecting group is a cleavable protecting group that is orthogonal within the ceil.
- the removable protecting group is an enzyme cleavable protecting group.
- the removable protecting group is a protein that is cleavable by a corresponding enzyme.
- the removable protecting group is ubiquitin that is cleavable by Ubpl .
- the cell is genetically modified to include a foreign nucleic acid sequence encoding the target polypeptide including a nonstandard amino acid substitution at an amino acid target location and a removable protecting group attached to the target polypeptide adjacent to the amino acid target location ,
- a detectable moiety is attached to the C-end of the target polypeptide.
- a detectable moiety is attached to the C-end of the target polypeptide, wherein the detectable moiety is a fluorescent moiety.
- a detectable moiety is attached to the C-end of the target polypeptide, wherein the detectable moiety is a reporter protein.
- the cell is genetically modified to include a foreign nucleic acid sequence encoding the target polypeptide including a non-standard amino acid substitution at an amino acid target location, a removable protecting group attached to the target polypeptide adjacent to the amino acid target location and a detectable moiety attached to the C-end of the target polypeptide, wherein the nonstandard amino acid is encoded by a corresponding nonsense or sense codon.
- the cell is genetically modified to include a foreign nucleic acid sequence encoding an amino-acyl tRNA synthetase and a transfer RNA corresponding to the nonstandard amino acid and wherein the nonstandard amino acid is provided to the cell and the cell expresses the synthetase and the transfer RNA to include the nonstandard amino acid at the amino acid target location.
- the ceil is genetically modified to include a foreign nucleic acid sequence encoding an enzyme for cleaving the removable protecting group under influence of a constitutive or an inducible promoter.
- the cell includes an adapter protein that coordinates with a protease for degrading the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA.
- the cell includes an adapter protein that coordinates with a protease for degrading the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA, wherein the adapter protein is under influence of a constitutive or an inducible promoter.
- the method further includes the ceil expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid.
- the method further includes the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and the cell expressing a protease wherein the protease degrades the target polypeptide when the N- end amino acid is a standard amino acid or an undesired NSAA.
- the method further includes the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and the ceil expressing a protease wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA to thereby enrich the target polypeptide including a non-standard amino acid substitution within the cell .
- the method further includes the cell expressing an enzyme that cleaves the removable protecting group to generate an N- end amino acid, and the cell expressing an adapter protein that coordinates with a protease, wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA.
- the method further includes the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and the cell expressing an adapter protein that coordinates with a protease, wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA, and wherein the adapter protein is under influence of an inducible promoter.
- the method further includes the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and the ceil expressing a CipS-ClpAP protease system wherein the ClpS-ClpAP protease system degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA to thereby enrich the target polypeptide including a desired non-standard amino acid substitution within the cell.
- the method further includes the cell expressing an enzyme that cleaves the removable protecting group to generate an N- end amino acid, and the cell expressing a ClpS-ClpAP protease system wherein the ClpS- ClpAP protease system degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA to thereby enrich the target polypeptide including a desired non-standard amino acid substitution within the cell, and wherein the ClpS protein is a natural homolog or a ClpS V65I mutant.
- the method further includes a detectable moiety is attached to the C-end of the target polypeptide and further including the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and the cell expressing an adapter protein for a protease, wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA to thereby enrich the target polypeptide including a desired non-standard amino acid substitution within the ceil, and detecting the detectable moiety as a measure of the amount of the target polypeptide including a non-standard amino acid substitution within the cell.
- the cell is a prokaryotic cell or a eukaryotic cell.
- the cell is a microorganism such as a bacterium.
- the cell is E. coli.
- the cell is a genetically modified E. coli.
- the disclosure provides a method of designing an amino acyl tRNA synthetase variant for preferential selection of a desired non-standard amino acid against its standard amino acid counterpart or undesired NSAAs for incorporation into a protein in a cell including genetically modifying the ceil to express the target polypeptide including a nonstandard amino acid substitution at an amino acid target location using an engineered amino- acyl tRNA synthetase and transfer RNA pair corresponding to the non-standard amino acid or undesired NSAA, and wherein the cell expresses the target polypeptide including a standard amino acid or undesired NSAA at the amino acid target location when the engineered amino- acyl tRNA synthetase and transfer RNA pair non-selectively adds the standard amino acid or undesired NSAA at the amino acid target location, wherein a removable protecting group is attached to the target polypeptide adjacent to the amino acid target location, such that when the removable protecting group is removed, an N-end amino acid
- the disclosure provides an engineered cell including a foreign nucleic acid sequence encoding a target polypeptide including a non-standard amino acid substitution at an amino acid target location and a removable protecting group attached to the target polypeptide adjacent to the amino acid target location.
- the disclosure provides an engineered cell including a foreign nucleic acid sequence encoding a target polypeptide including a non-standard amino acid substitution at an amino acid target location, a removable protecting group attached to the target polypeptide adjacent to the amino acid target location and a detectable moiety attached to the C-end of the target polypeptide.
- the disclosure provides an engineered cell including (a) a foreign nucleic acid sequence encoding a target polypeptide including a non-standard amino acid substitution at an amino acid target location, a removable protecting group attached to the target polypeptide adjacent to the amino acid target location and a detectable moiety attached to the C-end of the target polypeptide; (b) a foreign nucleic acid sequence encoding an amino-acyl tRNA synthetase and a transfer RNA corresponding to the nonstandard amino acid; (c) an adapter protein for a protease for degrading the target polypeptide having a standard amino acid or undesired NSAA as the N-end amino acid, wherein the adapter protein is under influence of a constitutive promoter or an inducible promoter.
- the disclosure provides a nucleic acid construct encoding a target polypeptide including a non-standard amino acid substitution at an amino acid target location and a removable protecting group attached to the target polypeptide adjacent to the amino acid target location.
- the disclosure provides a nucleic acid construct encoding a target polypeptide including a non-standard amino acid substitution at an amino acid target location, a removable protecting group attached to the target polypeptide adjacent to the amino acid target location and a detectable moiety attached to the C-eml of the target polypeptide.
- the disclosure provides a nucleic acid construct encoding adapter protein Clps or mutants or variants thereof.
- the disclosure provides a nucleic acid construct encoding ClpS_V65I mutant.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Gastroenterology & Hepatology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
The disclosure provides methods of making a protein having a desired non-standard amino acid incorporated at its N-terminus in a cell and methods of screening for an amino acyl tRNA synthetase variant that preferentially selects a non-standard amino acid against its standard amino acid counterpart or undesired non-standard amino acids for incorporation into a protein in a cell.
Description
RELATED APPLICATION DATA
This application claims priority to U.S. Provisional Application No. 62/457,353 filed on February 10, 2017 and U.S. Provisional Application No. 62/526,671 filed on June 29, 2017, each of which is hereby incorporated herein by reference in its entirety for all purposes.
STATEMENT OF GOVERNMENT INTERESTS
This invention was made with government support under DE-FG02-02ER63445 awarded by Department of Energy. The government has certain rights in the invention.
FIELD
The present invention relates in general to methods of making proteins with nonstandard amino acids.
BACKGROUND
Naturally-occurring (standard) amino acids (SAAs) are the 20 unique building blocks composing all proteins derived from biological systems. Non-standard amino acids (NSAAs) have been developed bearing functional groups beyond those encoded by the 20 standard amino acids. To date, more than 70 non-standard amino acids (NSAAs) have been developed for in vivo protein translation. See Liu et al., Annual Review of Biochemistry 79:413-444 (2010). Methods of incorporating NSAAs into proteins using engineered amino-acyl tRNA synthetases and transfer RNAs also inadvertently incorporate standard amino acids and non- target NSAAs due to promiscuity or nonspecificity of the engineered amino-acyl tRNA synthetases and transfer RNAs corresponding to the NSAAs. Recently, in vivo Escherichia
coli orthogonal translation systems (OTSs) having engineered tRNA and ami noacyl -tRNA synthetase pairs that reassign the amber stop codon (UAG) were developed to characterize the performance of these orthogonal translation systems (OTSs) with respect to the efficiency and accuracy of NSAA incorporation. It was shown that in vivo incorporation of amino acids into reporter proteins (and thus reporter synthesis) at reassigned amber codons may not discriminate between standard and non-standard amino acids. See Monk JW, et al. (2016) Rapid and Inexpensive Evaluation of Nonstandard Amino Acid Incorporation in Escherichia coli. ACS Synthetic Biology. One method of determining the fidelity of NSAA incorporation includes in vitro protein purification followed by low-throughput mass spectrometry. Such an approach confirms the promiscuity of at least one E. coli OTS system (Methanococcus jannaschii tyrosyl-tRNA synthetase (Mj TyrRS)-deri ved orthogonal synthetases). See Young TS, Ahmad I, Yin J A, & Schultz PG (2010) An Enhanced System for Unnatural Amino Acid Mutagenesis in E. coli. Journal of Molecular Biology 395(2):361-374. Misincorporation of a standard amino acid for a nonstandard amino acid may be detrimental in the production of antibodies containing NSAAs for conjugation or the production of biomaterials containing NSAAs for tunable properties. The incorporation of SAAs instead of N SAAs represents the formation of impurities and the heterogeneous mixture lowers yields. Improvements in synthetase and tRNA specificity or selectivity can improve rate of NSAA incorporation. However, analyzing the polypeptide for rate of NSAA incorporation is laborious making modification of the synthetase or tRNA in response to experimental results a slow process. See Santoro SW, Wang L, Herberich B, King DS, & Schultz PG (2002) An efficient system for the evolution of aminoacyi-tRNA synthetase specificity. Nat Biotech 20(10): 1044-1048; and Amiram M, et al. (2015) Evolution of translation machinery in receded bacteria enables multi-site incorporation of nonstandard amino acids. Nat Biotech 33(12): 1272- 1279.
?
There is a continuing need in the art to develop methods of making proteins with NSAAs with improved efficiency and accuracy and for modifying synthetases or tRNA to result in improved efficiency and accuracy.
SUMMARY
The present disclosure provides a method for selectively degrading proteins having a standard amino acid at an amino acid target location within a mixture or collection or population of proteins including proteins having a nonstandard amino acid at the amino acid target location. In this context, the standard amino acid may be considered an undesired amino acid. The present disclosure also provides a method for selectively degrading proteins having an undesired non-target NSAA at an amino acid target location within a mixture or collection or population of proteins including proteins having a desired nonstandard amino acid at the amino acid target location. In this context, the undesired non-target NSAA may be considered an undesired amino acid. Aspects of the disclosure utilize methods and materials described herein to distinguish between different NSAAs at an amino acid target location when more than one NSAA is used during the synthesis procedure, such as when multiple different NSAAs are used simultaneously in vivo. According to one aspect, an in vivo methods are provided that can discriminate between NSAAs competing for incorporation at the same site of a protein.
According to one aspect, the methods may be carried out in vivo, i.e. within a cell . According to one aspect, methods of making target polypeptides having a non-standard amino acid substitution at an amino acid target location and methods of degrading target polypeptides having a standard amino acid substitution or an undesired NSAA (either referred to as an undesired amino acid) at the amino acid target location may be carried out in vivo, i.e. within a cell. According to one aspect, a target polypeptide is made which includes
a non-standard amino acid at an amino acid target location using an engineered amino-acyl tRNA synthetase and a transfer RNA corresponding to the non-standard amino acid. A removable protecting group is attached to the target polypeptide adjacent to the amino acid target location, such that when the removable protecting group is removed, an N-end amino acid is exposed at the amino acid target location. Should the engineered amino-acyl tRNA synthetase and a transfer RNA add an undesired amino acid, i.e., a standard amino acid or an undesired NSAA, instead of the desired non-standard amino acid at the amino acid target location, methods are provided herein for degrading the target polypeptide including the undesired amino acid, i.e. a standard amino acid or an undesired NSAA, at the amino acid target location, such as by using the N-end rule pathway for protein degradation. According to one aspect, post-translational proofreading is used to distinguish correctly translated from incorrectly translated proteins. The disclosure provides a post-translational proofreading concept that results in protein degradation in a cell using the N-end rule unless the desired NSAA is incorporated at the N-terminus of the protein.
According to one aspect, the target polypeptide is made within a cell, i.e. in vivo, and the cell produces a protease and may also produce an adapter protein for the protease, i.e. a degradation system. Certain cells, such as E. coli, naturally produce the protease system ClpAP and the adaptor ClpS, and both components are present in order for the N-end rule degradation pathway to function. According to certain aspects, adaptor variants such as ClpS variants may be used which have different specificities than the natural ClpS. In this context, cell production of the natural ClpS be inactivated, for example, such as by gene modification where the ClpS gene sequence in the genome is modified to introduce premature stop codons. Such degradation systems are representative of a cellular system that degrades proteins according to the N-end rule pathway for protein degradation as is known in the art. According to the N-end rule, the half-life of a protein is determined by its N-terminal residue.
An N-terminal residue is said to be destabilizing if it is recognized by a cellular degradation system, which in turn leads to degradation of the protein. If the N-terminal residue is not destabilizing, i.e, it is not recognized by a cellular degradation system, then the protein is not subject to the N-end rule pathway for protein degradation. Destabilizing amino acids may be natural amino acids. Destabilizing amino acids may be certain NSAAs. Destabilizing amino acids may be referred to as undesired amino acids. Stabilizing amino acids may be referred to as desired amino acids. Non-standard amino acids, such as analogs to natural amino acids, may be stabilizing or destabilizing amino acids. Certain non-standard amino acids, such as analogs to natural amino acids, may be resistant to degradation by cellular systems. In this manner, the method increases production (yield) of proteins having a desired amino acid, i.e., a stabilizing amino acid or an amino acid which is not destabilizing, such as certain nonstandard amino acids, insofar as such a protein may not be subject to the N-end rule pathway for protein degradation. In this manner, the method decreases production (yield) of proteins having an undesired amino acid, such as a standard amino acid or a certain NSAA, insofar as such a protein may be subject to the N-end rule pathway for protein degradation. In this manner, the method determines successful desired NSAA incorporation to the extent that target polypeptides that include an undesired amino acid, such as a standard amino acid or a certain destabilizing NSAA (undesired NSAA), at the amino acid target location may be degraded and target polypeptides that include a desired non-standard amino acid at the amino acid target location remain and may not be degraded.
The present disclosure provides a method of optimizing production of proteins including a non-standard amino acid. Reaction conditions are provided for making a target polypeptide including a non-standard amino acid substitution at an amino acid target location using an engineered amino-acyl t NA synthetase and a transfer RNA as is known in the art. A removable protecting group is attached to the target polypeptide adjacent to the amino acid
target location, such that when the removable protecting group is removed, an N-end amino acid is exposed at the amino acid target location. Should the engineered amino-acyl tRNA synthetase and a transfer RNA add a standard amino acid or undesired NSAA. instead of the desired non-standard amino acid at the amino acid target location, methods are provided herein for degrading the target polypeptide including the standard amino acid or undesired NSAA at the amino acid target location. The amount of proteins having a desired nonstandard amino acid is determined. Given the amount of protein produced, the reaction conditions and/or the amino-acyl tRNA synthetase and/or tRNA are altered and the amount of proteins having the desired non-standard amino acid is again determined. The process is repeated until the process is optimized for a desired yield of protein including desired NSAA. Exemplar}' reaction conditions which may be altered according to the present disclosure include changes of culture media or concentration of desired NSAA, Alterations or changes to the amino-acyl tRNA synthetase and/or tRNA include one or more mutations that may improve performance of the amino-acyl tRNA synthetase and/or tRNA. Such mutations may be made by methods known to those of skill in the art such as random mutagenesis approaches such as error-prone polymerase chain reaction (PCR) or directed approaches such as site-saturation mutagenesis or rational point mutagenesis.
The present disclosure provides a method of distinguishing a protein having a nonstandard amino acid, i.e. a desired nonstandard amino acid, at an amino acid target location from a protein having a standard amino acid or undesired nonstandard amino acid at the amino acid target location by degrading the protein if a destabilizing amino acid (i.e., standard amino acid or undesired destabilizing NSAA) recognized by a protease is present at the amino acid target location.
The present disclosure provides a method of making a target protein using an amino- acyl tRN A synthetase engineered or designed to incorporate a target nonstandard amino acid
(i.e., a desired NSAA) in the target protein at an amino acid target location, wherein a removable protecting group is adjacent the amino acid target location. The removable protecting group is removed to expose the amino acid target location in the presence of a degradation enzyme or degradation enzyme/adaptor complex. According to certain aspects, the adaptor discriminates between amino acids and accordingly whether degradation will occur by the degradation enzyme. According to one aspect, the degradation enzyme/adaptor complex degrades the target protein if a standard amino acid or undesired NS AA is present at the amino acid target location. If the desired nonstandard amino acid is present at the amino acid target location, then the degradation enzyme/adaptor complex is ineffective to degrade the target protein.
BRIEF DESCRIPTION OF THE DRAWINGS
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee. The foregoing and other features and advantages of the present invention will be more fully understood from the following detailed description of illustrative embodiments taken in conjunction with the accompanying drawings in which:
Figs. 1A and IB illustrate methods described herein to selectively degrade proteins that contain one or more undesired amino acids at one or more target amino acid locations based on a N-end rule degradation strategy. Fig. 1 A and Fig. IB depict aspects of the present disclosure where a target protein including a standard amino acid at an amino acid target location is selectively degraded over the target protein including a non-standard amino acid at the amino acid target location. Degradation is carried out in vivo according to the N-end rule. Fig. 1 A shows the three candidate site-specific OTS systems used previously in E. coli. Fig.
"7
IB shows the components of the selective degradation methods using the N-end rule described herein. Fig. 1C shows measurements of in vivo fluorescence normalized to optical density (FL/OD) corresponding to different combinations of post-translational proofreading components. Expression of the orthogonal tRNA alone was not responsible for GFP synthesis in the absence of BipA, but expression of the BipARS/tRNA pair resulted in levels of FL/OD nearly as high in the absence of BipA as in the presence of BipA. Significant reduction of the FL/OD signal in the absence of BipA occurs upon expression of Ubp l , which cleaves the ubiquitin domain and exposes the amino acid corresponding to the first UAG codon as the N- terminai residue. The decrease in signal only in the absence of BipA suggests that BipARS was primarily mischarging tRNA with N-terminally destabilizing residues. The FL/OD signal in the absence of BipA further decreased upon expression of ClpS, suggesting that the rate of N-end di scrimination was previously limiting. Fig, ID shows a control experiment that demonstrates the expected effect of Ubpl /ClpS expression on FL/OD signal resulting from GFP protein with N-terminal UCG, UAC, or LTAG codons. Fig. IE shows the FL/OD as a function of the number of U AG codons.
Figs. 2A and 2B shows the activities of different AARSs. Fig. 2A shows the activity of different amino-acyl tRNA synthetases ("AARS") in the absence and presence of NSAA. Fig. 2B shows the activity of the AARSs when Ubpl and ClpS are expressed. These results demonstrate that many tested AARSs mischarge tRNA in the absence of their intended N SAA, and that post-translational proofreading with wild-type ClpS can resolve differences for several of these AARS/NSAA pairs.
Figs. 3A-3E shows the effect of various engineered ClpS variants and that ClpS variants may be engineered to be tunable to a particular substrate. Fig. 3 A shows the activity profi le of synthetase- 1 on an NSAA panel to determine their natural N-end rule classification of these NSAAs. Fig. 3B shows the activity profile of synthetase-2 on structurally smaller
NSAAs that were not substrates of synthetase-1. Fig. 3C shows the four ClpS residues within four A of the para position of the N-terminal phenylalanine (F) on the substrate peptide. Fig. 3D shows the tunability achieved with rationally engineered ClpS variants. Fig. 3E shows further the properties of one ClpS variant of interest that leads to the classification of only two tested NSAAs as stabilizing.
Figs. 4A-4B show sequence alignments of ClpS homologs. Fig. 4A shows the sequence alignments of 10 bacterial ClpS homologs including E. coli ClpS with the four ClpS residues of interest highlighted. Fig. 4B shows the sequence alignments of E. coli ClpS together with portions of the yeast and human UBRl E3 ubiquitin ligase protein homologs. According to certain aspects, the sequence alignments identify the region of UBRl as suitable for rational engineering to adjust the N-end rule status of various NSAAs using methods known to those of skill in the art. Wild-type UBRl in yeast and human cells contains an isoleucine (I) rather than valine (V) at the position equivalent to "ClpS_V65F. Accordingly, one of skill will recognize that the natural N-end status of NSAAs may be different in eukaryotes than in prokaryotes.
DETAILED DESCRIPTION
The present disclosure provides a method of making a target polypeptide in a cell, wherein the target polypeptide includes a desired non-standard amino acid substitution at an amino acid target location, i.e. the non-standard amino acid withstands degradation as described herein. As used herein, the terms "polypeptide" and "protein" include compounds that include amino acids joined together by peptide bonds between the carboxyl and amino groups of adjacent amino acid residues. Exemplary cells include prokaryotic cells and eukaryotic cells. Exemplary prokaryotic cells include bacteria, such as E. coli, such as genetically modified E. coli. The method includes genetically modifying the cell to express
the target polypeptide including a desired non-standard amino acid substitution at an amino acid target location using an engineered amino-acyi tRNA synthetase and transfer RNA pair corresponding to the non-standard amino acid, and wherein the cell expresses the target polypeptide including a standard amino acid or an undesired NSAA at the amino acid target location when the engineered amino-acyl tRNA synthetase and transfer RNA pair nonselective^' adds the standard amino acid or undesired NSAA at the amino acid target location. A removable protecting group is attached to the target polypeptide adjacent to the amino acid target location, such that when the removable protecting group is removed, an N- end amino acid is exposed at the amino acid target location. According to one aspect, the removable protecting group is orthogonal within the cell in which it is being used.
According to certain aspects, the cell includes a protease system for degrading the target polypeptide when the N-end amino acid is a standard amino acid. According to certain aspects, the cell includes a protease system for degrading the target polypeptide when the N- end amino acid is an undesired NSAA. According to certain aspects, the protease system includes an adapter protein and a corresponding protease. The adapter protein coordinates with the protease for degrading the target polypeptide when the N-end amino acid is a standard amino acid. According to one aspect, the protease system is endogenous. According to one aspect, the protease and adaptor can be expressed constitutively. According to one aspect, the protease system is exogenous. According to one aspect, the protease system is under influence of a promoter. According to one aspect, the adapter protein of the protease system is under influence of an inducible promoter. According to one aspect, the adapter protein is upregulated. According to one aspect, overexpression of adaptor to produce adaptor levels in excess of that found normally within a cell improves degradation of polypeptides having an undesired amino acid at the amino acid target location.
According to one aspect, an adaptor protein is provided that facilitates N-end rule classification of an NSAA.
Because the N-end rule pathway of protein degradation is conserved across prokaryotes and eukaryotes, methods described herein are useful in prokaryotes and eukaryotes. The removable protecting groups should be orthogonal in the ceil within which it is being used. Ubiquitin is a suitable protecting group in prokaryotic cells because it is orthogonal but it is not a suitable protecting group in eukaryotic cells because it is not orthogonal . In eukaryotic cells, ubiquitin is N-terminally added to proteins often to initiate the process of protein degradation in the proteasome. In addition, the adaptor proteins in eukaryotic cells are homoiogs of ClpS known as Ubiquitin E3 ligases. According to the present disclosure, ubiquitin E3 ligase domain is altered in order to change the N-end rule classification of an NSAA.
According to one aspect, the removable protecting group is removed to generate an N- end amino acid, and the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA. In this manner, the target polypeptide including a desired non-standard amino acid substitution, i.e. which is resistant to degradation, is enriched within the cell. According to one aspect, embodiments of the disclosure are directed to methods that allow selective degradation of proteins having a standard amino acid or undesired NSAA instead of a desired nonstandard amino acid at their N-termini in a cell. The methods can be used for producing proteins with desired nonstandard amino acids at their N-termini with no detectable impurities.
According to one aspect, a method of identifying the presence of a target polypeptide including a desired non-standard amino acid, i.e. one which is resistant to degradation, is provided. According to this aspect, the target polypeptide includes a detectable moiety attached to the C-end of the target polypeptide. In this manner, if the target polypeptide (and
detectable moiety) that is made by the cell is not subject to degradation as described above, then the detectable moiety is detected as a measure of the amount of target polypeptide generated by the cell. Accordingly, a method is provided where a detectable moiety is present at the C-end of the target polypeptide, the removable protecting group is removed to generate an N-end amino acid, the protease (whether accompanied by an adapter protein or not depending upon the protease system being used) degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA, for example, to thereby enrich the target polypeptide including a desired non-standard amino acid substitution, and the detectable moiety is detected as a measure of the amount of the target polypeptide including a desired non-standard amino acid substitution.
According to one aspect, a method is provided for engineering synthetases that are more selective for incorporating non-standard amino acids versus standard amino acids at a selected site in a protein. Since all or substantially ail of proteins bearing a standard amino acid or an undesired NSAA at their N-terminus are degraded leaving only proteins with a desired nonstandard amino at their N-terminus, no or substantially no background signal due to standard amino acid or undesired NSAA incorporation results from the method. Synthetases can be evolved and their variants screened in a high-throughput fashion for their function of producing a protein incorporating a nonstandard amino acid, such as a desired NSAA. In this manner, those synthetases with improved function can be identified and modified further to further improve efficiency and selectivity .
I. METHODS OF MAKING A TARGET POLYPEPTIDE WITH AN NSAA
In general, methods of making a target polypeptide that includes a non-standard amino acid are known. In general, a cell is genetically modified to include a nucleic acid sequence which encodes for the target polypeptide that includes one or more non-standard
amino acids within its amino acid sequence. The cell can be genomicaliy recoded, ("a genoniicallv recoded organsim") to the extent that one or more codons have been reassigned to encode for a non-standard amino acid. For each different non-standard amino acid, an amino-acyl tRNA synthetase/tRNA pair is engineered and the cell is capable of using the amino-acyl tRNA synthetase/tRNA pair to add the corresponding non-standard amino acid (when present in the cell) to a growing peptide sequence. Materials, conditions, and reagents for genetically modifying a cell to make a target protein having one or more amino acid sequences are described in the following references, each of which are hereby incorporated by reference in their entireties.
Approaches to genomicaliy recede organisms include multiplex automatable genome engineering (MAGE), (for example, as described in Wang, Harris H., et al. "Programming cells by multiplex genome engineering and accelerated evolution." Nature 460.7257 (2009): 894-898 hereby incorporated by reference in its entirety) and hierarchical conjugative assembly genome engineering (CAGE) (for example, as described in Isaacs, Farren J., et al. "Precise manipulation of chromosomes in vivo enables genome-wide codon replacement. " Science 333.6040 (201 1): 348-353 hereby incorporated by reference in its entirety). In addition, portions of recoded genomes can be synthesized and subsequently assembled, as described recently in an effort to construct a 57-codon organism (for example, as described in Ostrov, Nili, et al. "Design, synthesis, and testing toward a 57-codon genome." Science 353.6301 (2016): 819-822 hereby incorporated by reference in its entirety). The modification of an organism, whether recoded or not recoded, in order to express a polypeptide containing a site-specific non-standard amino acid has been described extensively in the literature (for example, as described in Wang, Lei, et al. "Expanding the genetic code of Escherichia coli ." Science 292,5516 (2001): 498-500; Chin, Jason W., et al. "An expanded eukaryotic genetic code." Science 301.5635 (2003): 964-967; Wang, Lei, and
Peter G. Schultz. "Expanding the genetic code." Angewandte chemie international edition 44.1 (2005): 34-66; Liu, Chang C, and Peter G. Schultz. "Adding new chemistries to the genetic code." Annual review of biochemistry 79 (2010): 413-444; Chin, Jason W. "Expanding and reprogramming the genetic code of cells and animals." Annual review of biochemistry 83 (2014): 379-408 each of which is hereby incorporated by reference in its entirety). In brief, foreign nucleic acid sequences containing a gene encoding an orthogonal amino-acyl tRNA synthetase and an associated tRNA are introduced into an organism, typically in an expression vector. In addition, a desired non-standard amino acid is added to the cell culture medium. A nucleic acid sequence corresponding to a target protein is modified so that a free codon, such as the UAG codon, is formed at the target site of the gene encoding the target protein. In the presence of these four components - aminoacyl tRNA synthetase protein, tRNA, NSAA, and target protein mRNA - the target protein containing the NSAA is made.
Basic to the present disclosure is the use of an amino-acyl tRNA synthetase/tRNA pair cognate to a nonstandard amino acid. Exemplar}' amino-acyl tRNA synthetase/tRNA pairs cognate to a nonstandard amino acid are known to those of skill in the art or may be designed for particular non-standard amino acids, as is known in the art or as described in Wang, Lei, and Peter G. Schultz. "Expanding the genetic code " Angewandte chemie international edition 44.1 (2005): 34-66; Liu, Chang C, and Peter G. Schultz. " Adding new chemistries to the genetic code." A nnnal review of ' biochemistry Ί 9 (2010): 413-444; and Chin, Jason W. "Expanding and reprogramming the genetic code of ceils and animals." Annual review of biochemistry 83 (2014): 379-408 each of which are hereby incorporated by reference in its entirety.
According to one aspect, the amino-acyl tRNA synthetase/tRNA pair cognate to a nonstandard amino acid is orthogonal to the cellular components of the ceil in which it is
used. The orthogonality (and therefore the suitability) of exogenous amino-acyl tRNA synthetase/tRNA pairs is dependent on the type of host organism. Four main orthogonal aminoacyl-tRNA synthetases have been developed for genetic code expansion: the Methanococcus janaschii tyrosyl-tRNA synthetase (V/TyrRS)/tRNAcuA pair, the Escherichia coli tyrosyl-tRNA synthetase (EcTyrRS)/tRNAcuA pair, the E. coli leucyl-tRNA synthetase (EcLeuRS)/tRNACUA pair, and pyrrolysyl-tRNA synthetase (PylRS)/tRNAcuA pairs from certain Methanosarcina. The JW/TyrRS/tRNAcuA pair is orthogonal in E. coli but not in eukaryotic cells. The EcTyrRS/tR'NAcuA pair and the cLeuRS/tRNAcuA pair are orthogonal in eukaryotic ceils but not in E. coli, whereas the PylRS/tRNAcuA pair is orthogonal in bacteria, eukaryotic ceils, and animals (see Chin, Jason W. "Expanding and reprogramming the genetic code of cells and animals." Annual review of biochemistry 83 (2014): 379-408 hereby incorporated by reference in its entirety). To maintain orthogonality, the exogenous amino acyl tRNA synthetase should not recognize any native amino acids or native tRNA. To maintain orthogonality, the tRNA should not be recognized by any native amino-acyl tRNA synthetases. To maintain orthogonality, the non-standard amino acid should not be recognized by any native amino acyl tRNA synthetases, "Orthogonal" pairs meet one or more of the above conditions. It is to be understood that "orthogonal" pairs may lead to some mischarging, i.e. such as insubstantial mischarging for example, of orthogonal tRNA with native amino acids so long as sufficient efficiency of charging to the designed NSAA occurs.
Exemplary families of synthetases for bacteria in addition to those described above and incorporated by reference include the PylRS/tRNAcuA pair and the Saccharomyces cerevisiae tryptophanyl-tRNA synthetase (ScWRS)/tRNAcuA pair. These exemplary synthetase families have natural analogs (lysine and tryptophan) that are N-end destabilizing amino acids. The following references describe useful synthetase famiiies and their
associated NSAAs. Blight, Sherry K., et al. "Direct charging of tRNAcuA with pyrrolysine in vitro and in vivo." Nature 431.7006 (2004): 333-335; Namy, Olivier, et al. "Adding pyrrolysine to the Escherichia coli genetic code." FEBS letters 581 ,27 (2007): 5282-5288; Hughes, Randall A., and Andrew D. Ellington. "Rational design of an orthogonal tryptophanyl nonsense suppressor tRNA." Nucleic acids research 38.19 (2010): 6813-6830; Eliefson, Jared W., et al. "Directed evolution of genetic parts and circuits by compartmentalized partnered replication." Nature Biotechnology 32.1 (2014): 97-101; and Chatterj ee, Abhishek, et al. "A Tryptophanyl -tRNA Synthetase/tRNA Pair for Unnatural Amino Acid Mutagenesis in E. coli." Angewandte Chemie International Edition 52.19 (2013): 5106-5109 each of which are hereby incorporated by reference in its entirety. As is known in the art, the synthetase catalyzes a reaction that attaches the nonstandard amino acid to the correct tRNA. The amino-acyl tRNA then migrates to the ribosome. The ribosome adds the nonstandard amino acid where the tRNA anticodon corresponds to the reverse complement of the codon on the mRNA. of the target protein to be translated.
II. REMOVABLE PROTECTING GROUPS
According to one aspect, the target polypeptide includes a removable protecting group adjacent to the amino acid target location such that when the removable protecting group is removed, the amino acid target location is an N-end amino acid. Exemplary removable protecting groups are known to those of skill in the art and can be readily identified in the literature based on the present disclosure. According to one aspect, the removable protecting is a peptide sequence produced by the cell when making the target polypeptide. According to one aspect, the removable protecting is a peptide sequence produced by the cell when making the target polypeptide, such that the removable peptide and the target polypeptide is a fusion. According to this aspect, the cell is genetically modified to include a foreign nucleic acid
sequence encoding the target polypeptide including a non-standard amino acid substitution at an amino acid target location and a removable protecting group attached to the target polypeptide adjacent to the amino acid target location. According to one aspect, the removable protecting group is foreign to the cell, i.e. it is not endogenous to the cell. In this manner, the removable protecting is orthogonal to endogenous enzymes or other conditions within the cell.
An exemplary removable protecting group includes a cleavable protecting group, such as an enzyme cleavable protecting group. According to one aspect, the cell produces an enzyme that cleaves the removable protecting group to generate an N-end amino acid. An exemplary removable protecting group is a protein that is cleavable by a corresponding enzyme. According to one aspect, a removable protecting group is foreign to the cell and is not endogenous. According to one aspect, the enzyme that cleaves the removable protecting group is foreign to the cell and is not endogenous. According to one aspect, an exemplary removable protecting group for prokaryotic cells is ubiquitin that is cleavable by Ubpl . According to another aspect, an exemplar}' removable protecting group for eukaryotic cells is the sequence MENLYFQ/*, where "*" is the target position for the NSAA (known in the field as the Ρ position), where "/" represents the cut site, and where "ENLYFQ/*" is the sequence that is cleavable by certain variants of TEV protease. Ordinarily, TEV protease cleavage efficiency is influenced by the choice of the amino acid at the Ρ position. However, mutants of TEV protease have been engineered which have increased or altered substrate tolerance at the Ρ position (see Renicke, Christian, Roberta Spadaccini, and Christof Taxis. "A Tobacco Etch Virus Protease with Increased Substrate Tolerance at the Pl'position." PloS one 8.6 (2013): e67915 hereby incorporated by reference in its entirety). The use of TEV protease in vivo in mammalian cells has been demonstrated and is described in Oberst, Andrew, et al. "Inducible dimerization and inducible cleavage reveal a requirement
for both processes in caspase-8 activation." Journal of Biological Chemistry 285.22 (2010): 16632-16642 hereby incorporated by reference in its entirety. One of skill will readily understand based on the present disclosure that the methods described herein are useful in prokaryotic cells and eukaryotic cells.
According to the present disclosure, the N-end target residue is exposed using materials and methods that are or will become apparent to one of skill based on the present disclosure. An exemplary removable protecting protein domain includes a self-splicing domain, such as an intein, or other cleavable domains such as small ubiquitin modifiers (SUMO proteins). An exemplary removable protecting group may be a protein cleavage sequence along with its cognate partner, such as the TEV cleavage site and TEV protease. In general, any of the strategies used to remove N-terminal affinity tags in protein purification can serve as alternative ways to expose the N-end target residue. An exemplary system to expose the N-end target residue includes a class of enzymes known as methionine aminopeptidases which can remove the first N-terminal residue, such as when the second residue is the amino acid target location which is the desired site of addition of a NSAA. According to one aspect, the amino acid target location may be the N-terminal location or it may be any location between the N-terminal location and the C -terminal location. Accordingly, methods are provided for removing a protecting group and/or ail amino acids up to the amino acid target location, thereby rendering the amino acid target location being the N-terminal amino acid.
III. DETECTABLE MOIETY
According to one aspect, the target polypeptide includes a detectable moiety attached to the C-end of the target polypeptide. Exemplary detectable moieties are known to those of skill in the art and can be readily identified in the literature based on the present disclosure.
According to one aspect, the detectable moiety is a peptide sequence produced by the cell when making the target polypeptide. According to one aspect, the detectable moiety is a peptide sequence produced by the cell when making the target polypeptide, such that the detectable moiety and the target polypeptide is a fusion. According to this aspect, the cell is genetically modified to include a foreign nucleic acid sequence encoding the target polypeptide including a non-standard amino acid substitution at an amino acid target location and a detectable moiety attached to the target polypeptide, for example, at the C-end of the target polypeptide. According to one aspect, the detectable moiety is foreign to the cell, i.e. it is not endogenous to the cell.
An exemplary detectable moiety is a fluorescent moiety, such as GFP, that can be detected by fluorimetry, for example. An exemplary detectable moiety is a reporter protein. An exemplary detectable moiety includes a protein that confers antibiotic resistance which can be detected in the presence of an antibiotic. An exemplary detectable moiety includes an enzyme that performs a function (such as Beta-Gaiactosidase) that can lead to easy colorimetric output.
Aspects of the methods described herein may make use of epitope tags and reporter gene sequences as detectable moieties. Non-limiting examples of epitope tags include histidine (His) tags, V5 tags, FLAG tags, influenza hemagglutinin (HA) tags, Myc tags, VSV-G tags, and thioredoxin (Trx) tags. Examples of reporter genes include, but are not limited to, glutathione-S-transferase (GST), horseradish peroxidase (HRP), chloramphenicol acetyltransferase (CAT) beta-galactosidase, betaglucuronidase, luciferase, green fluorescent protein (GFP), HcRed, DsRed, cyan fluorescent protein (CFP), yellow fluorescent protein (YFP), and autofluorescent proteins including blue fluorescent protein (BFP).
IV. GENETIC MODIFICATIONS
Aspects of the present disclosure include the genetic modification of a cell to include foreign genetic material which can then be expressed by the cell . The cell may be modified to include any other genetic material or elements useful in the expression of a nucleic acid sequence. Foreign genetic elements may be introduced or provided to a cell using methods known to those of skill in the art. For example, the ceil may be genetically modified to include a foreign nucleic acid sequence encoding the target polypeptide including a nonstandard amino acid substitution at an amino acid target location, a removable protecting group attached to the target polypeptide adjacent to the amino acid target location and a detectable moiety attached to the C-end of the target polypeptide. The nonstandard amino acid may be encoded by a corresponding nonsense or sense codon. The cell may be genomically receded to recognize an engineered amino-acyl tRNA synthetase corresponding or cognate to a non-standard amino acid. The cell may be genetically modified to include a foreign nucleic acid sequence encoding an amino-acyl tRNA synthetase and/or a transfer RNA corresponding or cognate to the nonstandard amino acid and wherein the nonstandard amino acid is provided to the cell and the cell expresses the synthetase and the transfer RNA to include the nonstandard amino acid at the amino acid target location. The cell is genetically modified to include a foreign nucleic acid sequence encoding an enzyme for cleaving the removable protecting group under influence of an inducible promoter. The cell is genetically modified to include an inducible promoter influencing the production of an enzyme system for removal of the removable protecting group. The enzyme system or component thereof may be under influence of the inducible promoter. For example, the adapter which helps associate the cleavage enzyme with the removable protecting group may be under influence of an inducible promoter.
In general, nucleic acids may be introduced into a cell using any method known to
those skilled in the art for such introduction. Such methods include transfection, transduction, viral transduction, microinjection, lipofection, nucleofection, nanoparticie bombardment, transformation, conjugation and the like. One of skill in the art will readily understand and adapt such methods using readily identifiable literature sources.
Aspects of the methods described herein may make use of vectors. The term "vector" includes a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. Vectors used to deliver the nucleic acids to cells as described herein include vectors known to those of skill in the art and used for such purposes. Certain exemplary vectors may be plasmids, lentiviruses or adeno-associated vimses known to those of skill in the art. Vectors include, but are not limited to, nucleic acid molecules that are single- stranded, double-stranded, or partially double-stranded, nucleic acid molecules that comprise one or more free ends, no free ends (e.g. circular), nucleic acid molecules that comprise DNA, RNA, or both; and other varieties of polynucleotides known in the art. One type of vector is a "plasmid," which refers to a circular double stranded DNA loop into which additional DNA segments can be inserted, such as by standard molecular cloning techniques. Another type of vector is a viral vector, wherein virally-derived DNA or RNA sequences are present in the vector for packaging into a virus (e.g. retroviruses, lentivimses, replication defective retroviruses, adenoviruses, replication defective adenoviruses, and adeno-associated viruses). Viral vectors also include polynucleotides carried by a virus for transfection into a host cell. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g. bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively linked. Such vectors are referred to herein
as "expression vectors." Common expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. Recombinant expression vectors can comprise a nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory elements, which may be selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, "operably linked" is intended to mean that the nucleotide sequence of interest is linked to the regulatory element(s) in a manner that allows for expression of the nucleotide sequence (e.g. in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell).
Aspects of the methods described herein may make use of regulatory elements. The term "regulatory element" is intended to include promoters, enhancers, internal ribosoma! entry sites (IRES), and other expression control elements (e.g. transcription termination signals, such as polyadenylation signals and poly-U sequences). Such regulatory elements are described, for example, in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Regulatory elements include those that direct constitutive expression of a nucleotide sequence in many types of host cell and those that direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences). Regulatory elements useful in eukaryotic cells include a tissue-specific promoter that may direct expression primarily in a desired tissue of interest, such as muscle, neuron, bone, skin, blood, specific organs (e.g. liver, pancreas), or particular cell types (e.g. lymphocytes). Regulatory elements may also direct expression in a temporal-dependent manner, such as in a cell-cycle dependent or developmental stage- dependent manner, which may or may not also be tissue or cell-type specific. In some embodiments, a vector may comprise one or more pol III promoter (e.g. 1, 2, 3, 4, 5, or more
pol III promoters), one or more pol II promoters (e.g. 1, 2, 3, 4, 5, or more ροΐ II promoters), one or more pol I promoters (e.g. 1, 2, 3, 4, 5, or more pol I promoters), or combinations thereof. Examples of pol III promoters include, but are not limited to, U6 and HI promoters. Examples of pol II promoters include, but are not limited to, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) [see, e.g., Boshart et al, Cell, 41 :521-530 (1985)], the SV40 promoter, the dihydrofolate reductase promoter, the β-actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EFl a promoter and Pol II promoters described herein. Also encompassed by the term "regulatory element" are enhancer elements, such as WPRE; CMV enhancers; the R-U5' segment in LTR of HTLV-I (Mol. Cell. Biol, Vol. 8(1), p. 466-472, 1988), SV40 enhancer, and the intron sequence between exons 2 and 3 of rabbit β-globin (Proc. Natl. Acad, Sci , USA., Vol. 78(3), p. 1527-31 , 1981). It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the host cell to be transformed, the level of expression desired, etc, A vector can be introduced into host cells to thereby produce transcripts, proteins, or peptides, including fusion proteins or peptides, encoded by nucleic acids as described herein (e.g., clustered regularly interspersed short palindromic repeats (CRISPR) transcripts, proteins, enzymes, mutant forms thereof, fusion proteins thereof, etc.). Common prokaryotic promoters include IPTG (isopropyl B-D-l-thiogalactopyranoside) inducible, anhydrotetracycline inducible, or arabinose inducible promoters. Such promoters express genes only in the presence of IPTG, anhydrotetracycline, or arabinose in the medium. An exemplary promoter for use in bacteria such as E. coli to express aminoacyl tRNA synthetase is an arabinose inducible promoter. An exemplar}' promoter for use in bacteria such as E. coli to express a reporter protein is an anhydrotetracycline inducible promoter.
Aspects of the methods described herein may make use of terminator sequences. A
terminator sequence includes a section of nucleic acid sequence that marks the end of a gene or operon in genomic DNA during transcription. This sequence mediates transcriptional termination by providing signals in the newly synthesized mRNA that trigger processes which release the mRNA from the transcriptional complex. These processes include the direct interaction of the mRNA secondary structure with the complex and/or the indirect activities of recmited termination factors. Release of the transcriptional complex frees RNA polymerase and related transcriptional machinery to begin transcription of new mRNAs. Terminator sequences include those known in the art and identified and described herein.
V. ADAPTER PROTEIN PROTEASE SYSTEMS
According to one aspect, the cell includes a protease system for degrading the target polypeptide when the N-end amino acid is a standard amino acid. The protease system may be endogenous or exogenous. The cell may include an adapter or discriminator protein that coordinates with a protease for degrading the target polypeptide when the N-end amino acid is a standard amino acid. The adapter protein may be under influence of an inducible promoter. According to one aspect, the adapter protein is ClpS or a variant or mutant thereof. According to one aspect adapter proteins may have different levels of selectivity for certain amino acids. According to certain aspects, adapter proteins, such as ClpS may be altered to improve selectivity, such as between standard amino acids and non-standard amino acids or between a desired NSAA and an undesired NSAA. According to one aspect, the protease system is a ClpS-ClpAP protease system.
According to one aspect, protease systems include Clps or homologs or mutants thereof, such as ClpS V65I. The N-end rule is mediated by homologs of ClpS/ClpAP in bacteria. In eukaryotes, the N-end rule involves more distant homologs of ClpS (UBR1, ubiquitin E3 ligases) and degradation by the proteasome. Accordingly, the present disclosure
contemplates use of many of the bacterial ClpS homologs to perform similar functions with slightly different amino acid recognition specificity. The present disclosure also contemplates use of eukaryotic protease systems, such as UBRl and related variants to mediate N-end rule recognition with different amino acid recognition specificity in eukaryotes.
VI. CELLS
According to certain aspects, cells according to the present disclosure include prokaryotic cells and eukaryotic cells. Exemplary prokaryotic cells include bacteria. Microorganisms which may serve as host cells and which may be genetically modified to produce recombinant microorganisms as described herein may include one or members of the genera Clostridium, Escherichia, Rhodococcus, Pseudomonas, Bacillus, Lactobacillus Saccharomyces, and Enterococcus. Particularly suitable microorganisms include bacteria and archaea. Exemplary microorganisms include Escherichia coli, Bacillus s htilis, and Saccharomyces cerevisiae. Exemplary eukaryotic cells include animal cells, such as human cells, plant cells, fungal cells and the like.
In addition to E. coli, other useful bacteria include but are not limited to Bacillus subtil is. Bacillus megaterium, Bifidobacterium bifida,' a. Caulobacter crescentus, Clostridium difficile, Chlamydia trachomatis, Corynebacterium glutamicum, Lactobacillus acidophilus, Lactococcus laciis, Mycoplasma genitalium, Neisseria gonorrhoeae, Prochlorococcus marinus, Pseudomonas aeruginosa, Psuedomonas put Ida. Treponema pallidum, Streptomyces coelicolor, Synechococcus elongates, Vibrio natrigiens, and Zymomonas mob His.
Exemplary genus and species of bacteria cells include Acetobacter aurantius, Acinetobacter bitumen, Actinomyces israelii, Agrobacterium radiobacter, Agrobacterium
tumefaeiens, Anaplasma Anaplasma phagocytophilum, Azorhizobium caulinodans, Azotobacter vinelandii, viridans streptococci, Bacillus anthracis, Bacillus brevis, Bacillus cereus, Bacillus fusiformis, Bacillus licheniformis, Bacillus megaterium, Bacillus mycoides, Bacillus stearothermophilus, Bacillus subtilis, Bacteroides, Bacteroides fragilis, Bacteroides gingivalis, Bacteroides melaninogenicus (also referred to as Prevotella melaninogenica ), Bartonella ,Bartonelia henselae, Bartonella quintana, Bordetella, Bordetella bronchiseptica, Bordetella pertussis, Borrelia burgdorferi, Brucella abortus, Brucella melitensis, Bmcella suis, Burkholderia, Burkholderia mallei, Burkholderia pseudomallei, Burkholderia cepacia, Calymmatobacterium granulomatis, Campylobacter, Campylobacter coli, Campylobacter fetus, Campylobacter jejuni, Campylobacter pylori, Chlamydia, Chlamydia trachomatis, Chlamydophila Chlamydophila pneumoniae (also known as Chlamydia pneumoniae) Chlamydophila psittaci (also known as Chlamydia psittaci), Clostridium, Clostridium botulinum, Clostridium difficile, Clostridium perfringens (also known as Clostridium welchii), Clostridium tetani, Corynebacterium, Corynebacterium diphtheriae, Cory neb acterium fusiforme, Coxiella burnetii, Ehrlichia chaffeensis, Enterobacter cloacae, Enterococcus, Enterococcus avium, Enterococcus durans, Enterococcus faecalis, Enterococcus faecium, Enterococcus galliinarum, Enterococcus maloratus, Escherichia coli, Franciseila tuiarensis, Fusobacterium nucleatum, Gardnereila vaginalis, Haemophilus, Haemophilus ducreyi, Haemophilus influenzae, Haemophilus parainfluenzae, Haemophilus pertussis, Haemophilus vaginalis, Helicobacter pylori, Klebsiella pneumoniae, Lactobacillus, Lactobacillus acidophilus, Lactobacillus bulgaricus, Lactobacillus casei, Lactococcus lactis, Legionella pneumophila, Listeria monocytogenes, Methanob acterium extroquens, Mi crob acterium multiforme, Micrococcus luteus, Moraxella catarrhal! s, Mycobacterium, Mycobacterium avium, Mycobacterium bovis, Mycobacterium diphtheriae, Mycobacterium intracellulare, Mycobacterium leprae, Mycobacterium lepraemurium, Mycobacterium phiei,
Mycobacterium smegmatis, Mycobacterium tuberculosis, Mycoplasma, Mycoplasma fermentans, Mycoplasma genitalium, Mycoplasma hominis, Mycoplasma penetrans, Mycoplasma pneumoniae, Neisseria, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella, Pasteurella multocida, Pasteurella tularensis, Peptostreptococcus, Porphyron! onas gingivalis, Prevotella melaninogenica (also known as Bacteroides melaninogenicus), Pseudomonas aeruginosa, Rhizobium radiobacter, Rickettsia, Rickettsia prowazekii, Rickettsia psittaci, Rickettsia quintana, Rickettsia rickettsii, Rickettsia trachomae, Rochalimaea, Rochalimaea henselae, Rochalimaea quintana, Rothia dentocariosa, Salmonella, Salmonella enteritidis, Salmonella typhi, Salmonella typhimurium, Serratia marcescens, Shigella dysenteriae, Staphylococcus, Staphylococcus aureus, Staphylococcus epidermidis, Stenotrophomonas maltophilia, Streptococcus Streptococcus agalactiae, Streptococcus avium. Streptococcus bovis, Streptococcus cricetus, Streptococcus faceium, Streptococcus faecaiis, Streptococcus feais, Streptococcus gallinarum, Streptococcus lactis, Streptococcus mitior, Streptococcus mitis, Streptococcus mutans, Streptococcus oralis, Streptococcus pneumoniae, Streptococcus pyogenes, Streptococcus rattus, Streptococcus salivarius, Streptococcus sanguis, Streptococcus sobrinus, Treponema, Treponema pallidum, Treponema denticola, Vibrio, Vibrio cholerae, Vibrio comma, Vibrio parahemolyticus, Vibrio vulnificus, Wolbachia, Yersinia, Yersinia enterocolitica, Yersinia pestis, and Yersinia pseudotuberculosis, and other genus and species known to those of skill in the art.
Exemplary genus and species of yeast cells include Saccharomyces, Saccharomyces cerevisiae, Torula, Saccharomyces bouiardii, Schizosaccharomyces, Schizosaccharomyces pombe, Candida, Candida glabrata, Candida tropicaiis, Yarrowia, Candida parapsilosis, Candida krusei, Saccharomyces pastorianus, Brettanomyces, Brettanomyces bruxellensis, Pichia, Pichia guilliermondii, Cryptococcus, Cryptococcus gattii, Torulaspora, Torulaspora delbrueckii, Zygosaccharomyces, Zygosaccharomyces bailii, Candida lusitaniae, Candida
stellata, Geotrichum, Geotrichum candidum, Pichia pastoris, Kluyveromyces, Kluyveromyces marxianus, Candida dubliniensis, Kluyveromyces, Kluyveromyces lactis, Trichosporon, Trichosporon uvarum, Eremothecium, Eremothecium gossypii, Pichia stipitis, Candida mi fieri, Ogataea, Ogataea polymorpha, Candida oleophilia, Zygosaccharomyces rouxii, Candida albicans, Leucosporidium, Leucosporidium frigidum, Candida viswanathii, Candida biankii, Saccharaomyces teiluris, Saccharomyces florentinus, Sporidiobolus, Sporidiobolus salmonicolor, Dekkera, Dekkera anomala, Lachancea, Lachancea kluyveri, Trichosporon, Trichosporon mycotoxinivorans, Rhodotorula, Rhodotorula rubra, Saccharomyces exiguus, Sporobolomyces koalae, and Trichosporon cutaneum, and other genus and species known to those of skill in the art.
Exemplar}' genus and species of fungal cells include Sac fungi, Basidiomycota, Zygomycota, Chtridiomycota, Basidiomycetes, Hyphomycet.es, Glomeromycota, Microsporidia, Blastocladiomycota, and Neocallimastigomycota, and other genus and species known to those of skill in the art.
Exemplary eukaryotic cells include mammalian cells, plant cells, yeast cells and fungal cells.
VII. STANDARD AMINO ACID
As used herein, the term "SAA" (standard amino acid) include one of the L-amino acids that typically naturally occur in proteins on Earth and includes alanine, arginine, asparagine, aspartic acid, cysteine, glutamic acid, glutamine, glycine, histidine, isoleucine, leucine, lysine, methionine, phenylalanine, serine, threonine, tyrosine, tryptophan, proline and valine. The standard amino acids that are naturally N-end destabilizing in most bacteria include tyrosine, phenylalanine, tryptophan, leucine, lysine, and arginine. According to one aspect, the amino acid at the amino acid target location is an NSAA that is stabilizing. When
the natural analog of the NSAA is destabilizing and is present at the amino acid target location, degradation of the polypeptide occurs. Standard amino acids that are not naturally destabilizing via the N-end rule using natural ClpS, can be destabilizing when the ClpS is engineered to recognize such standard amino acid.
The N-end rule in bacteria may also be engineered to recognize isoleucine, valine, aspartate, glutamate, asparagine, and glutamine as destabilizing using methods known to those of skill in the art which is useful when the desired NSAA is an analog of these amino acids. For example, isoleucine and valine can be converted into N-end destabilizing residues by introducing a ClpS variant (M40A) that recognizes these amino acids as N-terminal destabilizing residues see (Roman-Hernandez G, Grant RA, Sauer RT, & Baker TA (2009) Molecular basis of substrate selection by the N-end rule adaptor protein ClpS. Proceedings of the National Academy of Sciences 106(22):8888-8893 hereby incorporated by reference in its entirety). Aspartate and glumatate may be converted into N-end destabilizing residues by introducing a bacterial aminoacyl -transferase from Vibrio vulnificus (Bpt) that is a homoiog of eukaryotic transferases and N-terminally appends a leucine (L) to peptides containing N~ terminally exposed aspartate or glutamate (see Graciet E, et al. (2006) Aminoacyl - transferases and the N-end rule pathway of prokaryotic/eukaryotic specificity in a human pathogen. Proceedings of the National Academy of Sciences of the United States of America 103(9):3078-3083 hereby incorporated by reference in its entirety). The ability of Bpt to catalyze this reaction has been demonstrated in E. coli and shows that components of the N- end rule, which includes many more conditionally destabilizing residues in eukaryotes, can be transferred across kingdoms. Asparagine and glutamine can be converted into N-end destabilizing residues by using an N-terminal amidase from S. cerevisiae (NTA1), which converts N-terminal asparagine into aspartate or N-terminal glutamine into glumate, respectively (see Tasaki T, Sriram SM, Park S, & Kwon YT (2012) The N-End Rule
Pathway. Annual Review of Biochemistry 81(l):261-289 hereby incorporated by reference in its entirety). Indeed, in many eukaryotic cells these amino acids and more are naturally conditionally N-end destabilizing. One of skill will understand that an N-end rule destabilizing pathway may be provided for all 20 standard amino acids as a basis for a system where a desired amino acid from among the 20 standard amino acids is N-end destabilizing in at least one context (see Chen, Shun-Jia, et al. "An N-end rule pathway that recognizes proline and destroys gluconeogenic enzymes." Science 355.6323 (2017): eaa!3655 hereby incorporated by rweterence in its entirety). One of skill in the art can identify the eukaryotic proteins required for conferring expanded N-end destabiiization and transfer them to prokaryotes as needed. Similarly, in eukaryotic cells one can constitutiveiy express components required for conferring expanded N-end destabiiization such that degradation of proteins containing N-end standard amino acids no longer remains conditional. One of skill will recognize that some amino acids rendered destabilizing may have adverse consequences for cell physiology. For example, most native proteins begin with methionine and if methionine is made N-end destabilizing then most proteins would degrade. Aspects of converting an N-end stabilizing amino acid to an N-end destabilizing amino acid can be tested in a particular organism.
VIII NON-STANDARD AMINO ACID
As used herein, the term "NSAA" refers to an unmodified amino acid that is not one of the 20 naturally occurring standard L-amino acids. NSAAs also include synthetic amino acids which have been designed to include a non-standard functional group not present in the standard amino acids or are naturally occurring amino acids bearing functional groups not present in the set of standard amino acids. Accordingly, a non-standard amino acid may include the structure of a standard amino acid and which includes a non-standard functional
group. A non-standard amino acid may include the basic amino acid portion of a standard amino acid and include a non-standard functional group.
NSAAs also refer to natural amino acids that are not used by all organisms (e.g. L- pyrrolysine (B. Hao et al ., A new uag-encoded residue in the structure of a methanogen methyltransferase. Science. 296: 1462) and L-selenocysteine (S. Osawa et al., Recent evidence for evolution of the genetic code. Microbiol Mol Biol Rev. 56:229)). NSAAs are also known in the art as unnatural amino acids (UAAs) and non-canonical amino acids (NCAAs).
NSAAs include, but are not limited to, p-Acetylphenyl alanine, m- Acetylphenylalanine, O-aliyltyrosine, Phenylselenocysteine, p-Propargyloxyphenylalanine, p-Azidophenylalanine, p-Boronophenyl alanine, O-m ethyl tyrosine, p-Aminophenylalanine, p- Cyanophenylalanine, m-Cyanophenyl alanine, p-Fluorophenylalanine, p-Iodophenyl alanine, p-Bromophenylalanine, p-Nitrophenylaianine, L-DOPA, 3-Aminotyrosine, 3-Iodotyrosine, p- Isopropyiphenyl alanine, 3-(2-Naphthyl)alanine, biphenylalanine, homoglutamine, D-tyrosine, p-Hydroxyphenyllactic acid, 2-Aminocaprylic acid, bipyridylalanine, HQ-alanine, p~ Benzoylphenylalanine, o-Nitrobenzylcysteine, o-Nitrobenzylserine, 4,5-Dimethoxy-2- Nitrobenzyiserine, o-Nitrobenzyilysine, o-Nitrobenzyityrosine, 2-Nitrophenyf alanine, dansylalanine, p-Carboxymethylphenylalanine, 3-Nitrotyrosine, sulfotyrosine, acetyllysine, methylhistidine, 2-Aminononanoic acid, 2-Aminodecanoic acid, pyrrolysine, Cbz-lysine, Boc-lysine, allyloxycarbony] lysine, arginosuccinic acid, citrulline, cysteine sulfinic acid, 3,4- dihydroxyphenylalanine, homocysteine, homoserine, ornithine, 3-monoiodotyrosine, 3,5- diiodotryosine, 3, 5, 5, -triiodothyronine, and 3,3 ',5,5'-tetraiodothyronine. Modified or unusual amino acids include D-amino acids, hydroxy-lysine, 4-hydroxyproline, N-Cbz-protected amino acids, 2,4-diaminobutyric acid, homoarginine, norleucine, 'N -methyl aminobutyric acid, naphthyialanine, phenylglycine, -phenylproline, tert-leucine, 4-aminocyclohexylalanine, N-
methyl -norleueine, 3,4-dehydroproline, Ν,Ν-dimethylaminoglycine, N-methylaminoglycine, 4-aminopiperidine-4-carboxylic acid, 6-aminocaproic acid, trans-4-(aminomethyl)- cyclohexanecarboxylic acid, 2-, 3-, and 4-(aminomethyl)-benzoic acid, 1- aminocyclopentanecarboxylic acid, 1 -aminocyclopropanecarboxylic acid, and 2~benzyl-5~ aminopentanoic acid, and the like. NSAAs also include amino acids that are functionalized, e.g., alkyne-functionaiized, azide-functionalized, ketone-functionaiized, aminooxy- functionaiized and the like. For reviews of NSAAs and lists of NSAAs suitable for use in certain embodiments of the subject invention, see Liu and Schultz (2010) Ann. Rev. Biochem. 79:413, and Kim et al. (2013) Curr. Opin. ( "hem. Biol. 1 7:412, each of which is incorporated herein by reference in its entirety for all purposes.
In certain aspects, an NSAA of the subject invention has a corresponding aminoacyl tRNA synthetase (aaRSVtRNA pair. In certain aspects, the aminoacyl tRNA synthetase/tRNA pair is orthogonal to those in a genetically modified organism such as, e.g., a prokaryotic cell, a bacterium (e.g., E. coif), a eukaryotic cell, a yeast, a plant cell, an insect cell, a mammalian cell, a virus, etc. In certain aspects, an NSAA of the subject invention is non-toxic when expressed in a genetically modified organism such as, e.g., a prokaryotic cell, a bacterium (e.g., E. coli), a eukaryotic cell, a yeast, a plant cell, an insect cell, a mammalian cell, a virus, etc. In certain aspects, an NSAA of the subject invention is not or does not resemble a natural product present in a cell or organism. In certain aspects, an NSAA of the subject invention is hydrophobic, hydrophilic, polar, positively charged, or negatively charged. In other aspects, an NSAA of the subject invention is commercially available (such as, e.g., L-4,4- bipnehyiaianine (bipA) and L-2-Naphthylalanine (napA)) or synthesized according to published protocols.
EXAMPLE I
Exemplary Degradation Materials and Methods
According to one aspect, the disclosure provides a method of making a protein having a non-standard amino acid incorporated therein, such as at its N-terminus, in a cell . The cell is provided with a nucleic acid sequence encoding a ubiquitin fused to the N-terminus of the protein wherein the N-terminus of the protein is an amino acid target location intended to have a nonstandard amino acid. The nonstandard amino acid may be encoded by a nonsense or sense codon. The cell is provided with a ubiquitin cleavase. The cell may include an endogenous protease system, such as a ClpS-ClpAP system. The cell is provided with a nonstandard amino acid. The cell expresses the fusion protein having either a standard or a nonstandard amino acid incorporated at the amino acid target location. The ubiquitin cleavase cleaves the ubiquitin to produce a protein having either the standard or non-standard intervening amino acid at its N-terminus. If a standard amino acid is present at the N- terminus, the ClpS recognizes the standard amino acid at the N-terminus and targets the protein having the standard amino acid at its N-terminus to ClpP for degradation. If a nonstandard amino acid is present at the N-terminus, the Clps does not recognize the nonstandard amino acid and the protein is not targeted for degradation. A residue is destabilizing if it is recognized by the ClpS adaptor protein, which is the discriminator of the N-end rule in E. coli such as is described in Erbse A, et al. (2006) ClpS is an essential component of the N-end rule pathway in Escherichia coli. Nature 439(7077):753-756 and Wang KH, Oakes ESC, Sauer RT, & Baker TA (2008) Tuning the Strength of a Bacterial N- end Rule Degradation Signal. Journal of Biological Chemistry 283(36):24600-24607; Schmidt R, Zahn R, Bukau B, & Mogk A (2009) ClpS is the recognition component for Escherichia coli substrates of the N-end rule degradation pathway. Molecular Microbiology 72(2):506-517.; Roman-Hernandez G, Grant RA, Sauer RT, & Baker TA (2009) Molecular
basis of substrate selection by the N-end rule adaptor protein ClpS. Proceedings of the National Academy of Sciences 106(22): 8888-8893; Schuenemann VJ, et al. (2009) Structural basis of N-end rule substrate recognition in Escherichia coli by the ClpAP adaptor protein ClpS. EMBO reports 10(5):508-514; Roman-Hernandez G, Hou Jennifer Y, Grant Robert A, Sauer Robert T, & Baker Tania A (201 ) The ClpS Adaptor Mediates Staged Delivery of N- End Rule Substrates to the AAA+ ClpAP Protease. Molecular Cell 43(2):217-228; and Hou JY, Sauer RT, & Baker TA (2008) Distinct structural elements of the adaptor ClpS are required for regulating degradation by ClpAP, Nat Struct Mo I Biol 15(3):288-294 each of which is hereby incorporated by reference in its entirety.
According to another aspect, the disclosure provides a method of screening for an amino acyl tRNA synthetase variant that preferentially selects a non-standard amino acid against its standard amino acid counterpart for incorporation into a protein in a cell . The cell is provided with an amino acyl tRN A synthetase variant. As shown in Figs. L and IB, the cell is provided with a nucleic acid sequence encoding a ubiquitin fused to the N-terminus of the protein wherein the N-terminus of the protein is an amino acid target location intended to have a nonstandard amino acid, and wherein GFP is fused to the C-end of the protein. The nonstandard amino acid may be encoded by a nonsense or sense codon. The cell is provided with a ubiquitin cleavase, such as Ubpl . The cell may include an endogenous protease system, such as a ClpS-ClpAP system. The cell is provided with a non-standard amino acid. The cell expresses the fusion protein having either a standard or a non-standard amino acid incorporated at the amino acid target location. The ubiquitin cleavase cleaves the ubiquitin to produce a protein having either the standard or non-standard intervening amino acid at its N- terminus. If a standard amino acid is present at the N-terminus, the ClpS recognizes the standard amino acid at the N-terminus and targets the protein having the standard amino acid at its N-terminus to ClpP for degradation, including the GFP portion. If a nonstandard amino
acid is present at the N-terminus, the Clps does not recognize the nonstandard amino acid and the protein is not targeted for degradation. The GFP is detected and is indicative of the presence of a synthetase variant that preferentially selects the non-standard amino acid against its standard amino acid counterpart for incorporation into the protein.
According to another aspect, the strength of the signal detected from the GFP is indicative of the amount of protein produced that included the nonstandard amino acid. In this manner, methods are provided for screening and evolving an amino acyl tRNA synthetase variant that preferentially selects a non-standard amino acid against its standard amino acid counterpart for incorporation into a protein in a cell.
EXAMPLE II
A method of making a protein having a non-standard amino acid incorporated at. its N- terminus in an engineered E. coli having orthogonal translation systems
To evaluate and classify various orthogonal translation systems in engineered E. coli for NSAA incorporation and proofreading, a fusion protein was constructed having a ubiquitin fused to the N-terminus of a GFP reporter protein having an intervening amino acid encoded by a UAG codon in between the ubiquitin and GFP. The plasmid containing the nucleic acid sequence of the fusion protein was introduced into the E. coli. In addition, the expression of a ubiquitin cleavase from yeast (Ubpl) was induced in E. coli as described in Tobias JW & Varshavsky A (1991) Cloning and functional analysis of the ubiquitin-specific protease gene UBP1 of Saccharomyces cerevisiae. J Biol Chem 266 hereby incorporated by reference in its entirety.
Finally, the native ClpS and the ClpP protease were overexpressed to determine their effect on the kinetics of GFP reporter degradation. The abundance of the GFP was measured as green fluorescence normalized by optical density (FL/OD) using a standard plate reader for fluorimetric and spectrophotometric assays. As shown in Figs 2A-2B, background protein
synthesis (FL/OD) was nearly eliminated for all tested orthogonal translation systems upon expression of Ubpl/CipS. Wild-type ClpS was not strictly selective as it recognized several small sized NSAAs and led to their degradation.
Since ClpS is endogenous to E. coli, it need not be overexpressed. However, overexpression, such as by use of an inducible promoter, increases the concentration of ClpS and therefore the ability of ClpS to discriminate between desired nonstandard amino acids and undesired standard amino acids or undesired NS AAs.
EXAMPLE III
Engineered ClpS V65I mutant
As shown in Figs. 4A-4B, the crystal structure of the ClpS protein was used to rationally engineer ClpS variants that have tunable specificities for degrading different sizes of NSAAs, including undetectable degradation of tested analogs of Y/F while maintaining full degradation of Y/F with the ClpS V65I mutant. The term "V65I" is a standard notation for describing a mutation where the amino acid I has been substituted for amino acid V at position 65, The ClpS_V65I mutant was capable of distinguishing between standard amino acids and their nonstandard amino acid counterparts when positioned at the N-terminus of a protein for purposes of degradation. One aspect of the present disclosure is to inactivate endogenous ClpS if the ClpS V65I mutant is being used. Inactivation of ClpS prevents indiscriminate or nonselective degradation of proteins including smaller nonstandard amino acids. Another useful mutant is ClpS_L32F, which causes biphenylalanine (bipA) and p- benzoylphenylalanine (pBnzylF) to be N-end stabilizing and all other tested NSAAs to be N- end destabilizing. In other words, the ClpS L32F mutant can be used to discriminate between incorporated bipA and all other incorporated NSAAs other than pBnzylF (see Fig. 3E).
EXAMPLE IV
Local Sense Codon Reassignment
Aspects of the present disclosure are directed to local sense codon reassignment. Instead of using a stop codon for nonstandard amino acid incorporation, a sense codon can be used which creates competition for an N-terminal sense codon between a standard amino acid and a nonstandard amino acid. If the standard amino acid corresponding to the sense codon is destabilizing by the N-end rule, all of the proteins which do not include a sense codon reassigned to a nonstandard amino acid will be degraded. As shown in Fig. IE with respect to the UAC codon which normally encodes tyrosine, when expression of Ubpl/ClpS is induced, nearly all of the GFP containing tyrosine gets degraded. If competition for the UAC codon with nonstandard amino acid is present, then nonstandard amino acid incorporation into the N-terminus of the GFP reporter protein can be detected by measuring the FL/OD increase,
EXAMPLE V
Materials and Methods
Strains and strain engineering
E. coli strain C321.AA (CP006698.1), which was previously engineered to be devoid of UAG codons and RFl, was used as the E coli strain. E. coli strain C321AA. (CP006698.1) is described in M. J. Lajoie et ah, Genomically Recoded Organisms Expand Biological Functions. Science (80- ). 342, 357-360 (2013) hereby incorporated by reference in its entirety. A starting reporter construct of an Ubiquitin-*-LFVQEL-sfGFP-His6x fusion under an anhydrotetracycline (ATC) inducible expression system was genomically integrated using λ Red recombineering as described in . A. Datsenko, B. !,. Wanner, One-step inactivation of chromosomal genes in Escherichia coli K-12 using PGR products. Proc Natl Acad Sci USA. 97, 6640-6645 (2000) and D. Yu et ah, An efficient recombination system for
chromosome engineering in Escherichia coli. Proc. Natl. Acad. Sci. U. S. A, 97, 5978-83 (2000) each of which are hereby incorporated by reference in its entirety and tolC negative selection using Colicin El as described in J, A, DeVito, Recombineering with tolC as a Selectable/Counter-selectable Marker: remodeling the rRNA Operons of Escherichia coli . Nucleic Acids Res. 36, e4 (2008) and C. J. Gregg et al., Rational optimization of tolC as a powerful dual selectable marker for genome engineering. Nucleic Acids Res. 42, 4779-4790 (2014) each of which are hereby incorporated by reference in its entirety. This resulted in strain C321.AA.Ubiq-UAG-sfGFP. Table 1 includes sequences of key constructs such as the reporter construct. Multiplex automatable genome engineering (MAGE) as described H. H. Wang et al. Programming ceils by multiplex genome engineering and accelerated evolution. Nature. 460, 894-898 (2009) hereby incorporated by reference in its entirety was used to inactivate the endogenous mutS and clpS genes when needed and to add or remove UAG codons in the integrated reporter, in brief, overnight cultures were diluted 100-fold into 3 mL LBL containing appropriate antibiotics and grown until mid-log. Lambda Red was induced in a shaking water bath (42 °C, 300 rpm, 15 minutes), then induced culture tubes were cooled rapidly in an ice slurry for at least two minutes, Electrocompetent cells were prepared at 4 °C by pelleting 1 mL of culture (centrifuge at 16,000 rcf for 20 seconds) and washing the cell pellet twice with 1 mL ice cold deionized water (dH20). Electrocompetent pellets were resuspended in 50
of dH20 containing the desired DNA. For MAGE oligonucleotides, 5 uM of each oligonucleotide was used. Table 2 includes a list of all oligonucleotides used. For integration of dsDNA cassettes, 50 ng was used. Allele-specific colony PCR (ASC-PCR) was used to identify desired colonies resulting from MAGE as described in F. J. Isaacs et al. Precise manipulation of chromosomes in vivo enables genome-wide codon replacement. Science (80-. ). 333, 348-353 (2011) hereby incorporated by reference in its entirety. Colony PCR was performed using Kapa 2G Fast HotStart ReadyMix according to manufacturer
protocols and Sanger sequencing was performed by Genewiz to verify strain engineering. The strains C321.AA.Ubiq-UAG-sfGFP and C321.AA.Ubiq-UAG-sfGFP_UAG151 are available from Addgene.
Table 1 . Sequences of key constructs
GTGACATTGCAGAGACGAAAGCCAAGGAGGCGACCGACTTGGG
GAAGGAAGCAGGTTTTCCTTTGATGTTCACGACTGAGCCCGAGG
AGTAA
pAzFRS. l .tl GTTATGcactacGATggtgttgacgttTACgttggtggtatggaacagcgtaaaatccacatgct gBlock ggcgcgtgaactgctgccgaaaaaagttgtttgcatccacaacccggttctgaccggtctggacggtgaag gtaaaatgtcttcitctaaaggtaacttcatcgcggtigacgactctccggaagaaatccgtgcgaaaatcaa aaaagcgtactgcccggcgggtgttgttgaaggtaacccgatcatggaaatcgcgaaatacttcctggaat acc cgctgaccatc aaaGGT
Ubpl_Sc_trunc ATGGGGAGTGGGTCTTTCATTGCTGGGCTTGTCAACGATGGTAA
TACGTGTTTTATGAACTCGGTTCTTCAGTCCCTTGCTAGTAGCCG
TGAACTTATGGAGTTTTTGGATAATAATGTAATCCGTACATATG
AAGAAATTGAACAGAACGAGCACAATGAGGAAGGTAATGGCCA
AGAGAGCGCACAAGATGAGGCAACTCACAAAAAAAACACTCGC
AAGGGAGGTAAGGTCTATGGGAAGCATAAAAAGAAATTAAACC
GCAAATCTTCTAGCAAGGAAGACGAAGAAAAGTCGCAAGAACC
AGACATTACGTTTTCGGTGGCGTTGCGTGATCTGCTGAGCGCAT
TAAATGCTAAGTATTATCGCGACAAACCCTACTTTAAGACTAAC
TCTTTATTAAAAGCGATGAGCAAGTCCCCGCGCAAAAATATCTT
GCTTGGGTACGATCAAGAAGACGCTCAGGAATTTTTTCAAAACA
TTCTTGCGGAGTTAGAATCTAATGTCAAGTCGTTAAACACAGAA
AAGCTTGATACTACACCGGTAGCCAAGTCCGAACTTCCAGACGA
TGCTCTGGTTGGCCAATTAAACCTTGGTGAGGTAGGCACCGTGT
ACATTCCCACAGAACAAATTGACCCCAATTCGATTTTACATGAC
AAATCGATTCAAAACTTTACCCCCTTTAAACTGATGACCCCGTT
GGATGGGATCACGGCTGAGCGCATCGGCTGCCTGCAATGCGGA
GAGAACGGGGGAATTCGCTACAGTGTTTTCAGCGGATTAAGTTT
GAACCTGCCGAATGAAAATATTGGAAGCACTCTTAAACTGTCCC
AGTTACTGTCCGATTGGTCGAAACCCGAGATTATCGAGGGTGTT
GAATGCAACCGTTGCGCTTTAACAGCTGCGCACTCACACTTGTT
TGGCCAATTAAAGGAGTTTGAGAAGAAACCTGAAGGCTCGATTC
CCGAAAAACTTATTAATGCCGTAAAGGACCGCGTGCACCAGATC
GAAGAGGTCTTGGCAAAGCCGGTTATCGACGATGAAGATTATA
AAAAATTGCATACTGCGAATATGGTCCGCAAGTGTTCAAAAAGT
AAACAAATTCTTATCTCTCGTCCACCACCTTTGTTGTCTATTCAT
ATCAACCGCTCTGTTTTCGACCCGCGCACCTACATGATTCGCAA
GAACAACTCCAAGGTTTTGTTCAAGTCACGCTTGAACCTGGCAC
CCTGGTGCTGTGATATCAACGAAATCAATCTTGACGCACGCCTT
CCGATGTCGAAGAAGGAAAAAGCAGCTCAACAAGATTCTTCTG
AAGACGAGAACATTGGCGGAGAGTACTATACTAAATTGCATGA
ACGTTTTGAGCAGGAGTTTGAAGATTCTGAAGAAGAGAAGGAA
TACGATGATGCAGAGGGTAATTATGCATCGCATTATAACCATAC
CAAGGACATCTCCAACTACGATCCATTGAATGGAGAAGTCGACG
GTGTGACTTCCGATGATGAGGATGAATACATTGAAGAGACAGA
CGCGTTGGGGAATACCATCAAAAAACGTATTATTGAACACTCCG
ACGTGGAGAACGAAAACGTGAAGGATAATGAAGAACTTCAGGA
GATCGATAACGTTAGCTTGGATGAGCCAAAAATTAATGTCGAGG
ACCAGCTTGAAACGAGTTCTGATGAGGAAGACGTTATTCCTGCT
CCACCCATCAACTACGCTCGCAGCTTTAGTACGGTCCCAGCGAC
CCCTTTAACTTACTCTTTGCGCAGCGTCATCGTGCACTATGGGAC
TCACAACTACGGACATTATATTGCATTTCGCAAGTATCGTGGAT
GTTGGTGGCGCATCTCCGATGAGACGGTCTATGTGGTAGATGAG
GCCGAAGTACTGTCAACACCGGGGGTATTTATGCTTTTCTACGA
GT ATGATTTCG AC GAGG AG ACC GGA AAAATGAAAGACG ACTT A
GAAGC T ATCC AGA GC AAT A ATG AGGAAG AT GACG AG A A AG A AC
AGGAACAGAAGGGTGTCCAGGAGCCAAAAGAATCCCAGGAGCA
AGGCGAAGGCGAAGAACAAGAAGAAGGGCAAGAGCAAATGAA
ATTTGAGCGTACGGAGGATCATCGCGACATTTCAGGGAAGGATG
TGAATTAA
Table 2, Oligonucleotides used
Upbl-P2-R TGATGATGATGGTCGACTTATTAATTCACATCCTTCCCTGA pUbi-*-Ndeg- TGCGTCTGCGTGGAGGATAGTTGTTTGTGCAGGAGCTTGC
GFP-F
pUbi-*-Ndeg- AAGCTCCTGCACAAACAACTATCCTCCACGCAGACGC
GFP-R
Ubpl_int-seq-F GCTTGGGTACGATCAAGAAG
Ubpl_int-seq-R CCTTGGTATGGTTATAATGCG
pZE21bbone4Ubp CAGGGAAGGATGTGAATTAAAAGCTTGATGGGGGATCCCA 1-F
pZE21bbone4Ubp ATGAAAGACCCACTCCCCATGGTACCTTTCTCCTCTTTAATGA 1-R AT
Ubpl-ins-F TTAAAGAGGAGAAAGGTACCATGGGGAGTGGGTCTTTCAT
Ubpl-ins-R TGGGATCCCCCATCAAGCTTTTAATTCACATCCTTCCCTGA
UbiGFPins-F TAAAGAGGAGAAAGGTACCATGCAGATTTTTGTGAAGACTTT
AAC
UbiGFPins-R TGGGATCCCCCATCAAGCTTTTAGTGGTGGTGGTGGTGGT pZEbb one4Ubi GF ACCACCACCACCACCACTAAAAGCTTGATGGGGGATCCCA P-F
pZEbb one4Ubi GF GTCTTCACAAAAATCTGCATGGTACCTTTCTCCTCTTTAATGA P-R AT
reporter to geno TTACGGGCTAATTACAGGCAGAAATGCGTGATGTGTGCCACA me-F CTTGTTGATCCCTATCAGTGATAGAGATTGAC
reporter to geno CCAGCGGGCTAACTTTCCTCGCCGGAAGAGTGGTTAACAAAA
me-R TAGTAACGTCACCGACAAACAACAGATAAAAC
SIR-seq-F C C AAAGTGAGTTGAGT AT AAC
SIR-seq-R TTTCTCCTTATTATCAATGC
r2g-extend-F GCCGCAGCAAGCCAAAGTGAGTTGAGTATAACGCAAATTTGC
TACTGGTCCGATGGGTGCAATGGTCTGAATTACGGGCTAATT
ACAGGC
r2g-extend-R AACGCAATCGCAACCGCTAAACCACTGGCCATGTGCACGAGT
TTCATTCATTTCTCCTTATTATCAATGCACCAGCGGGCTAACT TTC
MAGE_*toS t*a*aagagctcctcgcccttggatgcAAGCTCCTGCACAAACAACgATCCTC
CACGCAGACGCAGAACCAAATGAAGGGTAGATTCTTTCT
asPCR-S-F C GTC TGC GTGGAGG ATC
asPCR-*-F C GTC TGC GTGGAGG AT A
pZE- TTCTGACCCATCGTAATTAAaagcttgatgggggatccca
Ubp lbbone4ClpP-
F
pZE- tGGTATATCTCCTTTTATTATTAATTCACATCCTTCCCTGAAAT
Ubplbbone4ClpP- R
clpPins-F GTGAATTAATAATAAAAGGAGATATACCatgTCATACAGCGGC
GA
clpPins-R tgggatcccccatcaagcttTTAATTACGATGGGTCAGAATCG
pEVOLtRNA-pl- ctgccaacttactgatttagtgtatgatggtgtttttgagg
F
pEVOLtRNA-pl- gccgcttagttagccgtgcaaacttatatcgtatggggctg
R
pEVOLtRNA-p2- agccccatacgatataagtttgcacggctaactaagcggc
F
pEVOLtRNA-p2- ctcaaaaacaccatcatacactaaatcagtaagttggcagcatca
R
pZE- TGTGT AC GC T AGA A A A AGC C T AAaagcttgatgggggatc
Ubplbbone4ClpS- pZE- GTTCGTTTTACCcatGGTATATCTCCTTTTATTATTAATTCACAT
Ubplbbone4ClpS- R
ClpSins-F ATAATAAAAGGAGATATACCatgGGTAAAACGAACGACTG
ClpSins-R gatcccccatcaagcttTTAGGCTTTTTCTAGCGTACACA
AARSlibraryins-F tactgtttctccatacccgtttttttgggctaacaggaggaattagatct
pEVOLbbone41ib- agatctaattcctcctgttagcc
R
mutS null mut-2* A*C*CCCATGAGTGCAATAGAAAATTTCGACGCCCATACGCCC
ATGATGCAGCAGTGATAGTCGCTGAAAGCCCAGCATCCCGAG ATCCTGC
mutS null reveri- A* C * CC C ATG AGTGC AAT AGAA AATTTCGACGCC C AT ACGC CC 2* ATGATGCAGCAGTATCTCAGGCTGAAAGCCCAGCATCCCGAG
ATCCTGC
mutS- CCATGATGCAGCAGTATCTCAG
2__ascPCR.__wt~F
mutS- CCATGATGCAGCAGTGATAGTC
2 ascPCR mut-F
mutS-2 ascPCR- AGGTTGTCCTGACGCTCCTG
R
ASPCR- 151UAG- GTATAATTTCAATTCCCATAATGTATAG
F
ASPCR- 151UAC- GTATAATTTCAATTCCCATAATGTATAC
F
ASPCR-151-R ctcgagcttatagagctcatc
Rem ove 151 U AG- c*t*taaaattcgccttaatgccattcttttgcttatctgcggtaatgtatacattatgggaattgaaattata MAGE_corrected ctccagcttatggccgag
ClpS.inact-MAGE C*T*TTTTCTTCCGCCAGTTGATCAAAGTCCAGCCAGTCGTTCta
TTatCaCATTGTCAGTTATCATCTTCGGTTACGGTTATCGGCAG
AAC
ASPCR- C C GAT A AC C GT A AC C GA AG ATG AT A AC TGAC A ATGG
ClpS_WT-F
ASPCR- C C GAT A AC C GT A AC C GA AG ATG AT A AC TGAC A ATGT
ClpS.inact-F
ASPCR-ClpS-R CGTACTTGTTCACCATCGCCACTTTGGT
pZE-U- CGACTGAGCCCGAGGAGTAAaagcttgatgggggatccca
bbone4ClpS2_At-
F
pZE-U- TCAACAGGACTATCAGACATGGTATATCTCCTTTTATTATTAA bbone4ClpS2_At- TTCACATCC
R
ClpS2_At-ins-F ATAATAAAAGGAGATATACCATGTCTGATAGTCCTGTTGACT
T
C!pS2 At-ins-R tgggatcccccatcaagcttTTACTCCTCGGGCTCAGTCG
ClpS_M40A-F ATGATGATTACACTCCGGCGGAGTTTGTTATTGACGTGT
ClpS_M40A-R CGTCAATAACAAACTCCGCCGGAGTGTAATCATCATTGAC pOSIPbbone-F taacciaaactgacaggcat
pOSIPbbone-R ttccgatccccaattcct
pEVOL-araC-seq- GGATCATTTTGCGCTTCAG
1
pEVOL-araC-seq- GAATATAACCTTTCATTCCC
2
PylRSmiddle-seq GTGTTTCGACTAGCATTTC
PylRSeml-seq GGTCAAACATGATTTCAAAAAC
pEVOLCmR-seq- caacagtactgcgatgag
R
upstreamClpS-F GC AAAT AAGC TC TTGTC AGC
ClpS_L32-NTC-F CATCTATGTATAAAGTGATANTCGTCAATGATGATTACACTCC
G
ClpS_32-R TATCACTTTATACATAGATG
ClpS-V43-NTT-F ATTACACTCCGATGGAGTTTNTTATTGACGTGTTACAAAAATT
C
ClpS_43-R AAACTCCATCGGAGTGTAAT
ClpS_V65-NTT-F CAACGCAATTGATGCTCGCTNTTCACTACCAGGGGAAGG
ClpS_65-R AGCGAGCATCAATTGCGTTG
ClpS_L99-NTC-F CGAGGGAGAATGAGCATCCA TCCTGTGTACGCTAGAAAAAG
C
ClpS_99-R TGGATGCTCATTCTCCCTCG
Alt_ClpS- gcggatttgtcctactcag
R_forL99
AARS-inducible- gctaacaggaggaattagatct
only-F
AARS-inducible- ttgataatctaacaaggattatggg
only-R
pEVOLbbone- cccataatccttgttagattatcaaaggcattttgctattaaggg
Ind-only-F
pEVOL-bbone- agatctaattcctcctgttagc
ind-only-R
protosens-bbone-F TAACTCGAGGCTGTTTTGG
protosens-bbone- CATATGTATATCTCCTTGTGCATC
R
Ubpl ClpS4protos GATGCACAAGGAGATATACATATGGGGAGTGGGTCTTTCAT ens-F
Ubp 1 ClpS4protos CCAAAACAGCCTCGAGTTAGGCTTTTTCTAGCGTACA ens-R
pAzFRS. l .tl-ins- acccgatcatgcaggttaacGTTATGcactacGATggtgt pAzFRS. l .tl-ins- tcaccaccgaatttttccggACCtttgatggtcagcg
R
bbone4pAzFRS. l . ccggaaaaattcggtggtga
tl-F
bbone4pAzFRS. l . gttaacctgcatgatcgggt
tl-R
pZEbbone4tetR-F acgctctcctgagtaggac
pZEbbone4tetR-R tcaccgacaaacaacagataaaac
TetR-ins-F tatctgttgtttgtcggtgaacgtctcattttcgccagat
TetR-ins-R gtcctactcaggagagcgtagtgtcaactttatggctagc
pDULE-ABK- cgacctgaatggaagcc
bbone-F
pDULE-ABK- catacacggtgcctgac
bbone-R
CrnRms4pDULE- aacgcagtcaggcaccgtgtatggagaaaaaaatcactggatatac
F
CmR4pDULE-R gccggcttccattcaggtcgaaaaaattacgccccgc
pCNFRS-65-67- CAAAATGCTGGATTTGATATAATTATAJNNKTTG NKGATTTA 70-NNK-F NN GCCTATTTAAACCAGAAAGGAGAG
pCNFRS-65-R TATAATTATATCAAATCCAGCATTTTGTAAATC
pCNFRS-108- GGCAAAATATGTTT ATGGAAGTGAANNKNNKCTTGATAAGGA 109-114- K-F TNNKACACTGAATGTCTATAGATTGGC
pCNFRS-108-R TTCACTTCCATAAACATATTTTGCC
pCNFRS-155- GAAGTTATCTATCCAATAATGNNKGTTNNKGGTGCTCATNNK 157-161 -NNK-F CTTGGCGTTGATGTTGCAG
pCNFRS-155-R CATTATTGGATAGATAACTTCAGCAAC
library INS-seq-R CGCATCAGGCAATTTAGC
BipARS_P144Q-F cgcgcgtgaagacgaaaaccagaaagttgcggaagttatctac
BipARS_P144Q- ggttttcgtcttcacgcg
R
BipARS_N157K- tacccgatcatgcaggttaaaggtatccactacaaaggtgttg
BipARS_N157K- ttaacctgcatgatcgggta
R
BipARS R!8!C- gtaaaatccacatgctggcgtgtgaactgctgccgaaa
BipARS R!8!C- cgccagcatgtggatttta
R
BipARS 12551·'-!·" tcctggaatacccgctgaccttcaaacgtccggaaaaattc
BipARS J255F-R ggtcagcgggtattccag
BipARS_E259V- gctgaccatcaaacgtccggtaaaattcggtggtgacctg
F
BipARS_E259V- ccggacgtttgatggtc
R
BipARS_P284S-F tcaaaaacaaagaactgcactcgatgcgtctgaaaaacg
BipARS R284S- gtgcagttctttgtttttgaac
pEVOLbbone41ib ctgcagtttcaaacgctaa attg
v2-F
AARSlibraryinsv2 taggcctgataagcgtagcgcatcaggcaatttagcgtttgaaactgcag ~R
BipARS _G257R- aatacccgctgaccatcaaacgtccggaaaaattcggtg
F
BipARS_G257R- accaccgaatttttccggacgtttgatggtcagcgggtat
R
BipARS-lOOAA-F gcgaaatacgtttacggttc
BipARS-lOOAA- gaaccgtaaacgtatttcgc
R
BipARS-200AA-F ggacggtgaaggtaaaatgtc
BipARS-200AA- gacattttaccttcaccgtcc
R
pZErepbb one4pyl cggcgccagggttgtttttcacgctctcctgagtaggaca
T-F
pZErepbb one4pyl ttccattcaggtcgaaaaaaagtgtcaactttatggctagc
T-R
pylTpDULE-F ttttttcgacctgaatggaagc
pylTpDULE-R gaaaaacaac cctggcgc
pZEbbone4pyl Ton cggcgccagggttgtttttcacgctctcctgagtaggaca
ly-F
pZEbb one4pyl Ton ttccattcaggtcgaaaaaactcgaggtgaagacgaaagg
ly-R
ClpS-Lib-F ACATTTCAGGGAAGGATGTGAATTAATAATAAAAGGAGATAT
ACC
ClpS-Lib-R gcgtaccatgggatcccccatcaagcttTTA
pZEbbone4ClpSli TAAaagcttgatgggggatc
b-F
pZEbbone4ClpSli GGTATATCTCCTTTTATTATTAATTCACATCC
b-R
ClpS-Lib-Seq GGATCATCGCGACATTTC
Plasmids and plasmid construction
Two copies of orthogonal MjTyrRS-derived AARSs and tRNAc' y AOpt were provided in pEVOL plasmids by Dr. Peter Schultz (Scripps Institute) and as described in T. S. Young, I. Ahmad, J. A. Yin, P. G. Schultz, An Enhanced System for Unnatural Amino Acid Mutagenesis in E, coli. J. Mol. Biol. 395, 361-374 (2010) hereby incorporated by reference in its entirety. The pEVOL plasmids were maintained using chloramphenicol. Original plasmids harboring two AARS copies were used for synthetase promiscuity comparison experiments as shown in Figs lA-lE, Figs, 2A-2B and Figs, 3A-3E. For generation and characterization of synthetase variants, plasmids harboring only one AARS copy under inducible expression were constructed using Gibson assembly as described in D. G. Gibson et al.. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat. Methods. 6, 343-345 (2009) hereby incorporated by reference in its entirety. Plasmids containing
PyllFRS and tRNA^^ were provided by Professor Dieter Soil (Yale University) as described in L.-T. Guo et al , Polyspecific pyrrolysyl-tRNA synthetases from directed evolution, doi: 10.1073/pnas.14197371 1 1 hereby incorporated by reference in its entirety. The PyllFRS was retained on its original pCDF plasmid but the tRNA was cloned into the pZE21 vector for higher expression. The pCDF plasmid was maintained using spectinomycin and the pZE21 plasmid was maintained using kanamycin. The Pyl-ABK and tRNA' c'UA AARS/tRNA pair were provided by Dr. Peter Schultz (pDULE-ABK, Addgene plasmid #49086) as described in H. Ai, W. Shen, A. Sagi, P. R. Chen, P. G, Schultz, Probing Protei -Protein Interactions with a Genetically Encoded Photo-crosslinking Amino Acid. ChemBioChem . 12,
1854-1857 (2011) hereby incorporated by reference in its entirety. The antibiotic resistance marker on the pDULE plasmid was swapped for chloramphenicol using Gibson assembly. The ScWRS-R3-13 AARS was synthesized as codon-optimized for expression in E. coli and cloned into the pEVOL plasmid along with its associated tRNA as described in R. A. Hughes, A. D. Ellington, Rational design of an orthogonal tryptophanyl nonsense suppressor tRNA. Nucleic Acids Res. 38, 6813-6830 (2010) and J. W. Ellefson ei al, Directed evolution of genetic parts and circuits by compartmentalized partnered replication. Nat. BiotechnoL 32, 97-101 (2014) each of which is hereby incorporated by reference in its entirety. In all cases, tRNA is constitutively expressed and AARS expression is either arabinose inducible or constitutive.
An N-terminally truncated form of the ubpl gene from Saccharomyces cerevisiae as described in J, W. Tobias, A. Varshavsky, Cloning and functional analysis of the ubiquitin- specific protease gene UBP1 of Saccharomyces cerevisiae. J. Biol. Chem. 266, 12021-8 (1991) or A. Wojtowicz et ah. Expression of yeast deubiquitination enzyme UBP1 analogues in E, coli. Microh. Cell Fact 4, 1-12 (2005) each of which is hereby incorporated by reference in its entirety was synthesized as codon-optimized for expression in E. coli and cloned into the pZE21 vector (Kanamycin resistance, ColEl origin, TET promoter) (Expressys). The E. coli genes clpS and clpP were amplified from E. coli MG1655 and cloned into artificial operons downstream of the ubp l gene in the pZE21 vector using Gibson assembly. Artificial operons were created by inserting the following RBS sequence between the ubpl and clp genes: TAATAAAAGGAGATATACC. This RBS was originally designed using the RBS calculator (see H. M. Salis, E. A. Mirsky, C. A. Voigt, Automated design of synthetic ribosome binding sites to control protein expression. Nat Biotech, 27, 946-950 (2009)) and previously validated in the context of another artificial operon as described in A. M. Kunjapur, Y. Tarasova, K. L. J. Prather, Synthesis and Accumulation of Aromatic
Aldehydes in an Engineered Strain of Escherichia coli. J. Am. ( 'hem. Soc. 136, 11644-54 (2014) hereby incorporated by reference in its entirety. Rational engineering of ClpS variants was performed by dividing the clpS gene into two amplicons where the second amplicon contained a degenerate NTC or NTT sequence in the oligo corresponding to each codon of interest. The four initial positions of interest in the clpS gene correspond to amino acids 32, 43, 65, and 99. In each case, Gibson assembly was used to ligate both amplicons and the backbone plasmid. The pZE/Ubpl/ClpS and pZE Ubpl/ClpS V65I plasmids are available from Addgene. The ubpl/clpS_V65I operon was also placed under weak constitutive expression and integrated using Cionetegration as described in F. St-Pierre et al, One-step cloning and chromosomal integration of DNA. ACS Synth. Biol. 2, 537-541 (2013). This strain (C321. AA.Nendint) was used as the host for FACS experiments.
Culture Conditions
Cultures for general culturing used herein were grown in LB-Lennox medium (LBL: 10 g/L bacto tryptone, 5 g/L sodium chloride, 5 g/L yeast extract). Cultures for experiments in Figs 3A-3E were grown in 2X YT medium (2XYT: 16 g/L bacto tryptone, 10 g/L bacto yeast extract, 5 g/L sodium chloride) given improved observed final culture densities compared to LBL upon expression of ClpS variants. Unless otherwise indicated, ail cultures were grown in biological triplicate in 96-well deep-well plates in 300 μΐ, culture volumes at 34C and 400 rpm.
NSAA Incorporation Assays
Strains harboring integrated GFP reporters and AARS/tRNA plasmids were inoculated from frozen stocks of biological triplicate and grown to confluence overnight in deep well plates. Experimental cultures were inoculated at 1 : 100 dilution in either LBL or
2XYT media supplemented with chloramphenicol, arabinose, and the appropriate NSAA. Cultures were incubated at 34 °C to an ODeoo of 0.5-0.8 in a shaking plate incubator at 400 rpm (-4-5 h). GFP expression was induced by addition of anhydrotetracycline, and cells were incubated at 34 °C for an additional 16-20 h.
All assays were performed in 96-weli plate format. Ceils were centrifuged at 5,000g- for 3 min, washed with PBS, and resuspended in PBS after a second spin. GFP fluorescence was measured on a Biotek spectrophotometnc plate reader using excitation and emission wavelengths of 485 and 525 nm. Fluorescence signals were corrected for autofluorescence as a linear function of ODeoo using the parent C321.AA strain that does not contain a reporter. Fluorescence was then normalized by the ODeoo reading to obtain FL/OD.
Chemicals
NSAAs used in this study were purchased from PepTech Corporation, Sigma Aldrich, Santa Cruz Biotechnology, and Toronto Research Chemicals. The following NSAAs were purchased: L-4,4-Biphenylalanine (BipA), L-4-Benzoylphenylalanine (pBenzoylF), O-tert- Butyl-L-tyrosine (tButylY), L-2-Naphthylalanine (NapA), L-4-Acetyiphenyl alanine (pAcF), L-4-Iodophenyl alanine (piF), L-4-Bromophenylalanine (pBromoF), L-4-Chlorophenylalanine (pChloroF), L-4-Fluorophenylalanine (pFluoroF), L-4-Azidophenylalanine (pAzF), L-4- Nitrophenylalanine, L-4-Cyanophenylalanine, L-3-Iodophenylalanine, L-phenylalanine, L- tyrosine, L-tryptophan, D-phenylalanine, D-tyrosine, and 5-Hydroxytryptophan. Solutions of NSAAs (50 or 100 mM) were made in 10-50 niM NaOH.
Library Generation
Error-prone PGR (EP-PCR) is the method of choice for introducing random mutations into a defined segment of DNA that is too long to be chemically synthesized as a degenerate
sequence. EP-PCR was performed using the GeneMorph II Random Mutagenesis Kit (Stratagene Catalog #200550), following manufacturer instructions to obtain approximately an average of 2-4 DNA mutations per library member. To generate libraries of MjTyrRS- derived AARSs, roughly 175 ng of PGR template was used in each 25 uL. of PCR mix containing primers that have roughly 40 base pairs of homology flanking the AARS coding region. The reaction mixture was subject to 30 cycles with Tm of 63°C and extension time of I min. Four separate 25 uL EP-PCR reactions were performed per AARS and then pooled. Plasmid backbone PCRs were performed using KOD Xtreme Hot Start Polymerase (Miliipore Catalog #71795). Both PCR products were isolated by 1.5% agarose gel electrophoresis and Gibson assembled in 8 parallel 20 uL volumes per library. Assemblies were pooled, washed by ethanol precipitation, and resuspended in 50 μΤ, of dEbO, which was drop dialyzed (EMD Miliipore, Billerica, MA) and electroporated into E. cloni supreme cells (Lucigen, Middleton, WI). Libraries were expanded in culture and miniprepped (Qiagen, Valencia, CA) to roughly 100 ng/ul aliquots. 1 μg of library was drop dialyzed and electroporated into C321.AA.Nendint for subsequent FACS experiments. Colony counts on appropriate antibiotic containing plates within one doubling time after transformation revealed library sizes of roughly 1 x 106 for AARS libraries in Ecloni hosts and 1 x 107 in C321.AA.Nendint hosts.
Flow Cytometry and Cell Sorting
AARS libraries were subject to three rounds of fluorescence activated sorting in a Beckman Coulter MoFlo Astrios. Prior to each round, the usual NSAA incorporation assay procedure was followed such that cells would express GFP reporter proportional to the activity of the AARS library member. One notable deviation from that procedure was the use of a higher and variable inoculum volume to screen the full library at each stage. Cells
displaying the top 0.5% of fluorescence activation (50k cells) were collected after Round 1, expanded overnight, and used to inoculate experimental cultures for the next round. Because the next round was a negative screening round, the desired NSAA was not added into culture medium. The rest of the NSAA incorporation assay procedure was followed in order to eliminate cells that remained fluorescence due to promiscuous AARS activity on standard amino acids. In the second sort, cells displaying the lowest 10% of visible fluorescence (500k cells) were collected. Cells passing the second round were expanded overnight and used to inoculate the third and final round of sorting. The experimental cultures for the third round were treated as the first round and were sorted for the upper 0.05%> of fluorescence activation (Ik cells). The final ceils collected were expanded overnight and plated for sequencing and downstream testing. Libraries were frozen at each stage before and after sorting. FlowJo X software was used to analyze the flow cytometry data. Constructs of interest were grown overnight, miniprepped, and transformed into C321.AA.Ubiq-UAG-sfGFP for further analysis in plate reader assays.
EXAMPLE VI EMBODIMENTS
The present disclosure provides a method of making a target polypeptide in a cell, wherein the target polypeptide includes a non-standard amino acid (NSAA) substitution at an amino acid target location, including genetically modifying the cell to express the target polypeptide including a non-standard amino acid substitution at an amino acid target location using an engineered amino-acyl tRNA synthetase and transfer RNA pair corresponding to the non-standard amino acid, and wherein the cell expresses the target polypeptide including a standard amino acid or undesired NSAA at the amino acid target location when the engineered amino-acyl tRNA synthetase and transfer RNA pair non-selectively adds the
standard amino acid or undesired NSAA at the amino acid target location, wherein a removable protecting group is attached to the target polypeptide adjacent to the amino acid target location, such that when the removable protecting group is removed, an N-end amino acid is exposed at the amino acid target location. According to one aspect, the removable protecting group is a cleavable protecting group that is orthogonal within the ceil. According to one aspect, the removable protecting group is an enzyme cleavable protecting group. According to one aspect, the removable protecting group is a protein that is cleavable by a corresponding enzyme. According to one aspect, the removable protecting group is ubiquitin that is cleavable by Ubpl . According to one aspect, the cell is genetically modified to include a foreign nucleic acid sequence encoding the target polypeptide including a nonstandard amino acid substitution at an amino acid target location and a removable protecting group attached to the target polypeptide adjacent to the amino acid target location , According to one aspect a detectable moiety is attached to the C-end of the target polypeptide. According to one aspect, a detectable moiety is attached to the C-end of the target polypeptide, wherein the detectable moiety is a fluorescent moiety. According to one aspect, a detectable moiety is attached to the C-end of the target polypeptide, wherein the detectable moiety is a reporter protein. According to one aspect, the cell is genetically modified to include a foreign nucleic acid sequence encoding the target polypeptide including a non-standard amino acid substitution at an amino acid target location, a removable protecting group attached to the target polypeptide adjacent to the amino acid target location and a detectable moiety attached to the C-end of the target polypeptide, wherein the nonstandard amino acid is encoded by a corresponding nonsense or sense codon. According to one aspect, the cell is genetically modified to include a foreign nucleic acid sequence encoding an amino-acyl tRNA synthetase and a transfer RNA corresponding to the nonstandard amino acid and wherein the nonstandard amino acid is provided to the cell and
the cell expresses the synthetase and the transfer RNA to include the nonstandard amino acid at the amino acid target location. According to one aspect, the ceil is genetically modified to include a foreign nucleic acid sequence encoding an enzyme for cleaving the removable protecting group under influence of a constitutive or an inducible promoter. According to one aspect, the cell includes an adapter protein that coordinates with a protease for degrading the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA. According to one aspect, the cell includes an adapter protein that coordinates with a protease for degrading the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA, wherein the adapter protein is under influence of a constitutive or an inducible promoter. According to one aspect, the method further includes the ceil expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid. According to one aspect, the method further includes the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and the cell expressing a protease wherein the protease degrades the target polypeptide when the N- end amino acid is a standard amino acid or an undesired NSAA. According to one aspect, the method further includes the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and the ceil expressing a protease wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA to thereby enrich the target polypeptide including a non-standard amino acid substitution within the cell . According to one aspect, the method further includes the cell expressing an enzyme that cleaves the removable protecting group to generate an N- end amino acid, and the cell expressing an adapter protein that coordinates with a protease, wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA. According to one aspect, the method further includes the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end
amino acid, and the cell expressing an adapter protein that coordinates with a protease, wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA, and wherein the adapter protein is under influence of an inducible promoter. According to one aspect, the method further includes the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and the ceil expressing a CipS-ClpAP protease system wherein the ClpS-ClpAP protease system degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA to thereby enrich the target polypeptide including a desired non-standard amino acid substitution within the cell. According to one aspect,, the method further includes the cell expressing an enzyme that cleaves the removable protecting group to generate an N- end amino acid, and the cell expressing a ClpS-ClpAP protease system wherein the ClpS- ClpAP protease system degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA to thereby enrich the target polypeptide including a desired non-standard amino acid substitution within the cell, and wherein the ClpS protein is a natural homolog or a ClpS V65I mutant. According to one aspect, the method further includes a detectable moiety is attached to the C-end of the target polypeptide and further including the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and the cell expressing an adapter protein for a protease, wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA to thereby enrich the target polypeptide including a desired non-standard amino acid substitution within the ceil, and detecting the detectable moiety as a measure of the amount of the target polypeptide including a non-standard amino acid substitution within the cell. According to one aspect, the cell is a prokaryotic cell or a eukaryotic cell. According to one aspect, the cell is a microorganism such as a bacterium.
According to one aspect, the cell is E. coli. According to one aspect, the cell is a genetically modified E. coli.
The disclosure provides a method of designing an amino acyl tRNA synthetase variant for preferential selection of a desired non-standard amino acid against its standard amino acid counterpart or undesired NSAAs for incorporation into a protein in a cell including genetically modifying the ceil to express the target polypeptide including a nonstandard amino acid substitution at an amino acid target location using an engineered amino- acyl tRNA synthetase and transfer RNA pair corresponding to the non-standard amino acid or undesired NSAA, and wherein the cell expresses the target polypeptide including a standard amino acid or undesired NSAA at the amino acid target location when the engineered amino- acyl tRNA synthetase and transfer RNA pair non-selectively adds the standard amino acid or undesired NSAA at the amino acid target location, wherein a removable protecting group is attached to the target polypeptide adjacent to the amino acid target location, such that when the removable protecting group is removed, an N-end amino acid is exposed at the amino acid target location, and wherein a detectable moiety is attached to the C-end of the target polypeptide, wherein the cell is genetically modified to include a foreign nucleic acid sequence encoding an amino-acyl tRNA synthetase and a transfer RNA corresponding to the nonstandard amino acid and wherein the nonstandard amino acid is provided to the ceil and the cell expresses the synthetase and the transfer RNA to include the nonstandard amino acid at the amino acid target location, the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and the ceil expressing an adapter protein for a protease, wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or undesired NSAA to thereby enrich the target polypeptide including a desired non-standard amino acid substitution within the cell, detecting the detectable moiety as a measure of the amount of the target polypeptide including a non-
standard amino acid substitution within the cell, and repeatedly testing a modified synthetase in the genetically modified cell for improved production of the target polypeptide including a non-standard amino acid substitution.
The disclosure provides an engineered cell including a foreign nucleic acid sequence encoding a target polypeptide including a non-standard amino acid substitution at an amino acid target location and a removable protecting group attached to the target polypeptide adjacent to the amino acid target location.
The disclosure provides an engineered cell including a foreign nucleic acid sequence encoding a target polypeptide including a non-standard amino acid substitution at an amino acid target location, a removable protecting group attached to the target polypeptide adjacent to the amino acid target location and a detectable moiety attached to the C-end of the target polypeptide.
The disclosure provides an engineered cell including (a) a foreign nucleic acid sequence encoding a target polypeptide including a non-standard amino acid substitution at an amino acid target location, a removable protecting group attached to the target polypeptide adjacent to the amino acid target location and a detectable moiety attached to the C-end of the target polypeptide; (b) a foreign nucleic acid sequence encoding an amino-acyl tRNA synthetase and a transfer RNA corresponding to the nonstandard amino acid; (c) an adapter protein for a protease for degrading the target polypeptide having a standard amino acid or undesired NSAA as the N-end amino acid, wherein the adapter protein is under influence of a constitutive promoter or an inducible promoter.
The disclosure provides a nucleic acid construct encoding a target polypeptide including a non-standard amino acid substitution at an amino acid target location and a removable protecting group attached to the target polypeptide adjacent to the amino acid target location.
The disclosure provides a nucleic acid construct encoding a target polypeptide including a non-standard amino acid substitution at an amino acid target location, a removable protecting group attached to the target polypeptide adjacent to the amino acid target location and a detectable moiety attached to the C-eml of the target polypeptide.
The disclosure provides a nucleic acid construct encoding adapter protein Clps or mutants or variants thereof.
The disclosure provides a nucleic acid construct encoding ClpS_V65I mutant.
The above disclosure generally describes the present invention. All references disclosed herein are expressly incorporated by reference. A more complete understanding can be obtained by reference to the following specific examples which are provided herein for purposes of illustration only, and are not intended to limit the scope of the invention.
Claims
1. A method of making a target polypeptide in a cell, wherein the target polypeptide includes a non-standard amino acid (NSAA) substitution at an amino acid target location, comprising
genetically modifying the cell to express the target polypeptide including a nonstandard amino acid substitution at an amino acid target location using an engineered amino- acyl tRNA synthetase and transfer RNA pair corresponding to the non-standard amino acid, and wherein the cell expresses the target polypeptide including a standard amino acid or undesired NSAA at the amino acid target location when the engineered amino-acyl tRNA synthetase and transfer RNA pair non-selectively adds the standard amino acid or undesired NSAA at the amino acid target location,
wherein a removable protecting group is attached to the target polypeptide adjacent to the amino acid target location, such that when the removable protecting group is removed, an N-end amino acid is exposed at the amino acid target location.
2. The method of claim 1 wherein the removable protecting group is a cleavable protecting group that is orthogonal within the cell.
3. The method of claim 1 wherein the removable protecting group is an enzyme cleavable protecting group.
4. The method of claim 1 wherein the removable protecting group is a protein that is cleavable by a corresponding enzyme.
5. The method of claim 1 wherein the removable protecting group is ubiquitin that is cleavable by Ubpl .
6. The method of claim 1 wherein the cell is genetically modified to include a foreign nucleic acid sequence encoding the target polypeptide including a non-standard amino acid substitution at an amino acid target location and a removable protecting group attached to the target polypeptide adjacent to the amino acid target location.
7. The method of claim 1 wherein a detectable moiety is attached to the C-end of the target polypeptide.
8. The method of claim 1 wherein a detectable moiety is attached to the C-end of the target polypeptide, wherein the detectable moiety is a fluorescent moiety.
9. The method of claim 1 wherein a detectable moiety is attached to the C-end of the target polypeptide, wherein the detectable moiety is a reporter protein.
10. The method of claim 1 wherein the cell is genetically modified to include a foreign nucleic acid sequence encoding the target polypeptide including a non-standard amino acid substitution at an amino acid target location, a removable protecting group attached to the target polypeptide adjacent to the amino acid target location and a detectable moiety attached to the C-end of the target polypeptide, wherein the nonstandard amino acid is encoded by a corresponding nonsense or sense codon.
1 1. The method of claim 1 wherein the cell is genetically modified to include a foreign nucleic acid sequence encoding an amino-acyl tRNA synthetase and a transfer RNA corresponding to the nonstandard amino acid and wherein the nonstandard amino acid is provided to the cell and the cell expresses the synthetase and the transfer RNA to include the nonstandard amino acid at the amino acid target location.
12. The method of claim 1 wherein the cell is genetically modified to include a foreign nucleic acid sequence encoding an enzyme for cleaving the removable protecting group under influence of a constitutive or an inducible promoter.
13. The method of claim 1 wherein the cell includes an adapter protein that coordinates with a protease for degrading the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA.
14. The method of claim 1 wherein the cell includes an adapter protein that coordinates with a protease for degrading the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA, wherein the adapter protein is under influence of a constitutive or an inducible promoter.
15. The method of claim 1 further comprising
the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid.
16. The method of claim 1 further comprising
the ceil expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and
the cell expressing a protease wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA.
17. The method of claim 1 further comprising
the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and
the cell expressing a protease wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA to thereby enrich the target polypeptide including a non-standard amino acid substitution within the cell.
18. The method of claim 1 further comprising
the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and
the cell expressing an adapter protein that coordinates with a protease, wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA.
19. The method of claim 1 further comprising
the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and
the cell expressing an adapter protein that coordinates with a protease, wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid
or an undesired NSAA, and wherein the adapter protein is under influence of an inducible promoter.
20. The method of claim 1 further comprising
the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and
the cell expressing a ClpS-ClpAP protease system wherein the ClpS-ClpAP protease system degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA to thereby enrich the target polypeptide including a desired nonstandard amino acid substitution within the cell.
21 . The method of claim 1 further comprising
the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and
the cell expressing a ClpS-ClpAP protease system wherein the ClpS-ClpAP protease system degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA to thereby enrich the target polypeptide including a desired nonstandard amino acid substitution within the cell, and wherein the ClpS protein is a natural homolog or a ClpS V65I mutant.
22. The method of claim 1 wherein a detectable moiety is attached to the C-end of the target polypeptide and further comprising
the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and
the cell expressing an adapter protein for a protease, wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or an undesired NSAA to thereby emich the target polypeptide including a desired non-standard amino acid substitution within the cell, and
detecting the detectable moiety as a measure of the amount of the target polypeptide including a non-standard amino acid substitution within the cell.
23. The method of claim 1 wherein the cell is a prokaryotic cell or a eukaryotic cell.
24. The method of claim 1 wherein the cell is a bacterium.
25. The method of claim 1 wherein the cell is a genetically modified E. coli.
26. A method of designing an amino acyl tRNA synthetase variant for preferential selection of a desired non-standard amino acid against its standard amino acid counterpart or undesired NSAAs for incorporation into a protein in a cell comprising
genetically modifying the cell to express the target polypeptide including a nonstandard amino acid substitution at an amino acid target location using an engineered amino- acyl tRNA synthetase and transfer RNA pair corresponding to the non-standard amino acid or undesired NSAA, and wherein the cell expresses the target polypeptide including a standard amino acid or undesired NSAA at the amino acid target location when the engineered amino- acyl tRNA synthetase and transfer RNA pair non-selectively adds the standard amino acid or undesired NSAA at the amino acid target location,
wherein a removable protecting group is attached to the target polypeptide adjacent to the amino acid target location, such that when the removable protecting group is removed, an N-end amino acid is exposed at the amino acid target location, and wherein a detectable moiety is attached to the C-end of the target polypeptide,
wherein the cell is genetically modified to include a foreign nucleic acid sequence encoding an amino-acyl tRNA synthetase and a transfer RNA corresponding to the nonstandard amino acid and wherein the nonstandard amino acid is provided to the cell and the cell expresses the synthetase and the transfer RNA to include the nonstandard amino acid at the amino acid target location,
the cell expressing an enzyme that cleaves the removable protecting group to generate an N-end amino acid, and
the cell expressing an adapter protein for a protease, wherein the protease degrades the target polypeptide when the N-end amino acid is a standard amino acid or undesired NSAA to thereby enrich the target polypeptide including a desired non-standard amino acid substitution within the cell,
detecting the detectable moiety as a measure of the amount of the target polypeptide including a non-standard amino acid substitution within the cell, and
repeatedly testing a modified synthetase in the genetically modified cell for improved production of the target polypeptide including a non-standard amino acid substitution.
27. An engineered cell including a foreign nucleic acid sequence encoding a target polypeptide including a non-standard amino acid substitution at an amino acid target location and a removable protecting group attached to the target polypeptide adjacent to the amino acid target location.
28. An engineered cell including a foreign nucleic acid sequence encoding a target polypeptide including a non-standard amino acid substitution at an amino acid target location, a removable protecting group attached to the target polypeptide adjacent to the amino acid target location and a detectable moiety attached to the C-end of the target polypeptide.
29. An engineered cell including
(a) a foreign nucleic acid sequence encoding a target polypeptide including a nonstandard amino acid substitution at an amino acid target location, a removable protecting group attached to the target polypeptide adjacent to the amino acid target location and a detectable moiety attached to the C-end of the target polypeptide;
(b) a foreign nucleic acid sequence encoding an amino-acyl tRNA synthetase and a transfer R A corresponding to the nonstandard amino acid;
(c) an adapter protein for a protease for degrading the target polypeptide having a standard amino acid or undesired NSAA as the N-end amino acid, wherein the adapter protein is under influence of a constitutive promoter or an inducible promoter,
30. A nucleic acid construct encoding a target polypeptide including a nonstandard amino acid substitution at an amino acid target location and a removable protecting group attached to the target polypeptide adjacent to the amino acid target location.
31. A nucleic acid construct encoding a target polypeptide including a nonstandard amino acid substitution at an amino acid target location, a removable protecting group attached to the target polypeptide adjacent to the amino acid target location and a detectable moiety attached to the C-end of the target polypeptide.
32. A nucleic acid construct encoding adapter protein Clps or mutants or variants thereof,
33. A nucleic acid construct encoding ClpS_V65I mutant.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201762457353P | 2017-02-10 | 2017-02-10 | |
US62/457,353 | 2017-02-10 | ||
US201762526671P | 2017-06-29 | 2017-06-29 | |
US62/526,671 | 2017-06-29 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2018148516A1 true WO2018148516A1 (en) | 2018-08-16 |
Family
ID=63107847
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2018/017581 WO2018148516A1 (en) | 2017-02-10 | 2018-02-09 | Methods of making proteins with non-standard amino acids |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2018148516A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019006260A1 (en) * | 2017-06-29 | 2019-01-03 | President And Fellows Of Harvard College | Methods of making proteins with non-standard amino acids |
CN111518735A (en) * | 2019-02-02 | 2020-08-11 | 江南大学 | A kind of Corynebacterium glutamicum recombinant bacteria, preparation method and application |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000071723A2 (en) * | 1999-05-21 | 2000-11-30 | Roche Diagnostics Gmbh | Methods for regulating protein conformation using molecular chaperones |
US20050287639A1 (en) * | 2004-05-17 | 2005-12-29 | California Institute Of Technology | Methods of incorporating amino acid analogs into proteins |
US20080090999A1 (en) * | 2004-08-31 | 2008-04-17 | Goldman Barry S | Nucleotide and amino acid sequences from xenorhabdus and uses thereof |
US20080153130A1 (en) * | 2005-01-10 | 2008-06-26 | Andrzej Plucienniczak | Ubp1 Protease Mutant and Its Coding Sequence, Their Applications and Heterogonous Protein Expression System |
-
2018
- 2018-02-09 WO PCT/US2018/017581 patent/WO2018148516A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000071723A2 (en) * | 1999-05-21 | 2000-11-30 | Roche Diagnostics Gmbh | Methods for regulating protein conformation using molecular chaperones |
US20050287639A1 (en) * | 2004-05-17 | 2005-12-29 | California Institute Of Technology | Methods of incorporating amino acid analogs into proteins |
US20080090999A1 (en) * | 2004-08-31 | 2008-04-17 | Goldman Barry S | Nucleotide and amino acid sequences from xenorhabdus and uses thereof |
US20080153130A1 (en) * | 2005-01-10 | 2008-06-26 | Andrzej Plucienniczak | Ubp1 Protease Mutant and Its Coding Sequence, Their Applications and Heterogonous Protein Expression System |
Non-Patent Citations (4)
Title |
---|
CRACICT, C ET AL.: "Aminoacyl-transferases and the N-end rule pathway of prokaryotic/eukaryotic specificity in a human pathogen", PROCEEDINGS OF THE NATIONAL ACADEMY O SCIENCE, vol. 103, no. 9, 28 February 2006 (2006-02-28), pages 3078 - 3083, XP055533734 * |
KUNJAPUR, AM ET AL.: "Engineering post-translational proofreading to discriminate non-standard amino acids", BIORXIV, 30 June 2017 (2017-06-30), pages 1 - 11, XP055533753 * |
KUNJAPUR, AM ET AL.: "Engineering post-translational proofreading to discriminate non-standard amino acids", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE USA, vol. 115, no. 3, 16 January 2018 (2018-01-16), pages 619 - 624, XP055533758 * |
SUNDAR, S ET AL.: "Multiple sequence signals direct recognition and degradation of protein substrates by the AAA+ protease HsIUV", JOURNAL OF MOLECULAR BIOLOGY, vol. 403, no. 3, 29 October 2010 (2010-10-29), pages 420 - 429, XP055533742 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019006260A1 (en) * | 2017-06-29 | 2019-01-03 | President And Fellows Of Harvard College | Methods of making proteins with non-standard amino acids |
US11649450B2 (en) | 2017-06-29 | 2023-05-16 | President And Fellows Of Harvard College | Methods of making proteins with non-standard amino acids |
CN111518735A (en) * | 2019-02-02 | 2020-08-11 | 江南大学 | A kind of Corynebacterium glutamicum recombinant bacteria, preparation method and application |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230133277A1 (en) | S. pyogenes cas9 mutant genes and polypeptides encoded by same | |
US20240309402A1 (en) | S. pyogenes cas9 mutant genes and polypeptides encoded by same | |
EP3473728B1 (en) | Method for screening targeted genetic scissors by using multiple target system of on-target and off-target activity and use thereof | |
AU2018289077B2 (en) | Nucleic acid-guided nucleases | |
EP3269809B1 (en) | MODIFIED AMINOACYL-tRNA SYNTHETASE AND USE THEREOF | |
Yanagida et al. | The evolutionary potential of phenotypic mutations | |
EP3487998B1 (en) | Compositions and methods for identifying rna binding polypeptide targets | |
EP3822349B1 (en) | Dna polymerase mutant suited to nucleic acid amplification from rna | |
CN113490748A (en) | Cell-free protein synthesis platform derived from Clostridium extract | |
Dohmen et al. | Heat‐inducible degron and the making of conditional mutants | |
US20190390205A1 (en) | Incorporation of internal polya-encoded poly-lysine sequence tags and their variations for the tunable control of protein synthesis in bacterial and eukaryotic cells | |
US20240279629A1 (en) | Crispr-transposon systems for dna modification | |
Sorrells et al. | Intrinsic cooperativity potentiates parallel cis-regulatory evolution | |
Wencker et al. | Another layer of complexity in Staphylococcus aureus methionine biosynthesis control: unusual RNase III-driven T-box riboswitch cleavage determines met operon mRNA stability and decay | |
WO2018148516A1 (en) | Methods of making proteins with non-standard amino acids | |
WO2007094620A1 (en) | Preparation of cell extract and its application for cell-free protein systhesis | |
WO2019005973A1 (en) | Synthetase variants for incorporation of biphenylalanine into a peptide | |
US11649450B2 (en) | Methods of making proteins with non-standard amino acids | |
US20240158829A1 (en) | Methods for biomolecule analysis employing multi-component detection agent and related kits | |
JP2020531572A (en) | TALE RVD that specifically recognizes DNA bases modified by methylation and its use | |
Collins et al. | Variation in ubiquitin system genes creates substrate-specific effects on proteasomal protein degradation | |
WO2023069816A2 (en) | Compositions and methods for multiplex decoding of quadruplet codons | |
JP4808361B2 (en) | New DNA synthase | |
US20220259606A1 (en) | Methods of modulating nucleic acid stability and protein expression | |
US20230002758A1 (en) | Tethered ribosomes and methods of making and using thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18750844 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 18750844 Country of ref document: EP Kind code of ref document: A1 |