CA2332380C - Process and materials for production of glucosamine - Google Patents
Process and materials for production of glucosamine Download PDFInfo
- Publication number
- CA2332380C CA2332380C CA2332380A CA2332380A CA2332380C CA 2332380 C CA2332380 C CA 2332380C CA 2332380 A CA2332380 A CA 2332380A CA 2332380 A CA2332380 A CA 2332380A CA 2332380 C CA2332380 C CA 2332380C
- Authority
- CA
- Canada
- Prior art keywords
- glucosamine
- leu
- ala
- ile
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- MSWZFWKMSRAUBD-UHFFFAOYSA-N beta-D-galactosamine Natural products NC1C(O)OC(CO)C(O)C1O MSWZFWKMSRAUBD-UHFFFAOYSA-N 0.000 title claims abstract description 247
- 229960002442 glucosamine Drugs 0.000 title claims abstract description 247
- MSWZFWKMSRAUBD-IVMDWMLBSA-N 2-amino-2-deoxy-D-glucopyranose Chemical compound N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O MSWZFWKMSRAUBD-IVMDWMLBSA-N 0.000 title claims abstract description 244
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 105
- 238000000034 method Methods 0.000 title claims abstract description 94
- 230000008569 process Effects 0.000 title claims description 11
- 239000000463 material Substances 0.000 title abstract description 6
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 271
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 237
- 244000005700 microbiome Species 0.000 claims abstract description 234
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 193
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 193
- 238000000855 fermentation Methods 0.000 claims abstract description 93
- 230000004151 fermentation Effects 0.000 claims abstract description 93
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 82
- 108090001031 Glutamine-fructose-6-phosphate transaminase (isomerizing) Proteins 0.000 claims description 151
- 102000004894 Glutamine-fructose-6-phosphate transaminase (isomerizing) Human genes 0.000 claims description 149
- XHMJOUIAFHJHBW-VFUOTHLCSA-N glucosamine 6-phosphate Chemical class N[C@H]1[C@H](O)O[C@H](COP(O)(O)=O)[C@H](O)[C@@H]1O XHMJOUIAFHJHBW-VFUOTHLCSA-N 0.000 claims description 118
- 235000018102 proteins Nutrition 0.000 claims description 80
- 102000004190 Enzymes Human genes 0.000 claims description 79
- 108090000790 Enzymes Proteins 0.000 claims description 79
- 238000012239 gene modification Methods 0.000 claims description 77
- 230000005017 genetic modification Effects 0.000 claims description 77
- 235000013617 genetically modified food Nutrition 0.000 claims description 77
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 72
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 71
- 239000008103 glucose Substances 0.000 claims description 71
- 230000000694 effects Effects 0.000 claims description 58
- 230000004048 modification Effects 0.000 claims description 53
- 238000012986 modification Methods 0.000 claims description 53
- 241000588724 Escherichia coli Species 0.000 claims description 48
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 47
- 229910019142 PO4 Inorganic materials 0.000 claims description 39
- 239000010452 phosphate Substances 0.000 claims description 39
- 238000006467 substitution reaction Methods 0.000 claims description 39
- 238000012217 deletion Methods 0.000 claims description 35
- 230000037430 deletion Effects 0.000 claims description 35
- 230000014509 gene expression Effects 0.000 claims description 34
- 230000001965 increasing effect Effects 0.000 claims description 34
- 125000000539 amino acid group Chemical group 0.000 claims description 33
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 claims description 30
- 229910052799 carbon Inorganic materials 0.000 claims description 30
- 230000005764 inhibitory process Effects 0.000 claims description 30
- 229950006780 n-acetylglucosamine Drugs 0.000 claims description 30
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 claims description 29
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 28
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 claims description 28
- 230000015572 biosynthetic process Effects 0.000 claims description 27
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 claims description 25
- 238000012258 culturing Methods 0.000 claims description 21
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 claims description 20
- 229910052921 ammonium sulfate Inorganic materials 0.000 claims description 20
- 230000002829 reductive effect Effects 0.000 claims description 19
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 claims description 18
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 claims description 18
- 235000011130 ammonium sulphate Nutrition 0.000 claims description 18
- 230000003247 decreasing effect Effects 0.000 claims description 18
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 claims description 17
- 108010071189 phosphoenolpyruvate-glucose phosphotransferase Proteins 0.000 claims description 17
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 claims description 16
- 230000003834 intracellular effect Effects 0.000 claims description 15
- 102100041034 Glucosamine-6-phosphate isomerase 1 Human genes 0.000 claims description 13
- 230000007423 decrease Effects 0.000 claims description 13
- 108010022717 glucosamine-6-phosphate isomerase Proteins 0.000 claims description 13
- 108010032867 phosphoglucosamine mutase Proteins 0.000 claims description 13
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 claims description 12
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 claims description 12
- 241000894006 Bacteria Species 0.000 claims description 11
- 108010069483 N-acetylglucosamine-6-phosphate deacetylase Proteins 0.000 claims description 11
- 230000001105 regulatory effect Effects 0.000 claims description 11
- 108010082433 UDP-glucose-hexose-1-phosphate uridylyltransferase Proteins 0.000 claims description 10
- 125000001931 aliphatic group Chemical group 0.000 claims description 10
- -1 manXYZ Proteins 0.000 claims description 10
- 229910052757 nitrogen Inorganic materials 0.000 claims description 10
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 9
- 229910000402 monopotassium phosphate Inorganic materials 0.000 claims description 9
- 101150008111 nagA gene Proteins 0.000 claims description 9
- 101150027065 nagE gene Proteins 0.000 claims description 9
- 101100450592 Dictyostelium discoideum hexa1 gene Proteins 0.000 claims description 8
- 239000007836 KH2PO4 Substances 0.000 claims description 8
- 229910052943 magnesium sulfate Inorganic materials 0.000 claims description 8
- 235000019341 magnesium sulphate Nutrition 0.000 claims description 8
- 235000019796 monopotassium phosphate Nutrition 0.000 claims description 8
- 101150070589 nagB gene Proteins 0.000 claims description 8
- 101150006297 nagC gene Proteins 0.000 claims description 8
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 claims description 8
- 102000001105 Phosphofructokinases Human genes 0.000 claims description 7
- 108010069341 Phosphofructokinases Proteins 0.000 claims description 7
- 101150061429 nagD gene Proteins 0.000 claims description 7
- YMJBYRVFGYXULK-QZABAPFNSA-N alpha-D-glucosamine 1-phosphate Chemical compound N[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(O)=O YMJBYRVFGYXULK-QZABAPFNSA-N 0.000 claims description 6
- 230000009466 transformation Effects 0.000 claims description 6
- 108091026890 Coding region Proteins 0.000 claims description 5
- ZPWVASYFFYYZEW-UHFFFAOYSA-L dipotassium hydrogen phosphate Chemical compound [K+].[K+].OP([O-])([O-])=O ZPWVASYFFYYZEW-UHFFFAOYSA-L 0.000 claims description 5
- 235000019797 dipotassium phosphate Nutrition 0.000 claims description 5
- 229910000396 dipotassium phosphate Inorganic materials 0.000 claims description 5
- 101150100557 pfkB gene Proteins 0.000 claims description 5
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 claims description 4
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 claims description 4
- 239000001110 calcium chloride Substances 0.000 claims description 4
- 235000011148 calcium chloride Nutrition 0.000 claims description 4
- 229910001628 calcium chloride Inorganic materials 0.000 claims description 4
- XLYOFNOQVPJJNP-ZSJDYOACSA-N heavy water Substances [2H]O[2H] XLYOFNOQVPJJNP-ZSJDYOACSA-N 0.000 claims description 4
- 229910052717 sulfur Inorganic materials 0.000 claims description 4
- 239000011593 sulfur Substances 0.000 claims description 4
- 230000001131 transforming effect Effects 0.000 claims description 4
- 101100190555 Dictyostelium discoideum pkgB gene Proteins 0.000 claims description 3
- 101100453320 Pyrococcus furiosus (strain ATCC 43587 / DSM 3638 / JCM 8422 / Vc1) pfkC gene Proteins 0.000 claims description 3
- 101100029403 Synechocystis sp. (strain PCC 6803 / Kazusa) pfkA2 gene Proteins 0.000 claims description 3
- 101150073660 glmM gene Proteins 0.000 claims description 3
- 101150111330 glmU gene Proteins 0.000 claims description 3
- 230000006698 induction Effects 0.000 claims description 3
- 101150038284 pfkA gene Proteins 0.000 claims description 3
- 101150004013 pfkA1 gene Proteins 0.000 claims description 3
- 101150060387 pfp gene Proteins 0.000 claims description 3
- 238000012216 screening Methods 0.000 claims description 3
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 claims 6
- 125000000174 L-prolyl group Chemical group [H]N1C([H])([H])C([H])([H])C([H])([H])[C@@]1([H])C(*)=O 0.000 claims 4
- 125000000741 isoleucyl group Chemical group [H]N([H])C(C(C([H])([H])[H])C([H])([H])C([H])([H])[H])C(=O)O* 0.000 claims 4
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 claims 4
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 claims 4
- 102100031317 Alpha-N-acetylgalactosaminidase Human genes 0.000 claims 2
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 claims 2
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 claims 2
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 claims 2
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 claims 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 claims 1
- 210000004027 cell Anatomy 0.000 description 97
- 239000000047 product Substances 0.000 description 85
- 239000002609 medium Substances 0.000 description 84
- 239000013612 plasmid Substances 0.000 description 73
- 108020004414 DNA Proteins 0.000 description 72
- 241000282326 Felis catus Species 0.000 description 69
- 230000035772 mutation Effects 0.000 description 66
- 150000002337 glycosamines Chemical class 0.000 description 46
- 230000009471 action Effects 0.000 description 43
- 241000588722 Escherichia Species 0.000 description 42
- 101150117187 glmS gene Proteins 0.000 description 37
- 239000002773 nucleotide Substances 0.000 description 36
- 125000003729 nucleotide group Chemical group 0.000 description 36
- 230000037361 pathway Effects 0.000 description 33
- 230000012010 growth Effects 0.000 description 32
- 108010047495 alanylglycine Proteins 0.000 description 29
- 235000001014 amino acid Nutrition 0.000 description 29
- 239000013615 primer Substances 0.000 description 29
- 229940024606 amino acid Drugs 0.000 description 27
- 150000001413 amino acids Chemical class 0.000 description 27
- 230000037353 metabolic pathway Effects 0.000 description 27
- 108091000080 Phosphotransferase Proteins 0.000 description 26
- 102000020233 phosphotransferase Human genes 0.000 description 26
- 101150066555 lacZ gene Proteins 0.000 description 25
- 108010068380 arginylarginine Proteins 0.000 description 24
- 230000010354 integration Effects 0.000 description 24
- 238000003752 polymerase chain reaction Methods 0.000 description 24
- 238000006243 chemical reaction Methods 0.000 description 22
- 230000032258 transport Effects 0.000 description 21
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 20
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 19
- 238000003556 assay Methods 0.000 description 19
- 210000000349 chromosome Anatomy 0.000 description 19
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 18
- 230000006870 function Effects 0.000 description 18
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 18
- GSXOAOHZAIYLCY-UHFFFAOYSA-N D-F6P Natural products OCC(=O)C(O)C(O)C(O)COP(O)(O)=O GSXOAOHZAIYLCY-UHFFFAOYSA-N 0.000 description 17
- BGWGXPAPYGQALX-ARQDHWQXSA-N beta-D-fructofuranose 6-phosphate Chemical compound OC[C@@]1(O)O[C@H](COP(O)(O)=O)[C@@H](O)[C@@H]1O BGWGXPAPYGQALX-ARQDHWQXSA-N 0.000 description 17
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 16
- 239000000758 substrate Substances 0.000 description 16
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 15
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 15
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 15
- 108010050848 glycylleucine Proteins 0.000 description 15
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 14
- 230000000295 complement effect Effects 0.000 description 14
- 239000000284 extract Substances 0.000 description 14
- 101150023497 mcrA gene Proteins 0.000 description 14
- 101150079876 mcrB gene Proteins 0.000 description 14
- 239000000203 mixture Substances 0.000 description 14
- 239000013598 vector Substances 0.000 description 14
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 13
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 13
- 230000015556 catabolic process Effects 0.000 description 13
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 13
- 238000009396 hybridization Methods 0.000 description 13
- 108010009298 lysylglutamic acid Proteins 0.000 description 13
- 238000010361 transduction Methods 0.000 description 13
- 230000026683 transduction Effects 0.000 description 13
- 229920001817 Agar Polymers 0.000 description 12
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 12
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 12
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 12
- 239000004098 Tetracycline Substances 0.000 description 12
- 230000035508 accumulation Effects 0.000 description 12
- 238000009825 accumulation Methods 0.000 description 12
- 238000007792 addition Methods 0.000 description 12
- 239000008272 agar Substances 0.000 description 12
- 108010070944 alanylhistidine Proteins 0.000 description 12
- 230000003321 amplification Effects 0.000 description 12
- 108010087823 glycyltyrosine Proteins 0.000 description 12
- 108010085325 histidylproline Proteins 0.000 description 12
- 238000003199 nucleic acid amplification method Methods 0.000 description 12
- 229960002180 tetracycline Drugs 0.000 description 12
- 229930101283 tetracycline Natural products 0.000 description 12
- 235000019364 tetracycline Nutrition 0.000 description 12
- 150000003522 tetracyclines Chemical class 0.000 description 12
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 11
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 11
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 11
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 11
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 11
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 11
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 11
- 238000002474 experimental method Methods 0.000 description 11
- 241001515965 unidentified phage Species 0.000 description 11
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 10
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 10
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 10
- 108010044940 alanylglutamine Proteins 0.000 description 10
- 229960000723 ampicillin Drugs 0.000 description 10
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 10
- 230000001580 bacterial effect Effects 0.000 description 10
- 239000012634 fragment Substances 0.000 description 10
- 231100000350 mutagenesis Toxicity 0.000 description 10
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 9
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 9
- WFDIJRYMOXRFFG-UHFFFAOYSA-N Acetic anhydride Chemical compound CC(=O)OC(C)=O WFDIJRYMOXRFFG-UHFFFAOYSA-N 0.000 description 9
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 9
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 9
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 9
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 9
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 9
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 9
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 9
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 9
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 9
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 9
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 9
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 9
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 9
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 9
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 9
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 9
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 9
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 9
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 9
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 9
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 9
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 9
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 9
- PRCHKVGXZVTALR-KKUMJFAQSA-N Lys-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N PRCHKVGXZVTALR-KKUMJFAQSA-N 0.000 description 9
- AVTWKENDGGUWDC-BQBZGAKWSA-N Met-Cys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O AVTWKENDGGUWDC-BQBZGAKWSA-N 0.000 description 9
- 102100031324 N-acetylglucosamine-6-phosphate deacetylase Human genes 0.000 description 9
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 9
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 9
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 9
- 108010079005 RDV peptide Proteins 0.000 description 9
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 9
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 9
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 9
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 9
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 9
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 9
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 9
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 9
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 9
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 9
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 9
- 150000001875 compounds Chemical class 0.000 description 9
- 239000013604 expression vector Substances 0.000 description 9
- 108010078144 glutaminyl-glycine Proteins 0.000 description 9
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 9
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 9
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 9
- 108010078274 isoleucylvaline Proteins 0.000 description 9
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 9
- 108010017391 lysylvaline Proteins 0.000 description 9
- 238000002703 mutagenesis Methods 0.000 description 9
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 9
- 238000003786 synthesis reaction Methods 0.000 description 9
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 8
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 8
- VYMJAWXRWHJIMS-LKTVYLICSA-N Ala-Tyr-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VYMJAWXRWHJIMS-LKTVYLICSA-N 0.000 description 8
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 8
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 8
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 8
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 8
- 101100246032 Corynebacterium glutamicum (strain ATCC 13032 / DSM 20300 / BCRC 11384 / JCM 1318 / LMG 3730 / NCIMB 10025) ptsM gene Proteins 0.000 description 8
- 101100138654 Escherichia coli (strain K12) manZ gene Proteins 0.000 description 8
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 8
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 8
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 8
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 8
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 8
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 8
- XIGFLVCAVQQGNS-IHRRRGAJSA-N His-Pro-His Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 XIGFLVCAVQQGNS-IHRRRGAJSA-N 0.000 description 8
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 8
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 8
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 8
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 8
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 8
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 8
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 8
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 8
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 8
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 8
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 8
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 8
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 8
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 8
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 8
- NMPXRFYMZDIBRF-ZOBUZTSGSA-N Val-Asn-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NMPXRFYMZDIBRF-ZOBUZTSGSA-N 0.000 description 8
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 8
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 8
- 230000004075 alteration Effects 0.000 description 8
- 108010092854 aspartyllysine Proteins 0.000 description 8
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 8
- 108010054813 diprotin B Proteins 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 8
- 230000002068 genetic effect Effects 0.000 description 8
- 108010036413 histidylglycine Proteins 0.000 description 8
- 239000001301 oxygen Substances 0.000 description 8
- 229910052760 oxygen Inorganic materials 0.000 description 8
- 238000012163 sequencing technique Methods 0.000 description 8
- 238000011144 upstream manufacturing Methods 0.000 description 8
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 7
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 7
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 7
- YNNXQZDEOCYJJL-CIUDSAMLSA-N Gln-Arg-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N YNNXQZDEOCYJJL-CIUDSAMLSA-N 0.000 description 7
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 7
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 7
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 7
- WSEITRHJRVDTRX-QTKMDUPCSA-N His-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N)O WSEITRHJRVDTRX-QTKMDUPCSA-N 0.000 description 7
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 7
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 7
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 7
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- 108700026244 Open Reading Frames Proteins 0.000 description 7
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 7
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 7
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 7
- YTVJTXJTNRWJCR-JBACZVJFSA-N Trp-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N YTVJTXJTNRWJCR-JBACZVJFSA-N 0.000 description 7
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 7
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 7
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 7
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 7
- 238000002835 absorbance Methods 0.000 description 7
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 7
- 238000010367 cloning Methods 0.000 description 7
- 238000010276 construction Methods 0.000 description 7
- 238000006731 degradation reaction Methods 0.000 description 7
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 7
- 108010020688 glycylhistidine Proteins 0.000 description 7
- 230000002779 inactivation Effects 0.000 description 7
- 230000004060 metabolic process Effects 0.000 description 7
- 238000012546 transfer Methods 0.000 description 7
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 6
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 6
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 6
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 6
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 6
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 6
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 6
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 6
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 6
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 6
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 6
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 6
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 6
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 6
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 6
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 6
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 6
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 6
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 6
- 239000006137 Luria-Bertani broth Substances 0.000 description 6
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 6
- 108091081024 Start codon Proteins 0.000 description 6
- 108020005038 Terminator Codon Proteins 0.000 description 6
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 6
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 6
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 6
- 238000005842 biochemical reaction Methods 0.000 description 6
- 230000002255 enzymatic effect Effects 0.000 description 6
- 125000001475 halogen functional group Chemical group 0.000 description 6
- 229920002521 macromolecule Polymers 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 108091008146 restriction endonucleases Proteins 0.000 description 6
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 5
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 5
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 5
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 5
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 5
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 5
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 5
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 5
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 5
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 5
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 5
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 5
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 5
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 5
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 5
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 5
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 5
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 5
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 5
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 5
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 5
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 5
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 5
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 5
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 5
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 5
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 5
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 5
- WJGSTIMGSIWHJX-HVTMNAMFSA-N His-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WJGSTIMGSIWHJX-HVTMNAMFSA-N 0.000 description 5
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 5
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 5
- VISRCHQHQCLODA-NAKRPEOUSA-N Ile-Pro-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N VISRCHQHQCLODA-NAKRPEOUSA-N 0.000 description 5
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 5
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 5
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 5
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 5
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 5
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 5
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 5
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 5
- PPQRKXHCLYCBSP-IHRRRGAJSA-N Leu-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N PPQRKXHCLYCBSP-IHRRRGAJSA-N 0.000 description 5
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 5
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 5
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 5
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 5
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 5
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 5
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 5
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 5
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 5
- NCVJJAJVWILAGI-SRVKXCTJSA-N Met-Gln-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NCVJJAJVWILAGI-SRVKXCTJSA-N 0.000 description 5
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 5
- 238000012408 PCR amplification Methods 0.000 description 5
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 5
- 101710148171 Pyrokinin-4 Proteins 0.000 description 5
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 5
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 5
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 5
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 5
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 5
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 5
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 5
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 5
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 5
- 230000008859 change Effects 0.000 description 5
- 229960005091 chloramphenicol Drugs 0.000 description 5
- 238000007796 conventional method Methods 0.000 description 5
- 230000034659 glycolysis Effects 0.000 description 5
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 108010010147 glycylglutamine Proteins 0.000 description 5
- 108010037850 glycylvaline Proteins 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 108010057821 leucylproline Proteins 0.000 description 5
- 239000006166 lysate Substances 0.000 description 5
- 108010005942 methionylglycine Proteins 0.000 description 5
- 238000012261 overproduction Methods 0.000 description 5
- 230000036961 partial effect Effects 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 108010073969 valyllysine Proteins 0.000 description 5
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 4
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 4
- 108010040956 Ala-Asp-Glu-Leu Proteins 0.000 description 4
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 4
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 4
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 4
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 4
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 4
- ITHMWNNUDPJJER-ULQDDVLXSA-N Arg-His-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ITHMWNNUDPJJER-ULQDDVLXSA-N 0.000 description 4
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 4
- DRDWXKWUSIKKOB-PJODQICGSA-N Arg-Trp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O DRDWXKWUSIKKOB-PJODQICGSA-N 0.000 description 4
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 4
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- 229930091371 Fructose Natural products 0.000 description 4
- 239000005715 Fructose Substances 0.000 description 4
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 4
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 4
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 4
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 4
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 4
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 4
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 4
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 4
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 4
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 4
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 4
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 4
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 4
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 4
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 4
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 4
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 4
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 4
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 4
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 4
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 4
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 4
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 4
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 4
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 4
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 4
- MSFSPUZXLOGKHJ-UHFFFAOYSA-N Muraminsaeure Natural products OC(=O)C(C)OC1C(N)C(O)OC(CO)C1O MSFSPUZXLOGKHJ-UHFFFAOYSA-N 0.000 description 4
- 108010013639 Peptidoglycan Proteins 0.000 description 4
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 4
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 4
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 4
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 4
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 4
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 4
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 4
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 4
- 108020004511 Recombinant DNA Proteins 0.000 description 4
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 4
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 4
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 4
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 4
- 101710137500 T7 RNA polymerase Proteins 0.000 description 4
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 4
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 4
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 4
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 4
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 4
- DZKFGCNKEVMXFA-JUKXBJQTSA-N Tyr-Ile-His Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O DZKFGCNKEVMXFA-JUKXBJQTSA-N 0.000 description 4
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 4
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 4
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 230000000903 blocking effect Effects 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 4
- 239000012228 culture supernatant Substances 0.000 description 4
- 108010040030 histidinoalanine Proteins 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 108010034529 leucyl-lysine Proteins 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 230000002018 overexpression Effects 0.000 description 4
- 108010024607 phenylalanylalanine Proteins 0.000 description 4
- 239000002243 precursor Substances 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 230000035945 sensitivity Effects 0.000 description 4
- 108010026333 seryl-proline Proteins 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- KPVQNXLUPNWQHM-RBEMOOQDSA-N 3-acetylpyridine adenine dinucleotide Chemical compound CC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 KPVQNXLUPNWQHM-RBEMOOQDSA-N 0.000 description 3
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 3
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 3
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 3
- VOKWBBBXJONREA-DCAQKATOSA-N Asn-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N VOKWBBBXJONREA-DCAQKATOSA-N 0.000 description 3
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 3
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 3
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 3
- WXKWQSDHEXKKNC-ZKWXMUAHSA-N Cys-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WXKWQSDHEXKKNC-ZKWXMUAHSA-N 0.000 description 3
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 3
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 3
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 3
- APHGWLWMOXGZRL-DCAQKATOSA-N Glu-Glu-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O APHGWLWMOXGZRL-DCAQKATOSA-N 0.000 description 3
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 3
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 3
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 3
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 3
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 3
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 3
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- OSZUPUINVNPCOE-SDDRHHMPSA-N His-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OSZUPUINVNPCOE-SDDRHHMPSA-N 0.000 description 3
- HAPWZEVRQYGLSG-IUCAKERBSA-N His-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O HAPWZEVRQYGLSG-IUCAKERBSA-N 0.000 description 3
- FCPSGEVYIVXPPO-QTKMDUPCSA-N His-Thr-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FCPSGEVYIVXPPO-QTKMDUPCSA-N 0.000 description 3
- VTMSUKSRIKCCAD-ULQDDVLXSA-N His-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N VTMSUKSRIKCCAD-ULQDDVLXSA-N 0.000 description 3
- FADXGVVLSPPEQY-GHCJXIJMSA-N Ile-Cys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FADXGVVLSPPEQY-GHCJXIJMSA-N 0.000 description 3
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 3
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 3
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 3
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 3
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 3
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 3
- 239000006142 Luria-Bertani Agar Substances 0.000 description 3
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 3
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 3
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 3
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 3
- CAEZLMGDJMEBKP-AVGNSLFASA-N Met-Pro-His Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC=N1 CAEZLMGDJMEBKP-AVGNSLFASA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 101100205180 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-6 gene Proteins 0.000 description 3
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 3
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 3
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 3
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 3
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 3
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 3
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 3
- PEYSVKMXSLPQRU-FJHTZYQYSA-N Trp-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O PEYSVKMXSLPQRU-FJHTZYQYSA-N 0.000 description 3
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 3
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 3
- WGHVMKFREWGCGR-SRVKXCTJSA-N Val-Arg-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WGHVMKFREWGCGR-SRVKXCTJSA-N 0.000 description 3
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 3
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 3
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 3
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 108010013835 arginine glutamate Proteins 0.000 description 3
- 229940041514 candida albicans extract Drugs 0.000 description 3
- 150000001720 carbohydrates Chemical class 0.000 description 3
- 230000010261 cell growth Effects 0.000 description 3
- 239000000306 component Substances 0.000 description 3
- 239000000470 constituent Substances 0.000 description 3
- 238000002425 crystallisation Methods 0.000 description 3
- 230000008025 crystallization Effects 0.000 description 3
- 108010016616 cysteinylglycine Proteins 0.000 description 3
- 230000002950 deficient Effects 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 3
- 239000002158 endotoxin Substances 0.000 description 3
- 108010079547 glutamylmethionine Proteins 0.000 description 3
- 150000004676 glycans Chemical class 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 230000001976 improved effect Effects 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 108010091871 leucylmethionine Proteins 0.000 description 3
- 229920006008 lipopolysaccharide Polymers 0.000 description 3
- 108010018625 phenylalanylarginine Proteins 0.000 description 3
- 238000007747 plating Methods 0.000 description 3
- 229920001282 polysaccharide Polymers 0.000 description 3
- 239000005017 polysaccharide Substances 0.000 description 3
- 101150098466 rpsL gene Proteins 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 239000001509 sodium citrate Substances 0.000 description 3
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 230000002194 synthesizing effect Effects 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 230000002463 transducing effect Effects 0.000 description 3
- 239000012138 yeast extract Substances 0.000 description 3
- BGNGWHSBYQYVRX-UHFFFAOYSA-N 4-(dimethylamino)benzaldehyde Chemical compound CN(C)C1=CC=C(C=O)C=C1 BGNGWHSBYQYVRX-UHFFFAOYSA-N 0.000 description 2
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 2
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 2
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 2
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 2
- 229920002101 Chitin Polymers 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- MYMOFIZGZYHOMD-UHFFFAOYSA-N Dioxygen Chemical compound O=O MYMOFIZGZYHOMD-UHFFFAOYSA-N 0.000 description 2
- DGMPVYSXXIOGJY-UHFFFAOYSA-N Fusaric acid Chemical compound CCCCC1=CC=C(C(O)=O)N=C1 DGMPVYSXXIOGJY-UHFFFAOYSA-N 0.000 description 2
- CAVMESABQIKFKT-IUCAKERBSA-N Glu-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N CAVMESABQIKFKT-IUCAKERBSA-N 0.000 description 2
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 2
- 241000282414 Homo sapiens Species 0.000 description 2
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 2
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 2
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 2
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 2
- BRGMHAYQAZFZDJ-PVFLNQBWSA-N N-Acetylglucosamine 6-phosphate Chemical compound CC(=O)N[C@H]1[C@@H](O)O[C@H](COP(O)(O)=O)[C@@H](O)[C@@H]1O BRGMHAYQAZFZDJ-PVFLNQBWSA-N 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- VOGXLRKCWFLJBY-HSHDSVGOSA-N Thr-Arg-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VOGXLRKCWFLJBY-HSHDSVGOSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- ABZWHLRQBSBPTO-RNXOBYDBSA-N Tyr-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CC=C(C=C4)O)N ABZWHLRQBSBPTO-RNXOBYDBSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 2
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 2
- 238000005903 acid hydrolysis reaction Methods 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 2
- 238000005273 aeration Methods 0.000 description 2
- 238000013019 agitation Methods 0.000 description 2
- FZHXIRIBWMQPQF-SLPGGIOYSA-N aldehydo-D-glucosamine Chemical compound O=C[C@H](N)[C@@H](O)[C@H](O)[C@H](O)CO FZHXIRIBWMQPQF-SLPGGIOYSA-N 0.000 description 2
- 150000001412 amines Chemical class 0.000 description 2
- 125000003277 amino group Chemical group 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 230000008238 biochemical pathway Effects 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 2
- 230000006652 catabolic pathway Effects 0.000 description 2
- 230000006037 cell lysis Effects 0.000 description 2
- 239000006285 cell suspension Substances 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 238000001816 cooling Methods 0.000 description 2
- 230000001351 cycling effect Effects 0.000 description 2
- 230000002939 deleterious effect Effects 0.000 description 2
- 238000001212 derivatisation Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 229910001882 dioxygen Inorganic materials 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 238000001952 enzyme assay Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000004907 flux Effects 0.000 description 2
- 230000006377 glucose transport Effects 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 239000000411 inducer Substances 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 239000000543 intermediate Substances 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000002844 melting Methods 0.000 description 2
- 230000008018 melting Effects 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 150000002739 metals Chemical class 0.000 description 2
- 108091005573 modified proteins Proteins 0.000 description 2
- 102000035118 modified proteins Human genes 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 239000002417 nutraceutical Substances 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 235000016709 nutrition Nutrition 0.000 description 2
- 229920001542 oligosaccharide Polymers 0.000 description 2
- 150000002482 oligosaccharides Chemical class 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 239000001226 triphosphate Substances 0.000 description 2
- 235000011178 triphosphate Nutrition 0.000 description 2
- 239000012137 tryptone Substances 0.000 description 2
- 230000003827 upregulation Effects 0.000 description 2
- OGQYJDHTHFAPRN-UHFFFAOYSA-N 2-fluoro-6-(trifluoromethyl)benzonitrile Chemical compound FC1=CC=CC(C(F)(F)F)=C1C#N OGQYJDHTHFAPRN-UHFFFAOYSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 1
- FHCRKXCTKSHNOE-QEJZJMRPSA-N Asn-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FHCRKXCTKSHNOE-QEJZJMRPSA-N 0.000 description 1
- WKGJGVGTEZGFSW-FXQIFTODSA-N Asp-Asn-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O WKGJGVGTEZGFSW-FXQIFTODSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 108010023063 Bacto-peptone Proteins 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 229920001661 Chitosan Polymers 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- DIUBVGXMXONJCF-KKUMJFAQSA-N Cys-His-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DIUBVGXMXONJCF-KKUMJFAQSA-N 0.000 description 1
- NBSCHQHZLSJFNQ-GASJEMHNSA-N D-Glucose 6-phosphate Chemical compound OC1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H](O)[C@H]1O NBSCHQHZLSJFNQ-GASJEMHNSA-N 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 101100186363 Danio rerio nags gene Proteins 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 241000702191 Escherichia virus P1 Species 0.000 description 1
- 102000018711 Facilitative Glucose Transport Proteins Human genes 0.000 description 1
- VFRROHXSMXFLSN-UHFFFAOYSA-N Glc6P Natural products OP(=O)(O)OCC(O)C(O)C(O)C(O)C=O VFRROHXSMXFLSN-UHFFFAOYSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- 108091052347 Glucose transporter family Proteins 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- SQUHHTBVTRBESD-UHFFFAOYSA-N Hexa-Ac-myo-Inositol Natural products CC(=O)OC1C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C1OC(C)=O SQUHHTBVTRBESD-UHFFFAOYSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 102000004157 Hydrolases Human genes 0.000 description 1
- 108090000604 Hydrolases Proteins 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- IXEFKXAGHRQFAF-HVTMNAMFSA-N Ile-Glu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IXEFKXAGHRQFAF-HVTMNAMFSA-N 0.000 description 1
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 1
- MUFXDFWAJSPHIQ-XDTLVQLUSA-N Ile-Tyr Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 MUFXDFWAJSPHIQ-XDTLVQLUSA-N 0.000 description 1
- PMAOIIWHZHAPBT-HJPIBITLSA-N Ile-Tyr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N PMAOIIWHZHAPBT-HJPIBITLSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- 108010044467 Isoenzymes Proteins 0.000 description 1
- FFEARJCKVFRZRR-UHFFFAOYSA-N L-Methionine Natural products CSCCC(N)C(O)=O FFEARJCKVFRZRR-UHFFFAOYSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- FZHXIRIBWMQPQF-VANKVMQKSA-N L-glucosamine Chemical compound O=C[C@@H](N)[C@H](O)[C@@H](O)[C@@H](O)CO FZHXIRIBWMQPQF-VANKVMQKSA-N 0.000 description 1
- 229930182816 L-glutamine Natural products 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 229930195722 L-methionine Natural products 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- DUKURNFHYQXCJG-UHFFFAOYSA-N Lewis A pentasaccharide Natural products OC1C(O)C(O)C(C)OC1OC1C(OC2C(C(O)C(O)C(CO)O2)O)C(NC(C)=O)C(OC2C(C(OC3C(OC(O)C(O)C3O)CO)OC(CO)C2O)O)OC1CO DUKURNFHYQXCJG-UHFFFAOYSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- ATIPDCIQTUXABX-UWVGGRQHSA-N Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ATIPDCIQTUXABX-UWVGGRQHSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 101100186371 Mus musculus Nags gene Proteins 0.000 description 1
- OVRNDRQMDRJTHS-RTRLPJTCSA-N N-acetyl-D-glucosamine Chemical compound CC(=O)N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-RTRLPJTCSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- VZUNGTLZRAYYDE-UHFFFAOYSA-N N-methyl-N'-nitro-N-nitrosoguanidine Chemical compound O=NN(C)C(=N)N[N+]([O-])=O VZUNGTLZRAYYDE-UHFFFAOYSA-N 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 108010002747 Pfu DNA polymerase Proteins 0.000 description 1
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 1
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 1
- QXNSKJLSLYCTMT-FXQIFTODSA-N Pro-Cys-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O QXNSKJLSLYCTMT-FXQIFTODSA-N 0.000 description 1
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 1
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- UICKAKRRRBTILH-GUBZILKMSA-N Ser-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N UICKAKRRRBTILH-GUBZILKMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- WDIJBEWLXLQQKD-ULQDDVLXSA-N Tyr-Arg-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O WDIJBEWLXLQQKD-ULQDDVLXSA-N 0.000 description 1
- ONWMQORSVZYVNH-UWVGGRQHSA-N Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ONWMQORSVZYVNH-UWVGGRQHSA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- QPBJXNYYQTUTDD-KKUMJFAQSA-N Tyr-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QPBJXNYYQTUTDD-KKUMJFAQSA-N 0.000 description 1
- 108010061048 UDPacetylglucosamine pyrophosphorylase Proteins 0.000 description 1
- 241000218220 Ulmaceae Species 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 229960000583 acetic acid Drugs 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- RNBGYGVWRKECFJ-ZXXMMSQZSA-N alpha-D-fructofuranose 1,6-bisphosphate Chemical compound O[C@H]1[C@H](O)[C@](O)(COP(O)(O)=O)O[C@@H]1COP(O)(O)=O RNBGYGVWRKECFJ-ZXXMMSQZSA-N 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000002518 antifoaming agent Substances 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- 235000015278 beef Nutrition 0.000 description 1
- 235000021028 berry Nutrition 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 230000008033 biological extinction Effects 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 229960003185 chlortetracycline hydrochloride Drugs 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 238000012411 cloning technique Methods 0.000 description 1
- 230000001332 colony forming effect Effects 0.000 description 1
- 238000007398 colorimetric assay Methods 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000003413 degradative effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 238000004821 distillation Methods 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000003628 erosive effect Effects 0.000 description 1
- 229940023064 escherichia coli Drugs 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 229940025237 fructose 1,6-diphosphate Drugs 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 102000034356 gene-regulatory proteins Human genes 0.000 description 1
- 108091006104 gene-regulatory proteins Proteins 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000012362 glacial acetic acid Substances 0.000 description 1
- 101150024902 glnS gene Proteins 0.000 description 1
- 108010084034 glucosamine-1-phosphate acetyltransferase Proteins 0.000 description 1
- 239000006481 glucose medium Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 125000000404 glutamine group Chemical group N[C@@H](CCC(N)=O)C(=O)* 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- VKYKSIONXSXAKP-UHFFFAOYSA-N hexamethylenetetramine Chemical compound C1N(C2)CN3CN1CN2C3 VKYKSIONXSXAKP-UHFFFAOYSA-N 0.000 description 1
- 239000011261 inert gas Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- CDAISMWEOUEBRE-GPIVLXJGSA-N inositol Chemical compound O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@@H]1O CDAISMWEOUEBRE-GPIVLXJGSA-N 0.000 description 1
- 229960000367 inositol Drugs 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000007154 intracellular accumulation Effects 0.000 description 1
- 230000037041 intracellular level Effects 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 150000002597 lactoses Chemical class 0.000 description 1
- 231100001231 less toxic Toxicity 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 230000002934 lysing effect Effects 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000012533 medium component Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 238000012269 metabolic engineering Methods 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 229960004452 methionine Drugs 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 235000010755 mineral Nutrition 0.000 description 1
- 239000002480 mineral oil Substances 0.000 description 1
- 235000010446 mineral oil Nutrition 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 230000007498 myristoylation Effects 0.000 description 1
- 235000021436 nutraceutical agent Nutrition 0.000 description 1
- 239000006916 nutrient agar Substances 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 101150093139 ompT gene Proteins 0.000 description 1
- 230000003349 osteoarthritic effect Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 210000001322 periplasm Anatomy 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 230000008092 positive effect Effects 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 230000013823 prenylation Effects 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 239000012264 purified product Substances 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 230000009103 reabsorption Effects 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000001223 reverse osmosis Methods 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- CDAISMWEOUEBRE-UHFFFAOYSA-N scyllo-inosotol Natural products OC1C(O)C(O)C(O)C(O)C1O CDAISMWEOUEBRE-UHFFFAOYSA-N 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 238000013207 serial dilution Methods 0.000 description 1
- 238000012807 shake-flask culturing Methods 0.000 description 1
- 238000007086 side reaction Methods 0.000 description 1
- 235000021309 simple sugar Nutrition 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 238000006491 synthase reaction Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 229910021654 trace metal Inorganic materials 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 125000002264 triphosphate group Chemical class [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 1
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 1
- WUUHFRRPHJEEKV-UHFFFAOYSA-N tripotassium borate Chemical compound [K+].[K+].[K+].[O-]B([O-])[O-] WUUHFRRPHJEEKV-UHFFFAOYSA-N 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07H—SUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
- C07H5/00—Compounds containing saccharide radicals in which the hetero bonds to oxygen have been replaced by the same number of hetero bonds to halogen, nitrogen, sulfur, selenium, or tellurium
- C07H5/04—Compounds containing saccharide radicals in which the hetero bonds to oxygen have been replaced by the same number of hetero bonds to halogen, nitrogen, sulfur, selenium, or tellurium to nitrogen
- C07H5/06—Aminosugars
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07H—SUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
- C07H11/00—Compounds containing saccharide radicals esterified by inorganic acids; Metal salts thereof
- C07H11/04—Phosphates; Phosphites; Polyphosphates
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1096—Transferases (2.) transferring nitrogenous groups (2.6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/26—Preparation of nitrogen-containing carbohydrates
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y206/00—Transferases transferring nitrogenous groups (2.6)
- C12Y206/01—Transaminases (2.6.1)
- C12Y206/01016—Glutamine-fructose-6-phosphate transaminase (isomerizing) (2.6.1.16), i.e. glucosamine-6-phosphate-synthase
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Saccharide Compounds (AREA)
Abstract
The present invention relates to a method and materials for producing glucosamine by fermentation of a genetically modified microorganism. Included in the present invention are genetically modified microorganisms useful in the present method for producing glucosamine, as well as recombinant nucleic acid molecules and the proteins produced by such recombinant nucleic acid molecules.
Description
2 PCT/US99/15976 PROCESS AND MATERIALS FOR PRODUCTION OF GLUCOSAMINE
FIELD OF THE INVENTION
The present invention relates to a method for producing glucosamine by fermentation. The present invention also relates to genetically modified strains of microorganisms useful for producing glucosamine.
BACKGROUND OF THE INVENTION
Amino sugars are usually found as monomer residues in complex oligosaccharides and polysaccharides. Glucosamine is an amino derivative of the simple sugar, glucose.
Glucosamine and other amino sugars are important constituents of many natural polysaccharides. For example, polysaccharides containing amino sugars can form structural materials for cells, analogous to structural proteins.
Glucosamine is manufactured as a nutraceutical product with applications in the treatment of osteoarthritic conditions in animals and humans. The market for glucosamine is experiencing tremendous growth. Furthermore, significant erosion of the world market price for glucosamine is not expected.
Glucosamine is currently obtained by acid hydrolysis of chitin, a complex carbohydrate derived from N-acetyl-D-glucosamine. Alternatively, glucosamine can also be produced by acid hydrolysis of variously acetylated chitosans. These processes suffer from poor product yields (in the range of 50% conversion of substrate to glucosamine).
Moreover, the availability of raw material (i.e., a source of chitin, such as crab shells) is becoming increasingly limited. Therefore, there is a need in the industry for a cost-effective method for producing high yields of glucosamine for commercial sale and use.
SUMMARY OF THE INVENTION
One embodiment of the present invention relates to a method to produce glucosamine by fermentation of a microorganism. This method includes the steps of (a) culturing in a fermentation medium a microorganism having a genetic modification in an amino sugar metabolic pathway; and (b) recovering a product produced from the step of culturing which is selected from the group of glucosamine-6-phosphate and glucosamine.
Such an amino sugar metabolic pathway is selected from the group of a pathway for converting glucosamine-6-phosphate into another compound, a pathway for synthesizing glucosamine-6-phosphate, a pathway for transport of glucosamine or glucosamine-phosphate out of the microorganism, a pathway for transport of glucosamine into the microorganism, and a pathway which competes for substrates involved in the production of glucosamine-6-phosphate. The fermentation medium includes assimilable sources of carbon, nitrogen and phosphate. In a preferred embodiment, the microorganism is a bacterium or a yeast, and more preferably, a bacterium of the genus Escherichia, and even more preferably, Escherichia coll.
In one embodiment, the product can be recovered by recovering intracellular glucosamine-6-phosphate from the microorganism and/or recovering extracellular glucosamine from the fermentation medium. In further embodiments, the step of recovering can include purifying glucosamine from the fermentation medium, isolating glucosamine-6-phosphate from the microorganism, and/or dephosphorylating the glucosamine-6-phosphate to produce glucosamine. In one embodiment, at least about 1 g/L of product is recovered.
In yet another embodiment, the step of culturing includes the step of maintaining the carbon source at a concentration of from about 0.5% to about 5% in the fermentation medium. In another embodiment, the step of culturing is performed at a temperature of from about 28 C to about 37 C. In yet another embodiment, the step of culturing is performed in a fermentor.
In one embodiment ofthe present invention, the microorganism has a modification in a gene which encodes a protein including, but not limited to, N-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase, N-acetyl-glucosamine-specific enzyme IIN'9, glucosamine-6-phosphate synthase, phosphoglucosamine mutase, glucosamine- l -phosphate acetyltransferase-N-acetylglucosamine- 1 -phosphate uridyltransferase, phosphofructokinase, enzyme III" of the PEP:glucose PTS, EIIM,P/III' of the PEP:mannose PTS, and/or a phosphatase.
In another embodiment, the genetic modification includes a genetic modification which increases the action of glucosamine-6-phosphate synthase in the microorganism.
Such a genetic modification includes the transformation of the microorganism with a recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase to increase the action of glucosamine-6-phosphate synthase and/or to overexpress the
FIELD OF THE INVENTION
The present invention relates to a method for producing glucosamine by fermentation. The present invention also relates to genetically modified strains of microorganisms useful for producing glucosamine.
BACKGROUND OF THE INVENTION
Amino sugars are usually found as monomer residues in complex oligosaccharides and polysaccharides. Glucosamine is an amino derivative of the simple sugar, glucose.
Glucosamine and other amino sugars are important constituents of many natural polysaccharides. For example, polysaccharides containing amino sugars can form structural materials for cells, analogous to structural proteins.
Glucosamine is manufactured as a nutraceutical product with applications in the treatment of osteoarthritic conditions in animals and humans. The market for glucosamine is experiencing tremendous growth. Furthermore, significant erosion of the world market price for glucosamine is not expected.
Glucosamine is currently obtained by acid hydrolysis of chitin, a complex carbohydrate derived from N-acetyl-D-glucosamine. Alternatively, glucosamine can also be produced by acid hydrolysis of variously acetylated chitosans. These processes suffer from poor product yields (in the range of 50% conversion of substrate to glucosamine).
Moreover, the availability of raw material (i.e., a source of chitin, such as crab shells) is becoming increasingly limited. Therefore, there is a need in the industry for a cost-effective method for producing high yields of glucosamine for commercial sale and use.
SUMMARY OF THE INVENTION
One embodiment of the present invention relates to a method to produce glucosamine by fermentation of a microorganism. This method includes the steps of (a) culturing in a fermentation medium a microorganism having a genetic modification in an amino sugar metabolic pathway; and (b) recovering a product produced from the step of culturing which is selected from the group of glucosamine-6-phosphate and glucosamine.
Such an amino sugar metabolic pathway is selected from the group of a pathway for converting glucosamine-6-phosphate into another compound, a pathway for synthesizing glucosamine-6-phosphate, a pathway for transport of glucosamine or glucosamine-phosphate out of the microorganism, a pathway for transport of glucosamine into the microorganism, and a pathway which competes for substrates involved in the production of glucosamine-6-phosphate. The fermentation medium includes assimilable sources of carbon, nitrogen and phosphate. In a preferred embodiment, the microorganism is a bacterium or a yeast, and more preferably, a bacterium of the genus Escherichia, and even more preferably, Escherichia coll.
In one embodiment, the product can be recovered by recovering intracellular glucosamine-6-phosphate from the microorganism and/or recovering extracellular glucosamine from the fermentation medium. In further embodiments, the step of recovering can include purifying glucosamine from the fermentation medium, isolating glucosamine-6-phosphate from the microorganism, and/or dephosphorylating the glucosamine-6-phosphate to produce glucosamine. In one embodiment, at least about 1 g/L of product is recovered.
In yet another embodiment, the step of culturing includes the step of maintaining the carbon source at a concentration of from about 0.5% to about 5% in the fermentation medium. In another embodiment, the step of culturing is performed at a temperature of from about 28 C to about 37 C. In yet another embodiment, the step of culturing is performed in a fermentor.
In one embodiment ofthe present invention, the microorganism has a modification in a gene which encodes a protein including, but not limited to, N-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase, N-acetyl-glucosamine-specific enzyme IIN'9, glucosamine-6-phosphate synthase, phosphoglucosamine mutase, glucosamine- l -phosphate acetyltransferase-N-acetylglucosamine- 1 -phosphate uridyltransferase, phosphofructokinase, enzyme III" of the PEP:glucose PTS, EIIM,P/III' of the PEP:mannose PTS, and/or a phosphatase.
In another embodiment, the genetic modification includes a genetic modification which increases the action of glucosamine-6-phosphate synthase in the microorganism.
Such a genetic modification includes the transformation of the microorganism with a recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase to increase the action of glucosamine-6-phosphate synthase and/or to overexpress the
3 glucosamine-6-phosphate synthase by the microorganism. In one embodiment, the recombinant nucleic acid molecule is operatively linked to a transcription control sequence. In a further embodiment, the recombinant nucleic acid molecule is integrated into the genome of the microorganism. In yet another embodiment, the recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase has a genetic modification which increases the action of the synthase. Such genetic modifications can result in reduced glucosamine-6-phosphate product inhibition of the glucosamine-6-phosphate synthase, for example.
In one embodiment, a recombinant nucleic acid molecule of the present invention which comprises a nucleic acid sequence encoding a glucosamine-6-phosphate synthase encodes an amino acid sequence SEQ ID NO: 16. In another embodiment, such a recombinant nucleic acid molecule comprises a nucleic acid sequence selected from the group of SEQ ID NO: 13, SEQ ID NO: 14 or SEQ ID NO: 15. Preferred recombinant nucleic acid molecules of the present invention include pKLN23-28, nglmS-and nglmS-281830=
Also included in the present invention are recombinant nucleic acid molecules encoding a glucosamine-6-phosphate synthase which comprises a genetic modification which increases the action ofthe glucosamine-6-phosphate synthase (i.e., a glucosamine-6-phosphate synthase homologue). Such a genetic modification can reduce glucosamine-6-phosphate product inhibition of the synthase, for example. In one embodiment, such a genetic modification in a recombinant nucleic acid molecule ofthe present invention which encodes a glucosamine-6-phosphate synthase results in at least one amino acid modification selected from the group of an addition, substitution, deletion, and/or derivatization of an amino acid residue of the glucosamine-6-phosphate synthase. In one embodiment, such an amino acid modification is in an amino acid sequence position in the modified protein (i.e., homologue) which corresponds to one or more of the following amino acid positions in amino acid sequence SEQ ID NO: 16: Ile(4), Ile(272), Ser(450), Ala(39), Arg(250), Gly(472), Leu(469). In another embodiment, such an amino acid modification is selected from the group of a substitution of. (a) an amino acid residue having an aliphatic hydroxyl side group for Ile(4); (b) an amino acid residue having an aliphatic hydroxyl side group for Ile(272); (c) an amino acid residue having an aliphatic
In one embodiment, a recombinant nucleic acid molecule of the present invention which comprises a nucleic acid sequence encoding a glucosamine-6-phosphate synthase encodes an amino acid sequence SEQ ID NO: 16. In another embodiment, such a recombinant nucleic acid molecule comprises a nucleic acid sequence selected from the group of SEQ ID NO: 13, SEQ ID NO: 14 or SEQ ID NO: 15. Preferred recombinant nucleic acid molecules of the present invention include pKLN23-28, nglmS-and nglmS-281830=
Also included in the present invention are recombinant nucleic acid molecules encoding a glucosamine-6-phosphate synthase which comprises a genetic modification which increases the action ofthe glucosamine-6-phosphate synthase (i.e., a glucosamine-6-phosphate synthase homologue). Such a genetic modification can reduce glucosamine-6-phosphate product inhibition of the synthase, for example. In one embodiment, such a genetic modification in a recombinant nucleic acid molecule ofthe present invention which encodes a glucosamine-6-phosphate synthase results in at least one amino acid modification selected from the group of an addition, substitution, deletion, and/or derivatization of an amino acid residue of the glucosamine-6-phosphate synthase. In one embodiment, such an amino acid modification is in an amino acid sequence position in the modified protein (i.e., homologue) which corresponds to one or more of the following amino acid positions in amino acid sequence SEQ ID NO: 16: Ile(4), Ile(272), Ser(450), Ala(39), Arg(250), Gly(472), Leu(469). In another embodiment, such an amino acid modification is selected from the group of a substitution of. (a) an amino acid residue having an aliphatic hydroxyl side group for Ile(4); (b) an amino acid residue having an aliphatic hydroxyl side group for Ile(272); (c) an amino acid residue having an aliphatic
4 side group for Ser(450); (d) an amino acid residue having an aliphatic hydroxyl side group for Ala(39); (e) an amino acid residue having a sulfur-containing side group for Arg(250);
(f) an amino acid residue having an aliphatic hydroxyl side group for Gly(472); (g) an amino acid residue having an aliphatic side group for Leu(469); and, (h) combinations of (a)-(g).
In yet another embodiment of the present invention, an amino acid modification as described above is preferably a substitution selected from the group of:
Ile(4) to Thr, Ile(272) to Thr, Ser(450) to Pro, Ala(39) to Thr, Arg(250) to Cys, Gly(472) to Ser, Leu(469) to Pro, and combinations thereof.
In another embodiment, a genetically modified recombinant nucleic acid molecule of the present invention comprises a nucleic acid sequence encoding glucosamine-6-phosphate synthase comprising an amino acid sequence selected from the group of SEQ
ID NO:19, SEQ ID NO:22, SEQ ID NO:25, SEQ ID NO:28 or SEQ ID NO:31. In another embodiment, a genetically modified recombinant nucleic acid molecule of the present invention comprises a nucleic acid sequence selected from the group of SEQ ID
NO:17, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID
NO:24, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:29 and SEQ ID NO:30. Preferred genetically modified recombinant nucleic acid molecule of the present invention include pKLN23-49, pKLN23-54, pKLN23-124, pKLN23-149, pKLN23-151, nglmS-492184, nglmS-491830, nglmS-542184, nglmS-541830, nglmS-1242184, nglmS-1241830, nglmS-1492184, nglmS-1491830, nglmS-1512184 and nglmS-1511830=
Another embodiment of the present invention relates to a glucosamine-6-phosphate synthase which has glucosamine-6-phosphate synthase action, such synthase being encoded by a nucleic acid sequence having a genetic modification that results in increased glucosamine-6-phosphate synthase action. Such a nucleic acid sequence has been describe above with regard to recombinant nucleic acid molecules of the present invention.
Yet another embodiment of the present invention relates to a method to produce glucosamine by fermentation, such method comprising: (a) culturing in a fermentation medium comprising assimilable sources of carbon, nitrogen and phosphate, a genetically modified microorganism having increased glucosamine-6-phosphate synthase action, wherein the genetically modified microorganism is produced by a process comprising the steps of 1) generating modifications in an isolated nucleic acid molecule comprising a nucleic acid sequence encoding glucosamine-6-phosphate synthase to create a plurality
(f) an amino acid residue having an aliphatic hydroxyl side group for Gly(472); (g) an amino acid residue having an aliphatic side group for Leu(469); and, (h) combinations of (a)-(g).
In yet another embodiment of the present invention, an amino acid modification as described above is preferably a substitution selected from the group of:
Ile(4) to Thr, Ile(272) to Thr, Ser(450) to Pro, Ala(39) to Thr, Arg(250) to Cys, Gly(472) to Ser, Leu(469) to Pro, and combinations thereof.
In another embodiment, a genetically modified recombinant nucleic acid molecule of the present invention comprises a nucleic acid sequence encoding glucosamine-6-phosphate synthase comprising an amino acid sequence selected from the group of SEQ
ID NO:19, SEQ ID NO:22, SEQ ID NO:25, SEQ ID NO:28 or SEQ ID NO:31. In another embodiment, a genetically modified recombinant nucleic acid molecule of the present invention comprises a nucleic acid sequence selected from the group of SEQ ID
NO:17, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID
NO:24, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:29 and SEQ ID NO:30. Preferred genetically modified recombinant nucleic acid molecule of the present invention include pKLN23-49, pKLN23-54, pKLN23-124, pKLN23-149, pKLN23-151, nglmS-492184, nglmS-491830, nglmS-542184, nglmS-541830, nglmS-1242184, nglmS-1241830, nglmS-1492184, nglmS-1491830, nglmS-1512184 and nglmS-1511830=
Another embodiment of the present invention relates to a glucosamine-6-phosphate synthase which has glucosamine-6-phosphate synthase action, such synthase being encoded by a nucleic acid sequence having a genetic modification that results in increased glucosamine-6-phosphate synthase action. Such a nucleic acid sequence has been describe above with regard to recombinant nucleic acid molecules of the present invention.
Yet another embodiment of the present invention relates to a method to produce glucosamine by fermentation, such method comprising: (a) culturing in a fermentation medium comprising assimilable sources of carbon, nitrogen and phosphate, a genetically modified microorganism having increased glucosamine-6-phosphate synthase action, wherein the genetically modified microorganism is produced by a process comprising the steps of 1) generating modifications in an isolated nucleic acid molecule comprising a nucleic acid sequence encoding glucosamine-6-phosphate synthase to create a plurality
5 of modified nucleic acid sequences; (2) transforming microorganisms with the modified nucleic acid sequences to produce genetically modified microorganisms;
(3)screening the genetically modified microorganisms for glucosamine-6-phosphate synthase action; and, (4) selecting the genetically modified microorganisms which have increased glucosamine-6-phosphate synthase action; ; and, (b) recovering the product.
The step of culturing produces a product selected from the group of glucosamine-6-phosphate and glucosamine from the microorganism.
In another embodiment, a microorganism ofthe present invention has an additional genetic modification in genes encoding N-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase, N-acetyl-glucosamine-specific enzyme IV'e, phosphoglucosamine mutase, glucosamine- 1 -phosphate acetyltransferase-N-acetylglucosamine-l-phosphate uridyltransferase, phosphofructokinase, Enzyme IIG1` of the PEP:glucose PTS, EUM,P/III'' of the PEP:mannose PTS, wherein the modification decreases the action of such proteins. In another embodiment, a microorganism of the present invention has an additional genetic modification in a gene encoding a phosphatase, wherein the modification increases the action of the phosphatase. In a preferred embodiment, a microorganism of the present invention has an additional genetic modification in the genes encoding the following proteins: N-acetylglucosamine-
(3)screening the genetically modified microorganisms for glucosamine-6-phosphate synthase action; and, (4) selecting the genetically modified microorganisms which have increased glucosamine-6-phosphate synthase action; ; and, (b) recovering the product.
The step of culturing produces a product selected from the group of glucosamine-6-phosphate and glucosamine from the microorganism.
In another embodiment, a microorganism ofthe present invention has an additional genetic modification in genes encoding N-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase, N-acetyl-glucosamine-specific enzyme IV'e, phosphoglucosamine mutase, glucosamine- 1 -phosphate acetyltransferase-N-acetylglucosamine-l-phosphate uridyltransferase, phosphofructokinase, Enzyme IIG1` of the PEP:glucose PTS, EUM,P/III'' of the PEP:mannose PTS, wherein the modification decreases the action of such proteins. In another embodiment, a microorganism of the present invention has an additional genetic modification in a gene encoding a phosphatase, wherein the modification increases the action of the phosphatase. In a preferred embodiment, a microorganism of the present invention has an additional genetic modification in the genes encoding the following proteins: N-acetylglucosamine-
6-phosphate deacetylase, glucosamine-6-phosphate deaminase and N-acetyl-glucosamine-specific enzyme II''''g, such modifications including, but not limited to, a deletion of at least a portion of such genes.
Another embodiment of the present invention relates to a method to produce glucosamine by fermentation which includes the steps of (a) culturing an Escherichia coli transformed with a recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase in a fermentation medium comprising assimilable sources of carbon, nitrogen and phosphate to produce a product, and (b) recovering the product. The product includes intracellular glucosamine-6-phosphate which is recovered from the Escherichia coli and/or extracellular glucosamine which is recovered from the fermentation medium. In this embodiment, the recombinant nucleic acid molecule increases expression of the glucosamine-6-phosphate synthase by the Escherichia coil, and is operatively linked to a transcription control sequence. In one embodiment, the recombinant nucleic acid molecule comprises a genetic modification which reduces glucosamine-6-phosphate product inhibition ofthe glucosamine-6-phosphate synthase. In another embodiment, the Escherichia colt has an additional genetic modification in at least one gene selected from the group of nagA, nagB, nagC, nagD, nagE, manXYZ, glmM, p, f kB, p jlcA, gtm U, glmS, ptsG and/or a phosphatase gene. In yet another embodiment, the additional modification comprises a deletion of nagA, nagB, nagC, nagD, nagE, and a mutation in manXYZ, wherein the modification results in decreased enzymatic activity ofN-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase and N-acetyl-glucosamine-specific enzyme IIN'g.
Yet another embodiment of the present invention relates to a microorganism for producing glucosamine by a biosynthetic process. The microorganism is transformed with a recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase, wherein the recombinant nucleic acid molecule is operatively linked to a transcription control sequence. The recombinant nucleic acid molecule further comprises a genetic modification which increases the action of the glucosamine-6-phosphate synthase. The expression of the recombinant nucleic acid molecule increases production of the glucosamine by the microorganism. In a preferred embodiment, the recombinant nucleic acid molecule is integrated into the genome of the microorganism. In yet another embodiment, the microorganism has at least one additional genetic modification in a gene encoding a protein selected from the group consisting of N-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase, N-acetyl-glucosamine-specific enzyme IIN'9, phosphoglucosamine mutase, glucosamine- l -phosphate acetyltransferase-N-acetylglucosamine- l -phosphate uridyltransferase, phosphofiuctokinase, Enzyme 1TGk of the PEP:glucose PTS, and/or EIIM,P/III'' of the PEP:mannose PTS, wherein the genetic modification decreases the action of the protein.
In another embodiment, the microorganism has a modification in a gene encoding a phosphatase, wherein the genetic modification increases the action of the phosphatase.
Another embodiment of the present invention relates to a method to produce glucosamine by fermentation which includes the steps of (a) culturing an Escherichia coli transformed with a recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase in a fermentation medium comprising assimilable sources of carbon, nitrogen and phosphate to produce a product, and (b) recovering the product. The product includes intracellular glucosamine-6-phosphate which is recovered from the Escherichia coli and/or extracellular glucosamine which is recovered from the fermentation medium. In this embodiment, the recombinant nucleic acid molecule increases expression of the glucosamine-6-phosphate synthase by the Escherichia coil, and is operatively linked to a transcription control sequence. In one embodiment, the recombinant nucleic acid molecule comprises a genetic modification which reduces glucosamine-6-phosphate product inhibition ofthe glucosamine-6-phosphate synthase. In another embodiment, the Escherichia colt has an additional genetic modification in at least one gene selected from the group of nagA, nagB, nagC, nagD, nagE, manXYZ, glmM, p, f kB, p jlcA, gtm U, glmS, ptsG and/or a phosphatase gene. In yet another embodiment, the additional modification comprises a deletion of nagA, nagB, nagC, nagD, nagE, and a mutation in manXYZ, wherein the modification results in decreased enzymatic activity ofN-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase and N-acetyl-glucosamine-specific enzyme IIN'g.
Yet another embodiment of the present invention relates to a microorganism for producing glucosamine by a biosynthetic process. The microorganism is transformed with a recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase, wherein the recombinant nucleic acid molecule is operatively linked to a transcription control sequence. The recombinant nucleic acid molecule further comprises a genetic modification which increases the action of the glucosamine-6-phosphate synthase. The expression of the recombinant nucleic acid molecule increases production of the glucosamine by the microorganism. In a preferred embodiment, the recombinant nucleic acid molecule is integrated into the genome of the microorganism. In yet another embodiment, the microorganism has at least one additional genetic modification in a gene encoding a protein selected from the group consisting of N-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase, N-acetyl-glucosamine-specific enzyme IIN'9, phosphoglucosamine mutase, glucosamine- l -phosphate acetyltransferase-N-acetylglucosamine- l -phosphate uridyltransferase, phosphofiuctokinase, Enzyme 1TGk of the PEP:glucose PTS, and/or EIIM,P/III'' of the PEP:mannose PTS, wherein the genetic modification decreases the action of the protein.
In another embodiment, the microorganism has a modification in a gene encoding a phosphatase, wherein the genetic modification increases the action of the phosphatase.
7 In yet another embodiment, the microorganism has a modification in genes encoding N-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase and N-acetyl-glucosamine-specific enzyme IIN'~ wherein the genetic modification decreases enzymatic activity of the protein. In a preferred embodiment, the genetic modification is a deletion of at least a portion of the genes.
In a further embodiment, the microorganism is Escherichia coil, having a modification in a gene selected from the group of nagA, nagB, nagC, nagD, nagE, manXYZ, g1mM, pfkB, pfkA, g1mU, ptsG and/or a phosphatase gene. In one embodiment, such an Escherichia coil has a deletion of nag regulon genes, and in another embodiment, such an Escherichia coil has a deletion of nag regulon genes and a genetic modification in manXYZ genes such that the proteins encoded by the manXYZ genes have decreased action.
Yet another embodiment of the present invention is a microorganism as described above which produces at least about 1 g/L ofglucosamine when cultured for about 10-60 hours at from about 28 C to about 37 C to a cell density of at least about 8 g/L by dry cell weight, in a pH 7.0 fermentation medium comprising: 14 g/L K2HPO4i 16 g/L
KH2PO4i1 g/L Na3Citrate-2H2O, 5 g/L (NH4)2SO4i 20 g/L glucose, l0 mM MgSO4,1 mM
CaC12, and from about 0.2mM to about 1 mM IPTG.
Another embodiment of the present invention is a microorganism for producing glucosamine by a biosynthetic process, which includes: (a) a recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase operatively linked to a transcription control sequence; and, (b) at least one genetic modification in a gene encoding a protein selected from the group ofN-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase, N-acetyl-glucosamine-specific enzyme IIN'g, phosphoglucosamine mutase, glucosamine- 1 -phosphate acetyltransferase-N-acetylglucosamine-l-phosphate uridyltransferase, phosphofructokinase, Enzyme IIGk of the PEP:glucose PTS, and/or EUM,P/HI'`4ai of the PEP:mannose PTS, wherein the genetic modification decreases the action of the protein. In another embodiment, the microorganism includes at least one genetic modification in a gene encoding a phosphatase, wherein the genetic modification increases the action of the phosphatase. Expression of the recombinant nucleic acid molecule increases the action of the glucosamine-6-phosphate synthase in the
In a further embodiment, the microorganism is Escherichia coil, having a modification in a gene selected from the group of nagA, nagB, nagC, nagD, nagE, manXYZ, g1mM, pfkB, pfkA, g1mU, ptsG and/or a phosphatase gene. In one embodiment, such an Escherichia coil has a deletion of nag regulon genes, and in another embodiment, such an Escherichia coil has a deletion of nag regulon genes and a genetic modification in manXYZ genes such that the proteins encoded by the manXYZ genes have decreased action.
Yet another embodiment of the present invention is a microorganism as described above which produces at least about 1 g/L ofglucosamine when cultured for about 10-60 hours at from about 28 C to about 37 C to a cell density of at least about 8 g/L by dry cell weight, in a pH 7.0 fermentation medium comprising: 14 g/L K2HPO4i 16 g/L
KH2PO4i1 g/L Na3Citrate-2H2O, 5 g/L (NH4)2SO4i 20 g/L glucose, l0 mM MgSO4,1 mM
CaC12, and from about 0.2mM to about 1 mM IPTG.
Another embodiment of the present invention is a microorganism for producing glucosamine by a biosynthetic process, which includes: (a) a recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase operatively linked to a transcription control sequence; and, (b) at least one genetic modification in a gene encoding a protein selected from the group ofN-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase, N-acetyl-glucosamine-specific enzyme IIN'g, phosphoglucosamine mutase, glucosamine- 1 -phosphate acetyltransferase-N-acetylglucosamine-l-phosphate uridyltransferase, phosphofructokinase, Enzyme IIGk of the PEP:glucose PTS, and/or EUM,P/HI'`4ai of the PEP:mannose PTS, wherein the genetic modification decreases the action of the protein. In another embodiment, the microorganism includes at least one genetic modification in a gene encoding a phosphatase, wherein the genetic modification increases the action of the phosphatase. Expression of the recombinant nucleic acid molecule increases the action of the glucosamine-6-phosphate synthase in the
8 microorganism. In a further embodiment, the recombinant nucleic acid molecule is integrated into the genome of the microorganism.
BRIEF DESCRIPTION OF THE FIGURES OF THE INVENTION
Fig. 1 is a schematic representation of the pathways for the biosynthesis and catabolism ofglucosamine and N-acetyl-glucosamine and their phosphorylated derivatives in Escherichia coll.
Fig. 2 is a schematic representation of the modifications to the pathways related to amino sugar metabolism for the overproduction of glucosamine in Escherichia coll.
Fig. 3 is a schematic representation of the production of Escherichia coli strains containing combinations of the manXYZ, ptsG, and Onag mutations.
Fig. 4 is a line graph illustrating the effects on glucosamine accumulation of feeding additional glucose and ammonium sulfate to cultures.
Fig. 5 is a line graph which shows that glucosamine-6-phosphate synthase is inhibited by glucosamine-6-phosphate and glucosamine.
Fig. 6 is a line graph illustrating product inhibition of glucosamine-6-phosphate synthase activity in mutant g1mS clones.
Fig. 7 is a schematic representation of the strategy for constructions of Escherichia coil strains containing mutant gimS genes.
Fig. 8 is a line graph illustrating product inhibition of glucosamine-6-phosphate synthase in Escherichia coil strains with integrated mutant glmS genes.
Fig. 9 is a line graph showing glucosamine production in mutant Escherichia coil strains with integrated mutant g1mS genes.
Fig. 10 is a line graph showing inhibition of glucosamine-6-phosphate synthase in glucosamine-producing strains.
Fig. 11 A is a line graph showing the thermal stability at 45 C of glucosamine-6-phosphate synthase in glucosamine-producing strains.
Fig. 11 B is a line graph illustrating the thermal stability at 50'C of glucosamine-6-phosphate synthase in glucosamine-producing strains.
Fig. 12 is a line graph showing the effect of IPTG concentration on glucosamine production.
BRIEF DESCRIPTION OF THE FIGURES OF THE INVENTION
Fig. 1 is a schematic representation of the pathways for the biosynthesis and catabolism ofglucosamine and N-acetyl-glucosamine and their phosphorylated derivatives in Escherichia coll.
Fig. 2 is a schematic representation of the modifications to the pathways related to amino sugar metabolism for the overproduction of glucosamine in Escherichia coll.
Fig. 3 is a schematic representation of the production of Escherichia coli strains containing combinations of the manXYZ, ptsG, and Onag mutations.
Fig. 4 is a line graph illustrating the effects on glucosamine accumulation of feeding additional glucose and ammonium sulfate to cultures.
Fig. 5 is a line graph which shows that glucosamine-6-phosphate synthase is inhibited by glucosamine-6-phosphate and glucosamine.
Fig. 6 is a line graph illustrating product inhibition of glucosamine-6-phosphate synthase activity in mutant g1mS clones.
Fig. 7 is a schematic representation of the strategy for constructions of Escherichia coil strains containing mutant gimS genes.
Fig. 8 is a line graph illustrating product inhibition of glucosamine-6-phosphate synthase in Escherichia coil strains with integrated mutant glmS genes.
Fig. 9 is a line graph showing glucosamine production in mutant Escherichia coil strains with integrated mutant g1mS genes.
Fig. 10 is a line graph showing inhibition of glucosamine-6-phosphate synthase in glucosamine-producing strains.
Fig. 11 A is a line graph showing the thermal stability at 45 C of glucosamine-6-phosphate synthase in glucosamine-producing strains.
Fig. 11 B is a line graph illustrating the thermal stability at 50'C of glucosamine-6-phosphate synthase in glucosamine-producing strains.
Fig. 12 is a line graph showing the effect of IPTG concentration on glucosamine production.
9 Fig. 13 is a line graph demonstrating the effects of IPTG concentration and temperature on glucosamine production.
Fig. 14A is a line graph illustrating growth and glucosamine production by glucosamine-producing strain 2123-54 at 30 C.
Fig. 14B is a line graph illustrating growth and glucosamine production by glucosamine-producing strain 2123-54 at 37 C.
Fig. 15A is a line graph showing glucosamine production by strain 2123-49 at 30 C.
Fig. 15B is a line graph showing glucosamine production by strain 2123-124 at 30 C.
Fig. 16A is a line graph illustrating glucosamine production by a glucosamine-producing strain in a glucose limited fermentor at 37 C.
Fig. 16B is a line graph illustrating glucosamine production by a glucosamine-producing strain in a glucose limited fermentor at 30 C.
Fig. 16A is a line graph illustrating glucosamine production by a glucosamine-producing strain in a glucose excess fermentor at 30 C.
DETAILED DESCRIPTION OF THE INVENTION
The present invention relates to a biosynthetic method for producing glucosamine.
Such a method includes fermentation of a genetically modified microorganism to produce glucosamine. The present invention also relates to genetically modified microorganisms, such as strains ofEscherichia coli, useful for producing glucosamine. As used herein, the terms glucosamine and N-glucosamine can be used interchangeably. Similarly, the terms glucosamine-6-phosphate and N-glucosanune-6-phosphate can be used interchangeably.
Glucosamine can also be abbreviated as G1cN and glucosamine-6-phosphate can also be abbreviated as G1cN-6-P.
The novel method of the present invention for production of glucosamine by fermentation is inexpensive and can produce a yield of glucosamine that exceeds the yield per cost of glucosamine produced by current hydrolysis methods. In addition, by using a genetically modified microorganism as described herein, the method of the present invention can be easily modified to adapt to particular problems or changing needs relative to the production of glucosamine.
The amino sugars, N-acetylglucosamine (G1cNAc) and glucosamine (G1cN), are fundamentally important molecules in microorganisms, because they are the precursors 5 for the biosynthesis of major macromolecules, and in particular, glycoconjugates (i.e., macromolecules containing covalently bound oligosaccharide chains). For example, in Escherichia coli, N-acetylglucosamine and glucosamine are precursors for two macromolecules of the cell envelope, peptidoglycan and lipopolysaccharide.
Mutations that block the biosynthesis of peptidoglycan or lipopolysaccharide are lethal, resulting in
Fig. 14A is a line graph illustrating growth and glucosamine production by glucosamine-producing strain 2123-54 at 30 C.
Fig. 14B is a line graph illustrating growth and glucosamine production by glucosamine-producing strain 2123-54 at 37 C.
Fig. 15A is a line graph showing glucosamine production by strain 2123-49 at 30 C.
Fig. 15B is a line graph showing glucosamine production by strain 2123-124 at 30 C.
Fig. 16A is a line graph illustrating glucosamine production by a glucosamine-producing strain in a glucose limited fermentor at 37 C.
Fig. 16B is a line graph illustrating glucosamine production by a glucosamine-producing strain in a glucose limited fermentor at 30 C.
Fig. 16A is a line graph illustrating glucosamine production by a glucosamine-producing strain in a glucose excess fermentor at 30 C.
DETAILED DESCRIPTION OF THE INVENTION
The present invention relates to a biosynthetic method for producing glucosamine.
Such a method includes fermentation of a genetically modified microorganism to produce glucosamine. The present invention also relates to genetically modified microorganisms, such as strains ofEscherichia coli, useful for producing glucosamine. As used herein, the terms glucosamine and N-glucosamine can be used interchangeably. Similarly, the terms glucosamine-6-phosphate and N-glucosanune-6-phosphate can be used interchangeably.
Glucosamine can also be abbreviated as G1cN and glucosamine-6-phosphate can also be abbreviated as G1cN-6-P.
The novel method of the present invention for production of glucosamine by fermentation is inexpensive and can produce a yield of glucosamine that exceeds the yield per cost of glucosamine produced by current hydrolysis methods. In addition, by using a genetically modified microorganism as described herein, the method of the present invention can be easily modified to adapt to particular problems or changing needs relative to the production of glucosamine.
The amino sugars, N-acetylglucosamine (G1cNAc) and glucosamine (G1cN), are fundamentally important molecules in microorganisms, because they are the precursors 5 for the biosynthesis of major macromolecules, and in particular, glycoconjugates (i.e., macromolecules containing covalently bound oligosaccharide chains). For example, in Escherichia coli, N-acetylglucosamine and glucosamine are precursors for two macromolecules of the cell envelope, peptidoglycan and lipopolysaccharide.
Mutations that block the biosynthesis of peptidoglycan or lipopolysaccharide are lethal, resulting in
10 loss of integrity of the cell envelope and ultimately in cell lysis.
One embodiment of the present invention relates to a method to produce glucosamine by fermentation of a microorganism. This method includes the steps of (a) culturing in a fermentation medium a microorganism having a genetic modification in an amino sugar metabolic pathway which includes: a pathway for converting glucosamine-6-phosphate into another compound, a pathway for synthesizing glucosamine-6-phosphate, a pathway for transport of glucosamine or glucosamine-6-phosphate out of said microorganism, a pathway for transport of glucosamine into said microorganism, and a pathway which competes for substrates involved in the production of glucosamine-6-phosphate, to produce a product which can include intracellular glucosamine-6-phosphate and/or extracellular glucosamine from the microorganism; and (b) recovering the product by recovering intracellular glucosamine-6-phosphate from the microorganism and/or recovering extracellular glucosamine from the fermentation medium. The fermentation medium includes assimilable sources of carbon, nitrogen and phosphate.
Another embodiment of the present invention relates to a method to produce glucosamine by fermentation. Such method includes the steps of. (a) culturing in a fermentation medium comprising assimilable sources of carbon, nitrogen and phosphate, an Escherichia coli transformed with a recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase operatively linked to a transcription control sequence;
and (b) recovering a product selected from the group of glucosamine-6-phosphate and glucosamine. The recombinant nucleic acid molecule increases expression of the glucosamine-6-phosphate synthase by the Escherichia coll. In a further embodiment, the
One embodiment of the present invention relates to a method to produce glucosamine by fermentation of a microorganism. This method includes the steps of (a) culturing in a fermentation medium a microorganism having a genetic modification in an amino sugar metabolic pathway which includes: a pathway for converting glucosamine-6-phosphate into another compound, a pathway for synthesizing glucosamine-6-phosphate, a pathway for transport of glucosamine or glucosamine-6-phosphate out of said microorganism, a pathway for transport of glucosamine into said microorganism, and a pathway which competes for substrates involved in the production of glucosamine-6-phosphate, to produce a product which can include intracellular glucosamine-6-phosphate and/or extracellular glucosamine from the microorganism; and (b) recovering the product by recovering intracellular glucosamine-6-phosphate from the microorganism and/or recovering extracellular glucosamine from the fermentation medium. The fermentation medium includes assimilable sources of carbon, nitrogen and phosphate.
Another embodiment of the present invention relates to a method to produce glucosamine by fermentation. Such method includes the steps of. (a) culturing in a fermentation medium comprising assimilable sources of carbon, nitrogen and phosphate, an Escherichia coli transformed with a recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase operatively linked to a transcription control sequence;
and (b) recovering a product selected from the group of glucosamine-6-phosphate and glucosamine. The recombinant nucleic acid molecule increases expression of the glucosamine-6-phosphate synthase by the Escherichia coll. In a further embodiment, the
11 recombinant nucleic acid molecule comprises a genetic modification which reduces glucosamine-6-phosphate product inhibition of the glucosam.ine-6-phosphate symbase.
In yet another embodiment, the=scherichia coil has an additional genetic modification in at least one gene selected from the group of nagA, nag8, nagC, nagD, nags. manXY2, glniW. pJkB .pjk4. g1m U. glntS. ptsG and/or a phosphatase gene.
To produce significantly high yields of glucosamine by the fermentation method of the present invention, a microorganism is genetically modified to enhance production of glucosatnine. As used herein, a genetically modified microorganism, such as Escherichia coil, has a genome which is modified (i.e., mutated or changed) from its normal (i.e., wild-type or naturally occurring) form. Genetic modification of a microorganism can be accomplished using classical strain development and/or molecular genetic techniques. Such techniques are generally disclosed, for example, in Sambrook et al, 1989, Molecular Cloning. A Laboratory Manual, Cold Spring Harbor Labs Press. Additionally, techniques for genetic modification of a microorganism are described in detail in the Examples section- A genetically modified microorganism can include a natural genetic variant as well as a microorganism in which nucleic acid molecules have been inserted, deleted or modified (i.e., mutated; e.g., by insertion, deletion, substitution, and/or inversion of nucleotides), in such a manner that such modifications provide the desired effect within the microorganism. According to the present invention, a genetically modified microorganism includes a microorganism that has been modified using recombinant technology. As used herein, genetic modifications which result in a decrease in gene expression, in the function of the gene, or in the function of the gene product (i.e., the protein encoded by the gene) can be referred to as inactivation (complete or paral), deletion, interruption, blockage or down-regulation of a gene. For example, a genetic modification in a gene which results in a decrease in the ilmction of the protein encoded by such gene, can be the result of a complete deletion of the gene (i.e., the gene does not exist, and therefore the protein does not exist), a mutation in the gene which results in incomplete or no translation of the protein (e.g., the protein is not expressed), or a mutation in the gene which decreases or abolishes the natural function of the protein (e.g., a protein is expressed which has decreased or no enzymatic activity or action). Genetic {E5224320.DOC;t }
In yet another embodiment, the=scherichia coil has an additional genetic modification in at least one gene selected from the group of nagA, nag8, nagC, nagD, nags. manXY2, glniW. pJkB .pjk4. g1m U. glntS. ptsG and/or a phosphatase gene.
To produce significantly high yields of glucosamine by the fermentation method of the present invention, a microorganism is genetically modified to enhance production of glucosatnine. As used herein, a genetically modified microorganism, such as Escherichia coil, has a genome which is modified (i.e., mutated or changed) from its normal (i.e., wild-type or naturally occurring) form. Genetic modification of a microorganism can be accomplished using classical strain development and/or molecular genetic techniques. Such techniques are generally disclosed, for example, in Sambrook et al, 1989, Molecular Cloning. A Laboratory Manual, Cold Spring Harbor Labs Press. Additionally, techniques for genetic modification of a microorganism are described in detail in the Examples section- A genetically modified microorganism can include a natural genetic variant as well as a microorganism in which nucleic acid molecules have been inserted, deleted or modified (i.e., mutated; e.g., by insertion, deletion, substitution, and/or inversion of nucleotides), in such a manner that such modifications provide the desired effect within the microorganism. According to the present invention, a genetically modified microorganism includes a microorganism that has been modified using recombinant technology. As used herein, genetic modifications which result in a decrease in gene expression, in the function of the gene, or in the function of the gene product (i.e., the protein encoded by the gene) can be referred to as inactivation (complete or paral), deletion, interruption, blockage or down-regulation of a gene. For example, a genetic modification in a gene which results in a decrease in the ilmction of the protein encoded by such gene, can be the result of a complete deletion of the gene (i.e., the gene does not exist, and therefore the protein does not exist), a mutation in the gene which results in incomplete or no translation of the protein (e.g., the protein is not expressed), or a mutation in the gene which decreases or abolishes the natural function of the protein (e.g., a protein is expressed which has decreased or no enzymatic activity or action). Genetic {E5224320.DOC;t }
12 modifications which result in an increase in gene expression or function can be referred to as amplification, overproduction, overexpression, activation, enhancement, addition, or up-regulation of a gene.
In one embodiment of the present invention, a genetic modification of a microorganism increases or decreases the action of a protein involved in an amino sugar metabolic pathway according to the present invention. Such a genetic modification includes any type of modification and specifically includes modifications made by recombinant technology and by classical mutagenesis. For example, in one embodiment, a microorganism of the present invention has a genetic modification that increases the action of glucosamine-6-phosphate synthase. It should be noted that reference to increasing the action (or activity) ofglucosamine-6-phosphate synthase and other enzymes discussed herein refers to any genetic modification in the microorganism in question which results in increased functionality of the enzymes and includes higher activity of the enzymes (e.g., specific activity or in vivo enzymatic activity), reduced inhibition or degradation of the enzymes and overexpression of the enzymes. For example, gene copy number can be increased, expression levels can be increased by use of a promoter that gives higher levels of expression than that ofthe native promoter, or a gene can be altered by genetic engineering or classical mutagenesis to increase the action of an enzyme.
Examples of nucleic acid molecules encoding glucosamine-6-phosphate synthase which have been genetically modified to increase the action of the glucosamine-6-phosphate synthase are described in the Examples section. Similarly, reference to decreasing the action of enzymes discussed herein refers to any genetic modification in the microorganism in question which results in decreased functionality of the enzymes and includes decreased activity of the enzymes (e.g., specific activity), increased inhibition or degradation of the enzymes and a reduction or elimination of expression of the enzymes.
For example, the action of an enzyme of the present invention can be decreased by blocking or reducing the production ofthe enzyme, reducing enzyme activity, or inhibiting the activity of the enzyme. Blocking or reducing the production of an enzyme can include placing the gene encoding the enzyme under the control of a promoter that requires the presence of an inducing compound in the growth medium. By establishing conditions such that the inducer becomes depleted from the medium, the expression of the gene
In one embodiment of the present invention, a genetic modification of a microorganism increases or decreases the action of a protein involved in an amino sugar metabolic pathway according to the present invention. Such a genetic modification includes any type of modification and specifically includes modifications made by recombinant technology and by classical mutagenesis. For example, in one embodiment, a microorganism of the present invention has a genetic modification that increases the action of glucosamine-6-phosphate synthase. It should be noted that reference to increasing the action (or activity) ofglucosamine-6-phosphate synthase and other enzymes discussed herein refers to any genetic modification in the microorganism in question which results in increased functionality of the enzymes and includes higher activity of the enzymes (e.g., specific activity or in vivo enzymatic activity), reduced inhibition or degradation of the enzymes and overexpression of the enzymes. For example, gene copy number can be increased, expression levels can be increased by use of a promoter that gives higher levels of expression than that ofthe native promoter, or a gene can be altered by genetic engineering or classical mutagenesis to increase the action of an enzyme.
Examples of nucleic acid molecules encoding glucosamine-6-phosphate synthase which have been genetically modified to increase the action of the glucosamine-6-phosphate synthase are described in the Examples section. Similarly, reference to decreasing the action of enzymes discussed herein refers to any genetic modification in the microorganism in question which results in decreased functionality of the enzymes and includes decreased activity of the enzymes (e.g., specific activity), increased inhibition or degradation of the enzymes and a reduction or elimination of expression of the enzymes.
For example, the action of an enzyme of the present invention can be decreased by blocking or reducing the production ofthe enzyme, reducing enzyme activity, or inhibiting the activity of the enzyme. Blocking or reducing the production of an enzyme can include placing the gene encoding the enzyme under the control of a promoter that requires the presence of an inducing compound in the growth medium. By establishing conditions such that the inducer becomes depleted from the medium, the expression of the gene
13 encoding the enzyme (and therefore, of enzyme synthesis) could be turned off.
Blocking or reducing the activity of an enzyme could also include using an excision technology approach similar to that described in U.S. Patent No- 4,743,546. To use this approach, the gene encoding the enzyme of interest is cloned between specific genetic sequences that allow specific, controlled excision of the gene from the genome. Excision could be prompted by, for example, a shift in the cultivation temperature of the culture, as in U.S_ Patent No.
4,743,546, or by some other physical or nutritional signal.
An amino sugar is an amino derivative of a saccharide (e.g., a saccharide having 10 an amino group in place of it hydroxyl group). According to the present invention, an amino sugar metabolic pathway is any biochemical pathway involved in, or affecting, the biosynthesis, anabolism or catabolism of an amino sugar. As used herein, amino sugar metabolic pathways include pathways involved in the transport of amino sugars and their precursors into and out of a cvi, and can also include biochemical pathways which compete for substrates involved in the biosynthesis or catabolism of an amino sugar. For example, the immediate precursor to one of the earliest formed amino sugars is fructose-6-phosphate (F-6-P), which, in a biochemical reaction with glutamine (Gln, the amino group donor), forms glucosamine-6-phosphate.
Fructose-6-phosphate is also an intermediate in the glycolysis pathway.
Therefore, the glycolytic pathway competes with the glucosamine-6-phosphate biosynthetic pathway by competing for a substrate, fructose-6-phosphate. In addition, glucosamine-6-phosphate can be converted to. other amino sugars and form constituents in various macromolecules by a series of biochemical reactions. As such, the fiuctose-6-phospbate/glucosamine-6-phosph4te pathway, the fructose-6-phosphate glycolysic pathway, to the extent that it affects the biosynthesis of glucosamine-6-phosphate, and the glucosamine-6-phosphate/rnacromolecule biosynthesis pathway are all considered to be amino sugar metabolic pathways in the present invention.
In general, a microorganism having a genetically modified amino sugar metabolic pathway has at least one genetic modification, as discussed above, which results in a change in one or more amino sugar metabolic pathways as described above as compared to a wild-type microorganism cultured under the same conditions. Such a modification in an amino sugar metabolic pathway changes the ability of the microorganism to produce {ES22432O.DOc:1 }
Blocking or reducing the activity of an enzyme could also include using an excision technology approach similar to that described in U.S. Patent No- 4,743,546. To use this approach, the gene encoding the enzyme of interest is cloned between specific genetic sequences that allow specific, controlled excision of the gene from the genome. Excision could be prompted by, for example, a shift in the cultivation temperature of the culture, as in U.S_ Patent No.
4,743,546, or by some other physical or nutritional signal.
An amino sugar is an amino derivative of a saccharide (e.g., a saccharide having 10 an amino group in place of it hydroxyl group). According to the present invention, an amino sugar metabolic pathway is any biochemical pathway involved in, or affecting, the biosynthesis, anabolism or catabolism of an amino sugar. As used herein, amino sugar metabolic pathways include pathways involved in the transport of amino sugars and their precursors into and out of a cvi, and can also include biochemical pathways which compete for substrates involved in the biosynthesis or catabolism of an amino sugar. For example, the immediate precursor to one of the earliest formed amino sugars is fructose-6-phosphate (F-6-P), which, in a biochemical reaction with glutamine (Gln, the amino group donor), forms glucosamine-6-phosphate.
Fructose-6-phosphate is also an intermediate in the glycolysis pathway.
Therefore, the glycolytic pathway competes with the glucosamine-6-phosphate biosynthetic pathway by competing for a substrate, fructose-6-phosphate. In addition, glucosamine-6-phosphate can be converted to. other amino sugars and form constituents in various macromolecules by a series of biochemical reactions. As such, the fiuctose-6-phospbate/glucosamine-6-phosph4te pathway, the fructose-6-phosphate glycolysic pathway, to the extent that it affects the biosynthesis of glucosamine-6-phosphate, and the glucosamine-6-phosphate/rnacromolecule biosynthesis pathway are all considered to be amino sugar metabolic pathways in the present invention.
In general, a microorganism having a genetically modified amino sugar metabolic pathway has at least one genetic modification, as discussed above, which results in a change in one or more amino sugar metabolic pathways as described above as compared to a wild-type microorganism cultured under the same conditions. Such a modification in an amino sugar metabolic pathway changes the ability of the microorganism to produce {ES22432O.DOc:1 }
14 an amino sugar. According to the present invention, a genetically modified microorganism preferably has an enhanced ability to produce glucosamine compared to a wild-type microorganism cultured under the same conditions. An amino sugar metabolic pathway which affects the production ofglucosamine can generally be categorized into at least one ofthe following kinds ofpathways: (a) pathways for converting glucosamine-6-phosphate into other compounds, (b) pathways for synthesizing glucosamine-6-phosphate, (c) pathways for transporting glucosamine into a cell, (d) pathways for transporting glucosamine or glucosamine-6-phosphate out of a cell, and (e) pathways which compete for substrates involved in the production of glucosamine-6-phosphate.
A genetically modified microorganism useful in a method of the present invention typically has at least one modified gene involved in at least one amino sugar metabolic pathway which results in (a) reduced ability to convert glucosamine-6-phosphate into other compounds (i.e., inhibition of glucosamine-6-phosphate catabolic or anabolic pathways), (b) an enhanced ability to produce (i.e., synthesize) glucosamine-6-phosphate, (c) a reduced ability to transport glucosamine into the cell, (d) an enhanced ability to transport glucosamine-6-phosphate or glucosamine out of the cell, and/or (e) a reduced ability to use substrates involved in the production of glucosamine-6-P for competing biochemical reactions.
It is to be understood that the present invention discloses a method comprising the use of a microorganism with an ability to produce commercially useful amounts of glucosamine in a fermentation process (i.e., preferably an enhanced ability to produce glucosamine compared to a wild-type microorganism cultured under the same conditions).
This method is achieved by the genetic modification of one or more genes encoding a protein involved in an amino sugar metabolic pathway which results in the production (expression) of a protein having an altered (e.g., increased or decreased) function as compared to the corresponding wild-type protein. Such an altered function enhances the ability of the genetically engineered microorganism to produce glucosamine. It will be appreciated by those of skill in the art that production of genetically modified microorganisms having a particular altered function as described elsewhere herein (e.g., an enhanced ability to produce glucosamine-6-phosphate) such as by the specific selection techniques described in the Examples, can produce many organisms meeting the given functional requirement, albeit by virtue of a variety of different genetic modifications. For example, different random nucleotide deletions and/or substitutions in a given nucleic acid sequence may all give rise to the same phenotypic result (e.g., decreased action of the protein encoded by the sequence). The present invention contemplates any such genetic 5 modification which results in the production of a microorganism having the characteristics set forth herein.
For a variety of microorganisms, many of the amino sugar metabolic pathways have been elucidated. In particular, pathways for the biosynthesis and catabolism of glucosamine and N-acetylglucosamine and their phosphorylated derivatives have been 10 elucidated in Escherichia coll. These pathways include the multiple transport systems for the utilization of these amino sugars as carbon sources. Genes encoding the enzymes and proteins directly related to the transport, catabolism and biosynthesis of amino sugars in Escherichia coli have been cloned and sequenced. In addition, mutant strains of Escherichia coli blocked in substantially every step of amino sugar metabolism have been
A genetically modified microorganism useful in a method of the present invention typically has at least one modified gene involved in at least one amino sugar metabolic pathway which results in (a) reduced ability to convert glucosamine-6-phosphate into other compounds (i.e., inhibition of glucosamine-6-phosphate catabolic or anabolic pathways), (b) an enhanced ability to produce (i.e., synthesize) glucosamine-6-phosphate, (c) a reduced ability to transport glucosamine into the cell, (d) an enhanced ability to transport glucosamine-6-phosphate or glucosamine out of the cell, and/or (e) a reduced ability to use substrates involved in the production of glucosamine-6-P for competing biochemical reactions.
It is to be understood that the present invention discloses a method comprising the use of a microorganism with an ability to produce commercially useful amounts of glucosamine in a fermentation process (i.e., preferably an enhanced ability to produce glucosamine compared to a wild-type microorganism cultured under the same conditions).
This method is achieved by the genetic modification of one or more genes encoding a protein involved in an amino sugar metabolic pathway which results in the production (expression) of a protein having an altered (e.g., increased or decreased) function as compared to the corresponding wild-type protein. Such an altered function enhances the ability of the genetically engineered microorganism to produce glucosamine. It will be appreciated by those of skill in the art that production of genetically modified microorganisms having a particular altered function as described elsewhere herein (e.g., an enhanced ability to produce glucosamine-6-phosphate) such as by the specific selection techniques described in the Examples, can produce many organisms meeting the given functional requirement, albeit by virtue of a variety of different genetic modifications. For example, different random nucleotide deletions and/or substitutions in a given nucleic acid sequence may all give rise to the same phenotypic result (e.g., decreased action of the protein encoded by the sequence). The present invention contemplates any such genetic 5 modification which results in the production of a microorganism having the characteristics set forth herein.
For a variety of microorganisms, many of the amino sugar metabolic pathways have been elucidated. In particular, pathways for the biosynthesis and catabolism of glucosamine and N-acetylglucosamine and their phosphorylated derivatives have been 10 elucidated in Escherichia coll. These pathways include the multiple transport systems for the utilization of these amino sugars as carbon sources. Genes encoding the enzymes and proteins directly related to the transport, catabolism and biosynthesis of amino sugars in Escherichia coli have been cloned and sequenced. In addition, mutant strains of Escherichia coli blocked in substantially every step of amino sugar metabolism have been
15 isolated. The known pathways for amino sugar metabolism for Escherichia coli are illustrated in Fig. 1.
As will be discussed in detail below, even though many of the pathways and genes involved in the amino sugar metabolic pathways have been elucidated, until the present invention, it was not known which of the many possible genetic modifications would be necessary to generate a microorganism that can produce commercially significant amounts of glucosamine. Indeed, the present inventors are the first to design and engineer a glucosamine-producing microorganism that has glucosamine production capabilities that far exceed the glucosamine production capability of any known wild-type or mutant microorganism. The present inventors are also the first to appreciate that such a genetically modified microorganism is useful in a method to produce glucosamine for commercial use.
A microorganism to be used in the fermentation method of the present invention is preferably a bacterium or a yeast. More preferably, such a microorganism is a bacterium of the genus Escherichia. Escherichia coli is the most preferred microorganism to use in the fermentation method of the present invention.
Particularly preferred strains ofEscherichia coil include K-12, B and W, and most preferably, K-12.
As will be discussed in detail below, even though many of the pathways and genes involved in the amino sugar metabolic pathways have been elucidated, until the present invention, it was not known which of the many possible genetic modifications would be necessary to generate a microorganism that can produce commercially significant amounts of glucosamine. Indeed, the present inventors are the first to design and engineer a glucosamine-producing microorganism that has glucosamine production capabilities that far exceed the glucosamine production capability of any known wild-type or mutant microorganism. The present inventors are also the first to appreciate that such a genetically modified microorganism is useful in a method to produce glucosamine for commercial use.
A microorganism to be used in the fermentation method of the present invention is preferably a bacterium or a yeast. More preferably, such a microorganism is a bacterium of the genus Escherichia. Escherichia coli is the most preferred microorganism to use in the fermentation method of the present invention.
Particularly preferred strains ofEscherichia coil include K-12, B and W, and most preferably, K-12.
16 Although Escherichia coil is most preferred, it is to be understood that any microorganism that produces glucosamine and can be genetically modified to enhance production of glucosamine can be used in the method of the present invention.
A
microorganism for use in the fermentation method of the present invention can also be referred to as a production organism.
The amino sugar metabolic pathways ofthe microorganism, Escherichia co1i, will be addressed as specific embodiments of the present invention are described below. It will be appreciated that other microorganisms and in particular, other bacteria, have similar amino sugar metabolic pathways and genes and proteins having similar structure and function within such pathways. As such, the principles discussed below with regard to Escherichia coli are applicable to other microorganisms.
In one embodiment ofthe present invention, a genetically modified microorganism includes a microorganism which has an enhanced ability to synthesize glucosamine-6-phosphate. According to the present invention, "an enhanced ability to synthesize" a product refers to any enhancement, or up-regulation, in an amino sugar metabolic pathway related to the synthesis of the product such that the microorganism produces an increased amount of the product compared to the wild-type microorganism cultured under the same conditions. In one embodiment of the present invention, enhancement of the ability of a microorganism to synthesize glucosamine-6-phosphate is accomplished by amplification of the expression of the glucose-6-phosphate synthase gene, which in Escherichia coil is the g1mS gene, the product of which is glucosamine-6-phosphate synthase.
Glucosamine-6-phosphate synthase catalyzes the reaction in which fructose-6-phosphate and glutamine form glucosamine-6-phosphate and glutamic acid. Amplification of the expression of glucosamine-6-phosphate synthase can be accomplished inEscherichia coli, for example, by introduction of a recombinant nucleic acid molecule encoding the glmS gene.
Overexpression of glmS is crucial for the intracellular accumulation of glucosamine-6-phosphate and ultimately for production of glucosamine, since the level of glucosamine-6-phosphate synthase in the cell will control the redirection of carbon flow away from glycolysis and into glucosamine-6-phosphate synthesis. The glmS gene is located at 84 min on the Escherichia coli chromosome, and sequence analysis of this region of the chromosome reveals that g1mS resides in an operon with the glmU
gene,
A
microorganism for use in the fermentation method of the present invention can also be referred to as a production organism.
The amino sugar metabolic pathways ofthe microorganism, Escherichia co1i, will be addressed as specific embodiments of the present invention are described below. It will be appreciated that other microorganisms and in particular, other bacteria, have similar amino sugar metabolic pathways and genes and proteins having similar structure and function within such pathways. As such, the principles discussed below with regard to Escherichia coli are applicable to other microorganisms.
In one embodiment ofthe present invention, a genetically modified microorganism includes a microorganism which has an enhanced ability to synthesize glucosamine-6-phosphate. According to the present invention, "an enhanced ability to synthesize" a product refers to any enhancement, or up-regulation, in an amino sugar metabolic pathway related to the synthesis of the product such that the microorganism produces an increased amount of the product compared to the wild-type microorganism cultured under the same conditions. In one embodiment of the present invention, enhancement of the ability of a microorganism to synthesize glucosamine-6-phosphate is accomplished by amplification of the expression of the glucose-6-phosphate synthase gene, which in Escherichia coil is the g1mS gene, the product of which is glucosamine-6-phosphate synthase.
Glucosamine-6-phosphate synthase catalyzes the reaction in which fructose-6-phosphate and glutamine form glucosamine-6-phosphate and glutamic acid. Amplification of the expression of glucosamine-6-phosphate synthase can be accomplished inEscherichia coli, for example, by introduction of a recombinant nucleic acid molecule encoding the glmS gene.
Overexpression of glmS is crucial for the intracellular accumulation of glucosamine-6-phosphate and ultimately for production of glucosamine, since the level of glucosamine-6-phosphate synthase in the cell will control the redirection of carbon flow away from glycolysis and into glucosamine-6-phosphate synthesis. The glmS gene is located at 84 min on the Escherichia coli chromosome, and sequence analysis of this region of the chromosome reveals that g1mS resides in an operon with the glmU
gene,
17 which encodes the bifunctional enzyme, glucosamine-1-phosphate acetyltransferase N
acetylglucosamine-1-phosphate uridyltransferase. Glucosamine-1-phosphate acetyltransferase-N-acetylglucosamine-l-phosphate uridyltransferase functions within the amino sugar metabolic pathway in which glucosamine-6-phosphate is incorporated, through a series of biochemical reactions, into macromolecules. No obvious promoter sequence is detected upstream ofglmS; transcription of theglmUSoperon is initiated from two promoter sequences upstream of glmU Thus, it is preferred that the glmS
gene be cloned under control of an artificial promoter. The promoter can be any suitable promoter that will provide a level of gimS expression required to maintain a sufficient level of glucosamine-6-phosphate synthase in the production organism. Preferred promoters are constitutive (rather than inducible) promoters, since the need for addition of expensive inducers is therefore obviated. Such promoters include normally inducible promoter systems that have been made functionally constitutive or "leaky" by genetic modification, such as by using a weaker, mutant repressor gene. Particularly preferred promoters to be used with gimS are lac, APL and T7. The gene dosage (copy number) of glmS can be varied according to the requirements for maximum product formation. In one embodiment, the recombinant glmS gene is integrated into the E. coli chromosome.
Therefore, it is an embodiment of the present invention to provide a microorganism, such as E. coil, which is transformed with a recombinant nucleic acid molecule comprising a nucleic acid sequence encoding a glucosamine-6-phosphate synthase, which in E. coil, for example, is encoded by the glmS gene.
Preferred recombinant nucleic acid molecules comprising such a nucleic acid sequence include recombinant nucleic acid molecules comprising a nucleic acid sequence which encodes a glucosamine-6-phosphate synthase comprising an amino acid sequence SEQ ID
NO:16.
Other preferred recombinant nucleic acid molecules of the present invention include nucleic acid molecules which comprise a nucleic acid sequence selected from the group of SEQ ID NO: 13, SEQ ID NO: 14 and/or SEQ ID NO: 15. Particularly preferred recombinant nucleic acid molecules ofthe present invention include nucleic acid molecules comprising nucleic acid molecules nglmS-282154 and/or nglmS-281830. One recombinant molecule of the present invention, referred to herein as plasmid pKLN23-28, includes SEQ ID NOs:13, 14 and 15 and is particularly useful for expressing glucosamine-
acetylglucosamine-1-phosphate uridyltransferase. Glucosamine-1-phosphate acetyltransferase-N-acetylglucosamine-l-phosphate uridyltransferase functions within the amino sugar metabolic pathway in which glucosamine-6-phosphate is incorporated, through a series of biochemical reactions, into macromolecules. No obvious promoter sequence is detected upstream ofglmS; transcription of theglmUSoperon is initiated from two promoter sequences upstream of glmU Thus, it is preferred that the glmS
gene be cloned under control of an artificial promoter. The promoter can be any suitable promoter that will provide a level of gimS expression required to maintain a sufficient level of glucosamine-6-phosphate synthase in the production organism. Preferred promoters are constitutive (rather than inducible) promoters, since the need for addition of expensive inducers is therefore obviated. Such promoters include normally inducible promoter systems that have been made functionally constitutive or "leaky" by genetic modification, such as by using a weaker, mutant repressor gene. Particularly preferred promoters to be used with gimS are lac, APL and T7. The gene dosage (copy number) of glmS can be varied according to the requirements for maximum product formation. In one embodiment, the recombinant glmS gene is integrated into the E. coli chromosome.
Therefore, it is an embodiment of the present invention to provide a microorganism, such as E. coil, which is transformed with a recombinant nucleic acid molecule comprising a nucleic acid sequence encoding a glucosamine-6-phosphate synthase, which in E. coil, for example, is encoded by the glmS gene.
Preferred recombinant nucleic acid molecules comprising such a nucleic acid sequence include recombinant nucleic acid molecules comprising a nucleic acid sequence which encodes a glucosamine-6-phosphate synthase comprising an amino acid sequence SEQ ID
NO:16.
Other preferred recombinant nucleic acid molecules of the present invention include nucleic acid molecules which comprise a nucleic acid sequence selected from the group of SEQ ID NO: 13, SEQ ID NO: 14 and/or SEQ ID NO: 15. Particularly preferred recombinant nucleic acid molecules ofthe present invention include nucleic acid molecules comprising nucleic acid molecules nglmS-282154 and/or nglmS-281830. One recombinant molecule of the present invention, referred to herein as plasmid pKLN23-28, includes SEQ ID NOs:13, 14 and 15 and is particularly useful for expressing glucosamine-
18 phosphate synthase in a microorganism, The above identified nucleic acid molecules represent nucleic acid molecules comprising wild-type (i.e., naturally occurring or endogenous) nucleic acid sequences encoding glucosamine-6-phosphate synthase proteins. Genetically modified nucleic acid molecules which include nucleic acid sequences encoding homologues (i.e., modified and/or mutated) glucosamine-6-phosphate synthase proteins are also encompassed by the present invention and are described in detail below.
The reported Km's of glucosamine-6-phosphate synthase from Rscherichia coil are 2mM and 0.4mM for fructose-6-phosphate and glutamine, respectively. These are relatively high values (i.e., the affinity of the enzyme for its substrates is rather weak).
It is therefore another embodiment of the present invention to provide a microorganism having a glucosamine-6-phosphate synthase with improved affinity for its substrates. A glucosamine-6-phosphate synthase with improved affinity for its substrates can be produced by any suitable method of genetic modification or protein engineering. For example, computer-based protein engineering can be used to design a glucosamine-6-phosphate syntbase protein with greater stability and bcttcr affinity for its substrate. See for example, Maulrk et al, 1997, Molecular Biotechnology:
Therapeutic Applications and Strategies, Wiley-Liss, Inc.
White (1968, Biochem. 1, 106:847-858) first demonstrated that glucosamine-6-phosphate synthase was inhibited by glucosamine-6-phosphate. The present inventors determined that this inhibition was a key factor which limits glucosamine accumulation in glucosamine production strains of the present invention, which have been designed for commercial use. Therefore, it is yet another embodiment of the present invention to provide a microorganism having a glucosamine-6-phosphate synthase with reduced glucosamine-6-phosphate product feedback inhibition. A glucosamine-6-phosphate synthase with reduced product inhibition can be a mutated (i.e., genetically modified) glucosamine-6-phosphate synthase gene, for example, and can be produced by any suitable method of genetic modification. For example, a recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase can be modified by any method for inserting, deleting, and/or substituting nucleotides, such as by error-prone PCR. In this method, the gene is amplified under conditions that lead to a high frequency of misincorporation errors by the DNA polymerise used for the amplification. As a result, a high frequency of ( 224320.DOC.1)
The reported Km's of glucosamine-6-phosphate synthase from Rscherichia coil are 2mM and 0.4mM for fructose-6-phosphate and glutamine, respectively. These are relatively high values (i.e., the affinity of the enzyme for its substrates is rather weak).
It is therefore another embodiment of the present invention to provide a microorganism having a glucosamine-6-phosphate synthase with improved affinity for its substrates. A glucosamine-6-phosphate synthase with improved affinity for its substrates can be produced by any suitable method of genetic modification or protein engineering. For example, computer-based protein engineering can be used to design a glucosamine-6-phosphate syntbase protein with greater stability and bcttcr affinity for its substrate. See for example, Maulrk et al, 1997, Molecular Biotechnology:
Therapeutic Applications and Strategies, Wiley-Liss, Inc.
White (1968, Biochem. 1, 106:847-858) first demonstrated that glucosamine-6-phosphate synthase was inhibited by glucosamine-6-phosphate. The present inventors determined that this inhibition was a key factor which limits glucosamine accumulation in glucosamine production strains of the present invention, which have been designed for commercial use. Therefore, it is yet another embodiment of the present invention to provide a microorganism having a glucosamine-6-phosphate synthase with reduced glucosamine-6-phosphate product feedback inhibition. A glucosamine-6-phosphate synthase with reduced product inhibition can be a mutated (i.e., genetically modified) glucosamine-6-phosphate synthase gene, for example, and can be produced by any suitable method of genetic modification. For example, a recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase can be modified by any method for inserting, deleting, and/or substituting nucleotides, such as by error-prone PCR. In this method, the gene is amplified under conditions that lead to a high frequency of misincorporation errors by the DNA polymerise used for the amplification. As a result, a high frequency of ( 224320.DOC.1)
19 mutations are obtained in the PCR products. This method is described in detail in Example 5. The resulting glucosamine-6-phosphate synthase gene mutants can then be screened for reduced product inhibition by testing the mutant genes for the ability to confer increased glucosamine production onto a test microorganism, as compared to a microorganism carrying the non-mutated recombinant glucosamine-6-phosphate synthase nucleic acid molecule. It should be noted that decreased product inhibition of glucosamine-6 -phosphate synthase typically results in a glucosamine-6-phosphate synthase with increased action, even when the specific activity of the enzyme is remains the same, or actually is decreased, relative to a naturally occurring glucosamine-6-phosphate enzyme. Therefore, it is an embodiment of the present invention to produce a genetically modified glucosamine-6-phosphate synthase with increased action and increased in vivo enzymatic activity, which has unmodified or even decreased specific activity as compared to a naturally occurring glucosamine-6-phosphate synthase. Also encompassed by the present invention are genetically modified glucosamine-6-phosphate syntheses with increased specific activity.
Therefore, it is an embodiment of the present invention to provide a microorganism, such as E. coil, which is transformed with a genetically modified recombinant nucleic acid molecule comprising a nucleic acid sequence encoding a mutant, or homologue, glucosamine-6-phosphate synthase protein. Such glucosamine-6-phosphate synthase proteins can be referred to herein as glucosamine-6-phosphate synthase homologues. Protein homologues are described in detail below.
Preferred recombinant nucleic acid molecules comprising such a nucleic acid sequence include recombinant nucleic acid molecules comprising a nucleic acid sequence which encodes a glucosamine-6-phosphate synthase comprising an amino acid sequence selected from the group of SEQ ID NO: 19, SEQ ID NO:22, SEQ ID NO:25, SEQ ID NO:28 and/or SEQ
ID NO:3 1. Other preferred recombinant nucleic acid molecules comprise a nucleic acid sequence selected from the group of SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID
NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:27, SEQ
ID NO:29 and/or SEQ IDNO:30. Particularly preferred genetically modified recombinant nucleic acid molecules useful in the present invention include nucleic acid molecules comprising nucleic acid molecules selected from the group of nglmS-492184, nglmS-491830, nglmS-542184, nglmS-541830i nglmS-1242184, nglmS-1241,0i nglmS-1492184, nglmS-1491830, nglmS-1512184 and nglmS-1511830. Plasmids pKLN23-49, pKLN23-54, pKLN23-124, pKLN23-149 and pKLN23-151 are recombinant nucleic acid molecules of the present invention which are particularly useful for expressing glucosamine-6-phosphate synthase 5 homologues in a microorganism.
An adequate intracellular supply ofglutamine (Gln) is critical for the glucosamine-6-phosphate synthase reaction. Inspection of the synthetic and degradative pathways for glucosamine-6-phosphate reveals the presence of a potential futile cycle whereby continuous interconversion offiuctose-6-phosphate and glucosamine-6-phosphate results 10 in wasteful depletion of glutamine. In one embodiment of the present invention, the supply of glutamine can be increased either by genetic modification of the production organism to increase glutamine production in the cell, or by modifying the fermentation medium (i.e., adding glutamine to the fermentation medium), to ensure that the supply of glutamine will not limit the production of glucosamine-6-phosphate.
15 In another embodiment of the present invention, the potential futile cycling of fructose-6-phosphate and glucosamine-6-phosphate is addressed by inhibiting, or blocking, the reverse reaction in which glucosamine-6-phosphate is converted into fructose-6-phosphate. In this embodiment, a microorganism is genetically modified to have an inactivation or deletion of the gene which catalyzes this conversion, glucosamine-
Therefore, it is an embodiment of the present invention to provide a microorganism, such as E. coil, which is transformed with a genetically modified recombinant nucleic acid molecule comprising a nucleic acid sequence encoding a mutant, or homologue, glucosamine-6-phosphate synthase protein. Such glucosamine-6-phosphate synthase proteins can be referred to herein as glucosamine-6-phosphate synthase homologues. Protein homologues are described in detail below.
Preferred recombinant nucleic acid molecules comprising such a nucleic acid sequence include recombinant nucleic acid molecules comprising a nucleic acid sequence which encodes a glucosamine-6-phosphate synthase comprising an amino acid sequence selected from the group of SEQ ID NO: 19, SEQ ID NO:22, SEQ ID NO:25, SEQ ID NO:28 and/or SEQ
ID NO:3 1. Other preferred recombinant nucleic acid molecules comprise a nucleic acid sequence selected from the group of SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID
NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:27, SEQ
ID NO:29 and/or SEQ IDNO:30. Particularly preferred genetically modified recombinant nucleic acid molecules useful in the present invention include nucleic acid molecules comprising nucleic acid molecules selected from the group of nglmS-492184, nglmS-491830, nglmS-542184, nglmS-541830i nglmS-1242184, nglmS-1241,0i nglmS-1492184, nglmS-1491830, nglmS-1512184 and nglmS-1511830. Plasmids pKLN23-49, pKLN23-54, pKLN23-124, pKLN23-149 and pKLN23-151 are recombinant nucleic acid molecules of the present invention which are particularly useful for expressing glucosamine-6-phosphate synthase 5 homologues in a microorganism.
An adequate intracellular supply ofglutamine (Gln) is critical for the glucosamine-6-phosphate synthase reaction. Inspection of the synthetic and degradative pathways for glucosamine-6-phosphate reveals the presence of a potential futile cycle whereby continuous interconversion offiuctose-6-phosphate and glucosamine-6-phosphate results 10 in wasteful depletion of glutamine. In one embodiment of the present invention, the supply of glutamine can be increased either by genetic modification of the production organism to increase glutamine production in the cell, or by modifying the fermentation medium (i.e., adding glutamine to the fermentation medium), to ensure that the supply of glutamine will not limit the production of glucosamine-6-phosphate.
15 In another embodiment of the present invention, the potential futile cycling of fructose-6-phosphate and glucosamine-6-phosphate is addressed by inhibiting, or blocking, the reverse reaction in which glucosamine-6-phosphate is converted into fructose-6-phosphate. In this embodiment, a microorganism is genetically modified to have an inactivation or deletion of the gene which catalyzes this conversion, glucosamine-
20 6-phosphate deaminase, which in Escherichia coil is the nagB gene. nagB is one of several nag genes which are part of the nag regulon. The nag genes involved in the degradation ofglucosamine and N-acetyl-glucosamine exist as a regulon located at 15 min on the Escherichia coli chromosome. In another embodiment, the entire nag regulon is inactivated or deleted. The advantages of deleting the entire nag regulon are discussed in detail below.
As discussed above, overproduction ofglucosamine-6-phosphate synthase results in diversion of fructose-6-phosphate synthesis to glucosamine-6-phosphate synthesis.
However, many other enzymes can compete for the substrate, fructose-6-phosphate.
Therefore, one embodiment of the present invention includes a microorganism in which these competitive side reactions are blocked. In a preferred embodiment, a microorganism having complete or partial inactivation of the gene encoding phosphofructokinase is
As discussed above, overproduction ofglucosamine-6-phosphate synthase results in diversion of fructose-6-phosphate synthesis to glucosamine-6-phosphate synthesis.
However, many other enzymes can compete for the substrate, fructose-6-phosphate.
Therefore, one embodiment of the present invention includes a microorganism in which these competitive side reactions are blocked. In a preferred embodiment, a microorganism having complete or partial inactivation of the gene encoding phosphofructokinase is
21 provided. The second step in the glycolytic pathway is the conversion of fructose-6-phosphate to fructose- 1,6-diphosphate by phosphofructokinase, which inEscherichia coil exists as two isozymes encoded by the pjM and pfkB genes. Complete or partial inactivation of either the pfkA or pfkB genes slows the entry of fructose-6-phosphate into the glycolytic pathway and enhances the conversion of fructose-6-phosphate to glucosamine-6-phosphate. As used herein, inactivation of a gene can refer to any modification of a gene which results in a decrease in the activity (i.e., expression or function) of such a gene, including attenuation of activity or complete deletion of activity.
In a further embodiment of the present invention, a genetically modified microorganism has a decreased ability to convert glucosamine-6-phosphate into other compounds. Inactivation of glucosamine-6-phosphate deaminase, as described above, represents one such modification, however, glucosamine-6-phosphate serves as a substrate for other biochemical reactions. The first committed step in the pathway leading to production of macromolecules such as lipopolysaccharide and peptidoglycan in Escherichia coil is the conversion of glucosamine-6-phosphate to glucosamine-l-phosphate by phosphoglucosamine mutase, which in Escherichia coil is the product of the glmMgene. The involvement of this enzyme activity in the pathway oflipopolysaccharide and peptidoglycan biosynthesis was recently confirmed with the cloning oftheglmMgene.
Consequently, the regulation ofglmMgene, and its cognate product, phosphoglucosamine mutase, has not been studied in detail. It has been shown, however, that the phosphoglucosamine mutase, like all other hexosephosphate mutase enzymes studied, is regulated by phosphorylation. This type of regulation at the enzyme level is typically exquisitely sensitive to levels of the pathway end products. Thus, carbon flow through phosphoglucosamine mutase can be self-regulating and may not be a problem as glucosamine-6-phosphate accumulates. Since the sequence of the glmMgene is known, however, it is a preferred embodiment ofthe present invention to provide a microorganism in which the gene encoding phosphoglucosamine mutase is interrupted or deleted. More preferably, the gene encoding phosphoglucosamine mutase is down-regulated, but not completely inactivated, by a mutation, so as not to completely block the biosynthesis of the critical cell envelope components.
In a further embodiment of the present invention, a genetically modified microorganism has a decreased ability to convert glucosamine-6-phosphate into other compounds. Inactivation of glucosamine-6-phosphate deaminase, as described above, represents one such modification, however, glucosamine-6-phosphate serves as a substrate for other biochemical reactions. The first committed step in the pathway leading to production of macromolecules such as lipopolysaccharide and peptidoglycan in Escherichia coil is the conversion of glucosamine-6-phosphate to glucosamine-l-phosphate by phosphoglucosamine mutase, which in Escherichia coil is the product of the glmMgene. The involvement of this enzyme activity in the pathway oflipopolysaccharide and peptidoglycan biosynthesis was recently confirmed with the cloning oftheglmMgene.
Consequently, the regulation ofglmMgene, and its cognate product, phosphoglucosamine mutase, has not been studied in detail. It has been shown, however, that the phosphoglucosamine mutase, like all other hexosephosphate mutase enzymes studied, is regulated by phosphorylation. This type of regulation at the enzyme level is typically exquisitely sensitive to levels of the pathway end products. Thus, carbon flow through phosphoglucosamine mutase can be self-regulating and may not be a problem as glucosamine-6-phosphate accumulates. Since the sequence of the glmMgene is known, however, it is a preferred embodiment ofthe present invention to provide a microorganism in which the gene encoding phosphoglucosamine mutase is interrupted or deleted. More preferably, the gene encoding phosphoglucosamine mutase is down-regulated, but not completely inactivated, by a mutation, so as not to completely block the biosynthesis of the critical cell envelope components.
22 Another pathway which results in the conversion of glucosamine-6-phosphate to another compound is catalyzed by the enzyme, N-acetylglucosamine-6-phosphate deacetylase. N-acetylglucosamine-6-phosphate deacetylase is capable of catalyzing the reverse reaction of converting glucosamine-6-phosphate (plus acetyl CoA) to N-acetyl-glucosamine-6-phosphate. This could result in futile cycling of glucosamine-6-phosphate and N-acetyl-glucosamine-6-phosphate and result in a product composed of a mixture of glucosamine and N-acetyl-glucosamine. Therefore, it is a further embodiment of the present invention to provide a genetically modified microorganism in which the gene encoding N-acetylglucosamine-6-phosphate deacetylase, which is the nagA gene in Escherichia coil, is inactivated.
It is a further embodiment of the present invention to inactivate the transport systems for glucosamine in a microorganism such that once the glucosamine is excreted by the cell it is not taken back up. This modification is helpful for avoiding a high intracellular level ofglucosamine which could be toxic to the cells, and facilitates recovery of the product, since the product remains extracellular. In a preferred embodiment of the present invention, the transportation systems for glucosamine are inactivated to keep glucosamine outside of the microorganism once it is excreted by the microorganism.
During growth of Escherichia coil on glucosamine as sole carbon source, glucosamine is transported into the cell by the PEP:mannose phosphotransferase (PTS) system, which is not only capable of transporting glucosamine into the cell, but is also induced by glucosamine. It is therefore an embodiment of the present invention to provide a microorganism lacking the ability to transport glucosamine into the cell. For example, a manXYZ mutant (i.e., an Escherichia coil lacking or having a mutation in the genes encoding EIIM,P/M ' of the PEP:mannose PTS) can not transport glucosamine into the cell by this mechanism. The PEP:glucose PTS ofEscherichia coli, on the other hand, is capable of transporting both glucose and glucosamine into the cell, but glucosamine cannot induce this system. Thus, in order to grow a manXYZ mutant on glucosamine, the cells must first be grown on glucose to induce expression of the (alternate) glucose transport system and allow glucose (the preferred carbon source) to be transported into the cell. These induced cells are then capable of transporting glucosamine into the cell via the glucose transporter. A similar situation exists for transport of glucosamine by the
It is a further embodiment of the present invention to inactivate the transport systems for glucosamine in a microorganism such that once the glucosamine is excreted by the cell it is not taken back up. This modification is helpful for avoiding a high intracellular level ofglucosamine which could be toxic to the cells, and facilitates recovery of the product, since the product remains extracellular. In a preferred embodiment of the present invention, the transportation systems for glucosamine are inactivated to keep glucosamine outside of the microorganism once it is excreted by the microorganism.
During growth of Escherichia coil on glucosamine as sole carbon source, glucosamine is transported into the cell by the PEP:mannose phosphotransferase (PTS) system, which is not only capable of transporting glucosamine into the cell, but is also induced by glucosamine. It is therefore an embodiment of the present invention to provide a microorganism lacking the ability to transport glucosamine into the cell. For example, a manXYZ mutant (i.e., an Escherichia coil lacking or having a mutation in the genes encoding EIIM,P/M ' of the PEP:mannose PTS) can not transport glucosamine into the cell by this mechanism. The PEP:glucose PTS ofEscherichia coli, on the other hand, is capable of transporting both glucose and glucosamine into the cell, but glucosamine cannot induce this system. Thus, in order to grow a manXYZ mutant on glucosamine, the cells must first be grown on glucose to induce expression of the (alternate) glucose transport system and allow glucose (the preferred carbon source) to be transported into the cell. These induced cells are then capable of transporting glucosamine into the cell via the glucose transporter. A similar situation exists for transport of glucosamine by the
23 PEP:fructose PTS, although in this case glucosamine transport by the enzyme 1T
" is poor.
Methods to inhibit these secondary glucosamine transport pathways are discussed below.
It is yet another embodiment of the present invention to provide a microorganism having a decreased function in the PEP:glucose PTS (described above). Such a modification may be necessary to avoid "reabsorption of glucosamine from the culture medium.
For example, aptsG mutant (i.e., an Escherichia coli lacking or having a mutation in the genes encoding enzyme 1r' of the PEP:glucose PTS). Since such microorganisms will have reduced ability to grow using glucose as a carbon source, such organisms can be further genetically modified to take up glucose by a PEP:glucose PTS-independent mechanism.
It is has been shown, for example, that mutant microorganisms can be selected which are defective in the PEP:glucose PTS and still have an ability to grow on glucose (Flores et al., 1996, Nature Biotechnology 14:620-623).
DNA sequencing of the nag regulon in Escherichia coli reveals that the nagE
gene, encoding the N-acetyl-glucosamine-specific enzyme 11N'9 protein of the PEP: sugar phosphotransferase (PTS) system, which is involved in glucosamine transport into the cell, resides on one arm of the regulon and is transcribed divergently from the other nag genes (nagBACD) located on the other arm of the regulon. Therefore, another genetic modification that would result in decreased ability of an Escherichia coli to transport glucosamine into the cell is an inactivation or deletion of the nagE gene, or a gene encoding a similar enzyme in any microorganism used in a method of the present invention.
As discussed above, in one embodiment of the present invention, a genetically modified Escherichia coli microorganism useful in a method of the present invention has a deletion of the entire nag regulon. Deletion of the entire chromosomal nag regulon is preferred, because many genes which are deleterious to the production ofglucosamine-6-phosphate are inactivated together. The genes, nagA, nagB and nagE, have been discussed in detail above. The nagC gene encodes a regulatory protein that acts as a repressor of the nag regulon as well as both an activator and repressor of the gImUS
operon. The glm genes are discussed in detail above. The function of the nagD
gene is not known, but is believed to be related to amino sugar metabolism as it resides within the
" is poor.
Methods to inhibit these secondary glucosamine transport pathways are discussed below.
It is yet another embodiment of the present invention to provide a microorganism having a decreased function in the PEP:glucose PTS (described above). Such a modification may be necessary to avoid "reabsorption of glucosamine from the culture medium.
For example, aptsG mutant (i.e., an Escherichia coli lacking or having a mutation in the genes encoding enzyme 1r' of the PEP:glucose PTS). Since such microorganisms will have reduced ability to grow using glucose as a carbon source, such organisms can be further genetically modified to take up glucose by a PEP:glucose PTS-independent mechanism.
It is has been shown, for example, that mutant microorganisms can be selected which are defective in the PEP:glucose PTS and still have an ability to grow on glucose (Flores et al., 1996, Nature Biotechnology 14:620-623).
DNA sequencing of the nag regulon in Escherichia coli reveals that the nagE
gene, encoding the N-acetyl-glucosamine-specific enzyme 11N'9 protein of the PEP: sugar phosphotransferase (PTS) system, which is involved in glucosamine transport into the cell, resides on one arm of the regulon and is transcribed divergently from the other nag genes (nagBACD) located on the other arm of the regulon. Therefore, another genetic modification that would result in decreased ability of an Escherichia coli to transport glucosamine into the cell is an inactivation or deletion of the nagE gene, or a gene encoding a similar enzyme in any microorganism used in a method of the present invention.
As discussed above, in one embodiment of the present invention, a genetically modified Escherichia coli microorganism useful in a method of the present invention has a deletion of the entire nag regulon. Deletion of the entire chromosomal nag regulon is preferred, because many genes which are deleterious to the production ofglucosamine-6-phosphate are inactivated together. The genes, nagA, nagB and nagE, have been discussed in detail above. The nagC gene encodes a regulatory protein that acts as a repressor of the nag regulon as well as both an activator and repressor of the gImUS
operon. The glm genes are discussed in detail above. The function of the nagD
gene is not known, but is believed to be related to amino sugar metabolism as it resides within the
24 nag regulon. Thus, in Escherichia coif, a complete deletion of the nag regulon avoids catabolism of the initial intracellular product (glucosamine-6-phosphate) in a strain of Escherichia coil designed to overproduce glucosamine. A preferred Escherichia coil mutant strain having a deletion of the nag regulon is an Escherichia coil having a iinagEBACD::tc deletion/insertion.
With regard to activation of the gImUS operon (a function of nagC), although activation of the glmS gene, encoding glucosamine-6-phosphate synthase, is desirable, an increase in the level oftheglmUgene product, glucosamine-l-phosphate acetyltransferase-N-acetylglucosamine-l-phosphate uridyltransferase could be deleterious to accumulation of glucosamine-6-phosphate as it could lead to siphoning off of carbon flow toward cell envelope components. It is therefore an embodiment ofthe present invention to inactivate glucosamine-l-phosphate acetyltransferase-N-acetylglucosamine-l-phosphate uridyltransferase in a microorganism useful in a method of the present invention. In a microorganism in which the gImUS operon, or its equivalent, has been inactivated or deleted, it is a further embodiment of the present invention to genetically modify the microorganism by recombinantly producing the gene encoding glucosamine-6-phosphate synthase under control of an artificial promoter in the microorganism.
The initial intracellular product in the genetically modified microorganism described herein is glucosamine-6-phosphate. In many microorganisms, including Escherichia coil, glucosamine-6-phosphate is typically dephosphorylated to glucosamine prior to transport out of the cell. Nonetheless, it is yet another embodiment of the present invention to provide a microorganism which is genetically modified to have a suitable phosphatase activity for the conversion of glucosamine-6-phosphate to glucosamine.
Such a phosphatase can include, but is not limited to, for example, alkaline phosphatase.
In a preferred embodiment, such an Escherichia coli has an enhanced (i.e., increased) level of phosphatase activity (i.e., phosphatase action).
As noted above, in the method for production of glucosamine of the present invention, a microorganism having a genetically modified amino sugar metabolic pathway is cultured in a fermentation medium for production of glucosamine. An appropriate, or effective, fermentation medium refers to any medium in which a genetically modified microorganism of the present invention, when cultured, is capable of producing glucosamine. Such a medium is typically an aqueous medium comprising assimilable carbon, nitrogen and phosphate sources. Such a medium can also include appropriate salts, minerals, metals and other nutrients. One advantage of the genetic modifications to a microorganism described herein is that although such genetic modifications significantly 5 alter the metabolism of amino sugars, they do not create any nutritional requirements for the production organism. Thus, a minimal-salts medium containing glucose as the sole carbon source is preferably used as the fermentation medium. The use of a minimal-salts-glucose medium for the glucosamine fermentation will also facilitate recovery and purification of the glucosamine product.
10 Microorganisms of the present invention can be cultured in conventional fermentation bioreactors. The microorganisms can be cultured by any fermentation process which includes, but is not limited to, batch, fed-batch, cell recycle, and continuous fermentation. Preferably, microorganisms of the present invention are grown by batch or fed-batch fermentation processes.
15 In one embodiment of the present invention, before inoculation, the fermentation medium is brought up to the desired temperature, typically from about 20 C to about 40 C, preferably from about 25 C to about 40 C, with temperatures of from about 28'C
to about 37 C, and in some embodiments, about 30'C or about 370C being more preferred. The present inventors have discovered that glucosamine production in 20 microorganisms of the present invention transfected with a nucleic acid molecule under control of the T7-lac promoter (see Examples section) continues after growth has ceased when the microorganisms are cultured at 30 C, while at 37 C, growth and glucosamine production occur in concert. Growth at 37 C is slightly better than at 30 C, but glucosamine production at 30 C is significantly better than at 37 C. It is noted that the
With regard to activation of the gImUS operon (a function of nagC), although activation of the glmS gene, encoding glucosamine-6-phosphate synthase, is desirable, an increase in the level oftheglmUgene product, glucosamine-l-phosphate acetyltransferase-N-acetylglucosamine-l-phosphate uridyltransferase could be deleterious to accumulation of glucosamine-6-phosphate as it could lead to siphoning off of carbon flow toward cell envelope components. It is therefore an embodiment ofthe present invention to inactivate glucosamine-l-phosphate acetyltransferase-N-acetylglucosamine-l-phosphate uridyltransferase in a microorganism useful in a method of the present invention. In a microorganism in which the gImUS operon, or its equivalent, has been inactivated or deleted, it is a further embodiment of the present invention to genetically modify the microorganism by recombinantly producing the gene encoding glucosamine-6-phosphate synthase under control of an artificial promoter in the microorganism.
The initial intracellular product in the genetically modified microorganism described herein is glucosamine-6-phosphate. In many microorganisms, including Escherichia coil, glucosamine-6-phosphate is typically dephosphorylated to glucosamine prior to transport out of the cell. Nonetheless, it is yet another embodiment of the present invention to provide a microorganism which is genetically modified to have a suitable phosphatase activity for the conversion of glucosamine-6-phosphate to glucosamine.
Such a phosphatase can include, but is not limited to, for example, alkaline phosphatase.
In a preferred embodiment, such an Escherichia coli has an enhanced (i.e., increased) level of phosphatase activity (i.e., phosphatase action).
As noted above, in the method for production of glucosamine of the present invention, a microorganism having a genetically modified amino sugar metabolic pathway is cultured in a fermentation medium for production of glucosamine. An appropriate, or effective, fermentation medium refers to any medium in which a genetically modified microorganism of the present invention, when cultured, is capable of producing glucosamine. Such a medium is typically an aqueous medium comprising assimilable carbon, nitrogen and phosphate sources. Such a medium can also include appropriate salts, minerals, metals and other nutrients. One advantage of the genetic modifications to a microorganism described herein is that although such genetic modifications significantly 5 alter the metabolism of amino sugars, they do not create any nutritional requirements for the production organism. Thus, a minimal-salts medium containing glucose as the sole carbon source is preferably used as the fermentation medium. The use of a minimal-salts-glucose medium for the glucosamine fermentation will also facilitate recovery and purification of the glucosamine product.
10 Microorganisms of the present invention can be cultured in conventional fermentation bioreactors. The microorganisms can be cultured by any fermentation process which includes, but is not limited to, batch, fed-batch, cell recycle, and continuous fermentation. Preferably, microorganisms of the present invention are grown by batch or fed-batch fermentation processes.
15 In one embodiment of the present invention, before inoculation, the fermentation medium is brought up to the desired temperature, typically from about 20 C to about 40 C, preferably from about 25 C to about 40 C, with temperatures of from about 28'C
to about 37 C, and in some embodiments, about 30'C or about 370C being more preferred. The present inventors have discovered that glucosamine production in 20 microorganisms of the present invention transfected with a nucleic acid molecule under control of the T7-lac promoter (see Examples section) continues after growth has ceased when the microorganisms are cultured at 30 C, while at 37 C, growth and glucosamine production occur in concert. Growth at 37 C is slightly better than at 30 C, but glucosamine production at 30 C is significantly better than at 37 C. It is noted that the
25 optimum temperature for growth and glucosamine production by a microorganism of the present invention can vary according to a variety of factors. For example, the selection of a particular promoter for expression of a recombinant nucleic acid molecule in the microorganism can affect the optimum culture temperature. One of ordinary skill in the art can readily determine the optimum growth and glucosamine production temperature for any microorganism of the present invention using standard techniques, such as those described in the Examples section for one microorganism of the present invention.
26 The medium is inoculated with an actively growing culture of the genetically modified microorganism in an amount sufficient to produce, after a reasonable growth period, a high cell density. The cells are grown to a cell density of at least about 10 g/l, preferably between about 10 g/l and about 40 g/l, and more preferably at least about 40 g/l. This process typically requires about 10-60 hours.
Sufficient oxygen must be added to the medium during the course of the fermentation to maintain cell growth during the initial cell growth and to maintain metabolism and glucosamine production. Oxygen is conveniently provided by agitation and aeration of the medium. Conventional methods, such as stirring or shaking, may be used to agitate and aerate the medium. Preferably the oxygen concentration in the medium is greater than about 15% of the saturation value (i.e., the solubility of oxygen in the medium at atmospheric pressure and about 30-40 C) and more preferably greater than about 20% of the saturation value, although excursions to lower concentrations may occur if fermentation is not adversely affected. The oxygen concentration of the medium can be monitored by conventional methods, such as with an oxygen electrode.
Other sources of oxygen, such as undiluted oxygen gas and oxygen gas diluted with inert gas other than nitrogen, can be used.
Since the production of glucosamine by fermentation is preferably based on using glucose as the sole carbon source, in a preferred embodiment, in Escherichia coil, the PEP:glucose PTS will be induced. Accordingly, even in the absence of a functional EIIM,P/III' of the PEP:mannose PTS (e.g., in an Escherichia coil having a manXYZ
mutation), the product, glucosamine, will still be taken up by the cells via the induced glucose transport system. In the presence of excess glucose, however, uptake of glucosamine is severely repressed. Thus, it is one embodiment of the present invention to prevent uptake of the glucosamine product by maintaining an excess of glucose in the fermentation bioreactor. As used herein, "an excess" of glucose refers to an amount of glucose above that which is required to maintain the growth of the microorganism under normal conditions, such as the culturing conditions described above.
Preferably, the glucose concentration is maintained at a concentration of from about 0.5% to about 5%
weight/volume of the fermentation medium. In another embodiment, the glucose concentration is maintained at a concentration of from about 5 g/L to about 50 g/L of the
Sufficient oxygen must be added to the medium during the course of the fermentation to maintain cell growth during the initial cell growth and to maintain metabolism and glucosamine production. Oxygen is conveniently provided by agitation and aeration of the medium. Conventional methods, such as stirring or shaking, may be used to agitate and aerate the medium. Preferably the oxygen concentration in the medium is greater than about 15% of the saturation value (i.e., the solubility of oxygen in the medium at atmospheric pressure and about 30-40 C) and more preferably greater than about 20% of the saturation value, although excursions to lower concentrations may occur if fermentation is not adversely affected. The oxygen concentration of the medium can be monitored by conventional methods, such as with an oxygen electrode.
Other sources of oxygen, such as undiluted oxygen gas and oxygen gas diluted with inert gas other than nitrogen, can be used.
Since the production of glucosamine by fermentation is preferably based on using glucose as the sole carbon source, in a preferred embodiment, in Escherichia coil, the PEP:glucose PTS will be induced. Accordingly, even in the absence of a functional EIIM,P/III' of the PEP:mannose PTS (e.g., in an Escherichia coil having a manXYZ
mutation), the product, glucosamine, will still be taken up by the cells via the induced glucose transport system. In the presence of excess glucose, however, uptake of glucosamine is severely repressed. Thus, it is one embodiment of the present invention to prevent uptake of the glucosamine product by maintaining an excess of glucose in the fermentation bioreactor. As used herein, "an excess" of glucose refers to an amount of glucose above that which is required to maintain the growth of the microorganism under normal conditions, such as the culturing conditions described above.
Preferably, the glucose concentration is maintained at a concentration of from about 0.5% to about 5%
weight/volume of the fermentation medium. In another embodiment, the glucose concentration is maintained at a concentration of from about 5 g/L to about 50 g/L of the
27 fermentation medium, and even more preferably, from about 5 g/L to about 20 g/L of the fermentation medium. In one embodiment, the glucose concentration of the fermentation medium is monitored by any suitable method (e.g., by using glucose test strips), and when the glucose concentration is at or near depletion, additional glucose can be added to the medium. In another embodiment, the glucose concentration is maintained by semi-continuous or continuous feeding of the fermentation medium. The parameters disclosed herein for glucose can be applied to any carbon source used in the fermentation medium of the present invention. It is further understood that the carbon source can be allowed to reach undetectable levels for any appropriate amount of time during the fermentation if it enhances the glucosamine production process.
It is a further embodiment of the present invention to supplement and/or control other components and parameters of the fermentation medium, as necessary to maintain and/or enhance the production of glucosamine by a production organism. For example, in one embodiment, the fermentation medium includes ammonium sulfate, and the ammonium sulfate concentration in the culture medium is supplemented by the addition of excess ammonium sulfate. Preferably, the amount of ammonium sulfate is maintained at a level of from about 0.1% to about 1% (weight/volume) in the fermentation medium, and preferably, at about 0.5%. In yet another embodiment, the pH of the fermentation medium is monitored for fluctuations in pH. In the fermentation method of the present invention, the pH is preferably maintained at a pH of from about pH 6.0 to about pH 8.0, and more preferably, at about pH 7Ø In the method of the present invention, if the starting pH of the fermentation medium is pH 7.0, the pH of the fermentation medium is monitored for significant variations from pH 7.0, and is adjusted accordingly, for example, by the addition of sodium hydroxide.
A further embodiment of the present invention is to redirect carbon flux from acetate production to the production of less toxic byproducts. By such methods, problems of toxicity associated with an excess of glucose in the fermentation medium can be avoided. Methods to redirect carbon flux from acetate production are known in the art.
In a batch fermentation process of the present invention, fermentation is continued until the formation of glucosamine, as evidenced by the accumulation of extracellular
It is a further embodiment of the present invention to supplement and/or control other components and parameters of the fermentation medium, as necessary to maintain and/or enhance the production of glucosamine by a production organism. For example, in one embodiment, the fermentation medium includes ammonium sulfate, and the ammonium sulfate concentration in the culture medium is supplemented by the addition of excess ammonium sulfate. Preferably, the amount of ammonium sulfate is maintained at a level of from about 0.1% to about 1% (weight/volume) in the fermentation medium, and preferably, at about 0.5%. In yet another embodiment, the pH of the fermentation medium is monitored for fluctuations in pH. In the fermentation method of the present invention, the pH is preferably maintained at a pH of from about pH 6.0 to about pH 8.0, and more preferably, at about pH 7Ø In the method of the present invention, if the starting pH of the fermentation medium is pH 7.0, the pH of the fermentation medium is monitored for significant variations from pH 7.0, and is adjusted accordingly, for example, by the addition of sodium hydroxide.
A further embodiment of the present invention is to redirect carbon flux from acetate production to the production of less toxic byproducts. By such methods, problems of toxicity associated with an excess of glucose in the fermentation medium can be avoided. Methods to redirect carbon flux from acetate production are known in the art.
In a batch fermentation process of the present invention, fermentation is continued until the formation of glucosamine, as evidenced by the accumulation of extracellular
28 glucosamine, essentially ceases. The total fermentation time is typically from about 40 to about 60 hours, and more preferably, about 48 hours. In a continuous fermentation process, glucosamine can be removed from the bioreactor as it accumulates in the medium. The method of the present invention results in production of a product which can include intracellular or extracellular glucosamine-6-phosphate and intracellular or extracellular glucosamine.
The method of the present invention further includes recovering the product, which can be intracellular glucosamine-6-phosphate or extracellular glucosamine. The phrase "recovering glucosamine" refers simply to collecting the product from the fermentation bioreactor and need not imply additional steps of separation or purification.
For example, the step of recovering can refer to removing the entire culture (i.e., the microorganism and the fermentation medium) from the bioreactor, removing the fermentation medium containing extracellular glucosamine from the bioreactor, and/or removing the microorganism containing intracellular glucosamine-6-phosphate from the bioreactor. These steps can be followed by further purification steps.
Glucosamine is preferably recovered in substantially pure form. As used herein, "substantially pure" refers to a purity that allows for the effective use of the glucosamine as a nutriceutical compound for commercial sale. In one embodiment, the glucosamine product is preferably separated from the production organism and other fermentation medium constituents.
Methods to accomplish such separation are described below.
Preferably, by the method of the present invention, at least about 1 g/L of product (i.e., glucosamine and/or glucosamine-6-phosphate) are recovered from the microorganism and/or fermentation medium. More preferably, by the method of the present invention, at least about 5 g/L, and even more preferably, at least about 10 g/L, and even more preferably, at least about 20 g/L and even more preferably, at least about 50 g/L of product are recovered. In one embodiment, glucosamine product is recovered in an amount from about 1 g/L to about 50 g/L.
Typically, most ofthe glucosamine produced in the present process is extracellular.
The microorganism can be removed from the fermentation medium by conventional methods, such as by filtration or centrifugation. In one embodiment, the step of recovering the product includes the purification of glucosamine from the fermentation
The method of the present invention further includes recovering the product, which can be intracellular glucosamine-6-phosphate or extracellular glucosamine. The phrase "recovering glucosamine" refers simply to collecting the product from the fermentation bioreactor and need not imply additional steps of separation or purification.
For example, the step of recovering can refer to removing the entire culture (i.e., the microorganism and the fermentation medium) from the bioreactor, removing the fermentation medium containing extracellular glucosamine from the bioreactor, and/or removing the microorganism containing intracellular glucosamine-6-phosphate from the bioreactor. These steps can be followed by further purification steps.
Glucosamine is preferably recovered in substantially pure form. As used herein, "substantially pure" refers to a purity that allows for the effective use of the glucosamine as a nutriceutical compound for commercial sale. In one embodiment, the glucosamine product is preferably separated from the production organism and other fermentation medium constituents.
Methods to accomplish such separation are described below.
Preferably, by the method of the present invention, at least about 1 g/L of product (i.e., glucosamine and/or glucosamine-6-phosphate) are recovered from the microorganism and/or fermentation medium. More preferably, by the method of the present invention, at least about 5 g/L, and even more preferably, at least about 10 g/L, and even more preferably, at least about 20 g/L and even more preferably, at least about 50 g/L of product are recovered. In one embodiment, glucosamine product is recovered in an amount from about 1 g/L to about 50 g/L.
Typically, most ofthe glucosamine produced in the present process is extracellular.
The microorganism can be removed from the fermentation medium by conventional methods, such as by filtration or centrifugation. In one embodiment, the step of recovering the product includes the purification of glucosamine from the fermentation
29 medium. Glucosamine can be recovered from the cell-free fermentation medium by conventional methods, such as chromatography, extraction, crystallization (e.g., evaporative crystallization), membrane separation, reverse osmosis and distillation. In a preferred embodiment, glucosamine is recovered from the cell-free fermentation medium by crystallization. In another embodiment, the step of recovering the product includes the step of concentrating the extracellular glucosamine.
In one embodiment, glucosamine-6-phosphate accumulates intracellularly, the step of recovering the product includes isolating glucosamine-6-phosphate from the microorganism. For example, the product can be recovered by lysing the microorganism cells by a method which does not degrade the glucosamine product, centrifuging the lysate to remove insoluble cellular debris, and then recovering the glucosamine and/or glucosamine-6-phosphate product by a conventional method as described above.
The initial intracellular product in the genetically modified microorganism described herein is glucosamine-6-phosphate. It is generally accepted that phosphorylated intermediates are dephosphorylated during export from the microorganism, most likely due to the presence ofalkaline phosphatase in the periplasmic space ofthe microorganism.
In one embodiment of the present invention, glucosamine-6-phosphate is dephosphorylated before or during export from the cell by naturally occurring phosphatases in order to facilitate the production of the desired product, glucosamine.
In this embodiment, the need for amplification of a recombinantly provided phosphatase activity in the cell or treatment of the fermentation medium with a phosphatase is obviated. In another embodiment, the level of phosphatase in the production organism is increased by a method including, but not limited to, genetic modification of an endogenous phosphatase gene or by recombinant modification of the microorganism to express a phosphatase gene. In yet another embodiment, the recovered fermentation medium is treated with a phosphatase after glucosamine-6-phosphate is released into the medium, such as when cells are lysed as described above.
As noted above, the process of the present invention produces significant amounts ofextracellular glucosamine. In particular, the process produces extracellular glucosamine such that greater than about 50% of total glucosamine is extracellular, more preferably greater than about 75% of total glucosamine is extracellular, and most preferably greater than about 90% of total glucosamine is extracellular. By the method of the present invention, production of an extracellular glucosamine concentration can be achieved which is greater than about 1 g/l, more preferably greater than about 5 g/l, even more preferably greater than about 10 g/l, and even more preferably greater than about 20 g/L
and even 5 more preferably greater than about 50 g/l.
One embodiment of the present invention relates to a method to produce glucosamine by fermentation which includes the steps of (a) culturing an Escherichia coil having a genetically modified amino sugar metabolic pathway in a fermentation medium comprising assimilable sources of carbon, nitrogen and phosphate to produce a product, 10 and (b) recovering the product. The product includes intracellular glucosamine-6-phosphate which is recovered from the Escherichia coil and/or extracellular glucosamine which is recovered from the fermentation medium.
One embodiment of the present invention relates to a microorganism for producing glucosamine by a biosynthetic process. The microorganism is transformed with a 15 recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase operatively linked to a transcription control sequence. The recombinant nucleic acid molecule has a genetic modification which reduces glucosamine-6-phosphate product inhibition of the glucosamine-6-phosphate synthase. Expression of the recombinant nucleic acid molecule increases expression of the glucosamine-6-phosphate synthase by 20 the microorganism. In a preferred embodiment, the recombinant nucleic acid molecule is integrated into the genome of the microorganism. In a further embodiment, the microorganism has at least one additional genetic modification in a gene encoding a protein selected from the group of N-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase, N-acetyl-glucosamine-specific enzyme IIN'g, 25 phosphoglucosamine mutase, glucosamine-1-phosphate acetyltransferase-N-acetylglucosamine-1-phosphate uridyltransferase, phosphofiuctokinase, Enzyme IIo" of the PEP:glucose PTS, EIIM,P/III' ofthe PEP:mannose PTS, and/or a phosphatase.
The genetic modification decreases the action of the protein, except in the case of the phosphatase, in which the action of the phosphatase is preferably increased.
In another
In one embodiment, glucosamine-6-phosphate accumulates intracellularly, the step of recovering the product includes isolating glucosamine-6-phosphate from the microorganism. For example, the product can be recovered by lysing the microorganism cells by a method which does not degrade the glucosamine product, centrifuging the lysate to remove insoluble cellular debris, and then recovering the glucosamine and/or glucosamine-6-phosphate product by a conventional method as described above.
The initial intracellular product in the genetically modified microorganism described herein is glucosamine-6-phosphate. It is generally accepted that phosphorylated intermediates are dephosphorylated during export from the microorganism, most likely due to the presence ofalkaline phosphatase in the periplasmic space ofthe microorganism.
In one embodiment of the present invention, glucosamine-6-phosphate is dephosphorylated before or during export from the cell by naturally occurring phosphatases in order to facilitate the production of the desired product, glucosamine.
In this embodiment, the need for amplification of a recombinantly provided phosphatase activity in the cell or treatment of the fermentation medium with a phosphatase is obviated. In another embodiment, the level of phosphatase in the production organism is increased by a method including, but not limited to, genetic modification of an endogenous phosphatase gene or by recombinant modification of the microorganism to express a phosphatase gene. In yet another embodiment, the recovered fermentation medium is treated with a phosphatase after glucosamine-6-phosphate is released into the medium, such as when cells are lysed as described above.
As noted above, the process of the present invention produces significant amounts ofextracellular glucosamine. In particular, the process produces extracellular glucosamine such that greater than about 50% of total glucosamine is extracellular, more preferably greater than about 75% of total glucosamine is extracellular, and most preferably greater than about 90% of total glucosamine is extracellular. By the method of the present invention, production of an extracellular glucosamine concentration can be achieved which is greater than about 1 g/l, more preferably greater than about 5 g/l, even more preferably greater than about 10 g/l, and even more preferably greater than about 20 g/L
and even 5 more preferably greater than about 50 g/l.
One embodiment of the present invention relates to a method to produce glucosamine by fermentation which includes the steps of (a) culturing an Escherichia coil having a genetically modified amino sugar metabolic pathway in a fermentation medium comprising assimilable sources of carbon, nitrogen and phosphate to produce a product, 10 and (b) recovering the product. The product includes intracellular glucosamine-6-phosphate which is recovered from the Escherichia coil and/or extracellular glucosamine which is recovered from the fermentation medium.
One embodiment of the present invention relates to a microorganism for producing glucosamine by a biosynthetic process. The microorganism is transformed with a 15 recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase operatively linked to a transcription control sequence. The recombinant nucleic acid molecule has a genetic modification which reduces glucosamine-6-phosphate product inhibition of the glucosamine-6-phosphate synthase. Expression of the recombinant nucleic acid molecule increases expression of the glucosamine-6-phosphate synthase by 20 the microorganism. In a preferred embodiment, the recombinant nucleic acid molecule is integrated into the genome of the microorganism. In a further embodiment, the microorganism has at least one additional genetic modification in a gene encoding a protein selected from the group of N-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase, N-acetyl-glucosamine-specific enzyme IIN'g, 25 phosphoglucosamine mutase, glucosamine-1-phosphate acetyltransferase-N-acetylglucosamine-1-phosphate uridyltransferase, phosphofiuctokinase, Enzyme IIo" of the PEP:glucose PTS, EIIM,P/III' ofthe PEP:mannose PTS, and/or a phosphatase.
The genetic modification decreases the action of the protein, except in the case of the phosphatase, in which the action of the phosphatase is preferably increased.
In another
30 preferred embodiment, the microorganism has a modification in genes encoding N-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase and N-
31 acetyl-glucosamine-specific enzyme IIN'~ wherein the genetic modification decreases action of the protein. In one embodiment, the genetic modification is a deletion of at least a portion of the genes.
In a preferred embodiment, the genetically modified microorganism is a bacterium or a yeast, and more preferably, a bacterium of the genus Escherichia, and even more preferably, Escherichia coll. A genetically modified Escherichia coil preferably has a modification in a gene which includes, but is not limited to, nagA, nagB, nagC, nagD, nagE, manXYZ, glmM, pfkB, pf c4, g1mU, glmS, ptsG or a phosphatase gene. In another embodiment, such a genetically modified Escherichia coli has a deletion of nag regulon genes, and in yet another embodiment, a deletion of nag regulon genes and a genetic modification in manXYZ genes such that the proteins encoded by the manXYZ
genes have decreased action.
Yet another embodiment of the present invention relates to a microorganism for producing glucosamine by a biosynthetic process which has a recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase operatively linked to a transcription control sequence; and at least one genetic modification in a gene encoding a protein selected from the group ofN-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase, N-acetyl-glucosamine-specific enzyme 11N'~
phosphoglucosamine mutase, glucosamine-1-phosphate acetyltransferase-N-acetylglucosamine-l-phosphate uridyltransferase, phosphofructokinase, Enzyme III" of the PEP:glucose PTS, and/or EIIM,P/Ill " of the PEP:mannose PTS. The genetic modification decreases action of said protein and expression ofthe recombinant nucleic acid molecule increases expression ofthe glucosamine-6-phosphate synthase by the microorganism. In another embodiment, the microorganism has at least one genetic modification in a phosphatase gene, such that the phosphatase encoded by such gene has increased action. In a preferred embodiment, the recombinant nucleic acid molecule is integrated into the genome ofthe microorganism.
Another embodiment ofthe present invention relates to any ofthe above-described microorganisms which produces at least about 1 g/L of glucosamine when cultured for about 24 hours at 37 C to a cell density of at least about 8 g2 by dry cell weight, in a pH
7.0 fermentation medium comprising: 14 g/L K2HPO4, 16 g/L KH2PO4, 1 g/L
In a preferred embodiment, the genetically modified microorganism is a bacterium or a yeast, and more preferably, a bacterium of the genus Escherichia, and even more preferably, Escherichia coll. A genetically modified Escherichia coil preferably has a modification in a gene which includes, but is not limited to, nagA, nagB, nagC, nagD, nagE, manXYZ, glmM, pfkB, pf c4, g1mU, glmS, ptsG or a phosphatase gene. In another embodiment, such a genetically modified Escherichia coli has a deletion of nag regulon genes, and in yet another embodiment, a deletion of nag regulon genes and a genetic modification in manXYZ genes such that the proteins encoded by the manXYZ
genes have decreased action.
Yet another embodiment of the present invention relates to a microorganism for producing glucosamine by a biosynthetic process which has a recombinant nucleic acid molecule encoding glucosamine-6-phosphate synthase operatively linked to a transcription control sequence; and at least one genetic modification in a gene encoding a protein selected from the group ofN-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase, N-acetyl-glucosamine-specific enzyme 11N'~
phosphoglucosamine mutase, glucosamine-1-phosphate acetyltransferase-N-acetylglucosamine-l-phosphate uridyltransferase, phosphofructokinase, Enzyme III" of the PEP:glucose PTS, and/or EIIM,P/Ill " of the PEP:mannose PTS. The genetic modification decreases action of said protein and expression ofthe recombinant nucleic acid molecule increases expression ofthe glucosamine-6-phosphate synthase by the microorganism. In another embodiment, the microorganism has at least one genetic modification in a phosphatase gene, such that the phosphatase encoded by such gene has increased action. In a preferred embodiment, the recombinant nucleic acid molecule is integrated into the genome ofthe microorganism.
Another embodiment ofthe present invention relates to any ofthe above-described microorganisms which produces at least about 1 g/L of glucosamine when cultured for about 24 hours at 37 C to a cell density of at least about 8 g2 by dry cell weight, in a pH
7.0 fermentation medium comprising: 14 g/L K2HPO4, 16 g/L KH2PO4, 1 g/L
32 Na3Citrate-2H2O, 5 gIL (NH4)2S04, 20 g/L glucose, 10 mM MgSO4i 1 mM CaCl2, and mM IPTG.
A more preferred embodiment ofthe present invention relates to any of the above-described microorganisms which produces at least about 1 g/L of glucosamine when cultured for about 10 to about 60 hours at from about 28 C to about 37 C to a cell density of at least about 8 g/L by dry cell weight, in a pH 7.0 fermentation medium comprising: 14 g/L K2HPO4i 16 g/L KH2PO4, 1 g/L Na3Citrate=2H2O, 5 g/L
(NH4)2S04, 20 g/L glucose, 10 mM MgSO4, 1 mM CaCl2, and from about 0.2mM to about 1 mM
IPTG. In a preferred embodiment, the amount of IPTG is about 0.2mM.
Yet another embodiment of the present invention relates to any of the above-described genetically modified microorganisms which produce at least about 1 g/L, and preferably at least about 5 g/L, and more preferably, at least about 10 g/L, and even more preferably, at least about 20 g/L, and even more preferably, at least about 50 g/L of glucosamine and/or glucosamine-6-phosphate when cultured under the culture conditions as described herein. Another embodiment of the present invention relates to any of the above-described genetically modified microorganisms which produce at least about 2-fold more glucosamine and/or glucosamine-6-phosphate, and preferably at least about 5-fold, and more preferably at least about 10-fold, and more preferably at least about 25-fold, and more preferably at least about 50-fold, and even more preferably at least about 100-fold, and even more preferably, at least about 200-fold more glucosamine and/or glucosamine-6-phosphate 5ynthase than a wild-type (i.e., non-modified, naturally occurring) microorganism cultured under the same conditions as the genetically modified microorganism. A number of specific microorganisms are identified in the Examples section. Additional embodiments of the present invention include these microorganisms and microorganisms having the identifying characteristics of the microorganisms specifically identified in the Examples. Such microorganisms are preferably yeast or bacteria, more preferably, are bacteria, and most preferably are E. coli. Such identifying characteristics can include any or all genotypic and/or phenotypic characteristics of the microorganisms in the Examples, including their abilities to produce glucosamine.
Preferred microorganisms of the present invention include strains ofEscherichia coli which have been transformed with a recombinant nucleic acid molecule encoding
A more preferred embodiment ofthe present invention relates to any of the above-described microorganisms which produces at least about 1 g/L of glucosamine when cultured for about 10 to about 60 hours at from about 28 C to about 37 C to a cell density of at least about 8 g/L by dry cell weight, in a pH 7.0 fermentation medium comprising: 14 g/L K2HPO4i 16 g/L KH2PO4, 1 g/L Na3Citrate=2H2O, 5 g/L
(NH4)2S04, 20 g/L glucose, 10 mM MgSO4, 1 mM CaCl2, and from about 0.2mM to about 1 mM
IPTG. In a preferred embodiment, the amount of IPTG is about 0.2mM.
Yet another embodiment of the present invention relates to any of the above-described genetically modified microorganisms which produce at least about 1 g/L, and preferably at least about 5 g/L, and more preferably, at least about 10 g/L, and even more preferably, at least about 20 g/L, and even more preferably, at least about 50 g/L of glucosamine and/or glucosamine-6-phosphate when cultured under the culture conditions as described herein. Another embodiment of the present invention relates to any of the above-described genetically modified microorganisms which produce at least about 2-fold more glucosamine and/or glucosamine-6-phosphate, and preferably at least about 5-fold, and more preferably at least about 10-fold, and more preferably at least about 25-fold, and more preferably at least about 50-fold, and even more preferably at least about 100-fold, and even more preferably, at least about 200-fold more glucosamine and/or glucosamine-6-phosphate 5ynthase than a wild-type (i.e., non-modified, naturally occurring) microorganism cultured under the same conditions as the genetically modified microorganism. A number of specific microorganisms are identified in the Examples section. Additional embodiments of the present invention include these microorganisms and microorganisms having the identifying characteristics of the microorganisms specifically identified in the Examples. Such microorganisms are preferably yeast or bacteria, more preferably, are bacteria, and most preferably are E. coli. Such identifying characteristics can include any or all genotypic and/or phenotypic characteristics of the microorganisms in the Examples, including their abilities to produce glucosamine.
Preferred microorganisms of the present invention include strains ofEscherichia coli which have been transformed with a recombinant nucleic acid molecule encoding
33 glucosamine-6-phosphate synthase. Preferably, such a nucleic acid molecule is integrated into the genome of the microorganism. A particularly preferred microorganism is Escherichia coli strain 2123-12. Strain 2123-12 has integrated into its genome a recombinant nucleic acid molecule comprising a nucleic acid sequence SEQ ID
NO: 15, which represents the coding region of a wild-type (i.e., normal, unmodified, or naturally occurring) glucosamine-6-phosphate synthase enzyme having amino acid sequence SEQ
ID NO: 16. Particularly preferred microorganisms of the present invention have been transformed with a nucleic acid molecule comprising a nucleic acid sequence encoding a glucosamine-6-phosphate synthase that has been genetically modified such that the synthase has increased action (described above). Most preferably, such genetic modification enhances the ability of the microorganism to produce glucosamine as compared to a microorganism which has not been transformed with such a nucleic acid molecule. Particularly preferred genetically modified microorganisms of the present invention are described in the Examples section, and include E. coli strains 2123-49, 2123-54, 2123-124, 2123-149 and 2123-151.
Development of a microorganism with enhanced ability to produce glucosamine by genetic modification can be accomplished using both classical strain development and molecular genetic techniques. In general, the strategy for creating a microorganism with enhanced glucosamine production is to (1) inactivate or delete at least one, and preferably more than one of the amino sugar metabolic pathways in which production of glucosamine-6-phosphate is negatively affected (e.g., inhibited), and (2) amplify at least one, and preferably more than one of the amino sugar metabolic pathways in which glucosamine-6-phosphate production is enhanced. As such, genetically modified microorganisms of the present invention have a (a) reduced ability to convert glucosamine-6-phosphate into other compounds (i.e., inhibition of glucosamine-phosphate catabolic or anabolic pathways), (b) an enhanced ability to produce (i.e., synthesize) glucosamine-6-phosphate, (c) a reduced ability to transport glucosamine into the cell, (d) an enhanced ability to transport glucosamine-6-phosphate or glucosamine out of the cell, and/or (e) a reduced ability to use substrates involved in the production of glucosamine-6-P for competing biochemical reactions.
NO: 15, which represents the coding region of a wild-type (i.e., normal, unmodified, or naturally occurring) glucosamine-6-phosphate synthase enzyme having amino acid sequence SEQ
ID NO: 16. Particularly preferred microorganisms of the present invention have been transformed with a nucleic acid molecule comprising a nucleic acid sequence encoding a glucosamine-6-phosphate synthase that has been genetically modified such that the synthase has increased action (described above). Most preferably, such genetic modification enhances the ability of the microorganism to produce glucosamine as compared to a microorganism which has not been transformed with such a nucleic acid molecule. Particularly preferred genetically modified microorganisms of the present invention are described in the Examples section, and include E. coli strains 2123-49, 2123-54, 2123-124, 2123-149 and 2123-151.
Development of a microorganism with enhanced ability to produce glucosamine by genetic modification can be accomplished using both classical strain development and molecular genetic techniques. In general, the strategy for creating a microorganism with enhanced glucosamine production is to (1) inactivate or delete at least one, and preferably more than one of the amino sugar metabolic pathways in which production of glucosamine-6-phosphate is negatively affected (e.g., inhibited), and (2) amplify at least one, and preferably more than one of the amino sugar metabolic pathways in which glucosamine-6-phosphate production is enhanced. As such, genetically modified microorganisms of the present invention have a (a) reduced ability to convert glucosamine-6-phosphate into other compounds (i.e., inhibition of glucosamine-phosphate catabolic or anabolic pathways), (b) an enhanced ability to produce (i.e., synthesize) glucosamine-6-phosphate, (c) a reduced ability to transport glucosamine into the cell, (d) an enhanced ability to transport glucosamine-6-phosphate or glucosamine out of the cell, and/or (e) a reduced ability to use substrates involved in the production of glucosamine-6-P for competing biochemical reactions.
34 As previously discussed herein, in one embodiment, a genetically modified microorganism can be a microorganism in which nucleic acid molecules have been deleted, inserted or modified, such as by insertion, deletion, substitution, and/or inversion of nucleotides, in such a manner that such modifications provide the desired effect within the microorganism. Such genetic modifications can, in some embodiments, be within the coding region for a protein encoded by the nucleic acid molecule which results in amino acid modifications such as insertions, deletions, substitutions in the amino acid sequence of the protein which provide the desired effect within the microorganisms. A
genetically modified microorganism can be modified by recombinant technology, such as by introduction of an isolated nucleic acid molecule into a microorganism. For example, a genetically modified microorganism can be transfected with a recombinant nucleic acid molecule encoding a protein of interest, such as a protein for which increased expression is desired. The transfected nucleic acid molecule can remain extrachromosomal or can integrate into one or more sites within a chromosome ofthe transfected (i.e., recombinant) host cell in such a manner that its ability to be expressed is retained.
Preferably, once a host cell of the present invention is transfected with a nucleic acid molecule, the nucleic acid molecule is integrated into the host cell genome. A significant advantage of integration is that the nucleic acid molecule is stably maintained in the cell. In a preferred embodiment, the integrated nucleic acid molecule is operatively linked to a transcription control sequence (described below) which can be induced to control expression of the nucleic acid molecule.
A nucleic acid molecule can be integrated into the genome of the host cell either by random or targeted integration. Such methods of integration are known in the art. For example, as described in detail in Example 2, E. coli strain ATCC 47002 (Table 1) contains mutations that confer upon it an inability to maintain plasmids which contain a ColEI origin of replication. When such plasmids are transferred to this strain, selection for genetic markers contained on the plasmid results in integration of the plasmid into the chromosome. This strain can be transformed, for example, with plasmids containing the gene of interest and a selectable marker flanked by the 5'- and 3'-termini of the E. coli lacZ gene. The lacZ sequences target the incoming DNA to the lacZ gene contained in the chromosome. Integration at the lacZ locus replaces the intact IacZ gene, which encodes the enzyme (3-galactosidase, with a partial lacZ gene interrupted by the gene of interest. Successful integrants can be selected for (3-galactosidase negativity. A
genetically modified microorganism can also be produced by introducing nucleic acid molecules into a recipient cell genome by a method such as by using a transducing 5 bacteriophage. The use of recombinant technology and transducing bacteriophage technology to produce several different genetically modified microorganism ofthe present invention is known in the art and is described in detail in the Examples section.
According to the present invention, a gene, for example the pstG gene, includes all nucleic acid sequences related to a natural pstG gene such as regulatory regions that control 10 production of the pstG protein (Enzyme II` of the PEP:glucose PTS) encoded by that gene (such as, but not limited to, transcription, translation or post-translation control regions) as well as the coding region itself. In another embodiment, a gene, for example the pstG gene, can be an allelic variant (i.e., a naturally occurring allelic variant) that includes a similar but not identical sequence to the nucleic acid sequence encoding a given 15 pstG gene. An allelic variant of a pstG gene which has a given nucleic acid sequence is a gene that occurs at essentially the same locus (or loci) in the genome as the gene having the given nucleic acid sequence, but which, due to natural variations caused by, for example, mutation or recombination, has a similar but not identical sequence.
Allelic variants typically encode proteins having similar activity to that of the protein encoded by 20 the gene to which they are being compared. Allelic variants can also comprise alterations in the 5' or 3' untranslated regions of the gene (e.g., in regulatory control regions). Allelic variants are well known to those skilled in the art and would be expected to be found within a given microorganism, such as an E. coil, and/or among a group of two or more microorganisms.
25 Although the phrase "nucleic acid molecule" primarily refers to the physical nucleic acid molecule and the phrase "nucleic acid sequence" primarily refers to the sequence of nucleotides on the nucleic acid molecule, the two phrases can be used interchangeably, especially with respect to a nucleic acid molecule, or a nucleic acid sequence, being capable of encoding a gene involved in an amino sugar metabolic pathway. In addition, 30 the phrase "recombinant molecule" primarily refers to a nucleic acid molecule operatively linked to a transcription control sequence, but can be used interchangeably with the phrase "nucleic acid molecule" which is isolated and expressed in a host cell.
Knowing the nucleic acid sequences of certain nucleic acid molecules of the present invention, and particularly Escherichia coil nucleic acid molecules, allows one skilled in the art to, for example, (a) make copies of those nucleic acid molecules and/or (b) obtain nucleic acid molecules including at least a portion of such nucleic acid molecules (e.g., nucleic acid molecules including full-length genes, full-length coding regions, regulatory control sequences, truncated coding regions). Such nucleic acid molecules can be obtained in a variety of ways including traditional cloning techniques using oligonucleotide probes of to screen appropriate libraries or DNA and PCR
amplification of appropriate libraries or DNA using oligonucleotide primers.
Preferred libraries to screen or from which to amplify nucleic acid molecule include bacterial and yeast genomic DNA libraries, and in particular, Escherichia coil genomic DNA
libraries.
Techniques to clone and amplify genes are disclosed, for example, in Sambrook et al., ibid In accordance with the present invention, an isolated nucleic acid molecule is a nucleic acid molecule that has been removed from its natural milieu (i.e., that has been subject to human manipulation). As such, "isolated" does not reflect the extent to which the nucleic acid molecule has been purified. An isolated nucleic acid molecule can include DNA, RNA, or derivatives of either DNA or RNA. There is no limit, other than a practical limit, on the maximal size of a nucleic acid molecule in that the nucleic acid molecule can include a portion of a gene, an entire gene, or multiple genes, or portions thereof.
An isolated nucleic acid molecule of the present invention can be obtained from its natural source either as an entire (i.e., complete) gene or a portion thereof capable of forming a stable hybrid with that gene. An isolated nucleic acid molecule can also be produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) amplification, cloning) or chemical synthesis. Isolated nucleic acid molecules include natural nucleic acid molecules and homologues thereof, including, but not limited to, natural allelic variants and modified nucleic acid molecules in which nucleotides have been inserted, deleted, substituted, and/or inverted in such a manner that such modifications provide the desired effect within the microorganism.
A nucleic acid molecule homologue can be produced using a number of methods known to those skilled in the art (see, for example, Sambrook et al., ibid. ).
For example, nucleic acid molecules can be modified using a variety of techniques including, but not limited to, classic mutagenesis techniques and recombinant DNA techniques, such as site-directed mutagenesis, chemical treatment of a nucleic acid molecule to induce mutations, restriction enzyme cleavage of a nucleic acid fragment, ligation of nucleic acid fragments, PCR amplification and/or mutagenesis of selected regions of a nucleic acid sequence, synthesis of oligonucleotide mixtures and ligation of mixture groups to "build" a mixture of nucleic acid molecules and combinations thereof. Nucleic acid molecule homologues can be selected from a mixture of modified nucleic acids by screening for the function of the protein encoded by the nucleic acid and/or by hybridization with a wild-type gene.
Examples of such techniques are described in detail in the Examples section.
In one embodiment of the present invention, a nucleic acid homologue of a nucleic acid molecule of the present invention preferably comprises a genetic modification which results in an modification of the action of the protein encoded by the nucleic acid homologue. For example, in one embodiment of the present invention, a genetically modified recombinant nucleic acid molecule is provided which comprises a nucleic acid sequence encoding a glucosamine-6-phosphate synthase protein homologue, wherein the genetic modification increases the action of the glucosamine-6-phosphate synthase homologue, preferably as compared to a recombinant nucleic acid molecule encoding a naturally occurring glucosamine-6-phosphate synthase in the absence of such genetic modification. Such a genetic modification can increase the action of the glucosamine-6-phosphate synthase, for example, by encoding a glucosamine-6-phosphate synthase having reduced glucosamine-6-phosphate product inhibition and/or increased specific activity.
Such recombinant nucleic acid molecules having genetic modifications are referred to herein as nucleic acid homologues of wild-type nucleic acid molecules encoding glucosamine-6-phosphate synthase. According to the present invention, proteins having modifications as a result of genetic modifications in the nucleic acid molecules encoding the proteins are referred to herein as protein homologues, or homologues of the given protein.
Accordingly, a glucosamine-6-phosphate synthase protein, for example, which has glucosamine-6-phosphate synthase activity and is useful in the present invention, can be a full-length glucosamine-6-phosphate synthase protein, an enzymatically active portion of a full-length glucosamine-6-phosphate synthase protein, or any homologue of such proteins, such as a glucosamine-6-phosphate synthase protein having at least one or a few amino acid modifications in which amino acid residues have been deleted (e.g., a truncated version of the protein, such as a peptide), inserted, inverted, substituted and/or derivatized (e.g., by glycosylation, phosphorylation, acetylation, myristoylation, prenylation, palmitation, amidation and/or addition of glycosyiphosphatidyl inositol).
A protein homologue of any of the proteins within the amino sugar metabolic pathways as described in the present invention is a protein having an amino acid sequence that is sufficiently similar to a natural protein amino acid sequence (i.e., naturally occurring, unmodified, or wild-type) that a nucleic acid sequence encoding the homologue is capable of hybridizing under stringent conditions to (i.e., with) a nucleic acid molecule encoding the natural protein (i.e., to the complement of the nucleic acid strand encoding the natural protein amino acid sequence). A nucleic acid sequence complement of any nucleic acid sequence of the present invention refers to the nucleic acid sequence of the nucleic acid strand that is complementary to (i.e., can form a double helix with the entire molecule) the strand for which the sequence is cited. It is to be noted that a double-stranded nucleic acid molecule of the present invention for which a nucleic acid sequence has been determined for one strand that is represented by a SEQ ID NO also comprises a complementary strand having a sequence that is a complement of that SEQ ID
NO. As such, nucleic acid molecules ofthe present invention, which can be either double-stranded or single-stranded, include those nucleic acid molecules that form stable hybrids under stringent hybridization conditions with either a given SEQ ID NO denoted herein and/or with the complement of that SEQ ID NO, which may or may not be denoted herein.
Methods to deduce a complementary sequence are known to those skilled in the art.
The minimal size of a protein homologue of the present invention is a size sufficient to be encoded by a nucleic acid molecule capable offorming a stable hybrid with the complementary sequence of a nucleic acid molecule encoding the corresponding natural protein. Additionally, the minimal size of a protein homologue of the present invention is a size sufficient to have glucosamine-6-phosphate synthase action (e.g., a catalytically or enzymatically active portion), and preferably, increased glucosamine-6-phosphate synthase action. As such, the size of the nucleic acid molecule encoding such a protein homologue is dependent on nucleic acid composition and percent homology between the nucleic acid molecule and complementary sequence as well as upon hybridization conditions per se (e.g., temperature, salt concentration, and formamide concentration). There is no limit, other than a practical limit, on the maximal size of such a nucleic acid molecule in that the nucleic acid molecule can include a portion of a gene, an entire gene, or multiple genes, or portions thereof. Similarly, the minimal size of a protein homologue of the present invention is from about 4 to about 6 amino acids in length, with preferred sizes depending on whether a full-length, multivalent (i.e., fusion protein having more than one domain each of which has a function), or functional portions of such proteins are desired.
As used herein, stringent hybridization conditions refer to standard hybridization conditions under which nucleic acid molecules arc used to identify similar nucleic acid molecules. Such standard conditions are disclosed, for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Labs Press, 1989. Sambrook et al.,. In addition, formulae to calculate the appropriate hybridization and wash conditions to achieve hybridization permitting varying degrees of mismatch of nucleotides are disclosed, for example, in Meinkoth et al, 1984, Anal. Biochem. 138, 267-284; Meinkoth et al,.
More particularly, stringent hybridization conditions, as referred to herein, refer to conditions which permit isolanon of nucleic acid molecules having at least about 70% nucleic acid sequence identity with the nucleic acid molecule being used to probe in the hybridization reaction, more particularly at least about 75%, and most particularly at least about 80%. Such conditions will vary, depending on whether DNA:RNA or DNA:DNA hybrids are being formed.
Calculated melting temperatures for DNA:DNA hybrids are tE5224320.DOC.u 10 *C less than for DNA:RNA hybrids. In particular embodiments, stringent hybridization conditions for DNA:DNA hybrids include hybridization at an ionic strength of 0.1X SSC
(0.157 MNa) at a temperature ofbetween about 20 C and about 35 C, more preferably, between about 28 *C and about 40 C, and even more preferably, between about
genetically modified microorganism can be modified by recombinant technology, such as by introduction of an isolated nucleic acid molecule into a microorganism. For example, a genetically modified microorganism can be transfected with a recombinant nucleic acid molecule encoding a protein of interest, such as a protein for which increased expression is desired. The transfected nucleic acid molecule can remain extrachromosomal or can integrate into one or more sites within a chromosome ofthe transfected (i.e., recombinant) host cell in such a manner that its ability to be expressed is retained.
Preferably, once a host cell of the present invention is transfected with a nucleic acid molecule, the nucleic acid molecule is integrated into the host cell genome. A significant advantage of integration is that the nucleic acid molecule is stably maintained in the cell. In a preferred embodiment, the integrated nucleic acid molecule is operatively linked to a transcription control sequence (described below) which can be induced to control expression of the nucleic acid molecule.
A nucleic acid molecule can be integrated into the genome of the host cell either by random or targeted integration. Such methods of integration are known in the art. For example, as described in detail in Example 2, E. coli strain ATCC 47002 (Table 1) contains mutations that confer upon it an inability to maintain plasmids which contain a ColEI origin of replication. When such plasmids are transferred to this strain, selection for genetic markers contained on the plasmid results in integration of the plasmid into the chromosome. This strain can be transformed, for example, with plasmids containing the gene of interest and a selectable marker flanked by the 5'- and 3'-termini of the E. coli lacZ gene. The lacZ sequences target the incoming DNA to the lacZ gene contained in the chromosome. Integration at the lacZ locus replaces the intact IacZ gene, which encodes the enzyme (3-galactosidase, with a partial lacZ gene interrupted by the gene of interest. Successful integrants can be selected for (3-galactosidase negativity. A
genetically modified microorganism can also be produced by introducing nucleic acid molecules into a recipient cell genome by a method such as by using a transducing 5 bacteriophage. The use of recombinant technology and transducing bacteriophage technology to produce several different genetically modified microorganism ofthe present invention is known in the art and is described in detail in the Examples section.
According to the present invention, a gene, for example the pstG gene, includes all nucleic acid sequences related to a natural pstG gene such as regulatory regions that control 10 production of the pstG protein (Enzyme II` of the PEP:glucose PTS) encoded by that gene (such as, but not limited to, transcription, translation or post-translation control regions) as well as the coding region itself. In another embodiment, a gene, for example the pstG gene, can be an allelic variant (i.e., a naturally occurring allelic variant) that includes a similar but not identical sequence to the nucleic acid sequence encoding a given 15 pstG gene. An allelic variant of a pstG gene which has a given nucleic acid sequence is a gene that occurs at essentially the same locus (or loci) in the genome as the gene having the given nucleic acid sequence, but which, due to natural variations caused by, for example, mutation or recombination, has a similar but not identical sequence.
Allelic variants typically encode proteins having similar activity to that of the protein encoded by 20 the gene to which they are being compared. Allelic variants can also comprise alterations in the 5' or 3' untranslated regions of the gene (e.g., in regulatory control regions). Allelic variants are well known to those skilled in the art and would be expected to be found within a given microorganism, such as an E. coil, and/or among a group of two or more microorganisms.
25 Although the phrase "nucleic acid molecule" primarily refers to the physical nucleic acid molecule and the phrase "nucleic acid sequence" primarily refers to the sequence of nucleotides on the nucleic acid molecule, the two phrases can be used interchangeably, especially with respect to a nucleic acid molecule, or a nucleic acid sequence, being capable of encoding a gene involved in an amino sugar metabolic pathway. In addition, 30 the phrase "recombinant molecule" primarily refers to a nucleic acid molecule operatively linked to a transcription control sequence, but can be used interchangeably with the phrase "nucleic acid molecule" which is isolated and expressed in a host cell.
Knowing the nucleic acid sequences of certain nucleic acid molecules of the present invention, and particularly Escherichia coil nucleic acid molecules, allows one skilled in the art to, for example, (a) make copies of those nucleic acid molecules and/or (b) obtain nucleic acid molecules including at least a portion of such nucleic acid molecules (e.g., nucleic acid molecules including full-length genes, full-length coding regions, regulatory control sequences, truncated coding regions). Such nucleic acid molecules can be obtained in a variety of ways including traditional cloning techniques using oligonucleotide probes of to screen appropriate libraries or DNA and PCR
amplification of appropriate libraries or DNA using oligonucleotide primers.
Preferred libraries to screen or from which to amplify nucleic acid molecule include bacterial and yeast genomic DNA libraries, and in particular, Escherichia coil genomic DNA
libraries.
Techniques to clone and amplify genes are disclosed, for example, in Sambrook et al., ibid In accordance with the present invention, an isolated nucleic acid molecule is a nucleic acid molecule that has been removed from its natural milieu (i.e., that has been subject to human manipulation). As such, "isolated" does not reflect the extent to which the nucleic acid molecule has been purified. An isolated nucleic acid molecule can include DNA, RNA, or derivatives of either DNA or RNA. There is no limit, other than a practical limit, on the maximal size of a nucleic acid molecule in that the nucleic acid molecule can include a portion of a gene, an entire gene, or multiple genes, or portions thereof.
An isolated nucleic acid molecule of the present invention can be obtained from its natural source either as an entire (i.e., complete) gene or a portion thereof capable of forming a stable hybrid with that gene. An isolated nucleic acid molecule can also be produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) amplification, cloning) or chemical synthesis. Isolated nucleic acid molecules include natural nucleic acid molecules and homologues thereof, including, but not limited to, natural allelic variants and modified nucleic acid molecules in which nucleotides have been inserted, deleted, substituted, and/or inverted in such a manner that such modifications provide the desired effect within the microorganism.
A nucleic acid molecule homologue can be produced using a number of methods known to those skilled in the art (see, for example, Sambrook et al., ibid. ).
For example, nucleic acid molecules can be modified using a variety of techniques including, but not limited to, classic mutagenesis techniques and recombinant DNA techniques, such as site-directed mutagenesis, chemical treatment of a nucleic acid molecule to induce mutations, restriction enzyme cleavage of a nucleic acid fragment, ligation of nucleic acid fragments, PCR amplification and/or mutagenesis of selected regions of a nucleic acid sequence, synthesis of oligonucleotide mixtures and ligation of mixture groups to "build" a mixture of nucleic acid molecules and combinations thereof. Nucleic acid molecule homologues can be selected from a mixture of modified nucleic acids by screening for the function of the protein encoded by the nucleic acid and/or by hybridization with a wild-type gene.
Examples of such techniques are described in detail in the Examples section.
In one embodiment of the present invention, a nucleic acid homologue of a nucleic acid molecule of the present invention preferably comprises a genetic modification which results in an modification of the action of the protein encoded by the nucleic acid homologue. For example, in one embodiment of the present invention, a genetically modified recombinant nucleic acid molecule is provided which comprises a nucleic acid sequence encoding a glucosamine-6-phosphate synthase protein homologue, wherein the genetic modification increases the action of the glucosamine-6-phosphate synthase homologue, preferably as compared to a recombinant nucleic acid molecule encoding a naturally occurring glucosamine-6-phosphate synthase in the absence of such genetic modification. Such a genetic modification can increase the action of the glucosamine-6-phosphate synthase, for example, by encoding a glucosamine-6-phosphate synthase having reduced glucosamine-6-phosphate product inhibition and/or increased specific activity.
Such recombinant nucleic acid molecules having genetic modifications are referred to herein as nucleic acid homologues of wild-type nucleic acid molecules encoding glucosamine-6-phosphate synthase. According to the present invention, proteins having modifications as a result of genetic modifications in the nucleic acid molecules encoding the proteins are referred to herein as protein homologues, or homologues of the given protein.
Accordingly, a glucosamine-6-phosphate synthase protein, for example, which has glucosamine-6-phosphate synthase activity and is useful in the present invention, can be a full-length glucosamine-6-phosphate synthase protein, an enzymatically active portion of a full-length glucosamine-6-phosphate synthase protein, or any homologue of such proteins, such as a glucosamine-6-phosphate synthase protein having at least one or a few amino acid modifications in which amino acid residues have been deleted (e.g., a truncated version of the protein, such as a peptide), inserted, inverted, substituted and/or derivatized (e.g., by glycosylation, phosphorylation, acetylation, myristoylation, prenylation, palmitation, amidation and/or addition of glycosyiphosphatidyl inositol).
A protein homologue of any of the proteins within the amino sugar metabolic pathways as described in the present invention is a protein having an amino acid sequence that is sufficiently similar to a natural protein amino acid sequence (i.e., naturally occurring, unmodified, or wild-type) that a nucleic acid sequence encoding the homologue is capable of hybridizing under stringent conditions to (i.e., with) a nucleic acid molecule encoding the natural protein (i.e., to the complement of the nucleic acid strand encoding the natural protein amino acid sequence). A nucleic acid sequence complement of any nucleic acid sequence of the present invention refers to the nucleic acid sequence of the nucleic acid strand that is complementary to (i.e., can form a double helix with the entire molecule) the strand for which the sequence is cited. It is to be noted that a double-stranded nucleic acid molecule of the present invention for which a nucleic acid sequence has been determined for one strand that is represented by a SEQ ID NO also comprises a complementary strand having a sequence that is a complement of that SEQ ID
NO. As such, nucleic acid molecules ofthe present invention, which can be either double-stranded or single-stranded, include those nucleic acid molecules that form stable hybrids under stringent hybridization conditions with either a given SEQ ID NO denoted herein and/or with the complement of that SEQ ID NO, which may or may not be denoted herein.
Methods to deduce a complementary sequence are known to those skilled in the art.
The minimal size of a protein homologue of the present invention is a size sufficient to be encoded by a nucleic acid molecule capable offorming a stable hybrid with the complementary sequence of a nucleic acid molecule encoding the corresponding natural protein. Additionally, the minimal size of a protein homologue of the present invention is a size sufficient to have glucosamine-6-phosphate synthase action (e.g., a catalytically or enzymatically active portion), and preferably, increased glucosamine-6-phosphate synthase action. As such, the size of the nucleic acid molecule encoding such a protein homologue is dependent on nucleic acid composition and percent homology between the nucleic acid molecule and complementary sequence as well as upon hybridization conditions per se (e.g., temperature, salt concentration, and formamide concentration). There is no limit, other than a practical limit, on the maximal size of such a nucleic acid molecule in that the nucleic acid molecule can include a portion of a gene, an entire gene, or multiple genes, or portions thereof. Similarly, the minimal size of a protein homologue of the present invention is from about 4 to about 6 amino acids in length, with preferred sizes depending on whether a full-length, multivalent (i.e., fusion protein having more than one domain each of which has a function), or functional portions of such proteins are desired.
As used herein, stringent hybridization conditions refer to standard hybridization conditions under which nucleic acid molecules arc used to identify similar nucleic acid molecules. Such standard conditions are disclosed, for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Labs Press, 1989. Sambrook et al.,. In addition, formulae to calculate the appropriate hybridization and wash conditions to achieve hybridization permitting varying degrees of mismatch of nucleotides are disclosed, for example, in Meinkoth et al, 1984, Anal. Biochem. 138, 267-284; Meinkoth et al,.
More particularly, stringent hybridization conditions, as referred to herein, refer to conditions which permit isolanon of nucleic acid molecules having at least about 70% nucleic acid sequence identity with the nucleic acid molecule being used to probe in the hybridization reaction, more particularly at least about 75%, and most particularly at least about 80%. Such conditions will vary, depending on whether DNA:RNA or DNA:DNA hybrids are being formed.
Calculated melting temperatures for DNA:DNA hybrids are tE5224320.DOC.u 10 *C less than for DNA:RNA hybrids. In particular embodiments, stringent hybridization conditions for DNA:DNA hybrids include hybridization at an ionic strength of 0.1X SSC
(0.157 MNa) at a temperature ofbetween about 20 C and about 35 C, more preferably, between about 28 *C and about 40 C, and even more preferably, between about
35'C and 5 about 45'C. In particular embodiments, stringent hybridization conditions for DNA:RNA
hybrids include hybridization at an ionic strength of 0.1X SSC (0.157 M Na+) at a temperature of between about 30 C and about 45 C, more preferably, between about 38 C and about 50 C, and even more preferably, between about 45 C and about 55 C.
These values are based on calculations of a melting temperature for molecules larger than 10 about 100 nucleotides, 0% formamide and a G + C content of about 50%.
Alternatively, T. can be calculated empirically as set forth in Sambrook et al., supra, pages 11.55 to 11.57.
Protein homologues of proteins involved in an amino sugar metabolic pathway according to the present invention can be the result of natural allelic variation or natural 15 mutation. Protein homologues of the present invention can also be produced using techniques known in the art including, but not limited to, direct modifications to the protein or modifications to the gene encoding the protein using, for example, classic or recombinant DNA techniques to effect random or targeted mutagenesis, as discussed above.
20 In one embodiment of the present invention, a genetic modification in a recombinant nucleic acid molecule ofthe present invention which encodes a glucosamine-6-phosphate synthase results in at least one amino acid modification (i.e., modification in the amino acid sequence of the encoded protein) selected from the group of an addition, substitution, deletion, and/or derivatization of an amino acid residue ofthe glucosamine-6-25 phosphate synthase. Such a modification in the amino acid sequence of the encoded protein can be determined as compared to a wild-type, or naturally occurring glucosamine-6-phosphate synthase, such as a glucosamine-6-phosphate synthase having an amino acid sequence SEQ ID NO:16. One or more of such amino acid modifications results in increased action of glucosamine-6-phosphate synthase as compared to the 30 naturally occurring glucosamine-6-phosphate synthase having amino acid sequence SEQ
ID NO:16. In one embodiment, such an amino acid modification is in an amino acid sequence position in the modified protein (i.e., homologue) which corresponds to one or more ofthe following amino acid positions in amino acid sequence SEQ ID NO:
16: Ile(4), Ile(272), Ser(450), Ala(39), Arg(250), Gly(472), Leu(469).
In another embodiment, such an amino acid modification is selected from the group of a substitution of (a) an amino acid residue having an aliphatic hydroxyl side group for Ile(4); (b) an amino acid residue having an aliphatic hydroxyl side group for lle(272); (c) an amino acid residue having an aliphatic side group for Ser(450); (d) an amino acid residue having an aliphatic hydroxyl side group for Ala(39); (e) an amino acid residue having a sulfur-containing side group for Arg(250); (f) an amino acid residue having an aliphatic hydroxyl side group for Gly(472); (g) an amino acid residue having an aliphatic side group for Leu(469); and, (h) combinations of (a)-(g). According to the present invention, amino acid residues having an aliphatic hydroxyl group include serine and threonine, and amino acid residues having aliphatic side groups include glycine, alanine, valine, leucine, isoleucine and proline.
In yet another embodiment of the present invention, an amino acid modification as described above is preferably a substitution selected from the group of Ile(4) to Thr, Ile(272) to Thr, Ser(450) to Pro, Ala(39) to '.hr, Arg(250) to Cys, Gly(472) to Ser, Leu(469) to Pro, and combinations thereof. Specific examples of recombinant nucleic acid molecules having genetic modifications resulting in such amino acid modifications are described in detail in the Examples section.
Preferred genetically modified recombinant nucleic acid molecules comprising a nucleic acid sequence encoding a glucosamine-6-phosphate synthase having increased action include recombinant nucleic acid molecules comprising a nucleic acid sequence which encodes a glucosamine-6-phosphate synthase comprising an amino acid sequence selected from the group of SEQ ID NO:19, SEQ ID NO:22, SEQ ID NO:25, SEQ ID
NO:28 and/or SEQ ID NO:3 1. Other preferred genetically modified recombinant nucleic acid molecules comprise a nucleic acid sequence selected from the group of SEQ
ID
NO:17, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID
NO:24, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:29 and/or SEQ ID NO:30.
Particularly preferred genetically modified recombinant nucleic acid molecules useful in the present invention include nucleic acid molecules comprising nucleic acid molecules selected from the group of pKLN23-49, pKLN23-54, pKLN23-124, pKLN23-149, pKLN23-151, ngimS-492114, ngimS-491830, ngimS-542194, ngimS-541830, ngimS-1242184, ngimS-1241830, ngimS-1492184, ngimS-1491830, nglmS-1512184 and ngimS-1511830.
The present invention includes a recombinant vector, which includes at least one isolated nucleic acid molecule of the present invention, inserted into any vector capable of delivering the nucleic acid molecule into a bacterial cell. Such a vector can contain bacterial nucleic acid sequences that are not naturally found adjacent to the isolated nucleic acid molecules to be inserted into the vector. The vector can be either RNA or DNA and typically is a plasmid. Recombinant vectors can be used in the cloning, sequencing, and/or otherwise manipulating of nucleic acid: molecules. One type of recombinant vector, referred to herein as a recombinant nucleic acid molecule and described in more detail below, can be used in the expression of nucleic acid molecules.
Preferred recombinant vectors are capable ofreplicating in a transformed bacterial or yeast cell, and in particular, in an Escherichia coli cell.
Transformation of a nucleic acid molecule into a cell can be accomplished by any method by which a nucleic acid molecule can be inserted into the cell.
Transformation techniques include, but are not limited to, transfection, electroporation and microinjection.
Recombinant molecules of the present invention, which can be either DNA or RNA, can also contain additional regulatory sequences, such as translation regulatory sequences, origins of replication, and other regulatory sequences that are compatible with the recombinant cell. One or more recombinant molecules of the present invention can be used to produce an encoded product (e.g., a glucosamine-6-phosphate synthase). In one embodiment, an encoded product is produced by expressing a nucleic acid molecule of the present invention under conditions effective to produce the protein.
Such conditions (i.e., culture conditions) have been described above and are further discussed in the Examples section. A preferred method to produce an encoded protein is by transfecting a host cell with one or more recombinant molecules of the present invention to form a recombinant cell.
As discussed above, preferred recombinant molecules of the present invention include, ngimS-282184, ngimS-281830, ngimS-492184, ngimS-491830, ngimS-542184, nglmS-541830, ngimS-1242184, ngimS-1241830, ngimS-1492184, ngimS-1491830, ngimS-1512184, ng1mS-1511230, pKLN23-28, pKLN23-49, pKLN23-54, pKLN23-124, pKLN23-149 and/or pKLN23-151.
A recombinant cell is preferably produced by transforming a bacterial cell (i.e., a host cell) with one or more recombinant molecules, each comprising one or more nucleic acid molecules operatively linked to an expression vector containing one or more transcription control sequences. The phrase, operatively linked, refers to insertion of a nucleic acid molecule into an expression vector in a manner such that the molecule is able to be expressed when transformed into a host cell. As used herein, an expression vector is a DNA or RNA vector that is capable of transforming a host cell and of effecting expression of a specified nucleic acid molecule. Preferably, the expression vector is also capable of replicating within the host cell. In the present invention, expression vectors are typically plasmids. Expression vectors of the present invention include any vectors that function (i.e., direct gene expression) in a yeast host cell or a bacterial host cell, preferably anEscherichia coli host cell. Preferred recombinant cells of the present invention are set forth in the Examples section.
Nucleic acid molecules of the present invention can be operatively linked to expression vectors containing regulatory sequences such as transcription control sequences, translation control sequences, origins of replication, and other regulatory sequences that are compatible with the recombinant cell and that control the expression of nucleic acid molecules of the present invention. In particular, recombinant molecules of the present invention include transcription control sequences.
Transcription control sequences are sequences which control the initiation, elongation, and termination of transcription. Particularly important transcription control sequences are those which control transcription initiation, such as promoter, enhancer, operator and repressor sequences. Suitable transcription control sequences include any transcription control sequence that can function in yeast or bacterial cells and preferably, Escherichia coll. A
variety of such transcription control sequences are known to those skilled in the art.
It may be appreciated by one skilled in the art that use of recombinant DNA
technologies can improve expression of transformed nucleic acid molecules by manipulating, for example, the number of copies of the nucleic acid molecules within a host cell, the efficiency with which those nucleic acid molecules are transcribed, the efficiency with which the resultant transcripts are translated, and the efficiency of post-translational modifications. Recombinant techniques useful for increasing the expression of nucleic acid molecules of the present invention include, but are not limited to, operatively linking nucleic acid molecules to high-copy number plasmids, integration of the nucleic acid molecules into the host cell chromosome, addition of vector stability sequences to plasmids, substitutions or modifications of transcription control signals (e.g., promoters, operators, enhancers), substitutions or modifications of translational control signals, modification of nucleic acid molecules of the present invention to correspond to the codon usage of the host cell, deletion of sequences that destabilize transcripts, and use of control signals that temporally separate recombinant cell growth from recombinant enzyme production during fermentation. The activity of an expressed recombinant protein of the present invention may be improved by fragmenting, modifying, or derivarizing nucleic acid molecules encoding such a protein.
Such modifications are described in detail in the Examples section.
The following experimental results are provided for the purposes of illustration an are got intended to limit the scope of the invention.
EXAMPLES
Exam e I
The following example describes the production of mutant Escherichia cols strains which are blocked in amino acid sugar metabolic pathways involving degradation of glucosamine.
The starting strain for the construction of all glucosamine overproducing strains described herein was E_ coli W3 110 (publicly available from the American Type Culture Collection as ATCC No. 25947), a prototrophic, f A derivative of E. coil K-12 (Bachmann, 1987, "Escherichia coli and Salmonella iyphimunum ", Cellular W Molecular Biology. pp.
1190-1219 . A list of all strains used and produced in the following examples is provided in Table 1.
(E5224320 DOC, I) Table 1. Bacterial strains.
$ ... Aii G not :: Svu~re l fern .:.. > ::
W3110 F mcrA mcrB IN(rrnD-rrnE)1 X' ATCC
IBPC 522 thi-1 argG6 argE3 his-4 w/-1 xyl-S rpsL tsx- J. Plumbridge 29? AlacX74 manXYZ8 nagE47ptsG22 zcf-229::Th10 IBPC 566 thi-1 argG6 argE3 his-4 mtl-I xyl-5 rpsL tmc J. Plumbridge 29? AlacX74 manXYZ8 zdj-225:: Tn 10 IBPC 590 thi-1 argG6 argE3 his-4 mt1-1 xyl-5 rpsL tsx- J. Plumbridge 29? AlacX74 Mag::TcR
7101-6 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)I A' manXYZ8 W3110 x P1õ,(1BPC566) zdj-225::TnlO
7101-7 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)l A' manXYZ8 W3110 x P1,(IBPC566) zdj-225::Th10 7101-9 W31i0 Anag F mcrA mcrB IN(rrnD-rrnE)1 A' Anag::TcR W3110 x P1,,,,(IBPC590) 7101-13 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)1 A' manXYZ8 7101-6 selected on TCS
TcS zdj-225::Tn10? TcS medium 7101-14 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)1 A' manXYZ8 7101-7 selected on TCS
TcS zdj-22S::TnlO? TcS medium 7101-15 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)l ) manXYZ8 7101-14 x P1,(IBPC522) ptsG zdj-225:: Tn 10? ptsG22 zcf-229:: Tn10 7101-17 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)1 A' manXYZ8 7101-13 x PlAIBPC590) Anag zdj-225:: Tn 10? TcS nag::TcR
7101-22 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)I A' manXYZ8 7101-15 selected on TCS
ptsG TcS zdj-225:: Tn107 ptsG22 zcf-229::Tn 10? TcS medium 2123-4 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)l A' manXYZ8 7101-22 x Plõõ
(IBPC590) ptsG Anag zdj-225::Tn10? ptsG22 zcf-229::Tn10? TcS
Anag::TcR
W3110(D F mcrA mcrB IN(rrnD-rrnE)l ADE3 W3110 lysogenized with E3) ADE3 7101- F mcrA mcrB IN(rrnD-rrnE)1 ADE3 7101-9 lysogenized with 9(DE3) nag::TcR ADE3 7101- F mcrA mcrB IN(rrnD-rrnE)1 ADE3 7101-17 lysogenized with 17(DE3) manXYZ8 zdj-225::Tn10? TcS Anag::TcR ADE3 2123- F mcrA mcrB IN(rrnD-rrnE)l ADE3 2123-4 lysogenized with 4(DE3) manXYZ8 zdj-225::Tnl0? ptsG22 zcf- ADE3 229::Tn 10 TcS Anag::TcR
BL21(DE F ompT hsdSB gal dcm ADE3 Novagen, Inc.
3) ATCC JC7623 F recB21 recC22 sbcBi S leu-6 ara-14 his-4 ATCC
T-71 F recB2i recC22 sbcBiS leu-6 ara-14 his-4 Integration of pT7 glmS-A' IacZ::pT7 g/mS--Cm8H7 Cm into lacZ of ATCC47002 by transformation with pKLN23-28 T-81 F recB21 recC22 sbcB1 S leu-6 ara-14 his-4 Integration of pT7g/mS-) lacZ::pT7g1mS-Cm8H8 Cm into lacZ of ATCC47002 by transformation with pKLN23-28 2123-5 W3110(DE3) 1acZ::pT7 g/mS-Cm8H7 W3110(DE3) x Pl,(T-71) tr~gn orererice 2123-6 W3110(DE3) IacZ::pT7 g/mS-Cm8H8 W3110(DE3) x P1õõ(T-81) 2123-7 W3110(DE3) /acZ::pT7gImS-Cm8H7 W3110(DE3) x Plõõ (T-71) 2123-8 W3110(DE3) lacZ::pT7gImS-Cm8H8 W3110(DE3) x Pl,,,,(T-81) 2123-9 7101-9(DE3) lacZ::pT7g/mS--Cm8H7 7101-9(DE3)x P1,,,,(T-71) 2123-10 7101-9(DE3) lacZ::pT7 gImS-Cm8H8 7101-9(DE3)x P1,,,,(T-81) 2123-11 7101-17(DE3) lacZ::pT7 g/mS-Cm8H7 7101-17(DE3)x P1,,,,(T-71) 2123-12 7101-17(DE3) IacZ::pT7 glmS--Cm8H8 7101-17(DE3)x Pl,(T-81) 2123-13 2123-4(DE3) IacZ::pT7gImS--Cm8H7 2123-4(DE3)xP1,,,(T-71) 2123-14 2123-4(DE3) lacZ::pT7glmS--Cm8H8 2123-4(DE3)xP1,,,,(T-81) NovaBlue endAI hsdR17 supE44 thi-I recAl gyrA96 Novagen relAl lac [F' proA+B+ lacFZAM]5::Tn10) LE392 F e14' (McrA') hsdR514(rm+) supE44 Lab collection supFS8 lacY1 or Alac(IZY)6 ga1K2 gaIT22 metBi trpR55 2123-16 LE392 gImS13 NG mutagenesis of LE392 2123-49 7101-17(DE3) IacZ::pT7 gImS11C-Cm8H8 Error-prone PCR with pKLN23-28; integration of mutant glmS into ATCC47002; transfer to 7101-17(DE3) by PI
transduction 2123-51 7101-17(DE3) lacZ::pT7 gImS52B-Cm8H8 Error-prone PCR with pKLN23-28; integration of mutant glmS into ATCC47002; transfer to 7101-17(DE3) by P1 transduction 2123-54 7101-17(DE3) IacZ::pT7 gImS8A-Cm8H8 Error-prone PCR with pKLN23-28; integration of mutant glmS into ATCC47002; transfer to 7101-17(DE3) by PI
transduction 2123-124 7101-17(DE3) Error-prone PCR with lacZ::pT7gImS94A pKLN23-28; integration S-Cm8H8 of mutant gimS into ATCC47002; transfer to 7107-17(DE3) by P1 transduction 2123-149 7101-17(DE3) pKLN23-54 EcoR1-lacZ::pT7 glmSl49- HindllI (1.0 kb) x Cm8H8 pKLN23-28 EcoRl-Hindlll (6.4 kb);
integration of mutant gimS into ATCC47002;
transfer to 7101-17(DE3) PI transduction Strain Albs Genotype Source/Reference 2123-151 7101-17(DE3) pKLN23-54 EcoRI-Rindlll (1.0 kb) x lacZ::pT7- pKLN23-28 coRlHindIll (6.4 kb);
glmSl S 1-Cm8H8 integration of mutant g1mS into ATCC47002; transfer to 7101-17(DE3) by P1 transduction Host strains blocked for glucosamine uptake and degradation were constructed by introducing mutations in the nagE man= and ptsG genes, which block transport of glucosamine, and the nagA. -B, -C and D genes, which are involved in metabolism of glucosamine-6-phosphate. Each of these genes has been described in detail previously herein.
Mutations in these genes were introduced into strains using the transducing bacteriophage P l,, (as described in Miller, 1972, "Experiments in Molecular Genetics", Cold Spring Harbor Laboratory.
In this technique, genes or mutations from one strain (the donor strain) are transferred to a recipient strain using the bacteriophage P1,:,. When bacteriophage PL. is grown on the donor strain, a small portion of the phage particles that are produced contain chromosomal DNA from the donor rather than the normal complement of phage DNA. Upon infection of the recipient strain with bacteriophage grown on the donor strain, those bacteriophagge particles containing chromosomal ANA from the donor strain can transfer that DNA to the recipient strain. If there is a strong selection for the DNA from the donor strain, recipient strains containing the appropriate gene or mutation from the donor strain can be selected.
To grow PI,,,. on a donor strain, an existing bacteriophage stock was used to infect-a culture of that strain. The recipient strain was grown at 37 C in LBMC medium (10 g/L Bacto u-yptone, S g/L yeast extract, 10 g/L NaCl, 1 mM MgCl2, 5 mM CaC12) until the absorbance at 600 nm was approximately 1.0, corresponding to approximately 6 x 108 cells per mL of culture.
One mL of the culture was then infected with a dilution of the phage stock at a ratio of approximately one phage per 10 cells. The mixture was incubated without shaking for 20 minutes at 37 C, then transferred to 10 ml- of prewarmed LBMC broth in a 125 mL baffled Erlenmeyer flask. The resulting culture was shaken vigorously for 2-3 hours at 37 C. During this period, it was generally observed that the culture would become more turbid, indicating bacterial growth. Toward the end (E5224320.DOC,1) of this incubation period, the culture would become clear, indicating cell lysis due to bacteriophage growth. After lysis had occurred, the culture was cooled on ice, a few drops of chloroform were added, and the flask was shaken briefly. The contents of the flask were then centrifuged to remove the cell debris and chloroform, and the resulting supernatant generally contained between 10' and 10' bacteriophage per mL.
Mutations were transferred to recipient strains by transduction with P 1,,, grown on the appropriate donor strain as described above. For transduction with P
1",, a culture of the recipient strain was grown overnight at 37 C in LBMC broth. 0.1 mL of culture was mixed with 0.1 ml, of bacteriophage lysate or a serial dilution of the lysate in a sterile test tube and incubated at 37'C for 20 minutes. 0.2 mL of 1 M sodium citrate was added to the tube, and the mixture was plated to selective medium. For each transduction, controls containing uninfected cells and bacteriophage lysates without cells were performed as described above. For the production of strains blocked in glucosamine degradation, selections were for tetracycline resistance as described below.
Tetracycline resistant mutants were selected by plating to LB medium (10 g/L Bacto tryptone, 5 g/L
yeast extract, 10 g/L NaCl) containing 12.5 jig/mL tetracycline and 10 mM
sodium citrate.
The mutations in the nag genes were introduced simultaneously as a deletion mutation (znag::TcR). In strain IBPC590 (Plumbridge, Table 1), which contains this mutation, the nag genes have been replaced by a tetracycline-resistance (TcR) determinant.
As a result, the mutation which removes the nag functions was transferred to appropriate recipient hosts by selection for tetracycline resistance. In this case, since the Tc' determinant was contained within the mutation of interest, the Onag and TcR
mutations were 100% linked. That is, all of the recipient strains receiving the Tc' determinant from IBPC590 also received the Anag mutation. This was confirmed by examining the growth phenotype ofthe tetracycline resistant strains resulting from infection with P
ly,, grown on IBPC590. All such strains were unable to grow on media containing glucosamine or N-acetylglucosamine as carbon sources, indicating the presence of the Onag mutation.
Mutations in the manXYZ and ptsG genes were also introduced by P1,, transduction using phage grown on strains 1BPC566 and IBPC522 (Plumbridge, Table 1), respectively. These strains also contained tetracycline-resistance determinants linked to the mutations of interest (designated zdj-225:: Th70 and zcf-229:: Tn 10, respectively). In these strains, the TcR determinants were not within the gene but were linked to the gene. Accordingly, not all recipient strains receiving the TcR determinant contained the mutations of interest. The degree of linkage is indicative of the distance on the chromosome between the TcR determinant and the mutation of interest. As a result, it was necessary to screen tetracycline resistant strains for manXYZ and ptsG. The maijXYZ strains grew slowly on mannose and failed to grow on glucosamine as sole carbon sources for growth. The ptsG strains grew slowly op glucose as sole carbon source.
Because all of the selections for the mutations described above were for tetracycline resistance, it was necessary to render strains tetracycline sensitive between steps if multiple mutations were to be introduced. To accomplish this, tetracycline-resistant strains were plated to TCS medium (15 g/1. agar, 5 g(L Bacto tryptone, 5 g/L yeast extract, 50 mg/L
chlortetracycline hydrochloride, 10 g/1, NaCl, 10 g/L NaH2PO4H2O, 12 mg/L fusaric acid, and 0.1 mM ZnCI2) which selects for tetracycline sensitive mutants (described in Maloy and Nunn, 1981, J.
Bacreriol., 145:1110-1112. Colonies arising on this medium were purified by resireaking to the same medium, then checking individual colonies for tetracycline sensitivity by plating to LB
media with and without 12.5 pg/mL tetracycline.
The scheme described above for the production of strains containing combinations 20 of the manXYZ ptsG, and Anag mutations is presented schematically in Fig. 3.
Exam e 2 The following Example describes the cloning and overexpression of the gimS
gene and the integration of the T7 glmS gene cassette into the E. coil chromosome.
Clonin and Oyeres re 'an a the Elms Gene.
Using information obtained from the published sequence of the glmS gene (Walker er al, 1984, Bzochent J. 224:799-815, primers were synthesized to amplify the gene from genomic DNA isolated from strain W3110 (Table 1) using the polymerise chain reaction (PCR). The primers used for PCR amplification were designated Upl and Log and had the following sequences: LTpI: 5'-CGGTCTCCCATGTGTGGAATTGTTGGCGC-3' (SEQ ID NO:1) {W5224320.DOC;I }
Lo8: 5'-CTCTAGAGCGTTGATATTCAGTCAATTACAAACA-3' (SEQ ID NO:2) The Up 1 primer contained sequences corresponding to the first twenty nucleotides of the g1mS gene (represented in nucleotides 10-29 of SEQ ID NO: 1) preceded by a BsaI
restriction endonuclease recognition site (GGTCTC, represented in nucleotides 2-7 of 5 SEQ ID NO: 1). The Lo8 primer contained sequences corresponding to positions between 145 and 171 bases downstream ofthe g1mS gene (represented in nucleotides 8-34 of SEQ
ID NO:2) preceded by a XbaI restriction endonuclease site (TCTAGA, represented in nucleotides 2-7 of SEQ ID NO:2). PCR amplification was conducted using a standard protocol to generate a fragment of DNA containing the g1mS gene with 171 base pairs of 10 DNA downstream of the gene flanked by BsaI and Xbal sites. This DNA
fragment was cloned into the vector pCR-Scripfr SK(+) (Stratagene Cloning Systems, La Jolla, California) using materials and instructions supplied by the manufacturer. The resulting plasmid was designated pKLN23-20.
One consequence of this cloning was that it placed a unique Sacl restriction 15 endonuclease site downstream of the gene. This allowed excision of a fragment of DNA
containing the g1mS gene from pKLN23-20 using the restriction endonucleases BsaI and Sad. This fragment was then cloned between the Ncol and Sacl sites of the expression vector pET-24d(+) (Novagen, Inc., Madison, Wisconsin) to generate plasmid pKLN23-23. The pET-24d(+) expression vector is based on the T7 promoter system (Studier and 20 Moffatt, 1986, J. Mol. Biol., 189:113-130). Cloning in this manner resulted in placement of the g1mS gene behind the T7-lac promoter contained on pET-24d(+). The T7-lac promoter is specifically recognized by the T7 RNA polymerase and is only expressed in strains containing a cloned T7 gene 1, which encodes the T7 RNA polymerise.
The cloned T7 polymerase gene is contained on a defective bacteriophage I phage designated 25 11DE3. Strains in which the ADE3 element is integrated into the chromosome contain the T7 RNA polymerase gene driven by the lacUV5 promoter. In those strains, expression ofthe T7 RNA polymerase gene can be induced using the lactose analog isopropylthio-p-D-galactopyranoside (IPTG). Accordingly, addition of IPTG to such cultures results in induction of the T7 RNA polymerase gene and expression of any genes controlled by the 30 T7 or T7-lac promoter.
To verify that pKLN23-23 contained the glmS gene driven by the T7--lac promoter, the plasmid was transferred to strain B1.21(Dir3) (Novagen, Inc.) (Table 1)_ Strain BL21(13E3)/pKLN23-23 was grown in duplicate in LB medium containing 50 mg/L
kanamycin (kanamycm resistance is encoded by the plasmid). One of the duplicates was induced with 1 mM
IPTC); the other was not. When the total proteins were examined from these two cultures by sodium dodecyl sulfate polyacrylamide gel electrophoresis, it prominent protein of approximately 70,000 molecular weight, corresponding to the predicted size for the gIMS gene product, was observed in cells from the induced culture but not in cells from the uninduced culture. A preliminary enzyme assay from an induced culture indicated several hundred fold higher glucosamine-6-phosphate synthase activity in the induced culture than in what had typically been observed in a wild type strain.
fnte_ration of thLT7-glmS Gene Casserte Arty the R. coil Chromosome.
The glmS gene driven by the 77-lac (the T7--glmS gene cassette) promoter was transferred to the chromosome of E. coil strains by a multistep process. First. The cassette was cloned into plasmid pBRINT-Cm (Balbas et al, 1996, Gene 96:65-6).. The gene cassette was then integrated into the chromosome of strain ATCC47002 (Table 1) by the techniques described by 13albas et al, 1996, supra, to generate strains 1-71 and T-8 1 (Table 1). Finally, the gene cassette was transferred to strains of interest by transduction with PI,,, as described below.
The T7-glnS cassette was excised from pKLN23 -23 by performing a partial digest of the plasmid with restriction endonuclease Bg/II and a complete digest with restriction ondonuclease HinD1l1. Plastuid pKLN23 -23 contains a Bg/II site approximately 20 base pairs upstream of the T7 promoter. The glmS gene also contains two Bg/II sites, A partial- digest with this enzyme was necessary to cut the plasmid upstream of the T7 promoter while avoiding the two internal sites.
Plasmid pKLN23-23 also contains a unique HinDUI site downstream of the ghnS
gene. The excised T7-glmS cassette was theta cloned between the unique BamHl and HinDill sites of pBRJNT-Cm. This resulted in the production of plasmids designated pKLN23-27 and pKLN23-28. Plasmids pKT.N23-27 and pKLN23-28 contain the T7i'lmS cassette in addition to a {55224320 DOC:I }
chloramphenicol resistance determinant flanked by the 5'- and 3'-termini of the E. coli lacZ gene.
Strain ATCC 47002 (Table 1) contains mutations that confer upon it an inability to maintain plasmids such as pBRINT-Cm which contain a ColE 1 origin of replication.
When such plasmids are transferred to this strain, selection for genetic markers contained on the plasmid results in integration of the plasmid into the chromosome (Balbas et al., 1996, supra). As mentioned above, plasmids pKLN23-27 and -28 contain the T7-glmS
cassette and a chloramphenicol resistance determinant flanked by the 5'- and 3'-termini of the E. coli lacZ gene. The lacZ sequences target the incoming DNA to the lacZ gene contained in the chromosome. Integration at the IacZ locus replaces the intact lacZ gene, which encodes the enzyme P-galactosidase, with a partial lacZ gene interrupted by the T7-glmS-Cm cassette. As a result, integration at lacZ results in the strain becoming f3-galactosidase negative. The plasmid also contains an ampicillin resistance determinant remote from the 5'-1acZ--T7 g1mS-Cm-lacZ-3' cassette. Integration at IacZ and plasmid loss also results in ampicillin sensitivity.
Plasmids pKLN23-27 and -28 were transferred to strain ATCC 47002, and cells were plated to LB medium containing 10 pg/mL chloramphenicol, 1 mM IPTG, and pg/mL 5-bromo-4-chloro-3-indolyl-(3-D-galactopyranoside (X-gal). The X-gal contained in the medium is a chromogenic (3-galactosidase substrate. Hydrolysis of X-gal by (3-galactosidase results in a blue derivative. Inclusion of X-gal and IPTG, which induces the native lacZ gene, results in blue lacZ-positive colonies and white lacZ-negative colonies.
White (lacZ-negative) chloramphenicol resistant colonies were then selected and purified.
The strains were then verified for sensitivity to ampicillin by plating to LB
media with and without 100 g/mL ampicillin. DNA integration was further confirmed using a PCR
scheme as described by Balbas et al., 1996, supra. Integrants T-71 and T-81 (Table 1) resulted from the integration of the desired segments of plasmids pKLN23-27 and pKLN23-28, respectively, into the chromosome of ATCC 47002.
The T7-gimS-Cm cassette was then transferred to strains W3110(DE3), 7101-9(DE3), 7101-17(DE3), and 2123-4(DE3) byP1r;,transduction, as described in Example 1, using lysates prepared on strains T-71 and T-81. These strains contain various combinations ofthe A nag, manXYZ, and ptsG mutations in addition to the ADE3 element necessary for expression from the T7-lac promoter. The ?.DE3 element was introduced to these strains using the W1=3 lysogenization kit produced by Novagen, Inc. (Madison, Wisconsin).
Transductants were selected on L8 agar plates containing 30 14g/mL
chloramphenicol and 10 mM sodium citrate. Loss of fI-galactosidase activity was verified on plates containing X-gal and 1PTG and DNA integration was further confirmed using a PCR scheme as described by Balbas er al, 1996, supra Glucosamine-6-phosphate synthase activity was measured in production strains containing integrated T7 glmS cassettes after growth in LB medium with and without IPTG
(Table 2). Glucosamine-6-phosphate synthase was assayed in crude cell extracts using either colorimetric or spectrophotometric assays (8adet et at, 1987, Biochemistry 26:1940-1948) as described below. The extracts used for those assays were prepared by suspending washed cell pellets in 5 mL of 0.1 M KH2P04/K2HPO4, pH 7.5 per grain of wet cell paste, passing the suspension through a French press at 16,000 psi, and centrifuging the disrupted cell suspension at 35,000-40,000 xg for 15 to 20 minutes. The supernatant was used as the source of enzyme for the assay.
For a colorimetric assay, 1 mL reactions were prepared containing 45 mM
KH2PO4/K2HPO4, 20 mM fructose-6-phosphate, 15 mML-glutamine, 2.5 mMEDTA, pH
7,5, and cell extract. The reactions were incubated at 37 C for 20 minutes and stopped by boiling for 4 minutes. The resulting precipitate was removed by centrifugation and the supernatant was assayed for gucosamine-6-phosphate by a modification of the method of Elson and Morgan (1933, Biochem_ J_ 27:1824-1828) essentially as described by Zalldu (1985.Meth..Enzymol.
113:278-281) . To 100 L of the above supernatant was added 12.5 L of saturated NaHC03 and 12.5 pL of cold, freshly prepared 5% aqueous acetic anhydride. After incubating for 3 minutes at room temperature, the mixture was boiled for 3 minutes to drive off excess acetic anhydride. After cooling to room temperature, 150 aj4. of 0.8 M potassium borate, pH 9.2(0.8 MH38O3 adjusted to pH 9.2 with KOH) was added and the mixture was boiled for 3 minutes.
After cooling to room temperature, 1.25 mL Ehrlich's reagent (1% p-dimethylaminobenzaldehyde in glacial acetic acid containing 0.125 N HCI) was added to each tube. After incubating at 37 C for 30 minutes, the absorbance at 585 nm was measured and the amount of gucosamine-6-phosphate formed {E522g32o DOC;1 }
was determined using a standard curve. In the absence of the substrate, fructose-6-phosphate, or when boiled extract was used in the assay, no significant absorbance at 585 nm was observed.
In the spectrophotometric assay, 1 mL reactions containing 50 mM
KH2PO4/K2HPO4i10 mM fructose-6-phosphate, 6 mM L-glutamine,10 mM KCI, 0.6 mM
acetylpyridine adenine dinucleotide (APAD), and 50-60 Units of L-glutamic dehydrogenase (Sigma Type H from bovine liver) were run at room temperature.
The activity was followed by monitoring the absorbance at 3 65 nm after the addition of extract and corrected for the small absorbance increase observed in the absence of extract. The activity was calculated using a molar extinction coefficient for APAD of 9100.
Table 2 Glucosamine 6-Phosphate Synthase Activity in Production Strains Containing Integrated T7 glmS
Cassettes Activity, ( ole per minute per mL of extract) Strain Host Geno -1PTG + IPTG
2123-9 DE3 Anag 26 58 2123-10 DE3 dnag 33 67 2123-11 DZ 3 Anag manXYZ 32 59 2123-12 DE3 Anag manXYZ 17 67 2123-13 DE3 dnag manXYZ ptsG 21 68 2123-14 DE3 dna manXYZ ptsG 20 88 Table 2 shows that, on average, the activity ofglucosamine-6-phosphate synthase in production strains containing integrated T7-glmS cassettes was about three-to four-fold higher with IPTG induction than without. The activities were substantially higher than those obtained with a wild type glmS strain driven by its native promoter, which typically were in the range of 0.05-0.1 pmole per minute per mL of extract.
One of the strains, 2123-6, apparently suffered an aberrant integration event since the activity was lower than that observed in the other strains and was not influenced by the presence of IPTG in the medium.
Example 3 The following example shows the effect of strain genotype on glucosamine accumulation.
Strains with T7-glmS integrants, produced as described in Example 2, as well as 5 the corresponding parent strains without integrated DNA, were grown in shake flasks containing M9A medium (14 g/L K2HP04i 16 g/L KH2PO4, 1 g/L Na3Citrate-2H20, 5 g/L
(NH4)2SO4i pH 7.0) supplemented with 20 g/L glucose, 10 mM MgSO4, 1 mM CaC12, and 1 mM IPTG. Samples were taken periodically over the course of two days, and the glucosamine concentration in the culture supernatant was measured using the modified 10 Elson-Morgan assay as described in Example 2. Samples were assayed with and without acetic anhydride treatment, and the amount of glucosamine present was determined from the net absorbance using a standard curve.
Glucosamine concentrations after 24 hours of cultivation, at which time the concentration peaked, are indicated in Table 3. The results shown in Table 3 indicate that 15 for significant glucosamine production to occur, the T7-glmS gene cassette must be present. The data also indicate that the presence of the Anag mutation in the host results in a significant increase in glucosamine accumulation compared with its absence. Little effect of the manXYZ mutation was observed in this experiment. In further shake flask experiments, however, strain 2123-12 accumulated slightly higher glucosamine 20 concentrations on a consistent basis.
Table 3 Glucosamine in Culture Supernatants of Production Strains Strain Genotype Glucosamine Concentration, mgfL
(24 hours) 2123-5 DE3, T-71 integrant 21 25 2123-7 DE3, T-71 integrant 23 2123-9 DE3 Anag, T-71 integrant 67 2123-10 DE3 Anag, T-81 integrant 80 2123-11 DE3 Anag manXYZ, T-71 integrant 65 2123-12 DE3 Anag manXYZ, T-81 integrant 63 30 W3110(DE3) DE3, no integrant 4 7101-9(DE3) DE3 Anag, no integrant 0 7101-17(DE3) DE3 Anag manXYZ, no integrant 0 Example 4 The following example demonstrates the effect feeding nutrients to the cultures has on glucosamine accumulation.
In early experiments, it was observed that glucosamine accumulation ceased when glucose was depleted from cultures. In the experiment summarized by Table 4 and Fig.
4, it was found that increased glucosamine accumulation could be accomplished by feeding additional glucose and ammonium sulfate as they became depleted. For this experiment, strain 2123-12 was grown in M9A medium supplemented with 10 mM
MgSO4, 1 mM CaCl2, and 1 mM IPTG. Initial glucose concentrations and feeding regimens were varied as, indicated in Table 4. In one of the flasks, the initial ammonium sulfate concentration was 10 g/L rather than the 5 g/L normally used in M9A
medium.
Glucose concentration was monitored in shake flasks during cultivation using Diastix glucose test strips (Bayer Corporation Diagnostics Division, Elkhart, Indiana). When the glucose concentration was at or near depletion (<5 g/L remaining), glucose and/or ammonium sulfate were supplemented as indicated in Table 4. pH was also monitored during cultivation. When the pH varied significantly from the initial pH of 7.0, it was adjusted to 7.0 with sodium hydroxide.
Table 4 Shake Flask Experiment to Examine the Effect of Glucose Feeding Flask Initial Glucose, g/L Initial Ammonium Sulfate, Feed No.
1 20 5 None 2 50 5 None 3 50 10 None 4 20 5 20g/L Glucose 5 20 5 20 gtL Glucose + 5 g/L AmSO4 As Fig. 4 indicates, increasing the supply of glucose had a positive effect on glucosamine accumulation. By periodically feeding with glucose and ammonium sulfate (20 g/L and 5 g/L additions, respectively), a maximum accumulation of 0.4 g/L
of glucosamine was observed, approximately four-fold higher than what was observed in the absence of feeding.
Example 5 The following example describes the isolation of mutant g1mS genes encoding glucosamine-6-phosphate synthase enzymes with decreased sensitivity to glucosamine-6-phosphate product inhibition.
White (1968, Biochem. J., 106:847-858) first demonstrated that glucosamine-6-phosphate synthase was inhibited by glucosamine-6-phosphate. Using the spectrophotonietric assay for glucosamine-6-phosphate synthase as described in Example 2, the effects ofglucosamine-6-phosphate and glucosamine on glucosamine-6-phosphate synthase were measured. For determination of product inhibition, assays were run in the presence of various concentrations of added glucosamine-6-phosphate.
As indicated in Fig. 5, the enzyme is significantly inhibited by glucosamine-6-phosphate and slightly inhibited by glucosamine. These results are similar to those obtained by White, 1968, supra. This inhibition may be a key factor in limiting glucosamine accumulation in the glucosamine production strains.
To further increase glucosamine synthesis in production strains, efforts were made to isolate mutants of the g1mS gene encoding glucosamine-6-phosphate synthase variants with reduced product inhibition. To accomplish this, the cloned gene was amplified using the technique of error-prone PCR. In this method, the gene is amplified under conditions that lead to a high frequency of misincorporation errors by the DNA polymerase used for the amplification. As a result, a high frequency of mutations are obtained in the PCR
products.
Plasmid pKLN23-28 contains a unique SpeI restriction endonuclease site 25 base pairs upstream of the T7 promoter and 111 base pairs upstream of the start of the g1mS
gene. The plasmid also contains a unique HinDIII site 177 base pairs downstream of the glmS gene. PCR primers of the following sequences were synthesized to correspond to regions just upstream of the SpeI and downstream of the HinDIII sites, respectively:
5'-ATGGATGAGCAGACGATGGT-3' (SEQ ID NO: 3) 5'-CCTCGAGGTCGACGGTATC-3' (SEQ ID NO:4) Amplification with these primers (SEQ ID NO:3 and SEQ ID NO:4) allowed mutagenesis of a 2247 base pair region that included the entire g1mS gene. PCR
conditions were as described by Moore and Arnold, 1996, Nature Biotechnology 14:458-467. Briefly, a 100 L solution was prepared containing I mM each of the four deoxynucleotide triphosphates, 16.6 mM ammonium sulfate, 67 mMTtis-HC1, pH 8.8, 6.1 mM MgC12 6.7 gM
EDTA, 10 mM -mercaptoethanol, 10 I. DMSO, 30 ng each of the primers (SEQ 1D
NO:3 and SEQ ID NO:4), either 7 or 35 ng of plasmid pKLN23-28 linearized with Kpn I, and 2,5 Units of TagDNApolyinera_se(Perldn Elmer-Cetus, Emeryville, California). The reaction mixture was covered with 70 4 of mineral oil and placed in a thermoeycler, where the following steps were repeated for 25 cycles:
1 minute at 94 C
1 minute at 42 C
2 minutes at 720C
Under these conditions, an error frequency of approximately one mutation per 1000 base pairs has been reported (Moore and Arnold, 1996, supra). The product of the reaction was recovered, purified, and digested with Spel and HinD11 and cloned into the Spel-HinDUI
backbone fragment of pKLN23 -28, which effectively substitutes for the wild We glmS gene on the Spel-HinDIII fragment of pKLN23-28. The cloned DNA was used to transform strain NovaBlue (Novagen, Inc., Madison, Wisconsin), and the transformed calls were plated to LB
agar containing ampicillin. A total of 9000 plasmid-containing colonies were pooled from the anipicillin plates and plasmid DNA was prepared from the pooled cells to generate a library of pKLN23-28 derivative plasmids containing mutations in the cloned gImS gene.
The mutant plastids generated by error-prone PCR were screened for their ability to confer increased glucosamine production in a Anag manXYZ DE3 host background.
This screen was in the form of a bioassay in which the ability of plasmid-containing strains to crossfeed glucosamine-requiring strains of F coil was assessed-Strains of E. call (Sarves, 1971, J Baueriol. 105:467-471; Wu and Wu, 1971, J.
Bacteria! 105:455-466) and Bacillus subtilis (Freese at a!., 1970, J.
Bacterial. 101:1046-1062) which are defective for glucosamine-6-phosphate synthase require glucosamine or N-acetylglucosamine for growth. A glucosamine-requiring strain of E. coli was isolated after mutagenesis with N-methyl-N'-nitro-N-nitrosoguanidine (NG). Strain LE392 (Table 1) was grown in LB medium to a cell density of 6 x FoE cells per mL. 50 JEi1-4320 PQC,1) L of 2.5 mg/mL NG dissolved in methanol was added to 2 mL of this culture and the mixture was incubated at 37 C for 20 minutes. This treatment resulted in about 10%
survival of the strain. The mutagenized cells were harvested by centrifugation, and the cells were washed twice by suspension in 0.9% NaCl and recentrifugation. The washed cells were diluted and plated to nutrient agar medium (NA; 5 g/L Bacto peptone, 3 g/L
beef extract, 15 g/L agar) containing 0.2 g/L N-acetylglucosamine at a density of between 50 and 200 colony forming units per plate. Approximately 13,000 colonies were plated.
These colonies were replica-plated to NA agar with and without 0.2 g/L N-acetylglucosamine. Twenty-two colonies grew on NA with 0.2 g/L N-acetylglucosamine but not on NA without 0.2 g/L N-acetylglucosamine. These colonies were purified by streaking to NA with 0.2 g/L N-acetylglucosamine, and their growth phenotype was rechecked. Of the original 22 colonies selected, five had the phenotype expected of a glmS mutant of LE3 92. They failed to grow on NA but grew on NA supplemented with 0.2 g/L of glucosamine or 0.2 g/L N-acetylglucosamine. They also failed to grow on glucose minimal agar, but grew on glucose minimal agar supplemented with 0.2 g/L N-acetylglucosamine. One of these mutants was designated 2123-16 (Table 1).
For the cross-feeding assay, agar plates containing either glycerol or fructose as the principle carbon source for growth were overlaid with cells from a culture of strain 2123-16, the glucosamine-requiring strain isolated as described above.
Glucosamine-producing strains were stabbed into the agar and the ability to produce glucosamine was assessed based on the size of the "halo" of growth of the indicator strain surrounding the stab. Those stabs surrounded by larger halos were considered to produce greater amounts of glucosamine.
The media used for the cross-feeding assays consisted of M9 minimal medium (6 g/L Na2HPO4i 3 g/L KH2PO4, 0.5 g/L NaCl, 1 g/L NI-14C1, 1 mM MgSO4, 0.1 mM
CaCI2) supplemented with 40 mg/L of L-methionine (required for growth of strains LE392 or 2123-16) and 2 g/L of either glycerol or fructose. These plates were overlaid with strain 2123-16 as follows. A culture of strain 2123-16 was grown overnight at 37 C in LB
medium containing 1 g/L N-acetylglucosamine. The culture was harvested by centrifugation, and the cells were washed twice by suspension in 0.9% NaCI and recentrifugation. The washed cells were suspended in the original volume of 0.9% NaCl.
For each plate to be overlaid, 0.1 mL of washed cell suspension was mixed with 3 mL of molten (50 C) F-top agar (8 g/L NaCl, 8 g/L agar) and poured onto the plate.
The library of pKLN23-28 mutant plasmids was transferred to strain 7101-17(DE3) and transformed cells were plated to LB agar containing 100 gg/mL
ampicillin.
5 Each colony arising on these plates contained an individual member of the mutant plasmid library. The colonies were screened by picking them from the LB + ampicillin plates and stabbing them sequentially into:
(1) LB agar + ampicillin;
(2) glycerol minimal agar overlaid with strain 2123-16; and, 10 (3) fructose minimal agar overlaid with strain 2123-16 All plates were incubated for about 24 hours at 37 C. After this incubation period, halos of growth of the 2123-16 indicator strain could be observed surrounding the stabs in the overlaid plates. Those colonies giving rise to the larger halos were picked from the corresponding LB + ampicillin plate and streaked for purification. In an initial 15 screen, 4368 mutant candidates were screened, and 96 initial candidates were identified.
Upon rescreening those, 30 appeared to be superior to the rest, i.e. resulted in larger halos of the indicator strain.
Enzyme assays performed with six of the plasmid-containing strains isolated as described above indicated that three of the strains were less sensitive to inhibition by 20 glucosamine-6-phosphate than the enzyme from the control strain 7101-17(DE3)/pKLN23-28. The strains were grown overnight in LB broth containing 100 gg/mL ampicillin and 1 mM IPTG. Extracts prepared from cells harvested from those cultures were assayed for glucosamine-6-phosphate synthase using the spectrophotometric assay (described in Example 2) in the presence and absence of added glucosamine-6-25 phosphate. The mutant clones designated 11 C, 65A, and 8A were significantly less sensitive to glucosamine-6-phosphate than the control strain (Fig. 6). Other mutants were not distinguishable from the control by this assay.
Example 6 The following example describes the construction and characterization of 30 glucosamine production strains with mutations in gimS which result in reduced product inhibition.
Plasmid DNA isolated from clones 11 C, 52B, and 8A described above were transferred to strain ATCC 47002, which had been used previously to integrate the cloned T7 glmS construct into the E. coil chromosome. Integration was readily accomplished using the methods described in Example 2, and the integrated DNA was transferred to strain 7101-17(DE3) by P1 transduction as described in Example 1. These procedures produced strains that have the same genotype as strain 2123-12 except for the presence of mutations in the gimS gene generated by PCR. These new mutant production strains were designated 2123-49, 2323-5 1, and 2123-54, respectively. A summary of the strain construction strategy is presented in Fig. 7.
Strains 2123-12, 2123-49, 2123-51, and 2123-54 were grown overnight in LB
broth containing 1 mM IPTG. Extracts prepared from cells harvested from those cultures were assayed for glucosamine-6-phosphate synthase using the spectrophotometric assay described in Example 2 in the presence and absence of added glucosamine-6-phosphate.
The results of these assays are shown in Fig. 8.
Glucosamine production in these mutants was significantly elevated compared to that in 2123-12. When glucosamine production was assayed in shake flask cultures grown using the glucose and ammonium sulfate feeding protocol previously described in Example 4, when the cultures were grown to a cell density (measured as O.D.WO) of about 14 (about 8.4 g/L by dry cell weight), strains 212349,2123-5 1, and 2123-54 produced 1.5, 2.4, and 5.8 g/L glucosamine, respectively (Fig. 9) compared with 0.3 g/L for 2123-12.
Example 7 The following example describes the production of yet another strain with a mutation in glmS which results in reduced product inhibition.
An additional 6,344 colonies containing mutant plasmids generated by error-prone PCR as described in Example 5 were screened using the cross-feeding assay, also described in Example 5. Fifty four colonies resulted in larger halos than the rest of the colonies. DNA was isolated from all 54 colonies and strains isogenic to 2123-12 except for the mutations in gimS were constructed as described in Example 6.
Glucosamine production in most of these mutants was significantly elevated compared to strain 2123-12. Among the newly isolated mutants, the strain that produced the most glucosamine was a strain designated 2123-124. This strain produced 3.6 g/L of glucosamine when production was assayed in shake flasks using the glucose and ammonium sulfate feeding protocol described in Example 4 compared with 4.3 g/L
for strain 2123-54 in a side-by-side experiment.
Example 8 The following example describes the sequencing ofthe cloned wild typegIMS gene present in plasmid pKLN23-28. In addition, the sequences present in plasmids pKLN23-49, pKLN23-54, and pKLN23-124, containing the mutant glmS genes used to construct strains 2123-49, 2123-54, and 2123-124, respectively were sequenced and are described.
DNA sequencing reactions were performed using the Applied Biosystems (ABI) Prism Dye Terminator Cycle sequencing method with AmpliTaq DNA polymerase. The extended products were separated by gel electrophoresis on an ABI DNA
sequencer 373A
or 377. Sequences were analyzed using ABI Sequencing Analysis 3.0 software from ABI
and Sequencher 3.1 from Gene Codes.
The primers used for sequencing were as follows:
PK-1: 5'-TGGATGAGCAGACGATGG-3' (SEQ ID NO:5) PK-2: 5'-TCCGTCACAGGTATTTATTC-3' (SEQ IDNO:6) PK-3: 5'-AGCTGCGTGGTGCGTAC-3' (SEQ ID NO:7) PK-4: 5'-GGACCGTGTTTCAGTTCG-3' (SEQ ID NO:8) PK-5A: 5'-GCCGTGGCGATCAGTAC-3' (SEQ ID NO:9) PK-6A: 5'-GCCAATCACCAGCGGAC-3' (SEQ IDNO:10) PK-7: 5'-ATGGTTTCCCGCTACTGG-3' (SEQ IDNO:11) PK-8: 5'-GAACCAAGGTAACCCAGC-3' (SEQ IDNO:12) The nucleotide sequence of plasmid pKLN23-28, containing the wild-type g1mS
gene, was determined to be a 7408 bp nucleic acid sequence represented herein as SEQ
ID NO:13. The 2184 base pairs between positions 1130 and 3313 of SEQ ID NO:13 were determined using the primers described above. The nucleic acid molecule representing positions 1130-3313 of SEQ ID NO: 13 is referred to herein as nglmS-282184 and is further identified as SEQ ID NO: 14. nglmS-282184 was shown to include the Spel and HinDIlI sites used to construct the mutant plasmids as described in Example 5. The remaining DNA sequence of SEQ ID NO: 13 is based on the known sequences of the vectors used for the construction of pKLN23-28. The same 2184 base pair region was sequenced in plasmids pKLN23-49, pKLN23-54, and pKLN23-124. It is noted that for the discussion of the mutant glmS genes of these plasmids (Table 5), the specific nucleotide position ofmutations in the nucleotide sequence ofthe mutant gIMS-containing plasmids will be described using SEQ ID NO: 13 as a reference.
SEQ ID NOs: 13 and 14 contain an open reading frame that encodes the glmSgene product (i.e., GIcN6P synthase enzyme) which is a nucleic acid molecule referred to herein as nglmS-281830, the nucleic acid sequence of which is represented by SEQ ID
NO: 15.
SEQ IDNO:15 spans nucleotides 1253 to 3082 of SEQ ID NO: 13, with an initiation codon spanning from nucleotides 1253-1255 and a termination codon spanning from nucleotides 3080-3082. SEQ ID NO:15 encodes a protein of 609 amino acids referred to herein as G1cN6P-S-28, the deduced amino acid sequence of which is represented herein as SEQ ID NO: 16. It is noted that for the discussion of the mutant glmS gene products produced by the mutant strains described herein, specific mutations in the amino acid sequence of the mutant glmS gene products will be described using SEQ ID
NO:16 as a reference.
The primers described above correspond to the following nucleotide positions of SEQIDNO:13:
PK-1 (SEO ID NO:5): positions 1087-1104 of SEQ ID NO: 13;
PK-2 (SEQ ID NO:6): positions 3378-3359 of a nucleic acid sequence complementary to SEQ ID NO:13;
PK-3 (SEO ID NO:7): positions 1707-1723 of SEQ ID NO: 13;
PK-4 (SEQ ID NO:8): positions 2772-2755 of a nucleic acid sequence complementary to SEQ ID NO:13;
PK-5A SEO ID NO:9): positions 2667-2683 of SEQ ID NO: 13;
PK-6A (SEO IDNO:1 ): positions 1798-1782 of a nucleic acid sequence complementary to SEQ ID NO: 13;
PK-7 (SEQ ID NO: 11): positions 2177-2194 of SEQ ID NO:13;
PK-8 SEO ID NO: 12): positions 2364-2347 of a nucleic acid sequence complementary to SEQ ID NO: 13.
The nucleic acid sequence of nucleic acid molecule nglmS-281830(SEQ ID NO: 15, or positions 1253-3082 of SEQ ID NO: 13) from pKLN23-28, differs from the published sequence (Walker, 7.E, et at., 1984, "DNA sequence around the Escherichia coil unc operon", Biochem. J. 224:799-815) at positions 2509 and 2510 (with reference to SEQ
ID NO:13). The nucleotides for pKLN23-28 at these positions as determined in this example were G and C, respectively, while those reported in the published sequence were C and G. Otherwise, the published and determined sequences of the gimS gene were identical. The sequences determined upstream and downstream from the gimS gene were those expected based on the known sequences of the vectors used for the construction of pKLN23-28 and the methods used to construct the plasmid.
The nucleotide sequences for the mutant gimS genes for plasmids pKLN23-49, pKLN23-54, and pKLN23-124 were determined as described above for pKLN23-28.
Mutations were found in each ofthose plasmids. The mutations and the predicted amino acid changes in the corresponding mutant gimS gene products, as compared to the wild-type sequence determined for pKLN23-28 (SEQ ID NO: 13) are summarized in Table 5.
Table 5 Mutations in gimS Genes of Glucosamine-Overproducing strains.
Plasmid Position* Base Change Amino Acid Change (Position**) pKLN23-49 1263 T to C Ile to Thr (4) 2067 T to C Ile to Thr (272) 2600 T to C Ser to Pro (450) pKLN23-54 1367 G to A Ala to Thr (39) 2000 C to T Arg to Cys (250) 2239 T to C Silent (329) 2666 G to A Gly to Ser (472) 3264 A to G Outside gene pKLN23-124 1525 T to C Silent (91) 2658 T to C Leu to Pro (469) 3280 G to A Outside gene *Refers to nucleic acid position as indicated in the sequence of pKLN23-28 (SEQ ID NO: 13) **TheglmS gene (nglmS-281.; SEQ ID NO: 15) encodes a protein of 609 amino acids in length (SEQ ID NO: 16); the methionine residue at position 1 is removed by a hydrolase.
Plasmid pKLN23-49 contains a 2184 bp nucleic acid molecule referred to herein 5 as nglmS-492184, which comprises a mutant glmS gene. The nucleic acid sequence of ngImS-492184 is represented herein as SEQ ID NO: 17. A nucleic acid molecule spanning from nucleotide 124 through 1953 of SEQ ID NO: 17, referred to herein as nglmS-491830, represents an open reading frame encoding a mutant glucosamine-6-phosphate synthase ofthe present invention, with an initiation codon spanning from nucleotides 124-126 and 10 a termination codon spanning from nucleotides 1951-1953 of SEQ ID NO: 17.
The nucleic acid sequence of nglmS-491830 is represented herein as SEQ ID NO: 18.
SEQ ID
NO:18 encodes a mutant glucosamine-6-phosphate synthase protein of 609 amino acids referred to herein as G1cN6P-S-49, the deduced amino acid sequence of which is represented herein as SEQ ID NO: 19. SEQ ID NO: 17 has a nucleic acid sequence that 15 is identical to positions 1130 through 3313 of SEQ ID NO:13 (i.e., SEQ ID
NO:14), except for the mutations as indicated for plasmid pKLN23-49 in Table S. SEQ ID
NO:18 has a nucleic acid sequence that is identical to positions 1253 through 3082 of SEQ ID
NO:13 (i.e., SEQ ID NO: 15), except for the mutations as indicated for plasmid pKLN23-49 in Table 5.
20 Plasmid pKLN23-54 contains a 2184 bp nucleic acid molecule referred to herein as nglmS-542184, which comprises a mutant gimS gene. The nucleic acid sequence of nglmS-542184 is represented herein as SEQ ID NO:20. A nucleic acid molecule spanning from nucleotide 124 through 1953 of SEQ ID NO:20, referred to herein as ngimS-541830, represents an open reading frame encoding a mutant glucosamine-6-phosphate synthase 25 of the present invention, with an initiation codon spanning from nucleotides 124-126 and a termination codon spanning from nucleotides 1951-1953 of SEQ ID NO:20. The nucleic acid sequence of nglmS-541830 is represented herein as SEQ ID NO:21.
SEQ ID
NO:21 encodes a mutant glucosamine-6-phosphate synthase protein of 609 amino acids referred to herein as GIcN6P-S-54, the deduced amino acid sequence of which is 30 represented herein as SEQ ID NO:22. SEQ ID NO:20 has a nucleic acid sequence that is identical to positions 1130 through 3313 of SEQ ID NO:13 (i.e., SEQ ID
NO:14), except for the mutations as indicated for plasmid pKLN23-54 in Table 5. SEQ ID
NO:21 has a nucleic acid sequence that is identical to positions 1253 through 3082 of SEQ ID
NO: 13 (i.e., SEQ ID NO: 15), except for the mutations as indicated for plasmid pKLN23-54 in Table 5.
Plasmid pKLN23-124 contains a 2184 bp nucleic acid molecule referred to herein as nglmS-1242184, which comprises a mutant glnS gene. The nucleic acid sequence of nglmS-12421k is represented herein as SEQ ID NO:23. A nucleic acid molecule spanning from nucleotide 124 through 1953 of SEQ ID NO:23, referred to herein as nghnS-1241830, represents an open reading frame encoding a mutant glucosamine-6-phosphate synthase of the present invention, with an initiation codon spanning from nucleotides 124-126 and a termination codon spanning from nucleotides 1951-1953 of SEQ ID NO:23. The nucleic acid sequence ofnglmS-1241830 is represented herein as SEQ ID NO:24.
SEQ ID
NO:24 encodes a mutant glucosamine-6-phosphate synthase protein of 609 amino acids referred to herein as G1cN6P-S-124, the deduced amino acid sequence of which is represented herein as SEQ ID NO:25. SEQ ID NO:23 has a nucleic acid sequence that is identical to positions 1130 through 3313 of SEQ ID NO: 13 (i.e., SEQ ID NO:
14), except for the mutations as indicated for plasmid pKLN23-124 in Table 5. SEQ
ID
NO:24 has a nucleic acid sequence that is identical to positions 1253 through 3082 of SEQ ID NO:13 (i.e., SEQ ID NO:15), except for the mutations as indicated for plasmid pKLN23-124 in Table 5.
To verify that the same mutations were present in the strains into which the mutant glmS genes were integrated into the chromosome, PCR products were generated from genomic DNA isolated from strains 2123-49, 2123-54, and 2123-124. For PCR
amplification, the primers listed in Example 3 for the mutagenesis of the gene (SEQ ID
NO:3 and SEQ ID NO:4) were used. PCR reactions were carried out in 50 L
reactions consisting of 20 mM Tris'HCI (pH 8.8), 10 mM KCI, 10 mM (NH4)2SO4i 20 mM
MgSO4, 0.1% Triton X-100, 0.1 mg/mL nuclease-free bovine serum albumin, 0.05 mM each deoxynucleotide triphosphate, 2 M each primer, 1.25 U cloned Pfu DNA
polymerase (Stratagene), and 160 ng of genomic DNA. The complete reactions were placed in a RoboCycler Gradient 96 Temperature Cycler (Stratagene). After 3 minutes at 94 C, the following three steps were repeated for 30 cycles: (1) 30 seconds at 94 C; (2) 30 seconds at 47 C; and (3) 2 minutes at 72 . This was followed with a 7 minute incubation at 72 C.
The resulting DNA contained the expected amplification product in addition to extraneous products. The product containing the glmS gene was purified using a QlAquick PCR purification kit, followed by electrophoresis of the purified product on an agarose gel, isolation of the correct band using a QlAquick gel extraction kit, and reamplification using this isolated DNA as a template. The reactions with the isolated DNA were amplified in a similar fashion as the original amplification described above, except that 40 ng of DNA was used as template, and only 20 cycles of amplification were performed. The product from this second amplification reaction was recovered as described above.
The presence of mutations in the genomic DNA was verified using primers specific for the DNA regions containing the mutations identified in the plasmids. For 2123-49, these included primers PK-1 (SEQ ID NO:5), PK-3 (SEQ ID NO:7), PK-4 (SEQ ID
NO:8), and PK-5A (SEQ IDNO:9). For 2123-124, primers PK-1 (SEQ IDNO:5), PK-4 (SEQ ID NO:8) and PK-5 (SEQ ID NO:9) were used. For 2123-54, the entire PCR
product was sequenced using all eight primers described earlier (SEQ ID NOs:5-12).
Sequencing of the PCR products confirmed the presence of the mutations identified from the plasmids and listed in Table 5.
Example 9 This example describes the construction of strains containing a mutant glmS
gene encoding a product containing only the glycine to serine alteration at position 472 (SEQ
ID NO:22) from strain 2123-54.
As indicated in Table 5, the only amino acid change in the GIcN6P synthase enzyme for strain 2123-124 (G1cN6P-S-124) is a leucine to proline alteration at position 469 (SEQ ID NO:25), unambiguously defining this mutation as being responsible for the overproduction of glucosamine by strain 2123-124. This would suggest the possibility that the glycine to serine alteration at position 472 (gly-ser472; SEQ ID
NO:22) of G1cN6P-S-54 in strain 2123-54 was likewise responsible for the glucosamine overproduction phenotype for this strain. In an effort to demonstrate this, the alteration was isolated away from the other two amino acid alterations in the G1cN6P-S-54 amino acid sequence (SEQ ID NO:22) of strain 2123-54 (i.e., Ala-'Thr39 and Arg--Cys250) by digesting plasmid pKLN23-54 with EcoRl and HinDM. These enzymes each have unique cleavage sites on the plasmid and cut at positions 2241 and 3305, respectively (positions indicated with respect to the equivalent positions in SEQ ID NO: 13 for pKLN23-28), resulting in fragments of 1064 and 6344 base pairs. The smaller fragment contains mutations in which the gly- ser472 alteration is the only amino acid change in this portion of GIcN6P-S-54. This smaller fragment was ligated to the corresponding larger fragment from pKLN23-28 containing the wild type glmS gene.
Two plasmids resulting from this ligation were designated pKLN23-149 and pKLN23-151. Sequencing the DNA from these plasmids using primers PK-1 (SEQ ID
NO:5), PK-3 (SEQ ID NO:7), and PK-4 (SEQ ID NO:8) verified that these plasmids contained the mutation at position 2666 present in plasmid pKLN23-54 but not the mutations at positions 1367 and 2000 (Table 5 with reference to SEQ ID NO:13).
The nucleic acid sequence of the 2184 base pairs between positions 1130 and of plasmid pKLN23-149 (these positions being determined relative to the equivalent positions in SEQ ID NO:13) are referred to herein as nucleic acid molecule nglmS-1492,84, the nucleic acid sequence of which is represented by SEQ ID NO:26. SEQ ID
NO:26 contains a nucleic acid sequence spanning nucleotides 124 through 1953, referred to herein as nglmS-1491830i which represents an open reading frame encoding a mutant glucosamine-6-phosphate synthase of the present invention, with an initiation codon spanning from nucleotides 124-126 and a termination codon spanning from nucleotides 1951-1953 of SEQ ID NO:26. The nucleic acid sequence of nglmS-1491830 is represented herein as SEQ ID NO:27. SEQ ID NO:27 encodes a mutant glucosamine-6-phosphate synthase protein of 609 amino acids referred to herein as G1cN6P-S-149, the deduced amino acid sequence of which is represented herein as SEQ ID NO:28.
The nucleic acid sequence of the 2184 base pairs between positions 1130 and of plasmid pKLN23-151 (these positions being determined relative to the equivalent positions in SEQ ID NO: 13) are referred to herein as nucleic acid molecule nglmS-15121843 the nucleic acid sequence of which is represented by SEQ ID NO:29. SEQ ID
NO:29 contains a nucleic acid sequence spanning nucleotides 124 to 1953, referred to herein as nglmS-1511830, which represents an open reading frame encoding a mutant glucosamine-6-phosphate synthase of the present invention, with an initiation codon spanning from nucleotides 124-126 and a termination codon spanning from nucleotides 1951-1953 of SEQ ID NO:29. The nucleic acid sequence of nglmS-1511s30 is represented herein as SEQ
IDNO:30. SEQ ID NO:30 encodes a mutant glucosamine-6-phosphate synthase protein of 609 amino acids referred to herein as G1cN6P-S-151, the deduced amino acid sequence of which is represented herein as SEQ ID NO:31.
Strains isogenic to strain 2123-12 except for mutations conferring the gly-.
ser472 alteration were constructed using the scheme indicated in Fig. 7. Strains 2123-149 and 2123-151 were generated from plasmids pKLN23-149 and pKLN23-151, respectively.
The presence of the mutation at position 2666 (SEQ ID NO: 13) and the absence of mutations at positions 1367 and 2000 were verified by sequencing of PCR
products from genomic DNA of these strains using the methods described in Example 8.
Example 10 This example compares properties of G1cN6P synthase enzymes from strains 2123-12, 2123-49, 2123-54, 2123-124, 2123-149, and 2123-151.
Strains 2123-12, 2123-49, 2123-54, 2123-124, 2123-149, and 2123-151, described in the examples above, were grown overnight in LB broth at 37 C then transferred to fresh LB broth. Cultures were grown to an absorbance at 600 nm of 0.8 to 0.9, then induced for G1cN6P synthase production by the addition of I mM
IPTG. The cultures were grown for an additional three hours at 37 and harvested.
Extracts were prepared from cells harvested from those cultures as described in Example 2 and were assayed for glucosamine-6-phosphate synthase using the spectrophotometric assay as described in Example 2 except that a fiuctose-6-phosphate concentration of 20 mM was used. The enzyme was assayed in the presence and absence of added glucosamine-6-phosphate. In the absence of glucosamine-6-phosphate, the specific activities measured for these enzymes were similar except for that from strain 2123-124.
The data from Table 6 suggests that the latter strain encodes a less active variant of the enzyme.
Table 6 Specific Activities of GIcN6P Synthase from Glucosamine-Producing Strains Strain Specific Activity, mol min-' mg-' 2123-12 0.385 2123-49 0.375 2123-54 0.416 2123-124 0.0076 2123-149 0.494 2123-151 0.515 Fig. 10 shows that the G1cN6P synthase enzymes from strains 2123-49, 2123-54, and 2123-124 are significantly less inhibited by G1cN6P than the enzyme from strain 5 2123-12. Enzymes from strains 2123-149 and 2123-151 are slightly less inhibited by G1cN6P than the enzyme from 2123-12.
Thermal stability of the enzymes was also examined using these extracts. The extracts were incubated at 45 C (Fig. 11A) or 50 C (Fig. 11B) for various periods then assayed using the spectrophotometric assay. Figs. 11A and 11B show that the enzymes from 10 2123-49 and 2123-54 are much less stable than the wild type enzyme from strain 2123-12.
The enzyme from strain 2123-124 is comparable in stability to the wild type enzyme, and the enzymes from 2123-149 and 2123-151 are slightly less stable under the incubation conditions described here.
Example 11 15 The following example illustrates the effects of isopropylthio-p-D-galactoside (IPTG) concentration and temperature on glucosamine production.
Cultures of strains 2123-54 and 2123-124 were grown for 20 hours at 37 C on M9A medium (14 g/L K2HPO4, 16 g/L KH2PO4, 1 g/L Na3Citrate2H20, 5 g/L
(NH4)2SO4i pH 7.0) supplemented with 20 g/L glucose, 1 mM MgSO4, 0.1 mM CaCI2, 20 and varying amounts of IPTG. At the end of the growth period, a sample was taken and the glucosamine concentration in the culture supernatant was assayed using the Elson-Morgan assay described in Example 2. The results shown in Fig. 12 indicate that the optimum IPTG concentration for production is about 0.2 mM.
Subsequently, strain 2123-54 was grown in the same medium as described above in shake flasks with either 0.2 or 1 mM IPTG and at 30 C or 37 C. These flask cultures were also fed glucose and ammonium sulfate as described in Example 4. At various intervals, samples were taken and the glucosamine concentrations in culture supernatants were assayed using the Elson-Morgan assay described in Example 2. Fig. 13 shows that under the conditions of this experiment, there was little difference in glucosamine production associated with the differences in IPTG concentration. However, growth at 30 C resulted in higher glucosamine production than did growth at 37 C.
Results shown in Figs. 14A and 14B further indicated that at 30 C (Fig. 14A), glucosamine production continued after growth had ceased, while at 37 C (Fig. 14B), growth and glucosamine production occurred in concert.
When strains 2123-49 and 2123-124 were grown with 0.2 mM IPTG at 30 C, glucosamine production also occurred after growth had ceased, as shown in Figs. 15A
(2123-49) and 15B (2123-124). As observed at 37 C, the highest concentrations of glucosamine were obtained with strain 2123-54, followed by 2123-124 and 2123-49.
Also tested were strains 2123-149 and 2123-151 which produced negligibly higher concentrations of glucosamine than did 2123-12 (Table 7).
Table 7 Production of Glucosamine at 30 Strain Maximum Glucosamine Production, gIL
2123-12 0.3 2123-49 4.6 2123-54 7.2 2123-124 5.3 2123-149 0.6 2123-151 0.6 Example 12 The following example illustrates that glucosamine can be produced at higher concentrations in fermentor cultures of strain 2123-54 as compared to shake flasks. This example also illustrates that in fermentors, strain 2123-54 produces more glucosamine at 30 C than at 37 C.
Fermentation cultures of strain 2123-54 were cultivated in the medium shown in Table 8. Fermentations were run using NaOH for pH control to pH 6.7 and were fed a mixture of 33% glucose, 8% ammonium sulfate. Aeration and agitation were adjusted to maintain a dissolved oxygen concentration of greater than 20% of air saturation.
Table 8 Fermentation Medium Component Amount, g/L
Na3Citrate-2H20 1 (NH4)2SO4 5 MgSO4 0.12 CaC12 0.011 Mazu 204 Antifoam 0.5 mL/L
IPTG 0.048 Glucose 20 Trace Metals *Trace metal composition is 0.7 mg/L CoC12, 1.7 mg/L H3803i 0.6 mg/L
CuC12.21120, 10.5 mg/L
FeC13.6H20, 12 mg/L MnC12.4H20, 1.5 mg/L Na2Mo4.2H20, 1.5 mg/L ZnC12.
In the following experiment, three fermentations were run in one-liter vessels containing an initial volume of 600 mL. Variables tested were as follows.
Fermentor #1: The mixture of 33% glucose and 8% ammonium sulfate was fed at such a rate that no glucose accumulated in the fermentor. Growth was at 37 .
Fermentor #2: As with fermentation #1 except that growth was at 30 .
Fermentor #3: As with fermentation #2 except that the feed rate was increased to maintain a constant glucose concentration in the fermentor of 5 to 10 g/L.
Results from these fermentations are shown in Figs. 16A, 16B and 16C.
Comparison of the results from fermentors 1 (Fig. 16A) and 2 (Fig. 16B) shows that glucosamine titers are markedly higher at 30 C than they are at 37 C, as observed in shake flasks. The maximum glucosamine concentration observed was in the glucose-excess fermentor 3 grown at 30 C (Fig. 16C), at 10.9 g/L. At 30 C, growth and glucosamine concentration appeared to coincide, and there appeared to be a slight advantage to growth under glucose-excess. In subsequent fermentation experiments, run under conditions similar to fermentor #3, glucosamine concentrations in excess of 12 g/L have been obtained (data not shown).
In summary, the present inventors have described herein the use of metabolic engineering to create the first glucosamine overproducing strains ofE. coll.
The concept, proven here, will be generally applicable to any microorganism having a pathway for the production of amino sugars, or to any recombinant microorganism into which a pathway for the production of amino sugars has been introduced. In addition to the present strategy for creating a glucosamine-producing strain (i.e., eliminating glucosamine degradation and uptake and increasing expression of the glmS gene), the present inventors have also established that reducing product inhibition of glucosamine-6-phosphate synthase by glucosamine-6-phosphate improves glucosamine production.
While various embodiments of the present invention have been described in detail, it is apparent that modifications and adaptations of those embodiments will occur to those skilled in the art. However, it is to be expressly understood that such modifications and adaptations are within the spirit and scope of the present invention, as set forth in the following claims.
SEQUENCE LISTING
<110> Berry, Alan Burlingame, Richard P.
Millis, James R.
<120> PROCESS AND MATERIALS FOR PRODUCTION OF GLUCOSAMINE
<130> 3161-18-C1-PCT
<140>
<141>
<150> PCT/US98/00800 <151> 1998-01-14 <150> 60/035,494 <151> 1997-01-14 <160> 31 <170> Patentln Ver. 2.0 <210> 1 <211> 29 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 1 cggtctccca tgtgtggaat tgttggcgc 29 <210> 2 <211> 34 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 2 ctctagagcg ttgatattca gtcaattaca aaca 34 <210> 3 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 3 atggatgagc agacgatggt 20 <210> 4 <211> 19 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 4 cctcgaggtc gacggtatc 19 <210> 5 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 5 tggatgagca gacgatgg 18 <210> 6 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 6 tccgtcacag gtatttattc 20 <210> 7 <211> 17 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 7 agctgcgtgg tgcgtac 17 <210> 8 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 8 ggaccgtgtt tcagttcg 18 <210> 9 <211> 17 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 9 gccgtggcga tcagtac 17 <210> 10 <211> 17 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 10 gccaatcacc agcggac 17 <210> 11 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 11 atggtttccc gctactgg 18 <210> 12 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 12 gaaccaaggt aacccagc 18 <210> 13 <211> 7408 <212> DNA
<213> Escherichia coli <220>
<221> RBS
<222> (1240)..(1245) <220>
<221> promoter <222> (1165)..(1181) <220>
<221> conflict <222> (2509)..(2510) <400> 13 gaattgatcc cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt acccaactta 60 atcgccttgc agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg 120 atcgcccttc ccaacagttg cgcagcctga atggcgaatg gcgctttgcc tggtttccgg 180 caccagaagc ggtgccggaa agctggctgg agtgcgatct tcctgaggcc gatactgtcg 240 tcgtcccctc aaactggcag atgcacggtt acgatgcgcc catctacacc aacgtaacct 300 atcccattac ggtcaatccg ccgtttgttc ccacggagaa tccgacgggt tgttactcgc 360 tcacatttaa tgttgatgaa agctggctac aggaaggcca gacgcgaatt atttttgatg 420 gcgttaactc ggcgtttcat ctgtggtgca acgggcgctg ggtcggttac ggccaggaca 480 gtcgtttgcc gtctgaattt gacctgagcg catttttacg cgccggagaa aaccgcctcg 540 cggtgatggt gctgcgttgg agtgacggca gttatctgga agatcaggat atgtggcgga 600 tgagcggcat tttccgtgac gtctcgttgc tgcataaacc gactacacaa atcagcgatt 660 tccatgttgc cactcgcttt aatgatgatt tcagccgcgc tgtactggag gctgaagttc 720 agatgtgcgg cgagttgcgt gactacctac gggtaacagt ttctttatgg cagggtgaaa 780 cgcaggtcgc cagcggcacc gcgcctttcg gcggtgaaat tatcgatgag cgtggtggtt 840 atgccgatcg cgtcacacta cgtctgaacg tcgaaaaccc gaaactgtgg agcgccgaaa 900 tcccgaatct ctatcgtgcg gtggttgaac tgcacaccgc cgacggcacg ctgattgaag 960 cagaagcctg cgatgtcggt ttccgcgagg tgcggattga aaatggtctg ctgctgctga 1020 acggcaagcc gttgctgatt cgaggcgtta accgtcacga gcatcatcct ctgcatggtc 1080 aggtcatgga tgagcagacg atggtgcagg atctccaccg cggtggcggc cgctctagaa 1140 ctagtggatc tcgatcccgc gaaattaata cgactcacta taggggaatt gtgagcggat 1200 aacaattccc ctctagaaat aattttgttt aactttaaga aggagatata ccatgtgtgg 1260 aattgttggc gcgatcgcgc aacgtgatgt agcagaaatc cttcttgaag gtttacgtcg 1320 tctggaatac cgcggatatg actctgccgg tctggccgtt gttgatgcag aaggtcatat 1380 gacccgcctg cgtcgcctcg gtaaagtcca gatgctggca caggcagcgg aagaacatcc 1440 tctgcatggc ggcactggta ttgctcacac tcgctgggcg acccacggtg aaccttcaga 1500 agtgaatgcg catccgcatg tttctgaaca cattgtggtg gtgcataacg gcatcatcga 1560 aaaccatgaa ccgctgcgtg aagagctaaa agcgcgtggc tataccttcg tttctgaaac 1620 cgacaccgaa gtgattgccc atctggtgaa ctgggagctg aaacaaggcg ggactctgcg 1680 tgaggccgtt ctgcgtgcta tcccgcagct gcgtggtgcg tacggtacag tgatcatgga 1740 ctcccgtcac ccggataccc tgctggcggc acgttctggt agtccgctgg tgattggcct 1800 ggggatgggc gaaaacttta tcgcttctga ccagctggcg ctgttgccgg tgacccgtcg 1860 ctttatcttc cttgaagagg gcgatattgc ggaaatcact cgccgttcgg taaacatctt 1920 cgataaaact ggcgcggaag taaaacgtca ggatatcgaa tccaatctgc aatatgacgc 1980 gggcgataaa ggcatttacc gtcactacat gcagaaagag atctacgaac agccgaacgc 2040 gatcaaaaac acccttaccg gacgcatcag ccacggtcag gttgatttaa gcgagctggg 2100 accgaacgcc gacgaactgc tgtcgaaggt tgagcatatt cagatcctcg cctgtggtac 2160 ttcttataac tccggtatgg tttcccgcta ctggtttgaa tcgctagcag gtattccgtg 2220 cgacgtcgaa atcgcctctg aattccgcta tcgcaaatct gccgtgcgtc gtaacagcct 2280 gatgatcacc ttgtcacagt ctggcgaaac cgcggatacc ctggctggcc tgcgtctgtc 2340 gaaagagctg ggttaccttg gttcactggc aatctgtaac gttccgggtt cttctctggt 2400 gcgcgaatcc gatctggcgc taatgaccaa cgcgggtaca gaaatcggcg tggcatccac 2460 taaagcattc accactcagt taactgtgct gttgatgctg gtggcgaagc tgtctcgcct 2520 gaaaggtctg gatgcctcca ttgaacatga catcgtgcat ggtctgcagg cgctgccgag 2580 ccgtattgag cagatgctgt ctcaggacaa acgcattgaa gcgctggcag aagatttctc 2640 tgacaaacat cacgcgctgt tcctgggccg tggcgatcag tacccaatcg cgctggaagg 2700 cgcattgaag ttgaaagaga tctcttacat tcacgctgaa gcctacgctg ctggcgaact 2760 gaaacacggt ccgctggcgc taattgatgc cgatatgccg gttattgttg ttgcaccgaa 2820 caacgaattg ctggaaaaac tgaaatccaa cattgaagaa gttcgcgcgc gtggcggtca 2880 gttgtatgtc ttcgccgatc aggatgcggg ttttgtaagt agcgataaca tgcacatcat 2940 cgagatgccg catgtggaag aggtgattgc accgatcttc tacaccgttc cgctgcagct 3000 gctggcttac catgtcgcgc tgatcaaagg caccgacgtt gaccagccgc gtaacctggc 3060 aaaatcggtt acggttgagt aataaatgga tgccctgcgt aagcggggca tttttcttcc 3120 tgttatgttt ttaatcaaac atcctgccaa ctccatgtga caaaccgtca tcttcggcta 3180 ctttttctct gtcacagaat gaaaattttt ctgtcatctc ttcgttatta atgtttgtaa 3240 ttgactgaat atcaacgctc tagaggggct agagcggccg ccaccgcggt ggagctccgt 3300 cgacaagctt atcgataccg tcgacctcga gggggggccc ggtaccgagg acgcgttcga 3360 ataaatacct gtgacggaag atcacttcgc agaataaata aatcctggtg tccctgttga 3420 taccgggaag ccctgggcca acttttggcg aaaatgagac gttgatcggc acgtaagagg 3480 ttccaacttt caccataatg aaataagatc actaccgggc gtattttttg agttatcgag 3540 attttcagga gctaaggaag ctaaaatgga gaaaaaaatc actggatata ccaccgttga 3600 tatatcccaa tggcatcgta aagaacattt tgaggcattt cagtcagttg ctcaatgtac 3660 ctataaccag accgttcagc tggatattac ggccttttta aagaccgtaa agaaaaataa 3720 gcacaagttt tatccggcct ttattcacat tcttgcccgc ctgatgaatg ctcatccgaa 3780 attccgtatg gcaatgaaag acggtgagct ggtgatatgg gatagtgttc acccttgtta 3840 caccgttttc catgagcaaa ctgaaacgtt ttcatcgctc tggagtgaat accacgacga 3900 tttccggcag tttctacaca tatattcgca agatgtggcg tgttacggtg aaaacctggc 3960 ctatttccct aaagggttta ttgagaatat gtttttcgtc tcagccaatc cctgggtgag 4020 tttcaccagt tttgatttaa acgtggccaa tatggacaac ttcttcgccc ccgttttcac 4080 catgggcaaa tattatacgc aaggcgacaa ggtgctgatg ccgctggcga ttcaggttca 4140 tcatgccgtt tgtgatggct tccatgtcgg cagaatgctt aatgaattac aacagtactg 4200 cgatgagtgg cagggcgggg cgtaattttt ttaaggcagt tattggtgcc cttaaacgcc 4260 tggtgctacg cctgaataag tgataataag cggatgaatg gcagaaattc ggacgcgtca 4320 attcgagctc ctgcactgga tggtggcgct ggatggtaag ccgctggcaa gcggtgaagt 4380 gcctctggat gtcgctccac aaggtaaaca gttgattgaa ctgcctgaac taccgcagcc 4440 ggagagcgcc gggcaactct ggctcacagt acgcgtagtg caaccgaacg cgaccgcatg 4500 gtcagaagcc gggcacatca gcgcctggca gcagtggcgt ctggcggaaa acctcagtgt 4560 gacgctcccc gccgcgtccc acgccatccc gcatctgacc accagcgaaa tggatttttg 4620 catcgagctg ggtaataagc gttggcaatt taaccgccag tcaggctttc tttcacagat 4680 gtggattggc gataaaaaac aactgctgac gccgctgcgc gatcagttca cccgtgcacc 4740 gctggataac gacattggcg taagtgaagc gacccgcatt gaccctaacg cctgggtcga 4800 acgctggaag gcggcgggcc attaccaggc cgaagcagcg ttgttgcagt gcacggcaga 4860 tacacttgct gatgcggtgc tgattacgac cgctcacgcg tggcagcatc aggggaaaac 4920 cttatttatc agccggaaaa cctaccggat tgatggtagt ggtcaaatgg cgattaccgt 4980 tgatgttgaa gtggcgagcg atacaccgca tccggcgcgg attggcctga actgccagct 5040 ggcgcaggta gcagagcggg taaactggct cggattaggg ccgcaagaaa actatcccga 5100 ccgccttact gccgcctgtt ttgaccgctg ggatctgcca ttgtcagaca tgtatacccc 5160 gtacgtcttc ccgagcgaaa acggtctgcg ctgcgggacg cgcgaattga attatggccc 5220 acaccagtgg cgcggcgact tccagttcaa catcagccgc tacagtcaac agcaactgat 5280 ggaaaccagc catcgccatc tgctgcacgc ggaagaaggc acatggctga atatcgacgg 5340 tttccatatg cggtgtgaaa taccgcacag atgcgtaagg agaaaatacc gcatcaggcg 5400 ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt 5460 atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa 5520 gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc 5580 gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag 5640 gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt 5700 gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg 5760 aagcgtggcg ctttctcaat gctcacgctg taggtatctc agttcggtgt aggtcgttcg 5820 ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg 5880 taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac 5940 tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg 6000 gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc tgaagccagt 6060 taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg 6120 tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc 6180 tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt 6240 ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt 6300 taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag 6360 tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct gactccccgt 6420 cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg caatgatacc 6480 gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag ccggaagggc 6540 cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta attgttgccg 6600 ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg ccattgctac 6660 aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg gttcccaacg 6720 atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc 6780 tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta tggcagcact 6840 gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg gtgagtactc 6900 aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat 6960 acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg gaaaacgttc 7020 ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga tgtaacccac 7080 tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg ggtgagcaaa 7140 aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat gttgaatact 7200 catactcttc ctttttcaat attattgaag catttatcag ggttattgtc tcatgagcgg 7260 atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg 7320 aaaagtgcca cctgacgtct aagaaaccat tattatcatg acattaacct ataaaaatag 7380 gcgtatcacg aggccctttc gtcttcaa 7408 <210> 14 <211> 2184 <212> DNA
<213> Escherichia coli <400> 14 ccgctctaga actagtggat ctcgatcccg cgaaattaat acgactcact ataggggaat 60 tgtgagcgga taacaattcc cctctagaaa taattttgtt taactttaag aaggagatat 120 accatgtgtg gaattgttgg cgcgatcgcg caacgtgatg tagcagaaat ccttcttgaa 180 ggtttacgtc gtctggaata ccgcggatat gactctgccg gtctggccgt tgttgatgca 240 gaaggtcata tgacccgcct gcgtcgcctc ggtaaagtcc agatgctggc acaggcagcg 300 gaagaacatc ctctgcatgg cggcactggt attgctcaca ctcgctgggc gacccacggt 360 gaaccttcag aagtgaatgc gcatccgcat gtttctgaac acattgtggt ggtgcataac 420 ggcatcatcg aaaaccatga accgctgcgt gaagagctaa aagcgcgtgg ctataccttc 480 gtttctgaaa ccgacaccga agtgattgcc catctggtga actgggagct gaaacaaggc 540 gggactctgc gtgaggccgt tctgcgtgct atcccgcagc tgcgtggtgc gtacggtaca 600 gtgatcatgg actcccgtca cccggatacc ctgctggcgg cacgttctgg tagtccgctg 660 gtgattggcc tggggatggg cgaaaacttt atcgcttctg accagctggc gctgttgccg 720 gtgacccgtc gctttatctt ccttgaagag ggcgatattg cggaaatcac tcgccgttcg 780 gtaaacatct tcgataaaac tggcgcggaa gtaaaacgtc aggatatcga atccaatctg 840 caatatgacg cgggcgataa aggcatttac cgtcactaca tgcagaaaga gatctacgaa 900 cagccgaacg cgatcaaaaa cacccttacc ggacgcatca gccacggtca ggttgattta 960 agcgagctgg gaccgaacgc cgacgaactg ctgtcgaagg ttgagcatat tcagatcctc 1020 gcctgtggta cttcttataa ctccggtatg gtttcccgct actggtttga atcgctagca 1080 ggtattccgt gcgacgtcga aatcgcctct gaattccgct atcgcaaatc tgccgtgcgt 1140 cgtaacagcc tgatgatcac cttgtcacag tctggcgaaa ccgcggatac cctggctggc 1200 ctgcgtctgt cgaaagagct gggttacctt ggttcactgg caatctgtaa cgttccgggt 1260 tcttctctgg tgcgcgaatc cgatctggcg ctaatgacca acgcgggtac agaaatcggc 1320 gtggcatcca ctaaagcatt caccactcag ttaactgtgc tgttgatgct ggtggcgaag 1380 ctgtctcgcc tgaaaggtct ggatgcctcc attgaacatg acatcgtgca tggtctgcag 1440 gcgctgccga gccgtattga gcagatgctg tctcaggaca aacgcattga agcgctggca 1500 gaagatttct ctgacaaaca tcacgcgctg ttcctgggcc gtggcgatca gtacccaatc 1560 gcgctggaag gcgcattgaa gttgaaagag atctcttaca ttcacgctga agcctacgct 1620 gctggcgaac tgaaacacgg tccgctggcg ctaattgatg ccgatatgcc ggttattgtt 1680 gttgcaccga acaacgaatt gctggaaaaa ctgaaatcca acattgaaga agttcgcgcg 1740 cgtggcggtc agttgtatgt cttcgccgat caggatgcgg gttttgtaag tagcgataac 1800 atgcacatca tcgagatgcc gcatgtggaa gaggtgattg caccgatctt ctacaccgtt 1860 ccgctgcagc tgctggctta ccatgtcgcg ctgatcaaag gcaccgacgt tgaccagccg 1920 cgtaacctgg caaaatcggt tacggttgag taataaatgg atgccctgcg taagcggggc 1980 atttttcttc ctgttatgtt tttaatcaaa catcctgcca actccatgtg acaaaccgtc 2040 atcttcggct actttttctc tgtcacagaa tgaaaatttt tctgtcatct cttcgttatt 2100 aatgtttgta attgactgaa tatcaacgct ctagaggggc tagagcggcc gccaccgcgg 2160 tggagctccg tcgacaagct tatc 2184 <210> 15 <211> 1830 <212> DNA
<213> Escherichia coli <220>
<221> CDS
<222> (1)..(1830) <400> 15 atg tgt gga att gtt ggc gcg atc gcg caa cgt gat gta gca gaa atc 48 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile ctt ctt gaa ggt tta cgt cgt ctg gaa tac cgc gga tat gac tct gcc 96 Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala ggt ctg gcc gtt gtt gat gca gaa ggt cat atg acc cgc ctg cgt cgc 144 Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg ctc ggt aaa gtc cag atg ctg gca cag gca gcg gaa gaa cat cct ctg 192 Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu cat ggc ggc act ggt att get cac act cgc tgg gcg acc cac ggt gaa 240 His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu cct tca gaa gtg aat gcg cat ccg cat gtt tct gaa cac att gtg gtg 288 Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val gtg cat aac ggc atc atc gaa aac cat gaa ccg ctg cgt gaa gag cta 336 Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu aaa gcg cgt ggc tat acc ttc gtt tct gaa acc gac acc gaa gtg att 384 Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile gcc cat ctg gtg aac tgg gag ctg aaa caa ggc ggg act ctg cgt gag 432 Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu gcc gtt ctg cgt get atc ccg cag ctg cgt ggt gcg tac ggt aca gtg 480 Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val atc atg gac tcc cgt cac ccg gat acc ctg ctg gcg gca cgt tct ggt 528 Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly agt ccg ctg gtg att ggc ctg ggg atg ggc gaa aac ttt atc get tct 576 Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser gac cag ctg gcg ctg ttg ccg gtg acc cgt cgc ttt atc ttc ctt gaa 624 Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu gag ggc gat att gcg gaa atc act cgc cgt tcg gta aac atc ttc gat 672 Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp aaa act ggc gcg gaa gta aaa cgt cag gat atc gaa tcc aat ctg caa 720 Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln tat gac gcg ggc gat aaa ggc att tac cgt cac tac atg cag aaa gag 768 Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu atc tac gaa cag ccg aac gcg atc aaa aac acc ctt acc gga cgc atc 816 Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile agc cac ggt cag gtt gat tta agc gag ctg gga ccg aac gcc gac gaa 864 Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu ctg ctg tcg aag gtt gag cat att cag atc ctc gcc tgt ggt act tct 912 Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser tat aac tcc ggt atg gtt tcc cgc tac tgg ttt gaa tcg cta gca ggt 960 Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly att ccg tgc gac gtc gaa atc gcc tct gaa ttc cgc tat cgc aaa tct 1008 Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser gcc gtg cgt cgt aac agc ctg atg atc acc ttg tca cag tct ggc gaa 1056 Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu acc gcg gat acc ctg get ggc ctg cgt ctg tcg aaa gag ctg ggt tac 1104 Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr ctt ggt tca ctg gca atc tgt aac gtt ccg ggt tct tct ctg gtg cgc 1152 Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg gaa tcc gat ctg gcg cta atg acc aac gcg ggt aca gaa atc ggc gtg 1200 Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val gca tcc act aaa gca ttc acc act cag tta act gtg ctg ttg atg ctg 1248 Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu gtg gcg aag ctg tct cgc ctg aaa ggt ctg gat gcc tcc att gaa cat 1296 Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser. Ile Glu His gac atc gtg cat ggt ctg cag gcg ctg ccg agc cgt att gag cag atg 1344 Asp Ile Val His Gly Leu Gin Ala Leu Pro Ser Arg Ile Glu Gln Met ctg tct cag gac aaa cgc att gaa gcg ctg gca gaa gat ttc tct gac 1392 Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp aaa cat cac gcg ctg ttc ctg ggc cgt ggc gat cag tac cca atc gcg 1440 Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala ctg gaa ggc gca ttg aag ttg aaa gag atc tct tac att cac get gaa 1488 Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu gcc tac get get ggc gaa ctg aaa cac ggt ccg ctg gcg cta att gat 1536 Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp gcc gat atg ccg gtt att gtt gtt gca ccg aac aac gaa ttg ctg gaa 1584 Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu aaa ctg aaa tcc aac att gaa gaa gtt cgc gcg cgt ggc ggt cag ttg 1632 Lys Leu Lys Ser Asn Ile Giu Glu Val Arg Ala Arg Gly Gly Gln Leu tat gtc ttc gcc gat cag gat gcg ggt ttt gta agt agc gat aac atg 1680 Tyr Val Phe Ala Asp Gin Asp Ala Gly Phe Val Ser Ser Asp Asn Met cac atc atc gag atg ccg cat gtg gaa gag gtg att gca ccg atc ttc 1728 His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe tac acc gtt ccg ctg cag ctg ctg get tac cat gtc gcg ctg atc aaa 1776 Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys ggc acc gac gtt gac cag ccg cgt aac ctg gca aaa tcg gtt acg gtt 1824 Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val gag taa 1830 Glu <210> 16 <211> 609 <212> PRT
<213> Escherichia coli <400> 16 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr. Leu Arg Glu Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ser Pro Leu Val Ile Gly Leu Giy Met Gly Glu Asn Phe Ile Ala Ser Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Glu <210> 17 <211> 2184 <212> DNA
<213> Escherichia coli <400> 17 ccgctctaga actagtggat ctcgatcccg cgaaattaat acgactcact ataggggaat 60 tgtgagcgga taacaattcc cctctagaaa taattttgtt taactttaag aaggagatat 120 accatgtgtg gaactgttgg cgcgatcgcg caacgtgatg tagcagaaat ccttcttgaa 180 ggtttacgtc gtctggaata ccgcggatat gactctgccg gtctggccgt tgttgatgca 240 gaaggtcata tgacccgcct gcgtcgcctc ggtaaagtcc agatgctggc acaggcagcg 300 gaagaacatc ctctgcatgg cggcactggt attgctcaca ctcgctgggc gacccacggt 360 gaaccttcag aagtgaatgc gcatccgcat gtttctgaac acattgtggt ggtgcataac 420 ggcatcatcg aaaaccatga accgctgcgt gaagagctaa aagcgcgtgg ctataccttc 480 gtttctgaaa ccgacaccga agtgattgcc catctggtga actgggagct gaaacaaggc 540 gggactctgc gtgaggccgt tctgcgtgct atcccgcagc tgcgtggtgc gtacggtaca 600 gtgatcatgg actcccgtca cccggatacc ctgctggcgg cacgttctgg tagtccgctg 660 gtgattggcc tggggatggg cgaaaacttt atcgcttctg accagctggc gctgttgccg 720 gtgacccgtc gctttatctt ccttgaagag ggcgatattg cggaaatcac tcgccgttcg 780 gtaaacatct tcgataaaac tggcgcggaa gtaaaacgtc aggatatcga atccaatctg 840 caatatgacg cgggcgataa aggcatttac cgtcactaca tgcagaaaga gatctacgaa 900 cagccgaacg cgatcaaaaa cacccttacc ggacgcacca gccacggtca ggttgattta 960 agcgagctgg gaccgaacgc cgacgaactg ctgtcgaagg ttgagcatat tcagatcctc 1020 gcctgtggta cttcttataa ctccggtatg gtttcccgct actggtttga atcgctagca 1080 ggtattccgt gcgacgtcga aatcgcctct gaattccgct atcgcaaatc tgccgtgcgt 1140 cgtaacagcc tgatgatcac cttgtcacag tctggcgaaa ccgcggatac cctggctggc 1200 ctgcgtctgt cgaaagagct gggttacctt ggttcactgg caatctgtaa cgttccgggt 1260 tcttctctgg tgcgcgaatc cgatctggcg ctaatgacca acgcgggtac agaaatcggc 1320 gtggcatcca ctaaagcatt caccactcag ttaactgtgc tgttgatgct ggtggcgaag 1380 ctgtctcgcc tgaaaggtct ggatgcctcc attgaacatg acatcgtgca tggtctgcag 1440 gcgctgccga gccgtattga gcagatgctg cctcaggaca aacgcattga agcgctggca 1500 gaagatttct ctgacaaaca tcacgcgctg ttcctgggcc gtggcgatca gtacccaatc 1560 gcgctggaag gcgcattgaa gttgaaagag atctcttaca ttcacgctga agcctacgct 1620 gctggcgaac tgaaacacgg tccgctggcg ctaattgatg ccgatatgcc ggttattgtt 1680 gttgcaccga acaacgaatt gctggaaaaa ctgaaatcca acattgaaga agttcgcgcg 1740 cgtggcggtc agttgtatgt cttcgccgat caggatgcgg gttttgtaag tagcgataac 1800 atgcacatca tcgagatgcc gcatgtggaa gaggtgattg caccgatctt ctacaccgtt 1860 ccgctgcagc tgctggctta ccatgtcgcg ctgatcaaag gcaccgacgt tgaccagccg 1920 cgtaacctgg caaaatcggt tacggttgag taataaatgg atgccctgcg taagcggggc 1980 atttttcttc ctgttatgtt tttaatcaaa catcctgcca actccatgtg acaaaccgtc 2040 atcttcggct actttttctc tgtcacagaa tgaaaatttt tctgtcatct cttcgttatt 2100 aatgtttgta attgactgaa tatcaacgct ctagaggggc tagagcggcc gccaccgcgg 2160 tggagctccg tcgacaagct tatc 2184 <210> 18 <211> 1830 <212> DNA
<213> Escherichia coli <220>
<221> CDS
<222> (1)..(1830) <400> 18 atg tgt gga act gtt ggc gcg atc gcg caa cgt gat gta gca gaa atc 48 Met Cys Gly Thr Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile ctt ctt gaa ggt tta cgt cgt ctg gaa tac cgc gga tat gac tct gcc 96 Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala ggt ctg gcc gtt gtt gat gca gaa ggt cat atg acc cgc ctg cgt cgc 144 Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg ctc ggt aaa gtc cag atg ctg gca cag gca gcg gaa gaa cat cct ctg 192 Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu cat ggc ggc act ggt att get cac act cgc tgg gcg acc cac ggt gaa 240 His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu cct tca.gaa gtg aat gcg cat ccg cat gtt tct gaa cac att gtg gtg 288 Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val gtg cat aac ggc atc atc gaa aac cat gaa ccg ctg cgt gaa gag cta 336 Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu aaa gcg cgt ggc tat acc ttc gtt tct gaa acc gac acc gaa gtg att 384 Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile gcc cat ctg gtg aac tgg gag ctg aaa caa ggc ggg act ctg cgt gag 432 Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu gcc gtt ctg cgt get atc ccg cag ctg cgt ggt gcg tac ggt aca gtg 480 Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val atc atg gac tcc cgt cac ccg gat acc ctg ctg gcg gca cgt tct ggt 528 Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly agt ccg ctg gtg att ggc ctg ggg atg ggc gaa aac ttt atc get tct 576 Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser gac cag ctg gcg ctg ttg ccg gtg acc cgt cgc ttt atc ttc ctt gaa 624 Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu gag ggc gat att gcg gaa atc act cgc cgt tcg gta aac atc ttc gat 672 Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp aaa act ggc gcg gaa gta aaa cgt cag gat atc gaa tcc aat ctg caa 720 Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln tat gac gcg ggc gat aaa ggc att tac cgt cac tac atg cag aaa gag 768 Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu atc tac gaa cag ccg aac gcg atc aaa aac acc ctt acc gga cgc acc 816 Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Thr agc cac ggt cag gtt gat tta agc gag ctg gga ccg aac gcc gac gaa 864 Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu ctg ctg tcg aag gtt gag cat att cag atc ctc gcc tgt ggt act tct 912 Leu Leu Ser Lys Val Glu His Ile Gin Ile Leu Ala Cys Gly Thr Ser tat aac tcc ggt atg gtt tcc cgc tac tgg ttt gaa tcg cta gca ggt 960 Tyr Asn. Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly att ccg tgc gac gtc gaa atc gcc tct gaa ttc cgc tat cgc aaa tct 1008 Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser gcc gtg cgt cgt aac agc ctg atg atc acc ttg tca cag tct ggc gaa 1056 Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu acc gcg gat acc ctg get ggc ctg cgt ctg tcg aaa gag ctg ggt tac 1104 Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr ctt ggt tca ctg gca atc tgt aac gtt ccg ggt tct tct ctg gtg cgc 1152 Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg gaa tcc gat ctg gcg cta atg acc aac gcg ggt aca gaa atc ggc gtg 1200 Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val gca tcc act aaa gca ttc acc act cag tta act gtg ctg ttg atg ctg 1248 Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu gtg gcg aag ctg tct cgc ctg aaa ggt ctg gat gcc tcc att gaa cat 1296 Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His gac atc gtg cat ggt ctg cag gcg ctg ccg agc cgt att gag cag atg 1344 Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met ctg cct cag gac aaa cgc att gaa gcg ctg gca gaa gat ttc tct gac 1392 Leu Pro Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp aaa cat cac gcg ctg ttc ctg ggc cgt ggc gat cag tac cca atc gcg 1440 Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala ctg gaa ggc gca ttg aag ttg aaa gag atc tct tac att cac get gaa 1488 Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu gcc tac get get ggc gaa ctg aaa cac ggt ccg ctg gcg cta att gat 1536 Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp gcc gat atg ccg gtt att gtt gtt gca ccg aac aac gaa ttg ctg gaa 1584 Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu aaa ctg aaa tcc aac att gaa gaa gtt cgc gcg cgt ggc ggt cag ttg 1632 Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu tat gtc ttc gcc gat cag gat gcg ggt ttt gta agt agc gat aac atg 1680 Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met cac atc atc gag atg ccg cat gtg gaa gag gtg att gca ccg atc ttc 1728 His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe tac acc gtt ccg ctg cag ctg ctg get tac cat gtc gcg ctg atc aaa 1776 Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys ggc acc gac gtt gac cag ccg cgt aac ctg gca aaa tcg gtt acg gtt 1824 Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val gag taa 1830 Glu <210> 19 <211> 609 <212> PRT
<213> Escherichia coli <400> 19 Met Cys Gly Thr Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Thr Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Leu Leu Ser Lys Val Giu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met Leu Pro Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp 450. 455 460 Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Glu <210> 20 <211> 2184 <212> DNA
<213> Escherichia coli <400> 20 ccgctctaga actagtggat ctcgatcccg cgaaattaat acgactcact ataggggaat 60 tgtgagcgga taacaattcc cctctagaaa taattttgtt taactttaag aaggagatat 120 accatgtgtg gaattgttgg cgcgatcgcg caacgtgatg tagcagaaat ccttcttgaa 180 ggtttacgtc gtctggaata ccgcggatat gactctgccg gtctggccgt tgttgataca 240 gaaggtcata tgacccgcct gcgtcgcctc ggtaaagtcc agatgctggc acaggcagcg 300 gaagaacatc ctctgcatgg cggcactggt attgctcaca ctcgctgggc gacccacggt 360 gaaccttcag aagtgaatgc gcatccgcat gtttctgaac acattgtggt ggtgcataac 420 ggcatcatcg aaaaccatga accgctgcgt gaagagctaa aagcgcgtgg ctataccttc 480 gtttctgaaa ccgacaccga agtgattgcc catctggtga actgggagct gaaacaaggc 540 gggactctgc gtgaggccgt tctgcgtgct atcccgcagc tgcgtggtgc gtacggtaca 600 gtgatcatgg actcccgtca cccggatacc ctgctggcgg cacgttctgg tagtccgctg 660 gtgattggcc tggggatggg cgaaaacttt atcgcttctg accagctggc gctgttgccg 720 gtgacccgtc gctttatctt ccttgaagag ggcgatattg cggaaatcac tcgccgttcg 780 gtaaacatct tcgataaaac tggcgcggaa gtaaaacgtc aggatatcga atccaatctg 840 caatatgacg cgggcgataa aggcatttac tgtcactaca tgcagaaaga gatctacgaa 900 cagccgaacg cgatcaaaaa cacccttacc ggacgcatca gccacggtca ggttgattta 960 agcgagctgg gaccgaacgc cgacgaactg ctgtcgaagg ttgagcatat tcagatcctc 1020 gcctgtggta cttcttataa ctccggtatg gtttcccgct actggtttga atcgctagca 1080 ggtattccgt gcgacgtcga aatcgcctcc gaattccgct atcgcaaatc tgccgtgcgt 1140 cgtaacagcc tgatgatcac cttgtcacag tctggcgaaa ccgcggatac cctggctggc 1200 ctgcgtctgt cgaaagagct gggttacctt ggttcactgg caatctgtaa cgttccgggt 1260 tcttctctgg tgcgcgaatc cgatctggcg ctaatgacca acgcgggtac agaaatcggc 1320 gtggcatcca ctaaagcatt caccactcag ttaactgtgc tgttgatgct ggtggcgaag 1380 ctgtctcgcc tgaaaggtct ggatgcctcc attgaacatg acatcgtgca tggtctgcag 1440 gcgctgccga gccgtattga gcagatgctg tctcaggaca aacgcattga agcgctggca 1500 gaagatttct ctgacaaaca tcacgcgctg ttcctgagcc gtggcgatca gtacccaatc 1560 gcgctggaag gcgcattgaa gttgaaagag atctcttaca ttcacgctga agcctacgct 1620 gctggcgaac tgaaacacgg tccgctggcg ctaattgatg ccgatatgcc ggttattgtt 1680 gttgcaccga acaacgaatt gctggaaaaa ctgaaatcca acattgaaga agttcgcgcg 1740 cgtggcggtc agttgtatgt cttcgccgat caggatgcgg gttttgtaag tagcgataac 1800 atgcacatca tcgagatgcc gcatgtggaa gaggtgattg caccgatctt ctacaccgtt 1860 ccgctgcagc tgctggctta ccatgtcgcg ctgatcaaag gcaccgacgt tgaccagccg 1920 cgtaacctgg caaaatcggt tacggttgag taataaatgg atgccctgcg taagcggggc 1980 atttttcttc ctgttatgtt tttaatcaaa catcctgcca actccatgtg acaaaccgtc 2040 atcttcggct actttttctc tgtcacagaa tgaaaatttt tctgtcatct cttcgttatt 2100 aatgtttgta attgactgaa tatcaacgct ctaggggggc tagagcggcc gccaccgcgg 2160 tggagctccg tcgacaagct tatc 2184 <210> 21 <211> 1830 <212> DNA
<213> Escherichia coli <220>
<221> CDS
<222> (1)..(1830) <400> 21 atg tgt gga att gtt ggc gcg atc gcg caa cgt gat gta gca gaa atc 48 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile ctt ctt gaa ggt tta cgt cgt ctg gaa tac cgc gga tat gac tct gcc 96 Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala ggt ctg gcc gtt gtt gat aca gaa ggt cat atg acc cgc ctg cgt cgc 144 Gly Leu Ala Val Val Asp Thr Glu Gly His Met Thr Arg Leu Arg Arg ctc ggt aaa gtc cag atg ctg gca cag gca gcg gaa gaa cat cct ctg 192 Leu Gly Lys Val Gln Met Leu Ala Gin Ala Ala Glu Glu His Pro Leu cat ggc ggc act ggt att get cac act cgc tgg gcg acc cac ggt gaa 240 His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu cct tca gaa gtg aat gcg cat ccg cat gtt tct gaa cac att gtg gtg 288 Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val gtg cat aac ggc atc atc gaa aac cat gaa ccg ctg cgt gaa gag cta 336 Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu aaa gcg cgt ggc tat acc ttc gtt tct gaa acc gac acc gaa gtg att 384 Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile gcc cat ctg gtg aac tgg gag ctg aaa caa ggc ggg act ctg cgt gag 432 Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu gcc gtt ctg cgt get atc ccg cag ctg cgt ggt gcg tac ggt aca gtg 480 Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val atc atg gac tcc cgt cac ccg gat acc ctg ctg gcg gca cgt tct ggt 528 Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly agt ccg ctg gtg att ggc ctg ggg atg ggc gaa aac ttt atc get tct 576 Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser gac cag.ctg gcg ctg ttg ccg gtg acc cgt cgc ttt atc ttc ctt gaa 624 Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu gag ggc gat att gcg gaa atc act cgc cgt tcg gta aac atc ttc gat 672 Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp aaa act ggc gcg gaa gta aaa cgt cag gat atc gaa tcc aat ctg caa 720 Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln tat gac gcg ggc gat aaa ggc att tac tgt cac tac atg cag aaa gag 768 Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Cys His Tyr Met Gln Lys Glu atc tac gaa cag ccg aac gcg atc aaa aac acc ctt acc gga cgc atc 816 Ile Tyr Giu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile agc cac ggt cag gtt gat tta agc gag ctg gga ccg aac gcc gac gaa 864 Ser His Gly Gin Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu ctg ctg tcg aag gtt gag cat att cag atc ctc gcc tgt ggt act tct 912 Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser tat aac tcc ggt atg gtt tcc cgc tac tgg ttt gaa tcg cta gca ggt 960 Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly att ccg tgc gac gtc gaa atc gcc tcc gaa ttc cgc tat cgc aaa tct 1008 Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser gcc gtg cgt cgt aac agc ctg atg atc acc ttg tca cag tct ggc gaa 1056 Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gin Ser Gly Glu acc gcg gat acc ctg get ggc ctg cgt ctg tcg aaa gag ctg ggt tac 1104 Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr ctt ggt tca ctg gca atc tgt aac gtt ccg ggt tct tct ctg gtg cgc 1152 Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg gaa tcc gat ctg gcg cta atg acc aac gcg ggt aca gaa atc ggc gtg 1200 Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val gca tcc act aaa gca ttc acc act cag tta act gtg ctg ttg atg ctg 1248 Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu gtg gcg aag ctg tct cgc ctg aaa ggt ctg gat gcc tcc att gaa cat 1296 Val Ala.Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His gac atc gtg cat ggt ctg cag gcg ctg ccg agc cgt att gag cag atg 1344 Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met ctg tct cag gac aaa cgc att gaa gcg ctg gca gaa gat ttc tct gac 1392 Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp aaa cat cac gcg ctg ttc ctg agc cgt ggc gat cag tac cca atc gcg 1440 Lys His His Ala Leu Phe Leu Ser Arg Gly Asp Gln Tyr Pro Ile Ala ctg gaa ggc gca ttg aag ttg aaa gag atc tct tac att cac get gaa 1488 Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu gcc tac get get ggc gaa ctg aaa cac ggt ccg ctg gcg cta att gat 1536 Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp gcc gat atg ccg gtt att gtt gtt gca ccg aac aac gaa ttg ctg gaa 1584 Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu aaa ctg aaa tcc aac att gaa gaa gtt cgc gcg cgt ggc ggt cag ttg 1632 Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu tat gtc ttc gcc gat cag gat gcg ggt ttt gta agt agc gat aac atg 1680 Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met cac atc atc gag atg ccg cat gtg gaa gag gtg att gca ccg atc ttc 1728 His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe tac acc gtt ccg ctg cag ctg ctg get tac cat gtc gcg ctg atc aaa 1776 Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys ggc acc gac gtt gac cag ccg cgt aac ctg gca aaa tcg gtt acg gtt 1824 Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val gag taa 1830 Glu <210> 22 <211> 609 <212> PRT
<213> Escherichia coli <400> 22 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Gly Leu Ala Val Val Asp Thr Glu Gly His Met Thr Arg Leu Arg Arg Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Cys His Tyr Met Gln Lys Glu Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser. Leu Val Arg Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp Lys His His Ala Leu Phe Leu Ser Arg Gly Asp Gln Tyr Pro Ile Ala Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser. Asp Asn Met His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Glu <210> 23 <211> 2184 <212> DNA
<213> Escherichia coli <400> 23 ccgctctaga actagtggat ctcgatcccg cgaaattaat acgactcact ataggggaat 60 tgtgagcgga taacaattcc cctctagaaa taattttgtt taactttaag aaggagatat 120 accatgtgtg gaattgttgg cgcgatcgcg caacgtgatg tagcagaaat ccttcttgaa 180 ggtttacgtc gtctggaata ccgcggatat gactctgccg gtctggccgt tgttgatgca 240 gaaggtcata tgacccgcct gcgtcgcctc ggtaaagtcc agatgctggc acaggcagcg 300 gaagaacatc ctctgcatgg cggcactggt attgctcaca ctcgctgggc gacccacggt 360 gaaccttcag aagtgaatgc gcatccgcat gtttccgaac acattgtggt ggtgcataac 420 ggcatcatcg aaaaccatga accgctgcgt gaagagctaa aagcgcgtgg ctataccttc 480 gtttctgaaa ccgacaccga agtgattgcc catctggtga actgggagct gaaacaaggc 540 gggactctgc gtgaggccgt tctgcgtgct atcccgcagc tgcgtggtgc gtacggtaca 600 gtgatcatgg actcccgtca cccggatacc ctgctggcgg cacgttctgg tagtccgctg 660 gtgattggcc tggggatggg cgaaaacttt atcgcttctg accagctggc gctgttgccg 720 gtgacccgtc gctttatctt ccttgaagag ggcgatattg cggaaatcac tcgccgttcg 780 gtaaacatct tcgataaaac tggcgcggaa gtaaaacgtc aggatatcga atccaatctg 840 caatatgacg cgggcgataa aggcatttac cgtcactaca tgcagaaaga gatctacgaa 900 cagccgaacg cgatcaaaaa cacccttacc ggacgcatca gccacggtca ggttgattta 960 agcgagctgg gaccgaacgc cgacgaactg ctgtcgaagg ttgagcatat tcagatcctc 1020 gcctgtggta cttcttataa ctccggtatg gtttcccgct actggtttga atcgctagca 1080 ggtattccgt gcgacgtcga aatcgcctct gaattccgct atcgcaaatc tgccgtgcgt 1140 cgtaacagcc tgatgatcac cttgtcacag tctggcgaaa ccgcggatac cctggctggc 1200 ctgcgtctgt cgaaagagct gggttacctt ggttcactgg caatctgtaa cgttccgggt 1260 tcttctctgg tgcgcgaatc cgatctggcg ctaatgacca acgcgggtac agaaatcggc 1320 gtggcatcca ctaaagcatt caccactcag ttaactgtgc tgttgatgct ggtggcgaag 1380 ctgtctcgcc tgaaaggtct ggatgcctcc attgaacatg acatcgtgca tggtctgcag 1440 gcgctgccga gccgtattga gcagatgctg tctcaggaca aacgcattga agcgctggca 1500 gaagatttct ctgacaaaca tcacgcgccg ttcctgggcc gtggcgatca gtacccaatc 1560 gcgctggaag gcgcattgaa gttgaaagag atctcttaca ttcacgctga agcctacgct 1620 gctggcgaac tgaaacacgg tccgctggcg ctaattgatg ccgatatgcc ggttattgtt 1680 gttgcaccga acaacgaatt gctggaaaaa ctgaaatcca acattgaaga agttcgcgcg 1740 cgtggcggtc agttgtatgt cttcgccgat caggatgcgg gttttgtaag tagcgataac 1800 atgcacatca tcgagatgcc gcatgtggaa gaggtgattg caccgatctt ctacaccgtt 1860 ccgctgcagc tgctggctta ccatgtcgcg ctgatcaaag gcaccgacgt tgaccagccg 1920 cgtaacctgg caaaatcggt tacggttgag taataaatgg atgccctgcg taagcggggc 1980 atttttcttc ctgttatgtt tttaatcaaa catcctgcca actccatgtg acaaaccgtc 2040 atcttcggct actttttctc tgtcacagaa tgaaaatttt tctgtcatct cttcgttatt 2100 aatgtttgta attgactgaa tatcaacgct ctagaggggc tagagcggcc accaccgcgg 2160 tggagctccg tcgacaagct tatc 2184 <210> 24 <211> 1830 <212> DNA
<213> Escherichia coli <220>
<221> CDS
<222> (1)..(1830) <400> 24 atg tgt gga att gtt ggc gcg atc gcg caa cgt gat gta gca gaa atc 48 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile ctt ctt gaa ggt tta cgt cgt ctg gaa tac cgc gga tat gac tct gcc 96 Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala ggt ctg gcc gtt gtt gat gca gaa ggt cat atg acc cgc ctg cgt cgc 144 Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg ctc ggt aaa gtc cag atg ctg gca cag gca gcg gaa gaa cat cct ctg 192 Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu cat ggc ggc act ggt att get cac act cgc tgg gcg acc cac ggt gaa 240 His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu cct tca gaa gtg aat gcg cat ccg cat gtt tcc gaa cac att gtg gtg 288 Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val gtg cat aac ggc atc atc gaa aac cat gaa ccg ctg cgt gaa gag cta 336 Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu aaa gcg cgt ggc tat acc ttc gtt tct gaa acc gac acc gaa gtg att 384 Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile gcc cat ctg gtg aac tgg gag ctg aaa caa ggc ggg act ctg cgt gag 432 Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu gcc gtt ctg cgt get atc ccg cag ctg cgt ggt gcg tac ggt aca gtg 480 Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val atc atg gac tcc cgt cac ccg gat acc ctg ctg gcg gca cgt tct ggt 528 Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly agt ccg ctg gtg att ggc ctg ggg atg ggc gaa aac ttt atc get tct 576 Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser gac cag ctg gcg ctg ttg ccg gtg acc cgt cgc ttt atc ttc ctt gaa 624 Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu gag ggc gat att gcg gaa atc act cgc cgt tcg gta aac atc ttc gat 672 Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp aaa act ggc gcg gaa gta aaa cgt cag gat atc gaa tcc aat ctg caa 720 Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln tat gac gcg ggc gat aaa ggc att tac cgt cac tac atg cag aaa gag 768 Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu atc tac gaa cag ccg aac gcg atc aaa aac acc ctt acc gga cgc atc 816 Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile agc cac ggt cag gtt gat tta agc gag ctg gga ccg aac gcc gac gaa 864 Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu ctg ctg tcg aag gtt gag cat att cag atc ctc gcc tgt ggt act tct 912 Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser tat aac. tcc ggt atg gtt tcc cgc tac tgg ttt gaa tcg cta gca ggt 960 Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly att ccg tgc gac gtc gaa atc gcc tct gaa ttc cgc tat cgc aaa tct 1008 Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser gcc gtg cgt cgt aac agc ctg atg atc acc ttg tca cag tct ggc gaa 1056 Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu acc gcg gat acc ctg get ggc ctg cgt ctg tcg aaa gag ctg ggt tac 1104 Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr ctt ggt tca ctg gca atc tgt aac gtt ccg ggt tct tct ctg gtg cgc 1152 Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg gaa tcc gat ctg gcg cta atg acc aac gcg ggt aca gaa atc ggc gtg 1200 Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val gca tcc act aaa gca ttc acc act cag tta act gtg ctg ttg atg ctg 1248 Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu gtg gcg aag ctg tct cgc ctg aaa ggt ctg gat gcc tcc att gaa cat 1296 Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His gac atc gtg cat ggt ctg cag gcg ctg ccg agc cgt att gag cag atg 1344 Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met ctg tct cag gac aaa cgc att gaa gcg ctg gca gaa gat ttc tct gac 1392 Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp aaa cat cac gcg ccg ttc ctg ggc cgt ggc gat cag tac cca atc gcg 1440 Lys His His Ala Pro Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala ctg gaa ggc gca ttg aag ttg aaa gag atc tct tac att cac get gaa 1488 Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu gcc tac get get ggc gaa ctg aaa cac ggt ccg ctg gcg cta att gat 1536 Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp gcc gat atg ccg gtt att gtt gtt gca ccg aac aac gaa ttg ctg gaa 1584 Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu aaa ctg aaa tcc aac att gaa gaa gtt cgc gcg cgt ggc ggt cag ttg 1632 Lys Leu. Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu tat gtc ttc gcc gat cag gat gcg ggt ttt gta agt agc gat aac atg 1680 Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met cac atc atc gag atg ccg cat gtg gaa gag gtg att gca ccg atc ttc 1728 His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe tac acc gtt ccg ctg cag ctg ctg get tac cat gtc gcg ctg atc aaa 1776 Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys ggc acc gac gtt gac cag ccg cgt aac ctg gca aaa tcg gtt acg gtt 1824 Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val gag taa 1830 Glu <210> 25 <211> 609 <212> PRT
<213> Escherichia coli <400> 25 Met Cys Gly Ile Val Gly Ala Ile Ala Gin Arg Asp Val Ala Glu Ile Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu Ala Val.Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp Lys His His Ala Pro Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Glu <210> 26 <211> 2184 <212> DNA
<213> Escherichia coli <400> 26 ccgctctaga actagtggat ctcgatcccg cgaaattaat acgactcact ataggggaat 60 tgtgagcgga taacaattcc cctctagaaa taattttgtt taactttaag aaggagatat 120 accatgtgtg gaattgttgg cgcgatcgcg caacgtgatg tagcagaaat ccttcttgaa 180 ggtttacgtc gtctggaata ccgcggatat gactctgccg gtctggccgt tgttgatgca 240 gaaggtcata tgacccgcct gcgtcgcctc ggtaaagtcc agatgctggc acaggcagcg 300 gaagaacatc ctctgcatgg cggcactggt attgctcaca ctcgctgggc gacccacggt 360 gaaccttcag aagtgaatgc gcatccgcat gtttctgaac acattgtggt ggtgcataac 420 ggcatcatcg aaaaccatga accgctgcgt gaagagctaa aagcgcgtgg ctataccttc 480 gtttctgaaa ccgacaccga agtgattgcc catctggtga actgggagct gaaacaaggc 540 gggactctgc gtgaggccgt tctgcgtgct atcccgcagc tgcgtggtgc gtacggtaca 600 gtgatcatgg actcccgtca cccggatacc ctgctggcgg cacgttctgg tagtccgctg 660 gtgattggcc tggggatggg cgaaaacttt atcgcttctg accagctggc gctgttgccg 720 gtgacccgtc gctttatctt ccttgaagag ggcgatattg cggaaatcac tcgccgttcg 780 gtaaacatct tcgataaaac tggcgcggaa gtaaaacgtc aggatatcga atccaatctg 840 caatatgacg cgggcgataa aggcatttac cgtcactaca tgcagaaaga gatctacgaa 900 cagccgaacg cgatcaaaaa cacccttacc ggacgcatca gccacggtca ggttgattta 960 agcgagctgg gaccgaacgc cgacgaactg ctgtcgaagg ttgagcatat tcagatcctc 1020 gcctgtggta cttcttataa ctccggtatg gtttcccgct actggtttga atcgctagca 1080 ggtattccgt gcgacgtcga aatcgcctct gaattccgct atcgcaaatc tgccgtgcgt 1140 cgtaacagcc tgatgatcac cttgtcacag tctggcgaaa ccgcggatac cctggctggc 1200 ctgcgtctgt cgaaagagct gggttacctt ggttcactgg caatctgtaa cgttccgggt 1260 tcttctctgg tgcgcgaatc cgatctggcg ctaatgacca acgcgggtac agaaatcggc 1320 gtggcatcca ctaaagcatt caccactcag ttaactgtgc tgttgatgct ggtggcgaag 1380 ctgtctcgcc tgaaaggtct ggatgcctcc attgaacatg acatcgtgca tggtctgcag 1440 gcgctgccga gccgtattga gcagatgctg tctcaggaca aacgcattga agcgctggca 1500 gaagatttct ctgacaaaca tcacgcgctg ttcctgagcc gtggcgatca gtacccaatc 1560 gcgctggaag gcgcattgaa gttgaaagag atctcttaca ttcacgctga agcctacgct 1620 gctggcgaac tgaaacacgg tccgctggcg ctaattgatg ccgatatgcc ggttattgtt 1680 gttgcaccga acaacgaatt gctggaaaaa ctgaaatcca acattgaaga agttcgcgcg 1740 cgtggcggtc agttgtatgt cttcgccgat caggatgcgg gttttgtaag tagcgataac 1800 atgcacatca tcgagatgcc gcatgtggaa gaggtgattg caccgatctt ctacaccgtt 1860 ccgctgcagc tgctggctta ccatgtcgcg ctgatcaaag gcaccgacgt tgaccagccg 1920 cgtaacctgg caaaatcggt tacggttgag taataaatgg atgccctgcg taagcggggc 1980 atttttcttc ctgttatgtt tttaatcaaa catcctgcca actccatgtg acaaaccgtc 2040 atcttcggct actttttctc tgtcacagaa tgaaaatttt tctgtcatct cttcgttatt 2100 aatgtttgta attgactgaa tatcaacgct ctaggggggc tagagcggcc gccaccgcgg 2160 tggagctccg tcgacaagct tatc 2184 <210> 27 <211> 1830 <212> DNA
<213> Escherichia coli <220>
<221> CDS
<222> (1)..(1830) <400> 27 atg tgt gga att gtt ggc gcg atc gcg caa cgt gat gta gca gaa atc 48 Met Cys Gly Ile Val Gly Ala Ile Ala Gin Arg Asp Val Ala Glu Ile ctt ctt gaa ggt tta cgt cgt ctg gaa tac cgc gga tat gac tct gcc 96 Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala ggt ctg gcc gtt gtt gat gca gaa ggt cat atg acc cgc ctg cgt cgc 144 Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg ctc ggt aaa gtc cag atg ctg gca cag gca gcg gaa gaa cat cct ctg 192 Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu cat ggc ggc act ggt att get cac act cgc tgg gcg acc cac ggt gaa 240 His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu cct tca gaa gtg aat gcg cat ccg cat gtt tct gaa cac att gtg gtg 288 Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val gtg cat aac ggc atc atc gaa aac cat gaa ccg ctg cgt gaa gag cta 336 Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu aaa gcg cgt ggc tat acc ttc gtt tct gaa acc gac acc gaa gtg att 384 Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile gcc cat ctg gtg aac tgg gag ctg aaa caa ggc ggg act ctg cgt gag 432 Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu gcc gtt ctg cgt get atc ccg cag ctg cgt ggt gcg tac ggt aca gtg 480 Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val atc atg gac tcc cgt cac ccg gat acc ctg ctg gcg gca cgt tct ggt 528
hybrids include hybridization at an ionic strength of 0.1X SSC (0.157 M Na+) at a temperature of between about 30 C and about 45 C, more preferably, between about 38 C and about 50 C, and even more preferably, between about 45 C and about 55 C.
These values are based on calculations of a melting temperature for molecules larger than 10 about 100 nucleotides, 0% formamide and a G + C content of about 50%.
Alternatively, T. can be calculated empirically as set forth in Sambrook et al., supra, pages 11.55 to 11.57.
Protein homologues of proteins involved in an amino sugar metabolic pathway according to the present invention can be the result of natural allelic variation or natural 15 mutation. Protein homologues of the present invention can also be produced using techniques known in the art including, but not limited to, direct modifications to the protein or modifications to the gene encoding the protein using, for example, classic or recombinant DNA techniques to effect random or targeted mutagenesis, as discussed above.
20 In one embodiment of the present invention, a genetic modification in a recombinant nucleic acid molecule ofthe present invention which encodes a glucosamine-6-phosphate synthase results in at least one amino acid modification (i.e., modification in the amino acid sequence of the encoded protein) selected from the group of an addition, substitution, deletion, and/or derivatization of an amino acid residue ofthe glucosamine-6-25 phosphate synthase. Such a modification in the amino acid sequence of the encoded protein can be determined as compared to a wild-type, or naturally occurring glucosamine-6-phosphate synthase, such as a glucosamine-6-phosphate synthase having an amino acid sequence SEQ ID NO:16. One or more of such amino acid modifications results in increased action of glucosamine-6-phosphate synthase as compared to the 30 naturally occurring glucosamine-6-phosphate synthase having amino acid sequence SEQ
ID NO:16. In one embodiment, such an amino acid modification is in an amino acid sequence position in the modified protein (i.e., homologue) which corresponds to one or more ofthe following amino acid positions in amino acid sequence SEQ ID NO:
16: Ile(4), Ile(272), Ser(450), Ala(39), Arg(250), Gly(472), Leu(469).
In another embodiment, such an amino acid modification is selected from the group of a substitution of (a) an amino acid residue having an aliphatic hydroxyl side group for Ile(4); (b) an amino acid residue having an aliphatic hydroxyl side group for lle(272); (c) an amino acid residue having an aliphatic side group for Ser(450); (d) an amino acid residue having an aliphatic hydroxyl side group for Ala(39); (e) an amino acid residue having a sulfur-containing side group for Arg(250); (f) an amino acid residue having an aliphatic hydroxyl side group for Gly(472); (g) an amino acid residue having an aliphatic side group for Leu(469); and, (h) combinations of (a)-(g). According to the present invention, amino acid residues having an aliphatic hydroxyl group include serine and threonine, and amino acid residues having aliphatic side groups include glycine, alanine, valine, leucine, isoleucine and proline.
In yet another embodiment of the present invention, an amino acid modification as described above is preferably a substitution selected from the group of Ile(4) to Thr, Ile(272) to Thr, Ser(450) to Pro, Ala(39) to '.hr, Arg(250) to Cys, Gly(472) to Ser, Leu(469) to Pro, and combinations thereof. Specific examples of recombinant nucleic acid molecules having genetic modifications resulting in such amino acid modifications are described in detail in the Examples section.
Preferred genetically modified recombinant nucleic acid molecules comprising a nucleic acid sequence encoding a glucosamine-6-phosphate synthase having increased action include recombinant nucleic acid molecules comprising a nucleic acid sequence which encodes a glucosamine-6-phosphate synthase comprising an amino acid sequence selected from the group of SEQ ID NO:19, SEQ ID NO:22, SEQ ID NO:25, SEQ ID
NO:28 and/or SEQ ID NO:3 1. Other preferred genetically modified recombinant nucleic acid molecules comprise a nucleic acid sequence selected from the group of SEQ
ID
NO:17, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID
NO:24, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:29 and/or SEQ ID NO:30.
Particularly preferred genetically modified recombinant nucleic acid molecules useful in the present invention include nucleic acid molecules comprising nucleic acid molecules selected from the group of pKLN23-49, pKLN23-54, pKLN23-124, pKLN23-149, pKLN23-151, ngimS-492114, ngimS-491830, ngimS-542194, ngimS-541830, ngimS-1242184, ngimS-1241830, ngimS-1492184, ngimS-1491830, nglmS-1512184 and ngimS-1511830.
The present invention includes a recombinant vector, which includes at least one isolated nucleic acid molecule of the present invention, inserted into any vector capable of delivering the nucleic acid molecule into a bacterial cell. Such a vector can contain bacterial nucleic acid sequences that are not naturally found adjacent to the isolated nucleic acid molecules to be inserted into the vector. The vector can be either RNA or DNA and typically is a plasmid. Recombinant vectors can be used in the cloning, sequencing, and/or otherwise manipulating of nucleic acid: molecules. One type of recombinant vector, referred to herein as a recombinant nucleic acid molecule and described in more detail below, can be used in the expression of nucleic acid molecules.
Preferred recombinant vectors are capable ofreplicating in a transformed bacterial or yeast cell, and in particular, in an Escherichia coli cell.
Transformation of a nucleic acid molecule into a cell can be accomplished by any method by which a nucleic acid molecule can be inserted into the cell.
Transformation techniques include, but are not limited to, transfection, electroporation and microinjection.
Recombinant molecules of the present invention, which can be either DNA or RNA, can also contain additional regulatory sequences, such as translation regulatory sequences, origins of replication, and other regulatory sequences that are compatible with the recombinant cell. One or more recombinant molecules of the present invention can be used to produce an encoded product (e.g., a glucosamine-6-phosphate synthase). In one embodiment, an encoded product is produced by expressing a nucleic acid molecule of the present invention under conditions effective to produce the protein.
Such conditions (i.e., culture conditions) have been described above and are further discussed in the Examples section. A preferred method to produce an encoded protein is by transfecting a host cell with one or more recombinant molecules of the present invention to form a recombinant cell.
As discussed above, preferred recombinant molecules of the present invention include, ngimS-282184, ngimS-281830, ngimS-492184, ngimS-491830, ngimS-542184, nglmS-541830, ngimS-1242184, ngimS-1241830, ngimS-1492184, ngimS-1491830, ngimS-1512184, ng1mS-1511230, pKLN23-28, pKLN23-49, pKLN23-54, pKLN23-124, pKLN23-149 and/or pKLN23-151.
A recombinant cell is preferably produced by transforming a bacterial cell (i.e., a host cell) with one or more recombinant molecules, each comprising one or more nucleic acid molecules operatively linked to an expression vector containing one or more transcription control sequences. The phrase, operatively linked, refers to insertion of a nucleic acid molecule into an expression vector in a manner such that the molecule is able to be expressed when transformed into a host cell. As used herein, an expression vector is a DNA or RNA vector that is capable of transforming a host cell and of effecting expression of a specified nucleic acid molecule. Preferably, the expression vector is also capable of replicating within the host cell. In the present invention, expression vectors are typically plasmids. Expression vectors of the present invention include any vectors that function (i.e., direct gene expression) in a yeast host cell or a bacterial host cell, preferably anEscherichia coli host cell. Preferred recombinant cells of the present invention are set forth in the Examples section.
Nucleic acid molecules of the present invention can be operatively linked to expression vectors containing regulatory sequences such as transcription control sequences, translation control sequences, origins of replication, and other regulatory sequences that are compatible with the recombinant cell and that control the expression of nucleic acid molecules of the present invention. In particular, recombinant molecules of the present invention include transcription control sequences.
Transcription control sequences are sequences which control the initiation, elongation, and termination of transcription. Particularly important transcription control sequences are those which control transcription initiation, such as promoter, enhancer, operator and repressor sequences. Suitable transcription control sequences include any transcription control sequence that can function in yeast or bacterial cells and preferably, Escherichia coll. A
variety of such transcription control sequences are known to those skilled in the art.
It may be appreciated by one skilled in the art that use of recombinant DNA
technologies can improve expression of transformed nucleic acid molecules by manipulating, for example, the number of copies of the nucleic acid molecules within a host cell, the efficiency with which those nucleic acid molecules are transcribed, the efficiency with which the resultant transcripts are translated, and the efficiency of post-translational modifications. Recombinant techniques useful for increasing the expression of nucleic acid molecules of the present invention include, but are not limited to, operatively linking nucleic acid molecules to high-copy number plasmids, integration of the nucleic acid molecules into the host cell chromosome, addition of vector stability sequences to plasmids, substitutions or modifications of transcription control signals (e.g., promoters, operators, enhancers), substitutions or modifications of translational control signals, modification of nucleic acid molecules of the present invention to correspond to the codon usage of the host cell, deletion of sequences that destabilize transcripts, and use of control signals that temporally separate recombinant cell growth from recombinant enzyme production during fermentation. The activity of an expressed recombinant protein of the present invention may be improved by fragmenting, modifying, or derivarizing nucleic acid molecules encoding such a protein.
Such modifications are described in detail in the Examples section.
The following experimental results are provided for the purposes of illustration an are got intended to limit the scope of the invention.
EXAMPLES
Exam e I
The following example describes the production of mutant Escherichia cols strains which are blocked in amino acid sugar metabolic pathways involving degradation of glucosamine.
The starting strain for the construction of all glucosamine overproducing strains described herein was E_ coli W3 110 (publicly available from the American Type Culture Collection as ATCC No. 25947), a prototrophic, f A derivative of E. coil K-12 (Bachmann, 1987, "Escherichia coli and Salmonella iyphimunum ", Cellular W Molecular Biology. pp.
1190-1219 . A list of all strains used and produced in the following examples is provided in Table 1.
(E5224320 DOC, I) Table 1. Bacterial strains.
$ ... Aii G not :: Svu~re l fern .:.. > ::
W3110 F mcrA mcrB IN(rrnD-rrnE)1 X' ATCC
IBPC 522 thi-1 argG6 argE3 his-4 w/-1 xyl-S rpsL tsx- J. Plumbridge 29? AlacX74 manXYZ8 nagE47ptsG22 zcf-229::Th10 IBPC 566 thi-1 argG6 argE3 his-4 mtl-I xyl-5 rpsL tmc J. Plumbridge 29? AlacX74 manXYZ8 zdj-225:: Tn 10 IBPC 590 thi-1 argG6 argE3 his-4 mt1-1 xyl-5 rpsL tsx- J. Plumbridge 29? AlacX74 Mag::TcR
7101-6 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)I A' manXYZ8 W3110 x P1õ,(1BPC566) zdj-225::TnlO
7101-7 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)l A' manXYZ8 W3110 x P1,(IBPC566) zdj-225::Th10 7101-9 W31i0 Anag F mcrA mcrB IN(rrnD-rrnE)1 A' Anag::TcR W3110 x P1,,,,(IBPC590) 7101-13 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)1 A' manXYZ8 7101-6 selected on TCS
TcS zdj-225::Tn10? TcS medium 7101-14 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)1 A' manXYZ8 7101-7 selected on TCS
TcS zdj-22S::TnlO? TcS medium 7101-15 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)l ) manXYZ8 7101-14 x P1,(IBPC522) ptsG zdj-225:: Tn 10? ptsG22 zcf-229:: Tn10 7101-17 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)1 A' manXYZ8 7101-13 x PlAIBPC590) Anag zdj-225:: Tn 10? TcS nag::TcR
7101-22 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)I A' manXYZ8 7101-15 selected on TCS
ptsG TcS zdj-225:: Tn107 ptsG22 zcf-229::Tn 10? TcS medium 2123-4 W3110 ptsM F mcrA mcrB IN(rrnD-rrnE)l A' manXYZ8 7101-22 x Plõõ
(IBPC590) ptsG Anag zdj-225::Tn10? ptsG22 zcf-229::Tn10? TcS
Anag::TcR
W3110(D F mcrA mcrB IN(rrnD-rrnE)l ADE3 W3110 lysogenized with E3) ADE3 7101- F mcrA mcrB IN(rrnD-rrnE)1 ADE3 7101-9 lysogenized with 9(DE3) nag::TcR ADE3 7101- F mcrA mcrB IN(rrnD-rrnE)1 ADE3 7101-17 lysogenized with 17(DE3) manXYZ8 zdj-225::Tn10? TcS Anag::TcR ADE3 2123- F mcrA mcrB IN(rrnD-rrnE)l ADE3 2123-4 lysogenized with 4(DE3) manXYZ8 zdj-225::Tnl0? ptsG22 zcf- ADE3 229::Tn 10 TcS Anag::TcR
BL21(DE F ompT hsdSB gal dcm ADE3 Novagen, Inc.
3) ATCC JC7623 F recB21 recC22 sbcBi S leu-6 ara-14 his-4 ATCC
T-71 F recB2i recC22 sbcBiS leu-6 ara-14 his-4 Integration of pT7 glmS-A' IacZ::pT7 g/mS--Cm8H7 Cm into lacZ of ATCC47002 by transformation with pKLN23-28 T-81 F recB21 recC22 sbcB1 S leu-6 ara-14 his-4 Integration of pT7g/mS-) lacZ::pT7g1mS-Cm8H8 Cm into lacZ of ATCC47002 by transformation with pKLN23-28 2123-5 W3110(DE3) 1acZ::pT7 g/mS-Cm8H7 W3110(DE3) x Pl,(T-71) tr~gn orererice 2123-6 W3110(DE3) IacZ::pT7 g/mS-Cm8H8 W3110(DE3) x P1õõ(T-81) 2123-7 W3110(DE3) /acZ::pT7gImS-Cm8H7 W3110(DE3) x Plõõ (T-71) 2123-8 W3110(DE3) lacZ::pT7gImS-Cm8H8 W3110(DE3) x Pl,,,,(T-81) 2123-9 7101-9(DE3) lacZ::pT7g/mS--Cm8H7 7101-9(DE3)x P1,,,,(T-71) 2123-10 7101-9(DE3) lacZ::pT7 gImS-Cm8H8 7101-9(DE3)x P1,,,,(T-81) 2123-11 7101-17(DE3) lacZ::pT7 g/mS-Cm8H7 7101-17(DE3)x P1,,,,(T-71) 2123-12 7101-17(DE3) IacZ::pT7 glmS--Cm8H8 7101-17(DE3)x Pl,(T-81) 2123-13 2123-4(DE3) IacZ::pT7gImS--Cm8H7 2123-4(DE3)xP1,,,(T-71) 2123-14 2123-4(DE3) lacZ::pT7glmS--Cm8H8 2123-4(DE3)xP1,,,,(T-81) NovaBlue endAI hsdR17 supE44 thi-I recAl gyrA96 Novagen relAl lac [F' proA+B+ lacFZAM]5::Tn10) LE392 F e14' (McrA') hsdR514(rm+) supE44 Lab collection supFS8 lacY1 or Alac(IZY)6 ga1K2 gaIT22 metBi trpR55 2123-16 LE392 gImS13 NG mutagenesis of LE392 2123-49 7101-17(DE3) IacZ::pT7 gImS11C-Cm8H8 Error-prone PCR with pKLN23-28; integration of mutant glmS into ATCC47002; transfer to 7101-17(DE3) by PI
transduction 2123-51 7101-17(DE3) lacZ::pT7 gImS52B-Cm8H8 Error-prone PCR with pKLN23-28; integration of mutant glmS into ATCC47002; transfer to 7101-17(DE3) by P1 transduction 2123-54 7101-17(DE3) IacZ::pT7 gImS8A-Cm8H8 Error-prone PCR with pKLN23-28; integration of mutant glmS into ATCC47002; transfer to 7101-17(DE3) by PI
transduction 2123-124 7101-17(DE3) Error-prone PCR with lacZ::pT7gImS94A pKLN23-28; integration S-Cm8H8 of mutant gimS into ATCC47002; transfer to 7107-17(DE3) by P1 transduction 2123-149 7101-17(DE3) pKLN23-54 EcoR1-lacZ::pT7 glmSl49- HindllI (1.0 kb) x Cm8H8 pKLN23-28 EcoRl-Hindlll (6.4 kb);
integration of mutant gimS into ATCC47002;
transfer to 7101-17(DE3) PI transduction Strain Albs Genotype Source/Reference 2123-151 7101-17(DE3) pKLN23-54 EcoRI-Rindlll (1.0 kb) x lacZ::pT7- pKLN23-28 coRlHindIll (6.4 kb);
glmSl S 1-Cm8H8 integration of mutant g1mS into ATCC47002; transfer to 7101-17(DE3) by P1 transduction Host strains blocked for glucosamine uptake and degradation were constructed by introducing mutations in the nagE man= and ptsG genes, which block transport of glucosamine, and the nagA. -B, -C and D genes, which are involved in metabolism of glucosamine-6-phosphate. Each of these genes has been described in detail previously herein.
Mutations in these genes were introduced into strains using the transducing bacteriophage P l,, (as described in Miller, 1972, "Experiments in Molecular Genetics", Cold Spring Harbor Laboratory.
In this technique, genes or mutations from one strain (the donor strain) are transferred to a recipient strain using the bacteriophage P1,:,. When bacteriophage PL. is grown on the donor strain, a small portion of the phage particles that are produced contain chromosomal DNA from the donor rather than the normal complement of phage DNA. Upon infection of the recipient strain with bacteriophage grown on the donor strain, those bacteriophagge particles containing chromosomal ANA from the donor strain can transfer that DNA to the recipient strain. If there is a strong selection for the DNA from the donor strain, recipient strains containing the appropriate gene or mutation from the donor strain can be selected.
To grow PI,,,. on a donor strain, an existing bacteriophage stock was used to infect-a culture of that strain. The recipient strain was grown at 37 C in LBMC medium (10 g/L Bacto u-yptone, S g/L yeast extract, 10 g/L NaCl, 1 mM MgCl2, 5 mM CaC12) until the absorbance at 600 nm was approximately 1.0, corresponding to approximately 6 x 108 cells per mL of culture.
One mL of the culture was then infected with a dilution of the phage stock at a ratio of approximately one phage per 10 cells. The mixture was incubated without shaking for 20 minutes at 37 C, then transferred to 10 ml- of prewarmed LBMC broth in a 125 mL baffled Erlenmeyer flask. The resulting culture was shaken vigorously for 2-3 hours at 37 C. During this period, it was generally observed that the culture would become more turbid, indicating bacterial growth. Toward the end (E5224320.DOC,1) of this incubation period, the culture would become clear, indicating cell lysis due to bacteriophage growth. After lysis had occurred, the culture was cooled on ice, a few drops of chloroform were added, and the flask was shaken briefly. The contents of the flask were then centrifuged to remove the cell debris and chloroform, and the resulting supernatant generally contained between 10' and 10' bacteriophage per mL.
Mutations were transferred to recipient strains by transduction with P 1,,, grown on the appropriate donor strain as described above. For transduction with P
1",, a culture of the recipient strain was grown overnight at 37 C in LBMC broth. 0.1 mL of culture was mixed with 0.1 ml, of bacteriophage lysate or a serial dilution of the lysate in a sterile test tube and incubated at 37'C for 20 minutes. 0.2 mL of 1 M sodium citrate was added to the tube, and the mixture was plated to selective medium. For each transduction, controls containing uninfected cells and bacteriophage lysates without cells were performed as described above. For the production of strains blocked in glucosamine degradation, selections were for tetracycline resistance as described below.
Tetracycline resistant mutants were selected by plating to LB medium (10 g/L Bacto tryptone, 5 g/L
yeast extract, 10 g/L NaCl) containing 12.5 jig/mL tetracycline and 10 mM
sodium citrate.
The mutations in the nag genes were introduced simultaneously as a deletion mutation (znag::TcR). In strain IBPC590 (Plumbridge, Table 1), which contains this mutation, the nag genes have been replaced by a tetracycline-resistance (TcR) determinant.
As a result, the mutation which removes the nag functions was transferred to appropriate recipient hosts by selection for tetracycline resistance. In this case, since the Tc' determinant was contained within the mutation of interest, the Onag and TcR
mutations were 100% linked. That is, all of the recipient strains receiving the Tc' determinant from IBPC590 also received the Anag mutation. This was confirmed by examining the growth phenotype ofthe tetracycline resistant strains resulting from infection with P
ly,, grown on IBPC590. All such strains were unable to grow on media containing glucosamine or N-acetylglucosamine as carbon sources, indicating the presence of the Onag mutation.
Mutations in the manXYZ and ptsG genes were also introduced by P1,, transduction using phage grown on strains 1BPC566 and IBPC522 (Plumbridge, Table 1), respectively. These strains also contained tetracycline-resistance determinants linked to the mutations of interest (designated zdj-225:: Th70 and zcf-229:: Tn 10, respectively). In these strains, the TcR determinants were not within the gene but were linked to the gene. Accordingly, not all recipient strains receiving the TcR determinant contained the mutations of interest. The degree of linkage is indicative of the distance on the chromosome between the TcR determinant and the mutation of interest. As a result, it was necessary to screen tetracycline resistant strains for manXYZ and ptsG. The maijXYZ strains grew slowly on mannose and failed to grow on glucosamine as sole carbon sources for growth. The ptsG strains grew slowly op glucose as sole carbon source.
Because all of the selections for the mutations described above were for tetracycline resistance, it was necessary to render strains tetracycline sensitive between steps if multiple mutations were to be introduced. To accomplish this, tetracycline-resistant strains were plated to TCS medium (15 g/1. agar, 5 g(L Bacto tryptone, 5 g/L yeast extract, 50 mg/L
chlortetracycline hydrochloride, 10 g/1, NaCl, 10 g/L NaH2PO4H2O, 12 mg/L fusaric acid, and 0.1 mM ZnCI2) which selects for tetracycline sensitive mutants (described in Maloy and Nunn, 1981, J.
Bacreriol., 145:1110-1112. Colonies arising on this medium were purified by resireaking to the same medium, then checking individual colonies for tetracycline sensitivity by plating to LB
media with and without 12.5 pg/mL tetracycline.
The scheme described above for the production of strains containing combinations 20 of the manXYZ ptsG, and Anag mutations is presented schematically in Fig. 3.
Exam e 2 The following Example describes the cloning and overexpression of the gimS
gene and the integration of the T7 glmS gene cassette into the E. coil chromosome.
Clonin and Oyeres re 'an a the Elms Gene.
Using information obtained from the published sequence of the glmS gene (Walker er al, 1984, Bzochent J. 224:799-815, primers were synthesized to amplify the gene from genomic DNA isolated from strain W3110 (Table 1) using the polymerise chain reaction (PCR). The primers used for PCR amplification were designated Upl and Log and had the following sequences: LTpI: 5'-CGGTCTCCCATGTGTGGAATTGTTGGCGC-3' (SEQ ID NO:1) {W5224320.DOC;I }
Lo8: 5'-CTCTAGAGCGTTGATATTCAGTCAATTACAAACA-3' (SEQ ID NO:2) The Up 1 primer contained sequences corresponding to the first twenty nucleotides of the g1mS gene (represented in nucleotides 10-29 of SEQ ID NO: 1) preceded by a BsaI
restriction endonuclease recognition site (GGTCTC, represented in nucleotides 2-7 of 5 SEQ ID NO: 1). The Lo8 primer contained sequences corresponding to positions between 145 and 171 bases downstream ofthe g1mS gene (represented in nucleotides 8-34 of SEQ
ID NO:2) preceded by a XbaI restriction endonuclease site (TCTAGA, represented in nucleotides 2-7 of SEQ ID NO:2). PCR amplification was conducted using a standard protocol to generate a fragment of DNA containing the g1mS gene with 171 base pairs of 10 DNA downstream of the gene flanked by BsaI and Xbal sites. This DNA
fragment was cloned into the vector pCR-Scripfr SK(+) (Stratagene Cloning Systems, La Jolla, California) using materials and instructions supplied by the manufacturer. The resulting plasmid was designated pKLN23-20.
One consequence of this cloning was that it placed a unique Sacl restriction 15 endonuclease site downstream of the gene. This allowed excision of a fragment of DNA
containing the g1mS gene from pKLN23-20 using the restriction endonucleases BsaI and Sad. This fragment was then cloned between the Ncol and Sacl sites of the expression vector pET-24d(+) (Novagen, Inc., Madison, Wisconsin) to generate plasmid pKLN23-23. The pET-24d(+) expression vector is based on the T7 promoter system (Studier and 20 Moffatt, 1986, J. Mol. Biol., 189:113-130). Cloning in this manner resulted in placement of the g1mS gene behind the T7-lac promoter contained on pET-24d(+). The T7-lac promoter is specifically recognized by the T7 RNA polymerase and is only expressed in strains containing a cloned T7 gene 1, which encodes the T7 RNA polymerise.
The cloned T7 polymerase gene is contained on a defective bacteriophage I phage designated 25 11DE3. Strains in which the ADE3 element is integrated into the chromosome contain the T7 RNA polymerase gene driven by the lacUV5 promoter. In those strains, expression ofthe T7 RNA polymerase gene can be induced using the lactose analog isopropylthio-p-D-galactopyranoside (IPTG). Accordingly, addition of IPTG to such cultures results in induction of the T7 RNA polymerase gene and expression of any genes controlled by the 30 T7 or T7-lac promoter.
To verify that pKLN23-23 contained the glmS gene driven by the T7--lac promoter, the plasmid was transferred to strain B1.21(Dir3) (Novagen, Inc.) (Table 1)_ Strain BL21(13E3)/pKLN23-23 was grown in duplicate in LB medium containing 50 mg/L
kanamycin (kanamycm resistance is encoded by the plasmid). One of the duplicates was induced with 1 mM
IPTC); the other was not. When the total proteins were examined from these two cultures by sodium dodecyl sulfate polyacrylamide gel electrophoresis, it prominent protein of approximately 70,000 molecular weight, corresponding to the predicted size for the gIMS gene product, was observed in cells from the induced culture but not in cells from the uninduced culture. A preliminary enzyme assay from an induced culture indicated several hundred fold higher glucosamine-6-phosphate synthase activity in the induced culture than in what had typically been observed in a wild type strain.
fnte_ration of thLT7-glmS Gene Casserte Arty the R. coil Chromosome.
The glmS gene driven by the 77-lac (the T7--glmS gene cassette) promoter was transferred to the chromosome of E. coil strains by a multistep process. First. The cassette was cloned into plasmid pBRINT-Cm (Balbas et al, 1996, Gene 96:65-6).. The gene cassette was then integrated into the chromosome of strain ATCC47002 (Table 1) by the techniques described by 13albas et al, 1996, supra, to generate strains 1-71 and T-8 1 (Table 1). Finally, the gene cassette was transferred to strains of interest by transduction with PI,,, as described below.
The T7-glnS cassette was excised from pKLN23 -23 by performing a partial digest of the plasmid with restriction endonuclease Bg/II and a complete digest with restriction ondonuclease HinD1l1. Plastuid pKLN23 -23 contains a Bg/II site approximately 20 base pairs upstream of the T7 promoter. The glmS gene also contains two Bg/II sites, A partial- digest with this enzyme was necessary to cut the plasmid upstream of the T7 promoter while avoiding the two internal sites.
Plasmid pKLN23-23 also contains a unique HinDUI site downstream of the ghnS
gene. The excised T7-glmS cassette was theta cloned between the unique BamHl and HinDill sites of pBRJNT-Cm. This resulted in the production of plasmids designated pKLN23-27 and pKLN23-28. Plasmids pKT.N23-27 and pKLN23-28 contain the T7i'lmS cassette in addition to a {55224320 DOC:I }
chloramphenicol resistance determinant flanked by the 5'- and 3'-termini of the E. coli lacZ gene.
Strain ATCC 47002 (Table 1) contains mutations that confer upon it an inability to maintain plasmids such as pBRINT-Cm which contain a ColE 1 origin of replication.
When such plasmids are transferred to this strain, selection for genetic markers contained on the plasmid results in integration of the plasmid into the chromosome (Balbas et al., 1996, supra). As mentioned above, plasmids pKLN23-27 and -28 contain the T7-glmS
cassette and a chloramphenicol resistance determinant flanked by the 5'- and 3'-termini of the E. coli lacZ gene. The lacZ sequences target the incoming DNA to the lacZ gene contained in the chromosome. Integration at the IacZ locus replaces the intact lacZ gene, which encodes the enzyme P-galactosidase, with a partial lacZ gene interrupted by the T7-glmS-Cm cassette. As a result, integration at lacZ results in the strain becoming f3-galactosidase negative. The plasmid also contains an ampicillin resistance determinant remote from the 5'-1acZ--T7 g1mS-Cm-lacZ-3' cassette. Integration at IacZ and plasmid loss also results in ampicillin sensitivity.
Plasmids pKLN23-27 and -28 were transferred to strain ATCC 47002, and cells were plated to LB medium containing 10 pg/mL chloramphenicol, 1 mM IPTG, and pg/mL 5-bromo-4-chloro-3-indolyl-(3-D-galactopyranoside (X-gal). The X-gal contained in the medium is a chromogenic (3-galactosidase substrate. Hydrolysis of X-gal by (3-galactosidase results in a blue derivative. Inclusion of X-gal and IPTG, which induces the native lacZ gene, results in blue lacZ-positive colonies and white lacZ-negative colonies.
White (lacZ-negative) chloramphenicol resistant colonies were then selected and purified.
The strains were then verified for sensitivity to ampicillin by plating to LB
media with and without 100 g/mL ampicillin. DNA integration was further confirmed using a PCR
scheme as described by Balbas et al., 1996, supra. Integrants T-71 and T-81 (Table 1) resulted from the integration of the desired segments of plasmids pKLN23-27 and pKLN23-28, respectively, into the chromosome of ATCC 47002.
The T7-gimS-Cm cassette was then transferred to strains W3110(DE3), 7101-9(DE3), 7101-17(DE3), and 2123-4(DE3) byP1r;,transduction, as described in Example 1, using lysates prepared on strains T-71 and T-81. These strains contain various combinations ofthe A nag, manXYZ, and ptsG mutations in addition to the ADE3 element necessary for expression from the T7-lac promoter. The ?.DE3 element was introduced to these strains using the W1=3 lysogenization kit produced by Novagen, Inc. (Madison, Wisconsin).
Transductants were selected on L8 agar plates containing 30 14g/mL
chloramphenicol and 10 mM sodium citrate. Loss of fI-galactosidase activity was verified on plates containing X-gal and 1PTG and DNA integration was further confirmed using a PCR scheme as described by Balbas er al, 1996, supra Glucosamine-6-phosphate synthase activity was measured in production strains containing integrated T7 glmS cassettes after growth in LB medium with and without IPTG
(Table 2). Glucosamine-6-phosphate synthase was assayed in crude cell extracts using either colorimetric or spectrophotometric assays (8adet et at, 1987, Biochemistry 26:1940-1948) as described below. The extracts used for those assays were prepared by suspending washed cell pellets in 5 mL of 0.1 M KH2P04/K2HPO4, pH 7.5 per grain of wet cell paste, passing the suspension through a French press at 16,000 psi, and centrifuging the disrupted cell suspension at 35,000-40,000 xg for 15 to 20 minutes. The supernatant was used as the source of enzyme for the assay.
For a colorimetric assay, 1 mL reactions were prepared containing 45 mM
KH2PO4/K2HPO4, 20 mM fructose-6-phosphate, 15 mML-glutamine, 2.5 mMEDTA, pH
7,5, and cell extract. The reactions were incubated at 37 C for 20 minutes and stopped by boiling for 4 minutes. The resulting precipitate was removed by centrifugation and the supernatant was assayed for gucosamine-6-phosphate by a modification of the method of Elson and Morgan (1933, Biochem_ J_ 27:1824-1828) essentially as described by Zalldu (1985.Meth..Enzymol.
113:278-281) . To 100 L of the above supernatant was added 12.5 L of saturated NaHC03 and 12.5 pL of cold, freshly prepared 5% aqueous acetic anhydride. After incubating for 3 minutes at room temperature, the mixture was boiled for 3 minutes to drive off excess acetic anhydride. After cooling to room temperature, 150 aj4. of 0.8 M potassium borate, pH 9.2(0.8 MH38O3 adjusted to pH 9.2 with KOH) was added and the mixture was boiled for 3 minutes.
After cooling to room temperature, 1.25 mL Ehrlich's reagent (1% p-dimethylaminobenzaldehyde in glacial acetic acid containing 0.125 N HCI) was added to each tube. After incubating at 37 C for 30 minutes, the absorbance at 585 nm was measured and the amount of gucosamine-6-phosphate formed {E522g32o DOC;1 }
was determined using a standard curve. In the absence of the substrate, fructose-6-phosphate, or when boiled extract was used in the assay, no significant absorbance at 585 nm was observed.
In the spectrophotometric assay, 1 mL reactions containing 50 mM
KH2PO4/K2HPO4i10 mM fructose-6-phosphate, 6 mM L-glutamine,10 mM KCI, 0.6 mM
acetylpyridine adenine dinucleotide (APAD), and 50-60 Units of L-glutamic dehydrogenase (Sigma Type H from bovine liver) were run at room temperature.
The activity was followed by monitoring the absorbance at 3 65 nm after the addition of extract and corrected for the small absorbance increase observed in the absence of extract. The activity was calculated using a molar extinction coefficient for APAD of 9100.
Table 2 Glucosamine 6-Phosphate Synthase Activity in Production Strains Containing Integrated T7 glmS
Cassettes Activity, ( ole per minute per mL of extract) Strain Host Geno -1PTG + IPTG
2123-9 DE3 Anag 26 58 2123-10 DE3 dnag 33 67 2123-11 DZ 3 Anag manXYZ 32 59 2123-12 DE3 Anag manXYZ 17 67 2123-13 DE3 dnag manXYZ ptsG 21 68 2123-14 DE3 dna manXYZ ptsG 20 88 Table 2 shows that, on average, the activity ofglucosamine-6-phosphate synthase in production strains containing integrated T7-glmS cassettes was about three-to four-fold higher with IPTG induction than without. The activities were substantially higher than those obtained with a wild type glmS strain driven by its native promoter, which typically were in the range of 0.05-0.1 pmole per minute per mL of extract.
One of the strains, 2123-6, apparently suffered an aberrant integration event since the activity was lower than that observed in the other strains and was not influenced by the presence of IPTG in the medium.
Example 3 The following example shows the effect of strain genotype on glucosamine accumulation.
Strains with T7-glmS integrants, produced as described in Example 2, as well as 5 the corresponding parent strains without integrated DNA, were grown in shake flasks containing M9A medium (14 g/L K2HP04i 16 g/L KH2PO4, 1 g/L Na3Citrate-2H20, 5 g/L
(NH4)2SO4i pH 7.0) supplemented with 20 g/L glucose, 10 mM MgSO4, 1 mM CaC12, and 1 mM IPTG. Samples were taken periodically over the course of two days, and the glucosamine concentration in the culture supernatant was measured using the modified 10 Elson-Morgan assay as described in Example 2. Samples were assayed with and without acetic anhydride treatment, and the amount of glucosamine present was determined from the net absorbance using a standard curve.
Glucosamine concentrations after 24 hours of cultivation, at which time the concentration peaked, are indicated in Table 3. The results shown in Table 3 indicate that 15 for significant glucosamine production to occur, the T7-glmS gene cassette must be present. The data also indicate that the presence of the Anag mutation in the host results in a significant increase in glucosamine accumulation compared with its absence. Little effect of the manXYZ mutation was observed in this experiment. In further shake flask experiments, however, strain 2123-12 accumulated slightly higher glucosamine 20 concentrations on a consistent basis.
Table 3 Glucosamine in Culture Supernatants of Production Strains Strain Genotype Glucosamine Concentration, mgfL
(24 hours) 2123-5 DE3, T-71 integrant 21 25 2123-7 DE3, T-71 integrant 23 2123-9 DE3 Anag, T-71 integrant 67 2123-10 DE3 Anag, T-81 integrant 80 2123-11 DE3 Anag manXYZ, T-71 integrant 65 2123-12 DE3 Anag manXYZ, T-81 integrant 63 30 W3110(DE3) DE3, no integrant 4 7101-9(DE3) DE3 Anag, no integrant 0 7101-17(DE3) DE3 Anag manXYZ, no integrant 0 Example 4 The following example demonstrates the effect feeding nutrients to the cultures has on glucosamine accumulation.
In early experiments, it was observed that glucosamine accumulation ceased when glucose was depleted from cultures. In the experiment summarized by Table 4 and Fig.
4, it was found that increased glucosamine accumulation could be accomplished by feeding additional glucose and ammonium sulfate as they became depleted. For this experiment, strain 2123-12 was grown in M9A medium supplemented with 10 mM
MgSO4, 1 mM CaCl2, and 1 mM IPTG. Initial glucose concentrations and feeding regimens were varied as, indicated in Table 4. In one of the flasks, the initial ammonium sulfate concentration was 10 g/L rather than the 5 g/L normally used in M9A
medium.
Glucose concentration was monitored in shake flasks during cultivation using Diastix glucose test strips (Bayer Corporation Diagnostics Division, Elkhart, Indiana). When the glucose concentration was at or near depletion (<5 g/L remaining), glucose and/or ammonium sulfate were supplemented as indicated in Table 4. pH was also monitored during cultivation. When the pH varied significantly from the initial pH of 7.0, it was adjusted to 7.0 with sodium hydroxide.
Table 4 Shake Flask Experiment to Examine the Effect of Glucose Feeding Flask Initial Glucose, g/L Initial Ammonium Sulfate, Feed No.
1 20 5 None 2 50 5 None 3 50 10 None 4 20 5 20g/L Glucose 5 20 5 20 gtL Glucose + 5 g/L AmSO4 As Fig. 4 indicates, increasing the supply of glucose had a positive effect on glucosamine accumulation. By periodically feeding with glucose and ammonium sulfate (20 g/L and 5 g/L additions, respectively), a maximum accumulation of 0.4 g/L
of glucosamine was observed, approximately four-fold higher than what was observed in the absence of feeding.
Example 5 The following example describes the isolation of mutant g1mS genes encoding glucosamine-6-phosphate synthase enzymes with decreased sensitivity to glucosamine-6-phosphate product inhibition.
White (1968, Biochem. J., 106:847-858) first demonstrated that glucosamine-6-phosphate synthase was inhibited by glucosamine-6-phosphate. Using the spectrophotonietric assay for glucosamine-6-phosphate synthase as described in Example 2, the effects ofglucosamine-6-phosphate and glucosamine on glucosamine-6-phosphate synthase were measured. For determination of product inhibition, assays were run in the presence of various concentrations of added glucosamine-6-phosphate.
As indicated in Fig. 5, the enzyme is significantly inhibited by glucosamine-6-phosphate and slightly inhibited by glucosamine. These results are similar to those obtained by White, 1968, supra. This inhibition may be a key factor in limiting glucosamine accumulation in the glucosamine production strains.
To further increase glucosamine synthesis in production strains, efforts were made to isolate mutants of the g1mS gene encoding glucosamine-6-phosphate synthase variants with reduced product inhibition. To accomplish this, the cloned gene was amplified using the technique of error-prone PCR. In this method, the gene is amplified under conditions that lead to a high frequency of misincorporation errors by the DNA polymerase used for the amplification. As a result, a high frequency of mutations are obtained in the PCR
products.
Plasmid pKLN23-28 contains a unique SpeI restriction endonuclease site 25 base pairs upstream of the T7 promoter and 111 base pairs upstream of the start of the g1mS
gene. The plasmid also contains a unique HinDIII site 177 base pairs downstream of the glmS gene. PCR primers of the following sequences were synthesized to correspond to regions just upstream of the SpeI and downstream of the HinDIII sites, respectively:
5'-ATGGATGAGCAGACGATGGT-3' (SEQ ID NO: 3) 5'-CCTCGAGGTCGACGGTATC-3' (SEQ ID NO:4) Amplification with these primers (SEQ ID NO:3 and SEQ ID NO:4) allowed mutagenesis of a 2247 base pair region that included the entire g1mS gene. PCR
conditions were as described by Moore and Arnold, 1996, Nature Biotechnology 14:458-467. Briefly, a 100 L solution was prepared containing I mM each of the four deoxynucleotide triphosphates, 16.6 mM ammonium sulfate, 67 mMTtis-HC1, pH 8.8, 6.1 mM MgC12 6.7 gM
EDTA, 10 mM -mercaptoethanol, 10 I. DMSO, 30 ng each of the primers (SEQ 1D
NO:3 and SEQ ID NO:4), either 7 or 35 ng of plasmid pKLN23-28 linearized with Kpn I, and 2,5 Units of TagDNApolyinera_se(Perldn Elmer-Cetus, Emeryville, California). The reaction mixture was covered with 70 4 of mineral oil and placed in a thermoeycler, where the following steps were repeated for 25 cycles:
1 minute at 94 C
1 minute at 42 C
2 minutes at 720C
Under these conditions, an error frequency of approximately one mutation per 1000 base pairs has been reported (Moore and Arnold, 1996, supra). The product of the reaction was recovered, purified, and digested with Spel and HinD11 and cloned into the Spel-HinDUI
backbone fragment of pKLN23 -28, which effectively substitutes for the wild We glmS gene on the Spel-HinDIII fragment of pKLN23-28. The cloned DNA was used to transform strain NovaBlue (Novagen, Inc., Madison, Wisconsin), and the transformed calls were plated to LB
agar containing ampicillin. A total of 9000 plasmid-containing colonies were pooled from the anipicillin plates and plasmid DNA was prepared from the pooled cells to generate a library of pKLN23-28 derivative plasmids containing mutations in the cloned gImS gene.
The mutant plastids generated by error-prone PCR were screened for their ability to confer increased glucosamine production in a Anag manXYZ DE3 host background.
This screen was in the form of a bioassay in which the ability of plasmid-containing strains to crossfeed glucosamine-requiring strains of F coil was assessed-Strains of E. call (Sarves, 1971, J Baueriol. 105:467-471; Wu and Wu, 1971, J.
Bacteria! 105:455-466) and Bacillus subtilis (Freese at a!., 1970, J.
Bacterial. 101:1046-1062) which are defective for glucosamine-6-phosphate synthase require glucosamine or N-acetylglucosamine for growth. A glucosamine-requiring strain of E. coli was isolated after mutagenesis with N-methyl-N'-nitro-N-nitrosoguanidine (NG). Strain LE392 (Table 1) was grown in LB medium to a cell density of 6 x FoE cells per mL. 50 JEi1-4320 PQC,1) L of 2.5 mg/mL NG dissolved in methanol was added to 2 mL of this culture and the mixture was incubated at 37 C for 20 minutes. This treatment resulted in about 10%
survival of the strain. The mutagenized cells were harvested by centrifugation, and the cells were washed twice by suspension in 0.9% NaCl and recentrifugation. The washed cells were diluted and plated to nutrient agar medium (NA; 5 g/L Bacto peptone, 3 g/L
beef extract, 15 g/L agar) containing 0.2 g/L N-acetylglucosamine at a density of between 50 and 200 colony forming units per plate. Approximately 13,000 colonies were plated.
These colonies were replica-plated to NA agar with and without 0.2 g/L N-acetylglucosamine. Twenty-two colonies grew on NA with 0.2 g/L N-acetylglucosamine but not on NA without 0.2 g/L N-acetylglucosamine. These colonies were purified by streaking to NA with 0.2 g/L N-acetylglucosamine, and their growth phenotype was rechecked. Of the original 22 colonies selected, five had the phenotype expected of a glmS mutant of LE3 92. They failed to grow on NA but grew on NA supplemented with 0.2 g/L of glucosamine or 0.2 g/L N-acetylglucosamine. They also failed to grow on glucose minimal agar, but grew on glucose minimal agar supplemented with 0.2 g/L N-acetylglucosamine. One of these mutants was designated 2123-16 (Table 1).
For the cross-feeding assay, agar plates containing either glycerol or fructose as the principle carbon source for growth were overlaid with cells from a culture of strain 2123-16, the glucosamine-requiring strain isolated as described above.
Glucosamine-producing strains were stabbed into the agar and the ability to produce glucosamine was assessed based on the size of the "halo" of growth of the indicator strain surrounding the stab. Those stabs surrounded by larger halos were considered to produce greater amounts of glucosamine.
The media used for the cross-feeding assays consisted of M9 minimal medium (6 g/L Na2HPO4i 3 g/L KH2PO4, 0.5 g/L NaCl, 1 g/L NI-14C1, 1 mM MgSO4, 0.1 mM
CaCI2) supplemented with 40 mg/L of L-methionine (required for growth of strains LE392 or 2123-16) and 2 g/L of either glycerol or fructose. These plates were overlaid with strain 2123-16 as follows. A culture of strain 2123-16 was grown overnight at 37 C in LB
medium containing 1 g/L N-acetylglucosamine. The culture was harvested by centrifugation, and the cells were washed twice by suspension in 0.9% NaCI and recentrifugation. The washed cells were suspended in the original volume of 0.9% NaCl.
For each plate to be overlaid, 0.1 mL of washed cell suspension was mixed with 3 mL of molten (50 C) F-top agar (8 g/L NaCl, 8 g/L agar) and poured onto the plate.
The library of pKLN23-28 mutant plasmids was transferred to strain 7101-17(DE3) and transformed cells were plated to LB agar containing 100 gg/mL
ampicillin.
5 Each colony arising on these plates contained an individual member of the mutant plasmid library. The colonies were screened by picking them from the LB + ampicillin plates and stabbing them sequentially into:
(1) LB agar + ampicillin;
(2) glycerol minimal agar overlaid with strain 2123-16; and, 10 (3) fructose minimal agar overlaid with strain 2123-16 All plates were incubated for about 24 hours at 37 C. After this incubation period, halos of growth of the 2123-16 indicator strain could be observed surrounding the stabs in the overlaid plates. Those colonies giving rise to the larger halos were picked from the corresponding LB + ampicillin plate and streaked for purification. In an initial 15 screen, 4368 mutant candidates were screened, and 96 initial candidates were identified.
Upon rescreening those, 30 appeared to be superior to the rest, i.e. resulted in larger halos of the indicator strain.
Enzyme assays performed with six of the plasmid-containing strains isolated as described above indicated that three of the strains were less sensitive to inhibition by 20 glucosamine-6-phosphate than the enzyme from the control strain 7101-17(DE3)/pKLN23-28. The strains were grown overnight in LB broth containing 100 gg/mL ampicillin and 1 mM IPTG. Extracts prepared from cells harvested from those cultures were assayed for glucosamine-6-phosphate synthase using the spectrophotometric assay (described in Example 2) in the presence and absence of added glucosamine-6-25 phosphate. The mutant clones designated 11 C, 65A, and 8A were significantly less sensitive to glucosamine-6-phosphate than the control strain (Fig. 6). Other mutants were not distinguishable from the control by this assay.
Example 6 The following example describes the construction and characterization of 30 glucosamine production strains with mutations in gimS which result in reduced product inhibition.
Plasmid DNA isolated from clones 11 C, 52B, and 8A described above were transferred to strain ATCC 47002, which had been used previously to integrate the cloned T7 glmS construct into the E. coil chromosome. Integration was readily accomplished using the methods described in Example 2, and the integrated DNA was transferred to strain 7101-17(DE3) by P1 transduction as described in Example 1. These procedures produced strains that have the same genotype as strain 2123-12 except for the presence of mutations in the gimS gene generated by PCR. These new mutant production strains were designated 2123-49, 2323-5 1, and 2123-54, respectively. A summary of the strain construction strategy is presented in Fig. 7.
Strains 2123-12, 2123-49, 2123-51, and 2123-54 were grown overnight in LB
broth containing 1 mM IPTG. Extracts prepared from cells harvested from those cultures were assayed for glucosamine-6-phosphate synthase using the spectrophotometric assay described in Example 2 in the presence and absence of added glucosamine-6-phosphate.
The results of these assays are shown in Fig. 8.
Glucosamine production in these mutants was significantly elevated compared to that in 2123-12. When glucosamine production was assayed in shake flask cultures grown using the glucose and ammonium sulfate feeding protocol previously described in Example 4, when the cultures were grown to a cell density (measured as O.D.WO) of about 14 (about 8.4 g/L by dry cell weight), strains 212349,2123-5 1, and 2123-54 produced 1.5, 2.4, and 5.8 g/L glucosamine, respectively (Fig. 9) compared with 0.3 g/L for 2123-12.
Example 7 The following example describes the production of yet another strain with a mutation in glmS which results in reduced product inhibition.
An additional 6,344 colonies containing mutant plasmids generated by error-prone PCR as described in Example 5 were screened using the cross-feeding assay, also described in Example 5. Fifty four colonies resulted in larger halos than the rest of the colonies. DNA was isolated from all 54 colonies and strains isogenic to 2123-12 except for the mutations in gimS were constructed as described in Example 6.
Glucosamine production in most of these mutants was significantly elevated compared to strain 2123-12. Among the newly isolated mutants, the strain that produced the most glucosamine was a strain designated 2123-124. This strain produced 3.6 g/L of glucosamine when production was assayed in shake flasks using the glucose and ammonium sulfate feeding protocol described in Example 4 compared with 4.3 g/L
for strain 2123-54 in a side-by-side experiment.
Example 8 The following example describes the sequencing ofthe cloned wild typegIMS gene present in plasmid pKLN23-28. In addition, the sequences present in plasmids pKLN23-49, pKLN23-54, and pKLN23-124, containing the mutant glmS genes used to construct strains 2123-49, 2123-54, and 2123-124, respectively were sequenced and are described.
DNA sequencing reactions were performed using the Applied Biosystems (ABI) Prism Dye Terminator Cycle sequencing method with AmpliTaq DNA polymerase. The extended products were separated by gel electrophoresis on an ABI DNA
sequencer 373A
or 377. Sequences were analyzed using ABI Sequencing Analysis 3.0 software from ABI
and Sequencher 3.1 from Gene Codes.
The primers used for sequencing were as follows:
PK-1: 5'-TGGATGAGCAGACGATGG-3' (SEQ ID NO:5) PK-2: 5'-TCCGTCACAGGTATTTATTC-3' (SEQ IDNO:6) PK-3: 5'-AGCTGCGTGGTGCGTAC-3' (SEQ ID NO:7) PK-4: 5'-GGACCGTGTTTCAGTTCG-3' (SEQ ID NO:8) PK-5A: 5'-GCCGTGGCGATCAGTAC-3' (SEQ ID NO:9) PK-6A: 5'-GCCAATCACCAGCGGAC-3' (SEQ IDNO:10) PK-7: 5'-ATGGTTTCCCGCTACTGG-3' (SEQ IDNO:11) PK-8: 5'-GAACCAAGGTAACCCAGC-3' (SEQ IDNO:12) The nucleotide sequence of plasmid pKLN23-28, containing the wild-type g1mS
gene, was determined to be a 7408 bp nucleic acid sequence represented herein as SEQ
ID NO:13. The 2184 base pairs between positions 1130 and 3313 of SEQ ID NO:13 were determined using the primers described above. The nucleic acid molecule representing positions 1130-3313 of SEQ ID NO: 13 is referred to herein as nglmS-282184 and is further identified as SEQ ID NO: 14. nglmS-282184 was shown to include the Spel and HinDIlI sites used to construct the mutant plasmids as described in Example 5. The remaining DNA sequence of SEQ ID NO: 13 is based on the known sequences of the vectors used for the construction of pKLN23-28. The same 2184 base pair region was sequenced in plasmids pKLN23-49, pKLN23-54, and pKLN23-124. It is noted that for the discussion of the mutant glmS genes of these plasmids (Table 5), the specific nucleotide position ofmutations in the nucleotide sequence ofthe mutant gIMS-containing plasmids will be described using SEQ ID NO: 13 as a reference.
SEQ ID NOs: 13 and 14 contain an open reading frame that encodes the glmSgene product (i.e., GIcN6P synthase enzyme) which is a nucleic acid molecule referred to herein as nglmS-281830, the nucleic acid sequence of which is represented by SEQ ID
NO: 15.
SEQ IDNO:15 spans nucleotides 1253 to 3082 of SEQ ID NO: 13, with an initiation codon spanning from nucleotides 1253-1255 and a termination codon spanning from nucleotides 3080-3082. SEQ ID NO:15 encodes a protein of 609 amino acids referred to herein as G1cN6P-S-28, the deduced amino acid sequence of which is represented herein as SEQ ID NO: 16. It is noted that for the discussion of the mutant glmS gene products produced by the mutant strains described herein, specific mutations in the amino acid sequence of the mutant glmS gene products will be described using SEQ ID
NO:16 as a reference.
The primers described above correspond to the following nucleotide positions of SEQIDNO:13:
PK-1 (SEO ID NO:5): positions 1087-1104 of SEQ ID NO: 13;
PK-2 (SEQ ID NO:6): positions 3378-3359 of a nucleic acid sequence complementary to SEQ ID NO:13;
PK-3 (SEO ID NO:7): positions 1707-1723 of SEQ ID NO: 13;
PK-4 (SEQ ID NO:8): positions 2772-2755 of a nucleic acid sequence complementary to SEQ ID NO:13;
PK-5A SEO ID NO:9): positions 2667-2683 of SEQ ID NO: 13;
PK-6A (SEO IDNO:1 ): positions 1798-1782 of a nucleic acid sequence complementary to SEQ ID NO: 13;
PK-7 (SEQ ID NO: 11): positions 2177-2194 of SEQ ID NO:13;
PK-8 SEO ID NO: 12): positions 2364-2347 of a nucleic acid sequence complementary to SEQ ID NO: 13.
The nucleic acid sequence of nucleic acid molecule nglmS-281830(SEQ ID NO: 15, or positions 1253-3082 of SEQ ID NO: 13) from pKLN23-28, differs from the published sequence (Walker, 7.E, et at., 1984, "DNA sequence around the Escherichia coil unc operon", Biochem. J. 224:799-815) at positions 2509 and 2510 (with reference to SEQ
ID NO:13). The nucleotides for pKLN23-28 at these positions as determined in this example were G and C, respectively, while those reported in the published sequence were C and G. Otherwise, the published and determined sequences of the gimS gene were identical. The sequences determined upstream and downstream from the gimS gene were those expected based on the known sequences of the vectors used for the construction of pKLN23-28 and the methods used to construct the plasmid.
The nucleotide sequences for the mutant gimS genes for plasmids pKLN23-49, pKLN23-54, and pKLN23-124 were determined as described above for pKLN23-28.
Mutations were found in each ofthose plasmids. The mutations and the predicted amino acid changes in the corresponding mutant gimS gene products, as compared to the wild-type sequence determined for pKLN23-28 (SEQ ID NO: 13) are summarized in Table 5.
Table 5 Mutations in gimS Genes of Glucosamine-Overproducing strains.
Plasmid Position* Base Change Amino Acid Change (Position**) pKLN23-49 1263 T to C Ile to Thr (4) 2067 T to C Ile to Thr (272) 2600 T to C Ser to Pro (450) pKLN23-54 1367 G to A Ala to Thr (39) 2000 C to T Arg to Cys (250) 2239 T to C Silent (329) 2666 G to A Gly to Ser (472) 3264 A to G Outside gene pKLN23-124 1525 T to C Silent (91) 2658 T to C Leu to Pro (469) 3280 G to A Outside gene *Refers to nucleic acid position as indicated in the sequence of pKLN23-28 (SEQ ID NO: 13) **TheglmS gene (nglmS-281.; SEQ ID NO: 15) encodes a protein of 609 amino acids in length (SEQ ID NO: 16); the methionine residue at position 1 is removed by a hydrolase.
Plasmid pKLN23-49 contains a 2184 bp nucleic acid molecule referred to herein 5 as nglmS-492184, which comprises a mutant glmS gene. The nucleic acid sequence of ngImS-492184 is represented herein as SEQ ID NO: 17. A nucleic acid molecule spanning from nucleotide 124 through 1953 of SEQ ID NO: 17, referred to herein as nglmS-491830, represents an open reading frame encoding a mutant glucosamine-6-phosphate synthase ofthe present invention, with an initiation codon spanning from nucleotides 124-126 and 10 a termination codon spanning from nucleotides 1951-1953 of SEQ ID NO: 17.
The nucleic acid sequence of nglmS-491830 is represented herein as SEQ ID NO: 18.
SEQ ID
NO:18 encodes a mutant glucosamine-6-phosphate synthase protein of 609 amino acids referred to herein as G1cN6P-S-49, the deduced amino acid sequence of which is represented herein as SEQ ID NO: 19. SEQ ID NO: 17 has a nucleic acid sequence that 15 is identical to positions 1130 through 3313 of SEQ ID NO:13 (i.e., SEQ ID
NO:14), except for the mutations as indicated for plasmid pKLN23-49 in Table S. SEQ ID
NO:18 has a nucleic acid sequence that is identical to positions 1253 through 3082 of SEQ ID
NO:13 (i.e., SEQ ID NO: 15), except for the mutations as indicated for plasmid pKLN23-49 in Table 5.
20 Plasmid pKLN23-54 contains a 2184 bp nucleic acid molecule referred to herein as nglmS-542184, which comprises a mutant gimS gene. The nucleic acid sequence of nglmS-542184 is represented herein as SEQ ID NO:20. A nucleic acid molecule spanning from nucleotide 124 through 1953 of SEQ ID NO:20, referred to herein as ngimS-541830, represents an open reading frame encoding a mutant glucosamine-6-phosphate synthase 25 of the present invention, with an initiation codon spanning from nucleotides 124-126 and a termination codon spanning from nucleotides 1951-1953 of SEQ ID NO:20. The nucleic acid sequence of nglmS-541830 is represented herein as SEQ ID NO:21.
SEQ ID
NO:21 encodes a mutant glucosamine-6-phosphate synthase protein of 609 amino acids referred to herein as GIcN6P-S-54, the deduced amino acid sequence of which is 30 represented herein as SEQ ID NO:22. SEQ ID NO:20 has a nucleic acid sequence that is identical to positions 1130 through 3313 of SEQ ID NO:13 (i.e., SEQ ID
NO:14), except for the mutations as indicated for plasmid pKLN23-54 in Table 5. SEQ ID
NO:21 has a nucleic acid sequence that is identical to positions 1253 through 3082 of SEQ ID
NO: 13 (i.e., SEQ ID NO: 15), except for the mutations as indicated for plasmid pKLN23-54 in Table 5.
Plasmid pKLN23-124 contains a 2184 bp nucleic acid molecule referred to herein as nglmS-1242184, which comprises a mutant glnS gene. The nucleic acid sequence of nglmS-12421k is represented herein as SEQ ID NO:23. A nucleic acid molecule spanning from nucleotide 124 through 1953 of SEQ ID NO:23, referred to herein as nghnS-1241830, represents an open reading frame encoding a mutant glucosamine-6-phosphate synthase of the present invention, with an initiation codon spanning from nucleotides 124-126 and a termination codon spanning from nucleotides 1951-1953 of SEQ ID NO:23. The nucleic acid sequence ofnglmS-1241830 is represented herein as SEQ ID NO:24.
SEQ ID
NO:24 encodes a mutant glucosamine-6-phosphate synthase protein of 609 amino acids referred to herein as G1cN6P-S-124, the deduced amino acid sequence of which is represented herein as SEQ ID NO:25. SEQ ID NO:23 has a nucleic acid sequence that is identical to positions 1130 through 3313 of SEQ ID NO: 13 (i.e., SEQ ID NO:
14), except for the mutations as indicated for plasmid pKLN23-124 in Table 5. SEQ
ID
NO:24 has a nucleic acid sequence that is identical to positions 1253 through 3082 of SEQ ID NO:13 (i.e., SEQ ID NO:15), except for the mutations as indicated for plasmid pKLN23-124 in Table 5.
To verify that the same mutations were present in the strains into which the mutant glmS genes were integrated into the chromosome, PCR products were generated from genomic DNA isolated from strains 2123-49, 2123-54, and 2123-124. For PCR
amplification, the primers listed in Example 3 for the mutagenesis of the gene (SEQ ID
NO:3 and SEQ ID NO:4) were used. PCR reactions were carried out in 50 L
reactions consisting of 20 mM Tris'HCI (pH 8.8), 10 mM KCI, 10 mM (NH4)2SO4i 20 mM
MgSO4, 0.1% Triton X-100, 0.1 mg/mL nuclease-free bovine serum albumin, 0.05 mM each deoxynucleotide triphosphate, 2 M each primer, 1.25 U cloned Pfu DNA
polymerase (Stratagene), and 160 ng of genomic DNA. The complete reactions were placed in a RoboCycler Gradient 96 Temperature Cycler (Stratagene). After 3 minutes at 94 C, the following three steps were repeated for 30 cycles: (1) 30 seconds at 94 C; (2) 30 seconds at 47 C; and (3) 2 minutes at 72 . This was followed with a 7 minute incubation at 72 C.
The resulting DNA contained the expected amplification product in addition to extraneous products. The product containing the glmS gene was purified using a QlAquick PCR purification kit, followed by electrophoresis of the purified product on an agarose gel, isolation of the correct band using a QlAquick gel extraction kit, and reamplification using this isolated DNA as a template. The reactions with the isolated DNA were amplified in a similar fashion as the original amplification described above, except that 40 ng of DNA was used as template, and only 20 cycles of amplification were performed. The product from this second amplification reaction was recovered as described above.
The presence of mutations in the genomic DNA was verified using primers specific for the DNA regions containing the mutations identified in the plasmids. For 2123-49, these included primers PK-1 (SEQ ID NO:5), PK-3 (SEQ ID NO:7), PK-4 (SEQ ID
NO:8), and PK-5A (SEQ IDNO:9). For 2123-124, primers PK-1 (SEQ IDNO:5), PK-4 (SEQ ID NO:8) and PK-5 (SEQ ID NO:9) were used. For 2123-54, the entire PCR
product was sequenced using all eight primers described earlier (SEQ ID NOs:5-12).
Sequencing of the PCR products confirmed the presence of the mutations identified from the plasmids and listed in Table 5.
Example 9 This example describes the construction of strains containing a mutant glmS
gene encoding a product containing only the glycine to serine alteration at position 472 (SEQ
ID NO:22) from strain 2123-54.
As indicated in Table 5, the only amino acid change in the GIcN6P synthase enzyme for strain 2123-124 (G1cN6P-S-124) is a leucine to proline alteration at position 469 (SEQ ID NO:25), unambiguously defining this mutation as being responsible for the overproduction of glucosamine by strain 2123-124. This would suggest the possibility that the glycine to serine alteration at position 472 (gly-ser472; SEQ ID
NO:22) of G1cN6P-S-54 in strain 2123-54 was likewise responsible for the glucosamine overproduction phenotype for this strain. In an effort to demonstrate this, the alteration was isolated away from the other two amino acid alterations in the G1cN6P-S-54 amino acid sequence (SEQ ID NO:22) of strain 2123-54 (i.e., Ala-'Thr39 and Arg--Cys250) by digesting plasmid pKLN23-54 with EcoRl and HinDM. These enzymes each have unique cleavage sites on the plasmid and cut at positions 2241 and 3305, respectively (positions indicated with respect to the equivalent positions in SEQ ID NO: 13 for pKLN23-28), resulting in fragments of 1064 and 6344 base pairs. The smaller fragment contains mutations in which the gly- ser472 alteration is the only amino acid change in this portion of GIcN6P-S-54. This smaller fragment was ligated to the corresponding larger fragment from pKLN23-28 containing the wild type glmS gene.
Two plasmids resulting from this ligation were designated pKLN23-149 and pKLN23-151. Sequencing the DNA from these plasmids using primers PK-1 (SEQ ID
NO:5), PK-3 (SEQ ID NO:7), and PK-4 (SEQ ID NO:8) verified that these plasmids contained the mutation at position 2666 present in plasmid pKLN23-54 but not the mutations at positions 1367 and 2000 (Table 5 with reference to SEQ ID NO:13).
The nucleic acid sequence of the 2184 base pairs between positions 1130 and of plasmid pKLN23-149 (these positions being determined relative to the equivalent positions in SEQ ID NO:13) are referred to herein as nucleic acid molecule nglmS-1492,84, the nucleic acid sequence of which is represented by SEQ ID NO:26. SEQ ID
NO:26 contains a nucleic acid sequence spanning nucleotides 124 through 1953, referred to herein as nglmS-1491830i which represents an open reading frame encoding a mutant glucosamine-6-phosphate synthase of the present invention, with an initiation codon spanning from nucleotides 124-126 and a termination codon spanning from nucleotides 1951-1953 of SEQ ID NO:26. The nucleic acid sequence of nglmS-1491830 is represented herein as SEQ ID NO:27. SEQ ID NO:27 encodes a mutant glucosamine-6-phosphate synthase protein of 609 amino acids referred to herein as G1cN6P-S-149, the deduced amino acid sequence of which is represented herein as SEQ ID NO:28.
The nucleic acid sequence of the 2184 base pairs between positions 1130 and of plasmid pKLN23-151 (these positions being determined relative to the equivalent positions in SEQ ID NO: 13) are referred to herein as nucleic acid molecule nglmS-15121843 the nucleic acid sequence of which is represented by SEQ ID NO:29. SEQ ID
NO:29 contains a nucleic acid sequence spanning nucleotides 124 to 1953, referred to herein as nglmS-1511830, which represents an open reading frame encoding a mutant glucosamine-6-phosphate synthase of the present invention, with an initiation codon spanning from nucleotides 124-126 and a termination codon spanning from nucleotides 1951-1953 of SEQ ID NO:29. The nucleic acid sequence of nglmS-1511s30 is represented herein as SEQ
IDNO:30. SEQ ID NO:30 encodes a mutant glucosamine-6-phosphate synthase protein of 609 amino acids referred to herein as G1cN6P-S-151, the deduced amino acid sequence of which is represented herein as SEQ ID NO:31.
Strains isogenic to strain 2123-12 except for mutations conferring the gly-.
ser472 alteration were constructed using the scheme indicated in Fig. 7. Strains 2123-149 and 2123-151 were generated from plasmids pKLN23-149 and pKLN23-151, respectively.
The presence of the mutation at position 2666 (SEQ ID NO: 13) and the absence of mutations at positions 1367 and 2000 were verified by sequencing of PCR
products from genomic DNA of these strains using the methods described in Example 8.
Example 10 This example compares properties of G1cN6P synthase enzymes from strains 2123-12, 2123-49, 2123-54, 2123-124, 2123-149, and 2123-151.
Strains 2123-12, 2123-49, 2123-54, 2123-124, 2123-149, and 2123-151, described in the examples above, were grown overnight in LB broth at 37 C then transferred to fresh LB broth. Cultures were grown to an absorbance at 600 nm of 0.8 to 0.9, then induced for G1cN6P synthase production by the addition of I mM
IPTG. The cultures were grown for an additional three hours at 37 and harvested.
Extracts were prepared from cells harvested from those cultures as described in Example 2 and were assayed for glucosamine-6-phosphate synthase using the spectrophotometric assay as described in Example 2 except that a fiuctose-6-phosphate concentration of 20 mM was used. The enzyme was assayed in the presence and absence of added glucosamine-6-phosphate. In the absence of glucosamine-6-phosphate, the specific activities measured for these enzymes were similar except for that from strain 2123-124.
The data from Table 6 suggests that the latter strain encodes a less active variant of the enzyme.
Table 6 Specific Activities of GIcN6P Synthase from Glucosamine-Producing Strains Strain Specific Activity, mol min-' mg-' 2123-12 0.385 2123-49 0.375 2123-54 0.416 2123-124 0.0076 2123-149 0.494 2123-151 0.515 Fig. 10 shows that the G1cN6P synthase enzymes from strains 2123-49, 2123-54, and 2123-124 are significantly less inhibited by G1cN6P than the enzyme from strain 5 2123-12. Enzymes from strains 2123-149 and 2123-151 are slightly less inhibited by G1cN6P than the enzyme from 2123-12.
Thermal stability of the enzymes was also examined using these extracts. The extracts were incubated at 45 C (Fig. 11A) or 50 C (Fig. 11B) for various periods then assayed using the spectrophotometric assay. Figs. 11A and 11B show that the enzymes from 10 2123-49 and 2123-54 are much less stable than the wild type enzyme from strain 2123-12.
The enzyme from strain 2123-124 is comparable in stability to the wild type enzyme, and the enzymes from 2123-149 and 2123-151 are slightly less stable under the incubation conditions described here.
Example 11 15 The following example illustrates the effects of isopropylthio-p-D-galactoside (IPTG) concentration and temperature on glucosamine production.
Cultures of strains 2123-54 and 2123-124 were grown for 20 hours at 37 C on M9A medium (14 g/L K2HPO4, 16 g/L KH2PO4, 1 g/L Na3Citrate2H20, 5 g/L
(NH4)2SO4i pH 7.0) supplemented with 20 g/L glucose, 1 mM MgSO4, 0.1 mM CaCI2, 20 and varying amounts of IPTG. At the end of the growth period, a sample was taken and the glucosamine concentration in the culture supernatant was assayed using the Elson-Morgan assay described in Example 2. The results shown in Fig. 12 indicate that the optimum IPTG concentration for production is about 0.2 mM.
Subsequently, strain 2123-54 was grown in the same medium as described above in shake flasks with either 0.2 or 1 mM IPTG and at 30 C or 37 C. These flask cultures were also fed glucose and ammonium sulfate as described in Example 4. At various intervals, samples were taken and the glucosamine concentrations in culture supernatants were assayed using the Elson-Morgan assay described in Example 2. Fig. 13 shows that under the conditions of this experiment, there was little difference in glucosamine production associated with the differences in IPTG concentration. However, growth at 30 C resulted in higher glucosamine production than did growth at 37 C.
Results shown in Figs. 14A and 14B further indicated that at 30 C (Fig. 14A), glucosamine production continued after growth had ceased, while at 37 C (Fig. 14B), growth and glucosamine production occurred in concert.
When strains 2123-49 and 2123-124 were grown with 0.2 mM IPTG at 30 C, glucosamine production also occurred after growth had ceased, as shown in Figs. 15A
(2123-49) and 15B (2123-124). As observed at 37 C, the highest concentrations of glucosamine were obtained with strain 2123-54, followed by 2123-124 and 2123-49.
Also tested were strains 2123-149 and 2123-151 which produced negligibly higher concentrations of glucosamine than did 2123-12 (Table 7).
Table 7 Production of Glucosamine at 30 Strain Maximum Glucosamine Production, gIL
2123-12 0.3 2123-49 4.6 2123-54 7.2 2123-124 5.3 2123-149 0.6 2123-151 0.6 Example 12 The following example illustrates that glucosamine can be produced at higher concentrations in fermentor cultures of strain 2123-54 as compared to shake flasks. This example also illustrates that in fermentors, strain 2123-54 produces more glucosamine at 30 C than at 37 C.
Fermentation cultures of strain 2123-54 were cultivated in the medium shown in Table 8. Fermentations were run using NaOH for pH control to pH 6.7 and were fed a mixture of 33% glucose, 8% ammonium sulfate. Aeration and agitation were adjusted to maintain a dissolved oxygen concentration of greater than 20% of air saturation.
Table 8 Fermentation Medium Component Amount, g/L
Na3Citrate-2H20 1 (NH4)2SO4 5 MgSO4 0.12 CaC12 0.011 Mazu 204 Antifoam 0.5 mL/L
IPTG 0.048 Glucose 20 Trace Metals *Trace metal composition is 0.7 mg/L CoC12, 1.7 mg/L H3803i 0.6 mg/L
CuC12.21120, 10.5 mg/L
FeC13.6H20, 12 mg/L MnC12.4H20, 1.5 mg/L Na2Mo4.2H20, 1.5 mg/L ZnC12.
In the following experiment, three fermentations were run in one-liter vessels containing an initial volume of 600 mL. Variables tested were as follows.
Fermentor #1: The mixture of 33% glucose and 8% ammonium sulfate was fed at such a rate that no glucose accumulated in the fermentor. Growth was at 37 .
Fermentor #2: As with fermentation #1 except that growth was at 30 .
Fermentor #3: As with fermentation #2 except that the feed rate was increased to maintain a constant glucose concentration in the fermentor of 5 to 10 g/L.
Results from these fermentations are shown in Figs. 16A, 16B and 16C.
Comparison of the results from fermentors 1 (Fig. 16A) and 2 (Fig. 16B) shows that glucosamine titers are markedly higher at 30 C than they are at 37 C, as observed in shake flasks. The maximum glucosamine concentration observed was in the glucose-excess fermentor 3 grown at 30 C (Fig. 16C), at 10.9 g/L. At 30 C, growth and glucosamine concentration appeared to coincide, and there appeared to be a slight advantage to growth under glucose-excess. In subsequent fermentation experiments, run under conditions similar to fermentor #3, glucosamine concentrations in excess of 12 g/L have been obtained (data not shown).
In summary, the present inventors have described herein the use of metabolic engineering to create the first glucosamine overproducing strains ofE. coll.
The concept, proven here, will be generally applicable to any microorganism having a pathway for the production of amino sugars, or to any recombinant microorganism into which a pathway for the production of amino sugars has been introduced. In addition to the present strategy for creating a glucosamine-producing strain (i.e., eliminating glucosamine degradation and uptake and increasing expression of the glmS gene), the present inventors have also established that reducing product inhibition of glucosamine-6-phosphate synthase by glucosamine-6-phosphate improves glucosamine production.
While various embodiments of the present invention have been described in detail, it is apparent that modifications and adaptations of those embodiments will occur to those skilled in the art. However, it is to be expressly understood that such modifications and adaptations are within the spirit and scope of the present invention, as set forth in the following claims.
SEQUENCE LISTING
<110> Berry, Alan Burlingame, Richard P.
Millis, James R.
<120> PROCESS AND MATERIALS FOR PRODUCTION OF GLUCOSAMINE
<130> 3161-18-C1-PCT
<140>
<141>
<150> PCT/US98/00800 <151> 1998-01-14 <150> 60/035,494 <151> 1997-01-14 <160> 31 <170> Patentln Ver. 2.0 <210> 1 <211> 29 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 1 cggtctccca tgtgtggaat tgttggcgc 29 <210> 2 <211> 34 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 2 ctctagagcg ttgatattca gtcaattaca aaca 34 <210> 3 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 3 atggatgagc agacgatggt 20 <210> 4 <211> 19 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 4 cctcgaggtc gacggtatc 19 <210> 5 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 5 tggatgagca gacgatgg 18 <210> 6 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 6 tccgtcacag gtatttattc 20 <210> 7 <211> 17 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 7 agctgcgtgg tgcgtac 17 <210> 8 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 8 ggaccgtgtt tcagttcg 18 <210> 9 <211> 17 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 9 gccgtggcga tcagtac 17 <210> 10 <211> 17 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 10 gccaatcacc agcggac 17 <210> 11 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 11 atggtttccc gctactgg 18 <210> 12 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:primer <400> 12 gaaccaaggt aacccagc 18 <210> 13 <211> 7408 <212> DNA
<213> Escherichia coli <220>
<221> RBS
<222> (1240)..(1245) <220>
<221> promoter <222> (1165)..(1181) <220>
<221> conflict <222> (2509)..(2510) <400> 13 gaattgatcc cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt acccaactta 60 atcgccttgc agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg 120 atcgcccttc ccaacagttg cgcagcctga atggcgaatg gcgctttgcc tggtttccgg 180 caccagaagc ggtgccggaa agctggctgg agtgcgatct tcctgaggcc gatactgtcg 240 tcgtcccctc aaactggcag atgcacggtt acgatgcgcc catctacacc aacgtaacct 300 atcccattac ggtcaatccg ccgtttgttc ccacggagaa tccgacgggt tgttactcgc 360 tcacatttaa tgttgatgaa agctggctac aggaaggcca gacgcgaatt atttttgatg 420 gcgttaactc ggcgtttcat ctgtggtgca acgggcgctg ggtcggttac ggccaggaca 480 gtcgtttgcc gtctgaattt gacctgagcg catttttacg cgccggagaa aaccgcctcg 540 cggtgatggt gctgcgttgg agtgacggca gttatctgga agatcaggat atgtggcgga 600 tgagcggcat tttccgtgac gtctcgttgc tgcataaacc gactacacaa atcagcgatt 660 tccatgttgc cactcgcttt aatgatgatt tcagccgcgc tgtactggag gctgaagttc 720 agatgtgcgg cgagttgcgt gactacctac gggtaacagt ttctttatgg cagggtgaaa 780 cgcaggtcgc cagcggcacc gcgcctttcg gcggtgaaat tatcgatgag cgtggtggtt 840 atgccgatcg cgtcacacta cgtctgaacg tcgaaaaccc gaaactgtgg agcgccgaaa 900 tcccgaatct ctatcgtgcg gtggttgaac tgcacaccgc cgacggcacg ctgattgaag 960 cagaagcctg cgatgtcggt ttccgcgagg tgcggattga aaatggtctg ctgctgctga 1020 acggcaagcc gttgctgatt cgaggcgtta accgtcacga gcatcatcct ctgcatggtc 1080 aggtcatgga tgagcagacg atggtgcagg atctccaccg cggtggcggc cgctctagaa 1140 ctagtggatc tcgatcccgc gaaattaata cgactcacta taggggaatt gtgagcggat 1200 aacaattccc ctctagaaat aattttgttt aactttaaga aggagatata ccatgtgtgg 1260 aattgttggc gcgatcgcgc aacgtgatgt agcagaaatc cttcttgaag gtttacgtcg 1320 tctggaatac cgcggatatg actctgccgg tctggccgtt gttgatgcag aaggtcatat 1380 gacccgcctg cgtcgcctcg gtaaagtcca gatgctggca caggcagcgg aagaacatcc 1440 tctgcatggc ggcactggta ttgctcacac tcgctgggcg acccacggtg aaccttcaga 1500 agtgaatgcg catccgcatg tttctgaaca cattgtggtg gtgcataacg gcatcatcga 1560 aaaccatgaa ccgctgcgtg aagagctaaa agcgcgtggc tataccttcg tttctgaaac 1620 cgacaccgaa gtgattgccc atctggtgaa ctgggagctg aaacaaggcg ggactctgcg 1680 tgaggccgtt ctgcgtgcta tcccgcagct gcgtggtgcg tacggtacag tgatcatgga 1740 ctcccgtcac ccggataccc tgctggcggc acgttctggt agtccgctgg tgattggcct 1800 ggggatgggc gaaaacttta tcgcttctga ccagctggcg ctgttgccgg tgacccgtcg 1860 ctttatcttc cttgaagagg gcgatattgc ggaaatcact cgccgttcgg taaacatctt 1920 cgataaaact ggcgcggaag taaaacgtca ggatatcgaa tccaatctgc aatatgacgc 1980 gggcgataaa ggcatttacc gtcactacat gcagaaagag atctacgaac agccgaacgc 2040 gatcaaaaac acccttaccg gacgcatcag ccacggtcag gttgatttaa gcgagctggg 2100 accgaacgcc gacgaactgc tgtcgaaggt tgagcatatt cagatcctcg cctgtggtac 2160 ttcttataac tccggtatgg tttcccgcta ctggtttgaa tcgctagcag gtattccgtg 2220 cgacgtcgaa atcgcctctg aattccgcta tcgcaaatct gccgtgcgtc gtaacagcct 2280 gatgatcacc ttgtcacagt ctggcgaaac cgcggatacc ctggctggcc tgcgtctgtc 2340 gaaagagctg ggttaccttg gttcactggc aatctgtaac gttccgggtt cttctctggt 2400 gcgcgaatcc gatctggcgc taatgaccaa cgcgggtaca gaaatcggcg tggcatccac 2460 taaagcattc accactcagt taactgtgct gttgatgctg gtggcgaagc tgtctcgcct 2520 gaaaggtctg gatgcctcca ttgaacatga catcgtgcat ggtctgcagg cgctgccgag 2580 ccgtattgag cagatgctgt ctcaggacaa acgcattgaa gcgctggcag aagatttctc 2640 tgacaaacat cacgcgctgt tcctgggccg tggcgatcag tacccaatcg cgctggaagg 2700 cgcattgaag ttgaaagaga tctcttacat tcacgctgaa gcctacgctg ctggcgaact 2760 gaaacacggt ccgctggcgc taattgatgc cgatatgccg gttattgttg ttgcaccgaa 2820 caacgaattg ctggaaaaac tgaaatccaa cattgaagaa gttcgcgcgc gtggcggtca 2880 gttgtatgtc ttcgccgatc aggatgcggg ttttgtaagt agcgataaca tgcacatcat 2940 cgagatgccg catgtggaag aggtgattgc accgatcttc tacaccgttc cgctgcagct 3000 gctggcttac catgtcgcgc tgatcaaagg caccgacgtt gaccagccgc gtaacctggc 3060 aaaatcggtt acggttgagt aataaatgga tgccctgcgt aagcggggca tttttcttcc 3120 tgttatgttt ttaatcaaac atcctgccaa ctccatgtga caaaccgtca tcttcggcta 3180 ctttttctct gtcacagaat gaaaattttt ctgtcatctc ttcgttatta atgtttgtaa 3240 ttgactgaat atcaacgctc tagaggggct agagcggccg ccaccgcggt ggagctccgt 3300 cgacaagctt atcgataccg tcgacctcga gggggggccc ggtaccgagg acgcgttcga 3360 ataaatacct gtgacggaag atcacttcgc agaataaata aatcctggtg tccctgttga 3420 taccgggaag ccctgggcca acttttggcg aaaatgagac gttgatcggc acgtaagagg 3480 ttccaacttt caccataatg aaataagatc actaccgggc gtattttttg agttatcgag 3540 attttcagga gctaaggaag ctaaaatgga gaaaaaaatc actggatata ccaccgttga 3600 tatatcccaa tggcatcgta aagaacattt tgaggcattt cagtcagttg ctcaatgtac 3660 ctataaccag accgttcagc tggatattac ggccttttta aagaccgtaa agaaaaataa 3720 gcacaagttt tatccggcct ttattcacat tcttgcccgc ctgatgaatg ctcatccgaa 3780 attccgtatg gcaatgaaag acggtgagct ggtgatatgg gatagtgttc acccttgtta 3840 caccgttttc catgagcaaa ctgaaacgtt ttcatcgctc tggagtgaat accacgacga 3900 tttccggcag tttctacaca tatattcgca agatgtggcg tgttacggtg aaaacctggc 3960 ctatttccct aaagggttta ttgagaatat gtttttcgtc tcagccaatc cctgggtgag 4020 tttcaccagt tttgatttaa acgtggccaa tatggacaac ttcttcgccc ccgttttcac 4080 catgggcaaa tattatacgc aaggcgacaa ggtgctgatg ccgctggcga ttcaggttca 4140 tcatgccgtt tgtgatggct tccatgtcgg cagaatgctt aatgaattac aacagtactg 4200 cgatgagtgg cagggcgggg cgtaattttt ttaaggcagt tattggtgcc cttaaacgcc 4260 tggtgctacg cctgaataag tgataataag cggatgaatg gcagaaattc ggacgcgtca 4320 attcgagctc ctgcactgga tggtggcgct ggatggtaag ccgctggcaa gcggtgaagt 4380 gcctctggat gtcgctccac aaggtaaaca gttgattgaa ctgcctgaac taccgcagcc 4440 ggagagcgcc gggcaactct ggctcacagt acgcgtagtg caaccgaacg cgaccgcatg 4500 gtcagaagcc gggcacatca gcgcctggca gcagtggcgt ctggcggaaa acctcagtgt 4560 gacgctcccc gccgcgtccc acgccatccc gcatctgacc accagcgaaa tggatttttg 4620 catcgagctg ggtaataagc gttggcaatt taaccgccag tcaggctttc tttcacagat 4680 gtggattggc gataaaaaac aactgctgac gccgctgcgc gatcagttca cccgtgcacc 4740 gctggataac gacattggcg taagtgaagc gacccgcatt gaccctaacg cctgggtcga 4800 acgctggaag gcggcgggcc attaccaggc cgaagcagcg ttgttgcagt gcacggcaga 4860 tacacttgct gatgcggtgc tgattacgac cgctcacgcg tggcagcatc aggggaaaac 4920 cttatttatc agccggaaaa cctaccggat tgatggtagt ggtcaaatgg cgattaccgt 4980 tgatgttgaa gtggcgagcg atacaccgca tccggcgcgg attggcctga actgccagct 5040 ggcgcaggta gcagagcggg taaactggct cggattaggg ccgcaagaaa actatcccga 5100 ccgccttact gccgcctgtt ttgaccgctg ggatctgcca ttgtcagaca tgtatacccc 5160 gtacgtcttc ccgagcgaaa acggtctgcg ctgcgggacg cgcgaattga attatggccc 5220 acaccagtgg cgcggcgact tccagttcaa catcagccgc tacagtcaac agcaactgat 5280 ggaaaccagc catcgccatc tgctgcacgc ggaagaaggc acatggctga atatcgacgg 5340 tttccatatg cggtgtgaaa taccgcacag atgcgtaagg agaaaatacc gcatcaggcg 5400 ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt 5460 atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa 5520 gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc 5580 gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag 5640 gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt 5700 gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg 5760 aagcgtggcg ctttctcaat gctcacgctg taggtatctc agttcggtgt aggtcgttcg 5820 ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg 5880 taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac 5940 tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg 6000 gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc tgaagccagt 6060 taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg 6120 tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc 6180 tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt 6240 ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt 6300 taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag 6360 tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct gactccccgt 6420 cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg caatgatacc 6480 gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag ccggaagggc 6540 cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta attgttgccg 6600 ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg ccattgctac 6660 aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg gttcccaacg 6720 atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc 6780 tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta tggcagcact 6840 gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg gtgagtactc 6900 aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat 6960 acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg gaaaacgttc 7020 ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga tgtaacccac 7080 tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg ggtgagcaaa 7140 aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat gttgaatact 7200 catactcttc ctttttcaat attattgaag catttatcag ggttattgtc tcatgagcgg 7260 atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg 7320 aaaagtgcca cctgacgtct aagaaaccat tattatcatg acattaacct ataaaaatag 7380 gcgtatcacg aggccctttc gtcttcaa 7408 <210> 14 <211> 2184 <212> DNA
<213> Escherichia coli <400> 14 ccgctctaga actagtggat ctcgatcccg cgaaattaat acgactcact ataggggaat 60 tgtgagcgga taacaattcc cctctagaaa taattttgtt taactttaag aaggagatat 120 accatgtgtg gaattgttgg cgcgatcgcg caacgtgatg tagcagaaat ccttcttgaa 180 ggtttacgtc gtctggaata ccgcggatat gactctgccg gtctggccgt tgttgatgca 240 gaaggtcata tgacccgcct gcgtcgcctc ggtaaagtcc agatgctggc acaggcagcg 300 gaagaacatc ctctgcatgg cggcactggt attgctcaca ctcgctgggc gacccacggt 360 gaaccttcag aagtgaatgc gcatccgcat gtttctgaac acattgtggt ggtgcataac 420 ggcatcatcg aaaaccatga accgctgcgt gaagagctaa aagcgcgtgg ctataccttc 480 gtttctgaaa ccgacaccga agtgattgcc catctggtga actgggagct gaaacaaggc 540 gggactctgc gtgaggccgt tctgcgtgct atcccgcagc tgcgtggtgc gtacggtaca 600 gtgatcatgg actcccgtca cccggatacc ctgctggcgg cacgttctgg tagtccgctg 660 gtgattggcc tggggatggg cgaaaacttt atcgcttctg accagctggc gctgttgccg 720 gtgacccgtc gctttatctt ccttgaagag ggcgatattg cggaaatcac tcgccgttcg 780 gtaaacatct tcgataaaac tggcgcggaa gtaaaacgtc aggatatcga atccaatctg 840 caatatgacg cgggcgataa aggcatttac cgtcactaca tgcagaaaga gatctacgaa 900 cagccgaacg cgatcaaaaa cacccttacc ggacgcatca gccacggtca ggttgattta 960 agcgagctgg gaccgaacgc cgacgaactg ctgtcgaagg ttgagcatat tcagatcctc 1020 gcctgtggta cttcttataa ctccggtatg gtttcccgct actggtttga atcgctagca 1080 ggtattccgt gcgacgtcga aatcgcctct gaattccgct atcgcaaatc tgccgtgcgt 1140 cgtaacagcc tgatgatcac cttgtcacag tctggcgaaa ccgcggatac cctggctggc 1200 ctgcgtctgt cgaaagagct gggttacctt ggttcactgg caatctgtaa cgttccgggt 1260 tcttctctgg tgcgcgaatc cgatctggcg ctaatgacca acgcgggtac agaaatcggc 1320 gtggcatcca ctaaagcatt caccactcag ttaactgtgc tgttgatgct ggtggcgaag 1380 ctgtctcgcc tgaaaggtct ggatgcctcc attgaacatg acatcgtgca tggtctgcag 1440 gcgctgccga gccgtattga gcagatgctg tctcaggaca aacgcattga agcgctggca 1500 gaagatttct ctgacaaaca tcacgcgctg ttcctgggcc gtggcgatca gtacccaatc 1560 gcgctggaag gcgcattgaa gttgaaagag atctcttaca ttcacgctga agcctacgct 1620 gctggcgaac tgaaacacgg tccgctggcg ctaattgatg ccgatatgcc ggttattgtt 1680 gttgcaccga acaacgaatt gctggaaaaa ctgaaatcca acattgaaga agttcgcgcg 1740 cgtggcggtc agttgtatgt cttcgccgat caggatgcgg gttttgtaag tagcgataac 1800 atgcacatca tcgagatgcc gcatgtggaa gaggtgattg caccgatctt ctacaccgtt 1860 ccgctgcagc tgctggctta ccatgtcgcg ctgatcaaag gcaccgacgt tgaccagccg 1920 cgtaacctgg caaaatcggt tacggttgag taataaatgg atgccctgcg taagcggggc 1980 atttttcttc ctgttatgtt tttaatcaaa catcctgcca actccatgtg acaaaccgtc 2040 atcttcggct actttttctc tgtcacagaa tgaaaatttt tctgtcatct cttcgttatt 2100 aatgtttgta attgactgaa tatcaacgct ctagaggggc tagagcggcc gccaccgcgg 2160 tggagctccg tcgacaagct tatc 2184 <210> 15 <211> 1830 <212> DNA
<213> Escherichia coli <220>
<221> CDS
<222> (1)..(1830) <400> 15 atg tgt gga att gtt ggc gcg atc gcg caa cgt gat gta gca gaa atc 48 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile ctt ctt gaa ggt tta cgt cgt ctg gaa tac cgc gga tat gac tct gcc 96 Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala ggt ctg gcc gtt gtt gat gca gaa ggt cat atg acc cgc ctg cgt cgc 144 Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg ctc ggt aaa gtc cag atg ctg gca cag gca gcg gaa gaa cat cct ctg 192 Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu cat ggc ggc act ggt att get cac act cgc tgg gcg acc cac ggt gaa 240 His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu cct tca gaa gtg aat gcg cat ccg cat gtt tct gaa cac att gtg gtg 288 Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val gtg cat aac ggc atc atc gaa aac cat gaa ccg ctg cgt gaa gag cta 336 Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu aaa gcg cgt ggc tat acc ttc gtt tct gaa acc gac acc gaa gtg att 384 Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile gcc cat ctg gtg aac tgg gag ctg aaa caa ggc ggg act ctg cgt gag 432 Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu gcc gtt ctg cgt get atc ccg cag ctg cgt ggt gcg tac ggt aca gtg 480 Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val atc atg gac tcc cgt cac ccg gat acc ctg ctg gcg gca cgt tct ggt 528 Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly agt ccg ctg gtg att ggc ctg ggg atg ggc gaa aac ttt atc get tct 576 Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser gac cag ctg gcg ctg ttg ccg gtg acc cgt cgc ttt atc ttc ctt gaa 624 Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu gag ggc gat att gcg gaa atc act cgc cgt tcg gta aac atc ttc gat 672 Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp aaa act ggc gcg gaa gta aaa cgt cag gat atc gaa tcc aat ctg caa 720 Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln tat gac gcg ggc gat aaa ggc att tac cgt cac tac atg cag aaa gag 768 Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu atc tac gaa cag ccg aac gcg atc aaa aac acc ctt acc gga cgc atc 816 Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile agc cac ggt cag gtt gat tta agc gag ctg gga ccg aac gcc gac gaa 864 Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu ctg ctg tcg aag gtt gag cat att cag atc ctc gcc tgt ggt act tct 912 Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser tat aac tcc ggt atg gtt tcc cgc tac tgg ttt gaa tcg cta gca ggt 960 Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly att ccg tgc gac gtc gaa atc gcc tct gaa ttc cgc tat cgc aaa tct 1008 Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser gcc gtg cgt cgt aac agc ctg atg atc acc ttg tca cag tct ggc gaa 1056 Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu acc gcg gat acc ctg get ggc ctg cgt ctg tcg aaa gag ctg ggt tac 1104 Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr ctt ggt tca ctg gca atc tgt aac gtt ccg ggt tct tct ctg gtg cgc 1152 Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg gaa tcc gat ctg gcg cta atg acc aac gcg ggt aca gaa atc ggc gtg 1200 Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val gca tcc act aaa gca ttc acc act cag tta act gtg ctg ttg atg ctg 1248 Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu gtg gcg aag ctg tct cgc ctg aaa ggt ctg gat gcc tcc att gaa cat 1296 Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser. Ile Glu His gac atc gtg cat ggt ctg cag gcg ctg ccg agc cgt att gag cag atg 1344 Asp Ile Val His Gly Leu Gin Ala Leu Pro Ser Arg Ile Glu Gln Met ctg tct cag gac aaa cgc att gaa gcg ctg gca gaa gat ttc tct gac 1392 Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp aaa cat cac gcg ctg ttc ctg ggc cgt ggc gat cag tac cca atc gcg 1440 Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala ctg gaa ggc gca ttg aag ttg aaa gag atc tct tac att cac get gaa 1488 Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu gcc tac get get ggc gaa ctg aaa cac ggt ccg ctg gcg cta att gat 1536 Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp gcc gat atg ccg gtt att gtt gtt gca ccg aac aac gaa ttg ctg gaa 1584 Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu aaa ctg aaa tcc aac att gaa gaa gtt cgc gcg cgt ggc ggt cag ttg 1632 Lys Leu Lys Ser Asn Ile Giu Glu Val Arg Ala Arg Gly Gly Gln Leu tat gtc ttc gcc gat cag gat gcg ggt ttt gta agt agc gat aac atg 1680 Tyr Val Phe Ala Asp Gin Asp Ala Gly Phe Val Ser Ser Asp Asn Met cac atc atc gag atg ccg cat gtg gaa gag gtg att gca ccg atc ttc 1728 His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe tac acc gtt ccg ctg cag ctg ctg get tac cat gtc gcg ctg atc aaa 1776 Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys ggc acc gac gtt gac cag ccg cgt aac ctg gca aaa tcg gtt acg gtt 1824 Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val gag taa 1830 Glu <210> 16 <211> 609 <212> PRT
<213> Escherichia coli <400> 16 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr. Leu Arg Glu Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ser Pro Leu Val Ile Gly Leu Giy Met Gly Glu Asn Phe Ile Ala Ser Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Glu <210> 17 <211> 2184 <212> DNA
<213> Escherichia coli <400> 17 ccgctctaga actagtggat ctcgatcccg cgaaattaat acgactcact ataggggaat 60 tgtgagcgga taacaattcc cctctagaaa taattttgtt taactttaag aaggagatat 120 accatgtgtg gaactgttgg cgcgatcgcg caacgtgatg tagcagaaat ccttcttgaa 180 ggtttacgtc gtctggaata ccgcggatat gactctgccg gtctggccgt tgttgatgca 240 gaaggtcata tgacccgcct gcgtcgcctc ggtaaagtcc agatgctggc acaggcagcg 300 gaagaacatc ctctgcatgg cggcactggt attgctcaca ctcgctgggc gacccacggt 360 gaaccttcag aagtgaatgc gcatccgcat gtttctgaac acattgtggt ggtgcataac 420 ggcatcatcg aaaaccatga accgctgcgt gaagagctaa aagcgcgtgg ctataccttc 480 gtttctgaaa ccgacaccga agtgattgcc catctggtga actgggagct gaaacaaggc 540 gggactctgc gtgaggccgt tctgcgtgct atcccgcagc tgcgtggtgc gtacggtaca 600 gtgatcatgg actcccgtca cccggatacc ctgctggcgg cacgttctgg tagtccgctg 660 gtgattggcc tggggatggg cgaaaacttt atcgcttctg accagctggc gctgttgccg 720 gtgacccgtc gctttatctt ccttgaagag ggcgatattg cggaaatcac tcgccgttcg 780 gtaaacatct tcgataaaac tggcgcggaa gtaaaacgtc aggatatcga atccaatctg 840 caatatgacg cgggcgataa aggcatttac cgtcactaca tgcagaaaga gatctacgaa 900 cagccgaacg cgatcaaaaa cacccttacc ggacgcacca gccacggtca ggttgattta 960 agcgagctgg gaccgaacgc cgacgaactg ctgtcgaagg ttgagcatat tcagatcctc 1020 gcctgtggta cttcttataa ctccggtatg gtttcccgct actggtttga atcgctagca 1080 ggtattccgt gcgacgtcga aatcgcctct gaattccgct atcgcaaatc tgccgtgcgt 1140 cgtaacagcc tgatgatcac cttgtcacag tctggcgaaa ccgcggatac cctggctggc 1200 ctgcgtctgt cgaaagagct gggttacctt ggttcactgg caatctgtaa cgttccgggt 1260 tcttctctgg tgcgcgaatc cgatctggcg ctaatgacca acgcgggtac agaaatcggc 1320 gtggcatcca ctaaagcatt caccactcag ttaactgtgc tgttgatgct ggtggcgaag 1380 ctgtctcgcc tgaaaggtct ggatgcctcc attgaacatg acatcgtgca tggtctgcag 1440 gcgctgccga gccgtattga gcagatgctg cctcaggaca aacgcattga agcgctggca 1500 gaagatttct ctgacaaaca tcacgcgctg ttcctgggcc gtggcgatca gtacccaatc 1560 gcgctggaag gcgcattgaa gttgaaagag atctcttaca ttcacgctga agcctacgct 1620 gctggcgaac tgaaacacgg tccgctggcg ctaattgatg ccgatatgcc ggttattgtt 1680 gttgcaccga acaacgaatt gctggaaaaa ctgaaatcca acattgaaga agttcgcgcg 1740 cgtggcggtc agttgtatgt cttcgccgat caggatgcgg gttttgtaag tagcgataac 1800 atgcacatca tcgagatgcc gcatgtggaa gaggtgattg caccgatctt ctacaccgtt 1860 ccgctgcagc tgctggctta ccatgtcgcg ctgatcaaag gcaccgacgt tgaccagccg 1920 cgtaacctgg caaaatcggt tacggttgag taataaatgg atgccctgcg taagcggggc 1980 atttttcttc ctgttatgtt tttaatcaaa catcctgcca actccatgtg acaaaccgtc 2040 atcttcggct actttttctc tgtcacagaa tgaaaatttt tctgtcatct cttcgttatt 2100 aatgtttgta attgactgaa tatcaacgct ctagaggggc tagagcggcc gccaccgcgg 2160 tggagctccg tcgacaagct tatc 2184 <210> 18 <211> 1830 <212> DNA
<213> Escherichia coli <220>
<221> CDS
<222> (1)..(1830) <400> 18 atg tgt gga act gtt ggc gcg atc gcg caa cgt gat gta gca gaa atc 48 Met Cys Gly Thr Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile ctt ctt gaa ggt tta cgt cgt ctg gaa tac cgc gga tat gac tct gcc 96 Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala ggt ctg gcc gtt gtt gat gca gaa ggt cat atg acc cgc ctg cgt cgc 144 Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg ctc ggt aaa gtc cag atg ctg gca cag gca gcg gaa gaa cat cct ctg 192 Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu cat ggc ggc act ggt att get cac act cgc tgg gcg acc cac ggt gaa 240 His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu cct tca.gaa gtg aat gcg cat ccg cat gtt tct gaa cac att gtg gtg 288 Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val gtg cat aac ggc atc atc gaa aac cat gaa ccg ctg cgt gaa gag cta 336 Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu aaa gcg cgt ggc tat acc ttc gtt tct gaa acc gac acc gaa gtg att 384 Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile gcc cat ctg gtg aac tgg gag ctg aaa caa ggc ggg act ctg cgt gag 432 Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu gcc gtt ctg cgt get atc ccg cag ctg cgt ggt gcg tac ggt aca gtg 480 Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val atc atg gac tcc cgt cac ccg gat acc ctg ctg gcg gca cgt tct ggt 528 Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly agt ccg ctg gtg att ggc ctg ggg atg ggc gaa aac ttt atc get tct 576 Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser gac cag ctg gcg ctg ttg ccg gtg acc cgt cgc ttt atc ttc ctt gaa 624 Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu gag ggc gat att gcg gaa atc act cgc cgt tcg gta aac atc ttc gat 672 Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp aaa act ggc gcg gaa gta aaa cgt cag gat atc gaa tcc aat ctg caa 720 Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln tat gac gcg ggc gat aaa ggc att tac cgt cac tac atg cag aaa gag 768 Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu atc tac gaa cag ccg aac gcg atc aaa aac acc ctt acc gga cgc acc 816 Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Thr agc cac ggt cag gtt gat tta agc gag ctg gga ccg aac gcc gac gaa 864 Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu ctg ctg tcg aag gtt gag cat att cag atc ctc gcc tgt ggt act tct 912 Leu Leu Ser Lys Val Glu His Ile Gin Ile Leu Ala Cys Gly Thr Ser tat aac tcc ggt atg gtt tcc cgc tac tgg ttt gaa tcg cta gca ggt 960 Tyr Asn. Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly att ccg tgc gac gtc gaa atc gcc tct gaa ttc cgc tat cgc aaa tct 1008 Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser gcc gtg cgt cgt aac agc ctg atg atc acc ttg tca cag tct ggc gaa 1056 Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu acc gcg gat acc ctg get ggc ctg cgt ctg tcg aaa gag ctg ggt tac 1104 Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr ctt ggt tca ctg gca atc tgt aac gtt ccg ggt tct tct ctg gtg cgc 1152 Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg gaa tcc gat ctg gcg cta atg acc aac gcg ggt aca gaa atc ggc gtg 1200 Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val gca tcc act aaa gca ttc acc act cag tta act gtg ctg ttg atg ctg 1248 Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu gtg gcg aag ctg tct cgc ctg aaa ggt ctg gat gcc tcc att gaa cat 1296 Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His gac atc gtg cat ggt ctg cag gcg ctg ccg agc cgt att gag cag atg 1344 Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met ctg cct cag gac aaa cgc att gaa gcg ctg gca gaa gat ttc tct gac 1392 Leu Pro Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp aaa cat cac gcg ctg ttc ctg ggc cgt ggc gat cag tac cca atc gcg 1440 Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala ctg gaa ggc gca ttg aag ttg aaa gag atc tct tac att cac get gaa 1488 Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu gcc tac get get ggc gaa ctg aaa cac ggt ccg ctg gcg cta att gat 1536 Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp gcc gat atg ccg gtt att gtt gtt gca ccg aac aac gaa ttg ctg gaa 1584 Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu aaa ctg aaa tcc aac att gaa gaa gtt cgc gcg cgt ggc ggt cag ttg 1632 Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu tat gtc ttc gcc gat cag gat gcg ggt ttt gta agt agc gat aac atg 1680 Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met cac atc atc gag atg ccg cat gtg gaa gag gtg att gca ccg atc ttc 1728 His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe tac acc gtt ccg ctg cag ctg ctg get tac cat gtc gcg ctg atc aaa 1776 Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys ggc acc gac gtt gac cag ccg cgt aac ctg gca aaa tcg gtt acg gtt 1824 Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val gag taa 1830 Glu <210> 19 <211> 609 <212> PRT
<213> Escherichia coli <400> 19 Met Cys Gly Thr Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Thr Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Leu Leu Ser Lys Val Giu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met Leu Pro Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp 450. 455 460 Lys His His Ala Leu Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Glu <210> 20 <211> 2184 <212> DNA
<213> Escherichia coli <400> 20 ccgctctaga actagtggat ctcgatcccg cgaaattaat acgactcact ataggggaat 60 tgtgagcgga taacaattcc cctctagaaa taattttgtt taactttaag aaggagatat 120 accatgtgtg gaattgttgg cgcgatcgcg caacgtgatg tagcagaaat ccttcttgaa 180 ggtttacgtc gtctggaata ccgcggatat gactctgccg gtctggccgt tgttgataca 240 gaaggtcata tgacccgcct gcgtcgcctc ggtaaagtcc agatgctggc acaggcagcg 300 gaagaacatc ctctgcatgg cggcactggt attgctcaca ctcgctgggc gacccacggt 360 gaaccttcag aagtgaatgc gcatccgcat gtttctgaac acattgtggt ggtgcataac 420 ggcatcatcg aaaaccatga accgctgcgt gaagagctaa aagcgcgtgg ctataccttc 480 gtttctgaaa ccgacaccga agtgattgcc catctggtga actgggagct gaaacaaggc 540 gggactctgc gtgaggccgt tctgcgtgct atcccgcagc tgcgtggtgc gtacggtaca 600 gtgatcatgg actcccgtca cccggatacc ctgctggcgg cacgttctgg tagtccgctg 660 gtgattggcc tggggatggg cgaaaacttt atcgcttctg accagctggc gctgttgccg 720 gtgacccgtc gctttatctt ccttgaagag ggcgatattg cggaaatcac tcgccgttcg 780 gtaaacatct tcgataaaac tggcgcggaa gtaaaacgtc aggatatcga atccaatctg 840 caatatgacg cgggcgataa aggcatttac tgtcactaca tgcagaaaga gatctacgaa 900 cagccgaacg cgatcaaaaa cacccttacc ggacgcatca gccacggtca ggttgattta 960 agcgagctgg gaccgaacgc cgacgaactg ctgtcgaagg ttgagcatat tcagatcctc 1020 gcctgtggta cttcttataa ctccggtatg gtttcccgct actggtttga atcgctagca 1080 ggtattccgt gcgacgtcga aatcgcctcc gaattccgct atcgcaaatc tgccgtgcgt 1140 cgtaacagcc tgatgatcac cttgtcacag tctggcgaaa ccgcggatac cctggctggc 1200 ctgcgtctgt cgaaagagct gggttacctt ggttcactgg caatctgtaa cgttccgggt 1260 tcttctctgg tgcgcgaatc cgatctggcg ctaatgacca acgcgggtac agaaatcggc 1320 gtggcatcca ctaaagcatt caccactcag ttaactgtgc tgttgatgct ggtggcgaag 1380 ctgtctcgcc tgaaaggtct ggatgcctcc attgaacatg acatcgtgca tggtctgcag 1440 gcgctgccga gccgtattga gcagatgctg tctcaggaca aacgcattga agcgctggca 1500 gaagatttct ctgacaaaca tcacgcgctg ttcctgagcc gtggcgatca gtacccaatc 1560 gcgctggaag gcgcattgaa gttgaaagag atctcttaca ttcacgctga agcctacgct 1620 gctggcgaac tgaaacacgg tccgctggcg ctaattgatg ccgatatgcc ggttattgtt 1680 gttgcaccga acaacgaatt gctggaaaaa ctgaaatcca acattgaaga agttcgcgcg 1740 cgtggcggtc agttgtatgt cttcgccgat caggatgcgg gttttgtaag tagcgataac 1800 atgcacatca tcgagatgcc gcatgtggaa gaggtgattg caccgatctt ctacaccgtt 1860 ccgctgcagc tgctggctta ccatgtcgcg ctgatcaaag gcaccgacgt tgaccagccg 1920 cgtaacctgg caaaatcggt tacggttgag taataaatgg atgccctgcg taagcggggc 1980 atttttcttc ctgttatgtt tttaatcaaa catcctgcca actccatgtg acaaaccgtc 2040 atcttcggct actttttctc tgtcacagaa tgaaaatttt tctgtcatct cttcgttatt 2100 aatgtttgta attgactgaa tatcaacgct ctaggggggc tagagcggcc gccaccgcgg 2160 tggagctccg tcgacaagct tatc 2184 <210> 21 <211> 1830 <212> DNA
<213> Escherichia coli <220>
<221> CDS
<222> (1)..(1830) <400> 21 atg tgt gga att gtt ggc gcg atc gcg caa cgt gat gta gca gaa atc 48 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile ctt ctt gaa ggt tta cgt cgt ctg gaa tac cgc gga tat gac tct gcc 96 Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala ggt ctg gcc gtt gtt gat aca gaa ggt cat atg acc cgc ctg cgt cgc 144 Gly Leu Ala Val Val Asp Thr Glu Gly His Met Thr Arg Leu Arg Arg ctc ggt aaa gtc cag atg ctg gca cag gca gcg gaa gaa cat cct ctg 192 Leu Gly Lys Val Gln Met Leu Ala Gin Ala Ala Glu Glu His Pro Leu cat ggc ggc act ggt att get cac act cgc tgg gcg acc cac ggt gaa 240 His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu cct tca gaa gtg aat gcg cat ccg cat gtt tct gaa cac att gtg gtg 288 Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val gtg cat aac ggc atc atc gaa aac cat gaa ccg ctg cgt gaa gag cta 336 Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu aaa gcg cgt ggc tat acc ttc gtt tct gaa acc gac acc gaa gtg att 384 Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile gcc cat ctg gtg aac tgg gag ctg aaa caa ggc ggg act ctg cgt gag 432 Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu gcc gtt ctg cgt get atc ccg cag ctg cgt ggt gcg tac ggt aca gtg 480 Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val atc atg gac tcc cgt cac ccg gat acc ctg ctg gcg gca cgt tct ggt 528 Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly agt ccg ctg gtg att ggc ctg ggg atg ggc gaa aac ttt atc get tct 576 Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser gac cag.ctg gcg ctg ttg ccg gtg acc cgt cgc ttt atc ttc ctt gaa 624 Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu gag ggc gat att gcg gaa atc act cgc cgt tcg gta aac atc ttc gat 672 Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp aaa act ggc gcg gaa gta aaa cgt cag gat atc gaa tcc aat ctg caa 720 Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln tat gac gcg ggc gat aaa ggc att tac tgt cac tac atg cag aaa gag 768 Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Cys His Tyr Met Gln Lys Glu atc tac gaa cag ccg aac gcg atc aaa aac acc ctt acc gga cgc atc 816 Ile Tyr Giu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile agc cac ggt cag gtt gat tta agc gag ctg gga ccg aac gcc gac gaa 864 Ser His Gly Gin Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu ctg ctg tcg aag gtt gag cat att cag atc ctc gcc tgt ggt act tct 912 Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser tat aac tcc ggt atg gtt tcc cgc tac tgg ttt gaa tcg cta gca ggt 960 Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly att ccg tgc gac gtc gaa atc gcc tcc gaa ttc cgc tat cgc aaa tct 1008 Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser gcc gtg cgt cgt aac agc ctg atg atc acc ttg tca cag tct ggc gaa 1056 Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gin Ser Gly Glu acc gcg gat acc ctg get ggc ctg cgt ctg tcg aaa gag ctg ggt tac 1104 Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr ctt ggt tca ctg gca atc tgt aac gtt ccg ggt tct tct ctg gtg cgc 1152 Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg gaa tcc gat ctg gcg cta atg acc aac gcg ggt aca gaa atc ggc gtg 1200 Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val gca tcc act aaa gca ttc acc act cag tta act gtg ctg ttg atg ctg 1248 Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu gtg gcg aag ctg tct cgc ctg aaa ggt ctg gat gcc tcc att gaa cat 1296 Val Ala.Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His gac atc gtg cat ggt ctg cag gcg ctg ccg agc cgt att gag cag atg 1344 Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met ctg tct cag gac aaa cgc att gaa gcg ctg gca gaa gat ttc tct gac 1392 Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp aaa cat cac gcg ctg ttc ctg agc cgt ggc gat cag tac cca atc gcg 1440 Lys His His Ala Leu Phe Leu Ser Arg Gly Asp Gln Tyr Pro Ile Ala ctg gaa ggc gca ttg aag ttg aaa gag atc tct tac att cac get gaa 1488 Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu gcc tac get get ggc gaa ctg aaa cac ggt ccg ctg gcg cta att gat 1536 Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp gcc gat atg ccg gtt att gtt gtt gca ccg aac aac gaa ttg ctg gaa 1584 Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu aaa ctg aaa tcc aac att gaa gaa gtt cgc gcg cgt ggc ggt cag ttg 1632 Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu tat gtc ttc gcc gat cag gat gcg ggt ttt gta agt agc gat aac atg 1680 Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met cac atc atc gag atg ccg cat gtg gaa gag gtg att gca ccg atc ttc 1728 His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe tac acc gtt ccg ctg cag ctg ctg get tac cat gtc gcg ctg atc aaa 1776 Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys ggc acc gac gtt gac cag ccg cgt aac ctg gca aaa tcg gtt acg gtt 1824 Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val gag taa 1830 Glu <210> 22 <211> 609 <212> PRT
<213> Escherichia coli <400> 22 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Gly Leu Ala Val Val Asp Thr Glu Gly His Met Thr Arg Leu Arg Arg Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Cys His Tyr Met Gln Lys Glu Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser. Leu Val Arg Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp Lys His His Ala Leu Phe Leu Ser Arg Gly Asp Gln Tyr Pro Ile Ala Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser. Asp Asn Met His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Glu <210> 23 <211> 2184 <212> DNA
<213> Escherichia coli <400> 23 ccgctctaga actagtggat ctcgatcccg cgaaattaat acgactcact ataggggaat 60 tgtgagcgga taacaattcc cctctagaaa taattttgtt taactttaag aaggagatat 120 accatgtgtg gaattgttgg cgcgatcgcg caacgtgatg tagcagaaat ccttcttgaa 180 ggtttacgtc gtctggaata ccgcggatat gactctgccg gtctggccgt tgttgatgca 240 gaaggtcata tgacccgcct gcgtcgcctc ggtaaagtcc agatgctggc acaggcagcg 300 gaagaacatc ctctgcatgg cggcactggt attgctcaca ctcgctgggc gacccacggt 360 gaaccttcag aagtgaatgc gcatccgcat gtttccgaac acattgtggt ggtgcataac 420 ggcatcatcg aaaaccatga accgctgcgt gaagagctaa aagcgcgtgg ctataccttc 480 gtttctgaaa ccgacaccga agtgattgcc catctggtga actgggagct gaaacaaggc 540 gggactctgc gtgaggccgt tctgcgtgct atcccgcagc tgcgtggtgc gtacggtaca 600 gtgatcatgg actcccgtca cccggatacc ctgctggcgg cacgttctgg tagtccgctg 660 gtgattggcc tggggatggg cgaaaacttt atcgcttctg accagctggc gctgttgccg 720 gtgacccgtc gctttatctt ccttgaagag ggcgatattg cggaaatcac tcgccgttcg 780 gtaaacatct tcgataaaac tggcgcggaa gtaaaacgtc aggatatcga atccaatctg 840 caatatgacg cgggcgataa aggcatttac cgtcactaca tgcagaaaga gatctacgaa 900 cagccgaacg cgatcaaaaa cacccttacc ggacgcatca gccacggtca ggttgattta 960 agcgagctgg gaccgaacgc cgacgaactg ctgtcgaagg ttgagcatat tcagatcctc 1020 gcctgtggta cttcttataa ctccggtatg gtttcccgct actggtttga atcgctagca 1080 ggtattccgt gcgacgtcga aatcgcctct gaattccgct atcgcaaatc tgccgtgcgt 1140 cgtaacagcc tgatgatcac cttgtcacag tctggcgaaa ccgcggatac cctggctggc 1200 ctgcgtctgt cgaaagagct gggttacctt ggttcactgg caatctgtaa cgttccgggt 1260 tcttctctgg tgcgcgaatc cgatctggcg ctaatgacca acgcgggtac agaaatcggc 1320 gtggcatcca ctaaagcatt caccactcag ttaactgtgc tgttgatgct ggtggcgaag 1380 ctgtctcgcc tgaaaggtct ggatgcctcc attgaacatg acatcgtgca tggtctgcag 1440 gcgctgccga gccgtattga gcagatgctg tctcaggaca aacgcattga agcgctggca 1500 gaagatttct ctgacaaaca tcacgcgccg ttcctgggcc gtggcgatca gtacccaatc 1560 gcgctggaag gcgcattgaa gttgaaagag atctcttaca ttcacgctga agcctacgct 1620 gctggcgaac tgaaacacgg tccgctggcg ctaattgatg ccgatatgcc ggttattgtt 1680 gttgcaccga acaacgaatt gctggaaaaa ctgaaatcca acattgaaga agttcgcgcg 1740 cgtggcggtc agttgtatgt cttcgccgat caggatgcgg gttttgtaag tagcgataac 1800 atgcacatca tcgagatgcc gcatgtggaa gaggtgattg caccgatctt ctacaccgtt 1860 ccgctgcagc tgctggctta ccatgtcgcg ctgatcaaag gcaccgacgt tgaccagccg 1920 cgtaacctgg caaaatcggt tacggttgag taataaatgg atgccctgcg taagcggggc 1980 atttttcttc ctgttatgtt tttaatcaaa catcctgcca actccatgtg acaaaccgtc 2040 atcttcggct actttttctc tgtcacagaa tgaaaatttt tctgtcatct cttcgttatt 2100 aatgtttgta attgactgaa tatcaacgct ctagaggggc tagagcggcc accaccgcgg 2160 tggagctccg tcgacaagct tatc 2184 <210> 24 <211> 1830 <212> DNA
<213> Escherichia coli <220>
<221> CDS
<222> (1)..(1830) <400> 24 atg tgt gga att gtt ggc gcg atc gcg caa cgt gat gta gca gaa atc 48 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile ctt ctt gaa ggt tta cgt cgt ctg gaa tac cgc gga tat gac tct gcc 96 Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala ggt ctg gcc gtt gtt gat gca gaa ggt cat atg acc cgc ctg cgt cgc 144 Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg ctc ggt aaa gtc cag atg ctg gca cag gca gcg gaa gaa cat cct ctg 192 Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu cat ggc ggc act ggt att get cac act cgc tgg gcg acc cac ggt gaa 240 His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu cct tca gaa gtg aat gcg cat ccg cat gtt tcc gaa cac att gtg gtg 288 Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val gtg cat aac ggc atc atc gaa aac cat gaa ccg ctg cgt gaa gag cta 336 Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu aaa gcg cgt ggc tat acc ttc gtt tct gaa acc gac acc gaa gtg att 384 Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile gcc cat ctg gtg aac tgg gag ctg aaa caa ggc ggg act ctg cgt gag 432 Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu gcc gtt ctg cgt get atc ccg cag ctg cgt ggt gcg tac ggt aca gtg 480 Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val atc atg gac tcc cgt cac ccg gat acc ctg ctg gcg gca cgt tct ggt 528 Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly agt ccg ctg gtg att ggc ctg ggg atg ggc gaa aac ttt atc get tct 576 Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser gac cag ctg gcg ctg ttg ccg gtg acc cgt cgc ttt atc ttc ctt gaa 624 Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu gag ggc gat att gcg gaa atc act cgc cgt tcg gta aac atc ttc gat 672 Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp aaa act ggc gcg gaa gta aaa cgt cag gat atc gaa tcc aat ctg caa 720 Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln tat gac gcg ggc gat aaa ggc att tac cgt cac tac atg cag aaa gag 768 Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu atc tac gaa cag ccg aac gcg atc aaa aac acc ctt acc gga cgc atc 816 Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile agc cac ggt cag gtt gat tta agc gag ctg gga ccg aac gcc gac gaa 864 Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu ctg ctg tcg aag gtt gag cat att cag atc ctc gcc tgt ggt act tct 912 Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser tat aac. tcc ggt atg gtt tcc cgc tac tgg ttt gaa tcg cta gca ggt 960 Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly att ccg tgc gac gtc gaa atc gcc tct gaa ttc cgc tat cgc aaa tct 1008 Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser gcc gtg cgt cgt aac agc ctg atg atc acc ttg tca cag tct ggc gaa 1056 Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu acc gcg gat acc ctg get ggc ctg cgt ctg tcg aaa gag ctg ggt tac 1104 Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr ctt ggt tca ctg gca atc tgt aac gtt ccg ggt tct tct ctg gtg cgc 1152 Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg gaa tcc gat ctg gcg cta atg acc aac gcg ggt aca gaa atc ggc gtg 1200 Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val gca tcc act aaa gca ttc acc act cag tta act gtg ctg ttg atg ctg 1248 Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu gtg gcg aag ctg tct cgc ctg aaa ggt ctg gat gcc tcc att gaa cat 1296 Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His gac atc gtg cat ggt ctg cag gcg ctg ccg agc cgt att gag cag atg 1344 Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met ctg tct cag gac aaa cgc att gaa gcg ctg gca gaa gat ttc tct gac 1392 Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp aaa cat cac gcg ccg ttc ctg ggc cgt ggc gat cag tac cca atc gcg 1440 Lys His His Ala Pro Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala ctg gaa ggc gca ttg aag ttg aaa gag atc tct tac att cac get gaa 1488 Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu gcc tac get get ggc gaa ctg aaa cac ggt ccg ctg gcg cta att gat 1536 Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp gcc gat atg ccg gtt att gtt gtt gca ccg aac aac gaa ttg ctg gaa 1584 Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu aaa ctg aaa tcc aac att gaa gaa gtt cgc gcg cgt ggc ggt cag ttg 1632 Lys Leu. Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu tat gtc ttc gcc gat cag gat gcg ggt ttt gta agt agc gat aac atg 1680 Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met cac atc atc gag atg ccg cat gtg gaa gag gtg att gca ccg atc ttc 1728 His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe tac acc gtt ccg ctg cag ctg ctg get tac cat gtc gcg ctg atc aaa 1776 Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys ggc acc gac gtt gac cag ccg cgt aac ctg gca aaa tcg gtt acg gtt 1824 Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val gag taa 1830 Glu <210> 25 <211> 609 <212> PRT
<213> Escherichia coli <400> 25 Met Cys Gly Ile Val Gly Ala Ile Ala Gin Arg Asp Val Ala Glu Ile Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu Ala Val.Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp Lys His His Ala Pro Phe Leu Gly Arg Gly Asp Gln Tyr Pro Ile Ala Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Glu <210> 26 <211> 2184 <212> DNA
<213> Escherichia coli <400> 26 ccgctctaga actagtggat ctcgatcccg cgaaattaat acgactcact ataggggaat 60 tgtgagcgga taacaattcc cctctagaaa taattttgtt taactttaag aaggagatat 120 accatgtgtg gaattgttgg cgcgatcgcg caacgtgatg tagcagaaat ccttcttgaa 180 ggtttacgtc gtctggaata ccgcggatat gactctgccg gtctggccgt tgttgatgca 240 gaaggtcata tgacccgcct gcgtcgcctc ggtaaagtcc agatgctggc acaggcagcg 300 gaagaacatc ctctgcatgg cggcactggt attgctcaca ctcgctgggc gacccacggt 360 gaaccttcag aagtgaatgc gcatccgcat gtttctgaac acattgtggt ggtgcataac 420 ggcatcatcg aaaaccatga accgctgcgt gaagagctaa aagcgcgtgg ctataccttc 480 gtttctgaaa ccgacaccga agtgattgcc catctggtga actgggagct gaaacaaggc 540 gggactctgc gtgaggccgt tctgcgtgct atcccgcagc tgcgtggtgc gtacggtaca 600 gtgatcatgg actcccgtca cccggatacc ctgctggcgg cacgttctgg tagtccgctg 660 gtgattggcc tggggatggg cgaaaacttt atcgcttctg accagctggc gctgttgccg 720 gtgacccgtc gctttatctt ccttgaagag ggcgatattg cggaaatcac tcgccgttcg 780 gtaaacatct tcgataaaac tggcgcggaa gtaaaacgtc aggatatcga atccaatctg 840 caatatgacg cgggcgataa aggcatttac cgtcactaca tgcagaaaga gatctacgaa 900 cagccgaacg cgatcaaaaa cacccttacc ggacgcatca gccacggtca ggttgattta 960 agcgagctgg gaccgaacgc cgacgaactg ctgtcgaagg ttgagcatat tcagatcctc 1020 gcctgtggta cttcttataa ctccggtatg gtttcccgct actggtttga atcgctagca 1080 ggtattccgt gcgacgtcga aatcgcctct gaattccgct atcgcaaatc tgccgtgcgt 1140 cgtaacagcc tgatgatcac cttgtcacag tctggcgaaa ccgcggatac cctggctggc 1200 ctgcgtctgt cgaaagagct gggttacctt ggttcactgg caatctgtaa cgttccgggt 1260 tcttctctgg tgcgcgaatc cgatctggcg ctaatgacca acgcgggtac agaaatcggc 1320 gtggcatcca ctaaagcatt caccactcag ttaactgtgc tgttgatgct ggtggcgaag 1380 ctgtctcgcc tgaaaggtct ggatgcctcc attgaacatg acatcgtgca tggtctgcag 1440 gcgctgccga gccgtattga gcagatgctg tctcaggaca aacgcattga agcgctggca 1500 gaagatttct ctgacaaaca tcacgcgctg ttcctgagcc gtggcgatca gtacccaatc 1560 gcgctggaag gcgcattgaa gttgaaagag atctcttaca ttcacgctga agcctacgct 1620 gctggcgaac tgaaacacgg tccgctggcg ctaattgatg ccgatatgcc ggttattgtt 1680 gttgcaccga acaacgaatt gctggaaaaa ctgaaatcca acattgaaga agttcgcgcg 1740 cgtggcggtc agttgtatgt cttcgccgat caggatgcgg gttttgtaag tagcgataac 1800 atgcacatca tcgagatgcc gcatgtggaa gaggtgattg caccgatctt ctacaccgtt 1860 ccgctgcagc tgctggctta ccatgtcgcg ctgatcaaag gcaccgacgt tgaccagccg 1920 cgtaacctgg caaaatcggt tacggttgag taataaatgg atgccctgcg taagcggggc 1980 atttttcttc ctgttatgtt tttaatcaaa catcctgcca actccatgtg acaaaccgtc 2040 atcttcggct actttttctc tgtcacagaa tgaaaatttt tctgtcatct cttcgttatt 2100 aatgtttgta attgactgaa tatcaacgct ctaggggggc tagagcggcc gccaccgcgg 2160 tggagctccg tcgacaagct tatc 2184 <210> 27 <211> 1830 <212> DNA
<213> Escherichia coli <220>
<221> CDS
<222> (1)..(1830) <400> 27 atg tgt gga att gtt ggc gcg atc gcg caa cgt gat gta gca gaa atc 48 Met Cys Gly Ile Val Gly Ala Ile Ala Gin Arg Asp Val Ala Glu Ile ctt ctt gaa ggt tta cgt cgt ctg gaa tac cgc gga tat gac tct gcc 96 Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala ggt ctg gcc gtt gtt gat gca gaa ggt cat atg acc cgc ctg cgt cgc 144 Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg ctc ggt aaa gtc cag atg ctg gca cag gca gcg gaa gaa cat cct ctg 192 Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu cat ggc ggc act ggt att get cac act cgc tgg gcg acc cac ggt gaa 240 His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu cct tca gaa gtg aat gcg cat ccg cat gtt tct gaa cac att gtg gtg 288 Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val gtg cat aac ggc atc atc gaa aac cat gaa ccg ctg cgt gaa gag cta 336 Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu aaa gcg cgt ggc tat acc ttc gtt tct gaa acc gac acc gaa gtg att 384 Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile gcc cat ctg gtg aac tgg gag ctg aaa caa ggc ggg act ctg cgt gag 432 Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu gcc gtt ctg cgt get atc ccg cag ctg cgt ggt gcg tac ggt aca gtg 480 Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val atc atg gac tcc cgt cac ccg gat acc ctg ctg gcg gca cgt tct ggt 528
36 Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly agt ccg ctg gtg att ggc ctg ggg atg ggc gaa aac ttt atc get tct 576 Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser gac cag ctg gcg ctg ttg ccg gtg acc cgt cgc ttt atc ttc ctt gaa 624 Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu gag ggc gat att gcg gaa atc act cgc cgt tcg gta aac atc ttc gat 672 Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp aaa act ggc gcg gaa gta aaa cgt cag gat atc gaa tcc aat ctg caa 720 Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser. Asn Leu Gln tat gac gcg ggc gat aaa ggc att tac cgt cac tac atg cag aaa gag 768 Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu atc tac gaa cag ccg aac gcg atc aaa aac acc ctt acc gga cgc atc 816 Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile agc cac ggt cag gtt gat tta agc gag ctg gga ccg aac gcc gac gaa 864 Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu ctg ctg tcg aag gtt gag cat att cag atc ctc gcc tgt ggt act tct 912 Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser tat aac tcc ggt atg gtt tcc cgc tac tgg ttt gaa tcg cta gca ggt 960 Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly att ccg tgc gac gtc gaa atc gcc tct gaa ttc cgc tat cgc aaa tct 1008 Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser gcc gtg cgt cgt aac agc ctg atg atc acc ttg tca cag tct ggc gaa 1056 Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu acc gcg gat acc ctg get ggc ctg cgt ctg tcg aaa gag ctg ggt tac 1104 Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr ctt ggt tca ctg gca atc tgt aac gtt ccg ggt tct tct ctg gtg cgc 1152 Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg gaa tcc gat ctg gcg cta atg acc aac gcg ggt aca gaa atc ggc gtg 1200 Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val
37 gca tcc act aaa gca ttc acc act cag tta act gtg ctg ttg atg ctg 1248 Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu gtg gcg aag ctg tct cgc ctg aaa ggt ctg gat gcc tcc att gaa cat 1296 Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His gac atc gtg cat ggt ctg cag gcg ctg ccg agc cgt att gag cag atg 1344 Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met ctg tct cag gac aaa cgc att gaa gcg ctg gca gaa gat ttc tct gac 1392 Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp aaa cat cac gcg ctg ttc ctg agc cgt ggc gat cag tac cca atc gcg 1440 Lys His His Ala Leu Phe Leu Ser Arg Gly Asp Gln Tyr Pro Ile Ala ctg gaa ggc gca ttg aag ttg aaa gag atc tct tac att cac get gaa 1488 Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu gcc tac get get ggc gaa ctg aaa cac ggt ccg ctg gcg cta att gat 1536 Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp gcc gat atg ccg gtt att gtt gtt gca ccg aac aac gaa ttg ctg gaa 1584 Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu aaa ctg aaa tcc aac att gaa gaa gtt cgc gcg cgt ggc ggt cag ttg 1632 Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu tat gtc ttc gcc gat cag gat gcg ggt ttt gta agt agc gat aac atg 1680 Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met cac atc atc gag atg ccg cat gtg gaa gag gtg att gca ccg atc ttc 1728 His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe tac acc gtt ccg ctg cag ctg ctg get tac cat gtc gcg ctg atc aaa 1776 Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys ggc acc gac gtt gac cag ccg cgt aac ctg gca aaa tcg gtt acg gtt 1824 Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser. Val Thr Val gag taa 1830 Glu
38 <210> 28 <211> 609 <212> PRT
<213> Escherichia coli <400> 28 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile
<213> Escherichia coli <400> 28 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile
39 Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Giu Ile Gly Val Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Asp Ile Val His Gly Leu Gin Ala Leu Pro Ser Arg Ile Glu Gln Met Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp Lys His His Ala Leu Phe Leu Ser Arg Gly Asp Gln Tyr Pro Ile Ala Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gin Leu Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Glu <210> 29 <211> 2184 <212> DNA
<213> Escherichia coli <400> 29 ccgctctaga actagtggat ctcgatcccg cgaaattaat acgactcact ataggggaat 60 tgtgagcgga taacaattcc cctctagaaa taattttgtt taactttaag aaggagatat 120 accatgtgtg gaattgttgg cgcgatcgcg caacgtgatg tagcagaaat ccttcttgaa 180 ggtttacgtc gtctggaata ccgcggatat gactctgccg gtctggccgt tgttgatgca 240 gaaggtcata tgacccgcct gcgtcgcctc ggtaaagtcc agatgctggc acaggcagcg 300 gaagaacatc ctctgcatgg cggcactggt attgctcaca ctcgctgggc gacccacggt 360 gaaccttcag aagtgaatgc gcatccgcat gtttctgaac acattgtggt ggtgcataac 420 ggcatcatcg aaaaccatga accgctgcgt gaagagctaa aagcgcgtgg ctataccttc 480 gtttctgaaa ccgacaccga agtgattgcc catctggtga actgggagct gaaacaaggc 540 gggactctgc gtgaggccgt tctgcgtgct atcccgcagc tgcgtggtgc gtacggtaca 600 gtgatcatgg actcccgtca cccggatacc ctgctggcgg cacgttctgg tagtccgctg 660 gtgattggcc tggggatggg cgaaaacttt atcgcttctg accagctggc gctgttgccg 720 gtgacccgtc gctttatctt ccttgaagag ggcgatattg cggaaatcac tcgccgttcg 780 gtaaacatct tcgataaaac tggcgcggaa gtaaaacgtc aggatatcga atccaatctg 840 caatatgacg cgggcgataa aggcatttac cgtcactaca tgcagaaaga gatctacgaa 900 cagccgaacg cgatcaaaaa cacccttacc ggacgcatca gccacggtca ggttgattta 960 agcgagctgg gaccgaacgc cgacgaactg ctgtcgaagg ttgagcatat tcagatcctc 1020 gcctgtggta cttcttataa ctccggtatg gtttcccgct actggtttga atcgctagca 1080 ggtattccgt gcgacgtcga aatcgcctct gaattccgct atcgcaaatc tgccgtgcgt 1140 cgtaacagcc tgatgatcac cttgtcacag tctggcgaaa ccgcggatac cctggctggc 1200 ctgcgtctgt cgaaagagct gggttacctt ggttcactgg caatctgtaa cgttccgggt 1260 tcttctctgg tgcgcgaatc cgatctggcg ctaatgacca acgcgggtac agaaatcggc 1320 gtggcatcca ctaaagcatt caccactcag ttaactgtgc tgttgatgct ggtggcgaag 1380 ctgtctcgcc tgaaaggtct ggatgcctcc attgaacatg acatcgtgca tggtctgcag 1440 gcgctgccga gccgtattga gcagatgctg tctcaggaca aacgcattga agcgctggca 1500 gaagatttct ctgacaaaca tcacgcgctg ttcctgagcc gtggcgatca gtacccaatc 1560 gcgctggaag gcgcattgaa gttgaaagag atctcttaca ttcacgctga agcctacgct 1620 gctggcgaac tgaaacacgg tccgctggcg ctaattgatg ccgatatgcc ggttattgtt 1680 gttgcaccga acaacgaatt gctggaaaaa ctgaaatcca acattgaaga agttcgcgcg 1740 cgtggcggtc agttgtatgt cttcgccgat caggatgcgg gttttgtaag tagcgataac 1800 atgcacatca tcgagatgcc gcatgtggaa gaggtgattg caccgatctt ctacaccgtt 1860 ccgctgcagc tgctggctta ccatgtcgcg ctgatcaaag gcaccgacgt tgaccagccg 1920 cgtaacctgg caaaatcggt tacggttgag taataaatgg atgccctgcg taagcggggc 1980 atttttcttc ctgttatgtt tttaatcaaa catcctgcca actccatgtg acaaaccgtc 2040 atcttcggct actttttctc tgtcacagaa tgaaaatttt tctgtcatct cttcgttatt 2100 aatgtttgta attgactgaa tatcaacgct ctaggggggc tagagcggcc gccaccgcgg 2160 tggagctccg tcgacaagct tatc 2184 <210> 30 <211> 1830 <212> DNA
<213> Escherichia cola <220>
<221> CDS
<222> (1)..(1830) <400> 30 atg tgt gga att gtt ggc gcg atc gcg caa cgt gat gta gca gaa atc 48 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile ctt ctt gaa ggt tta cgt cgt ctg gaa tac cgc gga tat gac tct gcc 96 Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala ggt ctg gcc gtt gtt gat gca gaa ggt cat atg acc cgc ctg cgt cgc 144 Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg ctc ggt aaa gtc cag atg ctg gca cag gca gcg gaa gaa cat cct ctg 192 Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu cat ggc ggc act ggt att get cac act cgc tgg gcg acc cac ggt gaa 240 His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu cct tca gaa gtg aat gcg cat ccg cat gtt tct gaa cac att gtg gtg 288 Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val gtg cat aac ggc atc atc gaa aac cat gaa ccg ctg cgt gaa gag cta 336 Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu aaa gcg cgt ggc tat acc ttc gtt tct gaa acc gac acc gaa gtg att 384 Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile gcc cat ctg gtg aac tgg gag ctg aaa caa ggc ggg act ctg cgt gag 432 Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu gcc gtt ctg cgt get atc ccg cag ctg cgt ggt gcg tac ggt aca gtg 480 Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val atc atg gac tcc cgt cac ccg gat acc ctg ctg gcg gca cgt tct ggt 528 Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly agt ccg ctg gtg att ggc ctg ggg atg ggc gaa aac ttt atc get tct 576 Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser gac cag ctg gcg ctg ttg ccg gtg acc cgt cgc ttt atc ttc ctt gaa 624 Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu gag ggc gat att gcg gaa atc act cgc cgt tcg gta aac atc ttc gat 672 Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp aaa act ggc gcg gaa gta aaa cgt cag gat atc gaa tcc aat ctg caa 720 Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln tat gac gcg ggc gat aaa ggc att tac cgt cac tac atg cag aaa gag 768 Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu atc tac gaa cag ccg aac gcg atc aaa aac acc ctt acc gga cgc atc 816 Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile agc cac ggt cag gtt gat tta agc gag ctg gga ccg aac gcc gac gaa 864 Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu ctg ctg tcg aag gtt gag cat att cag atc ctc gcc tgt ggt act tct 912 Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser tat aac tcc ggt atg gtt tcc cgc tac tgg ttt gaa tcg cta gca ggt 960 Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly att ccg tgc gac gtc gaa atc gcc tct gaa ttc cgc tat cgc aaa tct 1008 Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser gcc gtg cgt cgt aac agc ctg atg atc acc ttg tca cag tct ggc gaa 1056 Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu acc gcg gat acc ctg get ggc ctg cgt ctg tcg aaa gag ctg ggt tac 1104 Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr ctt ggt tca ctg gca atc tgt aac gtt ccg ggt tct tct ctg gtg cgc 1152 Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg gaa tcc gat ctg gcg cta atg acc aac gcg ggt aca gaa atc ggc gtg 1200 Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val gca tcc act aaa gca ttc acc act cag tta act gtg ctg ttg atg ctg 1248 Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu gtg gcg aag ctg tct cgc ctg aaa ggt ctg gat gcc tcc att gaa cat 1296 Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His gac atc gtg cat ggt ctg cag gcg ctg ccg agc cgt att gag cag atg 1344 Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met ctg tct cag gac aaa cgc att gaa gcg ctg gca gaa gat ttc tct gac 1392 Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp aaa cat cac gcg ctg ttc ctg agc cgt ggc gat cag tac cca atc gcg 1440 Lys His His Ala Leu Phe Leu Ser Arg Gly Asp Gln Tyr Pro Ile Ala ctg gaa ggc gca ttg aag ttg aaa gag atc tct tac att cac get gaa 1488 Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu gcc tac get get ggc gaa ctg aaa cac ggt ccg ctg gcg cta att gat 1536 Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp gcc gat atg ccg gtt att gtt gtt gca ccg aac aac gaa ttg ctg gaa 1584 Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu aaa ctg aaa tcc aac att gaa gaa gtt cgc gcg cgt ggc ggt cag ttg 1632 Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu tat gtc ttc gcc gat cag gat gcg ggt ttt gta agt agc gat aac atg 1680 Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met cac atc atc gag atg ccg cat gtg gaa gag gtg att gca ccg atc ttc 1728 His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe tac acc gtt ccg ctg cag ctg ctg get tac cat gtc gcg ctg atc aaa 1776 Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys ggc acc gac gtt gac cag ccg cgt aac ctg gca aaa tcg gtt acg gtt 1824 Gly Thr Asp Val Asp Gin Pro Arg Asn Leu Ala Lys Ser Val Thr Val gag taa 1830 Glu <210> 31 <211> 609 <212> PRT
<213> Escherichia coli <400> 31 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Asp Ile Val His Gly Leu Gin Ala Leu Pro Ser Arg Ile Glu Gln Met Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp Lys His His Ala Leu Phe Leu Ser Arg Gly Asp Gin Tyr Pro Ile Ala Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser. Asp Asn Met His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Glu
<213> Escherichia coli <400> 29 ccgctctaga actagtggat ctcgatcccg cgaaattaat acgactcact ataggggaat 60 tgtgagcgga taacaattcc cctctagaaa taattttgtt taactttaag aaggagatat 120 accatgtgtg gaattgttgg cgcgatcgcg caacgtgatg tagcagaaat ccttcttgaa 180 ggtttacgtc gtctggaata ccgcggatat gactctgccg gtctggccgt tgttgatgca 240 gaaggtcata tgacccgcct gcgtcgcctc ggtaaagtcc agatgctggc acaggcagcg 300 gaagaacatc ctctgcatgg cggcactggt attgctcaca ctcgctgggc gacccacggt 360 gaaccttcag aagtgaatgc gcatccgcat gtttctgaac acattgtggt ggtgcataac 420 ggcatcatcg aaaaccatga accgctgcgt gaagagctaa aagcgcgtgg ctataccttc 480 gtttctgaaa ccgacaccga agtgattgcc catctggtga actgggagct gaaacaaggc 540 gggactctgc gtgaggccgt tctgcgtgct atcccgcagc tgcgtggtgc gtacggtaca 600 gtgatcatgg actcccgtca cccggatacc ctgctggcgg cacgttctgg tagtccgctg 660 gtgattggcc tggggatggg cgaaaacttt atcgcttctg accagctggc gctgttgccg 720 gtgacccgtc gctttatctt ccttgaagag ggcgatattg cggaaatcac tcgccgttcg 780 gtaaacatct tcgataaaac tggcgcggaa gtaaaacgtc aggatatcga atccaatctg 840 caatatgacg cgggcgataa aggcatttac cgtcactaca tgcagaaaga gatctacgaa 900 cagccgaacg cgatcaaaaa cacccttacc ggacgcatca gccacggtca ggttgattta 960 agcgagctgg gaccgaacgc cgacgaactg ctgtcgaagg ttgagcatat tcagatcctc 1020 gcctgtggta cttcttataa ctccggtatg gtttcccgct actggtttga atcgctagca 1080 ggtattccgt gcgacgtcga aatcgcctct gaattccgct atcgcaaatc tgccgtgcgt 1140 cgtaacagcc tgatgatcac cttgtcacag tctggcgaaa ccgcggatac cctggctggc 1200 ctgcgtctgt cgaaagagct gggttacctt ggttcactgg caatctgtaa cgttccgggt 1260 tcttctctgg tgcgcgaatc cgatctggcg ctaatgacca acgcgggtac agaaatcggc 1320 gtggcatcca ctaaagcatt caccactcag ttaactgtgc tgttgatgct ggtggcgaag 1380 ctgtctcgcc tgaaaggtct ggatgcctcc attgaacatg acatcgtgca tggtctgcag 1440 gcgctgccga gccgtattga gcagatgctg tctcaggaca aacgcattga agcgctggca 1500 gaagatttct ctgacaaaca tcacgcgctg ttcctgagcc gtggcgatca gtacccaatc 1560 gcgctggaag gcgcattgaa gttgaaagag atctcttaca ttcacgctga agcctacgct 1620 gctggcgaac tgaaacacgg tccgctggcg ctaattgatg ccgatatgcc ggttattgtt 1680 gttgcaccga acaacgaatt gctggaaaaa ctgaaatcca acattgaaga agttcgcgcg 1740 cgtggcggtc agttgtatgt cttcgccgat caggatgcgg gttttgtaag tagcgataac 1800 atgcacatca tcgagatgcc gcatgtggaa gaggtgattg caccgatctt ctacaccgtt 1860 ccgctgcagc tgctggctta ccatgtcgcg ctgatcaaag gcaccgacgt tgaccagccg 1920 cgtaacctgg caaaatcggt tacggttgag taataaatgg atgccctgcg taagcggggc 1980 atttttcttc ctgttatgtt tttaatcaaa catcctgcca actccatgtg acaaaccgtc 2040 atcttcggct actttttctc tgtcacagaa tgaaaatttt tctgtcatct cttcgttatt 2100 aatgtttgta attgactgaa tatcaacgct ctaggggggc tagagcggcc gccaccgcgg 2160 tggagctccg tcgacaagct tatc 2184 <210> 30 <211> 1830 <212> DNA
<213> Escherichia cola <220>
<221> CDS
<222> (1)..(1830) <400> 30 atg tgt gga att gtt ggc gcg atc gcg caa cgt gat gta gca gaa atc 48 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile ctt ctt gaa ggt tta cgt cgt ctg gaa tac cgc gga tat gac tct gcc 96 Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala ggt ctg gcc gtt gtt gat gca gaa ggt cat atg acc cgc ctg cgt cgc 144 Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg ctc ggt aaa gtc cag atg ctg gca cag gca gcg gaa gaa cat cct ctg 192 Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu cat ggc ggc act ggt att get cac act cgc tgg gcg acc cac ggt gaa 240 His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu cct tca gaa gtg aat gcg cat ccg cat gtt tct gaa cac att gtg gtg 288 Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val gtg cat aac ggc atc atc gaa aac cat gaa ccg ctg cgt gaa gag cta 336 Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu aaa gcg cgt ggc tat acc ttc gtt tct gaa acc gac acc gaa gtg att 384 Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile gcc cat ctg gtg aac tgg gag ctg aaa caa ggc ggg act ctg cgt gag 432 Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu gcc gtt ctg cgt get atc ccg cag ctg cgt ggt gcg tac ggt aca gtg 480 Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val atc atg gac tcc cgt cac ccg gat acc ctg ctg gcg gca cgt tct ggt 528 Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly agt ccg ctg gtg att ggc ctg ggg atg ggc gaa aac ttt atc get tct 576 Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser gac cag ctg gcg ctg ttg ccg gtg acc cgt cgc ttt atc ttc ctt gaa 624 Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu gag ggc gat att gcg gaa atc act cgc cgt tcg gta aac atc ttc gat 672 Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp aaa act ggc gcg gaa gta aaa cgt cag gat atc gaa tcc aat ctg caa 720 Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln tat gac gcg ggc gat aaa ggc att tac cgt cac tac atg cag aaa gag 768 Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu atc tac gaa cag ccg aac gcg atc aaa aac acc ctt acc gga cgc atc 816 Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile agc cac ggt cag gtt gat tta agc gag ctg gga ccg aac gcc gac gaa 864 Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu ctg ctg tcg aag gtt gag cat att cag atc ctc gcc tgt ggt act tct 912 Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser tat aac tcc ggt atg gtt tcc cgc tac tgg ttt gaa tcg cta gca ggt 960 Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly att ccg tgc gac gtc gaa atc gcc tct gaa ttc cgc tat cgc aaa tct 1008 Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser gcc gtg cgt cgt aac agc ctg atg atc acc ttg tca cag tct ggc gaa 1056 Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu acc gcg gat acc ctg get ggc ctg cgt ctg tcg aaa gag ctg ggt tac 1104 Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr ctt ggt tca ctg gca atc tgt aac gtt ccg ggt tct tct ctg gtg cgc 1152 Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg gaa tcc gat ctg gcg cta atg acc aac gcg ggt aca gaa atc ggc gtg 1200 Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val gca tcc act aaa gca ttc acc act cag tta act gtg ctg ttg atg ctg 1248 Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu gtg gcg aag ctg tct cgc ctg aaa ggt ctg gat gcc tcc att gaa cat 1296 Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His gac atc gtg cat ggt ctg cag gcg ctg ccg agc cgt att gag cag atg 1344 Asp Ile Val His Gly Leu Gln Ala Leu Pro Ser Arg Ile Glu Gln Met ctg tct cag gac aaa cgc att gaa gcg ctg gca gaa gat ttc tct gac 1392 Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp aaa cat cac gcg ctg ttc ctg agc cgt ggc gat cag tac cca atc gcg 1440 Lys His His Ala Leu Phe Leu Ser Arg Gly Asp Gln Tyr Pro Ile Ala ctg gaa ggc gca ttg aag ttg aaa gag atc tct tac att cac get gaa 1488 Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu gcc tac get get ggc gaa ctg aaa cac ggt ccg ctg gcg cta att gat 1536 Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp gcc gat atg ccg gtt att gtt gtt gca ccg aac aac gaa ttg ctg gaa 1584 Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu aaa ctg aaa tcc aac att gaa gaa gtt cgc gcg cgt ggc ggt cag ttg 1632 Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu tat gtc ttc gcc gat cag gat gcg ggt ttt gta agt agc gat aac atg 1680 Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser Asp Asn Met cac atc atc gag atg ccg cat gtg gaa gag gtg att gca ccg atc ttc 1728 His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe tac acc gtt ccg ctg cag ctg ctg get tac cat gtc gcg ctg atc aaa 1776 Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys ggc acc gac gtt gac cag ccg cgt aac ctg gca aaa tcg gtt acg gtt 1824 Gly Thr Asp Val Asp Gin Pro Arg Asn Leu Ala Lys Ser Val Thr Val gag taa 1830 Glu <210> 31 <211> 609 <212> PRT
<213> Escherichia coli <400> 31 Met Cys Gly Ile Val Gly Ala Ile Ala Gln Arg Asp Val Ala Glu Ile Leu Leu Glu Gly Leu Arg Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala Gly Leu Ala Val Val Asp Ala Glu Gly His Met Thr Arg Leu Arg Arg Leu Gly Lys Val Gln Met Leu Ala Gln Ala Ala Glu Glu His Pro Leu His Gly Gly Thr Gly Ile Ala His Thr Arg Trp Ala Thr His Gly Glu Pro Ser Glu Val Asn Ala His Pro His Val Ser Glu His Ile Val Val Val His Asn Gly Ile Ile Glu Asn His Glu Pro Leu Arg Glu Glu Leu Lys Ala Arg Gly Tyr Thr Phe Val Ser Glu Thr Asp Thr Glu Val Ile Ala His Leu Val Asn Trp Glu Leu Lys Gln Gly Gly Thr Leu Arg Glu Ala Val Leu Arg Ala Ile Pro Gln Leu Arg Gly Ala Tyr Gly Thr Val Ile Met Asp Ser Arg His Pro Asp Thr Leu Leu Ala Ala Arg Ser Gly Ser Pro Leu Val Ile Gly Leu Gly Met Gly Glu Asn Phe Ile Ala Ser Asp Gln Leu Ala Leu Leu Pro Val Thr Arg Arg Phe Ile Phe Leu Glu Glu Gly Asp Ile Ala Glu Ile Thr Arg Arg Ser Val Asn Ile Phe Asp Lys Thr Gly Ala Glu Val Lys Arg Gln Asp Ile Glu Ser Asn Leu Gln Tyr Asp Ala Gly Asp Lys Gly Ile Tyr Arg His Tyr Met Gln Lys Glu Ile Tyr Glu Gln Pro Asn Ala Ile Lys Asn Thr Leu Thr Gly Arg Ile Ser His Gly Gln Val Asp Leu Ser Glu Leu Gly Pro Asn Ala Asp Glu Leu Leu Ser Lys Val Glu His Ile Gln Ile Leu Ala Cys Gly Thr Ser Tyr Asn Ser Gly Met Val Ser Arg Tyr Trp Phe Glu Ser Leu Ala Gly Ile Pro Cys Asp Val Glu Ile Ala Ser Glu Phe Arg Tyr Arg Lys Ser Ala Val Arg Arg Asn Ser Leu Met Ile Thr Leu Ser Gln Ser Gly Glu Thr Ala Asp Thr Leu Ala Gly Leu Arg Leu Ser Lys Glu Leu Gly Tyr Leu Gly Ser Leu Ala Ile Cys Asn Val Pro Gly Ser Ser Leu Val Arg Glu Ser Asp Leu Ala Leu Met Thr Asn Ala Gly Thr Glu Ile Gly Val Ala Ser Thr Lys Ala Phe Thr Thr Gln Leu Thr Val Leu Leu Met Leu Val Ala Lys Leu Ser Arg Leu Lys Gly Leu Asp Ala Ser Ile Glu His Asp Ile Val His Gly Leu Gin Ala Leu Pro Ser Arg Ile Glu Gln Met Leu Ser Gln Asp Lys Arg Ile Glu Ala Leu Ala Glu Asp Phe Ser Asp Lys His His Ala Leu Phe Leu Ser Arg Gly Asp Gin Tyr Pro Ile Ala Leu Glu Gly Ala Leu Lys Leu Lys Glu Ile Ser Tyr Ile His Ala Glu Ala Tyr Ala Ala Gly Glu Leu Lys His Gly Pro Leu Ala Leu Ile Asp Ala Asp Met Pro Val Ile Val Val Ala Pro Asn Asn Glu Leu Leu Glu Lys Leu Lys Ser Asn Ile Glu Glu Val Arg Ala Arg Gly Gly Gln Leu Tyr Val Phe Ala Asp Gln Asp Ala Gly Phe Val Ser Ser. Asp Asn Met His Ile Ile Glu Met Pro His Val Glu Glu Val Ile Ala Pro Ile Phe Tyr Thr Val Pro Leu Gln Leu Leu Ala Tyr His Val Ala Leu Ile Lys Gly Thr Asp Val Asp Gln Pro Arg Asn Leu Ala Lys Ser Val Thr Val Glu
Claims (40)
1. An isolated microorganism for producing glucosamine by a biosynthetic process, the microorganism having a genetic modification that increases the activity of glucosamine-6-phosphate synthase, wherein the genetic modification is selected from the group consisting of:
a) transformation with a recombinant nucleic acid molecule comprising a nucleic acid sequence encoding a modified glucosamine-6-phosphate synthase, wherein the amino acid sequence of the modified glucosamine-6-phosphate synthase differs from the amino acid sequence of a non-modified glucosamine-6-phosphate synthase by a deletion or substitution at a position, with respect to amino acid sequence SEQ ID NO:16, selected from the group consisting of Ile(4), Ile(272), Ser(450), Ala(39), Arg(250), Gly(472), Leu(469), and combinations thereof;
b) a genetic modification in the coding region of a gene encoding glucosamine-6-phosphate synthase, wherein the genetic modification is a deletion or substitution at a position, with respect to amino acid sequence SEQ
ID NO:16, selected from the group consisting of Ile(4), Ile(272), Ser(450), Ala(39), Arg(250), Gly(472), Leu(469), and combinations thereof;
wherein the genetic modification increases production of glucosamine by the microorganism as compared to in the absence of the genetic modification.
a) transformation with a recombinant nucleic acid molecule comprising a nucleic acid sequence encoding a modified glucosamine-6-phosphate synthase, wherein the amino acid sequence of the modified glucosamine-6-phosphate synthase differs from the amino acid sequence of a non-modified glucosamine-6-phosphate synthase by a deletion or substitution at a position, with respect to amino acid sequence SEQ ID NO:16, selected from the group consisting of Ile(4), Ile(272), Ser(450), Ala(39), Arg(250), Gly(472), Leu(469), and combinations thereof;
b) a genetic modification in the coding region of a gene encoding glucosamine-6-phosphate synthase, wherein the genetic modification is a deletion or substitution at a position, with respect to amino acid sequence SEQ
ID NO:16, selected from the group consisting of Ile(4), Ile(272), Ser(450), Ala(39), Arg(250), Gly(472), Leu(469), and combinations thereof;
wherein the genetic modification increases production of glucosamine by the microorganism as compared to in the absence of the genetic modification.
2. The isolated microorganism of Claim 1, wherein the recombinant nucleic acid molecule comprises a lac promoter; wherein expression of the glucosamine-6-phosphate synthase is regulated through induction of the lac promoter.
3. The isolated microorganism of Claim 1, wherein the recombinant nucleic acid molecule comprises a nucleic acid sequence that is at least 75%
identical to SEQ ID NO:15 and encodes a protein having glucosamine-6-phosphate synthase activity.
identical to SEQ ID NO:15 and encodes a protein having glucosamine-6-phosphate synthase activity.
4. The isolated microorganism of Claim 1, wherein the glucosamine-6-phosphate synthase has an amino acid sequence that differs from a wild-type glucosamine-6-phosphate by a substitution, with respect to SEQ ID NO:16, selected from the group consisting of:
a) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Ile(4);
b) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Ile(272);
c) a substitution of an amino acid residue having an aliphatic side group for Ser(450);
d) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Ala(39);
e) a substitution of an amino acid residue having a sulfur-containing side group for Arg(250);
f) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Gly(472);
g) a substitution of an amino acid residue having an aliphatic side group for Leu(469);
h) and combinations of (a)-(g).
a) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Ile(4);
b) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Ile(272);
c) a substitution of an amino acid residue having an aliphatic side group for Ser(450);
d) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Ala(39);
e) a substitution of an amino acid residue having a sulfur-containing side group for Arg(250);
f) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Gly(472);
g) a substitution of an amino acid residue having an aliphatic side group for Leu(469);
h) and combinations of (a)-(g).
5. The isolated microorganism of Claim 1, wherein the glucosamine-6-phosphate synthase has an amino acid sequence that differs from a wild-type glucosamine-6-phosphate by a substitution, with respect to SEQ ID NO:16, selected from the group consisting of: Ile(4) to Thr, Ile(272) to Thr, Ser(450) to Pro, Ala(39) to Thr, Arg(250) to Cys, Gly(472) to Ser, Leu(469) to Pro, and combinations thereof.
6. The isolated microorganism of Claim 1, wherein the glucosamine-6-phosphate synthase has an amino acid sequence that differs from a wild-type glucosamine-6-phosphate by a substitution of a proline residue for a leucine residue at amino acid sequence position Leu(469), with respect to SEQ ID NO: 16.
7. The isolated microorganism of Claim 1, wherein the glucosamine-6-phosphate synthase has an amino acid sequence that differs from a wild-type glucosamine-6-phosphate by a substitution, with respect to SEQ ID NO: 16, selected from the group consisting of:
a) a threonine residue for an alanine residue at position Ala(39);
b) a cysteine residue for an arginine residue at position Arg(250);
c) a serine residue for a glycine residue at position Gly(472); and d) any combination of (a), (b), or (c).
a) a threonine residue for an alanine residue at position Ala(39);
b) a cysteine residue for an arginine residue at position Arg(250);
c) a serine residue for a glycine residue at position Gly(472); and d) any combination of (a), (b), or (c).
8. The isolated microorganism of Claim 1, wherein the glucosamine-6-phosphate synthase has an amino acid sequence that differs from a wild-type glucosamine-6-phosphate by a substitution, with respect to SEQ ID NO:16, selected from the group consisting of:
a) a threonine residue for an isoleucine residue at position Ile(4);
b) a threonine residue for an isoleucine residue at position Ile(272);
c) a proline residue for a serine residue at position Ser(450); and d) any combination of (a), (b), or (c).
a) a threonine residue for an isoleucine residue at position Ile(4);
b) a threonine residue for an isoleucine residue at position Ile(272);
c) a proline residue for a serine residue at position Ser(450); and d) any combination of (a), (b), or (c).
9. The isolated microorganism of Claim 1, wherein the glucosamine-6-phosphate synthase comprises an amino acid sequence selected from the group consisting of SEQ ID NO:19, SEQ ID NO:22, SEQ ID NO:25, SEQ ID NO:28 and SEQ ID NO:31.
10. The isolated microorganism of Claim 1, wherein the glucosamine-6-phosphate synthase is encoded by a nucleic acid sequence selected from the group consisting of SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:21, SEQ
ID NO:23, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:29 and SEQ
ID NO:30.
ID NO:23, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:29 and SEQ
ID NO:30.
11. The isolated microorganism of Claim 1, wherein the recombinant nucleic acid molecule is integrated into the genome of said microorganism.
12. The isolated microorganism of Claim 1, wherein the microorganism has a deletion of at least a portion of at least one nucleic acid molecule encoding a protein selected from the group consisting of N-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase, N-acetyl-glucosamine-specific enzyme II Nag, phosphoglucosamine mutase, glucosamine-1-phosphate acetyltransferase-N-acetylglucosamine-1-phosphate uridyltransferase, phosphofructokinase, Enzyme II Glc of the PEP:glucose PTS, and EIIM,P/III Man of the PEP:mannose PTS.
13. The isolated microorganism of Claim 1, wherein the microorganism expresses a recombinant nucleic acid molecule encoding a phosphatase.
14. The isolated microorganism of Claim 1, wherein the microorganism has a deletion of at least a portion of the genes encoding the following proteins: N-acetylglucosamine-6-phosphate deacetylase, glucosamine-6-phosphate deaminase and N-acetyl-glucosamine-specific enzyme II Nag;
wherein the deletions decrease the activity of the proteins as compared to in the absence of the deletions.
wherein the deletions decrease the activity of the proteins as compared to in the absence of the deletions.
15. The isolated microorganism of Claim 1, wherein the microorganism is selected from the group consisting of bacteria and yeast.
16. The isolated microorganism of Claim 1, wherein the microorganism is Escherichia coli.
17. The isolated microorganism of Claim 1, wherein the microorganism is a yeast.
18. The isolated microorganism of Claim 1, wherein the microorganism is Escherichia coli and wherein the microorganism has a deletion of at least a portion of at least one Escherichia coli gene selected from the group consisting of nagA, nagB, nagC, nagD, nagE, manXYZ, glmM, pfkB, pfkA, glmU, and ptsG.
19. The isolated microorganism of Claim 1, wherein the microorganism is Escherichia coli which has a deletion of nag regulon genes and a genetic modification in manXYZ genes such that proteins encoded by the manXYZ genes have decreased activity.
20. The isolated microorganism of Claim 1, wherein said microorganism produces at least 1 g/L of glucosamine when cultured from about 10 to about 60 hours at from about 28°C to about 37°C to a cell density of at least about 8 g/L by dry cell weight, in a pH 7.0 fermentation medium comprising: 14 g/L K2HPO4, 16 g/L
KH2PO4, 1 g/L Na3Citrate 2H2O, 5 g/L (NH4)2SO4, 20 g/L glucose, 10 mM MgSO4, 1 mM CaCl2, and from about 0.2mM to about 1 mM IPTG.
KH2PO4, 1 g/L Na3Citrate 2H2O, 5 g/L (NH4)2SO4, 20 g/L glucose, 10 mM MgSO4, 1 mM CaCl2, and from about 0.2mM to about 1 mM IPTG.
21. A method to produce a product selected from the group consisting of glucosamine-6-phosphate and glucosamine by fermentation, comprising:
a) culturing in a fermentation medium comprising assimilable sources of carbon, nitrogen and phosphate, the isolated microorganism of any one of Claims 1 to 20, wherein the step of culturing produces and accumulates a product selected from the group consisting of glucosamine-6-phosphate and glucosamine from the microorganism; and b) recovering and purifying the product.
a) culturing in a fermentation medium comprising assimilable sources of carbon, nitrogen and phosphate, the isolated microorganism of any one of Claims 1 to 20, wherein the step of culturing produces and accumulates a product selected from the group consisting of glucosamine-6-phosphate and glucosamine from the microorganism; and b) recovering and purifying the product.
22. The method of Claim 21, wherein the glucosamine-6-phosphate is intracellular and the glucosamine is extracellular, wherein the step of recovering comprises a recovering step selected from the group consisting of recovering the glucosamine-6-phosphate from the microorganism, recovering the glucosamine from the fermentation medium, and a combination thereof.
23. The method of Claim 21, wherein the product is intracellular glucosamine-6-phosphate and the step of recovering further comprises dephosphorylating the glucosamine-6-phosphate to produce glucosamine.
24. The method of Claim 21, wherein the step of culturing is performed at a temperature from about 20°C to about 40°C.
25. The method of Claim 21, wherein said step of culturing comprises maintaining said source of carbon at a concentration of from about 0.5% to about 5%
in said fermentation medium.
in said fermentation medium.
26. The method of Claim 21, wherein said step of culturing is performed in a fermentor.
27. The method of Claim 21, wherein the step of culturing is performed so that an excess of glucose is maintained.
28. The method of Claim 21, wherein the step of culturing produces and accumulates at least about 1 g/L of the product.
29. An isolated glucosamine-6-phosphate synthase encoded by a nucleic acid sequence that is at least 75% identical to SEQ ID NO: 15, wherein the glucosamine-6-phosphate synthase has an amino acid sequence that differs from SEQ
ID NO: 16 by a modification of a deletion or a substitution at a position selected from the group consisting of Ile(4), Ile(272), Ser(450), Ala(39), Arg(250), Gly(472), Leu(469), and combinations thereof, and wherein the synthase has glucosamine-6-phosphate synthase activity and has reduced product inhibition as compared to SEQ
ID NO:16.
ID NO: 16 by a modification of a deletion or a substitution at a position selected from the group consisting of Ile(4), Ile(272), Ser(450), Ala(39), Arg(250), Gly(472), Leu(469), and combinations thereof, and wherein the synthase has glucosamine-6-phosphate synthase activity and has reduced product inhibition as compared to SEQ
ID NO:16.
30. The isolated glucosamine-6-phosphate synthase of Claim 29, wherein the amino acid sequence differs from SEQ ID NO:16 by a substitution selected from the group consisting of:
a) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Ile(4);
b) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Ile(272);
c) a substitution of an amino acid residue having an aliphatic side group for Ser(450);
d) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Ala(39);
e) a substitution of an amino acid residue having a sulfur-containing side group for Arg(250);
f) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Gly(472);
g) a substitution of an amino acid residue having an aliphatic side group for Leu(469);
h) and combinations of (a)-(g).
a) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Ile(4);
b) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Ile(272);
c) a substitution of an amino acid residue having an aliphatic side group for Ser(450);
d) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Ala(39);
e) a substitution of an amino acid residue having a sulfur-containing side group for Arg(250);
f) a substitution of an amino acid residue having an aliphatic hydroxyl side group for Gly(472);
g) a substitution of an amino acid residue having an aliphatic side group for Leu(469);
h) and combinations of (a)-(g).
31. The isolated glucosamine-6-phosphate synthase of Claim 29, wherein the amino acid sequence differs from SEQ ID NO: 16 by a substitution selected from the group consisting of: Ile(4) to Thr, Ile(272) to Thr, Ser(450) to Pro, Ala(39) to Thr, Arg(250) to Cys, Gly(472) to Ser, Leu(469) to Pro, and combinations thereof.
32. The isolated glucosamine-6-phosphate synthase of Claim 29, wherein the amino acid sequence differs from SEQ ID NO:16 by a substitution of a proline residue for a leucine residue at amino acid sequence position Leu(469).
33. The isolated glucosamine-6-phosphate synthase of Claim 29, wherein the amino acid sequence differs from SEQ ID NO:16 by substitution selected from the group consisting of:
a) a threonine residue for an alanine residue at position Ala(39);
b) a cysteine residue for an arginine residue at position Arg(250);
c) a serine residue for a glycine residue at position Gly(472); and d) any combination of (a), (b), or (c).
a) a threonine residue for an alanine residue at position Ala(39);
b) a cysteine residue for an arginine residue at position Arg(250);
c) a serine residue for a glycine residue at position Gly(472); and d) any combination of (a), (b), or (c).
34. The isolated glucosamine-6-phosphate synthase of Claim 29, wherein the amino acid sequence differs from SEQ ID NO:16 by substitution selected from the group consisting of:
a) a threonine residue for an isoleucine residue at position Ile(4);
b) a threonine residue for an isoleucine residue at position Ile(272);
c) a proline residue for a serine residue at position Ser(450); and d) any combination of (a), (b), or (c).
a) a threonine residue for an isoleucine residue at position Ile(4);
b) a threonine residue for an isoleucine residue at position Ile(272);
c) a proline residue for a serine residue at position Ser(450); and d) any combination of (a), (b), or (c).
35. The isolated glucosamine-6-phosphate synthase of Claim 29, wherein amino acid sequence comprises an amino acid sequence selected from the group consisting of SEQ ID NO:19, SEQ ID NO:22, SEQ ID NO:25, SEQ ID NO:28 and SEQ ID NO:31.
36. An isolated nucleic acid molecule comprising a nucleic acid sequence encoding the glucosamine-6-phosphate synthase of any one of Claims 29 to 35.
37. The isolated nucleic acid molecule of Claim 36, wherein the nucleic acid molecule comprises a nucleic acid sequence selected from the group consisting of SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:29 and SEQ ID NO:30.
38. A recombinant nucleic acid molecule comprising the nucleic acid molecule of Claim 36.
39. A recombinant cell comprising the recombinant nucleic acid molecule of Claim 38.
40. A method to produce glucosamine by fermentation, comprising:
a) culturing in a fermentation medium comprising assimilable sources of carbon, nitrogen and phosphate, a genetically modified microorganism having increased glucosamine-6-phosphate synthase activity and reduced glucosamine-6-phosphate product inhibition, wherein said genetically modified microorganism is produced by a process comprising the steps of:
i) generating modifications in an isolated nucleic acid molecule comprising a nucleic acid sequence encoding glucosamine-6-phosphate synthase to create a plurality of modified nucleic acid sequences;
ii) transforming microorganisms with said modified nucleic acid sequences to produce genetically modified microorganisms;
iii) screening said genetically modified microorganisms for glucosamine-6-phosphate synthase activity and glucosamine-6-phosphate product inhibition; and, iv) selecting said genetically modified microorganisms which have increased glucosamine-6-phosphate synthase activity and reduced glucosamine-6-phosphate product inhibition;
wherein said step of culturing produces a product selected from the group consisting of glucosamine-6-phosphate and glucosamine from said microorganism; and, b) recovering said product.
a) culturing in a fermentation medium comprising assimilable sources of carbon, nitrogen and phosphate, a genetically modified microorganism having increased glucosamine-6-phosphate synthase activity and reduced glucosamine-6-phosphate product inhibition, wherein said genetically modified microorganism is produced by a process comprising the steps of:
i) generating modifications in an isolated nucleic acid molecule comprising a nucleic acid sequence encoding glucosamine-6-phosphate synthase to create a plurality of modified nucleic acid sequences;
ii) transforming microorganisms with said modified nucleic acid sequences to produce genetically modified microorganisms;
iii) screening said genetically modified microorganisms for glucosamine-6-phosphate synthase activity and glucosamine-6-phosphate product inhibition; and, iv) selecting said genetically modified microorganisms which have increased glucosamine-6-phosphate synthase activity and reduced glucosamine-6-phosphate product inhibition;
wherein said step of culturing produces a product selected from the group consisting of glucosamine-6-phosphate and glucosamine from said microorganism; and, b) recovering said product.
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US09/115,475 | 1998-07-15 | ||
| US09/115,475 US6372457B1 (en) | 1997-01-14 | 1998-07-15 | Process and materials for production of glucosamine |
| PCT/US1999/015976 WO2000004182A1 (en) | 1998-07-15 | 1999-07-15 | Process and materials for production of glucosamine |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CA2332380A1 CA2332380A1 (en) | 2000-01-27 |
| CA2332380C true CA2332380C (en) | 2010-11-02 |
Family
ID=22361646
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA2332380A Expired - Lifetime CA2332380C (en) | 1998-07-15 | 1999-07-15 | Process and materials for production of glucosamine |
Country Status (9)
| Country | Link |
|---|---|
| US (4) | US6372457B1 (en) |
| EP (1) | EP1095158B1 (en) |
| JP (2) | JP4567887B2 (en) |
| CN (1) | CN100491524C (en) |
| AT (1) | ATE414162T1 (en) |
| AU (1) | AU5102899A (en) |
| CA (1) | CA2332380C (en) |
| DE (1) | DE69939901D1 (en) |
| WO (1) | WO2000004182A1 (en) |
Families Citing this family (37)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6372457B1 (en) * | 1997-01-14 | 2002-04-16 | Arkion Life Sciences Llc | Process and materials for production of glucosamine |
| US20020160459A1 (en) * | 1997-01-14 | 2002-10-31 | Alan Berry | Process for production of n-glucosamine |
| CA2313836C (en) | 2000-03-15 | 2009-06-09 | Cargill, Incorporated | Chitosan and method of preparing chitosan |
| US6693188B2 (en) | 2001-08-08 | 2004-02-17 | Cargill Incorporated | N-acetyl-D-glucosamine and process for producing N-acetyl-D-glucosamine |
| US7816514B2 (en) * | 2001-02-16 | 2010-10-19 | Cargill, Incorporated | Glucosamine and method of making glucosamine from microbial biomass |
| US7923437B2 (en) * | 2001-02-16 | 2011-04-12 | Cargill, Incorporated | Water soluble β-glucan, glucosamine, and N-acetylglucosamine compositions and methods for making the same |
| US20020115639A1 (en) | 2001-02-16 | 2002-08-22 | Weiyu Fan | Glucosamine and method of making glucosamine from microbial blomass |
| US8222232B2 (en) * | 2001-02-16 | 2012-07-17 | Cargill, Incorporated | Glucosamine and N-acetylglucosamine compositions and methods of making the same fungal biomass |
| ES2336655T3 (en) * | 2001-07-06 | 2010-04-15 | Evonik Degussa Gmbh | PROCESS FOR THE PREPARATION OF L-AMINO ACIDS THAT USES ENVIRONMENTAL FAMILY STRAPS. |
| AU2003221828A1 (en) | 2002-04-02 | 2003-10-27 | Cargill, Inc. | Chitosan production |
| JP2006508643A (en) * | 2002-07-01 | 2006-03-16 | アーキオン ライフ サイエンシーズ エルエルシー ディー/ビー/エー バイオ−テクニカル リソーセズ ディビジョン | Processes and materials for the production of glucosamine and N-acetylglucosamine |
| PA8576201A1 (en) | 2002-07-10 | 2004-05-26 | Pfizer Prod Inc | PHARMACEUTICAL COMPOSITION THAT HAS A DISTRIBUTION AND UNIFORM POWER OF FARMACO |
| MXPA05003334A (en) * | 2002-09-30 | 2005-07-05 | Archer Daniels Midland Co | Cell-free production of glucosamine. |
| US20050239173A1 (en) * | 2003-12-08 | 2005-10-27 | Cargill, Inc. | Production of amino sugars |
| US20070212764A1 (en) * | 2005-01-19 | 2007-09-13 | Ptitsyn Leonid R | Method for producing l-amino acids using bacterium of the enterobacteriaceae family |
| JP5188960B2 (en) * | 2005-05-13 | 2013-04-24 | ネステク ソシエテ アノニム | Production of glucosamine from plant species |
| PT1890709E (en) * | 2005-05-13 | 2012-02-06 | Nestec Sa | PRODUCTION OF GLUCOSAMINE FROM PLANT SPECIES |
| WO2008111507A1 (en) * | 2007-03-09 | 2008-09-18 | Kyowa Hakko Bio Co., Ltd. | Modified glucosamine-6-phosphate synthase |
| WO2008119009A2 (en) * | 2007-03-27 | 2008-10-02 | University Of Florida Research Foundation, Inc. | Materials and methods for efficient alanine production |
| BRPI0811747A8 (en) | 2007-05-17 | 2014-10-14 | Univ Illinois | METHODS AND COMPOSITIONS FOR THE PRODUCTION OF SOLVENTS |
| JP5424604B2 (en) * | 2008-09-30 | 2014-02-26 | 協和発酵バイオ株式会社 | Glucosamine production method |
| CN103429731B (en) | 2008-10-20 | 2017-12-26 | Cj第一制糖株式会社 | Corynebacteria microorganism belonging to genus with production N acetylglucosamine abilities and the method with its production N acetylglucosamine or aminoglucose |
| US9267141B2 (en) | 2009-04-24 | 2016-02-23 | The Johns Hopkins University | Conversion of chitin into N-acetylglucosamine, glucosamine and bioethanol |
| US20120329135A1 (en) | 2011-06-23 | 2012-12-27 | Agrinos AS | Process for Making Chitin Derivatives |
| CN105683207B (en) | 2013-11-05 | 2019-07-23 | 雀巢产品有限公司 | Glucosamine production from plant material |
| CN104988196A (en) * | 2015-07-30 | 2015-10-21 | 安徽丰原发酵技术工程研究有限公司 | Fermentation production method for N-acetylglucosamine |
| CN107267579B (en) * | 2016-04-05 | 2020-08-21 | 孙镧 | Method for producing N-acetyl-D-glucosamine and/or D-glucosamine salt by microbial fermentation |
| WO2017174038A1 (en) * | 2016-04-05 | 2017-10-12 | 孙镧 | Method for producing n-acetyl-d-glucosamine and/or d-glucosamine salt by microbial fermentation |
| WO2017174040A1 (en) * | 2016-04-05 | 2017-10-12 | 孙镧 | Method for producing n-acetyl-d-glucosamine and/or d-glucosamine salt by microbial fermentation |
| CN108070628B (en) * | 2016-11-14 | 2021-08-24 | 中国科学院微生物研究所 | A method of producing glucosamine |
| EP3441473A4 (en) * | 2017-04-05 | 2019-10-09 | Sun, Lan | PROCESS FOR THE PRODUCTION OF N-ACETYL-D-GLUCOSAMINE AND / OR D-GLUCOSAMINE HYDROCHLORIDE BY MICROBIAL FERMENTATION |
| EP3473644A1 (en) * | 2017-10-17 | 2019-04-24 | Jennewein Biotechnologie GmbH | Fermentative production of n-acetylneuraminic acid |
| WO2019158541A1 (en) | 2018-02-14 | 2019-08-22 | Frieslandcampina Nederland B.V. | Nutritional compositions for musculoskeletal support for athletes |
| CN108410783B (en) * | 2018-05-29 | 2020-11-03 | 成都本则生科技有限公司 | Method for producing glucosamine by culturing escherichia coli and fermenting |
| CN110714042A (en) | 2018-07-13 | 2020-01-21 | 中国科学院天津工业生物技术研究所 | Enzymatic preparation of glucosamine |
| CN110577977A (en) * | 2019-10-09 | 2019-12-17 | 山东润德生物科技有限公司 | Method for producing glucosamine by fermenting escherichia coli |
| IT202100030935A1 (en) | 2021-12-09 | 2023-06-09 | Fidia Farm Spa | TABLETS INCLUDING CHONDROITIN SULFATE, GLUCOSAMINE HYDROCHLORIDE AND VITAMIN C FOR THE TREATMENT OF CARTILAGE, TENDON AND BONE DAMAGE |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0653434A1 (en) * | 1993-11-12 | 1995-05-17 | Tanabe Seiyaku Co., Ltd. | Glucosamine derivative, process for preparing the same and synthetic intermediate thereof |
| US6005099A (en) * | 1993-11-17 | 1999-12-21 | Laboratoires Om S.A. | Glucosamine disaccharides, method for their preparation, pharmaceutical composition comprising same, and their use |
| JP2769992B2 (en) * | 1995-04-25 | 1998-06-25 | 工業技術院長 | Glucosamine-6-phosphate deaminase |
| CA2187386C (en) * | 1995-10-27 | 2007-01-23 | Kaoru Takegawa | Gene encoding endo-.beta.-n-acetylglucosaminidase a |
| US5998173A (en) * | 1996-02-20 | 1999-12-07 | The University Of Bristish Columbia | Process for producing N-acetyl-D-glucosamine |
| JP2824508B2 (en) * | 1996-03-04 | 1998-11-11 | 工業技術院長 | N-acetylglucosamine 6-phosphate deacetylase |
| US6372457B1 (en) * | 1997-01-14 | 2002-04-16 | Arkion Life Sciences Llc | Process and materials for production of glucosamine |
| US20020160459A1 (en) * | 1997-01-14 | 2002-10-31 | Alan Berry | Process for production of n-glucosamine |
-
1998
- 1998-07-15 US US09/115,475 patent/US6372457B1/en not_active Expired - Lifetime
-
1999
- 1999-07-15 CA CA2332380A patent/CA2332380C/en not_active Expired - Lifetime
- 1999-07-15 JP JP2000560279A patent/JP4567887B2/en not_active Expired - Lifetime
- 1999-07-15 AU AU51028/99A patent/AU5102899A/en not_active Abandoned
- 1999-07-15 AT AT99935577T patent/ATE414162T1/en not_active IP Right Cessation
- 1999-07-15 EP EP99935577A patent/EP1095158B1/en not_active Expired - Lifetime
- 1999-07-15 CN CNB998100870A patent/CN100491524C/en not_active Expired - Lifetime
- 1999-07-15 DE DE69939901T patent/DE69939901D1/en not_active Expired - Lifetime
- 1999-07-15 WO PCT/US1999/015976 patent/WO2000004182A1/en not_active Ceased
-
2001
- 2001-12-17 US US10/024,460 patent/US20030044939A1/en not_active Abandoned
-
2005
- 2005-10-05 US US11/245,473 patent/US20060094085A1/en not_active Abandoned
-
2007
- 2007-06-22 US US11/767,394 patent/US20090017520A1/en not_active Abandoned
-
2010
- 2010-05-12 JP JP2010110574A patent/JP2010166934A/en not_active Withdrawn
Also Published As
| Publication number | Publication date |
|---|---|
| JP2010166934A (en) | 2010-08-05 |
| EP1095158B1 (en) | 2008-11-12 |
| US6372457B1 (en) | 2002-04-16 |
| US20030044939A1 (en) | 2003-03-06 |
| JP2002520067A (en) | 2002-07-09 |
| ATE414162T1 (en) | 2008-11-15 |
| WO2000004182A1 (en) | 2000-01-27 |
| CN100491524C (en) | 2009-05-27 |
| US20090017520A1 (en) | 2009-01-15 |
| JP4567887B2 (en) | 2010-10-20 |
| DE69939901D1 (en) | 2008-12-24 |
| CA2332380A1 (en) | 2000-01-27 |
| EP1095158A4 (en) | 2005-03-02 |
| US20060094085A1 (en) | 2006-05-04 |
| EP1095158A1 (en) | 2001-05-02 |
| AU5102899A (en) | 2000-02-07 |
| CN1314951A (en) | 2001-09-26 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CA2332380C (en) | Process and materials for production of glucosamine | |
| KR102823919B1 (en) | Fermentative production of sialylated saccharides | |
| Alting-Mees et al. | [42] pBluescriptII: Multifunctional cloning and mapping vectors | |
| CN109777761B (en) | Construction and application of an engineered bacterium that secretes and expresses chitobiose deacetylase | |
| US20020160459A1 (en) | Process for production of n-glucosamine | |
| CN113862235B (en) | Chimeric enzyme and application and method thereof in one-step reaction synthesis of Cap0 mRNA in vitro | |
| CN107604004A (en) | Tracer target practice plasmid for vaccinia virus Tiantan strain TK genes and preparation method thereof | |
| CN110607267B (en) | A balanced lethal system, construction method and application of Listeria ovum | |
| CN102002509B (en) | Escherichia coli-bacillus subtilis shuttle expression vector and application thereof | |
| CN108718529B (en) | Mutant microorganism for producing L-cysteine and method for producing L-cysteine using the same | |
| WO1992017581A1 (en) | Mammalian expression vector | |
| US5700665A (en) | Method for the extraction of periplasmic proteins from prokaryotic microorganisms in the presence of arginine | |
| CN114907997B (en) | Construction and application of diosgenin synthetic strain | |
| CN114164225B (en) | High-throughput screening tool for enabling escherichia coli to obtain effective NHEJ system and application of high-throughput screening tool | |
| DK3022290T3 (en) | MODIFIED ALGE STREAMS AND PROCEDURE FOR TRIACYLGYCLEROL ACCUMULATION WITH USING THESE STRAINS | |
| MXPA01000490A (en) | Process and materials for production of glucosamine | |
| CN114277047B (en) | Application of high-throughput screening tool for obtaining effective NHEJ system from escherichia coli in escherichia coli gene editing | |
| KR102454110B1 (en) | Recombinant plasmids and mutant strains for screening inhibitors of ppGpp biosynthesis-related gene expression | |
| CN114921394A (en) | Recombinant sheep listeria and use method thereof | |
| CN100429309C (en) | A 100bp gradient ribonucleic acid molecular weight marker and its preparation | |
| CN111206024B (en) | Engineering bacterium for expressing pectate endo-hydrolase and application thereof | |
| CN114574467B (en) | Gene expression regulation and control system and application thereof | |
| RU2819876C2 (en) | Enzymatic production of sialylated saccharides | |
| CN107557383A (en) | A kind of Cruciferae endogenous gene silencing methods of TYMV virus inductions and its application | |
| CN102653739A (en) | Method for producing inulin fructose transferase by use of yeast expression system |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| EEER | Examination request | ||
| MKEX | Expiry |
Effective date: 20190715 |