MXPA00005478A - Method for the production of glycerol by recombinant organisms - Google Patents
Method for the production of glycerol by recombinant organismsInfo
- Publication number
- MXPA00005478A MXPA00005478A MXPA/A/2000/005478A MXPA00005478A MXPA00005478A MX PA00005478 A MXPA00005478 A MX PA00005478A MX PA00005478 A MXPA00005478 A MX PA00005478A MX PA00005478 A MXPA00005478 A MX PA00005478A
- Authority
- MX
- Mexico
- Prior art keywords
- glycerol
- leu
- gene
- gly
- val
- Prior art date
Links
- PEDCQBHIVMGVHV-UHFFFAOYSA-N glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 title claims abstract description 544
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 92
- 230000000694 effects Effects 0.000 claims abstract description 80
- 108010041921 Glycerolphosphate Dehydrogenase Proteins 0.000 claims abstract description 56
- 102000000587 Glycerolphosphate Dehydrogenase Human genes 0.000 claims abstract description 56
- OKTJSMMVPCPJKN-UHFFFAOYSA-N carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims abstract description 44
- 229910052799 carbon Inorganic materials 0.000 claims abstract description 43
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 43
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 43
- 239000000758 substrate Substances 0.000 claims abstract description 36
- 108091000071 Glycerol Kinase Proteins 0.000 claims abstract description 33
- 102000002387 Glycerol kinases Human genes 0.000 claims abstract description 33
- 230000014509 gene expression Effects 0.000 claims description 55
- YPFDHNVEDLHUCE-UHFFFAOYSA-N 1,3-Propanediol Chemical compound OCCCO YPFDHNVEDLHUCE-UHFFFAOYSA-N 0.000 claims description 54
- 241000588724 Escherichia coli Species 0.000 claims description 48
- 102000004190 Enzymes Human genes 0.000 claims description 43
- 108090000790 Enzymes Proteins 0.000 claims description 43
- 239000008103 glucose Substances 0.000 claims description 40
- 150000001413 amino acids Chemical class 0.000 claims description 39
- WQZGKKKJIJFFOK-GASJEMHNSA-N D-Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 37
- AWUCVROLDVIAJX-UHFFFAOYSA-N Glycerol 3-phosphate Chemical compound OCC(O)COP(O)(O)=O AWUCVROLDVIAJX-UHFFFAOYSA-N 0.000 claims description 29
- 229920000166 polytrimethylene carbonate Polymers 0.000 claims description 27
- 239000010452 phosphate Substances 0.000 claims description 21
- 102000004316 Oxidoreductases Human genes 0.000 claims description 19
- 108090000854 Oxidoreductases Proteins 0.000 claims description 19
- 229920001184 polypeptide Polymers 0.000 claims description 16
- 230000001131 transforming Effects 0.000 claims description 15
- 244000005700 microbiome Species 0.000 claims description 14
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 12
- 150000004676 glycans Polymers 0.000 claims description 9
- 150000002772 monosaccharides Chemical class 0.000 claims description 9
- 229920001282 polysaccharide Polymers 0.000 claims description 9
- 239000005017 polysaccharide Substances 0.000 claims description 9
- 150000004804 polysaccharides Polymers 0.000 claims description 9
- 241000235070 Saccharomyces Species 0.000 claims description 8
- NBIIXXVUZAFLBC-UHFFFAOYSA-K [O-]P([O-])([O-])=O Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 claims description 8
- 102000030951 Phosphotransferases Human genes 0.000 claims description 7
- 108091000081 Phosphotransferases Proteins 0.000 claims description 7
- 229920001542 oligosaccharide Polymers 0.000 claims description 7
- 150000002482 oligosaccharides Polymers 0.000 claims description 7
- 241000186660 Lactobacillus Species 0.000 claims description 6
- 229940039696 Lactobacillus Drugs 0.000 claims description 6
- 238000006467 substitution reaction Methods 0.000 claims description 6
- 241000222120 Candida <Saccharomycetales> Species 0.000 claims description 5
- 241000588914 Enterobacter Species 0.000 claims description 5
- 241000235648 Pichia Species 0.000 claims description 5
- 241000193403 Clostridium Species 0.000 claims description 4
- 241000588722 Escherichia Species 0.000 claims description 4
- 241000588748 Klebsiella Species 0.000 claims description 4
- 229910052684 Cerium Inorganic materials 0.000 claims description 3
- 101700060831 PGP Proteins 0.000 claims description 3
- 102100001725 PGP Human genes 0.000 claims description 3
- 108010065027 Propanediol Dehydratase Proteins 0.000 claims description 3
- 241000607142 Salmonella Species 0.000 claims description 3
- 206010039447 Salmonellosis Diseases 0.000 claims description 3
- GWXLDORMOJMVQZ-UHFFFAOYSA-N cerium Chemical compound [Ce] GWXLDORMOJMVQZ-UHFFFAOYSA-N 0.000 claims description 3
- 150000002009 diols Chemical class 0.000 claims description 3
- 125000003976 glyceryl group Chemical group [H]C([*])([H])C(O[H])([H])C(O[H])([H])[H] 0.000 claims description 3
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 2
- 241000235035 Debaryomyces Species 0.000 claims description 2
- 241000589350 Methylobacter Species 0.000 claims description 2
- 241000235395 Mucor Species 0.000 claims description 2
- 241000863392 Pelobacter Species 0.000 claims description 2
- 241000589516 Pseudomonas Species 0.000 claims description 2
- 241000235346 Schizosaccharomyces Species 0.000 claims description 2
- 241000187747 Streptomyces Species 0.000 claims description 2
- 241000235017 Zygosaccharomyces Species 0.000 claims description 2
- 238000003780 insertion Methods 0.000 claims description 2
- 235000011187 glycerol Nutrition 0.000 claims 8
- 125000003275 alpha amino acid group Chemical group 0.000 claims 4
- 241000228212 Aspergillus Species 0.000 claims 1
- 241000233866 Fungi Species 0.000 claims 1
- 241000411968 Ilyobacter Species 0.000 claims 1
- 230000029087 digestion Effects 0.000 claims 1
- 108091022084 Glycerol dehydrogenases Proteins 0.000 abstract description 44
- 210000004027 cells Anatomy 0.000 description 73
- 150000007523 nucleic acids Chemical class 0.000 description 60
- 108020004707 nucleic acids Proteins 0.000 description 59
- 239000002585 base Substances 0.000 description 51
- 239000002609 media Substances 0.000 description 46
- 229920003013 deoxyribonucleic acid Polymers 0.000 description 42
- 238000000034 method Methods 0.000 description 39
- 229960001031 Glucose Drugs 0.000 description 38
- 238000006243 chemical reaction Methods 0.000 description 38
- WQZGKKKJIJFFOK-VFUOTHLCSA-N β-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 38
- 101700033219 GPD1 Proteins 0.000 description 31
- 235000001014 amino acid Nutrition 0.000 description 29
- 235000018102 proteins Nutrition 0.000 description 28
- 101710025050 MK0970 Proteins 0.000 description 25
- 108010034529 leucyl-lysine Proteins 0.000 description 25
- 229940035437 1,3-propanediol Drugs 0.000 description 24
- 101700059607 DAR1 Proteins 0.000 description 23
- 238000000855 fermentation Methods 0.000 description 23
- 230000004151 fermentation Effects 0.000 description 23
- 239000000047 product Substances 0.000 description 21
- 101700013246 GK Proteins 0.000 description 19
- MTCFGRXMJLQNBG-REOHCLBHSA-N L-serine Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 19
- BAWFJGJZGIEFAR-NNYOXOHSSA-N Nicotinamide adenine dinucleotide Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-N 0.000 description 19
- 125000000267 glycino group Chemical group [H]N([*])C([H])([H])C(=O)O[H] 0.000 description 19
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 18
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 18
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 description 18
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 17
- 101710004291 egsA Proteins 0.000 description 17
- 101700066745 gldA Proteins 0.000 description 17
- 108010050848 glycylleucine Proteins 0.000 description 17
- 101710042253 GK2 Proteins 0.000 description 16
- 229940052665 NADH Drugs 0.000 description 16
- 108010073969 valyllysine Proteins 0.000 description 16
- 229960005091 Chloramphenicol Drugs 0.000 description 15
- WIIZWVCIJKGZOK-RKDXNWHRSA-N Chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 15
- 108010057821 leucylproline Proteins 0.000 description 15
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 14
- 239000006137 Luria-Bertani broth Substances 0.000 description 14
- 238000011161 development Methods 0.000 description 14
- 230000018109 developmental process Effects 0.000 description 14
- -1 for example Substances 0.000 description 13
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 12
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 12
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 12
- 230000002255 enzymatic Effects 0.000 description 12
- OKKJLVBELUTLKV-UHFFFAOYSA-N methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 12
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 11
- ZKQOUHVVXABNDG-IUCAKERBSA-N Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 ZKQOUHVVXABNDG-IUCAKERBSA-N 0.000 description 11
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 11
- NFDYGNFETJVMSE-BQBZGAKWSA-N Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CO NFDYGNFETJVMSE-BQBZGAKWSA-N 0.000 description 11
- UPJONISHZRADBH-XPUUQOCRSA-N Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UPJONISHZRADBH-XPUUQOCRSA-N 0.000 description 11
- JKHXYJKMNSSFFL-IUCAKERBSA-N Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN JKHXYJKMNSSFFL-IUCAKERBSA-N 0.000 description 11
- 108010093581 aspartyl-proline Proteins 0.000 description 11
- 230000001419 dependent Effects 0.000 description 11
- 108010015792 glycyllysine Proteins 0.000 description 11
- 235000021317 phosphate Nutrition 0.000 description 11
- 108010061238 threonyl-glycine Proteins 0.000 description 11
- GNGACRATGGDKBX-UHFFFAOYSA-N Dihydroxyacetone phosphate Chemical compound OCC(=O)COP(O)(O)=O GNGACRATGGDKBX-UHFFFAOYSA-N 0.000 description 10
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 10
- CIOWSLJGLSUOME-BQBZGAKWSA-N Lys-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O CIOWSLJGLSUOME-BQBZGAKWSA-N 0.000 description 10
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 10
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 10
- 239000003795 chemical substances by application Substances 0.000 description 10
- 150000001875 compounds Chemical class 0.000 description 10
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 10
- ZKHQWZAMYRWXGA-KQYNXXCUSA-N Adenosine triphosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-N 0.000 description 9
- HXWUJJADFMXNKA-UHFFFAOYSA-N Asparaginyl-Leucine Chemical compound CC(C)CC(C(O)=O)NC(=O)C(N)CC(N)=O HXWUJJADFMXNKA-UHFFFAOYSA-N 0.000 description 9
- 101700013235 GPD2 Proteins 0.000 description 9
- YBAFDPFAUTYYRW-YUMQZZPRSA-N Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O YBAFDPFAUTYYRW-YUMQZZPRSA-N 0.000 description 9
- NFNVDJGXRFEYTK-YUMQZZPRSA-N Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O NFNVDJGXRFEYTK-YUMQZZPRSA-N 0.000 description 9
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 description 9
- VBKBDLMWICBSCY-IMJSIDKUSA-N Ser-Asp Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O VBKBDLMWICBSCY-IMJSIDKUSA-N 0.000 description 9
- ZSXJENBJGRHKIG-UHFFFAOYSA-N Tyrosyl-Serine Chemical compound OCC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 ZSXJENBJGRHKIG-UHFFFAOYSA-N 0.000 description 9
- 238000004166 bioassay Methods 0.000 description 9
- 108010089804 glycyl-threonine Proteins 0.000 description 9
- 108010037850 glycylvaline Proteins 0.000 description 9
- 239000000203 mixture Substances 0.000 description 9
- 108010090894 prolylleucine Proteins 0.000 description 9
- 239000000126 substance Substances 0.000 description 9
- GJSURZIOUXUGAL-UHFFFAOYSA-N 2-((2,6-Dichlorophenyl)imino)imidazolidine Chemical compound ClC1=CC=CC(Cl)=C1NC1=NCCN1 GJSURZIOUXUGAL-UHFFFAOYSA-N 0.000 description 8
- 210000000349 Chromosomes Anatomy 0.000 description 8
- BJHIKXHVCXFQLS-UYFOZJQFSA-N Fructose Natural products OC[C@@H](O)[C@@H](O)[C@H](O)C(=O)CO BJHIKXHVCXFQLS-UYFOZJQFSA-N 0.000 description 8
- IKAIKUBBJHFNBZ-LURJTMIESA-N Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CN IKAIKUBBJHFNBZ-LURJTMIESA-N 0.000 description 8
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 8
- OLIFSFOFKGKIRH-WUJLRWPWSA-N Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CN OLIFSFOFKGKIRH-WUJLRWPWSA-N 0.000 description 8
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- LESXFEZIFXFIQR-LURJTMIESA-N Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(O)=O LESXFEZIFXFIQR-LURJTMIESA-N 0.000 description 8
- HWMGTNOVUDIKRE-UWVGGRQHSA-N Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 HWMGTNOVUDIKRE-UWVGGRQHSA-N 0.000 description 8
- 108010047857 aspartylglycine Proteins 0.000 description 8
- 108010092854 aspartyllysine Proteins 0.000 description 8
- 238000010367 cloning Methods 0.000 description 8
- 238000010276 construction Methods 0.000 description 8
- 238000003379 elimination reaction Methods 0.000 description 8
- TUTIHHSZKFBMHM-UHFFFAOYSA-N 4-amino-5-[(3-amino-1-carboxy-3-oxopropyl)amino]-5-oxopentanoic acid Chemical compound OC(=O)CCC(N)C(=O)NC(CC(N)=O)C(O)=O TUTIHHSZKFBMHM-UHFFFAOYSA-N 0.000 description 7
- JHFNSBBHKSZXKB-VKHMYHEASA-N Asp-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(O)=O JHFNSBBHKSZXKB-VKHMYHEASA-N 0.000 description 7
- JLXVRFDTDUGQEE-YFKPBYRVSA-N Gly-Arg Chemical compound NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N JLXVRFDTDUGQEE-YFKPBYRVSA-N 0.000 description 7
- MLTRLIITQPXHBJ-BQBZGAKWSA-N Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O MLTRLIITQPXHBJ-BQBZGAKWSA-N 0.000 description 7
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 7
- UGTZHPSKYRIGRJ-YUMQZZPRSA-N Lys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UGTZHPSKYRIGRJ-YUMQZZPRSA-N 0.000 description 7
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 7
- WCRFXRIWBFRZBR-GGVZMXCHSA-N Thr-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WCRFXRIWBFRZBR-GGVZMXCHSA-N 0.000 description 7
- CKHWEVXPLJBEOZ-UHFFFAOYSA-N Threoninyl-Valine Chemical compound CC(C)C(C(O)=O)NC(=O)C(N)C(C)O CKHWEVXPLJBEOZ-UHFFFAOYSA-N 0.000 description 7
- 108010038633 aspartylglutamate Proteins 0.000 description 7
- 238000001514 detection method Methods 0.000 description 7
- 239000000499 gel Substances 0.000 description 7
- STKYPAFSDFAEPH-LURJTMIESA-N gly-val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CN STKYPAFSDFAEPH-LURJTMIESA-N 0.000 description 7
- 238000004128 high performance liquid chromatography Methods 0.000 description 7
- 108010009298 lysylglutamic acid Proteins 0.000 description 7
- 230000001105 regulatory Effects 0.000 description 7
- IOUPEELXVYPCPG-UHFFFAOYSA-N val-gly Chemical compound CC(C)C(N)C(=O)NCC(O)=O IOUPEELXVYPCPG-UHFFFAOYSA-N 0.000 description 7
- SITLTJHOQZFJGG-XPUUQOCRSA-N α-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SITLTJHOQZFJGG-XPUUQOCRSA-N 0.000 description 7
- XNSKSTRGQIPTSE-UHFFFAOYSA-N Arginyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CCCNC(N)=N XNSKSTRGQIPTSE-UHFFFAOYSA-N 0.000 description 6
- IIFDPDVJAHQFSR-WHFBIAKZSA-N Asn-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O IIFDPDVJAHQFSR-WHFBIAKZSA-N 0.000 description 6
- KWBQPGIYEZKDEG-FSPLSTOPSA-N Asn-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O KWBQPGIYEZKDEG-FSPLSTOPSA-N 0.000 description 6
- FRYULLIZUDQONW-IMJSIDKUSA-N Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FRYULLIZUDQONW-IMJSIDKUSA-N 0.000 description 6
- CKAJHWFHHFSCDT-WHFBIAKZSA-N Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O CKAJHWFHHFSCDT-WHFBIAKZSA-N 0.000 description 6
- OAMLVOVXNKILLQ-BQBZGAKWSA-N Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O OAMLVOVXNKILLQ-BQBZGAKWSA-N 0.000 description 6
- DWBZEJHQQIURML-IMJSIDKUSA-N Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O DWBZEJHQQIURML-IMJSIDKUSA-N 0.000 description 6
- UKGGPJNBONZZCM-WDSKDSINSA-N Aspartyl-L-proline Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O UKGGPJNBONZZCM-WDSKDSINSA-N 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 6
- 108020004705 Codon Proteins 0.000 description 6
- 101700077246 EHMT1 Proteins 0.000 description 6
- 101700001004 GPP1 Proteins 0.000 description 6
- UQHGAYSULGRWRG-WHFBIAKZSA-N Glu-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UQHGAYSULGRWRG-WHFBIAKZSA-N 0.000 description 6
- YSWHPLCDIMUKFE-QWRGUYRKSA-N Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YSWHPLCDIMUKFE-QWRGUYRKSA-N 0.000 description 6
- HFKJBCPRWWGPEY-BQBZGAKWSA-N L-arginyl-L-glutamic acid Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HFKJBCPRWWGPEY-BQBZGAKWSA-N 0.000 description 6
- XWOBNBRUDDUEEY-UWVGGRQHSA-N Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XWOBNBRUDDUEEY-UWVGGRQHSA-N 0.000 description 6
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 description 6
- OAPNERBWQWUPTI-YUMQZZPRSA-N Lys-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O OAPNERBWQWUPTI-YUMQZZPRSA-N 0.000 description 6
- HGNRJCINZYHNOU-LURJTMIESA-N Lys-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(O)=O HGNRJCINZYHNOU-LURJTMIESA-N 0.000 description 6
- ATIPDCIQTUXABX-UWVGGRQHSA-N Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ATIPDCIQTUXABX-UWVGGRQHSA-N 0.000 description 6
- YSZNURNVYFUEHC-BQBZGAKWSA-N Lys-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YSZNURNVYFUEHC-BQBZGAKWSA-N 0.000 description 6
- 229920001850 Nucleic acid sequence Polymers 0.000 description 6
- GLUBLISJVJFHQS-VIFPVBQESA-N Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 GLUBLISJVJFHQS-VIFPVBQESA-N 0.000 description 6
- GVUVRRPYYDHHGK-UHFFFAOYSA-N Prolyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C1CCCN1 GVUVRRPYYDHHGK-UHFFFAOYSA-N 0.000 description 6
- LAFKUZYWNCHOHT-WHFBIAKZSA-N Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O LAFKUZYWNCHOHT-WHFBIAKZSA-N 0.000 description 6
- HYLXOQURIOCKIH-VQVTYTSYSA-N Thr-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N HYLXOQURIOCKIH-VQVTYTSYSA-N 0.000 description 6
- BIYXEUAFGLTAEM-WUJLRWPWSA-N Thr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(O)=O BIYXEUAFGLTAEM-WUJLRWPWSA-N 0.000 description 6
- BQBCIBCLXBKYHW-CSMHCCOUSA-N Thr-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O BQBCIBCLXBKYHW-CSMHCCOUSA-N 0.000 description 6
- YKRQRPFODDJQTC-UHFFFAOYSA-N Threoninyl-Lysine Chemical compound CC(O)C(N)C(=O)NC(C(O)=O)CCCCN YKRQRPFODDJQTC-UHFFFAOYSA-N 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 230000002759 chromosomal Effects 0.000 description 6
- 108010016616 cysteinylglycine Proteins 0.000 description 6
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 6
- 101700050947 glpQ Proteins 0.000 description 6
- 239000000543 intermediate Substances 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 108010017391 lysylvaline Proteins 0.000 description 6
- 239000000463 material Substances 0.000 description 6
- 108010079317 prolyl-tyrosine Proteins 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- XMBSYZWANAQXEV-UHFFFAOYSA-N 4-amino-5-[(1-carboxy-2-phenylethyl)amino]-5-oxopentanoic acid Chemical compound OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 5
- SIFXMYAHXJGAFC-WDSKDSINSA-N Arg-Asp Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O SIFXMYAHXJGAFC-WDSKDSINSA-N 0.000 description 5
- JQFZHHSQMKZLRU-IUCAKERBSA-N Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N JQFZHHSQMKZLRU-IUCAKERBSA-N 0.000 description 5
- IJYZHIOOBGIINM-WDSKDSINSA-N Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N IJYZHIOOBGIINM-WDSKDSINSA-N 0.000 description 5
- JSLGXODUIAFWCF-UHFFFAOYSA-N Arginyl-Asparagine Chemical compound NC(N)=NCCCC(N)C(=O)NC(CC(N)=O)C(O)=O JSLGXODUIAFWCF-UHFFFAOYSA-N 0.000 description 5
- NALWOULWGHTVDA-UWVGGRQHSA-N Asp-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NALWOULWGHTVDA-UWVGGRQHSA-N 0.000 description 5
- RXKJFZQQPQGTFL-UHFFFAOYSA-N Dihydroxyacetone Chemical compound OCC(=O)CO RXKJFZQQPQGTFL-UHFFFAOYSA-N 0.000 description 5
- 229940120503 Dihydroxyacetone Drugs 0.000 description 5
- 101710008404 GAPDH Proteins 0.000 description 5
- 102100016606 GPD1 Human genes 0.000 description 5
- 101710010461 Gapdh1 Proteins 0.000 description 5
- KOSRFJWDECSPRO-WDSKDSINSA-N Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O KOSRFJWDECSPRO-WDSKDSINSA-N 0.000 description 5
- LSPKYLAFTPBWIL-BYPYZUCNSA-N Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(O)=O LSPKYLAFTPBWIL-BYPYZUCNSA-N 0.000 description 5
- SCCPDJAQCXWPTF-VKHMYHEASA-N Gly-Asp Chemical compound NCC(=O)N[C@H](C(O)=O)CC(O)=O SCCPDJAQCXWPTF-VKHMYHEASA-N 0.000 description 5
- IEFJWDNGDZAYNZ-BYPYZUCNSA-N Gly-Glu Chemical compound NCC(=O)N[C@H](C(O)=O)CCC(O)=O IEFJWDNGDZAYNZ-BYPYZUCNSA-N 0.000 description 5
- ZUKPVRWZDMRIEO-VKHMYHEASA-N L-cysteinylglycine zwitterion Chemical compound SC[C@H]([NH3+])C(=O)NCC([O-])=O ZUKPVRWZDMRIEO-VKHMYHEASA-N 0.000 description 5
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 5
- AIXUQKMMBQJZCU-IUCAKERBSA-N Lys-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O AIXUQKMMBQJZCU-IUCAKERBSA-N 0.000 description 5
- RVKIPWVMZANZLI-ZFWWWQNUSA-N Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-ZFWWWQNUSA-N 0.000 description 5
- JXWLMUIXUXLIJR-QWRGUYRKSA-N Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JXWLMUIXUXLIJR-QWRGUYRKSA-N 0.000 description 5
- AFWBWPCXSWUCLB-WDSKDSINSA-N Pro-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 AFWBWPCXSWUCLB-WDSKDSINSA-N 0.000 description 5
- WOUIMBGNEUWXQG-VKHMYHEASA-N Ser-Gly Chemical compound OC[C@H](N)C(=O)NCC(O)=O WOUIMBGNEUWXQG-VKHMYHEASA-N 0.000 description 5
- ILVGMCVCQBJPSH-WDSKDSINSA-N Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO ILVGMCVCQBJPSH-WDSKDSINSA-N 0.000 description 5
- RZEQTVHJZCIUBT-UHFFFAOYSA-N Serinyl-Arginine Chemical compound OCC(N)C(=O)NC(C(O)=O)CCCNC(N)=N RZEQTVHJZCIUBT-UHFFFAOYSA-N 0.000 description 5
- CUTPSEKWUPZFLV-UHFFFAOYSA-N Threoninyl-Cysteine Chemical compound CC(O)C(N)C(=O)NC(CS)C(O)=O CUTPSEKWUPZFLV-UHFFFAOYSA-N 0.000 description 5
- UBAQSAUDKMIEQZ-QWRGUYRKSA-N Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBAQSAUDKMIEQZ-QWRGUYRKSA-N 0.000 description 5
- GIAZPLMMQOERPN-YUMQZZPRSA-N Val-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GIAZPLMMQOERPN-YUMQZZPRSA-N 0.000 description 5
- STTYIMSDIYISRG-WDSKDSINSA-N Val-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(O)=O STTYIMSDIYISRG-WDSKDSINSA-N 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- 108010060035 arginylproline Proteins 0.000 description 5
- 230000003115 biocidal Effects 0.000 description 5
- 230000001086 cytosolic Effects 0.000 description 5
- 101710025070 gapdh-2 Proteins 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- 108010049041 glutamylalanine Proteins 0.000 description 5
- 108010081551 glycylphenylalanine Proteins 0.000 description 5
- 108010092114 histidylphenylalanine Proteins 0.000 description 5
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl β-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 5
- 230000004060 metabolic process Effects 0.000 description 5
- 108010005942 methionylglycine Proteins 0.000 description 5
- 230000035772 mutation Effects 0.000 description 5
- 108010012581 phenylalanylglutamate Proteins 0.000 description 5
- QLROSWPKSBORFJ-BQBZGAKWSA-N pro glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 QLROSWPKSBORFJ-BQBZGAKWSA-N 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 239000006228 supernatant Substances 0.000 description 5
- 108010078580 tyrosylleucine Proteins 0.000 description 5
- LXJXRIRHZLFYRP-VKHMYHEASA-L (R)-2-Hydroxy-3-(phosphonooxy)-propanal Natural products O=C[C@H](O)COP([O-])([O-])=O LXJXRIRHZLFYRP-VKHMYHEASA-L 0.000 description 4
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 4
- VBKIFHUVGLOJKT-UHFFFAOYSA-N Asparaginyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CC(N)=O VBKIFHUVGLOJKT-UHFFFAOYSA-N 0.000 description 4
- VGRHZPNRCLAHQA-UHFFFAOYSA-N Aspartyl-Asparagine Chemical compound OC(=O)CC(N)C(=O)NC(CC(N)=O)C(O)=O VGRHZPNRCLAHQA-UHFFFAOYSA-N 0.000 description 4
- ZVDPYSVOZFINEE-UHFFFAOYSA-N Aspartyl-Leucine Chemical compound CC(C)CC(C(O)=O)NC(=O)C(N)CC(O)=O ZVDPYSVOZFINEE-UHFFFAOYSA-N 0.000 description 4
- SBJKKFFYIZUCET-JLAZNSOCSA-N Dehydro-L-ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(=O)C1=O SBJKKFFYIZUCET-JLAZNSOCSA-N 0.000 description 4
- 239000005715 Fructose Substances 0.000 description 4
- 102100006961 GPD2 Human genes 0.000 description 4
- JEFZIKRIDLHOIF-BYPYZUCNSA-N Gln-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(O)=O JEFZIKRIDLHOIF-BYPYZUCNSA-N 0.000 description 4
- FYYSIASRLDJUNP-WHFBIAKZSA-N Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FYYSIASRLDJUNP-WHFBIAKZSA-N 0.000 description 4
- JSIQVRIXMINMTA-ZDLURKLDSA-N Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O JSIQVRIXMINMTA-ZDLURKLDSA-N 0.000 description 4
- SSHIXEILTLPAQT-UHFFFAOYSA-N Glutaminyl-Aspartate Chemical compound NC(=O)CCC(N)C(=O)NC(CC(O)=O)C(O)=O SSHIXEILTLPAQT-UHFFFAOYSA-N 0.000 description 4
- JBCLFWXMTIKCCB-VIFPVBQESA-N Gly-Phe Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-VIFPVBQESA-N 0.000 description 4
- AJHCSUXXECOXOY-NSHDSACASA-N Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-NSHDSACASA-N 0.000 description 4
- XUJNEKJLAYXESH-REOHCLBHSA-N L-cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 4
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 4
- SENJXOPIZNYLHU-IUCAKERBSA-N Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-IUCAKERBSA-N 0.000 description 4
- DVCSNHXRZUVYAM-BQBZGAKWSA-N Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O DVCSNHXRZUVYAM-BQBZGAKWSA-N 0.000 description 4
- HIZYETOZLYFUFF-BQBZGAKWSA-N Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(O)=O HIZYETOZLYFUFF-BQBZGAKWSA-N 0.000 description 4
- MYTOTTSMVMWVJN-STQMWFEESA-N Lys-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MYTOTTSMVMWVJN-STQMWFEESA-N 0.000 description 4
- QXOHLNCNYLGICT-YFKPBYRVSA-N Met-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(O)=O QXOHLNCNYLGICT-YFKPBYRVSA-N 0.000 description 4
- 230000036740 Metabolism Effects 0.000 description 4
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 4
- OHUXOEXBXPZKPT-STQMWFEESA-N Phe-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 OHUXOEXBXPZKPT-STQMWFEESA-N 0.000 description 4
- ROHDXJUFQVRDAV-UWVGGRQHSA-N Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ROHDXJUFQVRDAV-UWVGGRQHSA-N 0.000 description 4
- JMCOUWKXLXDERB-WMZOPIPTSA-N Phe-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 JMCOUWKXLXDERB-WMZOPIPTSA-N 0.000 description 4
- RVQDZELMXZRSSI-IUCAKERBSA-N Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 RVQDZELMXZRSSI-IUCAKERBSA-N 0.000 description 4
- OIDKVWTWGDWMHY-RYUDHWBXSA-N Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 OIDKVWTWGDWMHY-RYUDHWBXSA-N 0.000 description 4
- UJTZHGHXJKIAOS-WHFBIAKZSA-N Ser-Gln Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O UJTZHGHXJKIAOS-WHFBIAKZSA-N 0.000 description 4
- SBMNPABNWKXNBJ-UHFFFAOYSA-N Serinyl-Lysine Chemical compound NCCCCC(C(O)=O)NC(=O)C(N)CO SBMNPABNWKXNBJ-UHFFFAOYSA-N 0.000 description 4
- LDEBVRIURYMKQS-UHFFFAOYSA-N Serinyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CO LDEBVRIURYMKQS-UHFFFAOYSA-N 0.000 description 4
- LSNNMFCWUKXFEE-UHFFFAOYSA-L Sulphite Chemical class [O-]S([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-L 0.000 description 4
- BECPPKYKPSRKCP-ZDLURKLDSA-N Thr-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O BECPPKYKPSRKCP-ZDLURKLDSA-N 0.000 description 4
- IQHUITKNHOKGFC-MIMYLULJSA-N Thr-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IQHUITKNHOKGFC-MIMYLULJSA-N 0.000 description 4
- GXDLGHLJTHMDII-WISUUJSJSA-N Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(O)=O GXDLGHLJTHMDII-WISUUJSJSA-N 0.000 description 4
- UYKREHOKELZSPB-JTQLQIEISA-N Trp-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(O)=O)=CNC2=C1 UYKREHOKELZSPB-JTQLQIEISA-N 0.000 description 4
- LYMVXFSTACVOLP-ZFWWWQNUSA-N Trp-Leu Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 LYMVXFSTACVOLP-ZFWWWQNUSA-N 0.000 description 4
- WITCOKQIPFWQQD-FSPLSTOPSA-N Val-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O WITCOKQIPFWQQD-FSPLSTOPSA-N 0.000 description 4
- GVRKWABULJAONN-UHFFFAOYSA-N Valyl-Threonine Chemical compound CC(C)C(N)C(=O)NC(C(C)O)C(O)=O GVRKWABULJAONN-UHFFFAOYSA-N 0.000 description 4
- IKHGUXGNUITLKF-UHFFFAOYSA-N acetaldehyde Chemical compound CC=O IKHGUXGNUITLKF-UHFFFAOYSA-N 0.000 description 4
- QTBSBXVTEAMEQO-UHFFFAOYSA-N acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 4
- 239000002253 acid Substances 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 235000020669 docosahexaenoic acid Nutrition 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 238000007429 general method Methods 0.000 description 4
- 230000002068 genetic Effects 0.000 description 4
- 108010010147 glycylglutamine Proteins 0.000 description 4
- 108010084389 glycyltryptophan Proteins 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 230000001939 inductive effect Effects 0.000 description 4
- 238000011081 inoculation Methods 0.000 description 4
- 229910052816 inorganic phosphate Inorganic materials 0.000 description 4
- 150000002500 ions Chemical class 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 108010000761 leucylarginine Proteins 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 230000035786 metabolism Effects 0.000 description 4
- 108010068488 methionylphenylalanine Proteins 0.000 description 4
- 108010018625 phenylalanylarginine Proteins 0.000 description 4
- 108010051242 phenylalanylserine Proteins 0.000 description 4
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 4
- 101700054060 phoA Proteins 0.000 description 4
- 101710016966 phoA2 Proteins 0.000 description 4
- BWHMMNNQKKPAPP-UHFFFAOYSA-L potassium carbonate Chemical compound [K+].[K+].[O-]C([O-])=O BWHMMNNQKKPAPP-UHFFFAOYSA-L 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 108010080629 tryptophan-leucine Proteins 0.000 description 4
- 108010038745 tryptophylglycine Proteins 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- UKKNTTCNGZLJEX-UHFFFAOYSA-N γ-glutamyl-Serine Chemical compound NC(=O)CCC(N)C(=O)NC(CO)C(O)=O UKKNTTCNGZLJEX-UHFFFAOYSA-N 0.000 description 4
- LQJAALCCPOTJGB-YUMQZZPRSA-N (2S)-1-[(2S)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carboxylic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O LQJAALCCPOTJGB-YUMQZZPRSA-N 0.000 description 3
- RXGLHDWAZQECBI-SRVKXCTJSA-N (2S)-2-[[(2S)-2-[[(2S)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]amino]-3-hydroxypropanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 3
- IAJFFZORSWOZPQ-SRVKXCTJSA-N (2S)-4-amino-2-[[(2S)-2-[[(2S)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]amino]-4-oxobutanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 3
- 238000001644 13C nuclear magnetic resonance spectroscopy Methods 0.000 description 3
- BUXAPSQPMALTOY-UHFFFAOYSA-N 2-[(2-amino-3-sulfanylpropanoyl)amino]pentanedioic acid Chemical compound SCC(N)C(=O)NC(C(O)=O)CCC(O)=O BUXAPSQPMALTOY-UHFFFAOYSA-N 0.000 description 3
- XUUXCWCKKCZEAW-YFKPBYRVSA-N 2-[[(2S)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]acetic acid Chemical compound OC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N XUUXCWCKKCZEAW-YFKPBYRVSA-N 0.000 description 3
- MPZWMIIOPAPAKE-UHFFFAOYSA-N 4-amino-5-[[1-carboxy-4-(diaminomethylideneamino)butyl]amino]-5-oxopentanoic acid Chemical compound OC(=O)CCC(N)C(=O)NC(C(O)=O)CCCN=C(N)N MPZWMIIOPAPAKE-UHFFFAOYSA-N 0.000 description 3
- FRHBOQMZUOWXQL-UHFFFAOYSA-L Ammonium ferric citrate Chemical compound [NH4+].[Fe+3].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O FRHBOQMZUOWXQL-UHFFFAOYSA-L 0.000 description 3
- AVKUERGKIZMTKX-NJBDSQKTSA-N Ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 3
- XTWSWDJMIKUJDQ-RYUDHWBXSA-N Arg-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XTWSWDJMIKUJDQ-RYUDHWBXSA-N 0.000 description 3
- HZYFHQOWCFUSOV-IMJSIDKUSA-N Asn-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O HZYFHQOWCFUSOV-IMJSIDKUSA-N 0.000 description 3
- QCWJKJLNCFEVPQ-WHFBIAKZSA-N Asn-Gln Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O QCWJKJLNCFEVPQ-WHFBIAKZSA-N 0.000 description 3
- CPMKYMGGYUFOHS-FSPLSTOPSA-N Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O CPMKYMGGYUFOHS-FSPLSTOPSA-N 0.000 description 3
- NTQDELBZOMWXRS-UHFFFAOYSA-N Aspartyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CC(O)=O NTQDELBZOMWXRS-UHFFFAOYSA-N 0.000 description 3
- 241001489124 Boletus edulis Species 0.000 description 3
- YXQDRIRSAHTJKM-IMJSIDKUSA-N Cys-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YXQDRIRSAHTJKM-IMJSIDKUSA-N 0.000 description 3
- OELDIVRKHTYFNG-UHFFFAOYSA-N Cysteinyl-Valine Chemical compound CC(C)C(C(O)=O)NC(=O)C(N)CS OELDIVRKHTYFNG-UHFFFAOYSA-N 0.000 description 3
- CZMRCDWAGMRECN-UGDNZRGBSA-N D-sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 3
- LOJYQMFIIJVETK-WDSKDSINSA-N Gln-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LOJYQMFIIJVETK-WDSKDSINSA-N 0.000 description 3
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 3
- BBBXWRGITSUJPB-YUMQZZPRSA-N Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O BBBXWRGITSUJPB-YUMQZZPRSA-N 0.000 description 3
- LLEUXCDZPQOJMY-AAEUAGOBSA-N Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 LLEUXCDZPQOJMY-AAEUAGOBSA-N 0.000 description 3
- PFMUCCYYAAFKTH-YFKPBYRVSA-N Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CN PFMUCCYYAAFKTH-YFKPBYRVSA-N 0.000 description 3
- MNQZXJOMYWMBOU-UHFFFAOYSA-N Glyceraldehyde Chemical compound OCC(O)C=O MNQZXJOMYWMBOU-UHFFFAOYSA-N 0.000 description 3
- MMFKFJORZBJVNF-UWVGGRQHSA-N His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 MMFKFJORZBJVNF-UWVGGRQHSA-N 0.000 description 3
- 101700004979 IRX10 Proteins 0.000 description 3
- 101710010416 IRX10L Proteins 0.000 description 3
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 3
- 125000000998 L-alanino group Chemical group [H]N([*])[C@](C([H])([H])[H])([H])C(=O)O[H] 0.000 description 3
- VYZAGTDAHUIRQA-WHFBIAKZSA-N L-alanyl-L-glutamic acid Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O VYZAGTDAHUIRQA-WHFBIAKZSA-N 0.000 description 3
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 3
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 description 3
- BQVUABVGYYSDCJ-ZFWWWQNUSA-N Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-ZFWWWQNUSA-N 0.000 description 3
- JYOAXOMPIXKMKK-UHFFFAOYSA-N Leucyl-Glutamine Chemical compound CC(C)CC(N)C(=O)NC(C(O)=O)CCC(N)=O JYOAXOMPIXKMKK-UHFFFAOYSA-N 0.000 description 3
- NVGBPTNZLWRQSY-UWVGGRQHSA-N Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN NVGBPTNZLWRQSY-UWVGGRQHSA-N 0.000 description 3
- QTZXSYBVOSXBEJ-WDSKDSINSA-N Met-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O QTZXSYBVOSXBEJ-WDSKDSINSA-N 0.000 description 3
- IMTUWVJPCQPJEE-IUCAKERBSA-N Met-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN IMTUWVJPCQPJEE-IUCAKERBSA-N 0.000 description 3
- HGCNKOLVKRAVHD-RYUDHWBXSA-N Met-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-RYUDHWBXSA-N 0.000 description 3
- WEDDFMCSUNNZJR-WDSKDSINSA-N Met-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O WEDDFMCSUNNZJR-WDSKDSINSA-N 0.000 description 3
- KAKJTZWHIUWTTD-VQVTYTSYSA-N Met-Thr Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)O)C([O-])=O KAKJTZWHIUWTTD-VQVTYTSYSA-N 0.000 description 3
- XJLXINKUBYWONI-NNYOXOHSSA-N Nicotinamide adenine dinucleotide phosphate Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-NNYOXOHSSA-N 0.000 description 3
- AQSJGOWTSHOLKH-UHFFFAOYSA-N Phosphite Chemical compound [O-]P([O-])[O-] AQSJGOWTSHOLKH-UHFFFAOYSA-N 0.000 description 3
- YZMPDHTZJJCGEI-BQBZGAKWSA-N Ser-His Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 YZMPDHTZJJCGEI-BQBZGAKWSA-N 0.000 description 3
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 description 3
- GSEJCLTVZPLZKY-UHFFFAOYSA-N Tris Chemical compound OCCN(CCO)CCO GSEJCLTVZPLZKY-UHFFFAOYSA-N 0.000 description 3
- 239000007983 Tris buffer Substances 0.000 description 3
- DZHDVYLBNKMLMB-ZFWWWQNUSA-N Trp-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 DZHDVYLBNKMLMB-ZFWWWQNUSA-N 0.000 description 3
- PDSLRCZINIDLMU-QWRGUYRKSA-N Tyr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PDSLRCZINIDLMU-QWRGUYRKSA-N 0.000 description 3
- AOLHUMAVONBBEZ-STQMWFEESA-N Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AOLHUMAVONBBEZ-STQMWFEESA-N 0.000 description 3
- XXDVDTMEVBYRPK-XPUUQOCRSA-N Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O XXDVDTMEVBYRPK-XPUUQOCRSA-N 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- QGZKDVFQNNGYKY-UHFFFAOYSA-O ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 3
- 229960000723 ampicillin Drugs 0.000 description 3
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 230000001580 bacterial Effects 0.000 description 3
- CURLTUGMZLYLDI-UHFFFAOYSA-N carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 3
- 239000001569 carbon dioxide Substances 0.000 description 3
- 229910002092 carbon dioxide Inorganic materials 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 238000001816 cooling Methods 0.000 description 3
- 238000001962 electrophoresis Methods 0.000 description 3
- 238000001914 filtration Methods 0.000 description 3
- 235000013305 food Nutrition 0.000 description 3
- 238000004817 gas chromatography Methods 0.000 description 3
- 108010025885 glycerol dehydratase Proteins 0.000 description 3
- 229910000041 hydrogen chloride Inorganic materials 0.000 description 3
- 230000000977 initiatory Effects 0.000 description 3
- 239000004313 iron ammonium citrate Substances 0.000 description 3
- 235000000011 iron ammonium citrate Nutrition 0.000 description 3
- 108010091871 leucylmethionine Proteins 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 108010044655 lysylproline Proteins 0.000 description 3
- BDAGIHXWWSANSR-UHFFFAOYSA-M methanoate Chemical compound [O-]C=O BDAGIHXWWSANSR-UHFFFAOYSA-M 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 229950006238 nadide Drugs 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 125000003729 nucleotide group Chemical group 0.000 description 3
- 230000003204 osmotic Effects 0.000 description 3
- 230000036961 partial Effects 0.000 description 3
- 108010073101 phenylalanylleucine Proteins 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 239000011541 reaction mixture Substances 0.000 description 3
- 230000003362 replicative Effects 0.000 description 3
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 238000005070 sampling Methods 0.000 description 3
- 108010071207 serylmethionine Proteins 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 235000013619 trace mineral Nutrition 0.000 description 3
- 239000011573 trace mineral Substances 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- 108010003137 tyrosyltyrosine Proteins 0.000 description 3
- VNYDHJARLHNEGA-RYUDHWBXSA-N (2S)-1-[(2S)-2-azaniumyl-3-(4-hydroxyphenyl)propanoyl]pyrrolidine-2-carboxylate Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 VNYDHJARLHNEGA-RYUDHWBXSA-N 0.000 description 2
- LZDNBBYBDGBADK-KBPBESRZSA-N (2S)-2-[[(2S)-2-amino-3-methylbutanoyl]amino]-3-(1H-indol-3-yl)propanoic acid Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-KBPBESRZSA-N 0.000 description 2
- BVZABQIRMYTKCF-JSGCOSHPSA-N (2S)-2-[[(2S)-2-azaniumyl-3-(1H-indol-3-yl)propanoyl]amino]-4-methylsulfanylbutanoate Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(O)=O)=CNC2=C1 BVZABQIRMYTKCF-JSGCOSHPSA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N (2S)-2-[[(2S,3R)-2-[[(2S)-2-amino-4-methylpentanoyl]amino]-3-hydroxybutanoyl]amino]propanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- PEDCQBHIVMGVHV-YHPVEMKJSA-N 1,1,2,3,3-pentadeuterio-1,2,3-trideuteriooxypropane Chemical compound [2H]OC([2H])([2H])C([2H])(O[2H])C([2H])([2H])O[2H] PEDCQBHIVMGVHV-YHPVEMKJSA-N 0.000 description 2
- AKXKFZDCRYJKTF-UHFFFAOYSA-N 3-hydroxypropanal Chemical compound OCCC=O AKXKFZDCRYJKTF-UHFFFAOYSA-N 0.000 description 2
- PABVKUJVLNMOJP-UHFFFAOYSA-N 4-amino-5-[(1-carboxy-2-sulfanylethyl)amino]-5-oxopentanoic acid Chemical compound OC(=O)CCC(N)C(=O)NC(CS)C(O)=O PABVKUJVLNMOJP-UHFFFAOYSA-N 0.000 description 2
- CCUAQNUWXLYFRA-IMJSIDKUSA-N Ala-Asn Chemical compound C[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC(N)=O CCUAQNUWXLYFRA-IMJSIDKUSA-N 0.000 description 2
- CXISPYVYMQWFLE-VKHMYHEASA-N Ala-Gly Chemical compound C[C@H]([NH3+])C(=O)NCC([O-])=O CXISPYVYMQWFLE-VKHMYHEASA-N 0.000 description 2
- RDIKFPRVLJLMER-BQBZGAKWSA-N Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)N RDIKFPRVLJLMER-BQBZGAKWSA-N 0.000 description 2
- WPWUFUBLGADILS-WDSKDSINSA-N Ala-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WPWUFUBLGADILS-WDSKDSINSA-N 0.000 description 2
- IPWKGIFRRBGCJO-IMJSIDKUSA-N Ala-Ser Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O IPWKGIFRRBGCJO-IMJSIDKUSA-N 0.000 description 2
- BUQICHWNXBIBOG-LMVFSUKVSA-N Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)N BUQICHWNXBIBOG-LMVFSUKVSA-N 0.000 description 2
- BFNBIHQBYMNNAN-UHFFFAOYSA-N Ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 2
- OMLWNBVRVJYMBQ-YUMQZZPRSA-N Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OMLWNBVRVJYMBQ-YUMQZZPRSA-N 0.000 description 2
- PQBHGSGQZSOLIR-RYUDHWBXSA-N Arg-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PQBHGSGQZSOLIR-RYUDHWBXSA-N 0.000 description 2
- SONUFGRSSMFHFN-IMJSIDKUSA-N Asn-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O SONUFGRSSMFHFN-IMJSIDKUSA-N 0.000 description 2
- FYRVDDJMNISIKJ-UWVGGRQHSA-N Asn-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FYRVDDJMNISIKJ-UWVGGRQHSA-N 0.000 description 2
- DVUFTQLHHHJEMK-IMJSIDKUSA-N Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O DVUFTQLHHHJEMK-IMJSIDKUSA-N 0.000 description 2
- PSZNHSNIGMJYOZ-WDSKDSINSA-N Asp-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PSZNHSNIGMJYOZ-WDSKDSINSA-N 0.000 description 2
- HSPSXROIMXIJQW-BQBZGAKWSA-N Asp-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 HSPSXROIMXIJQW-BQBZGAKWSA-N 0.000 description 2
- YZQCXOFQZKCETR-UWVGGRQHSA-N Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YZQCXOFQZKCETR-UWVGGRQHSA-N 0.000 description 2
- SJUXYGVRSGTPMC-UHFFFAOYSA-N Asparaginyl-Alanine Chemical compound OC(=O)C(C)NC(=O)C(N)CC(N)=O SJUXYGVRSGTPMC-UHFFFAOYSA-N 0.000 description 2
- NPDLYUOYAGBHFB-UHFFFAOYSA-N Asparaginyl-Arginine Chemical compound NC(=O)CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N NPDLYUOYAGBHFB-UHFFFAOYSA-N 0.000 description 2
- IQTUDDBANZYMAR-UHFFFAOYSA-N Asparaginyl-Methionine Chemical compound CSCCC(C(O)=O)NC(=O)C(N)CC(N)=O IQTUDDBANZYMAR-UHFFFAOYSA-N 0.000 description 2
- OMSMPWHEGLNQOD-UHFFFAOYSA-N Asparaginyl-Phenylalanine Chemical compound NC(=O)CC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 OMSMPWHEGLNQOD-UHFFFAOYSA-N 0.000 description 2
- DYDKXJWQCIVTMR-UHFFFAOYSA-N Aspartyl-Methionine Chemical compound CSCCC(C(O)=O)NC(=O)C(N)CC(O)=O DYDKXJWQCIVTMR-UHFFFAOYSA-N 0.000 description 2
- OWMVSZAMULFTJU-UHFFFAOYSA-N Bis-tris methane Chemical compound OCCN(CCO)C(CO)(CO)CO OWMVSZAMULFTJU-UHFFFAOYSA-N 0.000 description 2
- 229940041514 Candida albicans extract Drugs 0.000 description 2
- 108010051219 Cre recombinase Proteins 0.000 description 2
- RGTVXXNMOGHRAY-UHFFFAOYSA-N Cysteinyl-Arginine Chemical compound SCC(N)C(=O)NC(C(O)=O)CCCNC(N)=N RGTVXXNMOGHRAY-UHFFFAOYSA-N 0.000 description 2
- AYKQJQVWUYEZNU-UHFFFAOYSA-N Cysteinyl-Asparagine Chemical compound SCC(N)C(=O)NC(C(O)=O)CC(N)=O AYKQJQVWUYEZNU-UHFFFAOYSA-N 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N DEOXYTHYMIDINE Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 108010090461 DFG peptide Proteins 0.000 description 2
- 101710044675 DHAPRD Proteins 0.000 description 2
- 241000721047 Danaus plexippus Species 0.000 description 2
- 229960002737 Fructose Drugs 0.000 description 2
- OWOFCNWTMWOOJJ-WDSKDSINSA-N Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OWOFCNWTMWOOJJ-WDSKDSINSA-N 0.000 description 2
- SXGAGTVDWKQYCX-BQBZGAKWSA-N Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SXGAGTVDWKQYCX-BQBZGAKWSA-N 0.000 description 2
- YBTCBQBIJKGSJP-BQBZGAKWSA-N Glu-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O YBTCBQBIJKGSJP-BQBZGAKWSA-N 0.000 description 2
- 102000003638 Glucose-6-Phosphatase Human genes 0.000 description 2
- 108010086800 Glucose-6-Phosphatase Proteins 0.000 description 2
- ARPVSMCNIDAQBO-UHFFFAOYSA-N Glutaminyl-Leucine Chemical compound CC(C)CC(C(O)=O)NC(=O)C(N)CCC(N)=O ARPVSMCNIDAQBO-UHFFFAOYSA-N 0.000 description 2
- VHLZDSUANXBJHW-UHFFFAOYSA-N Glutaminyl-Phenylalanine Chemical compound NC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 VHLZDSUANXBJHW-UHFFFAOYSA-N 0.000 description 2
- FUESBOMYALLFNI-VKHMYHEASA-N Gly-Asn Chemical compound NCC(=O)N[C@H](C(O)=O)CC(N)=O FUESBOMYALLFNI-VKHMYHEASA-N 0.000 description 2
- MFBYPDKTAJXHNI-VKHMYHEASA-N Gly-Cys Chemical compound [NH3+]CC(=O)N[C@@H](CS)C([O-])=O MFBYPDKTAJXHNI-VKHMYHEASA-N 0.000 description 2
- PNMUAGGSDZXTHX-BYPYZUCNSA-N Gly-Gln Chemical compound NCC(=O)N[C@H](C(O)=O)CCC(N)=O PNMUAGGSDZXTHX-BYPYZUCNSA-N 0.000 description 2
- XBGGUPMXALFZOT-VIFPVBQESA-N Gly-Tyr Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-VIFPVBQESA-N 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N HCl Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- VHOLZZKNEBBHTH-YUMQZZPRSA-N His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 VHOLZZKNEBBHTH-YUMQZZPRSA-N 0.000 description 2
- LYCVKHSJGDMDLM-LURJTMIESA-N His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 LYCVKHSJGDMDLM-LURJTMIESA-N 0.000 description 2
- VLDVBZICYBVQHB-IUCAKERBSA-N His-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 VLDVBZICYBVQHB-IUCAKERBSA-N 0.000 description 2
- 108010020056 Hydrogenase Proteins 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- JAQGKXUEKGKTKX-HOTGVXAUSA-N L-tyrosyl-L-tyrosine Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 JAQGKXUEKGKTKX-HOTGVXAUSA-N 0.000 description 2
- GUBGYTABKSRVRQ-UUNJERMWSA-N Lactose Natural products O([C@@H]1[C@H](O)[C@H](O)[C@H](O)O[C@@H]1CO)[C@H]1[C@@H](O)[C@@H](O)[C@H](O)[C@H](CO)O1 GUBGYTABKSRVRQ-UUNJERMWSA-N 0.000 description 2
- 229960001375 Lactose Drugs 0.000 description 2
- NTISAKGPIGTIJJ-IUCAKERBSA-N Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(C)C NTISAKGPIGTIJJ-IUCAKERBSA-N 0.000 description 2
- LHSGPCFBGJHPCY-STQMWFEESA-N Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-STQMWFEESA-N 0.000 description 2
- 239000006142 Luria-Bertani Agar Substances 0.000 description 2
- XBZOQGHZGQLEQO-IUCAKERBSA-N Lys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN XBZOQGHZGQLEQO-IUCAKERBSA-N 0.000 description 2
- ZOKVLMBYDSIDKG-CSMHCCOUSA-N Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ZOKVLMBYDSIDKG-CSMHCCOUSA-N 0.000 description 2
- GUBGYTABKSRVRQ-YOLKTULGSA-N Maltose Natural products O([C@@H]1[C@H](O)[C@@H](O)[C@H](O)O[C@H]1CO)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 GUBGYTABKSRVRQ-YOLKTULGSA-N 0.000 description 2
- ADHNYKZHPOEULM-BQBZGAKWSA-N Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O ADHNYKZHPOEULM-BQBZGAKWSA-N 0.000 description 2
- DZMGFGQBRYWJOR-YUMQZZPRSA-N Met-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O DZMGFGQBRYWJOR-YUMQZZPRSA-N 0.000 description 2
- XYVRXLDSCKEYES-JSGCOSHPSA-N Met-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 XYVRXLDSCKEYES-JSGCOSHPSA-N 0.000 description 2
- 230000035633 Metabolized Effects 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L MgCl2 Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N Oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 2
- 101710010904 PCBD2 Proteins 0.000 description 2
- OZILORBBPKKGRI-RYUDHWBXSA-N Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 OZILORBBPKKGRI-RYUDHWBXSA-N 0.000 description 2
- BXNGIHFNNNSEOS-UWVGGRQHSA-N Phe-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 BXNGIHFNNNSEOS-UWVGGRQHSA-N 0.000 description 2
- PYOHODCEOHCZBM-RYUDHWBXSA-N Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 PYOHODCEOHCZBM-RYUDHWBXSA-N 0.000 description 2
- GKZIWHRNKRBEOH-HOTGVXAUSA-N Phe-Phe Chemical compound C([C@H]([NH3+])C(=O)N[C@@H](CC=1C=CC=CC=1)C([O-])=O)C1=CC=CC=C1 GKZIWHRNKRBEOH-HOTGVXAUSA-N 0.000 description 2
- WEQJQNWXCSUVMA-RYUDHWBXSA-N Phe-Pro Chemical compound C([C@H]([NH3+])C(=O)N1[C@@H](CCC1)C([O-])=O)C1=CC=CC=C1 WEQJQNWXCSUVMA-RYUDHWBXSA-N 0.000 description 2
- KLAONOISLHWJEE-UHFFFAOYSA-N Phenylalanyl-Glutamine Chemical compound NC(=O)CCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 KLAONOISLHWJEE-UHFFFAOYSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N Phenylalanyl-Lysine Chemical compound NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- 206010036618 Premenstrual syndrome Diseases 0.000 description 2
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 2
- RWCOTTLHDJWHRS-YUMQZZPRSA-N Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RWCOTTLHDJWHRS-YUMQZZPRSA-N 0.000 description 2
- BEPSGCXDIVACBU-UHFFFAOYSA-N Prolyl-Histidine Chemical compound C1CCNC1C(=O)NC(C(=O)O)CC1=CN=CN1 BEPSGCXDIVACBU-UHFFFAOYSA-N 0.000 description 2
- 101710042047 SELENOO Proteins 0.000 description 2
- 235000003534 Saccharomyces carlsbergensis Nutrition 0.000 description 2
- 229940081969 Saccharomyces cerevisiae Drugs 0.000 description 2
- PBUXMVYWOSKHMF-WDSKDSINSA-N Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CO PBUXMVYWOSKHMF-WDSKDSINSA-N 0.000 description 2
- PPQRSMGDOHLTBE-UWVGGRQHSA-N Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PPQRSMGDOHLTBE-UWVGGRQHSA-N 0.000 description 2
- WBAXJMCUFIXCNI-WDSKDSINSA-N Ser-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WBAXJMCUFIXCNI-WDSKDSINSA-N 0.000 description 2
- FFOKMZOAVHEWET-UHFFFAOYSA-N Serinyl-Cysteine Chemical compound OCC(N)C(=O)NC(CS)C(O)=O FFOKMZOAVHEWET-UHFFFAOYSA-N 0.000 description 2
- 241000588717 Shimwellia blattae Species 0.000 description 2
- 238000002105 Southern blotting Methods 0.000 description 2
- 229920000978 Start codon Polymers 0.000 description 2
- 229960004793 Sucrose Drugs 0.000 description 2
- NHUHCSRWZMLRLA-UHFFFAOYSA-N Sulfizole Chemical compound CC1=NOC(NS(=O)(=O)C=2C=CC(N)=CC=2)=C1C NHUHCSRWZMLRLA-UHFFFAOYSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- 101700003317 TPI1 Proteins 0.000 description 2
- IOWJRKAVLALBQB-IWGUZYHVSA-N Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O IOWJRKAVLALBQB-IWGUZYHVSA-N 0.000 description 2
- WXVIGTAUZBUDPZ-DTLFHODZSA-N Thr-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 WXVIGTAUZBUDPZ-DTLFHODZSA-N 0.000 description 2
- QOLYAJSZHIJCTO-VQVTYTSYSA-N Thr-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O QOLYAJSZHIJCTO-VQVTYTSYSA-N 0.000 description 2
- KAFKKRJQHOECGW-JCOFBHIZSA-N Thr-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(O)=O)=CNC2=C1 KAFKKRJQHOECGW-JCOFBHIZSA-N 0.000 description 2
- OHKOGUYZJXTSFX-KZFFXBSXSA-N Ticarcillin Chemical compound C=1([C@@H](C(O)=O)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)C=CSC=1 OHKOGUYZJXTSFX-KZFFXBSXSA-N 0.000 description 2
- 108020003073 Triose-phosphate isomerases Proteins 0.000 description 2
- 102000005924 Triose-phosphate isomerases Human genes 0.000 description 2
- IMMPMHKLUUZKAZ-WMZOPIPTSA-N Trp-Phe Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 IMMPMHKLUUZKAZ-WMZOPIPTSA-N 0.000 description 2
- LWFWZRANSFAJDR-JSGCOSHPSA-N Trp-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 LWFWZRANSFAJDR-JSGCOSHPSA-N 0.000 description 2
- NLKUJNGEGZDXGO-XVKPBYJWSA-N Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLKUJNGEGZDXGO-XVKPBYJWSA-N 0.000 description 2
- ZQOOYCZQENFIMC-STQMWFEESA-N Tyr-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=C(O)C=C1 ZQOOYCZQENFIMC-STQMWFEESA-N 0.000 description 2
- AUEJLPRZGVVDNU-STQMWFEESA-N Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-STQMWFEESA-N 0.000 description 2
- CGWAPUBOXJWXMS-HOTGVXAUSA-N Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 CGWAPUBOXJWXMS-HOTGVXAUSA-N 0.000 description 2
- ONWMQORSVZYVNH-UHFFFAOYSA-N Tyrosyl-Asparagine Chemical compound NC(=O)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 ONWMQORSVZYVNH-UHFFFAOYSA-N 0.000 description 2
- MFEVVAXTBZELLL-UHFFFAOYSA-N Tyrosyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 MFEVVAXTBZELLL-UHFFFAOYSA-N 0.000 description 2
- GJNDXQBALKCYSZ-RYUDHWBXSA-N Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 GJNDXQBALKCYSZ-RYUDHWBXSA-N 0.000 description 2
- WPSXZFTVLIAPCN-UHFFFAOYSA-N Valyl-Cysteine Chemical compound CC(C)C(N)C(=O)NC(CS)C(O)=O WPSXZFTVLIAPCN-UHFFFAOYSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- VBJHPXDIVMXHJU-UHFFFAOYSA-N Zeocin Chemical compound N=1C(C=2SC=C(N=2)C(=O)NCCCCN=C(N)N)CSC=1CCNC(=O)C(C(O)C)NC(=O)C(C)C(O)C(C)NC(=O)C(C(OC1C(C(O)C(O)C(CO)O1)OC1C(C(OC(N)=O)C(O)C(CO)O1)O)C=1[N]C=NC=1)NC(=O)C1=NC(C(CC(N)=O)NCC(N)C(N)=O)=NC(N)=C1C VBJHPXDIVMXHJU-UHFFFAOYSA-N 0.000 description 2
- 108010084455 Zeocin Proteins 0.000 description 2
- GYQWVHROHDZXRS-UHFFFAOYSA-N [1,3-dihydroxy-2-(hydroxymethyl)propan-2-yl]azanium;carbonate Chemical compound OC(O)=O.OCC(N)(CO)CO.OCC(N)(CO)CO GYQWVHROHDZXRS-UHFFFAOYSA-N 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 101710025952 adhT Proteins 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- VHUUQVKOLVNVRT-UHFFFAOYSA-N ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 2
- 235000011114 ammonium hydroxide Nutrition 0.000 description 2
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 2
- 235000011130 ammonium sulphate Nutrition 0.000 description 2
- 230000003466 anti-cipated Effects 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 230000001851 biosynthetic Effects 0.000 description 2
- RYYVLZVUVIJVGH-UHFFFAOYSA-N caffeine Chemical compound CN1C(=O)N(C)C(=O)C2=C1N=CN2C RYYVLZVUVIJVGH-UHFFFAOYSA-N 0.000 description 2
- 230000001413 cellular Effects 0.000 description 2
- 239000003245 coal Substances 0.000 description 2
- 230000000875 corresponding Effects 0.000 description 2
- 239000002537 cosmetic Substances 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 235000019162 flavin adenine dinucleotide Nutrition 0.000 description 2
- WSFSSNUMVMOOMR-UHFFFAOYSA-N formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 2
- 230000002538 fungal Effects 0.000 description 2
- 101700037537 glpD Proteins 0.000 description 2
- 101700057982 glpD1 Proteins 0.000 description 2
- 101700083505 glpO Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 108010000361 glycerol 2-dehydrogenase (NADP+) Proteins 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 239000001963 growth media Substances 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 125000001165 hydrophobic group Chemical group 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 239000008101 lactose Substances 0.000 description 2
- GUBGYTABKSRVRQ-XLOQQCSPSA-N lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 238000004949 mass spectrometry Methods 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 150000002739 metals Chemical class 0.000 description 2
- 108010034507 methionyltryptophan Proteins 0.000 description 2
- BAVYZALUXZFZLV-UHFFFAOYSA-N methylamine Chemical compound NC BAVYZALUXZFZLV-UHFFFAOYSA-N 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000006011 modification reaction Methods 0.000 description 2
- 230000000051 modifying Effects 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 239000003471 mutagenic agent Substances 0.000 description 2
- MYMOFIZGZYHOMD-UHFFFAOYSA-N oxygen Chemical compound O=O MYMOFIZGZYHOMD-UHFFFAOYSA-N 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 229920000314 poly p-methyl styrene Polymers 0.000 description 2
- 239000001184 potassium carbonate Substances 0.000 description 2
- 229910000027 potassium carbonate Inorganic materials 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- MTTKOBXFUGIESR-UHFFFAOYSA-N prop-1-ene-1,3-diol Chemical compound OCC=CO MTTKOBXFUGIESR-UHFFFAOYSA-N 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 230000008929 regeneration Effects 0.000 description 2
- 238000011069 regeneration method Methods 0.000 description 2
- 101710023907 selenoo1 Proteins 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- 239000011734 sodium Substances 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 150000008163 sugars Chemical class 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 2
- 230000002103 transcriptional Effects 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- OPINTGHFESTVAX-UHFFFAOYSA-N γ-glutamyl-Arginine Chemical compound NC(=O)CCC(N)C(=O)NC(C(O)=O)CCCNC(N)=N OPINTGHFESTVAX-UHFFFAOYSA-N 0.000 description 2
- SIGGQAHUPUBWNF-UHFFFAOYSA-N γ-glutamyl-Methionine Chemical compound CSCCC(C(O)=O)NC(=O)C(N)CCC(N)=O SIGGQAHUPUBWNF-UHFFFAOYSA-N 0.000 description 2
- DNIAPMSPPWPWGF-GSVOUGTGSA-N (-)-propylene glycol Chemical compound C[C@@H](O)CO DNIAPMSPPWPWGF-GSVOUGTGSA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N (2S)-1-[(2S)-1-[(2S)-pyrrolidin-1-ium-2-carbonyl]pyrrolidine-2-carbonyl]pyrrolidine-2-carboxylate Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N (2S)-1-[2-[[(2S)-2-azaniumyl-4-methylpentanoyl]amino]acetyl]pyrrolidine-2-carboxylate Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- FAQVCWVVIYYWRR-WHFBIAKZSA-N (2S)-2-[[(2S)-2,5-diamino-5-oxopentanoyl]amino]propanoic acid Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O FAQVCWVVIYYWRR-WHFBIAKZSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N (2S)-2-[[(2S)-2-[[(2S)-2-amino-3-phenylpropanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]butanedioic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- KAJLHCWRWDSROH-BZSNNMDCSA-N (2S)-2-[[(2S)-2-[[(2S)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]butanedioic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-amino-4-methylpentanoyl]amino]-3-phenylpropanoyl]amino]butanedioic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N (2S)-2-[[(2S)-2-[[(2S)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]amino]butanedioic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N (2S)-2-[[(2S)-2-[[(2S)-2-amino-4-methylpentanoyl]amino]propanoyl]amino]-3-hydroxypropanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N (2S)-2-[[(2S)-2-[[(2S)-2-aminopropanoyl]amino]propanoyl]amino]-3-hydroxypropanoic acid Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- KYPMKDGKAYQCHO-RYUDHWBXSA-N (2S)-2-[[(2S)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-4-methylsulfanylbutanoic acid Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KYPMKDGKAYQCHO-RYUDHWBXSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N (2S)-5-amino-2-[[(2S,3R)-2-[[(2S)-2-amino-4-methylpentanoyl]amino]-3-hydroxybutanoyl]amino]-5-oxopentanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- QXRNAOYBCYVZCD-BQBZGAKWSA-N (2S)-6-amino-2-[[(2S)-2-aminopropanoyl]amino]hexanoic acid Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN QXRNAOYBCYVZCD-BQBZGAKWSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N (3S)-3-[[(2S)-2-amino-4-methylpentanoyl]amino]-4-(carboxymethylamino)-4-oxobutanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N (3S)-4-[[(1S)-1-carboxy-2-(4-hydroxyphenyl)ethyl]amino]-3-[[(2S)-2,6-diaminohexanoyl]amino]-4-oxobutanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- UOQFZGVGGMHGEE-UHFFFAOYSA-N 1,1-dihydroxypropan-2-one Chemical compound CC(=O)C(O)O UOQFZGVGGMHGEE-UHFFFAOYSA-N 0.000 description 1
- AIVLSBRNZPHKEG-UHFFFAOYSA-N 2,3-dihydroxypropyl dihydrogen phosphite Chemical compound OCC(O)COP(O)O AIVLSBRNZPHKEG-UHFFFAOYSA-N 0.000 description 1
- ZKHFYORNAYYOTM-UHFFFAOYSA-N 2-(2,5-dichlorophenyl)cyclohexa-2,5-diene-1,4-dione Chemical compound ClC1=CC=C(Cl)C(C=2C(C=CC(=O)C=2)=O)=C1 ZKHFYORNAYYOTM-UHFFFAOYSA-N 0.000 description 1
- RFCVXVPWSPOMFJ-UHFFFAOYSA-N 2-[(2-azaniumyl-3-phenylpropanoyl)amino]-4-methylpentanoate Chemical compound CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 RFCVXVPWSPOMFJ-UHFFFAOYSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N 2-[[(2R)-2-[[(2S)-2-aminopropanoyl]amino]-3-sulfanylpropanoyl]amino]acetic acid Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N 2-[[(2S)-2-[[(2S)-2-azaniumyl-4-methylpentanoyl]amino]-4-methylpentanoyl]amino]acetate Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N 2-[[2-[[(2S)-2-amino-4-methylpentanoyl]amino]acetyl]amino]acetic acid Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- MSWZFWKMSRAUBD-IVMDWMLBSA-N 2-amino-2-deoxy-D-glucopyranose Chemical compound N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O MSWZFWKMSRAUBD-IVMDWMLBSA-N 0.000 description 1
- AZKSAVLVSZKNRD-UHFFFAOYSA-M 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide Chemical compound [Br-].S1C(C)=C(C)N=C1[N+]1=NC(C=2C=CC=CC=2)=NN1C1=CC=CC=C1 AZKSAVLVSZKNRD-UHFFFAOYSA-M 0.000 description 1
- MGHKSHCBDXNTHX-UHFFFAOYSA-N 4-amino-5-[(4-amino-1-carboxy-4-oxobutyl)amino]-5-oxopentanoic acid Chemical compound OC(=O)CCC(N)C(=O)NC(CCC(N)=O)C(O)=O MGHKSHCBDXNTHX-UHFFFAOYSA-N 0.000 description 1
- XTWYTFMLZFPYCI-KQYNXXCUSA-N 5'-adenylphosphoric acid Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O XTWYTFMLZFPYCI-KQYNXXCUSA-N 0.000 description 1
- QYNUQALWYRSVHF-ABLWVSNPSA-N 5,10-Methylenetetrahydrofolate Chemical compound C1N2C=3C(=O)NC(N)=NC=3NCC2CN1C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 QYNUQALWYRSVHF-ABLWVSNPSA-N 0.000 description 1
- OIRDTQYFTABQOQ-GAWUUDPSSA-N 9-β-D-XYLOFURANOSYL-ADENINE Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@H](O)[C@H]1O OIRDTQYFTABQOQ-GAWUUDPSSA-N 0.000 description 1
- 102100000079 ACP4 Human genes 0.000 description 1
- 101700007462 ADH1A Proteins 0.000 description 1
- 102100000356 ADH1A Human genes 0.000 description 1
- 101700080270 ADH2 Proteins 0.000 description 1
- 102100006669 ALPP Human genes 0.000 description 1
- DZBUGLKDJFMEHC-UHFFFAOYSA-N Acridine Chemical compound C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Natural products NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 229960000643 Adenine Drugs 0.000 description 1
- OIRDTQYFTABQOQ-SXVXDFOESA-N Adenosine Natural products Nc1ncnc2c1ncn2[C@@H]3O[C@@H](CO)[C@H](O)[C@@H]3O OIRDTQYFTABQOQ-SXVXDFOESA-N 0.000 description 1
- HJCMDXDYPOUFDY-WHFBIAKZSA-N Ala-Gln Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O HJCMDXDYPOUFDY-WHFBIAKZSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N Alanyl-Arginine Chemical compound CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- YVBOZGOAVJZITM-UHFFFAOYSA-P Ammonium phosphomolybdate Chemical compound [NH4+].[NH4+].[NH4+].[NH4+].[O-]P([O-])=O.[O-][Mo]([O-])(=O)=O YVBOZGOAVJZITM-UHFFFAOYSA-P 0.000 description 1
- 235000019737 Animal fat Nutrition 0.000 description 1
- PMGDADKJMCOXHX-BQBZGAKWSA-N Arg-Gln Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PMGDADKJMCOXHX-BQBZGAKWSA-N 0.000 description 1
- WYBVBIHNJWOLCJ-IUCAKERBSA-N Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N WYBVBIHNJWOLCJ-IUCAKERBSA-N 0.000 description 1
- DAQIJMOLTMGJLO-YUMQZZPRSA-N Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N DAQIJMOLTMGJLO-YUMQZZPRSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- BNODVYXZAAXSHW-UHFFFAOYSA-N Arginyl-Histidine Chemical compound NC(=N)NCCCC(N)C(=O)NC(C(O)=O)CC1=CN=CN1 BNODVYXZAAXSHW-UHFFFAOYSA-N 0.000 description 1
- FFMIYIMKQIMDPK-BQBZGAKWSA-N Asn-His Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 FFMIYIMKQIMDPK-BQBZGAKWSA-N 0.000 description 1
- 229960005261 Aspartic Acid Drugs 0.000 description 1
- FKBFDTRILNZGAI-UHFFFAOYSA-N Aspartyl-Cysteine Chemical compound OC(=O)CC(N)C(=O)NC(CS)C(O)=O FKBFDTRILNZGAI-UHFFFAOYSA-N 0.000 description 1
- 241000196312 Asteromonas gracilis Species 0.000 description 1
- 206010003664 Atrial septal defect Diseases 0.000 description 1
- 102100015648 BACE2 Human genes 0.000 description 1
- 101700051947 BACE2 Proteins 0.000 description 1
- 241000590572 Bia <butterfly> Species 0.000 description 1
- 241001474374 Blennius Species 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 101700016165 CTRC Proteins 0.000 description 1
- 241001342895 Chorus Species 0.000 description 1
- 229920002759 Circular DNA Polymers 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 241000588919 Citrobacter freundii Species 0.000 description 1
- 229920001405 Coding region Polymers 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- HAYVTMHUNMMXCV-UHFFFAOYSA-N Cysteinyl-Alanine Chemical compound OC(=O)C(C)NC(=O)C(N)CS HAYVTMHUNMMXCV-UHFFFAOYSA-N 0.000 description 1
- LVNMAAGSAUGNIC-UHFFFAOYSA-N Cysteinyl-Histidine Chemical compound SCC(N)C(=O)NC(C(O)=O)CC1=CN=CN1 LVNMAAGSAUGNIC-UHFFFAOYSA-N 0.000 description 1
- NXTYATMDWQYLGJ-UHFFFAOYSA-N Cysteinyl-Leucine Chemical compound CC(C)CC(C(O)=O)NC(=O)C(N)CS NXTYATMDWQYLGJ-UHFFFAOYSA-N 0.000 description 1
- WXOFKRKAHJQKLT-UHFFFAOYSA-N Cysteinyl-Lysine Chemical compound NCCCCC(C(O)=O)NC(=O)C(N)CS WXOFKRKAHJQKLT-UHFFFAOYSA-N 0.000 description 1
- OOULJWDSSVOMHX-UHFFFAOYSA-N Cysteinyl-Methionine Chemical compound CSCCC(C(O)=O)NC(=O)C(N)CS OOULJWDSSVOMHX-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N Cysteinyl-Phenylalanine Chemical compound SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- WYVKPHCYMTWUCW-UHFFFAOYSA-N Cysteinyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CS WYVKPHCYMTWUCW-UHFFFAOYSA-N 0.000 description 1
- 229940104302 Cytosine Drugs 0.000 description 1
- OPTASPLRGRRNAP-UHFFFAOYSA-N Cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 1
- FNZLKVNUWIIPSJ-UHNVWZDZSA-N D-ribulose 5-phosphate Chemical compound OCC(=O)[C@H](O)[C@H](O)COP(O)(O)=O FNZLKVNUWIIPSJ-UHNVWZDZSA-N 0.000 description 1
- PPQRONHOSHZGFQ-VPENINKCSA-N D-xylose 5-phosphate Chemical compound OP(=O)(O)OC[C@@H](O)[C@H](O)[C@@H](O)C=O PPQRONHOSHZGFQ-VPENINKCSA-N 0.000 description 1
- ZAQJHHRNXZUBTE-WUJLRWPWSA-N D-xylulose Chemical compound OC[C@@H](O)[C@H](O)C(=O)CO ZAQJHHRNXZUBTE-WUJLRWPWSA-N 0.000 description 1
- 108009000097 DNA Replication Proteins 0.000 description 1
- 238000007399 DNA isolation Methods 0.000 description 1
- 230000033616 DNA repair Effects 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 101700011961 DPOM Proteins 0.000 description 1
- OJIYIVCMRYCWSE-UHFFFAOYSA-M Domiphen bromide Chemical compound [Br-].CCCCCCCCCCCC[N+](C)(C)CCOC1=CC=CC=C1 OJIYIVCMRYCWSE-UHFFFAOYSA-M 0.000 description 1
- 241001560459 Dunaliella sp. Species 0.000 description 1
- 108010015895 EC 2.7.1.29 Proteins 0.000 description 1
- 102000005731 EC 5.3.1.9 Human genes 0.000 description 1
- 108010070600 EC 5.3.1.9 Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241000701867 Enterobacteria phage T7 Species 0.000 description 1
- 210000003414 Extremities Anatomy 0.000 description 1
- 102100006546 FBP2 Human genes 0.000 description 1
- FUHQFAMVYDIUKL-UHFFFAOYSA-N FOX-7 Chemical compound NC(N)=C([N+]([O-])=O)[N+]([O-])=O FUHQFAMVYDIUKL-UHFFFAOYSA-N 0.000 description 1
- 102000012195 Fructose-1,6-bisphosphatase Human genes 0.000 description 1
- 108010017464 Fructose-Bisphosphatase Proteins 0.000 description 1
- 102100001506 G6PC3 Human genes 0.000 description 1
- 102100012694 GAL Human genes 0.000 description 1
- 101700002119 GAL Proteins 0.000 description 1
- 101700014301 GALK1 Proteins 0.000 description 1
- 101710037135 GAPC2 Proteins 0.000 description 1
- 101710037116 GAPC3 Proteins 0.000 description 1
- 101710025049 GAPDG Proteins 0.000 description 1
- 102100006425 GAPDH Human genes 0.000 description 1
- 101710007252 GG11003 Proteins 0.000 description 1
- 101700000203 GPP2 Proteins 0.000 description 1
- GYHNNYVSQQEPJS-UHFFFAOYSA-N Gallium Chemical compound [Ga] GYHNNYVSQQEPJS-UHFFFAOYSA-N 0.000 description 1
- 210000004907 Glands Anatomy 0.000 description 1
- 229960002442 Glucosamine Drugs 0.000 description 1
- 229960002989 Glutamic Acid Drugs 0.000 description 1
- XIPZDANNDPMZGQ-UHFFFAOYSA-N Glutaminyl-Cysteine Chemical compound NC(=O)CCC(N)C(=O)NC(CS)C(O)=O XIPZDANNDPMZGQ-UHFFFAOYSA-N 0.000 description 1
- CLSDNFWKGFJIBZ-UHFFFAOYSA-N Glutaminyl-Lysine Chemical compound NCCCCC(C(O)=O)NC(=O)C(N)CCC(N)=O CLSDNFWKGFJIBZ-UHFFFAOYSA-N 0.000 description 1
- HHSJMSCOLJVTCX-UHFFFAOYSA-N Glutaminyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CCC(N)=O HHSJMSCOLJVTCX-UHFFFAOYSA-N 0.000 description 1
- MRVYVEQPNDSWLH-UHFFFAOYSA-N Glutaminyl-Valine Chemical compound CC(C)C(C(O)=O)NC(=O)C(N)CCC(N)=O MRVYVEQPNDSWLH-UHFFFAOYSA-N 0.000 description 1
- YIWFXZNIBQBFHR-LURJTMIESA-N Gly-His Chemical compound [NH3+]CC(=O)N[C@H](C([O-])=O)CC1=CN=CN1 YIWFXZNIBQBFHR-LURJTMIESA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 101710007246 H3-3A Proteins 0.000 description 1
- 101710017531 H4C15 Proteins 0.000 description 1
- 101700009603 HIS3 Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- MDCTVRUPVLZSPG-BQBZGAKWSA-N His-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 MDCTVRUPVLZSPG-BQBZGAKWSA-N 0.000 description 1
- CZVQSYNVUHAILZ-UWVGGRQHSA-N His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 CZVQSYNVUHAILZ-UWVGGRQHSA-N 0.000 description 1
- KRBMQYPTDYSENE-BQBZGAKWSA-N His-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 KRBMQYPTDYSENE-BQBZGAKWSA-N 0.000 description 1
- FBTYOQIYBULKEH-ZFWWWQNUSA-N His-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CNC=N1 FBTYOQIYBULKEH-ZFWWWQNUSA-N 0.000 description 1
- WSDOHRLQDGAOGU-UHFFFAOYSA-N Histidinyl-Asparagine Chemical compound NC(=O)CC(C(O)=O)NC(=O)C(N)CC1=CN=CN1 WSDOHRLQDGAOGU-UHFFFAOYSA-N 0.000 description 1
- MAJYPBAJPNUFPV-UHFFFAOYSA-N Histidinyl-Cysteine Chemical compound SCC(C(O)=O)NC(=O)C(N)CC1=CN=CN1 MAJYPBAJPNUFPV-UHFFFAOYSA-N 0.000 description 1
- WRPDZHJNLYNFFT-UHFFFAOYSA-N Histidinyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CC1=CN=CN1 WRPDZHJNLYNFFT-UHFFFAOYSA-N 0.000 description 1
- 240000005979 Hordeum vulgare Species 0.000 description 1
- 235000007340 Hordeum vulgare Nutrition 0.000 description 1
- 229960000310 ISOLEUCINE Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N Kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 241000588749 Klebsiella oxytoca Species 0.000 description 1
- 241000235649 Kluyveromyces Species 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- 125000000415 L-cysteinyl group Chemical group O=C([*])[C@@](N([H])[H])([H])C([H])([H])S[H] 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- 108091000084 L-lactate dehydrogenases Proteins 0.000 description 1
- 102000003855 L-lactate dehydrogenases Human genes 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- JXNRXNCCROJZFB-RYUDHWBXSA-N L-tyrosyl-L-arginine Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JXNRXNCCROJZFB-RYUDHWBXSA-N 0.000 description 1
- 101700021119 LEUC Proteins 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N Leu-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- 241001182316 Linda Species 0.000 description 1
- 239000006391 Luria-Bertani Medium Substances 0.000 description 1
- QCZYYEFXOBKCNQ-STQMWFEESA-N Lys-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCZYYEFXOBKCNQ-STQMWFEESA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- JPNRPAJITHRXRH-UHFFFAOYSA-N Lysyl-Asparagine Chemical compound NCCCCC(N)C(=O)NC(C(O)=O)CC(N)=O JPNRPAJITHRXRH-UHFFFAOYSA-N 0.000 description 1
- 101710029649 MDV043 Proteins 0.000 description 1
- 239000007993 MOPS buffer Substances 0.000 description 1
- 229960002160 Maltose Drugs 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 108020004999 Messenger RNA Proteins 0.000 description 1
- UASDAHIAHBRZQV-YUMQZZPRSA-N Met-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N UASDAHIAHBRZQV-YUMQZZPRSA-N 0.000 description 1
- PBOUVYGPDSARIS-IUCAKERBSA-N Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C PBOUVYGPDSARIS-IUCAKERBSA-N 0.000 description 1
- ZYTPOUNUXRBYGW-YUMQZZPRSA-N Met-Met Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CCSC ZYTPOUNUXRBYGW-YUMQZZPRSA-N 0.000 description 1
- BJFJQOMZCSHBMY-YUMQZZPRSA-N Met-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(O)=O BJFJQOMZCSHBMY-YUMQZZPRSA-N 0.000 description 1
- JMEWFDUAFKVAAT-UHFFFAOYSA-N Methionyl-Asparagine Chemical compound CSCCC(N)C(=O)NC(C(O)=O)CC(N)=O JMEWFDUAFKVAAT-UHFFFAOYSA-N 0.000 description 1
- VMGAPWLDMVPYIA-HIDZBRGKSA-N N'-amino-N-iminomethanimidamide Chemical compound N\N=C\N=N VMGAPWLDMVPYIA-HIDZBRGKSA-N 0.000 description 1
- 101710017470 NT5C3B Proteins 0.000 description 1
- 210000004940 Nucleus Anatomy 0.000 description 1
- 241000320412 Ogataea angusta Species 0.000 description 1
- 239000005642 Oleic acid Substances 0.000 description 1
- GZMNRLSDVBKSJL-UHFFFAOYSA-N P(O)(O)OCC(C=O)O Chemical compound P(O)(O)OCC(C=O)O GZMNRLSDVBKSJL-UHFFFAOYSA-N 0.000 description 1
- 101700002320 PIR Proteins 0.000 description 1
- 101700061424 POLB Proteins 0.000 description 1
- 101710042137 PRSS1 Proteins 0.000 description 1
- NYQBYASWHVRESG-MIMYLULJSA-N Phe-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 NYQBYASWHVRESG-MIMYLULJSA-N 0.000 description 1
- PCNDJXKNXGMECE-UHFFFAOYSA-N Phenazine Chemical compound C1=CC=CC2=NC3=CC=CC=C3N=C21 PCNDJXKNXGMECE-UHFFFAOYSA-N 0.000 description 1
- DTBNBXWJWCWCIK-UHFFFAOYSA-N Phosphoenolpyruvic acid Natural products OC(=O)C(=C)OP(O)(O)=O DTBNBXWJWCWCIK-UHFFFAOYSA-N 0.000 description 1
- 102000012435 Phosphofructokinase-1 Human genes 0.000 description 1
- 108010022684 Phosphofructokinase-1 Proteins 0.000 description 1
- IUGYQRQAERSCNH-UHFFFAOYSA-N Pivalic acid Chemical compound CC(C)(C)C(O)=O IUGYQRQAERSCNH-UHFFFAOYSA-N 0.000 description 1
- SHAQGFGGJSLLHE-BQBZGAKWSA-N Pro-Gln Chemical compound NC(=O)CC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 SHAQGFGGJSLLHE-BQBZGAKWSA-N 0.000 description 1
- IWIANZLCJVYEFX-RYUDHWBXSA-N Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 IWIANZLCJVYEFX-RYUDHWBXSA-N 0.000 description 1
- 229960002429 Proline Drugs 0.000 description 1
- HXNYBZQLBWIADP-UHFFFAOYSA-N Prolyl-Cysteine Chemical compound OC(=O)C(CS)NC(=O)C1CCCN1 HXNYBZQLBWIADP-UHFFFAOYSA-N 0.000 description 1
- 229960004063 Propylene glycol Drugs 0.000 description 1
- XPPKVPWEQAFLFU-UHFFFAOYSA-J Pyrophosphate Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 1
- 101700054624 RF1 Proteins 0.000 description 1
- 101710044706 SFA1 Proteins 0.000 description 1
- 101710042981 SHMT1 Proteins 0.000 description 1
- 101700027726 SPE1 Proteins 0.000 description 1
- SSJMZMUVNKEENT-IMJSIDKUSA-N Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CO SSJMZMUVNKEENT-IMJSIDKUSA-N 0.000 description 1
- LTFSLKWFMWZEBD-IMJSIDKUSA-N Ser-Asn Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O LTFSLKWFMWZEBD-IMJSIDKUSA-N 0.000 description 1
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- CZMRCDWAGMRECN-GDQSFJPYSA-N Sucrose Natural products O([C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](CO)O1)[C@@]1(CO)[C@H](O)[C@@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-GDQSFJPYSA-N 0.000 description 1
- 102100009594 TPI1 Human genes 0.000 description 1
- 101700000513 TYRP1 Proteins 0.000 description 1
- JZRWCGZRTZMZEH-UHFFFAOYSA-N Thiamine Natural products CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N JZRWCGZRTZMZEH-UHFFFAOYSA-N 0.000 description 1
- 229960000344 Thiamine hydrochloride Drugs 0.000 description 1
- VPZKQTYZIVOJDV-LMVFSUKVSA-N Thr-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(O)=O VPZKQTYZIVOJDV-LMVFSUKVSA-N 0.000 description 1
- UQTNIFUCMBFWEJ-UHFFFAOYSA-N Threoninyl-Asparagine Chemical compound CC(O)C(N)C(=O)NC(C(O)=O)CC(N)=O UQTNIFUCMBFWEJ-UHFFFAOYSA-N 0.000 description 1
- 229940104230 Thymidine Drugs 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N Trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K Trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 1
- PEEAINPHPNDNGE-JQWIXIFHSA-N Trp-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 PEEAINPHPNDNGE-JQWIXIFHSA-N 0.000 description 1
- PWIQCLSQVQBOQV-AAEUAGOBSA-N Trp-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 PWIQCLSQVQBOQV-AAEUAGOBSA-N 0.000 description 1
- MYVYPSWUSKCCHG-JQWIXIFHSA-N Trp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 MYVYPSWUSKCCHG-JQWIXIFHSA-N 0.000 description 1
- 101710042194 Trpgamma Proteins 0.000 description 1
- GRQCSEWEPIHLBI-UHFFFAOYSA-N Tryptophyl-Asparagine Chemical compound C1=CC=C2C(CC(N)C(=O)NC(CC(N)=O)C(O)=O)=CNC2=C1 GRQCSEWEPIHLBI-UHFFFAOYSA-N 0.000 description 1
- YBRHKUNWEYBZGT-UHFFFAOYSA-N Tryptophyl-Threonine Chemical compound C1=CC=C2C(CC(N)C(=O)NC(C(O)C)C(O)=O)=CNC2=C1 YBRHKUNWEYBZGT-UHFFFAOYSA-N 0.000 description 1
- BMPPMAOOKQJYIP-WMZOPIPTSA-N Tyr-Trp Chemical compound C([C@H]([NH3+])C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C([O-])=O)C1=CC=C(O)C=C1 BMPPMAOOKQJYIP-WMZOPIPTSA-N 0.000 description 1
- OYOQKMOWUDVWCR-RYUDHWBXSA-N Tyr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OYOQKMOWUDVWCR-RYUDHWBXSA-N 0.000 description 1
- QZOSVNLXLSNHQK-UHFFFAOYSA-N Tyrosyl-Aspartate Chemical compound OC(=O)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 QZOSVNLXLSNHQK-UHFFFAOYSA-N 0.000 description 1
- BNQVUHQWZGTIBX-IUCAKERBSA-N Val-His Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CN=CN1 BNQVUHQWZGTIBX-IUCAKERBSA-N 0.000 description 1
- YSGSDAIMSCVPHG-YUMQZZPRSA-N Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)C(C)C YSGSDAIMSCVPHG-YUMQZZPRSA-N 0.000 description 1
- 229930003779 Vitamin B12 Natural products 0.000 description 1
- 239000005862 Whey Substances 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- SDXUTBWRNPRIJW-SLPGGIOYSA-N [(2R,3R,4S,5R)-2,3,4,5-tetrahydroxy-6-oxohexyl] dihydrogen phosphite Chemical compound OP(O)OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O SDXUTBWRNPRIJW-SLPGGIOYSA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 230000002378 acidificating Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 1
- WWMWAMFHUSTZTA-KQYNXXCUSA-N adenosine 5'-(pentahydrogen tetraphosphate) Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O WWMWAMFHUSTZTA-KQYNXXCUSA-N 0.000 description 1
- 238000005273 aeration Methods 0.000 description 1
- 101700074583 agp Proteins 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- HXHWSAZORRCQMX-UHFFFAOYSA-N albendazole Chemical compound CCCSC1=CC=C2NC(NC(=O)OC)=NC2=C1 HXHWSAZORRCQMX-UHFFFAOYSA-N 0.000 description 1
- 230000001476 alcoholic Effects 0.000 description 1
- GZCGUPFRVQAUEE-SLPGGIOYSA-N aldehydo-D-glucose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O GZCGUPFRVQAUEE-SLPGGIOYSA-N 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- DQPBABKTKYNPMH-UHFFFAOYSA-M amino sulfate Chemical compound NOS([O-])(=O)=O DQPBABKTKYNPMH-UHFFFAOYSA-M 0.000 description 1
- 101700012916 ampC Proteins 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000002528 anti-freeze Effects 0.000 description 1
- 239000007900 aqueous suspension Substances 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 101700065210 asp1 Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000037358 bacterial metabolism Effects 0.000 description 1
- 244000052616 bacterial pathogens Species 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- LSNNMFCWUKXFEE-UHFFFAOYSA-M bisulfite Chemical class OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 1
- CPELXLSAUQHCOX-UHFFFAOYSA-M bromide Chemical compound [Br-] CPELXLSAUQHCOX-UHFFFAOYSA-M 0.000 description 1
- UIIMBOGNXHQVGW-UHFFFAOYSA-M buffer Substances [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 229960001948 caffeine Drugs 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000003197 catalytic Effects 0.000 description 1
- 101710025091 cbbGC Proteins 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 235000013351 cheese Nutrition 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 125000003346 cobalamin group Chemical group 0.000 description 1
- 238000007398 colorimetric assay Methods 0.000 description 1
- 230000000295 complement Effects 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 239000003636 conditioned culture media Substances 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 235000005824 corn Nutrition 0.000 description 1
- 230000001808 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 239000000287 crude extract Substances 0.000 description 1
- 230000001082 cryoprotectant Effects 0.000 description 1
- 239000002577 cryoprotective agent Substances 0.000 description 1
- 150000001923 cyclic compounds Chemical class 0.000 description 1
- 108010081447 cytochrophin-4 Proteins 0.000 description 1
- 230000003247 decreasing Effects 0.000 description 1
- 101700047627 deoR Proteins 0.000 description 1
- 238000006209 dephosphorylation reaction Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 101700029304 dhaB Proteins 0.000 description 1
- 235000013681 dietary sucrose Nutrition 0.000 description 1
- 239000001177 diphosphate Substances 0.000 description 1
- 235000011180 diphosphates Nutrition 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 238000004821 distillation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000002708 enhancing Effects 0.000 description 1
- 238000007824 enzymatic assay Methods 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 101700043757 eryH Proteins 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000003925 fat Substances 0.000 description 1
- 235000019197 fats Nutrition 0.000 description 1
- ANUSOIHIIPAHJV-UHFFFAOYSA-N fenticlor Chemical compound OC1=CC=C(Cl)C=C1SC1=CC(Cl)=CC=C1O ANUSOIHIIPAHJV-UHFFFAOYSA-N 0.000 description 1
- 229960004642 ferric ammonium citrate Drugs 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 238000005755 formation reaction Methods 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 230000004077 genetic alteration Effects 0.000 description 1
- 231100000118 genetic alteration Toxicity 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 101700038358 glpF Proteins 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-VKHMYHEASA-N gly ala Chemical compound OC(=O)[C@H](C)NC(=O)CN VPZXBVLAVMBEQI-VKHMYHEASA-N 0.000 description 1
- 150000002314 glycerols Chemical class 0.000 description 1
- 230000002414 glycolytic Effects 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine zwitterion Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 101700042169 gspH Proteins 0.000 description 1
- 101700004055 gspI Proteins 0.000 description 1
- ZRALSGWEFCBTJO-UHFFFAOYSA-N guanidine Chemical compound NC(N)=N ZRALSGWEFCBTJO-UHFFFAOYSA-N 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 230000002209 hydrophobic Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- GSXOAOHZAIYLCY-HSUXUTPPSA-N keto-D-fructose 6-phosphate Chemical compound OCC(=O)[C@@H](O)[C@H](O)[C@H](O)COP(O)(O)=O GSXOAOHZAIYLCY-HSUXUTPPSA-N 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 101700005315 lerI Proteins 0.000 description 1
- 230000003902 lesions Effects 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- CSNNHWWHGAXBCP-UHFFFAOYSA-L magnesium sulphate Substances [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 229920002106 messenger RNA Polymers 0.000 description 1
- 230000002503 metabolic Effects 0.000 description 1
- VNWKTOKETHGBQD-UHFFFAOYSA-N methane Chemical compound C VNWKTOKETHGBQD-UHFFFAOYSA-N 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 230000000813 microbial Effects 0.000 description 1
- 230000003278 mimic Effects 0.000 description 1
- 235000010755 mineral Nutrition 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 230000002438 mitochondrial Effects 0.000 description 1
- 108060005003 mnmG Proteins 0.000 description 1
- 235000013379 molasses Nutrition 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- MEFBJEMVZONFCJ-UHFFFAOYSA-N molybdate Chemical compound [O-][Mo]([O-])(=O)=O MEFBJEMVZONFCJ-UHFFFAOYSA-N 0.000 description 1
- 150000004682 monohydrates Chemical class 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 150000004712 monophosphates Chemical class 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 230000003505 mutagenic Effects 0.000 description 1
- 231100000707 mutagenic chemical Toxicity 0.000 description 1
- 238000010899 nucleation Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 230000000065 osmolyte Effects 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- RZVAJINKPMORJF-UHFFFAOYSA-N p-acetaminophenol Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- ACVYVLVWPXVTIT-UHFFFAOYSA-M phosphinate Chemical compound [O-][PH2]=O ACVYVLVWPXVTIT-UHFFFAOYSA-M 0.000 description 1
- NJRWNWYFPOFDFN-UHFFFAOYSA-L phosphonate(2-) Chemical compound [O-][P]([O-])=O NJRWNWYFPOFDFN-UHFFFAOYSA-L 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000000865 phosphorylative Effects 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 230000027086 plasmid maintenance Effects 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 108091008117 polyclonal antibodies Proteins 0.000 description 1
- 229920000728 polyester Polymers 0.000 description 1
- 229920002635 polyurethane Polymers 0.000 description 1
- 239000004814 polyurethane Substances 0.000 description 1
- 101700051431 pqn-8 Proteins 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 235000013772 propylene glycol Nutrition 0.000 description 1
- 230000001681 protective Effects 0.000 description 1
- 230000001012 protector Effects 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 239000012429 reaction media Substances 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000000754 repressing Effects 0.000 description 1
- 230000001718 repressive Effects 0.000 description 1
- 230000002104 routine Effects 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- LIVNPJMFVYWSIS-UHFFFAOYSA-N silicon monoxide Inorganic materials [Si-]#[O+] LIVNPJMFVYWSIS-UHFFFAOYSA-N 0.000 description 1
- 239000000344 soap Substances 0.000 description 1
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- 235000011121 sodium hydroxide Nutrition 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- NESXXUAWVHXRGX-UHFFFAOYSA-M sodium;1-carboxyethenyl hydrogen phosphate Chemical compound [Na+].OP(O)(=O)OC(=C)C([O-])=O NESXXUAWVHXRGX-UHFFFAOYSA-M 0.000 description 1
- 238000009331 sowing Methods 0.000 description 1
- 230000003595 spectral Effects 0.000 description 1
- 238000004611 spectroscopical analysis Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 239000012085 test solution Substances 0.000 description 1
- 235000019157 thiamine Nutrition 0.000 description 1
- 229960003495 thiamine Drugs 0.000 description 1
- 239000011721 thiamine Substances 0.000 description 1
- KYMBYSLLVAOCFI-UHFFFAOYSA-N thiamine Chemical compound CC1=C(CCO)SCN1CC1=CN=C(C)N=C1N KYMBYSLLVAOCFI-UHFFFAOYSA-N 0.000 description 1
- DPJRMOMPQZCRJU-UHFFFAOYSA-M thiamine hydrochloride Chemical compound Cl.[Cl-].CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N DPJRMOMPQZCRJU-UHFFFAOYSA-M 0.000 description 1
- 235000019190 thiamine hydrochloride Nutrition 0.000 description 1
- 239000011747 thiamine hydrochloride Substances 0.000 description 1
- 101710031202 tpiA1 Proteins 0.000 description 1
- 101710031201 tpiA2 Proteins 0.000 description 1
- 239000011778 trisodium citrate Substances 0.000 description 1
- 108060005004 trmFO Proteins 0.000 description 1
- 101700075735 tyr-1 Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- 101710035147 ura3 Proteins 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 235000019163 vitamin B12 Nutrition 0.000 description 1
- 239000011715 vitamin B12 Substances 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
- 101710013713 xcpT Proteins 0.000 description 1
- 101710025069 yplQ Proteins 0.000 description 1
- 229910000368 zinc sulfate Inorganic materials 0.000 description 1
- DXJZITDUDUPINW-UHFFFAOYSA-N γ-glutamyl-Asparagine Chemical compound NC(=O)CCC(N)C(=O)NC(CC(N)=O)C(O)=O DXJZITDUDUPINW-UHFFFAOYSA-N 0.000 description 1
- NJMYZEJORPYOTO-UHFFFAOYSA-N γ-glutamyl-Proline Chemical compound NC(=O)CCC(N)C(=O)N1CCCC1C(O)=O NJMYZEJORPYOTO-UHFFFAOYSA-N 0.000 description 1
- ZQFAGNFSIZZYBA-UHFFFAOYSA-N γ-glutamyl-Tryptophan Chemical compound C1=CC=C2C(CC(NC(=O)C(CCC(N)=O)N)C(O)=O)=CNC2=C1 ZQFAGNFSIZZYBA-UHFFFAOYSA-N 0.000 description 1
Abstract
Recombinant organisms are provided comprising genes encoding a glycerol-3-phosphate dehydrogenase and/or a glycerol-3-phosphatase activity useful for the production of glycerol from a variety of carbon substrates. The organisms further contain disruptions in the endogenous genes encoding proteins having glycerol kinase and glycerol dehydrogenase activities.
Description
METHOD FOR THE PRODUCTION OF GLYCEROL BY RECOMBINANT ORGANISMS
FIELD OF THE INVENTION
The present invention concerns the field of molecular biology and the use of recombinant organisms for the production of glycerol and compounds from glycerol biosynthetic processes. More specifically, the invention describes the construction of a recombinant cell for the production of glycerol and compounds derived from a carbon substrate, the cell containing foreign genes that encode proteins having glycerol-3-phosphate dehydrogenase (G3PDH) and glycerol activities. 3-phosphatase (G3P phosphatase) where the endogenous genes that encode glycerol converting glycerol activities hydrogenase and glycerol kinase, have been eliminated.
BACKGROUND
Glycerol is a highly demanded compound for use in the cosmetics, liquid soaps, food, pharmaceutical, lubricants, REF. 120688 anti-freeze, and numerous other applications. Glycerol esters are important in the fats and oils industry. Historically, glycerol has been isolated from animal fat and similar sources; however, the process is laborious and inefficient. Microbial production of glycerol is preferred.
Not all organisms have a natural ability to synthesize glycerol. However, the biological production of glycerol is known for some species of bacteria, algae, and yeasts. The bacteria Bacill us licheniformis and Lactobacillus lycopersica synthesize glycerol. The production of glycerol is found in the seaweed Dunali ella sp. and Asteromonas gracilis halotolerantes for protection against high concentration of external salts (Ben-A otz et al., (1982) Experientia 38: 49-52). Similarly, several yeast-tolerant yeasts synthesize glycerol as a protective measure. Mostly Saccharomyces strains produce glycerol during alcoholic fermentation and their production can be increased by the application of osmotic stress (Alberryn et al., (1994) Mol Cell Biol .. 14, 4135-4144). At the beginning of this century glycerol was commercially produced with cultures of Saccharomyces to which reactants were added to induce them such as sulfites or alkalis. Through the formation of an inactive complex, the inducing agents block or inhibit the conversion of acetaldehyde or ethanol; thus, excess reducing equivalents (NADH) are available for or "induced" to dihydroxyacetone phosphate (DHAP) for reduction to produce glycerol. This method is limited by the partial inhibition of yeast development that is due to sulfites. This limitation can be partially overcome by the use of alkalies that create excess NADH equivalents by a different mechanism. In this practice, the alkalis initiate a Cannizarro disproportion to produce ethanol and acetic acid from two equivalents of acetaldehyde. In that way, although glycerol production is possible from organisms as they are naturally found, production is often subject to the need to control the osmotic tension of crops and the production of sulfites. A method free of this limitation is desirable. The production of glycerol by biosynthetic process is a possible way for such a method.
A number of the genes involved in the biosynthetic procedure for glycerol have been isolated. For example, the gene encoding glycero-3-phosphate dehydrogenase (DAR1, GPD1) has been cloned and sequenced from Saccaromyces diastaticus (Wang et al., (1994), J. Bact. 176: 7091-7095). The DAR1 gene was cloned into a traveling vector and used to transform E. coli where the expression produced the active enzyme. Wang et al., Supra, recognizes that DAR1 is regulated by the cellular osmotic environment but does not suggest how the gene should be used to improve the production of glycerol in a recombinant organism.
Other glycerol-3-phosphate dehydrogenase enzymes have been isolated. For example, sn-glycerol-3-phosphate dehydrogenase has been cloned and sequenced from S. Serevisiae (Larason et al., (1993) Mol.Microbiol., 10: 1101). Albertyn et al., (1994) Mol. Cell. Biol., 14: 4135) discloses the cloning of GPD1 encoding glycerol-3-phosphate dehydrogenase from S. Serevisiae. In the same way as Wang et al., Both Alberthyn et al. And Larason et al. Recognize the osmo-sensitivity of the regulation of this gene but do not suggest how the gene should be used in the production of glycerol in a recombinant organism. As with G3PDH, glycerol 3-phosphatase has been isolated from Saccharomyces cerevisiae and the protein identified as being encoded by GPP1 and GPP2 genes (Norbecck et al., (1996) J. Biol. Chem., 271: 13875). In the same way as the genes that encode G3PDH, it seems that GPP2 is osmotically induced.
Although the genes encoding G3PDH and G3P phosphatase have been isolated, there is no teaching in the art that demonstrates the production of glycerol from recombinant organisms with G3PDH / G3P phosphatase expressed together or separately. Additionally there is no teaching to suggest that efficient production of glycerol desired some type of wild type organism is possible using these two enzymatic activities that does not require any tension (salts or an osmolyte) to the cell. In fact, experience suggests that G3PDH activities may not affect the production of glycerol- For example, Eustace
((11987), Can. J. Microbiol., 33: 112-117)) shows hybridized yeast strains that produce glycerol at higher levels than the parent strain. However, Eustace also shows that the G3PDH activity remained constant or slightly lower in the hybridized strains as opposed to the wild type.
Glycerol is an industrially useful material. However, other compounds may be derivatives of the glycerol biosynthetic process which also have commercial significance. For example, glycerol producing organisms can be designed to produce 1, 3-propanediol (USA 5686276), a monomer that has potential utility in the production of polyester fibers and in the manufacture of polyurethanes and cyclic compounds. It is known for example that in some organisms, glycerol is converted to 3-hydroxypropionaldehyde and then to 1,3-propanediol through the actions of a dehydratase enzyme and an oxidoreductase enzyme, respectively. Bacterial strains capable of producing 1,3-propanediol have been found, for example, in the group ci Trobacter, Clostridium, Enterobacter, Iliobacter, Klebsi ella,
Lactobacillus, and Pelobacter. The glycerol dehydratase and dehydratase diol systems are described by Seyfried et al. (1996) J. Bacteriol, 178: 5793-5796 and Tobimatsu et al. (1995) J. Biol. Chem. 270: 7142-7148, respectively. Recombinant organisms, which contain exogenous dehydratase enzyme, which are capable of producing 1,3-propanediol have been described (USA 5686276). Although these organisms produce 1,3-propanediol, it is clear that they would benefit from a system that would minimize the conversion of glycerol.
There are a number of engineering advantages of glycerol producing organisms for the production of 1,3-propanediol where the conversion of glycerol is minimized. A microorganism capable of efficiently producing glycerol under physiological conditions is industrially desirable, especially when the glycerol itself will be used as a substrate in vivo as part of a complex catabolic or biosynthetic process that could be disturbed by omotic tension or the addition of inducing agents (eg. example, production of 1,3-propanediol). Some attempts to create mutants of glycerol kinase and glycerol dehydrogenase have been made. Or example, De Koning et al. (1990) Appl. Microbiol. Biotechnology. 32: 693-698 reports that the methanol-dependent production of dihydroxyacetone and glycerol by mutants of the methylphenic yeast Hansenula polymorpha blocked in dihydroxyacetone kinase and glycerol kinase. Methanol and an additional substrate required to replenish the xylose-5-phosphate, co-substrate of the assimilation reaction, was used to produce glycerol; however, a dihydroxyacetone reductase (glycerol dehydrogenase) is also required. Similarly, Shaw and Cameron, Book of Abstracts, 211ava. ACS National Meeting, Neew Orleans, LA, March 24-28 (1996), BIOT-154 Publisher; American Chemical Society,
Washimgton, D. C. investigates the elimination of IdhA
(lactate dehydrogenase) gJpK (glycerol kinase), and tpiA
(triosephosphate isomerase) for the production optimization of 1,3-propanediol. They do not suggest the expression of genes cloned by G3PDH or G3P phosphatase for the production of glycerol or 1,3-propanediol and they do not discuss the impact of glycerol dehydrogenase.
The problem to be solved, however, in the lack of a process to direct coal flow towards coal production by the addition of enhancers of certain enzymatic activities, especially G3PDH and G3P phosphatase that respectively catalyze the conversion of dihydroxyacetone phosphate ( DHAP) to glycerol-3-phosphate (G3P and then to glycerol.) The problem is complicated by the need to control the flow of carbon away from glycerol by eliminating or decreasing the activities of certain enzymes, especially glycerol kinase and glycerol dehydrogenase which respectively they catalyze the conversion of glycerol plus ATPP to G3P and glycerol to dihydroxyacetone (or glyceraldehyde).
BRIEF DESCRIPTION OF THE INVENTION The present invention provides a method for the production of glycerol from a recombinant organism comprising: transforming a suitable host cell with an expression cartridge comprising either or both of (a) a gene encoding a protein having glycerol-3-phosphate dehydrogenase activity and (b) a gene encoding a protein having glycerol 3-phosphate phosphatase activity, wherein the suitable host cell contains an interruption in either one or both of (a) a gene encoding an endogenous glycerol kinase and (b) a gene encoding an endogenous glycerol dehydrogenase, wherein the disruption prevents the expression of the active gene product; culturing the transformed host cell in the presence of at least one carbon source selected from the group consisting of - monosaccharides, oligosaccharides, polysaccharides, and single carbon substrates, by means of which glycerol is produced; and recovering the glycerol produced.
The present invention further provides a process for the production of 1,3-propanediol from a recombinant organism comprising: transforming a suitable host cell with an expression cartridge comprising either or both of (a) a gene encoding a protein having glycerol 3-phosphate dehydrogenase activity and (b) a gene encoding a protein having glycerol-3-phosphate phosphatase activity, the suitable host cell having at least one gene encoding a protein having a dehydratase activity and having an interruption in either or both of (a) a gene encoding an endogenous glycerol kinase and (b) a gene encoding an endogenous glycerol dehydrogenase, in which the disruption in the genes of (a) or (b) prevent the expression of the active gene product; culturing the transformed host cells in the presence of at least one carbon source selected from the group consisting of monosaccharides, oligosaccharides, polysaccharides, and single carbon substrates by which 1,3-propanediol is produced; and recovering the 1, 3-propanediol produced.
Additionally the invention provides for a process for the production of 1,3-propanediol from a recombinant organism where multiple copies of endogenous genes are introduced.
Additional embodiments of the invention include host cells transformed with heterologous genes for the glycerol method as well as host cells containing endogenous genes for the glycerol process.
Additionally, the invention provides recombinant cells suitable for the production of either glycerol or 1,3-propanediol, the host cells having genes that express either or both of a glycerol-3-phosphate dehydrogenase activity and a glycerol activity. - 3-phosphate phosphatase wherein the cell also has breaks in either or both of a gene encoding a glycerol kinase and a coding for a glycerol dehydrogenase, wherein disruption in the genes prevents the expression of an active gene product .
BRIEF DESCRIPTION OF THE FIGURES, BIOLOGICAL DEPOSITS AND LIST OF SEQUENCES.
Figure 1 illustrates the representative enzymatic procedure involving the metabolism of glycerol.
Applicants have made the following biological deposits under the terms of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the purposes of Patent Procedures:
Reference for Int Designation Date of Depositor's Identification Depositor's Deposit
Escherichia coli ATCC 98187 26 from pñH21 / DH5a September (containing the 1996 GPP2 gene)
Escherichia coli ATCC 98248 6 of DAR1A / AA200) November containing the 1996 gene DAR1)
FM5 Escherichia ATCC 98597 25 of Coli RJFl Om (November Contains a 1997 in terption) Reference for Int Designation Date of Depositor Identification Depositor's Deposit
FM5 Escherichia ATCC 98598 25 of Coli MSP33.6 November (containing 1997) an interruption gldA)
"ATCC" refers to the American Type Culture Collection international deposit located at 12301 Paarklawn Drive, Rockville, MD 20852 USA. The designation is the access number of the deposited material.
Applicants' has provided 43 sequences in accordance with the Rules for the Standard Representation of Sequences of Nucleotides and Amino Acids in Patent Applications (Annexes I and II of the Decision of the President of the EPO, published in Supplement No. 2 of PJ EPO , 12/1992) and with 37 CFR 1. 821- 1825 and Appendices A and B (Requirements for Requests Exhibits containing Nucleotide and amino acid sequences).
DETAILED DESCRIPTION OF THE INVENTION
The present invention solves the problem set forth in providing a method for the biological production of glycerol from a source of fermentable carbon in a recombinant organism. The method provides a fast, cheap and environmentally responsible source of glycerol useful in the cosmetic and pharmaceutical industries. The method uses a microorganism containing cloned homologous and heterologous genes encoding glycerol-3-phosphate dehydrogenase (G3PDH) and / or glycerol-3-phosphatase (G3P phosphatase). These genes are expressed in a recombinant host that has interruptions in genes encoding endogenous glycerol kinase and / or glycerol dehydrogenase enzymes. The method is useful for the production of glycerol, as well as some final products for which glycerol is an intermediate. The recombinant microorganism is contacted with a carbon source and cultured and then the glycerol or some final products derived therefrom are isolated from the conditioned medium. The genes can be incorporated into the host microorganisms separately or together for the production of the glycerol.
The process of the applicants has not previously been described for a recombinant organism and required the isolation of genes encoding the two enzymes and their subsequent expression in a host cell that has disruptions in the endogenous kinase and dehydrogenase genes. It will be appreciated by those familiar with the art that the applicants process can generally be applied for the production of compounds in which glycerol is a key intermediate, for example, 1,3-propanediol.
As used herein, the following terms may be used for the interpretation of the Claims and specification.
The terms "glycerol-3-phosphate dehydrogenase" and "G3PDH" refer to a polypeptide responsible for an enzymatic activity that catalyzes the conversion of dihydroxyacetone phosphate (DHAP) to glycerol-3-phosphate (G3P). In vivo G3PDH can be NADH; NADPH; or dependent on FAD. The NADH-dependent enzyme (EC 1.1.1.8) is encoded, for example, by several genes including GPDl (GenBank Z74071x2), or GPD2 (Genbank Z35169xl), or GPD3 (GenBank G984182), or DAR1 (GenBank Z74071x2). The NADPH-dependent enzyme (EC 1.1.1.94) is encoded by gpsA (GenBank U321643, (cds 197911-196892) G466746 and L45246). The FAD-dependent enzyme (EC 1.1.99.5) is encoded by GUT2 (GenBank Z47047x23), or glpD (GenBank G147838), or glpRBC (GenBank M20938).
The terms "glycerol-3-phosphatase", "sn-glycerol-3-phosphatase" or "d, 1-glycerol phosphatase", and "G3P phosphatase" refer to a polypeptide responsible for an enzymatic activity that catalyzes the conversion of glycerol - 3- phosphate and water to glycerol and inorganic phosphate. G3P phosphatase is encoded for example, by GPP1 (GenBank Z47047x25), or GPP2 (GenBank U18813x11).
The term "glycerol kinase" refers to a polypeptide responsible for an enzymatic activity that catalyzes the conversion of glycerol and ATP to glycerol-3-phosphate and ADP. The high energy of the phosphate donor ATP can be replaced by physiological substitutes (for example phosphoenolpyruvate). Glycerol kinase is encoded, for example, by GUT1 (GenBank U11583xl9) and glpK (GenBank L19201).
The term "glycerol dehydrogenase" refers to a polypeptide responsible for an enzymatic activity that catalyzes the conversion of glycerol to dihydroxyacetone (E.C. 1.1.1.6) or glycerol to glyceraldehyde (E.C.1.1.1.72). A responsible polypeptide - of an enzymatic activity that catalyzes the conversion of glycerol to dihydroxyacetone is also referred to as a "dihydroxyacetone reductase". Glycerol dehydrogenase may be dependent on NADH (E.C.l.1.1.6), NADPH (E .C.1.1.1.72), or other cofactors (e.g., E.C.1.1.99.22). A glycerol dehydrogenase dependent on NADH is encoded, for example, by gldA (GenBank U00006).
The term "enzyme dehydratase" will refer to any enzyme that is capable of isomerizing or converting a glycerol molecule to the product 3-hydroxypropionaldehyde. For the purposes of the present invention the dehydratase enzymes include a glycerol dehydratase (E.C. 4.2.1.30) and a diol dehydratase (E.C..2.1.28) having preferred substrates of glycerol and 1, 2-propanediol, respectively. In Citrobacter freundii, for example, glycerol dehydratase is encoded by three polypeptides whose gene sequences are represented by dhaB, dhaC and dhaE (GenBank U09771: base pairs 8556-10223, 10235-10819, and 10822-11250, respectively). In Klebsiella oxytoca, for example, diol dehydrate is encoded by three polypeptides whose gene sequences are represented by pddA, pddB, and pddC (GenBank D45071): base pairs 121-1785, 1796-2470, and 2485-3006, respectively).
The terms "GPDl", "DAR1", "0SG1", "D2830", and "YDL022W" will be used interchangeably and refer to a gene encoding a cytosolic glycerol-3-phosphate dehydrogenase and is characterized by the base sequence as SEC. ID NO: 1.
The term "GPD2" refers to a gene encoding a cytosolic glycerol-3-phosphate dehydrogenase and is characterized by the sequence given in SEQ ID NO: 2.
The terms "GUT2" and "YIL155C" are used interchangeably and refer to a gene encoding a mitochondrial glycerol-3-phosphate dehydrogenase and is characterized by the base sequence given in SEQ ID NO: 3.
The term "GPP1", "RHR2" and "YIL053W" are used interchangeably and refer to a .gen coding for a cytosolic glycerol-3-phosphatase and is characterized by the base sequence given in SEQ ID NO: 4.
The terms "GPP2", "H0R2" and "YER062C" are used interchangeably and refer to a gene encoding a cytosolic glycerol-3-phosphatase and characterized by the base sequence given as SEQ ID NO: 5.
The term "GUT1" refers to a gene encoding a cytosolic glycerol cinase and is characterized by the base sequence given as SEQ ID NO: 6. The term "glpK" refers to another gene encoding a glycerol cinase and is characterized by the base sequence given in GeneBank L19201, base pairs 77347-78855.
The term "gldA" refers to a gene encoding a glycerol dehydrogenase and is characterized by the base sequence given in GeneBank U00006, base pairs 3174-4316. The term "dad" refers to another gene encoding glycerol dehydrogenase and is characterized by the base sequence given in GeneBank U09771, peres bese 2577-3654.
How it is used in the present term
"Function" and "Enzyme function" refers to the catalytic ectivity of an enzyme to elterate the energy required to effect a specific chemical reaction. Such an activity can be applied to an equilibrium reaction where the production of both product and substrate can be achieved under suitable conditions.
The terms "polypeptide" and "protein" are used interchangeably. The terms "carbon substrate" and "carbon source" refers to a source capable of being metabolized by guest orgenisms of the present invention and particularly means carbon sources selected from the group consisting of monosaccharides, oligosaccharides, polysaccharides, and substrates of a carbon or mixtures of these.
"Conversion" refers to the metabolic process of an organism or cell that by means of a chemical reaction degrades or destroys the complexity of a chemical compound or substrate.
The terms "host cell" and "host organism" refer to a microorganism capable of receiving extirpant or heterologous genes and additional copies of endogenous genes and expressing these genes to produce an active gene product.
The terms "cell production" or "production of orgenisms" refer to a cell designed for the production of glycerol or compounds that can be derived from the glycerol biosynthetic process. Cell production will be recombinant and will contain either or both of a gene encoding a protein having an activity glycerol-3-phosphate dehydrogenase and a gene encoding a protein having a glycerol-3-phosphatase activity. In addition, in the genes of G3PDH and G3P phosphate, the host cell will contain interruptions in one or both of a gene encoding an endogenous glycerol kinase and a gene encoding an endogenous glycerol dehydrogenase. Where cell production is designated to produce 1, 3-propendiol, it will additionally contain a gene encoding a protein having dehydratase activity.
The terms "foreign gene", "DNA extrenjero", "heterologous gene", and "heterologous DNA" all refer to and genetic material net of an organism that has been placed in a different host organism.
The term "endogenous" as used herein with reference to genes or polypeptides expressed by genes, refers to genes or polypeptides that are native to a production of. cells and are not derived from other organisms. In the same way an "endogenous glycerol kinase" and an "endogenous glycerol dehydrogenase" are terms that refer to codified polypeptides by native genes for cell production.
The terms "recombinant organism" and "transformed host" refer to any organism transformed with heterologous or foreign genes. The recombinant organisms of the present invention express foreign genes encoding G3PDH and G3P phosphate for the production of glycerol from edecuted ceric sub-strains. Additionally, the terms "recombinant orgenisms" and "transformed hosts" refer to any organism transformed with endogenous genes (or homologs) as well as increase the number of copies of the genes.
"Gene" refers to a fragment of nucleic acid that expresses a specific protein, which includes regulatory sequences that precede non-coding) and follow e (3 'non-coding) coding region. The terms "native" and "wild type" refer to the gene as it is found in the netureleze with its own regulatory regimens.
The term "coding" refers to the process by which a gene, through the mechanisms of transcription and transfection, produces an amino acid sequence. The process of coding a specific amino acid sequence is the means to include DNA sequences that may involve base changes that do not cause a change in the encoded amino acid, or. that involves base changes that can alter one or more amino acids, but does not affect the functional properties of the protein encoded by the DNA sequence. For this, the invention aberca more than the specific exemplary sequences. Modifications in the sequence, such as eliminations, insertions, or substitutions in the sequence that produce imperceptible signals that do not substense the remaining functional properties of the resulting protein molecule are also contemplated. For example, alterations in the sequence of the gene that reflect the degeneration of the genetic code, or that result in the production of an amino acid chemically equivalent in a given site, are contemplated; similarly, a codon stops the amino acid alanine, a hydrophobic amino acid, can be replaced by a codon encoding other less hydrophobic residues, such as glycine, or a more hydrophobic residue, such as valine, leucine, or isoleucine. - Similarly, changes that result in the replacement of a negatively charged residue by another, such as aspartic acid with glutamic acid, or a positively charged residue by another, such as lysine by arginine, can also be expected to produce a biologically equivalent product. . Changes in the nucleotides that result in alteration in the N-terminal and C-terminal portions of the protein molecule would not be expected to alter the protein's activity either. In some cases, it may in fact be desirable to make mutants of the sequence in order to study the effect of the alteration of the biological activity of the protein. One of the proposed changes is in the routines of experts in the field, as is the determination of the retention of biological activity in codified products. In addition, the experts in the field recognize that the sequences encompassed by this invention are also defined by their capecided pare hybridizer, under conditions of severity (0.1X SSC, 0.1 L SDS, 65 ° C), with the sequence exemplified herein. .
The term "expression" refers to the transcription and triage of the product of the gene from a gene coding for the sequence of the gene product.
The terms "plasmid", "vector" and "cartridge" as used herein refer to an extrachromosomal element that often drives genes that are not part of the cell's central metabolism and usually in the form of circular DNA molecules of double filament. Such elements can be autonomously replicating sequences, sequences that integrate the genome, phage or nucleotide sequences, linear or circular, in which a number of nucleotide sequences have been joined or recombined in a unique construction that is capable of introducing a promoter fragment and DNA sequence for a selected gene product together with the appropriate non-translocated 3 'sequence in a cell, "transformation cartridge" refers to a specific vector that contains a foreign gene and that has elements in addition to the foreign gene that they facilitate the transformation of a particular host cell. "Expression cartridge" refers to a specific vector that contains a foreign gene and that has elements in addition to the foreign gene that allows enhanced expression of said gene in a foreign host cell.
The terms "transformation" and "transfection" refer to the acquisition of new genes in a cell after incorporation of nucleic acid. The acquired genes can be integrated into a chromosomal DNA or introduced as extrachromosomal replicating sequences. The term "transfor" refers to the cell resulting from a transformation.
The term "genetically altered" refers to the process of changing materiel herediterio by trensformeción or mutation. The terms "interruption" and "interrupted gene" as applied to genes refer to a method of genetically altering an organism by adding to or eliminating a gene a significant portion of said gene such as the protein e encoded by said gene if not is expressed and is not expressed in an effective form.
Biosynthetic glycerol procedure
It is contemplated that glycerol can be produced in recombinant orgenisms by manipulation of the synthetic glycerol process found in most microorganisms. Typically, a carbon substrate such as glucose is converted to glucose 6-phosphatase and hexocinese in the presence of ATP.
The glucose-phosphate isomerase catalyzes the conversion of glucose-6-phosphite and fructose-6-phosphate and then e-fructose-1, 6-diphosphate through the action of 6-phosphofructokinase. The diphosphate is then carried in dihydroxyacetone phosphate (DHAP) via aldolate. Finally G3PDH NADH-dependent converts DHAP to glycerol-3-phosphate which is then dephosphorylated to glycerol by G3P phosphatase. (Agarwel (1990), Adv. Biochem. Eng. 41: 114).
Genes encoding G3PDH, glycerol dehydrogenase, G3P phosphatase and glycerol kinase
The present invention provides genes suitable for the expression of the activities of G3PDH and G3P phosphates in a host cell.
Genes encoding G3PDH are known, for example, GPD1 has been isolated from Sacharomyces and has the base sequence given by SEQ ID NO: 1, which encodes the amino acid sequence given in SEQ ID NO: 7 (Wang and coleborers, supra). Similarly, the activity of G3PDH has been isolated from Saccharomyces encoded by GPD2 having the sequence given in SEQ ID NO: 2 coding for the amino acid sequence given in SEQ ID NO: 8 (Erikson et al., 1995). Microbiol., 17: 95).
For the purposes of the present invention, it is contemplated that any gene encoding a polypeptide responsible for the activity of G3PDH is determined to dictate the ability of the cetelizer to convert dihydroxycetone phosphite (DHAP) to glycerol-3-phosphate (G3P). . Additionally it is contemplated that any gene encoding the amino acid sequence of G3PDH as given by SEQ ID NO: 7, 8, 9, 10, 11 and 12 corresponding to the genes GPD1, GPD2, GUT2, gpsA, glpD, and the a subunit of glpABC respectively, be functional in the present invention in which said amino acid sequence can encompass substitutions, deletions or additions, of amino acids that do not alter the function of the enzyme. The experts in le meterie appreciate that genes encoding G3PDH isolated from other sources also be suitable for use in the present invention. For example, genes isolated from prokaryotes include accesses to GeneBank M34393, M20938, L06231, U12567, L45246, L45323, L45324, L45325, U32164, U32689, and U39682. Fungal isolated genes include accesses to GeneBank U30625, U30876 and X56162; genes isolated from insects include accesses to GeneBank X61223 and X14179; and genes isolated from source mammals include eats and GeneBank U12424, M25558 and X78593.
The genes that encode G3P phosphatase are known. For example, GPP2 has been isolated from Saccharomyces servisiae and has the base sequence of SEQ ID NO: 5, which encodes the amino acid sequence given in SEQ ID NO: 13 (Norbeck et al., (1996), J. Biol. Chem, 271: 13875).
For the purposes of the present invention, any gene encoding a G3P phosphite activity is suitable for use in the method in which the activity is capable of catalyzing the conversion of glycerol-3-phosphate and water to glycerol and organic phosphate. Additionally, any gene encoding the amino acid sequence of G3P phosphates as given by the sequence SEQ ID NO: 13 and 14 corresponding to the GPP2 and GPP1 genes respectively, will be functional in the present invention that includes any sequence of amino acids that ebarce substitutions, eliminations or additions of amino acids that do not alter the function of the enzyme G3P phosphatase. The experts in the materie will appreciate that genes encoding G3P phosphatase isolated from other sources will be suitable for use in the present invention. For example, the dephosphorylation of glycerol-3-phosphate to produce glycerol can be achieved with one or more of the following generic or specific phosphorylates: phosphonate elcelin (EC 3.1.3.1.) [GenBank M19159, M29663, U02550 or M33965]; acid phosphate (EC 3.1.3.2) [GenBank U51210, U19789, U28658 or L20566]; glycerol-3-phosphatase (EC 3.3.3.-) [GenBank Z38060 or U18813xll]; glucose-1-phosphatase (EC 3.3.3.10) [GenBank M33807]; glucose-6-phosphatase (EC 3.1.3.9) [GenBank U00445]; fructose-1, 6-bisphosphatase (EC 3.1.3.11) [GenBank X 12545 or J03207] or phosphotidyl glycero phosphate phosphatase (EC 3.3.3.27) [GenBank M23546 and M23628].
Genes encoding glycerol kinase are known. For example, GUT1 coding for glycerol kinase from Saccharomyces has been isolated and sequenced (Pavlik and collaborators (1993), Curr. Genet., 24: 21) and the sequence is from SEQ ID NO: 6, which codes sequence of amino acids dade in SEQ ID NO: 15. alternetically, glpK encodes a glycerol kinase from E. coli and is characterized by the base sequence given in Genebank L19201, peres bese 77347-78855.
Genes that encode glycerol dehydrogenase are known. For example, gldA encodes glycerol dehydrogenase from E. coli and is characterized by the base sequence given in GeneBank U00006, base pairs 3174-4316. Alternatively, ded refers to another gene encoding glycerol dehydrogenase from Ci trobacter freundii and is charac terized by the base sequence given in GeneBank U09771, base pairs 2557-3654.
Host cells deduced for the recombinant production of glycerol by the expression of G3PDH and G3P phosphates can be either prokaryotic or eukaryotic and will be limited only by their ability to express enzymes ectives. Preferred host cells will be bacteria, yeasts, and fungal lesions typically useful for the production of glycerol such as Ci trobacter, Enterobacter, Clostridium, Klebsiella, Aerobacter, Lactobacillus, apergillus, Saccaharomyces, Schizosaccharomyces, Zygosaccharomyces, Pichia,
Kluyveromyces, Candida, hansenula, Debaryomyces, Mucor, Torulopsis, Methylobacter, Escherichia, Salmonella, Bacillus, Streptomyces and Pseudomonas. Preferred in the present invention are E. Coli and Saccharomyces.
Where glycerol is a key intermediate in the production of 1,3-propanediol, the host cell will either have an endogenous gene that codes for a protein that has a dehydrating activity or will produce a tel transforming gene. Host cells particularly followed by the production of 1,3-propanediol are Ci trobacter, Enterobacter, Clostridium, Klebsiella, Aerobacter, Lactobacillus, and Salmonella, which have endogenous genes that encode enzymes dehydrate. In addition, cell hosts lacking an endogenous tel gene include E. coli.
Vectors and Cartridges of Expression
The present invention provides a variety of suitable expression and transformation vectors and transducers for the cloning, transformation and expression of G3PDH and G3P phosphatase in a suitable host cell. Appropriate vectors will be those that are compatible with the bacteria used. Suitable vectors can be derived, for example, from a bacterium, a virus (such as bacteriophage T7 or an M13 derived from phage), a cosmid, a yeast, or a plant. Protocols for obtaining and using such vectors are known to those skilled in the art (Sambrook et al., Molecular Cloning: A Leboretory Menuel- Volumes 1, 2, 3 (Cold Spring Herbor Laboratory: Cold Spring Harbor, NY, 1989)).
Typically, the vector or cartridge contains sequences that direct the transcription or translation of the appropriate gene, a selectable marker, and sequences that autonomously allow replication or chromosomal integration. Suitable vectors comprise a 5 'region of the gene harboring transcriptional initiation controls and a 3' region of the DNA fragment that controls transcriptional termination. It is mostly preferred when both control regions are derived from homologous genes of the transformed host cell. Such control regions do not need to be derived from the net genes of the specific species selected as a host for production.
The initiation control regions, or promoters, which are useful to drive the expression of the G3PDH and G3P phosphatase genes in the desired host cell are numerous and familiar to the experts in the meter.
Virtually any promoter capable of driving these genes is suitable for the present invention including but not limited to CYC1, HIS3, GAL1, GALIO, ADH1, PGK, PH05, GAPDH, ADC1, TRP1, URA3, LEU2, ENO, and TPI (useful for expression in Saccharomyces); AOXl (useful for expression in Pichia); and lac, trp,? P,? PR, T7, tac, and trc, (useful for expression in E. coli).
Control regions of the termination can also be derived from true native genes of the preferred hosts. Optionally, a termination site may not be necessary; however, it is mostly preferred if it is included.
For effective expression of the instantaneous enzymes, the DNA encoding the enzymes in linkage operebly and between initiation codons and selected expression control regions such that the expression results in the transformation of the appropriate messenger RNA.
Transformation of Suitable Guests and Expression of G3P Phosphatase for the Production of Glycerol
Once the right cartridges are built, they are used to keep the guests in their cells. Introduction of the cartridge containing the genes encoding G3PDH and / or G3P phosphatase into the host cell can be achieved by known methods such as transfection, for example, using cells permeabilized with calcium, electroporation, or by transfection using a phage virus recombinant (Sambrook et al., supra).
In the present invention, cartridges AH21 and DAR1 were used to transform the E. Coli DH5 and FM5 as fully described in the GENERAL METHODS AND EXAMPLES.
Alternately, it was contemplated that host cells comprising genes of endogenous G3PDH and / or G3P phosphatase can be manipulated so that the relevant genes are upregulated for the production of glycerol.
Methods for overregulation of endogenous genes are well known in the art. For example, in order to overregulate the desired genes, a structural gene is generally placed at the end of a promoter region on the DNA that is recognized by the recipient microorganism. In addition to the promoter one may include other regulatory sequences that increase or control expression from heterologous genes. In addition, one can alter the regulatory sequences of endogenous genes by any genetic manipulation for the same purpose. The expression can be controlled by an inducer or a repressor so that the microorganism coordinately expresses the genes necessary to complete the desired metabolic procedure.
At the time of the invention, host cells containing endogenous genes encoding the activities of G3PDH and / or G3P phosphatase could be placed under the control of regulated promoters (e.g., lac or osmy) or constitutive promoters. For example, a certucho can be constructed to contain an inducible or constitutive promoter, flenked by DNA of sufficient extension and homologous with the net genes to allow the objective. The introduction of adequate growth conditions will result in homologous recombinations between the cartridge and the target portion of the gene and the replacement of the relevant native promoter with the regulatable promoter. Such methods can be used to effect the up-regulation of endogenous genes coding for G3PDH and / or G3P phosphatase activities for the production of glycerol.
Specific and random Mutagenesis Sites to Disrupt Enzymatic Activities
The enzymatic procedure by the cell and the organisms metabolize glycerol are known in the art. Figure 1. Glycerol is converted to glycerol-3-phosphate (G3P) by an ATP-dependent glycerol kinase; The G3P can then be oxidized to DHAP by G3PDH. In a second procedure, the glycerol is oxidized and dihydroxycetone (DHA) by a glycerol hydrogenase; DHA can then be converted to DHAP by a DHA cinase ATP dependent. In a third procedure, glycerol is oxidized and glycereldehyde by a glycerol dehydrogenase; Glyceraldehyde can be phosphorylated to glyceraldehyde-3-phosphite through an ATP-dependent gland. DHAP and glyceraldehyde-3-phosphate, interconverted by the action of triosephosphate isomerase, can be further metabolized by the central metabolism procedure. These procedures, by the introduction of by-products, are deterioration for the production of glycerol.
One aspect of the present invention is the ability to provide an organism production for the production of glycerol wherein the glycerol kinase and glycerol dehydrogenase activities that convert glycerol have been eliminied. Methods of creating mutants by elimination are common and well known in the art. For example, wild type cells can be exposed to a variety of agents such as radiation or chemical agents and then screened for the desired phenotype. When the creation of mutagens through radiations either ultraviolet (UV) or ionizing radiation can be used. Suitable short UV wavelengths for genetic mutations will fall in the range of 200 nm to 300 nm where 254 is preferred. UV radiation at this wavelength mainly causes changes in the nucleic acid sequence from guanidine and cytosine to adenine and thymidine. Since all cells have DNA repair mechanisms that would repair most of the UV-induced mutations, agents such as caffeine and other inhibitors can be added to interrupt the repairing process and maximize the number of effective mutations. Mutations at long UV wavelengths using rays in the range of 300 n up to 400 nm are also possible but are not generally as effective as short UV wavelengths unless the rays are used in conjunction with several ectivadoree teles as tintures psorelem that interectúen with DNA.
Mutagenesis with chemical agents is also effective in generating mutant and commonly used substances that include chemicals that effect neuroreplication of DNA such as HNO_ and NH ^ OH, as well as agents that affect DNA replication such as acridine tinctures, remarkable for cause mutations of mobile structure. Specific methods to create mutants using radiation or chemical agents are well documented in the materie. See for example Thomas D. Brook in Biotechology: A Textbook Industrial Microbiology, second edition (1989) Sineuer Associetes, Inc., Sunderland, MA., Or Deshpande, Mukund V., api .. Biochem. Biotechnol., 36, 227, 1992), incorporated herein by reference.
After the mutagenesis took place, mutants having the desired phenotype can be selected by a variety of methods. It is the most common screener where the mutagenized cells are selected for the ability to produce the desired product or intermediary. Alternatively, selective isolation of mutants can be effected by growth of a mutagenized population on selective medium where only resistant colonies can develop. Methods of mutate selections are highly developed and well known in the field of industrial microbiology. See Brock, Supra, De Mancilha et al., Food Chem., 14, 313, (1984).
Biological mutagenic agents which target genes randomly are well known in the art. See for example De Bruijn and Rossbach in Methods for General and Molecular Bacteriology (1994) American Society for Microbiology, Washington, DC, alternatively, provided that the gene sequence is known, interruption in chromosomal genes with specific elimination or replacement is achieved by homologous recombination with an appropriate plasmid. See, for example, Hemilton et al. (1989) J. Bacteriol. 171-4622, Balbes y colaboredores (1993) Gene 136: 211-213, Gueldener et coleboradores, (1996) Nucleic Acid. Res. 24: 2519-2524, and Smith and coleboredores (1996) Methods Mol. Cell. Biol .. 5: 270-277.
It is contemplated that any of the above-mentioned methods can be used for the elimination or inactivation of glycerol kinase and glycerol dehydrogenase activities in the production of preferred orgenisms.
Medium and Carbon Substrates
The fermentation media of the present invention should contain edecuedos carbon substrates. Suitable substrates may include but are limited to monosaccharides such as glucose and fructose, oligosaccharides such as lectose or secerose, polysaccharides such as starch or cellulose or mixtures thereof and unlabeled mixtures of renewable food stocks such as drained whey from cheese, corn soaked liquor, beet sugar molasses, and barley malt. Additionally, the carbon substrate may also be of one-carbon substrates such as carbon dioxide, or methanol for the conversion of the methobolic conversion in chemical intermediates has been demonstrated.
The production of glycerol from single carbon sources (eg methanol, formaldehyde or formate) has been reported in methyltrophobic yeasts (Yamada and collaborators (1989), Agrie, Biol .. Chem., 53 (2): 541-543) and in Becteries (Unter et al., (1985), Biochemistry, 24: 4148-4155). These organisms can assimilate compounds of a single carbon, which vary in the oxidation state from methane to formate, and produce glycerol. The carbon assimilation process can be through ribulose monophosphate, through serine, or through monophosphate of xylulose (Gontschalk, Bacterial Metabolism, Second Edition, Springer-Verlag, New York (1986)). The ribulose monophosphate process involves the condensation of formate with ribulose-5-phosphate to form an ezurium of 6 cerbons which is transformed into fructose and eventually into a three-carbon product, glyceraldehyde-3-phosphate.
In the same way, the serine procedure mimics compounds of a carbon in the glycolytic process via methylenetetrahydrofolate.
In addition to one and two carbon substrates, methyltrophic orgenisms are well known to utilize a number of other carbon-containing compounds such as methylamine, glucosamine and a variety of amino acids for meteorological activity. For example, methyltrophic yeasts are known to utilize the cerium from methylamine to trehalose or glycerol (Bellion et al. (1993), Microb. Growth Cl Compd., [Int.Symp.], 7th, 415.32.) Editors: Murrell, J. Collin, Kelly, Don P. Publishers: Intercept, Andover, UK). Similarly, several species of Candida etabolizer elenin or oleic acid (Sulter et al., (1990), Arch. Microbiol, 153 (5): 485-9). Accordingly, the carbon source used in the present invention may encompass a variety of sub-strains containing cerbono and will only be limited by the selection of the microorganism.
Although all cerbono substrates are mentioned and mixed -of which they are suitable in the present invention, preferred carbon substrates are monosaccharides, oligosaccharides, polysaccharides, single-celled sub-strands or mixtures thereof. More preferred are sugars such as glucose, fructose, sucrose, maltose, lactose and single carbon substrates such as methanol and carbon dioxide. Although all above-mentioned carbon substrates and mixtures thereof are suitable for the present invention, preferred cerium substrates are monoseccharides, oligssecharides, polysaccharides, single carbon substrates or mixtures thereof. More preferred are sugars such as glucose, fructose, saccharose, meltose, lactose and single carbon substrates such as methanol and carbon dioxide. The most preferred of the carbon substrates is glucose.
In addition to an appropriate carbon source, the fermentation medium must contain minerals, salts, cofactors, regulators and other edecuted components, known to those skilled in the art, suitable for the development of crops and promotion of the enzymatic process necessary for glycerol production.
The cells are typically developed at 30 ° C in appropriate medium. Preferred detergent media are commercially available prepared media such as Luria Bertani broth (LB), Dextrose Saboured broth (SD), or medium-halide cell (YM). Other defined or synthetic means of development can also be used and the appropriate medium for the development of the pertinent micro-genomics will be known to experts in microbiology or fermentation sciences. The use of known agents to modulate the repression of cetebolite directly or indirectly, for example adenosine 3'-5'-cyclic monophosphate, can also be incorporated into the reaction medium. Similarly, the use of known agents to modulate enzymatic activities (eg, sulfites, bisulfites, and alkalis) that lead to improved glycerol production can be used in conjunction with or as an alterneive for genetic manipulations.
Suitable pH ranges for fermentation are between pH 5.0 to pH 9.0 where the pH range of 6.0 to 8.0 is preferred for initial conditions.
The reactions can be carried out under aerobic or anaerobic conditions, where aneerobic or microaerobic conditions are preferred.
Identification of the activities of G3PDH, glycerol dehydrogenase, G3P phosfetase, and glycerol
The expression levels of the G3PDH, G3P phosphatase, glycerol dehydrogenase, and glycerol kinase proteins are measured by enzymatic assays. Generally, assays for the activity of G3PDH and the glycerol dehydrogenase activity depend on the spectral properties of the co-substrate, NADH, in the conversion of DHAP to G-3-P and the conversion of DHA and glycerol, respectively. NADH tuwo UV absorption intrinsic / vis and its consumption can be monitored spectrophotometrically and 340 mm. The ectivided G3P phosphinate can be measured by any method of measuring the inorganic phosphate released in the reaction. The most commonly used detection method uses the visible spectroscopic determination of the blue colored ammonium phosphomolybdate complex. The glycerol kinase activity can be measured by the detection of G3P from glycerol and ATP, for example by NMR. The teachings can be targeted with more specific characteristics of individual enzymes if necessary, for example, by the use of alternate cofactors.
Identification and recovery of glycerol and other products (for example 1, 3-propanediol)
Glycerol and other products (for example 1,3-propanediol) can be identified and quantified by analysis by liquid chromatogram of this execution
(HPLC) and gas chromatography / mass spectroscopy
(GC // MS) on the extremities of the free cells. The
HPLC is a preferred method in which the fermentation media are analyzed on an analytical column of exchange using mobile phase of sulfuric acid 0.01 N in the isocratic style.
The methods for the recovery of glycerol from the fermentation media are known in the art. For example, glycerol can be obtained from the cell medium by subjecting the reaction mixture to the following sequence of steps: filtration; water removal; extraction of organic solvent; Frection distillation (US Pat. No. 2,986,495).
Description of the Preferred Modalities
Glycerol production
The present invention describes a method for the production of glycerol from a carbon source edecueda using a recombinant organism. Particularly well-known in the invention is a guest cell becteriene, trensformede with a certucho of expression that carries one or another or embos a gene that encodes a protein that has an activity glycerol-3-phosphide dehydrogenase and a gene that encodes and binds protein which has glycerol-3-phosphatase activity. In addition to the G3PDH and G3P phosphates gens, the host cell will contain interruptions in one or the other of the genes encoding the endogenous glycerol kinase and glycerol dehydrogenase enzymes. The combined effect of the foreign G3PDH and G3P phosphatase genes (which provide a carbon-to-glycerol source procedure) with gene disruptions (which block the conversion of glycerol) results in an orgenism that is efficient and efficient. trustworthy glycerol.
Although the optimal origin for the production of glycerol contains the interruption of the mentioned gene, the production of glycerol with a host cell that contains one or the other or both of the G3PDH and G3P phosphatase genes in the absence of such interruptions. For example, the recombinant E. coli strain AA200 carrying the DAR1 gene (Example 1) was able to produce between 0.38 g / L and 0.48 g / L glycerol depending on the fermentation parameters. Similarly, E. Coli DH5a, which carries and expresses the GPP2 gene (Example 2), was capable of a production of 0.2 g / L. Where both genes are present in conjunction with an elimination of endogenous glycerol cinase activity, a reduction in the conversion of glycerol can be seen (Example 8). In addition, witness the ectivided glycerol dehydrogenase is linked to the conversion of glycerol under limited glucose conditions; thus, it is anticipated that the removal of the glycerol dehydrogenase activity will result in the reduction of the glycerol conversion (Example 8).
Production of 1, 3-propanediol
The present invention can also be suitable for the production of 1,3-propanediol by the use of recombinant organisms expressing the G3PDH and / or G3P phosphatese genes that are extirpable and which contain interruptions in the activities of endogenous glycerol kinase and / or glycerol dehydrogenase.
Additionally, the invention is provided for the process for the production of 1,3-propanediol from a recombinant organism wherein multiple copies of endogenous genes are introduced. In addition to these genetic alterations, the production of cells will require the presence of a gene that encodes an active dehydrotic enzyme. The enzyme of the enzyme dehydrate can be either a glycerol dehydrate or a diol dehydratase. The activity of the enzyme dehydretase can result in either the expression of an endogenous gene or the expression of a transfected foreign gene in the host organism. The isolation and expression of genes encoding dehydrated enzymes edecuedes are well known in the art, and are disclosed by applicants in PCT / US 96/06705, filed November 5, 1996 and US 5686276 and US 5633362, so they are incorporated herein by reference. It will be appreciated that, since glycerol is an intermediate in the production of 1,3-propanediol, where the host cell contains a dehydrated ectivided in conjunction with the G3PDH and / or G3P phosphoretic genes and in the absence of activities. glycerol kinase and glycerol dehydrogenase that convert glycerol, the cell will be followed by the production of 1,3-propendiol.
The present invention is further defined in the following Examples, which indicate preferred embodiments of the invention, which are given by way of illustration only. From the foregoing discussion and these Examples, one skilled in the art can ascertain the essential characteristics of this invention, and without departing from the spirit and elcence thereof, can make various changes and modifications of the invention to adapt it to various uses and conditions.
EXAMPLES
GENERAL METHODS
Methods for phosphorylations, ligations, and transforms are well known in the art. Techniques suitable for use in the following examples can be found in Sambrook et al., Molecular Cloning: a Laboratory Manual, second edition, Cold Spring Harbor Laboratory Press (1989).
Suitable materials and methods for the maintenance and development of bacterial cultures are well known in the materie. Suitable techniques for use in the following examples can be found in Manuel of Methods for Generel Becteriology (Phillipp Gerherdt, R. G. Murrie, Relph N. Cosstilow, Eugene W. Nester,
Willis A. Wood, Moel R. Krieg and G. Briggs Phillips, eds),
Americen Society for Microbiology, Washington, DC. (1994) or in Biotechnology: A Textbook of Industrial Microbiology
(Thomas D. Brock, Second Edition (1989) Sinauer Associares, Inc., Sunderland, MA). All reagents and materials for the development and maintenance of bacterial cells were obtained from Aldrich Chemicals (Milwaukee, Wl), DIFCO Laboretories (Detroit, MI), GIBCO / BRL (Gaithersburg, MD), or Sigma Chemical Company (St. Louis , MO) unless otherwise specified.
The meaning of the abbreviations are as follows: "h" means time (s), "min" means minute (s), "sec" means second (s), "d" means day (s), "L" means milliliters, "L" means liters.
"Cellular strains"
The following strains of E. coli were used for transformation and expression of G3PDH and G3P phosphate. Strains were obtained from the E. Coli Genetic Stock Center, ATCC. Ó Life Technology (Gaithersburg, MD).
AA200 (garN10fhuA22 ompF627 fadL701 relay pit-10 spoTl tpi-1 phoM510 mcrBl) (Anderson and coleboradores, (1970), J. Gen. Microbiol, 62: 329).
BB20 (tonA22 AphoA8 fadL701 reJAI g! PR2 gJpD3 pi t-10 gpsA20 spoTl T2R) (Croman et al., J. bact., 118: 598).
DH5a (deoR endAl gyrA96 hsdR17 recAl relAl supE4 thi-1? DacZYA- argFV169) phl801acZ? M115 F-) (Woodcock et al., (1989), Nuci Acids Res., 17: 3469).
FM5 Escherichia coli (ATCC 53911)
Identification of Glycerol
The glucose conversion was monitored by HPLC and / or GC. Analyzes were carried out using standard and materiel techniques available to an expert in the chromatography material. An edecuedo method used a system Weters Maximum 820 HPLC using UV (210 nm) and IR detection. The samples were injected onto a Shodex SH-1011P precolumn (6 mm x 50 mm), temperature controlled at 50 ° C, using H SO ^ 0.01 N as the mobile phase at a flow of 0.69 ml / min. When the quantitative analysis was desired, the samples were prepared with a known amount of trimethylacetic acid as an external standard. Typically, the retention times of 1,3-propanediol (IR detection), glycerol (IR detection) and glucose (IR detection) were 21.39 min, 17.03 min and 12.66 min, respectively.
The gliocerol was also added by GC / MS. The detections by gas chromatography with masse spectrometry, seperation and glycerol cuentification were performed using DB-WAX column (30 m, 0.32 mm ID, 0.25 μm film thickness, J &W Scientific, Folsom, CA) the following conditions: Injector: portion, 1: 15; Sample volume: 1 μL; tempereture profile: 150 ° C initial temperature with 30 seconds of waiting, 40 ° C / min at 180 ° C, 20 ° C up to 240 ° C, wait for 2.5 min. Detection: Mass Spectrometry (Hewlett Packard 5971, San Fernando, CA), quantitative SIM using 61 m / z ions and 64 m / z target ions for glycerol and glycerol-d8, and 43 m / z ion qualifying ion for glycerol. Glycerol-d8 was used as an internal standard.
Assay for glycerol-3-phosphatase, G3P phosphatase
The assay for enzyme activation was carried out by incubation of the extract with a subset of organic phosphatide in a bis-Tris or MES and megnesium regulator, pH 6.5. the sub-residue used was either 1-a-glycerol phosphate, or d, 1-a-glycerol phosphate. The final concentrations of the reagents in the assay were: regulator (20 mM bis-Tris or 50 M MES); MgCl2 (10 mM); and substrate (20 mM). If the total protein in the sample was low and no visible precipitation occurs with an acid extinguisher, the sample was conveniently assayed in the cell. This method involves incubation of an enzyme sample in a cell that contained 20 mM of regulatory substrate (50 μL, 200 mM) of MES, 10 M of MgCl_, pH 6.5. The assay volume of the phosphatese finel was 0.5 mL. show you that the enzyme contained was added to the reaction mixture; the contents of the cell were mixed and then the cell was placed in a normal water bath at T = 37 ° C for 5 e 120 min, the time period elapsed depending on whether the phosphatase ectivided in the enzyme sample varied from 2 to 0.2 U / mL. The enzymatic reaction was quenched by the addition of acidic molybdate reagent (0.4 mL). After the Fiske SubbaRow reagent (0.1 mL) and dilute distillate (1.5 mL) were added, the solution was mixed and maintained for further development. After 10 min allow color development, the ebsorbance of the samples was read at 660 nm using a Cery 219 UV / Vis spectrophotometer. The centred of inorganic phosphate liberated was compared with a standard curve that was prepared by using a stock solution of inorganic phosphate (0.65 M) and preparation of 6 standards with fine concentrations of inorganic phosphite in the range of 0.026 hete 0.130 μmoles / mL.
Enseyo Spectrophotometric for the Activity of Glycerol 3-Phosphate Dehydrogenase (G3PDH)
The following procedure was used as the modified one subsequently published by Bell and coleboredores (1975), J. Biol .. Chem., 250: 7153- 8. This method involved incubating an enzyme sample in a cell containing 0.2 mM NADH; 2.0 mM dihydroxyacetone phosphate (DHAP), and enzyme in 0.1 M Tris / HCl, buffer pH 7.5 with 5 mM DTT, in a total volume of 1.0 ml at 30 ° C. The spectrophotometer was placed in monitor of absorbance changes at the fixed wavelength of 340 nm. The instrument was adjusted with a target in a cell containing regulator only. After the enzyme was added to the cell, an absorbency reading was made. The first substrate, NADH (50 μL 4 mM NADH, "the absorbance will increase approximately 1.25 AU), was added to determine the background ratio, the ratio would continue for at least 3 minutes, the second substrate, DHAP (50 μL 40 mM). of DHAP), was then determined and the ebsorbency monosite against time was monitored for at least 3 minutes to determine the important proportion.The activity of G3PDH was defined by subtracting the background proportion from the important proportion.
Assay of Activity 13, C-NMR for Glycerol Kinase
An appropriate amount of enzyme, typically a cell-free crude extract, was added to a reaction mixture containing 40 mM ATP, 20 mM MgSO4, 21 mM glycerol uniformly labeled 1OC (99, Cambridge Isotope Laboretories), and Tris -HCl 0.1 M, pH 9 for 75 min at 25 ° C. the conversion of glycerol to glycerol 3-phosphate was detected by: ~ 'C-NMR (125 MHz); - glycerol (63.11 ppm, d, J = 41 Hz and 72.66 ppm, t, J = 41 Hz); glycerol 3-phosphate (62.93 ppm, d, J = 41 Hz, 65.31 ppm, br d, J = 43 Hz, and 72.66 ppm, dt, J = 6.41 Hz).
Glycerol Dehydrogenase Assay Linked to NADH
The activity of glycerol dehydrogenase linked to N7? DH in cepes of. E. Coli (gldA) was determined after separation of the protein by electrophoresis on undenatured polyacrilemide gel. The conversion of glycerol plus NAD + e dihydroxyacetone plus NADH was echopleted with the conversion of 3- [4,5-dimethylthiazol-2-yl] -2,5-diphenyl tetrazolium bromide (PMS) as a mediator. (Tang et al (1997) J. Bacteriol.140: 182).
Electrophoresis was performed in duplicate by standard procedures using native gels (8-16 i TG, 1.5 mm, 15 fringes of Novex gel, San diego, CA). The residual glycerol was removed from the gels by washing 3x with 50 mM Tris or potassium carbonate regulator, pH 9 for 10 minutes. The duplicate of gels was developed, with and without glycerol (eXexclusively finel concentration of 0.16 M), in 15 ml of test solution containing 50 mM Tris or potassium carbonate, pH 9, 60 mg of ammonium sulfate, 75 mg of NAD +, 1.5 mg of MTT, and 0.5 mg of PMS. The presence or absence of glycerol dehydrogenase activity in the form of NADH in strains of E. coli (gldA) was also determined, following electrophoresis on polyacrylamide gel, by reaction with polyclonal antibodies produced for K. pneumoniae glycerol dehydrogenase (ded).
CONSTRUCTION OF PLASMIDS AND CONSTRUCTION OF STRAINS Cloning and Expression of Glycerol 3- Phosphatase for Increase in Glycerol Production in E. Coli DH5a and FM5
The clone 6592 of chromosome V lembda of Saccharomyces cerevisiae (GeneBank, accession No. U18813 x 11) was obtained from ATCC. The glycerol 3-phosphite phospholipid (GPP2) gene was cloned by cloning from the lambde clone as the target DNA using synthetic primers (SEQ ID NO: 16 with SEQ ID NO: 17) incorporating a BamHI-RBS-XbaI site at the 5 end. 'and a Smal site at the 3' end. The product was subcloned in pCR-Script (Stratagene, medison, Wl) on the SrfT site to generate plasmids pAH15 containing GPP2. Plasmid pAH15 contains the GPP2 gene in the inective direction for expression from the promoter reads in pCR-Script SK +. The BamHI-Smal fragment from pAH15 containing the GPP2 gene was inserted into pBlueScriptII SK + to generate the plasmid pAH19. PAH19 contains the GPP2 gene in the run expression orientation from the lac promoter. The Xbal-Pstl fragment of pAH19 containing the GPP2 gene was inserted into pPHOX2 in the created pAH21 plasmid. The pAH21 / DH5a is the plasmid expression.
Plasmids for Overexpression of PARÍ in E. coli
DAR1 was isolated by PCR cloning of S. cerevisiae genomic DNA using synthetic primers
(SEQ ID NO: 18 with SEQ ID NO: 19) Successful cloning by PCR places a Ncol site at the 5 'end and from DAR1 where the ATG in Ncol is the methionineinitiator of DARl. at the 3 'end of DAR1 a BamHl site is introduced following the translational terminator. The PCR fragments were designated with Ncol + BamHl and cloned in the same sites in the expression plasmid pTrc99A
(Phermecie, Piscawey, NJ) to give pDARl.
In order to create a better ribosome binding site at the 5 'end of DAR1, a Spel-RBS-NcoI stele obtained by chilling followed by cooling of synthetic primers (SEQ ID NO: 20 with SEQ ID NO: 21) was inserted into the Ncol site of pDARlA to create pAH40. Plasmid pAH40 contains the new RBS and the DAR1 gene in the correct orientation for expression from the trc promoter of pTrc99A (Pharmecia, Piscatewey, NJ). The NcoI-BamHI fragment of pDARlA and a second Spel-RBS-Ncol slender group obtained by heating and cooling the synthetic primers (SEQ ID NO: 22 with SEQ ID NO: 23) was inserted into the SpelI-BamHI site of pBC- SK + (Stretegene, medison, Wl) to create the pAH42 plasmid. Plasmid pAH42 contains a chloramphenicol resistant gene.
Construction of Expression Cartridges for PARÍ and
GPP2
Expression cartridges for DAR1 and GPP2 were conjugated from the individual subclones DAR1 and GPP2 described above using standard methods of molecular biology. The BamHI-PstI fragment of pAH19 containing the ribosome linker site (RBS) and the GPP2 gene was inserted into pAH40 to create pAH43. The BamHI-Pstl fragment of pAH19 containing the RBS and the GPP2 was inserted into pAH42 to believe pAH45.
The ribosome site appearing at the 5 'end of GPP2 was modified as follows. A BamHI-RBS-Spel link, obtained by heating and then cooling of synthetic primers (GATCCAGGAAACAG (SEQ ID NO: 24) with CTAGTCTGTTTCCTG (SEQ ID NO: 25) in the X al-Pstl fragment from pAH19 containing the GPP2 gene, was inserted into the BamHI-PstI site of pAH40 to create pAH448. Plasmid pAH48 c contains the DAR1 gene, the modified RBS, and the GPP2 gene in the correct orientation for expression from the trc promoter of pTrc99A (Pharmacie, Piscetaway, NJ).
Transformation of E. Coli
All plasmids described herein were transformed into E. Coli DH5a or FM5 using standard molecular biology techniques. The trensformentes were verified by their petrón DNA RFLP.
EXAMPLE 1
PRODUCTION OF GLYCEROL DESPE E. COLI TRANSFORMATES WITH THE G3PPH GENE
Medium
Synthetic medium was used for anaerobic or aerobic glycerol production using E. Cold cells transformed with pDARlA. The medium contained per liter 6.0 g of Na ^ HPO ^, 3.0 g of KH: P0, 1.0 g of NH4C1, 0.5 g of NaCl, 1 mL of Mg.SO; .7H_0 to 20?, 8.0 g of glucose, 40 mg of casemino acids, 0.5 ml of thiamine hydrochloride at 1; , 100 mg of empiciline.
Development Conditions
Cepe AA200 protected in pDARl or the vector pTrc99A was developed under aerobic conditions in 50 ml of medium shaking at 250 rpm in a 250 ml flask at 37 ° C. A 6JJ 0.2-0.3 isoprohio-β-galactoside was added to a finel concentration of 1 mM and continuous incubation for 48 hours. Stop samples of aneerbic development of induced cells were used to fill Falcon tubes No. 2054 that were capped and mixed gently by rotation at 37 ° C for 48 hours. The glycerol production was determined by HPLC analysis of the supernatant cultures. The cepes pDARlA / AA200 yielded 0.38 g / L of glycerol after 48 hours, under anaerobic conditions, and 0.48 g / L of eerobic conditions.
EXAMPLE 2
PRODUCTION OF GLYCEROL PESPE E. COLI TRANSFORMAPO CO GENE G3PFOSFATASA
Medium Synthetic medium phoA was used in agitation units to demonstrate the increase of glycerol by the expression of GPP2 in E. coli. The medium phoA, contained per liter: amisoy, 12 g; amino sulfate, 0.62 g; MOPS, 10.5 g; sodium citrate, 1.2 g; NaOH (IT), 10 mL; MgSO 1 M, 12 mL; trace elements 100X, 12 mL; 50% glucose, 10 L; 1% thiamine, 10 mL; 100 mg / mL of L-proline, 10 mL; 2.5 mM FeCl_, 5 L; Regulator with phosphate mixture, 2 ml (5 ml of 0.2M NaH2P04 + 9 ml of 0.2 M K_HP0), and pH 7.0. The trace elements 100X for the medium phoA / L contained: ZnS04. H "0., 0.58 g; MnS0 .HO, 0.34 g; CuS0 .5H_0, 0.49 g; CoC1..6H20, 0.47 g; H3B03, 0.12 g, NaMo04.2H.O, 0.48 g.
Agitation of Experiment flasks
Strains p? H21 / DH5a (containing the GPP2 gene) and pPH0X2 / DH5a (control were developed in 45 ml of medium (phoA medium, 50 μg / mL of carbeciline, and 1 μg / mL of vitamin B12) in a matre of agitation of 250 mL at 37 ° C. The cultures were developed under erogenous conditions (shaking at 250 rpm) for 24 hours.The glycerol production was determined by HPLC analysis of the supernatant culture PAH21 / DH5a produced 0.2 g / L of glycerol after 24 hours.
EXAMPLE 3
PROPUCCION PE GLICEROL PESPE P-GLUCOSE USING RECOMBINANT E. COLI CONTAINING BOTH GPP2 AND PARI.
The development of the increased glycerol production by E. Coli DH5a containing pAH43 proceeds aerobically at 37 ° C in shake flask cultures (erlen meyer flasks, liquid volume / 5 totel volume).
Crops in minimal medium / 1% glucose shake flasks are initiated by inoculation of overnight LB / culture in glucose at 11 with antibiotic selection. The minimum medium is: defined medium in sterilized filter, final pH 6.8 (HCl), medium content Minimum: 12.6 g of (NH) .S0, 13.7 g of KHP04, 0.2 g of levedura extract (Difco), 1 g of NaHC03, 5 mg of vitemine B12, 5 L of Solution of elements in Trezes de Balch Modified (the composition of the stem can be found in Methods for General and Molecular Bacteriology (P. Gerhardt et al., Eds. P. 158, American Society for Microbiology, Washington, DC (1994).) Methods of egitization are incubated at 37 ° C with vigorous all-night aeration, after which they are sampled for GC analysis of the supernatant.The glycerol production shown by pAH43 / DH5a 3.8 g / L after 24 hours.
EXAMPLE 4
PRODUCTION OF GLYCEROL PESPE P-GLUCOSE USANPO E. COLI RECOMBINANT THAT CONTAINS BOTH GPP2 AND PARI
Example 4 illustrates the production of glucose from recombinant E. coli DH5a / pAH48 which contains both GPP2 and DAR1 genes.
Strain DH5a / pAH48 was constructed as described above in the GENERAL METHODS.
Pre-cultivation
DH5a / pAH48 were precultured by seeding in a fermentation run. The components and protocol for pre-cultivation are enlisted and continued.
Pre-culture medium KH2P04 30.0 g / L Citric acid 2.0 g / L MgSO0 .7H O 2.0 g / L H S04 98 2.0 mL / L
Ammonium ferric citrate 0.3 g / L
CaCl- .2H O 0.2 g / L
Extract of Levadure 5.0 g / L
Trazes of metals 5.0 mL / L
Glucose 10.0 g / L
Carbecillin 100.0 mg / L.
The components of the previous medium were mixed together and the pH was adjusted to 6.8 with NH4OH. The medium was then sterilized by filtrate.
Trace metals were used according to the following recipe: Citric acid, monohydrate 4.0 g / L MgSO4-7H ^ 0 3.0 g / L MnS0 .H_0 0.5 g / L NaCl 1.0 g / L FeS0 .7H-0 0.1 g / L CoCl .6H O 0.1 g / L CeCl 0.1 g / L ZnS0 .7H O 0.1 g / L A1K (S0) ^. 12H.0 10.0 mg / L H-: BOJ 10 mg / L Ne2Mo04.2H20 10 mg / L
NiS04.6H20 10 mg / L
Na "Se03 10 mg / L
Na-W04.2H-0 10 mg / L
The cultures were initiated from inoculated culture sowings from 50 μL of frozen reserve
(glycerol at 15 μl as a freezing protector) in 600 mL of medium in a 2 L Erlen Meyer matre. The cultures were developed at 30 ° C in an incubator and 250 rpm for approximately 12 hours and then used to sow the fermenter.
Development of Fermentation
Container
Agitated tank of 15 L
Medium
KH P0 6.8 g / L Citric acid 2.0 g / L MgSO4.7H-O 2.0 g / L H S0 98 2.0 mL / L Ammonium ferric citrate 0.3 g / L CaCl-.2H-0 0.2 g / L
Defoamer Mazu DF204 1.0 ml / L
The above components were sterilized together in the fermentation tank. The pH was raised to 6.7 with NH4OH. Extract of levadure (5 g / L and solution of trace minerals (5 ml / L were aseptically added from the reserve solutions sterilized by filtration, Glucose was added from the feeding of 60 r: to give a final concentration of 10 g / L. Carbeciline was added to 100 mg / L. The volume after inoculation was 6 L.
Environmental Conditions for Fermentation The temperature was controlled up to 36 ° C and the air flow was controlled up to 6 liters per minute. The return pressure was controlede e 1 ber. The indexer was set at 350 rpm. Aqueous ammonium was used for pH control at 6.7. The proportion glucose ali entede (60% glucose monohydrate) was controlled to maintain excess glucose.
Results
The results of the run fermentation are given in Table 1
Table 1
6 4.7 4.0 2.0 49 14 8 5.4 0 3.6 71 25 10 6.7 0.0 4.7 116 33 12 7.4 2.1 7.0 157 49 14.2 10.4 0.3 10.0 230 70 16.2 18.1 9.7 15.5 259 106 18.2 12.4 14.5 305 20.2 11.8 17.4 17.7 353 119 22.2 11.0 12.6 382 24.2 10.8 6.5 26.6 404 178 26.2 10.9 6.8 442 28.2 10.4 10.3 31.5 463 216 30.2 10.2 13.1 30.4 493 213 32.2 10.1 8.1 28.2 512 196 34.2 10.2 3.5 33.4 530 223 36.2 10.1 5.8 548 38.2 9.8 5.1 36.1 512 233
EXAMPLE 5
ENGINEERING PE MUTANTS PE GLYCEROL CINASA PE E. COLI
FM5 FOR THE PROPUCCION PE GLICEROL PESPE GLUCOSE
Construction of integration plasmids for glycerol kinase gene replacement in E. coli EM5 Genomic DNA from E. coli FM5 was prepared using the Puregene DNA isolation kit (Gentra Systems, Minneapolis, MN). A 1 kb DNA fragment containing the gens glpF percial and glycerol kinase (gJpK) was amplified by PCR (Mullis and Faloone, Methods of Enzymol., 155: 335-350, 1987) from FM5 genomic DNA using primers SEQ ID NO: 26 and SEQ ID NO: 27. A 1.1 kb DNA fragment containing the partial glpK and glp genes was amplified by PCR from FM5 genomic DNA using primers SEQ ID NO: 28 and SEQ ID NO: 29. One site Muñí was incorporated into the primer SEQ ID NO: 28. The primer at the 5 'end of SEQ ID NO: 28 was the reverse complement of the SEPARATOR SEQ ID NO: 27 to facilitate the subsequent overlap of the extension by PCR. The union of the gene by the technique of superposition of the extension (Horton and coleboredores, BioTecniques, 8: 528-535, 1990) was used to generate a 2.1 kb fragment by PCR using the two above fragments as primers and primers SEQ ID NO: 26 and SEQ ID NO: 29. these fragments represented a 0.8 deletion kb from the core region of the 1.5 kb glpK gene. Globelly, this fragment had 1.0 kb and 1.1 kb in lateral regions on either side of the Muñí cloning site (in the partial glpK) to allow the replacement of the chromosomal gene by homologous recombination.
The above 2.1 kb PCR fragment was blunt ended and cloned into the smooth vector PCR using the "Zero Blunt PCR Cloning Kit (Invitrogen, San diego, CA) to produce the 5.6 kb pRNIOO plasmid containing genes resistant to kanemicine and Zeocin.The 1.2 kb HincII fragment of pLoxCetl (unpublished results), which contains a chloramphenicol resistant gene flanked by becteriofego sites
- Pl loxP (Sneith T. collaborators, Gene, 166: 173-174,
(1995), was used to interrupt the glpK fragment in plasmid pRNIOO by binding to Muñí (and with a smooth end) - pRNI O plasmid digested to produce plasmid pRNIQI-l of 6.9 kb. A 376 kb fragment containing the R6K origin was amplified by PCR from vector pGP704 (Miller and Mekalanon, J. Bacteriol, 170: 2575-2583, 1988) using primers SEQ ID NO: 30 and SEQ ID NO: 31, and bound to the 5.3 kb Asp718-AatII fragment of pRN101-l to produce the 5.7 kb pRN102-l plasmid containing genes resistant to kanemicin and chlormephenicol. Read substitution of the ColEl region of origin in pRNlOl-l with the origin R6K to generate pRN102-1 also involved elimination of most of the genes resistant to Zeocin. The host for the replication of pRN102-l was E. coli SY327 (Miller and Mekalanon, J. Bacteriol., 170: 2575-2583, 1988) which contains the pir gene necessary for the function of the R6K origin.
Engineering Glycerol Mutant Kinase RJFllOm with Chloramphenicol Resistance with Gene Break
E. coli FM5 was electrotransformed with the non-replicable integration plasmid pRN102-l and transformants that were resistant to chloramphenicol (12.5 μg / mL) and sensitive to kanemicine (30 μg / mL) were further screened for non-glycerol use on M9 medium minimum containing 1 mM glycerol. An EcoRI digester of genomic DNA from a tel mutein, RJFlOm, when tested with the intact gipK gene and Southern analysis (Southern, J. Mol. Biol., 98: 503-507, 1975) indicated that it was a member double-crossing (replacement of the gipK gene) since the two expected bands of 7.9 kb and 2.0 kb were observed, which is characteristic of the presence of an additional EcoRI site in the gene with resistance to chloramphenicol. Wild type control produced the expected single band of 9.4 kb.
An analysis by 13C NMR of the mutant RJFlOm confirmed that it was incapez to convert glycerol 13C-market and ATP glycerol-3-phosphite. This mutein glpK was further amplified by genomic PCR utilizing co-binations of primers SEQ ID NO: 32, SEQ ID NO: 34 and S? C ID NO: 35, and SEQ ID NO: 32 and SEQ ID NO: 35 that produced the cleavage by PCR, 2.3 kb, 2.4 kb and 4.0 kb respectively. Wild type control produced the expected 3.5 kb band with the primers SEQ ID NO: 32 and SEQ ID NO: 35. The glpK mutant, RJFlOm, was electrotransformed with the pAH48 plasmid to allow the production of glycerol from glucose. The mutant of glpK E. coli RJFlOm has been deposited with ATCC under the terms of the Budapest Tretedo on November 20, 1997 ..
Engineering Glycerol Kinase Mutant RJFlOm with Chloramphenicol Resistance with Removal of Gene Removal
After developing all night over medium
YENB (yeast extract at 0.75 é, broth nutrient al
0. 8 í. ) at 37 ° C, e. Coli RJFlOm in an aqueous suspension was electrotransformed with the plasmid pJW168
(resultsss not published), which contained the bacteriophage gene Pl Cre recombinase under the control of the indiccble-IPTG lacUVS promoter, a temperature-sensitive pSClOl replicon, and a resistant ampicillin gene. In overdevelopment in SOC medium at 30 ° CC, transformants were selected at 30 ° C (permissible temperature for replicate of pJW168) on eger medium LB supplemented with carbecillin (50 μg / mL) and IPTG (lmM). Two series of overnight transfers of groups of colonies were performed at 30 ° C on freshly prepared LB medium supplemented with carbecillin and IPTG in order to allow cleavage of chromosomal gene resistance to chloremfenicol via recombination at ioxP sites. involved in Cre recombinase (Hoess and Abremski, J. Mol. Biol., 181: 351-362, 1985). The resulting colonies were replicated on medium ager LB supplemented with carbecillin and IPTG and LB agar supplemented with chloramphenicol (12.5 μg / mL) to identify colonies that were resistant to carbecillin and sensitive to chloramphenicol indicating the removal of the marker gene. An overnight culture at 30 ° C of such a colony was used to inoculate 10 mL of LB medium. In development at 30 ° C to OD (600 n) of 0.6, the culture was incubated - at 37 ° C overnight. Several dilutions were plated on preheated LB agar medium and the pieces incubated overnight at 42 ° C (the temperature does not allow for replication of pJW168). The resulting scolonies were plaque-replica on the eger medium LB and medium eger LB supplemented with carbecillin (75 μg / mL) to identify colonies that were sensitive to carbecillin indicating loss of plasmid pJW168. Such a mutant of glpK, RJF10, was further analyzed by genomic PCR using the primers SEQ ID NO: 32 and SEQ ID NO: 35 and yielded the expected band of 3.0 kb which confirms the cleavage of the mercer gene. The non-utilization of the glycerol by the RJF10 mutant was confirmed by lack of growth on minimal M9 medium containing 1 mM glycerol. The RJF10 mutant of glpK was electrotransformed with the pAH48 plasmid to allow the production of glycerol from glucose.
EXAMPLE 6
CONSTRUCTION OF PE PE COL. WITH THE AGOTAPO GLPA GENE
The gldA gene was isolated from E. coli by PCR (KB Mullis and FA Faloona (1987) Meth. Enzymol 155: 335-350) using the primers SEQ ID NO: 36 and SEV ID NO: 37, which incorporate the Sphl sites and Xbal, repectively, and cloned (T. Menietis 1982 Molecular Cloning, A Laboratory Manual, Cold Spring Harbor, Cold Spring Harbor, NY) between the Sphl and Xbal sites in pUCld, pare generer pKP8. pKP8 was cut in the only Sail and Ncol sites in the gldA gene, the ends smoothed with Klenow and religated resulted in an elimination of 109 bp in half of gldA and the regeneration of a single Sail site, to generate pKP9. A 1.4 kb DNA fragment containing the gene that confers resistance to kanemicine (ken), and includes 400 bps of DNA eproximidately at the start of the transletional initiation codon and approximately 100 bps of DNA towards the end of the stop codon traslecionel, was isolated from pET-28A (+) (Novagen, Madison, Wis.) by PCR using the primers SEQ ID NO: 38 and SEQ ID NO: 39, incorporating the Sail terminals, and subclone in the Sali unique site of KP9, pere generer pKP13. A DNA fragment of 2.1 KB that begins 204 bps towards the end of 1 codon of initiation traslecionel of gldA and that finishes 178 bps to the beginning of the stop codon translational of gldA, and that -contains the insert kan, was isolated from pKP13 by PCR using the primers SEQ ID NO: 40 and SEQ ID NO: 41, which incorporate the terminal sites Sphl and Xbal, respectively, were subcloned between the Sphl and Xbal sites, in pMAK705 (Genencor International, Palo Alto, Calif.), to generate pMP33. E. coli FM5 was transformed with pM33 and selected over 20 μg / mL of ken at 30 ° C, which is the permit tempereture for replication of pMAK705. One colony was expanded to overnight at 30 ° C in culture medium supplemented with 20 μg / mL ken. Approximately 32,000 cells were plated on 20 μg / mL ken and incubated for 16 hours and 44 ° C, which is the restricted temperature for the replication of pMAK705. Transformants that grew at 44 ° C had plasmids integrated into the chromosome, which occur at a frequency of approximately 0.0001. Analysis by PCR and Menchedo Southern (E.M. Southern 1975 J. Mol. Biol .. 98: 503-517) were used to determine the nature of the events of chromosomal integration in the transformants. Western blot analysis (H. Torbin et al. (1979) Proc. Nati. Acad. Sci. 76: 4350) was used to determine whether the glycerol dehydrogenase protein, the gldA product, is produced in the transformants. An activity assay was used to determine whether the glycerol dehydrogenase activity remained in the transformants. The activity in the glycerol dehydrogenase bendes on native gels was determined by coupling the conversion of glycerol + NAD (+) - dihydroxyacetone + NADH for the conversion of a tetureolium tinture, MTT [3- (4,5-dimethylthiol) bromide - 2- il) -2,5-diphenyltetrazoliu] for a deep colored formazan, with phenazine methosulafate as an intermediate. Glycerol dehydrogenase also requires the presence of 30 mM of ammonium sulfate and 100 mM of Tris, pH 9 (C.-T. Tang, and collaborators (1997) J. Becteriol, 140: 182). Out of 8 three non-standardized forms, 6 were determined to be agonal. E. coli MSP33.6 has been deposited with ATCC under the conditions of the Budepest Treaty on November 20, 1997.
EXAMPLE 7
CONSTRUCTION OF THE CEPA E. COLI WITH GLPK AND THE GONE SOLD OUT GLDA
Ur fragment of 1.6 kb DNA containing the gene gldA and including 2? bps of DNA towards the start of the rlaslacicpal start coder and 220 bps of DNA at the end of the translational stop codon was isolated from E. coll by PCR using the primers SEQ ID NO: 42 and SEQ ID NO: 43, which incorporate the Sphl and Xbal sites, repec iva ep e, and cloned between the sites Sphl and Xbal ae pUCló, to generate p N2, pQN2 was cut in the unique Sail and ccl sites in the gidA gene, the smooth end with Klenow and religated, resulting in a half-elimination of gldA and regeneration of a single SalI site, to generate pQN4 , A 1.2 kb DNA fragment containing the gene that confers resistance to kanemycin (kan), and flanked by l oxP sites was isolated from pLoxKan2 (Genencor International, Palo Alto, Calif.) As a Stul / XhoI fragment, and the ends smoothed with Klenow, and subcloned into the pQN4 site after isolation with Klenow, to generate pQN8, A 0.4 kb DNA fragment containing the replica R6K origin was isolated from pGP704 (Miller and Mekalenos, J. Baetericl, 170: 2575-2583, 1988) by PCR using the primers SEQ ID NO: 44 and SEQ ID NO: 45, which incorporate the Spñl and Xabl sites, respectively, and ligated to the Sphl / Xbal DNA fragment of 2.8 kb containing the gene that confers resistance to chloramphenicol (cam), and flanks by sites- icxP was isolated from pLoxCat2 Genencor International, Palo Alto, Calif.) as a fragment Xba l, and sucked in pK22 on the Xbal site, to generate pKP23. The RJF1C strain of E. coli (see EXAMPLE-5, which is glpK, was transformed with pKP23 and transformants with the kanRcamS phenotype was isolated, indicating double cross-integration, which was confirmed by Southern blot analysis. Glyceryl dehydrogenase activity (as described in EXAMPLE 61 demonstrated that glycerol dehydrogenase ective is not present in these transformants.) The kan marker was removed from the chromosome using the plasmid pJW168 that produces Cre, as described in EXAMPLE 5, to produce KLP23 cepes, several isolates with the kanS phenotype demonstrated no glycerol dehydrogenase activity, and Southern blot analysis confirmed the loss of the kan marker.
SEC ID NO: 44
CACGCATGCAGTTCAACCTGTTGATAGTAC
SEC ID NO: 45:
GCGTCTAGATCCTTTTAAATTAAAAATG
EXAMPLE 8
CONSUMPTION OF GLYCEROL PROPUCIPO PESPE P-GLUCOSE BY E.
RECOMBINANT COLI CONTAINING BOTH GPP2 AND PARÍ CCON AND WITHOUT ACTIVIPAP PE GLYCEROL KINASE (GLPK)
The £ example illustrates glycerol consumption by E. coli FM5 / pAH48 recc binante and RJF10 / pAH48. Strains FM5 / pAH48 and RJF10 / pAH48 were constructed as described previously in GENERAL METHODS.
Pre-cultivation
FM5 / pAH48 and RJF10 / pAH48 were pre-cultured by sowing in a fermentor in the same medium used for fermentation, or in LB supplemented with 1 l of glucose. Either carbeciin or ampicillin were used (100 mg / L) for plasmid maintenance. The means for fermentation is as described in EXAMPLE 4.
The cultures were initiated from frozen stocks (15 ae glyceroi as cryoprotectant) in 600 ml medium in a 2 L Erlen Meyer flask, developed at 30 ° C on a shaker at 250 rpm for approximately 12 h, and used to seed the
Development of Fermentation
A 15 L fermenting tank with 5-7 liters of initial volume was prepared as described in EXAMPLE 4. Either carbecillin or ampicillin were used (100 mg / L! For maintenance of the plasmid.
Environmental Conditions for Evaluating Glycerol Kinase Activity (glpK)
The temperature was controlled at 30 ° C and the airflow controlled at 6 liters per minute standard. The return pressure was controlled at 0.5 bar. The tension # of dissolved oxygen was controlled 10 i by agitation. Aqueous ammonium was used to control the pH to 6.7. The glucose feed ratio (60% glucose) was controlled by maintaining excess glucose until the glycerol was accumulated to at least 25 g / L. The glucose was depleted, resulting in the net metabolism of glycerol. 2 shows the resulting conversion of glyceroi.
Table 2
Conversion of glycerol by FM5 / pAH48 (weight) and RJF110 / pAH48 8glpK)
Dinner Nc. of consumption ratio? j glycerol syrups (g / OD / hr)
FM5 / pAH48 0.095 ± 0.015 Table 2 (continued)
Strain No. of consumption ratio Examples glycerol (g / OP / hr)
RJF110 / pAH48 0.021 ± 0.011
As can be seen in the data in Table 2, the glycerol consumption ratio decreases approximately 4-5 times where the endogenous activity of glyceroi kinase is eliminated.
Environmental conditions to evaluate the Glycerol Dehydrogenase Activity (GldA)
The temperature was controlled at 30 ° C and the air flow controlled at the standard of 6 liters per minute. The return pressure was controlled up to 0.5 bar. The dissolved oxygen tension was controlled at 10% by egitetion. Aqueous ammonium was used for pH control at €. ~? . In the first fermentation, the glucose was conserved in excess for the duration of the fermentation. The second fermentation was operated with non-residual glucose after the first 25 hours. Samples against the time of the two fermentations were taken to evaluate the activities of GlpK and GldA. The Teble 3 concludes that the fermentations of RJFI0 / pA48 show the effects of gldA on the selectivity for glycerol.
Table 3
Activities of GldA and GlpK from two Fermentations of RJF110 / pAH 8
Fermentation Time GldA GlpK Selectivity (hours) Total (g / g)
42
49
c _ 54
41
4 c '14
DI 12 As can be seen from the data in Table 3, the presence of glycerol dehydrogenase activity is linked to the conversion of glycerol under limited glucose conditions; thus, it is anticipated that eliminating the glycerol dehydrogenase ectivided will reduce the conversion of glycerol.
EXAMPLE 9
PRODUCTION PE GLYCEROL PESDE P-GLUCOSE USEFUL RECOMBINANT E. BLATTAE CONTAINING BOTH GPP2 AND DAR1
Example 9 illustrates the production of glycerol from D-glucose from recombinant E. blattae containing 15 amino acids senes GP? 2 v DARl.
E. bla z zae, obtained from the ATCC and that has no. AT access 33429, was developed at 30 ° C until the cultivation reached a Ct of approximately 0.6 AU to 600
np .. The culture was then transformed with pAH48, a plasmid cosing genes GPP2 and DAR1 (described in WO 95/21341, using electroporation techniques) The transformants were confirmed by the RFLP DNA standard and antibiotic resistance. mL of carbicillin).
- ^ Transformed E. biattae was developed aerobically at 35 ° C in shake flask cultures. The cultures were developed in a defined medium plus 2 z of glucose with selection of entibiotic and were initiated by inoculum from an overnight culture developed in LB plus 1 glucose with antibiotic selection. The defined medium contained per liter: 27.2 g of KH_P0, 2 g of citric acid, 2 g of MgSO, .7H 0, 1.2 ml of H_S04 to 98 i, 0.3 g of ferric ammonium citrate, 0.2 g of CaC1 ^ .2H0 , 10 g. of yeast extract (Difco), 5 L of Trace solution of Modified Blach elements (the composition of the stem can be found in Methods for General and Molecular Beeteriology (P. Gerhardt and coleboredores, eds P. 158, American Society for Microbiology, Washington, DC
(1994.) The defined medium was sterilized by filtration and adjusted to a final p.sub.E of .8 with NH.sub.0 H. The agitation contents were then subjected to HPLC analysis for the presence of glycerci After the overnight incubation, E. bi ate tae containing pAH48 produced 7.63 g / L glyceroi The control, which was E. bia t tae wild type (ATCC 33429) developed under the same conditions, produced = 0.2 g / L glycerol.
EXAMPLE 10 PRODUCTION OF GLYCEROL FROM D-GLUCOSE USING RECOMBINANT COLI DEFICIENT IN GLPA AND GLPK AND CONTAINING BOTH GPP2 AND DAR1 INTEGRATED IN THE CHROMOSOME
This example illustrates the production of glycerol from
D glucose from recombinant E. coli with the gldA and the gene egotedo glpK and containing embos GPP2 and DAR1 coding loe genes integrated into the. chromosome of the host cell.
The KLP23 strain of E. coli, prepared as described in EXAMPLE 7, is deficient in both activities giicerol kinase (product of glpK) and glyceryl dehydrogenase (product of gldA). KLP23 containing DAR1, and a chloramphenicol-resistant GPP2 gene flanked by io > : P integrated into the chromosome at the ampZ location was prepared and is mentioned as AH76RJcm.
The integration plasmids were designed and constructed on the basis of er. an integration system cre-lox íHoes, supra). In order to create the integration plasmids, a Hindi-Smal fragment of pLoxCatl was inserted into Hind III and linearized Smal pAH48 to create pAH48cm2. Plasmid pAH48 contains the DAR1 and GPP2 genes expressed under the control of the promoter. The 3.5 kb ApaL I fragment from pAH48cm2 was finalized in smooth with T4 DNA polymerase (Boehringer Mannheim Biochemiccel) and dNTPs and inserted into linearized NruI plnt-amp C (Genencor International, CA), using E. coli SY327 (Miller and collaborators, J Bacteriol 170: 2575-2583, 1998) as a host to create pAH76 and pAH76R. The "R" stands for reverse orientation of the integration cartridge. Both plasmids, pAH76 and pAH76R contain a replica R6K origin and are not capable of replicating in KLP23. Plasmids pAH76 and pAH76R were used to transform KLP23 by integration into the ampC location of the E. coli chromosome. Transformants were selected on 10 μg / ml of chloramphenicol and were sensitive kanamycin, producing double cross-over integration. These transformants of E. coli are called AH76Icm and AH76Ricm.
The cultures of AH7óRicm were developed in shake flasks in defined medium (described in Example 9) plus 2.5 glucose initiated by inoculation from an overnight culture of LB having 1:. of glucose and aptibiotic selection. The shake flasks (Erlen Meyer flasks, 1/5 volume volume volume) were incubated at 37 ° C with vigorous shaking overnight, after the supernatant was sampled for glycerol using a colorimetric enzyme (Sigma, Procedure No. 337) On a Monarch 2000 instrument (Instrumentacttion Laboratory Co., Lexington, MA). AH77Ricm showed glycerol production of 6.7 g / L after 25 hours.
E. coli pAH76RI had the chloramphenicol gene removed from AH76Ricm. The chloramphenicol gene was removed from the chromosome using the Cre-producing plasmid, pJW168, as described in Example 5. Transformants were selected for resistance to carbecillin and sensitivity to chloramphenicol under 1 mM and IPTG induction at 30 ° C. After removal of the cioramfeniccl gene, AH76RI was developed on LB medium without any antibiotic to cure pJW168. The final version of AH76RI nc is capable of developing on the selection of chloramphenicol or carbecillin.
The crops of AH76RI were developed in shake flasks in a defined medium plus 2 ° of glucose initiated by inoculation from overnight LB / 1 glucose culture. The shake flasks were vigorously agitated overnight, after the supernatant was sampled for glycerol using a colorimetric assay (Sigma, procedure No. 337) on a Monarch 2000 instrument.
(Instrumentation Laboratory Co. Lexington, MA). AH77RI showed glycerol production of 4.6 g / L after 24 hours.
All the plasmids described in this example were transformed into E. coli KLP23 using standard techniques of molecular biology. The transtormins were verified by DNA RFLP patterns, antibiotic resistance, amplification by. PCR, or phosphatise G3P assay.
LISTING PE SEQUENCES
GENERAL INFORMATION:
(i! APPLICANT:
(A) NAME: E.l. DU PONT DE NEMOURS AND COMPANY (B) STREET: 1007 MARKET STREET (C) CITY: ILMINGTON J; STATE: DELAWARE (E) COUNTRY: E.U.A. (F) POSTAL CODE: 19898 (G) TELEPHONE: 302- 892- 8112 (H) TELEFAX: 302-773- 0164 (I? TELEx: 6717325
(il) TITLE D? THE INVENTION: METHOD FOR THE GLYCLECL CLINIC BY RECOMBINANT ORGANISMS
l li) NUMBER D? SEQUENCES: 3
Jv) READING COMPUTER:
(A HALF TYPE: Z,: SQUET, 3.5 INCHES 3 'COMPUTER: PC COMPATIBLE WITH IBM (C) OPERATING SYSTEM: MICROSOFT WINDOWS 95 (D) PROGRAM: MICROSOFT WORD VERSION 7.0a
(v) NORMAL APPLICATION DATES:
(A) APPLICATION NUMBER: (B) REGISTRATION DATE: (C) CLASSIFICATION:
(v) PRIORITY DATA OF THE APPLICATION:
(A) APPLICATION NUMBER: 08 / 982,783 (B) REGISTRATION DATE: DECEMBER 2, 1997 (C) CLASSIFICATION: 1 (vile, INFORMATION OF OFFICER / APPORTER:
(A1 NAME: FL0YO, LINDA AXAMETRY (B) REGISTRATION NUMBER: 33, 692 20 (C) REFERENCE / POWER NUMBER: CR- 9981-C
.2) INFORMATION OF THE ID NO: 1:
(i) CHARACTERISTICS OF THE SEQUENCE: - (A) EXTENSION: 1380 base pairs (B) TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(ii) MOLECULA TYPE: DNA (genomic)
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: i
CTTTAATTTT CTTTTATCTT ACTCTCCTAC ATAAGACATC AAGAAACAAT TGTATATTGT 60
ACACCCCCCC CCTCCACAAA CACAAATATT GATAATATAA AGATGTCTGC TGCTGC GAT 120
AGATTAAACT TAACTTCCGG CCACTTGAAT GCTGGTAGAA AGAGAAGTTC CTCTTCTGTT 1S0
TCTTTGAAG3 CTGCCGAAAA GCCTTTCAAG GTTACTGTGA TTGGATCTGG TAACTGGGGT 24C
W rtílOjTuU i. TGTAAGGGAT ACCCAGAAGT TTTCGCTCCA 300
ATAGTACAAA CGAAGAAGAG ATCAATGGTG AAAAATTGAC TGAAATCATA 360
AATACTAGAC ATCAAAACGT GAAATACTTG CCTGGCATCA CTCTACCCGA CAATTTGGTT 420
GCTAATCCAG TTCAGTCAAG GATGTCGACA TCATCGTTTT CAACATTCCA 480
CATCAATTTT TGCCCCGTAG CTG AGCCAA TTGAAAGGTC ATGTTGATTC ACACGTCAGA 540 TTTTGAAGT GGTGCTAAAG GTGTCCAATT GCTATCCTCT 600
TACATCACTG AGGAACTAGG GGTGCTCTAT CTGGTGCTAA CATTGCCACC 660
GAAGTCGCTC AAGAACACTC GTC * GAAACA ACAGTTGCTT ACCACATTCC AAAGGATTTC 720
AVJAVJLJL, .unuu Uv ??? neither.- . ? -U? L. r.4? rtÜ u * * Ti va Cu ..G ..? ~ C CAGACCTTAC 7S0
TTCCACGTTA GTGTCATCGA GGTATCTCCA TCTGTGGTGC TTTGAAGAAC 840
GTTGTTGCCT TAGGTTGTGG TTTCGTCGAA GGTCTAGGCT GGGGTAACAA CGCTTCTGCT 900
GCCATCCAAA GAGTCGGTTT GGGTGAGATC ATCAGATTCG GTCAAATGTT TTTCCCAGAA 960
TCTAGAGAAG AAACATACTA CCAAGAGTCT GCTGGTGTTG CTGATTTGAT CACCACCTGC 1020 GCTGGTGGTA GAAACGTCAA GGTTGCTAGG CTAATGGCTA CTTCTGGTAA GGACGCCTGG 1080
GAATGTGAAA AGGAGTTGTT GAATGGCCAA TCCGCTCAAG GTTTAATTAC CTGCAAAGAA 1140
GTTCACGAAT GGTTGGAAAC ATGTGGCTCT GTCGAAGACT TCCCATTATT TGAAGCCGTA 1200
TACCAAATCG TTTACAACAA CTACCCAATG AAGAACCTGC CGGACATGAT TGAAGAATTA 1260
GATCTACATG AAGATTAGAT TTATTGGAGA AAGATAACAT ATCATACTTC CCCCACTTTT 1320
TTCGAGGCTC TTCTATATCA TATTCATAAA TTAGCATTAT GTCATTTCTC ATAACTACTT 1380
(2) INFORMATION OF THE ID NO: (i) CHARACTERISTICS OF THE SEQUENCE:
(A) EXTENSION: 2946 base pairs (B) TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(ii) MOLECULES TYPE: DN: genomic)
(xi; DESCRIPTION D? THE SEQUENCE: SEQ ID NO: 2
GAATTCGAGC CTGAAGTGCT GATTACCTTC AGGTAGACTT CATCTTGACC CATCAACCCC 60
AGCGTCAATC CTGCAAATAC ACCACCCAGC AGCACTAGGA TGATAGAGAT AATAGAGTAC 120
GTGGTAACGC TTGCCTCATC ACCTACGCTA TGGCCGGAAT CGGCAACATC CCTAGAATTG 180
AGTACGTGTG ATCCGGATAA CAACGGCAGT GAATATATCT TCGGTATCGT AAAGATGTGA 240
TATAAGATGA TGTATACCCA ATGAGGAGCG CCTGATCGTG ACCTAGACCT TAGTGGCAAA 300
AACGACATAT ?? -. Ihift? 1 GGGGAGAGTT TCGTGCAAAT AACAGACGCA GCAGCAAGTA 360
ACTGTGACG TATCAACTCT TTTTTTATTA TGTAATAAGC AAACAAGCAC GAATGGGGAA 420 n u.? * U ?? CAATCACCAA GGTCGTCCCT TTTTTCCCAT TTGCTAATTT AGAATTTAAA 480
GAAACCAAAA GAATGAAGAA AGAAAACAAA TACTAGCCCT AACCCTGACT TCGTTTCTAT 540
GATAATACCC TGCTTTAATG AACGGTATGC CCTAGGGTAT ATCTCACTCT GTACGTTACA 600
AACTCCGGTT ATTTTATCGG AACATCCGAG CACCCGCGCC TTCCTCAACC CAGGCACCGC 660
CCCAGGTAA CG GCGCGAT GAGCTAATCC TGAGCCATCA CCCACCCCAC CCGTTGATGA 720
CAG A * T1 AA AAAACTG GAGCAAGGAA TTACCATCAC CGTCACCATC 780
ACCATCATAT CGCCTTAGCC TCTAGCCATA GCCATCATGC AAGCGTGTAT CTTCTAAGAT 840
TCAGTCATCA TCATTACCGA GTTTGTTTTC CTTCACATGA TGAAGAAGGT TTGAGTATGC 900
TCGAAACAAT AAGAC CGA TGGCTCTGCC ATTGGTTATA TTACGCTTTT GCGGCGAGGT 960
GCCGATGGGT TGCTGAGGG AAGAGTGTTT AGCTTACGGA CCTATTGCCA TTGTTATTCC 1020
GATTAATCTA TTGTTCAGCA GCTCTTCTCT ACCCTGTCAT TCTAGTATTT TTTTTTTTTT 1080
TTT * TGGTTT * a * • * • ** T *** T "» • * •• ** TCTTCTTGCC TTTTTTTCTT GTTACTTTTT TTCTAGTTTT 1140
TTTTCCTTCC ACTAAGCTTT TTCCTTGATT TATCCTTGGG TCTTCTTTC TACTCCTTTA 1200 TTATATATTA ATTTTTAAGT TTATGTATTT TGGTAGATTC AATTCTCTTT 1260
CCCTTTCCTT TTCCTTCGCT CCCCTTCCTT ATCAATGCTT GCTGTCAGAA GATTAACAAG 1320
ATACACATTC CTTAAGCGAA CGCATCCGGT GTTATATACT CGTCGTGCAT ATAAAATTTT 1380
- > Á GCCTTCAAGA TCTACTTTCC TAASAAGATC ATTATTACAA ACACAACTGC ACTCAAAGAT 144 C GACTGCTCAT ACTAATATCA AACAGCACAA ACACTGTCAT GAGGACCATC CTATCAGAAG 1500 ATCGGACTCT GCCGTGTCAA TTGTACATTT GAAACGTGCG CCCTTCAAGG TTACAGTGAT 156C TGGTTCTGGT AACTG3GGGA CCACCATCGC CAAAGTCATT GCGGAAAACA CAGAATTGCA 1620 TTCCCATATC TTCGAGCCAG AGGTGAGAAT GTGGGTTTTT GATGAAAAGA TCGGCGACGA 1680 AAATCTGACG GATATCATAA ATACAAGACA CCAGAACGTT AAATATCTAC CCAATATTGA 1740 CCTGCCCCAT AATCTAGTGG CCGATCCTGA TCTTTTACAC TCCATCAAGG GTGCTGACAT 1800 CCTTGTTTTC AACATCCCTC ATCAATTTTT ACCAAACATA GTCAAACAAT TGCAAGGCCA 1860 CGTGGCCCCT CATGTAAGGG CCATCTCGTG TCTAAAAGGG TTCGAGTTGG GCTCCAAGGG 1920 TGTGCAATTG CTATCCTCCT ATGTTACTGA TGAGTTAGGA ATCCAATGTG GCGCACTATC 1980 GCTGCAAA TTGGCACCGG AAGTGGCCAA GGAGCATTGG TCCGAAACCA CCGTGGCTTA 2040 CCAACTACCA AAGGATTATC AAGGTGATGG CAAGGATGTA GATCATAAGA TTTTGAAATT 2100 GCTGTTCCAC AGACCTTACT TCCACGTCAA TGTCATCGAT GATGTTGCTG GTATATCCAT 2160 TGCCGGTGCC TTGAAGAACG TCGTGGCACT TGCATGTGGT TTCGTAGAAG GTATGGGATG 2220 GGG TAACAAT GCCTCCGCAG CCATTCAAAG GCTGGGTTTA GGTGAAATTA TCAAGTTCGG 2280 TAGAATGTTT TTCCCAGAAT CCAAAGTCGA GACCTACTAT CAAGAATCCG CTGGTGTTGC 2340 AGATCTGATC ACCACCTGCT CAGGCGGTAG AAACGTCAAG GTTGCCACAT ACATGGCCAA 2400
G "w? J» JtJ-v < -7 TCAGCC7TGG AAGCAGAAAA GGAATTGCTT AACGGTCAAT CCGCCCAAGG 2460
G ATC CA TGCAGAGAA3 T7CACGAGTG GCTACAAACA TGTGAGTTGA CCCAAGAATT 2520 CCCAATTATT CGAGGCAGTC TACCAGATAG TCTACAACAA CGTCCGCATG GAAGACCTAC 2580 CGGAGATGAT TGAAGAGCTA GACATCGATG ACGAATAGAC ACTCTCCCCC CCCCTCCCCC 2640
TCTGATCTTT CCTGTTGCCT CTTTTTCCCC CAACCAATTT ATCATTATAC ACAAGTTCTA 2700 CAACTACTAC TAGTAACATT ACTACAGTTA TTATAATTTT CTATTCTCTT TTTCTTTAAG 2760 AATCTATCAT TAACGTTAAT TTCTATATAT ACATAACTAC CATTATACAC GCTATTATCG 2820 TTTACATATC ACATCACCGT TAATGAAAGA TACGACACCC TGTACACTAA CACAATTAAA 2880 TAATCGCCAT AACCTTTTCT GTTATCTATA GCCCTTAAAG CTGTTTCTTC GAGCTTTTCA CTGCAG 2940 2946 (2) INFORMATION SE ID NO: 3:E CHARACTERISTICS:
(A) EXTENSION: 3178 base pairs (B) TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(li) MOLECULE TYPE: DN (genomic)
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 3
CTGCAGAACT TCGTCTGCTC TGTGCCCA7C CTCGCGGTTA GAAAGAAGCT GAATTGTTTC 60
ATGCGCAAGG GCATCAGCGA GTGACCAATA ATCACTGCAC TAATTCCTTT TTAGCAACAC 120
ATACTTATAT ACAGCACCAG ACCT7ATGTC TTTTCTCTGC TCCGATACGT TATCCCACCC 180
AACTTTTATT TCAGTTTTGG CAGGGGAAAT TTCACAACCC CGCACGCTAA AAATCGTATT 240
TAAACTTAAA rt? N nn? . CACAAATAGG GAACt ^? G < j? CTAAACGAAG GACTCTCCCT 300
CCCTTATCTT GACCGTGCTA TTGCCATCAC TGCTACAAGA CTAAATACGT ACTAATATAT 360
GTTTTCGGTA ACGAGAAGAA GAGCTGCCGG TGCAGCTGCT GCCATGGCCA CAGCCACGGG 420
GACGCTGTAC TGGATGACTA GCCAAGGTGA TAGGCCGTTA GTGCACAATG ACCCGAGCTA 480 hi or * A? «. N TTCCCCACCG CCGCTCCACC GGCAGuTCTC TAGACGAGAC CTGCTGGACC 540
GTCTGGACAA * Ji-W w ?? ^ .n_-v TTCG? CGTGT TGATCATCGG TGGCGGGGCC ACGGGGACAG 600
GATGTGC CT? UnTuv. «GCJ TCAATGTGGC CCTTGTTGAA AAGGGGGATT 660
TTGCCTCGGG AACG CC C AAATCTACCA AGATGATTCA CGGTACTTAG_720_Go *. ? Gnu 780
AGCGTAAACA TCTTATCAAC ACTGCCCCTC ACCTGTGCAC GGTGCTACCA ATTCTGATCC 840 CCATCTACAG CACCTGGCAG GTCCCGTACA TCTATATGGG CTGTAAATTC TACGATTTCT 900
TTGGCGGTTC CCAAAACTTG AAAAAATCAT ACCTACTGTC CAAATCCGCC ACCGTGGAGA 960
AGGCTCCCAT GCTTACCACA GACAATTTAA AGGCCTCGCT TGTGTACCAT GATGGGTCCT 1020
TTAACGACTC GCGTTTGAAC GCCACTTTAG CCATCACGGG TGTGGAGAAC GGCGCTACCG 1080
TCTTGATCTA TGTCGAGGTA CAAAAATTGA TCAAAGACCC AACTTCTGGT AAGGTTATCG 1140
GTGCCGAGGC CCGGGACGTT GAGACTAATG AGCTTGTCAG AATCAACGCT AAATGTGTGG 1200
TCAATGCCAC GGGCCCATAC AGTGACGCCA TTTTGCAAAT GGACCGCAAC CCATCCGGTC 1260
TGCCGGACTC CCCGCTAAAC GACAACTCCA AGATCAAGTC GACTTTCAAT CAAATCTCCG 1320
TCATGGACCC GAAAATGGTC ATCCCATCTA TTGGCGTTCA CATCGTATTG CCCTCTTTTT 1380
ACTCCCCGAA GGATATGGGT TTGTTGGACG TCAGAACCTC TGATGGCAGA GTGATGTTCT 144C
T7TTACCTTG GCAGGGCAAA GTCCTTGCCG GCACCACAGA CATCCCACTA AAGCAAGTCC 1500
CAGAAAACCC TATGCCTACA GAGGCTGATA TTCAAGATAT CTTG? AAGAA CTACAGCACT 1560
ATATCGAA7T CCCCGTGAAA AGAGAAGACG TGCTAAGTGC ATGGGCTGGT GTCAGACCTT 162C
TGGTCAGAGA TCCACGTACA ATCCCCGCAG ACGGGAAGAA GGGCTCTGCC ACTCAGGGCG 166C
TGGTAAGATC CCACTTCTTG TTCACTTCGG ATAATGGCCT AATTACTATT GCAGGTGGTA 1740
AATGGACTA TTACAGACAA ATGGCTGAGG AAACAGTCGA CAAAGTTGTC GAAGTTGGCG 1800
GATTCCACAA CCTGAAACCT TGTCACACAA GAGATATTAA GCTTGCTGGT GCAGAAGAAT 1860
GGACGCAAAA CTATGTGGCT TTATTGGCTC AAAACTACCA TTTATCATCA AAAATGTCCA 1920
ACTACTTGGT TCAAAACTAC GGAACCCGTT CCTCTATCAT TTGCGAATTT TTCAAAGAAT 1980
CCATGGAAAA TAAACTGCCT TTGTCCTTAG CCGACAAGGA AAATAACGTA ATCTACTCTA 2040
GCGAGGAGAA CAACTTGGTC AATTTTGATA CTTTCAGATA TCCATTCACA ATCGGTGAGT 2100
TAAAGTATTC CATGCAGTAC GAATATTGTA GAACTCCCTT GGACTTCCTT TTAAGAAGAA 2160
CAAGATTCGC CTTCTTGGAC GCCAAGGAAG CTTTGAATGC CGTGCATGCC ACCGTCAAAG 2220
TTATGGGTGA TGAGTTCAAT TGGTCGGAGA AAAAGAGGCA GTGGGAACTT GAAAAAACTG 2280
TGAACTTCAT CCAAGGACGT TTCGGTGTCT AAATCGATCA TGATAGTTAA GGGTGACAAA 2340
GATAACATTC ACAAGAGTAA TAATAATGGT AATGATGATA ATAATAATAA TGATAGTAAT 2400
AACAATAATA ATAATGG7G0 TAATGGCAAT GAAATCGCTA TTATTACCTA TTTTCCTTAA 2460
TGGAA3AGTT AAAG7AAAC? AAAAAAAC7A CAAAAATATA TGAAGAAAAA AAAAAAAAAA 2520 GGTAATAGAC TCTACTACTA CAATTGATCT TCAAATTATG ACCTTCCTAG TGTTTATATT 2580
CTATTTCCAA TACATAATAT AATCTATATA ATCATTGCTG GTAGACTTCC GTTTTAATAT 2640
CGTTTTAATT ATCCCCTTTA TCTCTAGTCT AGTTTTATCA TAAAATATAG AAACACTAAA 2700
TAATATTCTT CAAACGGTCC TGGTGCATAC GCAATACATA TTTATGGTGC AAAAAAAAAA .2760
ATGGAAAATT TTGCTAGTCA TAAACCCTTT CATAAAACAA TACGTAGACA TCGCTACTTG 2820
AAATTTTCAA GTTTTTATCA GATCCATGTT TCCTATCTGC CTTGACAACC TCATCGTCGA 2880
AATAGTACCA TTTAGAACGC CCAATATTCA CATTGTGTTC AAGGTCTTTA TTCACCAGTG 2940
ACGTGTAATG GCCATGATTA ATGTGCCTGT ATGGTTAACC ACTCCAAATA GCTTATATTT 3000
CATAGTGTCA TTGTTTTTCA ATATAATGTT TAGTATCAAT GGATATGTTA CGACGGTGTT 3060
ATTTTTCTTG GTCAAATCGT AATAAAATCT CGATAAATGG ATGACTAAGA TTTTTGGTAA 3120
AGTTACAAAA TTTATCGTTT TCACTGTTGT CAATTTTTTG TTCTTGTAAT CACTCGAC L76
(2) INFORMATION D? THE? ID NO: (i) CHARACTERISTICS OF THE SEQUENCE:
(A) EXTENSION: 616 base pairs (B) TYPE.-nucleic acid (C) FILAMENTO: unites only (D) TOPOLOGY: linear
(ii) MOLECULUS TYPE: DNA (genomic) (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 4
ATGAAACGTT TCAATGTTTT AAAATATATC AGAACAACAA AAGCAAATAT ACAAACCATC 60
GCAATGCCTT TGACCACAAA ACCTTTATCT TTGAAAATCA ACGCCGCTCT ATTCGATGTT 12C
GACGGTACCA TCATCATCTC TCAACCAGCC ATTGCTGCTT TCTGGAGAGA TTTCGGTAAA 180
GACAAGCCTT ACTTCGATG ATTCACATCT CTCACGGTTG GAGAACTTAC 240
GATGCCATTG CCAAGTTCGC TCCAGA * - A GCTGATGAAG AATACGTTAA CAAGCTAGAA 300
GGTGAAATCC CAGAAAAGTA CGGTGAACAC TCCATCGAAG TTCCAGGTGC TGTCAAGTTG 360
TGTAATGCTT TGAACGCCTT GCCAAAGGAA AAATGGGCTG TCGCCACCTC TGGTACCCGT 420
GACATGGCCA AU? A TUU * T CGACATT .. AAGATCAAGA GACCAGAATA 480
GCCAATGATG TCAAGCAAGG TAAGCCTCAC CCAGAACCAT ACTTAAAGGG TAGAAACGGT 540
-Tui-iu * - n .._ AAu.rtCCCA TCCAAATCTA AGGTTGTTGT CTTTGAAGAC 600
GCACCAGCTG GTATTGC7GC TGGTAAGGCT GCTGGCTGTA AAATCGTTGG TATTGCTACC 660
ACTTTCGATT * oGACTTCT G AuG AAAG GGTTGTGACA TCATTGTCAA GAACCACGAA 720
TCTATCAGAG TCGGTGAA CAACGCTGAA ACCGATGAAG TCGAATTGAT CTTTGATGAC 780
TACTTATACG CTAAGGATG CTTGTTGAAA TGGTAA 816
(2) INFORMATION OF THE ID NO: 5:
CHARACTERISTICS D? SEQUENCE:
A) EXTENSION: 53 base pairs [B? TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(ii) MOLECULA TYPE: DNA (genomic)
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 5
ATGGGATTGA CTACTAAACC TCTATCTTTG AAAGTTAACG CCGCTTTGTT CGACGTCGAC 60 GGTACCATTA TCATCTCTCA ACCAGCCATT GCTGCATTCT GGAGGGATTT CGGTAAGGAC 120
AAACCTTATT TCu? I VJ Tu / ". ACACGT ATC - rt? L ¿C v- ATGGTTGGAG AACGTTTGAT 180
GCCATTGCTA AGACTTTGCC AATGAAGAGT ATGTTAACAA ATTAGAAGCT 24C
GAAATTCCGG TCAAGTACGG TGAAAAATCC ATTGAAGTCC CAGGTGCAGT TAAGCTGTGC 300
AACGCTTTGA AAAAGAGAAA TGGGCTGTGG CAACTTCCGG 36C
ATG? Cnn? A? * «UTTI-? ATCAGGAGAC CAAAGTACTT CATTACCGCT 420
AATGATGTCA AACAGGGTAA GCCTCATCCA GAACCATATC TGAAGGGCAG GAATGGCTTA 480 TCAATGAGCA AGACCCTTCC AAATCTAAGG TAGTAGTATT TGAAGACGCT 54C
CCAGCAGG A *** - "r * r * **** -" • GGTTGTAAGA TCATTGGTAT TGCCACTACT 600
TTCGACTTGG ACTTCCTAAA GGAAAAAGGC TGTGACATCA TTGTCAAAAA CCACGAATCC 660
ATCAGAGTTG? UhTnv. ?? TGCCGAAACA GACGAAGTTG AATTCATTTT TGACGACTAC 720
TTATATGCTA AGGACGATCT GTTGAAATGG TAA 753
twenty
(2) INFORMATION FROM THE SE IL NC: 6:
CHARACTERISTICS LE LA SECUENC: IA:
- > (A) EXTENSION: 2520 pairs bas (B) TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(ii) MOLECULA TYPE: DNA (genomic)
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 6
TGTATTGGCC ACGATAACCA CCCTTTGTAT ACTGTTTTTG TTTTTCACAT GGTAAATAAC 60
GACTTTTATT AAACAACGTA TGTAAAAACA TAACAAGAAT CTACCCATAC AGGCCATTTC 120
GTAATTCTTC TCTTCTAATT GGAGTAAAAC CATCAATTAA AGGGTGTGGA GTAGCATAGT 180
GAGGGGCTGA CTGCATTGAC AAAAAAATTG AAAAAAAAAA AGGAAAAGGA AAGGAAAAAA 240
AGACAGCCAA GACTTTTAGA ACGGATAAGG TGTAATAAAA TGTGGGGGGA TGCCTGTTCT 300
CGAACCATAT AAAATATACC ATGTGGTTTG AGTTGTGGCC GG? ACTATAC AAATAGTTAT 360
ATGTTTTCCCCT CTCTCTTCCG ACTTGTAGTA TTCTCCAAAC GTTACATATT CCGATCAAGC 420
CAGCGCCTTT ACACTAGTTT AAAACAAGAA CAGAGCCGTA TGTCCAAAAT AATGGAAGAT 480
TTACGAAGTG ACTACGTCCC GCTTATCGCC AGTATTGATG TAGGAACGAC CTCATCCAGA 540
TGCATTCTGT TCAACAGATG GGGCCAGGAC GTTTCAAAAC ACCAAATTGA ATATTCAACT 600
TCAGCATCGA AGGGCAAGAT "" TGGGGTGTCT GGCCTAAGGA GACCCTCTAC AGCCCCAGCT 660
CGTGAAACAC CAAACGCCGG TGACATCAAA ACCAGCGGAA AGCCCATCTT TTCTGCAGAA 720 * -.- b n * * k-rtrtunh? hnni .Í? \ ?. -.- «- nvju A_ * J. \ J? - * a* . * - ^, rw or * - AACGAACCCA Cu.TGAAGTT CCCCAAAC G GGTTGGGTTG AGTGCCATCC GCAGAAATTA 6 * íC
CTGGTGAACG TCGTCCAA G CCTTGCCTCA AGTTTGCTCT CTCTGCAGAC TATCAACAG 900
GAACGTGTAG CAAACGGTCT CCCACCTTAC AAGGTAATAT GCATGGGTAT AGCAAACATG 560
AGAGAAACCA CAATTCTGTG GTZ CZZZGZ ACAGGAAAAC CAATTGTTAA CTACGGTATT 102C GTTTGGAACG ACACCAGAAC GATCAAAATC GTTAGAGACA AATGGCAAAA CACTAGCGTC 1 C80 GATAGGCAAC TGCAGCTTAG ACAGAAGACT GGATTGCCAT TGCTCTCCAC GTATTTCTCC 1140 TGTTCCAAGC TGCGCTGGTT CCTCGACAAT GAGCCTCTGT GTACCAAGGC GTATGAGGAG 1200 AACGACCTGA TGTGGACACA TGGCTGATTT ACCAATTAAC TAAACAAAAG 126C GCGTTCGTTT CTGACGTAAC CAACGCTTCC AGAACTGGAT TTATGAACCT CTCCACTTTA 1320 AAGTACGACA ACGAGTTGCT GGAATTTTGG GGTATTGACA AGAACCTGAT TCACATGCCC 1380 GAAATTGTGT CCTCATCTCA ATACTACGGT GACTTTGGCA TTCCTGATTG GATAATGGAA 1440 AAGCTACACG ATTCGCCAAA AACAGTACTG CGAGATCTAG TCAAGAGAAA CCTGCCCATA 1500 CAGGGCTGTC TGGGCGACCA AAGCGCATCC ATGGTGGGGC AACTCGCTTA CAAACCCGGT 1560 GCTGCAAAAT GTACTTATGG TACCGGTTGC TTTTTACTGT ACAATACGGG GACCAAAAAA 1620 TTGATCTCCC AACATGGCGC ACTGACGACT CTAGCATTTT GGTTCCCACA TTTGCAAGAG 1680 0 TACGGTGGCC AAAAACCAGA ATTGAGCAAG CCACATTTTG CATTAGAGGG TTCCGTCGCT 1740 GTGGCTGGTG CTGTGGTCCA ATGGCTACGT GATAATTTAC GATTGATCGA TAAATCAGAG 1800
GG-. TW JG? C L? R .. * or -? * TACGOTTCCT GATTCTGGTG GCGTAGTTTT CGTCCCCGCA 1660
TTTAGTGGCC TATTCGCTCC CTATTGGGAC CCAGATGCCA GAGCCACCAT AATGGGGATG 1920 TCTCAATTCA CTACTGCCTC CCACATCGCC AGAGCTGCCG TGGAAGGTGT TTGCTTTCAA 1980 GCCAGGGCTA TCTTGAAGGC AATGAGTTCT GACGCGTTTG GTGAAGGTTC CAAAGACAGG 2040 5 GACTTTTTAG AGGAAATTTC CGACGTCACA TATGAAAAGT CGCCCCTGTC GGTTCTGGCA 2100 GTGGATGGCG GGATGTCGAG GTCTAATGAA GTCATGCAAA TTCAAGCCGA TATCCTAGGT 2160 CCCTGTGTCA AAGTCAGAAG GTCTCCGACA GCGGAATGTA CCGCATTGGG GGCAGCCATT 2220 GCAGCCAATA TGGCTTTCAA GGATGTGAAC GAGCGCCCAT TATGGAAGGA CCTACACGAT 2280 GTTAAGAAAT GGGTCTTTTA CAATGGAATG GAGAAAAACG AACAAATATC ACCAGAGGCT 2340 0 CATCCAAACC TTAAGATATT CAGAAGTGAA TCCGACGATG CTGAAAGGAG AAAGCATTGG 2400 AAGTATTGGG AAGTTGCCGT GGAAAGATCC AAAGGTTGGC TGAAGGACAT AGAAGGTGAA 2460
CA.-GAACAGG TT: CTTCCAATAA CAACATAAAT AATT7CTATT AACAATGTAA 2520
- > «;
Phe Leu Prc Ar? He Cys Ser Gln Leu Lys Gly His Val Asp Ser HAS 130 135 140 Val Arg Wing He Ser Cys Leu Lys Gly Phe Glu Val Gly Wing Lys Gly 145 150 155 160
Val Gln Leu Leu Be Ser Tyr He Thr Glu Glu Leu Gly He Gln Cys 165 170 175
Gly Ala Leu Ser Gly Wing Asn He Wing Thr Glu Val Wing Gln Glu HIS 180 185 190 Trp Ser Glu Thr Thr Val Wing Tyr HIS He Pro Lys Asp Phe Arg Gly 195 200 205 Glu Gly Lys Asp Val Asp HIS Lys Val Leu Lys Wing Leu Phe His Arg 210 215 220 Pro Tyr Phe Hxs Val Ser Val He Glu Asp Val Wing Gly He Ser He 225 230 235 240
Cys Gly Ale Leu Lys Asr. Val Val Ala Leu Gly Cys Gly Phe Val Glu 245 250"" 255
Gly Leu Gly Trp Gly Asn Asn Wing Wing Wing He Gln Arg Val Gly 260 265 270 Gly Glu He He Arg Phe Gly Gln Met Phe Phe Prc Glu Ser Arg 275 280 • 285 Glu Glu Tnr Tyr Tyr Glr. Glu Be Wing Gly Val Wing Asp Leu He Thr 29C 295 300 Thr Cys Wing Giy Gly Arg Asn Val Lys Val Wing Arg Leu Met Wing Thr 305 310 315 320
Ser Gly Lys Asp Wing Trp Glu Cys Glu Lys Glu Leu Leu Asn Gly Gln 325 330 335
Be Ala Glr. Gly Leu He Thr Cys Ly. «= Glu Val Hs Glu Trp Leu Glu 340 345 350 Thr Cys Gly Ser Val Glu Asp Phe Pro Leu Phe Glu Ala Val Tyr Gln 355 360 365 le Val Tyr Asr. Asr. Tyr Pro Met Lys Asn Leu Pro Asp Met He Glu 370 375 380 Glu Leu Asp Leu His Glu Asp 3E5 39C (2) INFORMATION SEE ID NO: 8:
(i) SEQUENCE CHARACTERISTICS:
(A) EXTENSION: 384 amino acids (B) TYPE: amino acid (C) FILAMENTO: unknown (D) TOPOLOGY: unknown
; ii) MOLECULA TYPE: protein
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 8
Met Thr Ala HIS Thr Asn He Lys Gln HIS Lys His Cys HIS Glu Asp 1 5 10 15
Kis Pro He Arg Arg Being Asp Being Wing Val Ser He Val HIS Leu Lys 20 25 30 Arg Wing Pro Phe Lys Val Thr Val He Gly Ser Gly Asn Trp Gly Thr 35 40 45 Thr He Wing Lys Val He Wing Glu Asn Thr Glu Leu HIS Be Hxs He 50 55 60 Phe Glu Pro Glu Val Arg Met Trp Val Phe Asp Glu Lys He Gly Asp 65 70 75 80
Glu Asn Le. Thr Asp He He Asn Tr.r Are HÍ S G n Asn Val Lys Tyr 85 90 95
Leu Prc Asn He Asp Leu Pro His Asn Leu Val Wing Asp Pro Asp Leu 100 105 i; 0 Leu HIS Be He Lys Giy Wing Asp He Leu Val Phe Asn He Pro H S 115 120 125 Gln Phe Leu Prc Asn He Vai Lys Gln Leu Gln Gly His Val Aia Prc 130 135 140 His Val Arg Ala He Ser Cys Leu Lys Gly Phe Glu Leu Gly Ser Lys
145 150 155 160
Gly Val Glp Leu Leu Ser Ser Tyr Val Thr Asp Glu Leu Gly He Glr. 165 1 0 175
Cys Gly Aia Leu Ser Gly Wing Asn Leu Aia Pro Giu Val Aia Lys Glu 180 185 190 His Trp Ser Glu Thr Thr Val Wing Tyr Gln Leu Prc Lys Asp Tyr Gln 195 200 205 Gly Asp Gly Lys Asp Val Asp His Lys He Leu Lys Leu Leu Phe His 210 215 220 Ar? Pro Tyr Phe His Val Asn Val lie Asp Asp Vai Aia Gly He Ser 225 230 235 240
'He Ala Gly Ala Leu Lys Asn Vai Val Ala Leu Ala Cys Gly Phe Val 245 250 255
Glu Gly Met Gly Trp Giy Asr. Asn Ala Be Ala Ala He Gln Arg Leu 260 * 265 270 Gly Leu Gly Glu lie He Lys Phe Gly Ar? Met Phe Phe Prc Giu Ser 275 280 285 Lys Val Giu Thr Tyr Tyr Glr. Glu Be Wing Gly Val Wing Asp Leu He 290 295 300 Thr Thr Cys Ser Gly Gly Arg Asn Val Lys Val Wing Thr Tyr Met Wing 305 310 315 320
Lys Thr Gly Lys Ser Ala Leu Glu Ala Glu Lys Glu Leu Leu Asn Gly 325 '330 335
Gln Ser Wing Gln Gly He He Thr Cys Arg Glu Val His Glu Trp Leu 340 345 350 Gln Thr Cys Glu Leu Thr Gln Glu Phe Pro He He Arg Gly Ser Leu 355 '• 360 365 Pro Asp Ser Leu Gln Gln Arg Pro His Gly Arg Pro Thr Gly Asp A = p 370 375 380
(2) INFORMATION D? LA SE ID NO: 9: CHARACTERISTICS OF THE SEQUENCE: (Ai EXTENSION: 614 amino acids (B) TYPE: amino acid (C) FILAMENTO: unknown (D) TOPOLOGY: unknown
(ii) MOLECULA TYPE: protein
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 9
Met Thr Arg Wing Thr Trp Cys Asn Pro Pro Pro Leu His Arg Glr. 1 5 10 15 '
Vai Ser Arg Arg Asp Leu Leu Asp Arg Leu Asp Lys Thr His Gin Phe 20 25 30 Asp Vai Leu He He Gly Gly Gly Wing Thr Gly Thr Gly Cys Ala Leu 35 40 45 Asp Ala Ala Thr Arg Giy Leu Asn Val Ala Leu Val Glu Lys Gly Asp 50 55 60 Pne Wing Ser Giy Thr Ser Ser Lys Ser Thr Lys Met He His Gly Gly 65 70 75 80
Val Arg Tyr Leu Giu Lys Wing Phe Trp Giu Phe Ser Lys Wing Gin Leu is 90 95
Asp Leu Val He Glu Ala Leu Asn Giu Arg Lys His Leu He Asn Thr 10C 105 HO Pro Wing His Leu Cys Tnr Val Leu Pro He Leu He Pro He Tyr Ser 115 120 125 Thr Trp Gir. V¿; Prc Tyr He Tyr Met Giy Cys Lys Phe Tyr Asp Phe 130 135 I Q Phe Giy Giy Ser Glr. Asn Leu Lys Lvs Ser Tyr Leu Leu Ser Lys Ser 145 15C 155 160
Wing Thr Vai Giu Lys Wing Pro Met Leu Thr Thr Asp Asn Leu Lys Wing 165 l 0? 75
Ser Leu Vai Tyr Hs Asp Gly Ser Phe Asn Asp Ser Arg Leu Asn Wing 180 185 190 Thr Leu Wing He Thr Gly Val Glu Asn Gly Wing Thr Val Leu He Tyr 195 200 205 Val Glu Val Gln Lys Leu He Lys Asp Pro Thr Ser Gly Lys Val He 210 215 220 Gly Wing Glu Wing Ar? Asp Val Glu Thr Asn Glu Leu Val Ar? He Asn 225 230 235 240 Wing Lvs Cys Val Vai Asn Wing Tnr Gly Prc Tyr Ser Asp Aia He Leu 245 250 255 Gln Met Asp Arg Asn Pro Ser Giy Leu Pro Asp Ser Pro Leu Asr. Asp 260 265 27C Asn Ser Lys He Lys Ser Thr Pne Asn Gln He Ser Val Met Asp Prc 2 5 '280 285 Lys Met Val He Pro Ser He Gly Val His He Val Leu Pro Ser Phe 290' 295 300 Tyr Ser Pro Lys Asp Met Gly Leu Leu Asp Val Arg Thr Ser Asp Gly 305 310 315 320 Arg Val Met Phe Phe Leu Pro Trp Glr. Gly Lys Val Leu Aia Gly Thr 325 330 335 Thr Asp He Pro Leu Lys Gin Val Pro Glu Asn Pro Met Pro Thr Glu 340 345 350 Wing Asp He 'Glp Asp He Leu Lys Glu Leu Gln KIS Tyr He Glu Phe
355 360 '365 Pro Vai Lys Arg Giu Asp Val Leu Ser Ala Trp Ala Giy Val Ar? Pro 37C 375 380 Leu Vil Arg Asp Pro Arg Thr He Pro Wing Asp Gly Lys Lys Gly Ser 385 390 395 400 Wing T.. R Gln Gly Val Val Are Ser His Phe Leu Phe Thr Ser Asp Asn 405 410 415 Gly Leu He Thr He Wing Gly Gly Lys Trp Thr Thr Tyr Arg Gln Met
420 425 430 Ala Glu Glu T.r Val Asp Ly-. Val Val Giu Val Gly Gly Phe His Asn 435 44C 445 Leu Lys Pro Cys Kis Thr Are Asp He Lys Leu Wing Gly Wing Glu Glu 450 455 460 Trp Tr.r Gln Asr, Tyr Val Aia Leu Leu Wing Gln Asn Tyr His Leu Ser 465 470 475 480
2 ^ Ser Lys Met Ser Asr. Tyr Leu Val Gln Asn Tyr Gly Thr Arg Ser Ser 485 490 495 He He Cys Glu Phe Phe Lys Giu Ser Met Glu Asn Lys Leu Pro Lßu 500 505 510 Ser Leu Wing Asp Lys Glu Asn Asn Val He Tyr Ser Ser Glu Glu Asn 515 520 525 Asn Leu Val Asn Phe Asp Thr Phe Arg Tyr Pro Phe Thr He Gly Glu 530 535 540 Leu Lys Tyr Ser Met Gin Tyr Glu Tyr Cys Arg Thr Pro Leu Asp Pne 545 550 555 560
Leu Leu Ara Arg Thr Arg Phe Wing Phe Leu Asp Wing Lys Giu Aia Leu 565 570 575
Asr. Wing Vai His Aia Thr Val Lys Val Met Gly Asp Glu Phe Asn Trp 580 585 590 Ser Glu Lys Arg Gln Trp Glu Leu Glu Lys Thr Val Asn Phe He 595"600 605 Gin Gly Arg Phe Gly Val 610
INFORMATION OF THE ID NO: 10:
(i) CHARACTERISTICS OF THE SEQUENCE:
l EXTENSION: 339 amino acids (Bj TYPE: amino acid (C) FILAMENTO: unknown? -: 'TOPOLOGY: unknown
MOLECULE TYPE: 'protein
ixi 'DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 10 Met Asn Gin Arg Asn Wing Being Met Thr Val He Gly Wing Gly Ser Tyr 1 5 10 15 Gly Thr Ala Leu Ala He Thr Leu Ala Arg Asn Gly His Glu Val Val 20 25 30 Leu Trp Giy His Asp Pro Glu His He Wing Thr Leu Glu Arg Asp Arg 3s "40 45 Cys Asn Aia Aia Pne Leu Pro Asp Val Pro Phe Pro Asp Thr Leu His 50 55 '60 Leu Giu Ser Asp Leu Ala Thr Ala Leu Ala Ala Ser Arg Asn He Leu 65 7C 75 80 Val Val Val Pro Ser His Val Phe Giy Glu Val Leu Arg Gln He Lys 85 90 95
Pro Leu Met Arg Pro Asp Wing Arg Leu Val Trp Wing Thr Lys Gly Leu 100 105 110 Glu Wing Glu Thr Gly Arg Leu Leu Gln Asp Val Wing Arg Glu Wing Leu 115 120 125 Gly Asp Gln He Pro Leu Wing Val He Ser Gly Pro Thr Phe Wing Lys 130 135 140 Glu Leu Wing Wing Gly Leu Pro Thr Wing Be Ser Leu Wing Ser Thr Asp 145 150 155 160
Gin Tr.r Fne Wing Asp Asp Leu Gir. Gir. Leu Leu H¿s Cys Giy Lys Ser 165 17C 175 Pne Arg Val Tyr Ser Asr. Pro Asp Phe He Gly Val Gir. Leu Gly Gly 180 185 190 Wing Val Lys Asn Vai He Wing He Gly Wing Gly Met Ser Asp Gly lie 195 200 205
twenty
Gly Phe Giy Wing Asn Wing Arg Thr Wing Leu He Thr Arg Gly Leu Wing 210 215 220 Glu Met Ser Arg Leu Gly Aia Wing Leu Gly Wing Asp Pro Aia Thr Phe 225 230 235 240
Met Gly Met Ala Giy Leu Gly Asp Leu Val Leu Thr Cys Thr Asp Aer. 245 250 255
Glp Ser Arg Asn Arg Arg Phe Gly Met Met Leu Gly Gln Giy Met Asp 260 265 270 Val Gir. Be Wing Gin Glu Lys He Gly Gin Vai Vai Glu Gly Tyr Arg 275 280 285 Asn Thr Lys Glu Val Arg Giu Leu Wing HAS Arg Phe Gly Val Glu Met 290 295 300 Pro He Thr Glu Giu He Tyr Gln Val Leu Tyr Cys Gly Lys Asp Ala 305 310 315 320
Arg Glu Ala Aia Leu Tnr Leu Leu Giy Arg Ala Ar? Lys Asp Glu Ar? 325 330 335
Ser Ser Kis
.'2) INFORMATION OF THE ID NO: 11.
li) CHARACTERISTICS OF THE SEQUENCE:
(A) EXTENSION: 501 amino acids (Bj TYPE: amino acid (C) FILAMENTO: unknown (D) TOPOLOGY: unknown
(ii) MOLECULE crctema; xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 11 Met Glu Thr Lys Asp Leu He Val He Gly Gly Gly He Asn Gly Ala 1 5 10 15
Gly He Wing Wing Asp Wing Wing Gly Arg Gly Leu Ser Val Leu Met Leu 20 25 30 Glu Wing Gln Asp Leu Wing Cys Wing Thr Ser Wing Being Ser Lys Leu 35 40 45 He His Gly Gly Leu Arg Tyr Leu Glu HAS Tyx Glu Phe Arg Leu Val 50 55 60 Ser Giu Aia Leu Aia Giu Arg Giu Val Leu Leu Lys Met Aia Prc His 65 7C 75 80
He Wing Pne Pro Met Arg Phe Arg Leu Pro HAS Arg Pro His Leu Arg 85 90 95
Pro Wing Trp Met He Arg He Gly Leu Phe Met Tyr Asp HAS Leu Giy 100 105 - 110 Lys Arg Tr.r Ser Leu Pro Gly Ser Thr Gly Leu Arg Phe Gly Wing Asn 115 120 '125 Ser Val Leu Lys Pro Glu He Lys Arg Gly Phe Glu Tyr Ser Asp Cys 130 135 140 Trp Vai Asp Asp Ala Arg Leu Val Leu Ala Asn Ala Gln Met Val Vai 145 150 155 160
Ar? Lys Giy Gly Gl ^ Vai Leu Thr Arg Thr Arg Wing Thr Ser Wing Arg 165 170 175
Arg Giu Asr Giy Leu He Val Glu Ala. Glu Asp He Asp Thr G.y 180 185 190 Lys Lys Tyr Ser Trp Glr. Aia Arg Gly Leu Val Asn Wing Thr Giy Pro 195 200 2C5 Trp Val Lys Gln Phe Phe Asp Asp Gly Met HAS Leu Pro Ser Pro ly- 210 215 220 Gly He Arg Leu He Lys Gly Ser KAS He Val Val Pro Aro Vai HAS 225 230 235"2 * 40
Thr Glr. Lys Gln Ala Tyr He Leu Gir. Asn Glu Asp Lys Arg He Val 245 250 255
Pne Val lie Prc Trp Met Asp Giu Pne Ser He Gly Thr Thr Asp 26C 265 27Q Val Glu Tyr Lys Giy Asp Pro Lys Wing Val Lys He Glu Glu Ser Glu 275 280 285 He Asn Tyr Leu Leu Asn Val Tyr Asn Thr His Phe Lys Lys Gln Leu 290 295 300 Ssx Arg Asp Asp He Val Trp Thr Tyr Ser Gly Val Arg Pro Leu Cys 305 * 310 315 320
A = p Asp Glu Ser Asp Ser Pro Gln Wing He Thr Arg Asp Tyr Thr Leu 325 330 335
Asp He HAS Asp Glu Asn Gly Lys Wing Pro Leu Leu Ser Val Phe Gly 340 345 350 Gly Lys Leu Thr Thr Tyr Arg Lys Leu Wing Glu HAS Wing Leu Glu Lys 355 360 365
Leu Tr.r Pro Tyr Tyr Gln Giy He Giy Pro Wing Trp Thr Lys Glu Se: 370 375 380 Val Leu Pro Gly Giy Aia He Giu Gly Asp Arg Asp Asp Tyr Aia Wing 385 390 395 400
Are Leu Arg Arg Arg Tyr Pro Phe Leu Thr Glu Ser Leu Aia Arg HAS 405 410 415
Tyr Ala Ar? Thr Tyr Gly Ser Asn Ser Glu Leu Leu Leu Gly Asr. Aia 420 425 430 Giy Thr Vai Ser Asp Leu Gly Glu Asp Phe Gly HAS Glu Phe Tyr Glu 435 440 445 Wing Giu Leu -ys Tyr Leu Val Asp HAS Glu Trp Val Arg Are Wing Asp 450 455 460 Asp Aia Leu Trp Arg? Rg Thr Lys Gln Gly Met Trp Leu Asn Wing Asp 465 470 475 B0
Glr. Gln Ser Are Vai Ser Gin Trp Leu Val Glu Tyr Thr Gln Gln Arg 485 490 95
Leu Ser Leu Ala Ser 50C
(2) INFORMATION OF THE ID NO: 12:
(i) SEQUENCE CHARACTERISTICS:
(A) EXTENSION: 542 amino acids (B) TYPE: amino acid (C) FILAMENTO: unknown (D) TOPOLOGY: unknown
ii) MOLECULA TYPE: proteine
(xi) DESCRIPTION OF THE. SEQUENCE: SEQ ID NO: 12
Met Lys Thr Arg Asp Ser Gln Ser Ser Asp Val He He He Giy Gly 1 5 10 15
Gly Aia Thr Gly Wing Gly He Wing Arg Asp Cys Wing Leu Arg Gly Leu 20 25 30 -Arg Val He Leu Val Glu Arg HAS Asp He Wing Thr Gly Wing Thr Gly 35 40 45 Arg Asn HAS Gly Leu Leu HAS Ser Gly Wing Arg Tyr Ala Val Thr Asp 50 55 60 Wing Glu Ser Wing Arg Glu Cys He Ser Glu Asn Gln He Leu Lys Arg 65 7 ° 75 80
He Wing Arg His Cys Val Giu Pro Thr Asn Giy Leu Phe He Thr Leu 85 90 95
Pro Giu Asp A = p Leu Ser Phe Gln Wing Thr Phe II 100 e Arg Wing C \ 105 110
Glu Ala Gly lle Be Ala Ala Giu Ala He Asp Pr 115 or Gln Gln AJ 120 to Arg Zlt 12. He Giu Pro Ala Vai 130 Asr. Pro Ala Leu He Gly Ala Vai Lys Val 135 140 A 1s4p5 Gly Thr Val Asp Pro Phe Arg Leu Thr Ala Ala Asn Met Leu Asi 150 155 160
Aia Lys Glu HAS Gl Gly Ala Val He Leu Thr Ala HAS Glu Val Thr GH- 1i7 / DU 2-, 5 - *
Le lie A: g Giu Gly Wing Thr Val Cys Gly Val Ar 180 g Val Are Asr. HAI 185 190 Leu Thr Gly Giu Tr.r Gln Ala Leu His Al a Pro Val Val Vai Asn Wing 200 205 to Giy He Trp Gly Gln HAS He Wing Glu Tyr Wing Asp Leu Arg H 210 215 220 Arg Met Pne Pro Wing Lys Gly Being Leu Leu He Met Asp HAS Arg He 225 230 235 240
Asn Gln HAS Val He Asn Arg Cys Arg Lys Pro As Asp Aia Asp He 245 250 255
Leu Val Prro Gly, .A. ^ Spry T .n..r- H ..- e S • er Lte-u H "• e Gi¡.y.}. Thr Thr Ser Leu Arg 26C 265 270 He Asp Tyr Asn Glu He Asp Asp Asr. Arg Vai Thr Wing Glu Glu Val 275"2", ° 8"C 285 He leu Ar? Gi. Giy Glu 290 Leu Ala Pro Val Met Ala Lys 295 300 Thr Arg lle Leu Arg Ala Tyr Ser Gly Val Arg Pro Leu Val Ala Ser 305 210 315 320
Asp Asp Asp Pro Be Gly Arg Asn Leu Be Arg Gly He Val Leu Leu 325 3"3n0 335
Asp HAS Ala Glu Aro itr, r. . ^ Arg Asp Gly Leu Asp Gly Phe Ilß Thr? Thr J45 ,,. Gly 350 Gly V *: C.vs Arg Lys Leu Gly Asn Thr Arg prQ Cy = ^ 375 Thr Wing Asp Leu 380 Wing Leu Prc Giy Ser Gin Glu Pro Wing Glu Val Thr Leu Arg Lys Val 385 3-9"0- 3 , 9n5c 400 He Be Pro Leu Pro Pro Leu Arg Gly Ser Aia Val Tyr Arg HAS Glv 405 410 415
Asp Arg Thr Pro Wing Trp Leu Ser Giu Giy Arg Leu His Are Ser Leu 420 425 430 Val Cys Glu Cys Glu Ala Val Thr Ala Giy Giu Val Gin Tyr Ala Val 435 440 445 Giu Asn Leu Asn Val Asp Ser Leu Leu Asp Leu Arg Arg Arg Thr Arg 450 455 460 Val Gly Met .Giy Thr Cys Gln Gly Glu Leu Cys Wing Cys Ring Wing Wing 465 470 475 '480
Gly Leu Leu Gin Arg Pne Asn Val Thr Thr Ser Wing Gln Ser lie Glu 485 490 95
Gin Leu Ser Thr Phe Leu Asn Glu Arg Trp Lys Giy Vai Gln Pro He 50C 505 sio Wing Trp Gly Asp Wing Leu Arg Glu Ser Glu Phe Thr Arg Trp Val Ty- 515 520 '525 Glr. Gly Leu Cys Giy Leu Glu Lys Glu Gln Lys Asp Aia Leu 530 535 54th
INFORMATION OF THE ID NO: 13
CHARACTERISTICS OF THE SEQUENCE:
(A; EXTENSION: 250 amino acids (B) TYPE: amino acid (C) FILAMENTO: unknown (D) TOPOLOGY: unknown
ü) MOLECULA TYPE: protein
[xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 13 Met Gly Leu Thr Thr Lys Pro Leu Ser Leu Lys Val Asn Ala Ala Leu 1 5 10 15
Phe Asp Vai Asp Gly Thr He He He Ser Gln Pro Wing He Wing Wing 20 25 30 Phe Trp Arg Asp Phe Giy Lys Asp Lys Pro Tyr Phe Asp Wing Glu HAS. 35 40 4s Val He Gln Val Ser HAS Gly Trp Arg Thr Phe Asp Wing Wing Lys 50 55 60 Phe Wing Pro Asp Phe Wing Asn Glu Glu Tyr Val Asn Lys Leu Glu Wing 65 70 75 80
Glu He Pro Vai Lys Tyr Gly Glu Lys Ser He Glu Val Pro Gly Ala
85 90 95
Val Lys Leu Cys Asp Ala Leu Asn Aia Leu Pro Lys Giu Lys rp n_ = .. 100 105 110
Val Ala Tr.r Ser Giy Thr Arg Asp Met Ala Gir. Lys Trp Phe Glu KAS 115 '120 125 Leu Giy He Arg Arg Pro Lys Tyr Pne He Thr Aia Asn Asp Val Lys 130 135 140 Glp Gly Lys Pro HAS Pro Glu Prc Tyr Leu Lys Gly Arg Asn Giy Leu 145 150 155 160
Gly Tyr Pro He Asn Glu Gln Asp Pro Ser Lys Ser Lys Val Val Val 165 170 175
Phe Glu Asp Aia Pro Wing Giy He Wing Aia Gly Lys Aia Wing Gly Cys 180 r 185 190 Lys He He Giy He Wing Thr Thr Phe Asp Leu Asp Phe Leu Lys Glu 195 200 205 Lys Gly Cys Asp He He Val Lys Asr. HAS Glu Ser He Arg Val Gly 210 215 220 Gly Tyr Asn Aia Glu Thr Asp Giu Val Glu Phe He Phe Asp Asp Tyr 225 230 235 240
Leu Tyr Aia Lys Asp Asp Leu Leu Lys Trp 245 250 [2) INFORMATION OF SEE ID NO: 14:
(i) SEQUENCE CHARACTERISTICS:
(A) EXTENSION: 271 amino acids (B) TYPE: amino acid (C) FILAMENTO: unknown (D) TOPOLOGY: unknown
(ii) MOLECULA TYPE: protein
(xi ^ DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 14
Met Lys Arg Pne Asn Val Leu Lys Tyr He Arg Thr Thr Lys Wing Asn x 5 10 5
He Gin Thr He Wing Met Pro Leu Thr Thr Lys Pro Leu Ser Leu Lys 20 25 30 He Asn Wing Aia Leu Phe Asp Val Asp Gly Thr He He Ser Ser Gln 35 40 45 Pro Wing Wing Wing Wing Phe Trp Arg Asp Phe Gly Lys Asp Lys Pro Tyr 50 55 60 Phe Asp Wing Glu HAS Val He HAS HAVE BE Gly Trp Arg Thr Tyr 65 70 75 80 Asp Wing Wing Lvs Pne Aia Pro Asp Phe Wing Asp Giu Glu Tyr Vai 85 90 95
Asn Lvs Leu Glu Gly Glu Lie Prc Giu Lys Tyr Gly Giu His Ser He 100 1C5 11C Giu Vai Pro Giy Wing Val Lys Leu Cys Asn Wing Leu Asn Wing Leu Pro 115 120 125 Lys Glu Lys Trp Wing Val Wing Thr Ser Gly Thr Ar ? Asp Ket Aia Lys 130 135 140 Lys Trp Phe Asp He Leu Lys He Lys Arg Prc Giu Tyr Phe He Thr 145 150 155 160
Wing Asn Asp Val Lys Gin Gly Lys Pro His Pro Glu Prc Tyr Leu Lys 165 170 175
Gly Arg Asr. Gly Leu Giy Phe Pro He Asn Glu Gln Asp Pro Ser Lys 130 185 190 Ser Lys Vai Val Val Phe Glu Asp Ala Pro Wing Gly He Wing Aia Glv 195 200 205 Lys Wing Aia Gly Cys Lys He Val Gly He Wing Thr Thr Phe Asp Leu 210 215 220 A = p Phe e Lys Glu Lys Giy Cys Asp He He Val Lys Asn His Glu 225 230 235 240
Ser He Arg Val Giy Glu Tyr Asn Aia Glu Thr Asp Glu Val Glu Leu 245 250 255 lle Fhe Asp Asp Tyr Leu Tyr Aia Lys Asp Asp Leu Leu Lys Trp 260 265 270
(2! INFORMATION FROM SE ID NO: 15:
(i) CHARACTERISTICS OF THE SEQUENCE:
(A) EXTENSION: 709 amino acids (B) TYPE: amino acid id FILAMENT:. Unknown ÍD) TOPOLOGY: unknown
(ii) MOLECULE TYPE: protein (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 15
Met Phe Pro Ser Leu Phe Ar? Leu Val Val Phe Ser Lys Arg Tyr He
1 5 10 15
Phe Arg Ser Ser Gln Arg Leu Tyr Thr Ser Leu Lys Gln Glu Gln Ser 20 25 30 Arg Met Ser Lys He Met Glu Asp Leu Arg Ser Asp Tyr Val Pro Leu 35 40 45
Ali Se: He Asp Val Giy Thr Tr.r Ser Se: Are Cys He Phe 5C 60 Asn Are Trp Gly Gln Asp Val Ser Lys HAS Gln He Giu Tyr Ser Thr 65 70 75 80
Be Ala Be Lys Giy Lys He Giy Val Ser Gly Leu Arg Arg Prc Ser 85 90 95
Thr Ala Prc Wing Arg Glu Thr Pro Asn Aia Gly Asp He Lys Thr Ser
100 Í05 11C Gly Lys Pro He Phe Be Wing Glu Gly Tyr Aia He Gln Glu Thr Lys
115 120 125 Phe Le-_ Lys He Glu Glu Leu Asp Leu Asp Phe HAS Asr. Glu Pro Thr
130 125 140 Leu Lys Pne Prc Lys Pro Gly Trp Val Giu Cys His Pro Gln Lys Leu
145 150 '55 '160
Leu Val Asr. Val Vai Gln Cys Leu Ala Ser Ser Leu Leu Ser Leu Gir. 165 170 175
Thr He Asr. Ser Glu Arg Val Aia Asr. Gly Leu Pro Pro Tyr Lys Vai
180 185 190 He Cys Met Giy He Wing Asn Met Arg Glu Thr Thr He Leu Trp Ser
195 200 205 Arg Vai Trp Asn A = p Thr Are Tr.r He Lys He Val Arg Asp Lys Trp Glr. Asn Thr Ser Val
225 230 235 240
Asp Are Gir. Leu Gip Leu Arg Gin Lye Tr.r Gly Leu Prc Leu Leu Ser 245 250 255 Thr Tyr Phe Ser Cys Ser Lys Leu Ar? Trp Phe Leu Asp Asn Glu Prc 260 265 270 Leu Cys Thr Lys Wing Tyr Giu Glu Asn Asp Leu Met Phe Gly Thr Val 275 280 285 Asp Thr Trp Leu He Tyr Gln Leu Thr Lys Gln Lys Wing Phe Val Ser 290 295 300 Asp Vai Thr Asn Ala Ser Ar? Thr Gly Phe Met Asn Leu Ser Thr Leu 305 310 315 320 Lys Tyr Asp Asn Glu Leu Leu Glu Phe Trp Gly He Asp Lys Asn Leu 325 330 335 He His Met Pro Glu He Val Being Ser Gln Tyr Tyr Gly Asp Phe 340 345 350 Gly He Pro Asp Trp He Met Giu Lys Leu HAS Asp Ser Pro Lys Thr 355 360 365 Vai Leu Arg ASD Leu Val Lys Arg Asr. Leu Pro He Glp Giy Cys Leu
370 '375 380 Giy Asr; Gir. Ser Aia Ser Met Val Gly Gln Leu Wing Tyr Lys Pro Gly 3E5"390 395 400 Wing Aia Lys Cvs Thr Tyr Gly Thr Giy Cys Phe Leu Leu Tyr Asr. Thr 405 410 415 Gly Thr Lys Lys Leu He Ser Gln HAS Giy Ala Leu Thr Thr Leu Wing 420 425 430 15 Pne Trp Phe Pro HAS Leu Gln Glu Tyr Gly Gly Gln Lys Pro Giu Leu 435 440 445 Ser Lys Pro His Phe Aia Leu Glu Gly Ser Val Val Wing Ala Gly Wing 450 455 460 Val Val Gir. Leu Arg Asp Asn Leu Arg Leu He Asp Lys Ser Glu 465 470 475 480 Asp Val Gly Prc lle Aia Ser Tnr Val Pro Asp Ser Gly Gly Val Val 485 490 495
Pne Val Prc Wing Phe Ser Gly Leu Phe Wing Pro Tyr Trp Asp Pro Asp 5CC 505 510 Wing Arg Aia Thr He Met Giy Met Ser Gln Phe Thr Thr Ala Ser His 515 520 525 lle Wing Arg Wing Wing Vai Giu Giy Val Cys Phe Gln Ala Ar? Ala He 530 535 540 leu lys Ala Ke Ser Ser Asp Ale Phe Giy Giu Gly Ser Lys Asp Ar? 545 550 555 560 Asp Pne Leu Glu Glu He Ser Asp Val Thr Tyr Glu Lys Ser Pro Leu 565 570 575 Ser Vai Leu Wing Val Asp Gly Gly Met Ser Arg Ser Asn Glu Val Met 580 585 590 Gln He Gln Wing Asp He Leu Gly Pro Cys Val Lys Val Arg Arg Ser 595 600 605 Pro Thr Wing Glu Cys Thr Wing Leu Gly Wing Wing Wing Wing Wing Asn Met 610 615 620 Wing Phe Lys Asp Val A = n Glu Ar? Pro Leu Trp Lys Asp Leu His Asp 625 630 635 640
Val Lys Lys Trp Val Phe Tyr Asn Gly Met Glu Lys Asn Glu Gln He 645 650 655
Ser Prc Giu Ala HAS Prc Asn Leu Lys He Phe Are Glu Ser Asp 660 665 670 Asp Wing Glu Arg Arg Lys HAS Trp Lys Tyr Trp Glu Val Aia Val Giu 675 680 685 Arg Ser Lys Giy Trp Leu Lys Asp He Glu Gly Glu HAS Giu Glr. Vai 690"695 700 Leu Giu Asn Phe Gln 705
: 2 < INFORMATION OF THE ID SE NC: 16:
íl) CHARACTERISTICS D? SEQUENCE:
(A) EXTENSION: 51 paree base (B) TYPE: nucleic acid (C) FILAMENTO: one only ÍD) TOPOLOGY: linear
i? MOLECULUS TYPE: DNA (genomic) (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 16
GCGCGGATCC AGGAGTCTAG AATTATGGGA TTGACTACTA AACCTCTATC T
(2) INFORMATION OF THE ID NO: 17:
(i) SEQUENCE CHARACTERISTICS:
(A) EXTENSION: 36 base pairs (B) TYPE: nucleic acid (C; FILAMENTO: one only (D) TOPOLOGY: linear
(ii) MOLECULA TYPE: DNA (genomic)
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 17
GATACGCCCG GGTTACCATT TCAACAGATC GTCCTT
(2) INFORMATION OF THE ID NO: 18.
CHARACTERISTICS D? THE SEQUENCE: (A) EXTENSION: 34 base pairs (B) TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(ii) MOLECULA TYPE: DNA (genomic)
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 1 8
GTGATAATAT AACCATGGCT GCTGCTGCTG ATAG
(2) INFORMATION OF THE ID NO: 19: (i) CHARACTERISTICS OF THE SEQUENCE:
(A) EXTENSION: 39 base pairs (B) TYPE: nucleic acid ÍC) FILAMENTO: one only (Di TOPOLOGY: linear
(ii) MOLECULUS TYPE: DNA (genomic) (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 19
GTATGATATG TTATCTTGGA TCCAATAAAT CTAATCTTC
(2) INFORMATION OF THE ID NO: 20: (i) CHARACTERISTICS OF THE SEQUENCE:
(A) EXTENSION: 24 base pairs (B) TYPE: nucleic acid (C) FILAMENTO: one only ÍD) TOPOLOGY: linear
(ii) MOLECULA TYPE: DNA (genomic)
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 20
CATGACTAGT AAGGAGGACA ATTC
? 2 INFORMATION OF THE ID NO: (i) CHARACTERISTICS OF THE SEQUENCE:
(A) EXTENSION: 24 base pairs (B) TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(ii) MOLECULA TYPE: DNA (genomic)
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 21
CATGGAATTG TCCTCCTTA TAG
Í2) INFORMATION OF THE ID NO: 2:
CHARACTERISTICS OF THE SEQUENCE:
ÍA; EXTENSION: 19 paree base (B) TYPE: nucleic acid (OR FILAMENT: one selo (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) [xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 22 CTAGTAAGGA GGACAATTC
(2) INFORMATION OF THE ID NO: 23 (i) CHARACTERISTICS OF THE SEQUENCE:
(A) EXTENSION: 19 base pairs (B) TYPE: nucleic acid IC) FILAMENT: one single TOPICOLOGY: lijieal
MOLECULE T ^ PD: DNA (genomic)
ixii DESCRIPTION D? THE SEQUENCE: SEQ ID NO: 23
CCTCCTTA
(2) INFORMATION OF THE ID NO: 24:
(i) SEQUENCE CHARACTERISTICS:
(A) EXTENSION: 15 base pairs (B) TYPE: nucleic acid (C) FILAMENTO: one only (D; TOPOLOGY: linear
(nor MOLECULA TYPE: other nucleic acid
A) DESCRIPTION: / desc = "primer"
(iii1 HYPOTHETICAL: NO
[iv > ANTICIPATION: NO
(xi SEQUENCE DESCRIPTION:? EC ID NO: 24
GATCCAGGAA ACAGA
2) INFORMATION OF THE ID NO: 25:
íl! CHARACTERISTICS L? THE SEQUENCE: (A) EXTENSION: 15 base pairs (B) TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(iii) MOLECULA TYPE: other nucleic acid
A) DESCRIPTION: / desc = "primer"
mi) HYPOTHETICAL.-NO
[iv) ANTICIPATION: NO
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 25
CTAGTCTGTT TCCTG
(2) INFORMATION OF THE ID NO: (i) CE CHARACTERISTICS THE SEQUENCE:
IA) EXTENSION: 22 base pairs (3 ^ TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(iv) MOLECULA TYPE: other nucleic acid
A) DESCRIPTION: / desc = "primer"
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 26
GCTTTCTGTG CTGCGGCTTT AG
"&!
) CHARACTERISTICS D? SEQUENCE:
(A- EXTENSION: 23 base pairs (B TIPG: nucleic acid [OR FILAMENT: one single (D.i TOPOLOGY: linear
(I saw MOLECULA ctrc nucleic acid
A > DESCRIPTION: / desc = "primer" (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 27
TGGTCGAGGA TCCACTTCAC TTT
(2) INFORMATION OF THE ID NO: 28: (i) CHARACTERISTICS OF THE SEQUENCE:
(A) EXTENSION: 51 base pairs (B) TYPE: nucleic acid (C) FILAMENTO: one only ÍDj TOPOLOGY: linear
(vi) MOLECULA TYPE: other nucleic acid
A 'DESCRIPTION: / desc = "starter"
(xi) DESCRIPTION E? THE SEQUENCE: SEQ ID NO: 28
AAAGTGAAGT GGATCCTCGA CCAATTGGAT GGTGGCGCAG TAGCAAACAA
INFORMATION OF THE ID NO: 29: (i) CHARACTERISTICS OF THE SEQUENCE:
(A) EXTENSION: 23 pairs kiss (B) TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(vii) MOLECULA TYPE: other nucleic acid
! DESCRIPTION: / desc = "primer"
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 26
GGATCACCGC CGCAGAAACT ACG
INFORMATION: D?
you, CHARACTERISTICS OF THE SEQUENCE:
(A'1 EXTENSION: 25 pairs case (3! TYPE: nucleic acid (C) FILAMENTO: one sclc (D: TOPOLOGY: linear (viii) MOLECULE TYPE: other nucleic acid
A) DESCRIPTION: / desc = "primer"
[xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 30
ÜTGTCAGCCG TTAAGTGTTC CTGTG
(2) INFORMATION OF THE ID NO: 31
[i! CHARACTERISTICS OF THE SEQUENCE:
EXTENSION: 23 base pairs TYPE: nucleic acid FILAMENTO: one only TOPOLOGY: linear
, IX 'MOLÉCULA TIfC: another nucleic acid
[ESCPIPTION: / .eec =: ebado
, x ?, D? SCRI? .? THE SEQUENCE: SEQ ID NO: 31 CAGTTCAACC TGTTGATAGT ACG
[2] INFORMATION OF THE ID NO: 3;
(i) SEQUENCE CHARACTERISTICS:
(A) EXTENSION: 20 base pairs (SI TIPC: nucleic acid ÍC; FILAMENTO: one only (D) TOPOLOGY: linear
(x) MOLECULA TYPE: other nucleic acid
A) DESCRIPTION: / desc = "primer"
(x: DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 3
ATGAGTCAAA CATCAACCTT
! 2l INFORMATION OF THE ID NO: 33: CHARACTERISTICS D? THE SEQUENCE: (A) EXTENSION: 20 peres kiss (B) TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(xi) MOLECULA TYPE: other nucleic acid
A) DESCRIPTION: / desc = "primer"
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 33
ATGGAGAAAA AAATCACTGG
[2! INFORMATION OF THE ID NO: 34
i A CHARACTERISTICS OF THE SEQUENCE:
(Ai EXTENSION: 20 base pairs (B) TYPE: nucleic acid (C'i FILAMENTO: one only (D) TOPOLOGY: linear
: nucleic acid tro
A) DESCRIPTION: / desc = "primer" (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 34
TTACGCCCCG CCCTGCCACT
[2) INFORMATION OF THE ID NO: 35:
ii) 'CHARACTERISTICS OF THE SEQUENCE:
and, EXT? NSION: 20 base pairs (B) TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(xiii) MOLECULA TYPE: other nucleic acid
TO; DESCRIPTION: / desc = "primer"
(xi! DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 35
TCAGAGGATG TGCACCTGCA
(2) INFORMATION D? LA ID NO: 36;
(i) SEQUENCE CHARACTERISTICS:
(A) EXTENSION: 26 peres kiss (B) TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: lineel
(xiv) MOLECULA TYPE: other nucleic acid
.) DESCRIPTION: / desc = "primer"
(xil DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 36
CGAGCATGCC GCATTTGGCA CTACTC 15
(2) INFORMATION OF THE ID NO: 37:
(il CHARACTERISTICS OF THE SEQUENCE: 20 ÍA) EXTENSION: 29 paree base Í3 'TIPC: ac nucleic nucleus ÍC) FILAMENT: one selo [D1 TOPOLOGY: linear * <;
(xv) MOLECULA TYPE: other nucleic acid
A) DESCRIPTION: / desc = "cebedor" (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 3 '
GCGTCTAGAG TAGGTTATTC CCACTCTTG
(2! INFORMATION OF THE ID NO: 38:
(i) SEQUENCE CHARACTERISTICS:
(AI EXTENSION: 26 base pairs (B) TYPE: nucleic acid (C, FILAMENTO: un solo ÍD) TOPOLOGY: linear
[xv: 'other nucleic acid
DESCRIPTION: / desc = "primer"
DESCRIPTION Z? THE SEQUENCE: SEQ ID NO: 38
GAAGTCGACC GCTGCGCCTT ATCCGG [2) INFORMATION OF SEE ID NO: 39:
(i) SEQUENCE CHARACTERISTICS:
(A) EXTENSION: 26 base pairs (B) TYPE: nucleic acid (C; FILAMENTO: one only and, TOPOLOGY: linear
[xvii) MOLECULA TYPE: other nucleic acid
l DESCRIPTION: / desc = "primer"
: xi? DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 39
CGCGTCGACG TTTACAATT7 CAGGTGGC
.2! INFORMATION OF THE ID NO: 40:
CHARACTERISTICS OF THE SEQUENCE:
: XTE 3I0N: res case (B) TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(xvi ii) MOLECULA TYPE: other nucleic acid
A) DESCRIPTION: / desc = "primer"
(xi; DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 40
GCAGCATGCT GGACTGGTAG TAG
(2) INFORMATION OF THE ID NO: 41:
1 CHARACTERISTICS OF THE SEQUENCE:
(A) EXTENSION: 27 base pairs (B? TIP0: nucleic acid (C? FILAMENTO: une sole 20 (D «TOPOLOGY: linear
(xix MOLECULA TYPE: other nucleic acid
A, DESCRIPTION: / desc = "primer '-> (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 41 CAGTCTAGAG TTATTGGCAA ACCTACC
[2] INFORMATION OF THE ID NO: 42:
(i) D CHARACTERISTICS? SEQUENCE:
(To the EXTENSION: 25 base pairs (B) TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(xx, another nucleic acid
A) DESCRIPTION: / desc = "primer
ixi) DESCRIPTION D? THE SEQUENCE: SEQ ID NO: 42
GATGCATGCC CAGGGCGGAG ACGGC
[2, INFORMATION 0? THE ?? ID NO: 43 (i) CHARACTERISTICS OF THE SEQUENCE:
(A) EXTENSION: 29 base pairs (B) TYPE: nucleic acid (C) FILAMENTO: one only (D) TOPOLOGY: linear
(xxi) MOLECULA TYPE: other nucleic acid
A- DESCRIPTION: / desc = "primer"
(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 43
CIAACGATTG TTCTCTAGAG AAAATGTCC
Í2 '> INFORMATION OF THE ID NO: 44:
CHARACTERISTICS D? SEQUENCE:
y: EXTENSION: 30 base pairs ÍB) TYPE: nucleic acid (C FILAMENTO: joins only (D- TOPOLOGY: linear íiii MOLECULA TYPE: nucleic acid chorus (A) DESCRIPTION: / desc = "primer"
(iii) HYPOTHETICAL: NO
; ÍV) ANTISENTIDO: NO
[xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 44
CACGCATGCA GTTCAACCTG TTGATAGTAC
(2) INFORMATION OF SEQ ID NO: 45:
(i) SEQUENCE CHARACTERISTICS:
ÍA) EXTENSION: 28 base pairs (B> TYPE: nucleic acid IC [FILAMENT: one only (Di TOPOLOGY: linear (ii <MOLECULE TYPE: other nucleic acid 20 (A) D? SCRIPTION: / desc = "primer"
iiii! HYPOTHETICAL: C
- > iv) ANTIS? NTIDO.-NC (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 45
GCGTCTAGAT CCTTTTAAAT TAAAAATG
It is noted that in relation to this date the best method known to the applicant to practice and practice the said invention is that which is clear from
the present description of the invention.
Having described the invention as above, it is claimed as property in the following:
!. ">
0
Claims (20)
1. A Method for the production of giicerol from a recombinant organism, which is characterized in that it comprises: (i) transforming a cell-guest to < decueda with a cartridge and expression comprising either or both of: (a) a gene encoding a protein having sii-cyano-3-phosphate dehydrogenase activity, and (b> a gene that encodes a protein that has gi gi-3-phosphate phosphatase activity, the host cell that has an i :: -tí: r- i J. either e :. ^; I in both of ?to; an enagenic gene encoding a polypeptide having glyceroi kinase activity, and r an alien gene that encodes a polypeptide that dehydrogenase, where the interruption prevents the expression of the piudu iu of the active gene, " (ii) cultivating the host cell transformed from (i) into ie presence of at least one carbon source selected from the group consisting of monosaccharides, oligosaccharides, polysaccharides, and substrates of a single carbon, by which is produced giiceroi; Y (iii) optionally recover the glycerol produced
2. The method of claim 1 which is characterized in that the expression cartridge comprises a gene coding for autologous and 3-phosphate dehydrogenase.
3. The method of claim 1 is characterized in that the expression cartridge comprises u;: yei; what c? dlíioe u :. glyceiul-3-phosphate rosettase enzyme.
4. The ethoac of claim 1, which is characterized in that the expression cartridge comprises genes encoding a giicerol-3-phosphate phos- phetase enzyme and a giiceroi- 3-phosphate dehydrogenase enzyme.
5. The cell of id claim 1 which is characterized in that the host cell contains an interruption in a gene encoding an endogenous giicerol kinase enzyme in which the interruption prevents the expression of the active gene product.
6. The method of claim 1 which is characterized in that the host cell contains an interruption in a gene encoding an endogenous enzyme giiceroi dehydrogenase in which the interruption prevents the expression of the active gene product.
7. Method of claim 1 which is characterized in that the host cell contains a) an interruption in a gene encoding an endogenous glycerol kinase enzyme and b; an interruption in a gene encoding an endogenous enzyme giicerol dehydrogenase, in which disruption in the respective genes prevents the expression of the active gene product.
8. The method of claim 1 which is characterized in that the suitable host cell is selected from the group consisting of becteria, yeasts, and filamentous fungi.
9. The method of claim 8 which is characterized in that a suitable host cell is selected from the group consisting of Ci trobacter, Enterobacter, Clostridium, Klebsieila, Aerobacter, Lactobacillus, Aspergillus, Saccharomyces, Schizosaccharomyces, Zygosaccharomyces, Pichia, Kl uyveromyces, Candida Hansenula. , Debaryomyces, Mucor, Torulopsis, Methylobacter, Escherichia, Salmonella, Bacillus, Streptomyces, and Pseudomonas.
10. The method of claim 9 which is characterized in that the appropriate host cell is E. Coli or Sd chai orr.y is sp.
11. r.1 method of claim 1 which is characterized in that the source of cerbono is glucose.
12. The method of claim 1 which is raised because the protein having giicerol-3-phosphate dehydrogenase activity corresponds to amino acid sequences selected from the group consisting of SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9 , SEQ ID NO: 10, SEQ ID NO: 11, and SEQ ID NO: 12 and in which the amino acid sequences aberce substitutions, 5 deletions or insertions of amino acids that do not alter the functional properties of the enzyme.
13. The method of claim 1 which is characterized because it protects it has activity of 10 giicerol-3-phosphatase corresponds to the amino acid sequences selected from the group consisting of SEQ ID NO: 13 and SEQ ID NO: 14, and the amino acid sequences may encompass substitutions, deletions or additions of amino acids not surrounding the function from 15 the enzyme.
14. A transformed host cell which is characterized in that it comprises: 20? A) a gene encoding a protein having an alicerol-3-phosphate phosphatase activity. (Or a gene that coats a protein that has aeziviaaa ae gncere_ - r> - resrato rosiatese. - > < (c) an interruption in a gene encoding an endogenous • -glycerol kinase; (d) an interruption in a gene encoding an endogenous ylicerol dehydrogenase; in which the disruption in the genes of (O and (d) prevent the expression of the active gene product, and wherein the host cell converts at least one carbon source selected from the group consisting of monosaccharides, cligosaccharides, polysaccharides, and subtracts of a single carbon to glycerol.
15. A transformed host cell that is "acterized because it comprises: . to; a gene that encodes a protein that has a nviaaa "o o .. - sratc aesnicrogenasa; tb, a gene that coacts a protein that has glycero-3-phosphate phosphatase activity; Y (Or an interruption in a gene encoding an endogenous rehydroxydehydrogenase; in which the interruption in the gene of (c) prevents the expression of the product of the ective gene, and where the guest cell converts at least one cerium source selected from the group consisting of monosaccharides, oiigosaccharides, poiisaccharides and substrates of a single carbon to glycerol.
16. A transformed host cell which is characterized in that it comprises: (a) a gene encoding e binds protein having a glycerol-3-phosphate dehydrogenase activity; ib) a gene encoding a protein having glycerol-3-phosphate phosphate activity; Y (c) an interruption of a gene that encodes e unites roomasoma enaognana, wherein the digestion in the gene of (c) prevents the expression of the product of the ective gene, and wherein the host cell conveys at least one carbon source selected from the group consisting of monosaccharides, ¿, Olxsdcei i ob, and substrates of a single carbon to alicercl.
17. Method for the production of 1,3-propanediol from a recombinant organism which is characterized in that it comprises: (i) transforming a suitable host cell with an expression cartridge comprising either one or both of (a) a gene encoding a protein having glycero-3-phosphate dehydrogenase activity, and (b'- a gene that codes for a protein that has glycerol-3-phosphate phosphatase activity = a, the appropriate host cell having at least one gene encoding a protein having an activity dehydrated and having an interruption in either one or both of; a) an endogenous gene encoding a polypeptide having glyceryl activity, and ib.- an endogenous gene that encodes a polypeptide that has glyceral dehydrogenase activity, in which the interruption in the genes of (a) or (b) prevents the expression of the product of the ective gene; (ii) culturing the transformed host cell of (i) in the presence of at least one carbon source selected from the group consisting of monosaccharides, oligosaccharides, polysaccharides, and single carbon substrates whereby 1,3-propanediol is produced; Y (iii) recover the 1,3-propanediol produced from [ii)
18. The method of claim 17 which is characterized in that the protein having an ectivided dehydrette is selected from the group consisting of a glycerol dehydrease enzyme and a diol dehydratase of the enzyme.
19. The method of claim 18 which is characterized in that the glycerol dehydrate enzyme is encoded by a gene, the gene isolated from a microorganism, the microorganism selected from the group consisting of Klebsiella, Lactobacillus, Enterobacter, Ci trobacter, Pelobacter, Ilyobacter and Clostridi. um.
20. The method of claim 1 which is characterized because the enzyme diol dehydrate is encoded by a gene, the gene from a microorganism, the microorganism selected from the group consisting of Kiebsieila and Seimonelle. METHOD FOR THE PRODUCTION OF GLYCEROL BY RECOMBINANT ORGANISMS SUMMARY OF THE INVENTION Recombinant organisms are provided- ^ an-t¿e- =; which comprise genes encoding glyceral-3-phosphate dehydrogenase activity and / or a glycerol-3-phosphatase useful for the production of slicerol in a variety C- i containing interruptions in the endogenous genes encoding the protein that has actividaaes of glycerol cinase and glycerin dehydrogenase.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/982,783 | 1997-12-02 |
Publications (1)
Publication Number | Publication Date |
---|---|
MXPA00005478A true MXPA00005478A (en) | 2002-06-05 |
Family
ID=
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR100550203B1 (en) | Method for The Production of Glycerol by Recombinant Organisms | |
US6358716B1 (en) | Method for the production of glycerol by recombinant organisms | |
KR100525325B1 (en) | Bioconversion of a Fermentable Carbon Source to, 1,3-Propanediol by a Single Microorganism | |
AU733534B2 (en) | Method for the production of 1,3-propanediol by recombinant microorganisms | |
DE60029971T2 (en) | PROCESS FOR THE BIOLOGICAL MANUFACTURE OF 1,3-PROPANEL | |
IL139466A (en) | Method for the production of 1, 3 - propanediol by recombinant organisms comprising genes for its synthesis and for vitamin b12 transport | |
MXPA00005478A (en) | Method for the production of glycerol by recombinant organisms | |
AU2005203028B2 (en) | Method for the production of glycerol by recombinant organisms | |
AU780783B2 (en) | Method for the production of glycerol by recombinant organisms | |
CA2624764C (en) | Method for the production of glycerol by recombinant organisms | |
MXPA99004406A (en) | Method for the production of glycerol by recombinant organisms | |
AU7156500A (en) | Bioconversion of a fermentable carbon source to 1,3-propanediol by a single microorganism | |
MXPA99004405A (en) | Method for the production of 1,3-propanediol by recombinant organisms | |
MXPA00010723A (en) | Method for the production of 1,3-propanediol by recombinant organisms comprising genes for vitamin b12 transport | |
MXPA99004337A (en) | Method for the recombinant production of 1,3-propanediol |