KR20200075813A - 개선된 철-황 클러스터 전달을 갖는 세포 공장 - Google Patents
개선된 철-황 클러스터 전달을 갖는 세포 공장 Download PDFInfo
- Publication number
- KR20200075813A KR20200075813A KR1020207002518A KR20207002518A KR20200075813A KR 20200075813 A KR20200075813 A KR 20200075813A KR 1020207002518 A KR1020207002518 A KR 1020207002518A KR 20207002518 A KR20207002518 A KR 20207002518A KR 20200075813 A KR20200075813 A KR 20200075813A
- Authority
- KR
- South Korea
- Prior art keywords
- ala
- leu
- gly
- arg
- glu
- Prior art date
Links
- BKWBIMSGEOYWCJ-UHFFFAOYSA-L iron;iron(2+);sulfanide Chemical compound [SH-].[SH-].[Fe].[Fe+2] BKWBIMSGEOYWCJ-UHFFFAOYSA-L 0.000 title abstract description 21
- 230000001976 improved effect Effects 0.000 title abstract description 11
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 claims abstract description 392
- 235000020958 biotin Nutrition 0.000 claims abstract description 196
- 229960002685 biotin Drugs 0.000 claims abstract description 196
- 239000011616 biotin Substances 0.000 claims abstract description 196
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 187
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 186
- 229920001184 polypeptide Polymers 0.000 claims abstract description 184
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 171
- JZRWCGZRTZMZEH-UHFFFAOYSA-N Thiamine Natural products CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N JZRWCGZRTZMZEH-UHFFFAOYSA-N 0.000 claims abstract description 148
- 238000004519 manufacturing process Methods 0.000 claims abstract description 133
- 235000019136 lipoic acid Nutrition 0.000 claims abstract description 81
- 235000019157 thiamine Nutrition 0.000 claims abstract description 81
- 239000011721 thiamine Substances 0.000 claims abstract description 81
- 229960002663 thioctic acid Drugs 0.000 claims abstract description 81
- AGBQKNBQESQNJD-UHFFFAOYSA-M lipoate Chemical compound [O-]C(=O)CCCCC1CCSS1 AGBQKNBQESQNJD-UHFFFAOYSA-M 0.000 claims abstract description 80
- 108700019146 Transgenes Proteins 0.000 claims abstract description 79
- 241000894006 Bacteria Species 0.000 claims abstract description 77
- 229960003495 thiamine Drugs 0.000 claims abstract description 57
- KYMBYSLLVAOCFI-UHFFFAOYSA-N thiamine Chemical compound CC1=C(CCO)SCN1CC1=CN=C(C)N=C1N KYMBYSLLVAOCFI-UHFFFAOYSA-N 0.000 claims abstract 13
- 101150076756 iscR gene Proteins 0.000 claims description 114
- 230000000694 effects Effects 0.000 claims description 101
- 230000014509 gene expression Effects 0.000 claims description 46
- 230000001965 increasing effect Effects 0.000 claims description 43
- 102000004169 proteins and genes Human genes 0.000 claims description 38
- AUTOLBMXDDTRRT-JGVFFNPUSA-N (4R,5S)-dethiobiotin Chemical compound C[C@@H]1NC(=O)N[C@@H]1CCCCCC(O)=O AUTOLBMXDDTRRT-JGVFFNPUSA-N 0.000 claims description 37
- 101710117026 Biotin synthase Proteins 0.000 claims description 35
- 150000001413 amino acids Chemical group 0.000 claims description 32
- 102000004190 Enzymes Human genes 0.000 claims description 31
- 108090000790 Enzymes Proteins 0.000 claims description 31
- 102000015478 lipoate synthase activity proteins Human genes 0.000 claims description 31
- 108010037535 lipoic acid synthase Proteins 0.000 claims description 31
- 238000006467 substitution reaction Methods 0.000 claims description 29
- 238000000034 method Methods 0.000 claims description 28
- 235000020955 thiamine monophosphate Nutrition 0.000 claims description 26
- 239000011621 thiamine monophosphate Substances 0.000 claims description 26
- GUGWNSHJDUEHNJ-UHFFFAOYSA-N thiamine(1+) monophosphate chloride Chemical compound [Cl-].CC1=C(CCOP(O)(O)=O)SC=[N+]1CC1=CN=C(C)N=C1N GUGWNSHJDUEHNJ-UHFFFAOYSA-N 0.000 claims description 26
- MEFKEPWMEQBLKI-AIRLBKTGSA-N S-adenosyl-L-methioninate Chemical compound O[C@@H]1[C@H](O)[C@@H](C[S+](CC[C@H](N)C([O-])=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 MEFKEPWMEQBLKI-AIRLBKTGSA-N 0.000 claims description 24
- 101710176059 4-amino-5-hydroxymethyl-2-methylpyrimidine phosphate synthase Proteins 0.000 claims description 19
- 101710098569 Phosphomethylpyrimidine synthase Proteins 0.000 claims description 19
- 238000012546 transfer Methods 0.000 claims description 19
- 101710197207 2-iminoacetate synthase Proteins 0.000 claims description 16
- 102000004316 Oxidoreductases Human genes 0.000 claims description 16
- 108090000854 Oxidoreductases Proteins 0.000 claims description 16
- 241000589516 Pseudomonas Species 0.000 claims description 16
- 108030003607 2-iminoacetate synthases Proteins 0.000 claims description 14
- 108010052919 Hydroxyethylthiazole kinase Proteins 0.000 claims description 14
- 239000001963 growth medium Substances 0.000 claims description 13
- 108010027436 Hydroxymethylpyrimidine kinase Proteins 0.000 claims description 12
- 108090000503 Phosphomethylpyrimidine synthases Proteins 0.000 claims description 11
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 claims description 11
- 108010031096 8-amino-7-oxononanoate synthase Proteins 0.000 claims description 10
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 claims description 10
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 10
- 108010004237 Glycine oxidase Proteins 0.000 claims description 10
- 230000001419 dependent effect Effects 0.000 claims description 10
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 claims description 9
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 9
- 239000002253 acid Substances 0.000 claims description 9
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 claims description 9
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 claims description 9
- 229910052799 carbon Inorganic materials 0.000 claims description 9
- 239000008103 glucose Substances 0.000 claims description 9
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 8
- 241000588722 Escherichia Species 0.000 claims description 8
- 108050005713 Lipoate-protein ligase A Proteins 0.000 claims description 8
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 claims description 8
- FZWLAAWBMGSTSO-UHFFFAOYSA-N Thiazole Chemical compound C1=CSC=N1 FZWLAAWBMGSTSO-UHFFFAOYSA-N 0.000 claims description 8
- RZVAJINKPMORJF-UHFFFAOYSA-N Acetaminophen Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 claims description 7
- 244000182625 Dictamnus albus Species 0.000 claims description 7
- 101100008681 Glycine max DHPS1 gene Proteins 0.000 claims description 7
- 241000186146 Brevibacterium Species 0.000 claims description 6
- 241000589876 Campylobacter Species 0.000 claims description 6
- 241000186216 Corynebacterium Species 0.000 claims description 6
- 241000186660 Lactobacillus Species 0.000 claims description 6
- 102000001172 Lipoyl synthase Human genes 0.000 claims description 6
- 108700040067 Lipoyl synthases Proteins 0.000 claims description 6
- 108091000080 Phosphotransferase Proteins 0.000 claims description 6
- 229960001570 ademetionine Drugs 0.000 claims description 6
- 125000000539 amino acid group Chemical group 0.000 claims description 6
- 238000012258 culturing Methods 0.000 claims description 6
- 102000020233 phosphotransferase Human genes 0.000 claims description 6
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 claims description 5
- SCVJRXQHFJXZFZ-KVQBGUIXSA-N 2-amino-9-[(2r,4s,5r)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-3h-purine-6-thione Chemical compound C1=2NC(N)=NC(=S)C=2N=CN1[C@H]1C[C@H](O)[C@@H](CO)O1 SCVJRXQHFJXZFZ-KVQBGUIXSA-N 0.000 claims description 5
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 claims description 5
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 claims description 5
- 101710088194 Dehydrogenase Proteins 0.000 claims description 5
- 229930091371 Fructose Natural products 0.000 claims description 5
- 239000005715 Fructose Substances 0.000 claims description 5
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 claims description 5
- 241000194036 Lactococcus Species 0.000 claims description 5
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 claims description 5
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 claims description 5
- MUPFEKGTMRGPLJ-RMMQSMQOSA-N Raffinose Natural products O(C[C@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O[C@@]2(CO)[C@H](O)[C@@H](O)[C@@H](CO)O2)O1)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 MUPFEKGTMRGPLJ-RMMQSMQOSA-N 0.000 claims description 5
- 229930006000 Sucrose Natural products 0.000 claims description 5
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 claims description 5
- 108030006413 Thiazole synthases Proteins 0.000 claims description 5
- MUPFEKGTMRGPLJ-UHFFFAOYSA-N UNPD196149 Natural products OC1C(O)C(CO)OC1(CO)OC1C(O)C(O)C(O)C(COC2C(C(O)C(O)C(CO)O2)O)O1 MUPFEKGTMRGPLJ-UHFFFAOYSA-N 0.000 claims description 5
- RURRXTDCDGOQFC-UHFFFAOYSA-N [2-(hydroxymethyl)pyrimidin-4-yl]-oxido-oxophosphanium Chemical compound OCC1=NC=CC(P(=O)=O)=N1 RURRXTDCDGOQFC-UHFFFAOYSA-N 0.000 claims description 5
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 claims description 5
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 claims description 5
- 229930182830 galactose Natural products 0.000 claims description 5
- 239000008101 lactose Substances 0.000 claims description 5
- MUPFEKGTMRGPLJ-ZQSKZDJDSA-N raffinose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO[C@@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)O1 MUPFEKGTMRGPLJ-ZQSKZDJDSA-N 0.000 claims description 5
- 239000005720 sucrose Substances 0.000 claims description 5
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 4
- 108090000371 Esterases Proteins 0.000 claims description 4
- 108010057366 Flavodoxin Proteins 0.000 claims description 4
- 108060004795 Methyltransferase Proteins 0.000 claims description 4
- 102000016397 Methyltransferase Human genes 0.000 claims description 4
- LCTONWCANYUPML-UHFFFAOYSA-M Pyruvate Chemical compound CC(=O)C([O-])=O LCTONWCANYUPML-UHFFFAOYSA-M 0.000 claims description 4
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 claims description 4
- 150000004702 methyl esters Chemical class 0.000 claims description 4
- 241000589220 Acetobacter Species 0.000 claims description 3
- 241000589291 Acinetobacter Species 0.000 claims description 3
- 241001453380 Burkholderia Species 0.000 claims description 3
- 101710154896 Octanoyltransferase Proteins 0.000 claims description 3
- 241000607720 Serratia Species 0.000 claims description 3
- 108020002494 acetyltransferase Proteins 0.000 claims description 3
- 102000005421 acetyltransferase Human genes 0.000 claims description 3
- 229940039696 lactobacillus Drugs 0.000 claims description 3
- 230000004952 protein activity Effects 0.000 claims description 3
- 108010073112 Dihydrolipoyllysine-residue acetyltransferase Proteins 0.000 claims description 2
- 102000009093 Dihydrolipoyllysine-residue acetyltransferase Human genes 0.000 claims description 2
- 150000004712 monophosphates Chemical class 0.000 claims description 2
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 claims 1
- 230000001580 bacterial effect Effects 0.000 abstract description 73
- 230000015572 biosynthetic process Effects 0.000 abstract description 39
- 210000004027 cell Anatomy 0.000 description 147
- 241000588724 Escherichia coli Species 0.000 description 89
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 86
- 241000282326 Felis catus Species 0.000 description 67
- 239000013612 plasmid Substances 0.000 description 61
- 125000003275 alpha amino acid group Chemical group 0.000 description 55
- 230000035772 mutation Effects 0.000 description 42
- 230000037361 pathway Effects 0.000 description 40
- 241000880493 Leptailurus serval Species 0.000 description 38
- 101150029327 bioB gene Proteins 0.000 description 38
- 230000012010 growth Effects 0.000 description 34
- 235000018102 proteins Nutrition 0.000 description 33
- 235000001014 amino acid Nutrition 0.000 description 32
- 230000001939 inductive effect Effects 0.000 description 31
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 30
- 230000002018 overexpression Effects 0.000 description 30
- 102200129094 rs118203933 Human genes 0.000 description 29
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 27
- 108010050848 glycylleucine Proteins 0.000 description 26
- 239000002609 medium Substances 0.000 description 26
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 25
- 238000003786 synthesis reaction Methods 0.000 description 25
- 108010047495 alanylglycine Proteins 0.000 description 24
- 230000006698 induction Effects 0.000 description 24
- 102200156862 rs121964891 Human genes 0.000 description 24
- 108020004414 DNA Proteins 0.000 description 23
- 101100218845 Escherichia coli (strain K12) bioH gene Proteins 0.000 description 23
- 229940024606 amino acid Drugs 0.000 description 22
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 19
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 18
- 238000004166 bioassay Methods 0.000 description 18
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 17
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 17
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 17
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 17
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 17
- 108010061238 threonyl-glycine Proteins 0.000 description 17
- 108010087924 alanylproline Proteins 0.000 description 16
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 15
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 15
- 108010041407 alanylaspartic acid Proteins 0.000 description 15
- 108010092854 aspartyllysine Proteins 0.000 description 15
- 108010078144 glutaminyl-glycine Proteins 0.000 description 15
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 14
- 235000008170 thiamine pyrophosphate Nutrition 0.000 description 14
- 239000011678 thiamine pyrophosphate Substances 0.000 description 14
- YXVCLPJQTZXJLH-UHFFFAOYSA-N thiamine(1+) diphosphate chloride Chemical compound [Cl-].CC1=C(CCOP(O)(=O)OP(O)(O)=O)SC=[N+]1CC1=CN=C(C)N=C1N YXVCLPJQTZXJLH-UHFFFAOYSA-N 0.000 description 14
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 13
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 13
- RYRQZJVFDVWURI-SRVKXCTJSA-N Arg-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N RYRQZJVFDVWURI-SRVKXCTJSA-N 0.000 description 13
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 13
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 13
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 13
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 13
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 13
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 13
- 102100021277 Beta-secretase 2 Human genes 0.000 description 13
- 101710150190 Beta-secretase 2 Proteins 0.000 description 13
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 13
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 13
- MQVNVZUEPUIAFA-WDSKDSINSA-N Gly-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN MQVNVZUEPUIAFA-WDSKDSINSA-N 0.000 description 13
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 13
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 13
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 13
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 13
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 13
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 13
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 13
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 13
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 13
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 13
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 13
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 13
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 13
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 13
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 13
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 13
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 13
- 108010054813 diprotin B Proteins 0.000 description 13
- 239000000543 intermediate Substances 0.000 description 13
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 13
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 12
- 108010005233 alanylglutamic acid Proteins 0.000 description 12
- 238000006243 chemical reaction Methods 0.000 description 12
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 12
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 12
- WWZKQHOCKIZLMA-UHFFFAOYSA-N octanoic acid Chemical compound CCCCCCCC(O)=O WWZKQHOCKIZLMA-UHFFFAOYSA-N 0.000 description 12
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 11
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 11
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 11
- JMQUAZXYFAEOIH-XGEHTFHBSA-N Thr-Arg-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O JMQUAZXYFAEOIH-XGEHTFHBSA-N 0.000 description 11
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 11
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 11
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 11
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 11
- 239000005090 green fluorescent protein Substances 0.000 description 11
- 239000006228 supernatant Substances 0.000 description 11
- 229960004441 tyrosine Drugs 0.000 description 11
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 10
- 241001013691 Escherichia coli BW25113 Species 0.000 description 10
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 10
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 10
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 10
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 10
- 239000000758 substrate Substances 0.000 description 10
- 101150029215 thiC gene Proteins 0.000 description 10
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 10
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 9
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 9
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 9
- 108010064235 lysylglycine Proteins 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- 101150100613 thiH gene Proteins 0.000 description 9
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 8
- 108010083590 Apoproteins Proteins 0.000 description 8
- 102000006410 Apoproteins Human genes 0.000 description 8
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 8
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 8
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 8
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 8
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 8
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 8
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 8
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 8
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 8
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 8
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 8
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 8
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 8
- 229960000723 ampicillin Drugs 0.000 description 8
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 8
- 108010077245 asparaginyl-proline Proteins 0.000 description 8
- 229960002433 cysteine Drugs 0.000 description 8
- 108010049041 glutamylalanine Proteins 0.000 description 8
- 108010037850 glycylvaline Proteins 0.000 description 8
- 229930027917 kanamycin Natural products 0.000 description 8
- 229960000318 kanamycin Drugs 0.000 description 8
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 8
- 229930182823 kanamycin A Natural products 0.000 description 8
- 101150091094 lipA gene Proteins 0.000 description 8
- 108010009298 lysylglutamic acid Proteins 0.000 description 8
- 108010056582 methionylglutamic acid Proteins 0.000 description 8
- 108010090894 prolylleucine Proteins 0.000 description 8
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 8
- 241000186226 Corynebacterium glutamicum Species 0.000 description 7
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 7
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 7
- 210000000349 chromosome Anatomy 0.000 description 7
- 235000018417 cysteine Nutrition 0.000 description 7
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 7
- 238000012217 deletion Methods 0.000 description 7
- 230000037430 deletion Effects 0.000 description 7
- 238000010586 diagram Methods 0.000 description 7
- 101150100173 fdx gene Proteins 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 102220040294 rs193171026 Human genes 0.000 description 7
- 241000894007 species Species 0.000 description 7
- 229910052717 sulfur Inorganic materials 0.000 description 7
- 108010073969 valyllysine Proteins 0.000 description 7
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 6
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 6
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 6
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 6
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 6
- 108010076441 Ala-His-His Proteins 0.000 description 6
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 6
- AETQNIIFKCMVHP-UVBJJODRSA-N Ala-Trp-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AETQNIIFKCMVHP-UVBJJODRSA-N 0.000 description 6
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 6
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 6
- AMRANMVXQWXNAH-ZLUOBGJFSA-N Asp-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O AMRANMVXQWXNAH-ZLUOBGJFSA-N 0.000 description 6
- 101100481176 Bacillus subtilis (strain 168) thiE gene Proteins 0.000 description 6
- KVGPYKUIHZJWGA-BQBZGAKWSA-N Cys-Met-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O KVGPYKUIHZJWGA-BQBZGAKWSA-N 0.000 description 6
- 101100494344 Desulfobacterium autotrophicum (strain ATCC 43914 / DSM 3382 / HRM2) bzaF gene Proteins 0.000 description 6
- 241001646716 Escherichia coli K-12 Species 0.000 description 6
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 6
- LWYUQLZOIORFFJ-XKBZYTNZSA-N Glu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O LWYUQLZOIORFFJ-XKBZYTNZSA-N 0.000 description 6
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 6
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 6
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 6
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 6
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 6
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 6
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 6
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 6
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 6
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 6
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 6
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 6
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 6
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 6
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 6
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 6
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 6
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 6
- 241000187747 Streptomyces Species 0.000 description 6
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 6
- ZAGPDPNPWYPEIR-SRVKXCTJSA-N Tyr-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ZAGPDPNPWYPEIR-SRVKXCTJSA-N 0.000 description 6
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 6
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 6
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 6
- XJLXINKUBYWONI-DQQFMEOOSA-N [[(2r,3r,4r,5r)-5-(6-aminopurin-9-yl)-3-hydroxy-4-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2s,3r,4s,5s)-5-(3-carbamoylpyridin-1-ium-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl phosphate Chemical compound NC(=O)C1=CC=C[N+]([C@@H]2[C@H]([C@@H](O)[C@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-DQQFMEOOSA-N 0.000 description 6
- 235000004279 alanine Nutrition 0.000 description 6
- 108010044940 alanylglutamine Proteins 0.000 description 6
- 108010070944 alanylhistidine Proteins 0.000 description 6
- OBETXYAYXDNJHR-UHFFFAOYSA-N alpha-ethylcaproic acid Natural products CCCCC(CC)C(O)=O OBETXYAYXDNJHR-UHFFFAOYSA-N 0.000 description 6
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 6
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 6
- 108010062796 arginyllysine Proteins 0.000 description 6
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 6
- 230000027455 binding Effects 0.000 description 6
- 230000002255 enzymatic effect Effects 0.000 description 6
- 239000013613 expression plasmid Substances 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 238000012239 gene modification Methods 0.000 description 6
- 230000005017 genetic modification Effects 0.000 description 6
- 235000013617 genetically modified food Nutrition 0.000 description 6
- 108010089804 glycyl-threonine Proteins 0.000 description 6
- 108010010147 glycylglutamine Proteins 0.000 description 6
- 108010003700 lysyl aspartic acid Proteins 0.000 description 6
- 108010017391 lysylvaline Proteins 0.000 description 6
- 108010005942 methionylglycine Proteins 0.000 description 6
- 239000002773 nucleotide Substances 0.000 description 6
- 125000003729 nucleotide group Chemical group 0.000 description 6
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 6
- 108010073101 phenylalanylleucine Proteins 0.000 description 6
- GTQXMAIXVFLYKF-UHFFFAOYSA-N thiochrome Chemical compound CC1=NC=C2CN3C(C)=C(CCO)SC3=NC2=N1 GTQXMAIXVFLYKF-UHFFFAOYSA-N 0.000 description 6
- 125000001493 tyrosinyl group Chemical group [H]OC1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 6
- ZKHQWZAMYRWXGA-KQYNXXCUSA-J ATP(4-) Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-J 0.000 description 5
- ZKHQWZAMYRWXGA-UHFFFAOYSA-N Adenosine triphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O ZKHQWZAMYRWXGA-UHFFFAOYSA-N 0.000 description 5
- 229920001817 Agar Polymers 0.000 description 5
- ODBSSLHUFPJRED-CIUDSAMLSA-N Asn-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ODBSSLHUFPJRED-CIUDSAMLSA-N 0.000 description 5
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 5
- 101100226150 Bacillus subtilis (strain 168) estA gene Proteins 0.000 description 5
- 241000196324 Embryophyta Species 0.000 description 5
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 5
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 5
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 5
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 5
- 101150013996 LIP gene Proteins 0.000 description 5
- 239000006137 Luria-Bertani broth Substances 0.000 description 5
- 241000589597 Paracoccus denitrificans Species 0.000 description 5
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 5
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 5
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 5
- 101100128403 Vibrio cholerae serotype O1 (strain ATCC 39315 / El Tor Inaba N16961) hlyC gene Proteins 0.000 description 5
- 239000008272 agar Substances 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 5
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 5
- 230000033228 biological regulation Effects 0.000 description 5
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 5
- 238000005119 centrifugation Methods 0.000 description 5
- 108010060199 cysteinylproline Proteins 0.000 description 5
- 150000004665 fatty acids Chemical class 0.000 description 5
- 230000037433 frameshift Effects 0.000 description 5
- 239000003112 inhibitor Substances 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 101150056138 lipA1 gene Proteins 0.000 description 5
- 101150114896 lipA2 gene Proteins 0.000 description 5
- 230000002829 reductive effect Effects 0.000 description 5
- 239000011593 sulfur Substances 0.000 description 5
- 101150113425 thiL gene Proteins 0.000 description 5
- -1 this Proteins 0.000 description 5
- 230000009261 transgenic effect Effects 0.000 description 5
- 239000013598 vector Substances 0.000 description 5
- SCPRYBYMKVYVND-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-4-methylpentanoyl)pyrrolidine-2-carbonyl]amino]-4-methylpentanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(O)=O SCPRYBYMKVYVND-UHFFFAOYSA-N 0.000 description 4
- DVLFYONBTKHTER-UHFFFAOYSA-N 3-(N-morpholino)propanesulfonic acid Chemical class OS(=O)(=O)CCCN1CCOCC1 DVLFYONBTKHTER-UHFFFAOYSA-N 0.000 description 4
- PKYFHKIYHBRTPI-UHFFFAOYSA-N 4-amino-2-methyl-5-phosphooxymethylpyrimidine Chemical compound CC1=NC=C(COP(O)(O)=O)C(N)=N1 PKYFHKIYHBRTPI-UHFFFAOYSA-N 0.000 description 4
- PDACUKOKVHBVHJ-XVFCMESISA-N 5-amino-1-(5-phospho-beta-D-ribosyl)imidazole Chemical compound NC1=CN=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(O)=O)O1 PDACUKOKVHBVHJ-XVFCMESISA-N 0.000 description 4
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 4
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 4
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 4
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 4
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 4
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 4
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 4
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 4
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 4
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 4
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 4
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 4
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 4
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 4
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 4
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 4
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 4
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 4
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 4
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 4
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 4
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 4
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 4
- 241000421688 Chloracidobacterium thermophilum B Species 0.000 description 4
- 108020004705 Codon Proteins 0.000 description 4
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 4
- FNXOZWPPOJRBRE-XGEHTFHBSA-N Cys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CS)N)O FNXOZWPPOJRBRE-XGEHTFHBSA-N 0.000 description 4
- MBMLMWLHJBBADN-UHFFFAOYSA-N Ferrous sulfide Chemical class [Fe]=S MBMLMWLHJBBADN-UHFFFAOYSA-N 0.000 description 4
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 4
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 4
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 4
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 4
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 4
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 4
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 4
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 4
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 4
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 4
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 4
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 4
- YPLYIXGKCRQZGW-SRVKXCTJSA-N His-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YPLYIXGKCRQZGW-SRVKXCTJSA-N 0.000 description 4
- RLAOTFTXBFQJDV-KKUMJFAQSA-N His-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CN=CN1 RLAOTFTXBFQJDV-KKUMJFAQSA-N 0.000 description 4
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 4
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 4
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 4
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 4
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 4
- FCWFBHMAJZGWRY-XUXIUFHCSA-N Ile-Leu-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N FCWFBHMAJZGWRY-XUXIUFHCSA-N 0.000 description 4
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 4
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 4
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 4
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 4
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 4
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 4
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 4
- YORLGJINWYYIMX-KKUMJFAQSA-N Leu-Cys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YORLGJINWYYIMX-KKUMJFAQSA-N 0.000 description 4
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 4
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 4
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 4
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 4
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 4
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 4
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 4
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 4
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 4
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 4
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 4
- ZIIMORLEZLVRIP-SRVKXCTJSA-N Met-Leu-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZIIMORLEZLVRIP-SRVKXCTJSA-N 0.000 description 4
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 4
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 4
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 4
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 4
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 4
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 4
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 4
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 4
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 4
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 4
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 4
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 4
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 4
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 4
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 4
- CYDVHRFXDMDMGX-KKUMJFAQSA-N Tyr-Asn-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O CYDVHRFXDMDMGX-KKUMJFAQSA-N 0.000 description 4
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 4
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 4
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 4
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 4
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 4
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 4
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 4
- 108010084455 Zeocin Proteins 0.000 description 4
- 238000007792 addition Methods 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 108010093581 aspartyl-proline Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- 101150076754 bioA gene Proteins 0.000 description 4
- FLKYBGKDCCEQQM-WYUVZMMLSA-M cefazolin sodium Chemical compound [Na+].S1C(C)=NN=C1SCC1=C(C([O-])=O)N2C(=O)[C@@H](NC(=O)CN3N=NN=C3)[C@H]2SC1 FLKYBGKDCCEQQM-WYUVZMMLSA-M 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 4
- 238000012512 characterization method Methods 0.000 description 4
- 230000002950 deficient Effects 0.000 description 4
- 235000014113 dietary fatty acids Nutrition 0.000 description 4
- 238000010790 dilution Methods 0.000 description 4
- 239000012895 dilution Substances 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 229930195729 fatty acid Natural products 0.000 description 4
- 239000000194 fatty acid Substances 0.000 description 4
- 101150057222 fpr gene Proteins 0.000 description 4
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 4
- 230000002401 inhibitory effect Effects 0.000 description 4
- 229960000310 isoleucine Drugs 0.000 description 4
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 4
- 108010078274 isoleucylvaline Proteins 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 125000003977 lipoyl group Chemical group S1SC(C([H])([H])C(C(C(C(=O)[*])([H])[H])([H])[H])([H])[H])([H])C([H])([H])C1([H])[H] 0.000 description 4
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 230000001404 mediated effect Effects 0.000 description 4
- 102000039446 nucleic acids Human genes 0.000 description 4
- 108020004707 nucleic acids Proteins 0.000 description 4
- 150000007523 nucleic acids Chemical class 0.000 description 4
- CWCMIVBLVUHDHK-ZSNHEYEWSA-N phleomycin D1 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC[C@@H](N=1)C=1SC=C(N=1)C(=O)NCCCCNC(N)=N)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C CWCMIVBLVUHDHK-ZSNHEYEWSA-N 0.000 description 4
- 108010025488 pinealon Proteins 0.000 description 4
- 108010004914 prolylarginine Proteins 0.000 description 4
- 108010015796 prolylisoleucine Proteins 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 108091005946 superfolder green fluorescent proteins Proteins 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 101150080237 thiF gene Proteins 0.000 description 4
- 230000001988 toxicity Effects 0.000 description 4
- 231100000419 toxicity Toxicity 0.000 description 4
- 108010003137 tyrosyltyrosine Proteins 0.000 description 4
- 239000004474 valine Substances 0.000 description 4
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 4
- OCYMERZCMYJQQO-UHFFFAOYSA-N 4-methyl-5-(2-phosphonooxyethyl)thiazole Chemical compound CC=1N=CSC=1CCOP(O)(O)=O OCYMERZCMYJQQO-UHFFFAOYSA-N 0.000 description 3
- GUAHPAJOXVYFON-UHFFFAOYSA-N 8-amino-7-oxononanoic acid Chemical compound CC([NH3+])C(=O)CCCCCC([O-])=O GUAHPAJOXVYFON-UHFFFAOYSA-N 0.000 description 3
- 241000506839 Agrobacterium fabrum Species 0.000 description 3
- 241000589176 Agrobacterium vitis Species 0.000 description 3
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 3
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 3
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 3
- 235000014469 Bacillus subtilis Nutrition 0.000 description 3
- 244000063299 Bacillus subtilis Species 0.000 description 3
- 101100381793 Bacillus subtilis (strain 168) bioK gene Proteins 0.000 description 3
- 241000276408 Bacillus subtilis subsp. subtilis str. 168 Species 0.000 description 3
- 102000014914 Carrier Proteins Human genes 0.000 description 3
- 108010078791 Carrier Proteins Proteins 0.000 description 3
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 3
- IXPSSIBVVKSOIE-SRVKXCTJSA-N Cys-Ser-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O IXPSSIBVVKSOIE-SRVKXCTJSA-N 0.000 description 3
- 108010074122 Ferredoxins Proteins 0.000 description 3
- 241000233866 Fungi Species 0.000 description 3
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 3
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 3
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 3
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 3
- 241000665848 Isca Species 0.000 description 3
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 3
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 3
- 102000003960 Ligases Human genes 0.000 description 3
- 108090000364 Ligases Proteins 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 241000030574 Ruegeria pomeroyi Species 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 241000218483 Streptomyces lydicus Species 0.000 description 3
- 108020005038 Terminator Codon Proteins 0.000 description 3
- 239000004098 Tetracycline Substances 0.000 description 3
- 108030007080 Thiamine-phosphate kinases Proteins 0.000 description 3
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 3
- 241001026323 Wolbachia endosymbiont of Cimex lectularius Species 0.000 description 3
- 238000000540 analysis of variance Methods 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 230000010261 cell growth Effects 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 239000000539 dimer Substances 0.000 description 3
- 102000034356 gene-regulatory proteins Human genes 0.000 description 3
- 108091006104 gene-regulatory proteins Proteins 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 229910052742 iron Inorganic materials 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 229930182817 methionine Natural products 0.000 description 3
- 229960004452 methionine Drugs 0.000 description 3
- 235000006109 methionine Nutrition 0.000 description 3
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000027756 respiratory electron transport chain Effects 0.000 description 3
- 229960002180 tetracycline Drugs 0.000 description 3
- 229930101283 tetracycline Natural products 0.000 description 3
- 235000019364 tetracycline Nutrition 0.000 description 3
- 150000003522 tetracyclines Chemical class 0.000 description 3
- 101150019895 thiE gene Proteins 0.000 description 3
- 101150040057 thiG gene Proteins 0.000 description 3
- 231100000331 toxic Toxicity 0.000 description 3
- 230000002588 toxic effect Effects 0.000 description 3
- 238000012070 whole genome sequencing analysis Methods 0.000 description 3
- ZNWPOBXVOKXYOW-UHFFFAOYSA-N (4-amino-2-methylpyrimidin-5-yl)methyl-phosphonooxyphosphinic acid Chemical compound CC1=NC=C(CP(O)(=O)OP(O)(O)=O)C(N)=N1 ZNWPOBXVOKXYOW-UHFFFAOYSA-N 0.000 description 2
- AJPADPZSRRUGHI-RFZPGFLSSA-N 1-deoxy-D-xylulose 5-phosphate Chemical compound CC(=O)[C@@H](O)[C@H](O)COP(O)(O)=O AJPADPZSRRUGHI-RFZPGFLSSA-N 0.000 description 2
- GLDQAMYCGOIJDV-UHFFFAOYSA-N 2,3-dihydroxybenzoic acid Chemical compound OC(=O)C1=CC=CC(O)=C1O GLDQAMYCGOIJDV-UHFFFAOYSA-N 0.000 description 2
- AGQJQCFEPUVXNK-UHFFFAOYSA-N 4-amino-2-methyl-5-diphosphooxymethylpyrimidine Chemical compound CC1=NC=C(COP(O)(=O)OP(O)(O)=O)C(N)=N1 AGQJQCFEPUVXNK-UHFFFAOYSA-N 0.000 description 2
- ALYNCZNDIQEVRV-UHFFFAOYSA-N 4-aminobenzoic acid Chemical compound NC1=CC=C(C(O)=O)C=C1 ALYNCZNDIQEVRV-UHFFFAOYSA-N 0.000 description 2
- QRZMXADUXZADTF-UHFFFAOYSA-N 4-aminoimidazole Chemical compound NC1=CNC=N1 QRZMXADUXZADTF-UHFFFAOYSA-N 0.000 description 2
- FJKROLUGYXJWQN-UHFFFAOYSA-N 4-hydroxybenzoic acid Chemical compound OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- 102000000452 Acetyl-CoA carboxylase Human genes 0.000 description 2
- 108010016219 Acetyl-CoA carboxylase Proteins 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 2
- FSBCNCKIQZZASN-GUBZILKMSA-N Ala-Arg-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O FSBCNCKIQZZASN-GUBZILKMSA-N 0.000 description 2
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 2
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 2
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 2
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 2
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 2
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 2
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 2
- OILNWMNBLIHXQK-ZLUOBGJFSA-N Ala-Cys-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O OILNWMNBLIHXQK-ZLUOBGJFSA-N 0.000 description 2
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 2
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 2
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 2
- FAJIYNONGXEXAI-CQDKDKBSSA-N Ala-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 FAJIYNONGXEXAI-CQDKDKBSSA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 2
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 2
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 2
- VHEVVUZDDUCAKU-FXQIFTODSA-N Ala-Met-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O VHEVVUZDDUCAKU-FXQIFTODSA-N 0.000 description 2
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 2
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 2
- DGLQWAFPIXDKRL-UBHSHLNASA-N Ala-Met-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DGLQWAFPIXDKRL-UBHSHLNASA-N 0.000 description 2
- DRARURMRLANNLS-GUBZILKMSA-N Ala-Met-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O DRARURMRLANNLS-GUBZILKMSA-N 0.000 description 2
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 2
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 2
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 2
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 2
- OAIGZYFGCNNVIE-ZPFDUUQYSA-N Ala-Val-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O OAIGZYFGCNNVIE-ZPFDUUQYSA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- 108010039224 Amidophosphoribosyltransferase Proteins 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 2
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- BAVDUESNGSMLPI-CIUDSAMLSA-N Arg-Asn-Gly-Ser Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BAVDUESNGSMLPI-CIUDSAMLSA-N 0.000 description 2
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 2
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 2
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- HJAICMSAKODKRF-GUBZILKMSA-N Arg-Cys-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O HJAICMSAKODKRF-GUBZILKMSA-N 0.000 description 2
- XTGGTAWGUFXJSV-NAKRPEOUSA-N Arg-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N XTGGTAWGUFXJSV-NAKRPEOUSA-N 0.000 description 2
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 2
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 2
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 2
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 2
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 2
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 2
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 2
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 2
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 2
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 2
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 2
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 2
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 2
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 2
- JKRPBTQDPJSQIT-RCWTZXSCSA-N Arg-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O JKRPBTQDPJSQIT-RCWTZXSCSA-N 0.000 description 2
- NMTANZXPDAHUKU-ULQDDVLXSA-N Arg-Tyr-Lys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 NMTANZXPDAHUKU-ULQDDVLXSA-N 0.000 description 2
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 2
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 2
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 2
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 2
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 2
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 2
- FANGHKQYFPYDNB-UBHSHLNASA-N Asn-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N FANGHKQYFPYDNB-UBHSHLNASA-N 0.000 description 2
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 2
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 description 2
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 2
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 2
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 2
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 2
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 2
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 2
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 2
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 2
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 2
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 2
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 2
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 2
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 2
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 2
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 2
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 2
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 2
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 2
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 2
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 2
- OGTCOKZFOJIZFG-CIUDSAMLSA-N Asp-His-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OGTCOKZFOJIZFG-CIUDSAMLSA-N 0.000 description 2
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 2
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 2
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 2
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 2
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 2
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 2
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 2
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 2
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 2
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 2
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 108010018763 Biotin carboxylase Proteins 0.000 description 2
- 241001622847 Buttiauxella Species 0.000 description 2
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 2
- QYKJOVAXAKTKBR-FXQIFTODSA-N Cys-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N QYKJOVAXAKTKBR-FXQIFTODSA-N 0.000 description 2
- LDIKUWLAMDFHPU-FXQIFTODSA-N Cys-Cys-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LDIKUWLAMDFHPU-FXQIFTODSA-N 0.000 description 2
- ZIKWRNJXFIQECJ-CIUDSAMLSA-N Cys-Cys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZIKWRNJXFIQECJ-CIUDSAMLSA-N 0.000 description 2
- KOHBWQDSVCARMI-BWBBJGPYSA-N Cys-Cys-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KOHBWQDSVCARMI-BWBBJGPYSA-N 0.000 description 2
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 2
- RESAHOSBQHMOKH-KKUMJFAQSA-N Cys-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N RESAHOSBQHMOKH-KKUMJFAQSA-N 0.000 description 2
- BSGXXYRIDXUEOM-IHRRRGAJSA-N Cys-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N BSGXXYRIDXUEOM-IHRRRGAJSA-N 0.000 description 2
- DQBRIEGWTLXALA-GQGQLFGLSA-N Cys-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CS)N DQBRIEGWTLXALA-GQGQLFGLSA-N 0.000 description 2
- 102100035406 Cysteine desulfurase, mitochondrial Human genes 0.000 description 2
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 2
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 2
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 2
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 2
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 2
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 2
- LKVCNGLNTAPMSZ-JYJNAYRXSA-N Gln-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N LKVCNGLNTAPMSZ-JYJNAYRXSA-N 0.000 description 2
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 2
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 2
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 2
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 2
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 2
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 2
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 2
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 2
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 2
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 2
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 2
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 2
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 2
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 2
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 2
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 2
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 2
- BRKUZSLQMPNVFN-SRVKXCTJSA-N Glu-His-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BRKUZSLQMPNVFN-SRVKXCTJSA-N 0.000 description 2
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 2
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 2
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 2
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 2
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 2
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 2
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 2
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 2
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- KCCNSVHJSMMGFS-NRPADANISA-N Glu-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KCCNSVHJSMMGFS-NRPADANISA-N 0.000 description 2
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 2
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 2
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 2
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 2
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 2
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 2
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 2
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 2
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 2
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 2
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 2
- IVSWQHKONQIOHA-YUMQZZPRSA-N Gly-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN IVSWQHKONQIOHA-YUMQZZPRSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 2
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 2
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 2
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 2
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 2
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 2
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 2
- WRFOZIJRODPLIA-QWRGUYRKSA-N Gly-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O WRFOZIJRODPLIA-QWRGUYRKSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 2
- CIWILNZNBPIHEU-DCAQKATOSA-N His-Arg-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O CIWILNZNBPIHEU-DCAQKATOSA-N 0.000 description 2
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 2
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 2
- YOSQCYUFZGPIPC-PBCZWWQYSA-N His-Asp-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YOSQCYUFZGPIPC-PBCZWWQYSA-N 0.000 description 2
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 2
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 2
- IMCHNUANCIGUKS-SRVKXCTJSA-N His-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IMCHNUANCIGUKS-SRVKXCTJSA-N 0.000 description 2
- PQKCQZHAGILVIM-NKIYYHGXSA-N His-Glu-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O PQKCQZHAGILVIM-NKIYYHGXSA-N 0.000 description 2
- PYNUBZSXKQKAHL-UWVGGRQHSA-N His-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O PYNUBZSXKQKAHL-UWVGGRQHSA-N 0.000 description 2
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 2
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 2
- CTJHHEQNUNIYNN-SRVKXCTJSA-N His-His-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O CTJHHEQNUNIYNN-SRVKXCTJSA-N 0.000 description 2
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 2
- ZHMZWSFQRUGLEC-JYJNAYRXSA-N His-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZHMZWSFQRUGLEC-JYJNAYRXSA-N 0.000 description 2
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 2
- 101001023837 Homo sapiens Cysteine desulfurase, mitochondrial Proteins 0.000 description 2
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 2
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 2
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 2
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 2
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 2
- IPYVXYDYLHVWHU-GMOBBJLQSA-N Ile-Asn-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N IPYVXYDYLHVWHU-GMOBBJLQSA-N 0.000 description 2
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 2
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 2
- WNQKUUQIVDDAFA-ZPFDUUQYSA-N Ile-Gln-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N WNQKUUQIVDDAFA-ZPFDUUQYSA-N 0.000 description 2
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 2
- TVSPLSZTKTUYLV-ZPFDUUQYSA-N Ile-Glu-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O TVSPLSZTKTUYLV-ZPFDUUQYSA-N 0.000 description 2
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 2
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 2
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 2
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 2
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 2
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 2
- FTUZWJVSNZMLPI-RVMXOQNASA-N Ile-Met-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N FTUZWJVSNZMLPI-RVMXOQNASA-N 0.000 description 2
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 2
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 2
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 2
- 241000588749 Klebsiella oxytoca Species 0.000 description 2
- 241001477369 Kosakonia sacchari Species 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- 235000019766 L-Lysine Nutrition 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 2
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 2
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 2
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 2
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 2
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 2
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 2
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 2
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 2
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 2
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 2
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 2
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 2
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 2
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 2
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 2
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 2
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 2
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 2
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 2
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 2
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 2
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 2
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 2
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 2
- HONVOXINDBETTI-KKUMJFAQSA-N Lys-Tyr-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 HONVOXINDBETTI-KKUMJFAQSA-N 0.000 description 2
- QFSYGUMEANRNJE-DCAQKATOSA-N Lys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N QFSYGUMEANRNJE-DCAQKATOSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 239000007993 MOPS buffer Substances 0.000 description 2
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 2
- MUYQDMBLDFEVRJ-LSJOCFKGSA-N Met-Ala-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 MUYQDMBLDFEVRJ-LSJOCFKGSA-N 0.000 description 2
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 2
- ZAJNRWKGHWGPDQ-SDDRHHMPSA-N Met-Arg-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N ZAJNRWKGHWGPDQ-SDDRHHMPSA-N 0.000 description 2
- JQECLVNLAZGHRQ-CIUDSAMLSA-N Met-Asp-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O JQECLVNLAZGHRQ-CIUDSAMLSA-N 0.000 description 2
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 2
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 2
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 2
- UZVKFARGHHMQGX-IUCAKERBSA-N Met-Gly-Met Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCSC UZVKFARGHHMQGX-IUCAKERBSA-N 0.000 description 2
- JZNGSNMTXAHMSV-AVGNSLFASA-N Met-His-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JZNGSNMTXAHMSV-AVGNSLFASA-N 0.000 description 2
- AEQVPPGEJJBFEE-CYDGBPFRSA-N Met-Ile-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEQVPPGEJJBFEE-CYDGBPFRSA-N 0.000 description 2
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 2
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 2
- OBPCXINRFKHSRY-SDDRHHMPSA-N Met-Met-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N OBPCXINRFKHSRY-SDDRHHMPSA-N 0.000 description 2
- QEDGNYFHLXXIDC-DCAQKATOSA-N Met-Pro-Gln Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O QEDGNYFHLXXIDC-DCAQKATOSA-N 0.000 description 2
- FNYBIOGBMWFQRJ-SRVKXCTJSA-N Met-Pro-Met Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N FNYBIOGBMWFQRJ-SRVKXCTJSA-N 0.000 description 2
- SBFPAAPFKZPDCZ-JYJNAYRXSA-N Met-Pro-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SBFPAAPFKZPDCZ-JYJNAYRXSA-N 0.000 description 2
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 2
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 2
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 2
- CNFMPVYIVQUJOO-NHCYSSNCSA-N Met-Val-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O CNFMPVYIVQUJOO-NHCYSSNCSA-N 0.000 description 2
- 108010006519 Molecular Chaperones Proteins 0.000 description 2
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- SEQKRHFRPICQDD-UHFFFAOYSA-N N-tris(hydroxymethyl)methylglycine Chemical compound OCC(CO)(CO)[NH2+]CC([O-])=O SEQKRHFRPICQDD-UHFFFAOYSA-N 0.000 description 2
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 2
- 241001057811 Paracoccus <mealybug> Species 0.000 description 2
- 241001329728 Paracoccus denitrificans PD1222 Species 0.000 description 2
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 2
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 2
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 2
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 2
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 2
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 2
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 2
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 2
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 2
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 2
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 2
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 2
- 241000881813 Pluralibacter gergoviae Species 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 2
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 2
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 2
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 2
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 2
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 2
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 2
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 2
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 2
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 2
- FFSLAIOXRMOFIZ-GJZGRUSLSA-N Pro-Gly-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)CNC(=O)[C@@H]1CCCN1 FFSLAIOXRMOFIZ-GJZGRUSLSA-N 0.000 description 2
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 2
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 2
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 2
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 2
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 2
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 2
- 241000589776 Pseudomonas putida Species 0.000 description 2
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 2
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 2
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 2
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 2
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 2
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 2
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 2
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 2
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 2
- NJSPTZXVPZDRCU-UBHSHLNASA-N Ser-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N NJSPTZXVPZDRCU-UBHSHLNASA-N 0.000 description 2
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 2
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 2
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 2
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 2
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 2
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 2
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 2
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 2
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- 241000736110 Sphingomonas paucimobilis Species 0.000 description 2
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 2
- YTPLMLYBLZKORZ-UHFFFAOYSA-N Thiophene Chemical group C=1C=CSC=1 YTPLMLYBLZKORZ-UHFFFAOYSA-N 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 2
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 2
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 2
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 2
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 2
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 2
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 2
- VUKVQVNKIIZBPO-HOUAVDHOSA-N Thr-Asp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VUKVQVNKIIZBPO-HOUAVDHOSA-N 0.000 description 2
- UCCNDUPVIFOOQX-CUJWVEQBSA-N Thr-Cys-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 UCCNDUPVIFOOQX-CUJWVEQBSA-N 0.000 description 2
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 2
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 2
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 2
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 2
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 2
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 2
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 description 2
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 2
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 2
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 2
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 2
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 2
- 108090000340 Transaminases Proteins 0.000 description 2
- 102000003929 Transaminases Human genes 0.000 description 2
- 102000004357 Transferases Human genes 0.000 description 2
- 108090000992 Transferases Proteins 0.000 description 2
- ACGIVBXINJFALS-HKUYNNGSSA-N Trp-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N ACGIVBXINJFALS-HKUYNNGSSA-N 0.000 description 2
- QHWMVGCEQAPQDK-UMPQAUOISA-N Trp-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O QHWMVGCEQAPQDK-UMPQAUOISA-N 0.000 description 2
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 description 2
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 2
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 2
- RGYDQHBLMMAYNZ-IHRRRGAJSA-N Tyr-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N RGYDQHBLMMAYNZ-IHRRRGAJSA-N 0.000 description 2
- GHUNBABNQPIETG-MELADBBJSA-N Tyr-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O GHUNBABNQPIETG-MELADBBJSA-N 0.000 description 2
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 2
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 2
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 2
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 2
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 2
- AKKYBQGHUAWPJR-MNSWYVGCSA-N Tyr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O AKKYBQGHUAWPJR-MNSWYVGCSA-N 0.000 description 2
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 2
- SMKXLHVZIFKQRB-GUBZILKMSA-N Val-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N SMKXLHVZIFKQRB-GUBZILKMSA-N 0.000 description 2
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 2
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 2
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 2
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 2
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 2
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 2
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 2
- FRUYSSRPJXNRRB-GUBZILKMSA-N Val-Cys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FRUYSSRPJXNRRB-GUBZILKMSA-N 0.000 description 2
- ICFRWCLVYFKHJV-FXQIFTODSA-N Val-Cys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N ICFRWCLVYFKHJV-FXQIFTODSA-N 0.000 description 2
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 2
- DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 2
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 2
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 2
- LMSBRIVOCYOKMU-NRPADANISA-N Val-Gln-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N LMSBRIVOCYOKMU-NRPADANISA-N 0.000 description 2
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 2
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 2
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 2
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 2
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 2
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 2
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 2
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 2
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 2
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 2
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 2
- UFCHCOKFAGOQSF-BQFCYCMXSA-N Val-Trp-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N UFCHCOKFAGOQSF-BQFCYCMXSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- 229930003756 Vitamin B7 Natural products 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 2
- 239000003963 antioxidant agent Substances 0.000 description 2
- 230000003078 antioxidant effect Effects 0.000 description 2
- 235000006708 antioxidants Nutrition 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- UORVGPXVDQYIDP-UHFFFAOYSA-N borane Chemical compound B UORVGPXVDQYIDP-UHFFFAOYSA-N 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 238000006473 carboxylation reaction Methods 0.000 description 2
- 239000003054 catalyst Substances 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 210000004748 cultured cell Anatomy 0.000 description 2
- 108010004073 cysteinylcysteine Proteins 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 231100000221 frame shift mutation induction Toxicity 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 230000008826 genomic mutation Effects 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- 108010046775 glutamyl-isoleucyl-leucyl-aspartyl-valine Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 230000003284 homeostatic effect Effects 0.000 description 2
- 229910052739 hydrogen Inorganic materials 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 2
- 230000008774 maternal effect Effects 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- 239000011785 micronutrient Substances 0.000 description 2
- 235000013369 micronutrients Nutrition 0.000 description 2
- 238000007481 next generation sequencing Methods 0.000 description 2
- 125000002801 octanoyl group Chemical group C(CCCCCCC)(=O)* 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000036542 oxidative stress Effects 0.000 description 2
- VUUFCJOPLZRMQF-TYSVMGFPSA-N phosphoric acid (3S,4R)-3,4,5-trihydroxypentan-2-one Chemical compound OP(O)(O)=O.CC(=O)[C@@H](O)[C@H](O)CO VUUFCJOPLZRMQF-TYSVMGFPSA-N 0.000 description 2
- SCVFZCLFOSHCOH-UHFFFAOYSA-M potassium acetate Chemical compound [K+].CC([O-])=O SCVFZCLFOSHCOH-UHFFFAOYSA-M 0.000 description 2
- 230000035755 proliferation Effects 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- NGVDGCNFYWLIFO-UHFFFAOYSA-N pyridoxal 5'-phosphate Chemical compound CC1=NC=C(COP(O)(O)=O)C(C=O)=C1O NGVDGCNFYWLIFO-UHFFFAOYSA-N 0.000 description 2
- 125000000714 pyrimidinyl group Chemical group 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 229960000268 spectinomycin Drugs 0.000 description 2
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 2
- 239000011550 stock solution Substances 0.000 description 2
- 125000004434 sulfur atom Chemical group 0.000 description 2
- 101150074714 thiD gene Proteins 0.000 description 2
- 101150045315 thiO gene Proteins 0.000 description 2
- 101150071180 thiS gene Proteins 0.000 description 2
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 229940088594 vitamin Drugs 0.000 description 2
- 229930003231 vitamin Natural products 0.000 description 2
- 235000013343 vitamin Nutrition 0.000 description 2
- 239000011782 vitamin Substances 0.000 description 2
- 235000011912 vitamin B7 Nutrition 0.000 description 2
- 239000011735 vitamin B7 Substances 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- FDKWRPBBCBCIGA-REOHCLBHSA-N (2r)-2-azaniumyl-3-$l^{1}-selanylpropanoate Chemical compound [Se]C[C@H](N)C(O)=O FDKWRPBBCBCIGA-REOHCLBHSA-N 0.000 description 1
- MDLNKYQRLGPVOG-UHFFFAOYSA-N (4-amino-2-methylpyrimidin-5-yl)methanol;phosphoric acid Chemical compound OP(O)(O)=O.CC1=NC=C(CO)C(N)=N1 MDLNKYQRLGPVOG-UHFFFAOYSA-N 0.000 description 1
- 101150084750 1 gene Proteins 0.000 description 1
- 229940082044 2,3-dihydroxybenzoic acid Drugs 0.000 description 1
- 229940090248 4-hydroxybenzoic acid Drugs 0.000 description 1
- XGYIMTFOTBMPFP-KQYNXXCUSA-N 5'-deoxyadenosine Chemical compound O[C@@H]1[C@H](O)[C@@H](C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 XGYIMTFOTBMPFP-KQYNXXCUSA-N 0.000 description 1
- KCEGBPIYGIWCDH-UHFFFAOYSA-N 7,8-diaminononanoic acid Chemical compound CC(N)C(N)CCCCCC(O)=O KCEGBPIYGIWCDH-UHFFFAOYSA-N 0.000 description 1
- GUAHPAJOXVYFON-SSDOTTSWSA-N 7-keto-8-aminopelargonic acid Chemical compound C[C@@H](N)C(=O)CCCCCC(O)=O GUAHPAJOXVYFON-SSDOTTSWSA-N 0.000 description 1
- 241000321865 Acidithiobacillus ferrivorans Species 0.000 description 1
- 241000751199 Agrobacterium fabrum str. C58 Species 0.000 description 1
- 241001235252 Agrobacterium vitis S4 Species 0.000 description 1
- 241000640374 Alicyclobacillus acidocaldarius Species 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- 101100152731 Arabidopsis thaliana TH2 gene Proteins 0.000 description 1
- 241000490494 Arabis Species 0.000 description 1
- BSGSDLYGGHGMND-IHRRRGAJSA-N Arg-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N BSGSDLYGGHGMND-IHRRRGAJSA-N 0.000 description 1
- 241000370685 Arge Species 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 101100268480 Bacillus subtilis (strain 168) gndA gene Proteins 0.000 description 1
- 101100204370 Bacillus subtilis (strain 168) sufU gene Proteins 0.000 description 1
- 241000606124 Bacteroides fragilis Species 0.000 description 1
- 241000588807 Bordetella Species 0.000 description 1
- 241000588780 Bordetella parapertussis Species 0.000 description 1
- 101100341204 Buchnera aphidicola subsp. Baizongia pistaciae (strain Bp) nifU gene Proteins 0.000 description 1
- BVHWQQWLJPFYDD-OVMPVWGRSA-N C[S+](CC[C@@H](C([O-])=O)N=O)C[C@H]([C@H]([C@H]1O)O)O[C@H]1N1C(N=CN=C2N)=C2N=C1 Chemical group C[S+](CC[C@@H](C([O-])=O)N=O)C[C@H]([C@H]([C@H]1O)O)O[C@H]1N1C(N=CN=C2N)=C2N=C1 BVHWQQWLJPFYDD-OVMPVWGRSA-N 0.000 description 1
- 241001115169 Candidatus Baumannia cicadellinicola Species 0.000 description 1
- 244000068645 Carya illinoensis Species 0.000 description 1
- 235000009025 Carya illinoensis Nutrition 0.000 description 1
- 241000205484 Cenarchaeum Species 0.000 description 1
- 241000205387 Cenarchaeum symbiosum Species 0.000 description 1
- 241000142757 Chromohalobacter Species 0.000 description 1
- RGJOEKWQDUBAIZ-IBOSZNHHSA-N CoASH Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-IBOSZNHHSA-N 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 241000252867 Cupriavidus metallidurans Species 0.000 description 1
- QMNFFXRFOJIOKZ-UHFFFAOYSA-N Cycloguanyl Natural products CC1(C)N=C(N)N=C(N)N1C1=CC=C(Cl)C=C1 QMNFFXRFOJIOKZ-UHFFFAOYSA-N 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- AUNGANRZJHBGPY-UHFFFAOYSA-N D-Lyxoflavin Natural products OCC(O)C(O)C(O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-UHFFFAOYSA-N 0.000 description 1
- FDKWRPBBCBCIGA-UWTATZPHSA-N D-Selenocysteine Natural products [Se]C[C@@H](N)C(O)=O FDKWRPBBCBCIGA-UWTATZPHSA-N 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 101100425082 Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) thiA gene Proteins 0.000 description 1
- 241001337151 Enterobacter timonensis Species 0.000 description 1
- 101100149206 Escherichia coli (strain K12) selO gene Proteins 0.000 description 1
- 101150093530 Fer gene Proteins 0.000 description 1
- 241000102723 Gallionella capsiferriformans Species 0.000 description 1
- 208000001613 Gambling Diseases 0.000 description 1
- 244000287680 Garcinia dulcis Species 0.000 description 1
- 241001494297 Geobacter sulfurreducens Species 0.000 description 1
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- SFKMXFWWDUGXRT-NWLDYVSISA-N Glu-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N)O SFKMXFWWDUGXRT-NWLDYVSISA-N 0.000 description 1
- 102000004366 Glucosidases Human genes 0.000 description 1
- 108010056771 Glucosidases Proteins 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 101100111505 Haemophilus influenzae (strain PittGG) bioB gene Proteins 0.000 description 1
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 1
- ZZLWLWSUIBSMNP-CIUDSAMLSA-N His-Asp-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZZLWLWSUIBSMNP-CIUDSAMLSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- UQSXHKLRYXJYBZ-UHFFFAOYSA-N Iron oxide Chemical compound [Fe]=O UQSXHKLRYXJYBZ-UHFFFAOYSA-N 0.000 description 1
- 241001026509 Kata Species 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- 239000004201 L-cysteine Substances 0.000 description 1
- 235000013878 L-cysteine Nutrition 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- 241001647841 Leclercia adecarboxylata Species 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- 239000006142 Luria-Bertani Agar Substances 0.000 description 1
- ZAWOJFFMBANLGE-CIUDSAMLSA-N Lys-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N ZAWOJFFMBANLGE-CIUDSAMLSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- 241000193386 Lysinibacillus sphaericus Species 0.000 description 1
- LTYOQGRJFJAKNA-KKIMTKSISA-N Malonyl CoA Natural products S(C(=O)CC(=O)O)CCNC(=O)CCNC(=O)[C@@H](O)C(CO[P@](=O)(O[P@](=O)(OC[C@H]1[C@@H](OP(=O)(O)O)[C@@H](O)[C@@H](n2c3ncnc(N)c3nc2)O1)O)O)(C)C LTYOQGRJFJAKNA-KKIMTKSISA-N 0.000 description 1
- 206010026749 Mania Diseases 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 241001048922 Methanococcus aeolicus Nankai-3 Species 0.000 description 1
- 101100023016 Methanothermobacter marburgensis (strain ATCC BAA-927 / DSM 2133 / JCM 14651 / NBRC 100331 / OCM 82 / Marburg) mat gene Proteins 0.000 description 1
- 241000589346 Methylococcus capsulatus Species 0.000 description 1
- 239000012901 Milli-Q water Substances 0.000 description 1
- 102000002568 Multienzyme Complexes Human genes 0.000 description 1
- 108010093369 Multienzyme Complexes Proteins 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- ACFIXJIJDZMPPO-NNYOXOHSSA-N NADPH Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](OP(O)(O)=O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 ACFIXJIJDZMPPO-NNYOXOHSSA-N 0.000 description 1
- 108020004485 Nonsense Codon Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241000193390 Parageobacillus thermoglucosidasius Species 0.000 description 1
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 241000186429 Propionibacterium Species 0.000 description 1
- 108700040121 Protein Methyltransferases Proteins 0.000 description 1
- 102000055027 Protein Methyltransferases Human genes 0.000 description 1
- 241000320117 Pseudomonas putida KT2440 Species 0.000 description 1
- 241000939704 Pusillimonas Species 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- AUNGANRZJHBGPY-SCRDCRAPSA-N Riboflavin Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-SCRDCRAPSA-N 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 241001026379 Ruegeria pomeroyi DSS-3 Species 0.000 description 1
- ZJUKTBDSGOFHSH-WFMPWKQPSA-N S-Adenosylhomocysteine Chemical compound O[C@@H]1[C@H](O)[C@@H](CSCC[C@H](N)C(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZJUKTBDSGOFHSH-WFMPWKQPSA-N 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 102220519855 Serine protease inhibitor Kazal-type 7_C92A_mutation Human genes 0.000 description 1
- 102220519849 Serine protease inhibitor Kazal-type 7_C98A_mutation Human genes 0.000 description 1
- 241000607715 Serratia marcescens Species 0.000 description 1
- 241000607768 Shigella Species 0.000 description 1
- 241000607762 Shigella flexneri Species 0.000 description 1
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 1
- 241001463631 Sphingobacterium sp. JB170 Species 0.000 description 1
- 241000187432 Streptomyces coelicolor Species 0.000 description 1
- 241000173600 Streptomyces pratensis Species 0.000 description 1
- 241000349084 Streptomyces sp. A02 Species 0.000 description 1
- 241001453296 Synechococcus elongatus Species 0.000 description 1
- 241000192581 Synechocystis sp. Species 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- UZMAPBJVXOGOFT-UHFFFAOYSA-N Syringetin Natural products COC1=C(O)C(OC)=CC(C2=C(C(=O)C3=C(O)C=C(O)C=C3O2)O)=C1 UZMAPBJVXOGOFT-UHFFFAOYSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 239000007997 Tricine buffer Substances 0.000 description 1
- SCQBNMKLZVCXNX-ZFWWWQNUSA-N Trp-Arg-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N SCQBNMKLZVCXNX-ZFWWWQNUSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- 229930003471 Vitamin B2 Natural products 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 101150077561 aceF gene Proteins 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- UDMBCSSLTHHNCD-KQYNXXCUSA-N adenosine 5'-monophosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O UDMBCSSLTHHNCD-KQYNXXCUSA-N 0.000 description 1
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 1
- 238000005882 aldol condensation reaction Methods 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 229960004050 aminobenzoic acid Drugs 0.000 description 1
- 101150073130 ampR gene Proteins 0.000 description 1
- 230000009604 anaerobic growth Effects 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 125000005110 aryl thio group Chemical group 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229940009098 aspartate Drugs 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 101150043536 bioH gene Proteins 0.000 description 1
- 230000008436 biogenesis Effects 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 229910000085 borane Inorganic materials 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- FAPWYRCQGJNNSJ-CTWWJBIBSA-L calcium;3-[[(2s)-2,4-dihydroxy-3,3-dimethylbutanoyl]amino]propanoate Chemical compound [Ca+2].OCC(C)(C)[C@H](O)C(=O)NCCC([O-])=O.OCC(C)(C)[C@H](O)C(=O)NCCC([O-])=O FAPWYRCQGJNNSJ-CTWWJBIBSA-L 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000011088 calibration curve Methods 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- 230000021523 carboxylation Effects 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000019522 cellular metabolic process Effects 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000000078 claw Anatomy 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 description 1
- 239000005516 coenzyme A Substances 0.000 description 1
- 229940093530 coenzyme a Drugs 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 125000000753 cycloalkyl group Chemical group 0.000 description 1
- 108010000742 dTMP kinase Proteins 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- TVMUHOAONWHJBV-UHFFFAOYSA-N dehydroglycine Chemical compound OC(=O)C=N TVMUHOAONWHJBV-UHFFFAOYSA-N 0.000 description 1
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000000378 dietary effect Effects 0.000 description 1
- 235000015872 dietary supplement Nutrition 0.000 description 1
- KCFYHBSOLOXZIF-UHFFFAOYSA-N dihydrochrysin Natural products COC1=C(O)C(OC)=CC(C2OC3=CC(O)=CC(O)=C3C(=O)C2)=C1 KCFYHBSOLOXZIF-UHFFFAOYSA-N 0.000 description 1
- 229910000396 dipotassium phosphate Inorganic materials 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 229940096118 ella Drugs 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 101150098737 erpA gene Proteins 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000004129 fatty acid metabolism Effects 0.000 description 1
- 230000004136 fatty acid synthesis Effects 0.000 description 1
- 101150031113 fdxB gene Proteins 0.000 description 1
- 230000009123 feedback regulation Effects 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 101150019247 fldA gene Proteins 0.000 description 1
- 101150081680 fldB gene Proteins 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 229930195712 glutamate Natural products 0.000 description 1
- 229940049906 glutamate Drugs 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 229960002743 glutamine Drugs 0.000 description 1
- 230000009036 growth inhibition Effects 0.000 description 1
- 150000002410 histidine derivatives Chemical class 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 101150038740 hyaA gene Proteins 0.000 description 1
- 229930195733 hydrocarbon Natural products 0.000 description 1
- 150000002430 hydrocarbons Chemical class 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 101150089729 iscA gene Proteins 0.000 description 1
- 101150021879 iscS gene Proteins 0.000 description 1
- 101150027367 iscU gene Proteins 0.000 description 1
- 101150065042 isiB gene Proteins 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 101150031897 lipB gene Proteins 0.000 description 1
- GZQKNULLWNGMCW-PWQABINMSA-N lipid A (E. coli) Chemical compound O1[C@H](CO)[C@@H](OP(O)(O)=O)[C@H](OC(=O)C[C@@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCCCC)[C@@H](NC(=O)C[C@@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCC)[C@@H]1OC[C@@H]1[C@@H](O)[C@H](OC(=O)C[C@H](O)CCCCCCCCCCC)[C@@H](NC(=O)C[C@H](O)CCCCCCCCCCC)[C@@H](OP(O)(O)=O)O1 GZQKNULLWNGMCW-PWQABINMSA-N 0.000 description 1
- FCCDDURTIIUXBY-UHFFFAOYSA-N lipoamide Chemical group NC(=O)CCCCC1CCSS1 FCCDDURTIIUXBY-UHFFFAOYSA-N 0.000 description 1
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 101150095537 lplA gene Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- LTYOQGRJFJAKNA-DVVLENMVSA-N malonyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-DVVLENMVSA-N 0.000 description 1
- 230000010534 mechanism of action Effects 0.000 description 1
- 101150095438 metK gene Proteins 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 238000012737 microarray-based gene expression Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000002887 multiple sequence alignment Methods 0.000 description 1
- 238000012243 multiplex automated genomic engineering Methods 0.000 description 1
- 230000017066 negative regulation of growth Effects 0.000 description 1
- 101150117431 nifJ gene Proteins 0.000 description 1
- 101150082753 nifS gene Proteins 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- TVMXDCGIABBOFY-UHFFFAOYSA-N octane Chemical compound CCCCCCCC TVMXDCGIABBOFY-UHFFFAOYSA-N 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 235000020030 perry Nutrition 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 125000003386 piperidinyl group Chemical group 0.000 description 1
- 101150067427 por gene Proteins 0.000 description 1
- 235000011056 potassium acetate Nutrition 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000002062 proliferating effect Effects 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 125000001501 propionyl group Chemical group O=C([*])C([H])([H])C([H])([H])[H] 0.000 description 1
- 238000000751 protein extraction Methods 0.000 description 1
- 239000002516 radical scavenger Substances 0.000 description 1
- 239000003642 reactive oxygen metabolite Substances 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 229960002477 riboflavin Drugs 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 238000006798 ring closing metathesis reaction Methods 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 231100000241 scar Toxicity 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- ZKZBPNGNEQAJSX-UHFFFAOYSA-N selenocysteine Natural products [SeH]CC(N)C(O)=O ZKZBPNGNEQAJSX-UHFFFAOYSA-N 0.000 description 1
- 229940055619 selenocysteine Drugs 0.000 description 1
- 235000016491 selenocysteine Nutrition 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 229960001153 serine Drugs 0.000 description 1
- 235000004400 serine Nutrition 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 239000010802 sludge Substances 0.000 description 1
- 101150033650 soxS gene Proteins 0.000 description 1
- 230000035882 stress Effects 0.000 description 1
- 101150037939 sufA gene Proteins 0.000 description 1
- 230000008093 supporting effect Effects 0.000 description 1
- 101150043604 thiI gene Proteins 0.000 description 1
- 101150054688 thiM gene Proteins 0.000 description 1
- 101150109483 thiP gene Proteins 0.000 description 1
- CYFJIBWZIQDUSZ-UHFFFAOYSA-N thioglycine Chemical compound NCC(S)=O CYFJIBWZIQDUSZ-UHFFFAOYSA-N 0.000 description 1
- 229960002898 threonine Drugs 0.000 description 1
- 235000008521 threonine Nutrition 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 238000006276 transfer reaction Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 229910021642 ultra pure water Inorganic materials 0.000 description 1
- 239000012498 ultrapure water Substances 0.000 description 1
- 235000019164 vitamin B2 Nutrition 0.000 description 1
- 239000011716 vitamin B2 Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 101150027386 wrbA gene Proteins 0.000 description 1
- 101150072217 ykuN gene Proteins 0.000 description 1
- 101150003742 yqjI gene Proteins 0.000 description 1
- 101150087671 yumC gene Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/24—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
- C07K14/245—Escherichia (G)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
- C12N15/77—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora for Corynebacterium; for Brevibacterium
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/13—Transferases (2.) transferring sulfur containing groups (2.8)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/16—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing two or more hetero rings
- C12P17/167—Heterorings having sulfur atoms as ring heteroatoms, e.g. vitamin B1, thiamine nucleus and open chain analogs
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/18—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing at least two hetero rings condensed among themselves or condensed with a common carbocyclic ring system, e.g. rifamycin
- C12P17/185—Heterocyclic compounds containing sulfur atoms as ring hetero atoms in the condensed system
- C12P17/186—Heterocyclic compounds containing sulfur atoms as ring hetero atoms in the condensed system containing a 2-oxo-thieno[3,4-d]imidazol nucleus, e.g. Biotin
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/40—Preparation of oxygen-containing organic compounds containing a carboxyl group including Peroxycarboxylic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y201/00—Transferases transferring one-carbon groups (2.1)
- C12Y201/01—Methyltransferases (2.1.1)
- C12Y201/01197—Malonyl-CoA O-methyltransferase (2.1.1.197)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y203/00—Acyltransferases (2.3)
- C12Y203/01—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
- C12Y203/01047—8-Amino-7-oxononanoate synthase (2.3.1.47)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y206/00—Transferases transferring nitrogenous groups (2.6)
- C12Y206/01—Transaminases (2.6.1)
- C12Y206/01062—Adenosylmethionine--8-amino-7-oxononanoate transaminase (2.6.1.62)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y208/00—Transferases transferring sulfur-containing groups (2.8)
- C12Y208/01—Sulfurtransferases (2.8.1)
- C12Y208/01006—Biotin synthase (2.8.1.6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y208/00—Transferases transferring sulfur-containing groups (2.8)
- C12Y208/01—Sulfurtransferases (2.8.1)
- C12Y208/01008—Lipoyl synthase (2.8.1.8)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y301/00—Hydrolases acting on ester bonds (3.1)
- C12Y301/01—Carboxylic ester hydrolases (3.1.1)
- C12Y301/01085—Pimelyl-[acyl-carrier protein] methyl ester esterase (3.1.1.85)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y401/00—Carbon-carbon lyases (4.1)
- C12Y401/99—Other Carbon-Carbon Lyases (1.4.99)
- C12Y401/99017—Phosphomethylpyrimidine synthase (4.1.99.17)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y401/00—Carbon-carbon lyases (4.1)
- C12Y401/99—Other Carbon-Carbon Lyases (1.4.99)
- C12Y401/99019—2-Iminoacetate synthase (4.1.99.19)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y602/00—Ligases forming carbon-sulfur bonds (6.2)
- C12Y602/01—Acid-Thiol Ligases (6.2.1)
- C12Y602/01014—6-Carboxyhexanoate--CoA ligase (6.2.1.14)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y603/00—Ligases forming carbon-nitrogen bonds (6.3)
- C12Y603/03—Cyclo-ligases (6.3.3)
- C12Y603/03003—Dethiobiotin synthase (6.3.3.3)
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
본 발명은 돌연변이 철 황 클러스터 조절자 (IscR)를 코딩하는 변형된 유전자뿐만 아니라 비오틴, 리포산 또는 티아민의 생합성을 증가시키는 폴리펩티드를 코딩하는 하나 이상의 전이 유전자를 특징으로 하는, 개선된 철-황 클러스터 전달이 가능한 유전자 변형 박테리아 세포를 제공한다. 본 발명은 본 발명의 유전자 변형 박테리아를 사용하여 비오틴, 리포산 또는 티아민을 생산하는 방법뿐만 아니라; 비오틴, 리포산 또는 티아민 생산을 위한 유전자 변형 박테리아 세포의 사용을 제공한다.
Description
본 발명은 돌연변이 철 황 클러스터 조절자 (IscR)를 코딩하는 변형된 유전자뿐만 아니라 비오틴, 리포산 또는 티아민의 생합성을 증가시키는 폴리펩티드를 코딩하는 하나 이상의 전이 유전자를 특징으로 하는, 개선된 철-황 클러스터 전달이 가능한 유전자 변형 박테리아 세포에 관한 것이다. 본 발명은 또한 본 발명의 유전자 변형 박테리아를 사용하여 비오틴, 리포산 또는 티아민을 생산하는 방법뿐만 아니라; 비오틴, 리포산 또는 티아민 생산을 위한 유전자 변형 박테리아 세포의 사용에 관한 것이다.
비오틴 (비타민 B7 또는 비타민 H라고도 알려짐), 및 티아민 (비타민 B2이라고도 알려짐)은 사람에게 필수적인 식이 비타민인데, 이는 다른 후생동물들과 마찬가지로, 그들은 비오틴 또는 티아민을 생산할 수 없기 때문이다. 리포산 (LA: Lipoic acid)은 황-함유, 비타민-유사 항산화제이고, 박테리아, 식물 및 동물에서 소량으로 합성된다. 세가지 모두 식이 보충제로 널리 사용된다. 이들 비타민 또는 비타민-유사 화합물들의 생산은 현재 화학적 합성에 의존하는데, 이는 비용이 많이 든다. 이들의 제조를 위한 생합성 방법은 현재 및 미래 요구를 충족시키기 위한 대안적이고, 보다 비용효율적인 방법을 제공할 것이다.
비오틴은 모든 생물 형태에 존재하는 아세틸-CoA 카복실라제 (ACC: acetyl-CoA carboxylase)와 같은 특정 카르복실화 반응을 촉매시키는 효소의 필수적인 보조인자이고, 지방산 생합성의 중요한 구성 요소인 말로닐-CoA를 생산한다. 사실상, 비오틴은 지방산 생합성 경로와 관련된 선형 경로에 의해 합성된다. 대장균 (E. coli: Escherichia coli)에서 비오틴 생합성의 초기 기질은 지방산 합성의 개시 대사 산물이기도 한 말로닐-ACP이다. 지방산 사이클을 들어가기 전에, 말로닐-ACP는 SAM (S-아데노실메티오닌)-의존적 메틸트랜스퍼라제, BioC에 의해 마스킹되어, 말로닐-ACP 메틸 에스테르를 생성한다. 이어서, 2회의 지방산 사슬 신장은 피멜로일-에스테르-ACP 분자를 생성한다. 전용 에스테라제, BioH에 의한 피멜로일-에스테르-ACP의 O-메틸기의 가수분해는 이들 분자를 지방산 신장 사이클을 빠져나가게 한다. 이어서, 중간체(intermediate), 피멜로일-에스테르-ACP는 비오틴-특이적 경로를 통해 비오틴으로 전환된다 (도 1A). 이 경로에서, BioF는 피멜로일-ACP와 알라닌의 PLP-의존적 탈탄산 알돌 축합(decarboxylative aldol condensation)을 촉매하여 KAPA (8-아미노-7-옥소노나노에이트)를 생산한다. BioA (및 BioK)는 KATA의 PLP-의존적 아미노기 전이반응을 촉매하여 DAPA (7,8-디아미노펠라르고네이트)를 생산하고, 여기에서 공여자는 SAM이고; 부산물은 S-아데노실 옥소메티오닌이다. BioD는 ATP-구동 카르복실화 및 DAPA의 고리 폐쇄를 촉매하여 데스티오비오틴 (DTB: desthiobiotin)에서 티오펜(thiophane) 고리를 형성한다. 비오틴 합성 경로의 최종 단계는 BioB (비오틴 신타아제)에 의해 2개의 탄화수소 사이에 황 다리 결합(sulfur bridge)의 도입과 관련되어 있으므로, 비오틴을 생산하기 위한 알려진 가장 복잡한 반응 중 하나이다. BioB는 이합체로 발견되는, S-아데노실-L-메티오닌 (SAM 또는 AdoMet) 라디칼 효소이고, 이의 활성 부위에 2개의 철-황 클러스터: [2Fe-2S] 2+ 및 [4Fe-4S] 2+를 포함한다. DTB에서 티오펜 고리를 생성하는데 필요한 황 원자는 BioB에서 [2Fe-2S] 2+ 클러스터로부터 조달된 것으로 여겨진다. 결과적으로, DTB 합성에 소비되는 BioB 이합체에서 철-황 클러스터는 촉매반응의 각 라운드 후에 재생산되는 것으로 생각된다.
리포산 (LA: Lipoic acid)는 활성 산소종의 강력한 스캐빈저(scavenger)일 뿐만 아니라, 이에 따라 중요한 항-산화제이고, 또한 α-케토산 탈수소효소에 대한 보조인자이다. LA는 지방산 대사과정에서 중간체로부터 새롭게 합성된다 (도 2). E. coli에서 LA 합성에 참여하는 3개의 효소는 LplA (리포산-단백질 리가아제), LipB (옥탄오일 단백질 ACP 운반 단백질: 단백질 트랜스퍼라제), 및 LipA (리포산 신타아제)이다. lplA 유전자에 의해 코딩되는 LplA는 ATP-의존적인 방식으로 타겟 효소의 E2 서브유닛의 비리포일화된-아포-리포일 도메인(unlipoylated-apo-lipoyl domain)에 외인성 옥탄산의 컨쥬게이션을 촉매할 수 있다. lipB 유전자에 의해 코딩되는 LipB는 ACP로부터의 옥타닐 잔기의 타겟 효소의 E2 서브유닛의 아포-리포일 도메인으로의 이동을 촉매할 수 있다. AceF 유전자는 피루브산 탈수소효소의 E2 서브유닛의 리포일 도메인을 코딩한다. lipA 유전자에 의해 코딩되는 LipA는 2개의 C-S 결합의 형성을 담당한다. LipA-유도 반응은 이의 기능을 수행하기 위해 철-황 클러스터 (4Fe-4S) 및 SAM (metK 유전자에 의해 생성됨)을 필요로 한다. 리포산은 주로 다수의 다중-효소 복합체에서 단백질-결합된 리포아미드(lipoamide) 모이어티로서 세포에서 발견된다.
티아민 생합성은 박테리아, 일부 원생동물, 식물, 및 진균에서 특성화되었다. 티아민의 티아졸 및 피리미딘 모이어티는 별도로 합성된다 (도 3). 피리미딘 모이어티, 4-아미노-5-하이드록시메틸-2-메틸피리미딘 포스페이트 (HMP-P)는 드 노보(de novo) 퓨린 생합성 경로에서 중간체인 5-아미노이미다졸 리보타이드 (AIR: aminoimidazole ribotide)로부터 유래된다. 그람-음성 박테리아에서, AIR의 HMP-P로의 전환은 서브유닛 당 1개의 [4Fe-4S] 클러스터에 결합하는 thiC 유전자 산물인 HMP-P 신타제에 의한 라디칼 S-아데노실-L-메티오닌 (SAM)-의존적 반응에서 촉매된다.
이어서, HMP-P는 티아졸 유닛과의 커플링 전에 ThiD 키나아제에 의해 HMP-PP로 인산화된다. 티아졸 모이어티, 5-(2-하이드록시메틸)-4-메틸티아졸 포스페이트 (HET-P)는 L-티로신 및 1-데옥시-D-자일룰로스 포스페이트 (DXP) 및 시스테인으로부터 유래되고; 여기어세 황 원자는 L-시스테인으로부터 유래되는 것으로 예상된다. thiH 유전자에 의해 코딩되는 티로신 리아제(Tyrosine lyase)는 서브유닛 당 1개의 [4Fe-4S]에 결합하고, 티로신의 2-이미노아세테이트 및 4-크레솔로의 라디칼-매개 절단을 촉매작용한다. 티아졸 모이어티의 합성은 적어도 5개의 유전자 thiF, this, thiG, thiH 및 thiI의 발현을 필요로 한다.
이후, 피리미딘 및 티아졸 모이어티는 thiE에 의해 코딩되는 티아민-포스페이트 신타아제 (EC 2.5.1.3)의 작용에 의해 TMP를 형성하도록 조합된다. 따라서, TMP는 모든 공지된 티아민 생합성 경로의 첫번째 산물이다. 대장균 및 다른 장내 세균에서, TMP는 ATP의 존재 하에 thiL에 의해 코딩되는 티아민-포스페이트 키나아제 (EC 2.7.4.16)에 의해 보조인자 TPP로 인산화될 수 있다. 티아민 모노-포스페이트 포스파타제 (E.C 3.1.3.-)를 발현하는 전이 유전자를 포함하는 박테리아 균주들은 TMP를 티아민으로 전환시킬 수 있고, 이로써 티아민 생산을 증가시킬 수 있다.
박테리아-기반의 세포 공장의 사용은 비오틴, 리포산 및 티아민의 생합성 생산을 위한 잠재적 경로이다. 바이오-제품의 생산을 위한 세포 공장으로서 재조합 대장균의 이점은 다음과 같은 사실로 인해 널리 인식된다: (i) 이는 글루코스-염 배지에서 배양될 경우 및 최적의 환경 조건 하에서 약 20분의 배가 시간(doubling time)을 가진 비할데 없는 빠른 성장 속도를 갖는다, (ii) 높은 세포 밀도를 쉽게 달성한다; 여기에서 대장균 액체 배양물의 이론적 밀도 한계는 약 200 g 건조 세포 중량/l 또는 대략 1 x 1013 생존 가능한(viable) 박테리아/mL로 추정된다. 또한, 대장균은 이종 단백질의 발현에 다루기 쉬운(amenable) 유기체일 뿐만 아니라; 대장균의 유전자 변형을 위한 많은 분자 도구 및 사용 가능한 프로토콜이 있다; 이들 모두는 원하는 바이오-제품의 높은-수준의 생산을 수득하기 위해 필수적일 수 있다.
대장균에서, 비오틴 오페론 구조는 반대 가닥(bioO 유전자좌) 상의 중복 프로모터의 조절 하에 bioA 및 bioBFCD로 나눠지는데, bioH는 대장균 염색체의 다른 곳에 위치한다. 비오틴 오페론의 발현은 비오틴-결합 억제자 (BirA: biotin-bound repressor)에 의해 하향-조절되고; 비오틴-결합 억제자는 비오틴 오페론에서 오퍼레이터(operator)에 결합한다. BirA은 또한 비오틴을 세포의 카르복실라제로 전달하는 비오틴 리가아제로서 기능을 한다. 비오틴 리가아제로부터 전사 억제자로의 BirA의 기능 전환은 각각의 세포 내 비오틴 및 아포-카복실라제 풀에 의해 조절된다. 대장균에서 비오틴 오페론 (bioA 및 bioBFCD)의 과-발현은 성장을 저해하는 것으로 보고되었다 (Ifuku, 0. et al., 1995). 이 저해의 원인은 알려져 있지 않았고, 이는 비오틴 합성을 증가시키는 장애물을 형성한다.
일반적으로, 박테리아-기반의 세포 공장 (예를 들어, 대장균)에서 비오틴, 리포산 및 티아민의 생산을 용이하게 하기 위해 이러한 복잡한 생합성 경로들의 병목 현상을 규명할 필요가 있고, 박테리아-기반의 세포 공장은 이들 각각의 경로 효소들의 증가된 수준을 성장 및 생산하는 이들의 능력을 제한할수 있는 다양한 원인을 극복하도록 맞춤-제작된다.
발명의 요약
일 양상에 따르면, 본 발명은 비오틴, 리포산 또는 티아민 중 어느 하나의 증가된 생산을 위해 유전자 변형 박테리아를 제공하며; 상기 박테리아는:
ㆍ돌연변이 IscR 폴리펩티드를 코딩하는 유전자로 변형 내인성 iscR 유전자로서, 상기 돌연변이 IscR 폴리펩티드의 아미노산 서열은 서열번호 2, 4, 6, 8, 10, 12 및 14와 80% 서열 상동성을 갖는 것이고, 상기 아미노산 서열은 다음으로 이루어진 군으로부터 선택된 적어도 하나의 아미노산 치환을 갖는 것인:
o L15X, C92X, C98X, C104X, and H 107X; 상기 X는 서열번호 2, 4, 6, 8, 10, 12 및 14에 상응하는 아미노산 잔기 이외의 임의의 아미노산인 것인 유전자, 및
ㆍ 이들 중 선택된 폴리펩티드를 코딩하는 적어도 하나의 전이유전자:
o 증가된 비오틴 생산을 위한 비오틴 신타아제 (EC 2.8.1.6) 활성을 갖는 폴리펩티드,
o 증가된 리포산 생산을 위한 리포산 신타아제 (EC 2.8.1.8) 활성을 갖는 폴리펩티드,
o 증가된 티아민 생산을 위한 HMP-P 신타아제 (EC 4.1.99.17) 활성을 갖는 폴리펩티드, 및
o 증가된 티아민 생산을 위한 티로신 리아제 (EC 4.1.99.19) 활성을 갖는 폴리펩티드.
바람직하게, 상기 돌연변이 IscR 폴리펩티드 내 적어도 하나의 아미노산 치환은 다음으로 이루어진 군으로부터 선택된다:
o L15X, 상기 X는 F, Y, M 및 W 중 어느 하나임;
o C92X, 상기 X는 Y, A, V, I, G, L, M, F 및 W 중 어느 하나임;
o C98X, 상기 X는 A, V, I, G, L, F 및 W 중 어느 하나임;
o C104X, 상기 X는 A V, I, G, L, F 및 W 중 어느 하나임; 및
o H 107X; 상기 X는 A, Y, M, F, W, V, I, G, 및 L 중 어느 하나임.
비오틴 신타아제 (EC 2.8.1.6) 활성을 갖는 폴리펩티드를 코딩하는 하나의 전이 유전자를 포함하는 비오틴의 증가된 생산을 위한 본 발명에 따른 유전자 변형 박테리아는 다음으로 이루어진 군으로부터 선택된 하나 이상의 폴리펩티드를 코딩하는 추가적인 전이 유전자를 더 포함할 수 있다:
o SAM (S-아데노실메티오닌)-의존적 메틸트랜스퍼라제 (BioC; EC 2.1.1.197) 활성을 갖는 폴리펩티드;
o 7-케토-8-아미노펠라르곤산 (KAPA) 신타아제 (BioF; EC 2.3.1.47) 활성을 갖는 폴리펩티드;
o 7,8-디아미노펠라르곤산 (DAPA) 신타아제 (BioA; EC:2.6.1.62) 또는 L-리신:8-아미노-7-옥소노나노에이트 아미노트랜스퍼라제(BioK; EC:2.6.1.105) 활성을 갖는 폴리펩티드;
o 데티오비오틴 (dethiobiotin, DTB) 신타아제 (BioD; EC 6.3.3.3) 활성을 갖는 폴리펩티드, 및
o 피멜로일-[아실-운반 단백질] 메틸 에스테르 에스테라제 (BioH; EC 3.1.1.85)를 갖는 폴리펩티드 또는 6-카복시헥사노에이트-CoA 리가제 (BioW; EC 6.2.1.14) 활성을 갖는 폴리펩티드.
바람직하게, 비오틴 신타아제 (EC 2.8.1.6) 활성을 갖는 폴리펩티드를 코딩하는 하나의 전이 유전자를 포함하는 비오틴의 증가된 생산을 위한 본 발명에 따른 유전자 변형 박테리아는 SAM (S-아데노실메티오닌)-의존적 메틸트랜스퍼라제 (BioC; EC 2.1.1.197) 활성; 7-케토-8-아미노펠라르곤산 (KAPA) 신타아제 (BioF; EC 2.3.1.47) 활성; 및 7,8-디아미노펠라르곤산 (DAPA) 신타아제 (BioA; EC:2.6.1.62) 활성을 갖는 폴리펩티드를 코딩하는 추가적인 전이 유전자를 더 포함할 수 있다.
리포산 신타아제 (EC 2.8.1.8) 활성을 갖는 폴리펩티드를 코딩하는 하나의 전이 유전자를 포함하는 리포산의 증가된 생산을 위한 본 발명에 따른 유전자 변형 박테리아는 다음으로 이루어진 군으로부터 선택된 하나 이상의 폴리펩티드를 코딩하는 추가적인 전이 유전자를 더 포함할 수 있다:
o 옥타노일트랜스퍼라제 (EC 2.3.1.181) 활성을 갖는 폴리펩티드, 및
o 피루브산 탈수소효소 (EC 2.3.1.12)의 디하이드로리포일라이신-잔기 아세틸트랜스퍼라제 성분을 포함하는 폴리펩티드, 및
o 리포에이트-단백질 리가아제 A (EC:6.3.1.20) 활성을 갖는 폴리펩티드.
HMP-P 신타아제 (EC 4.1.99.17) 활성을 갖는 ThiC 폴리펩티드를 코딩하는 하나의 전이 유전자, 및/또는 티로신 리아제(EC 4.1.99.19) 활성을 갖는 ThiH 폴리펩티드를 코딩하는 하나의 전이 유전자를 포함하는 티아민의 증가된 생산을 위한 본 발명에 따른 유전자 변형 박테리아는 다음으로 이루어진 군으로부터 선택된 하나 이상의 폴리펩티드를 코딩하는 추가적인 전이 유전자를 더 포함할 수 있다:
o ThiS 아데닐트랜스퍼라제 (EC 2.7.7.73) 활성을 갖는 ThiF 폴리펩티드;
o 티아민 포스페이트 신타아제 (EC 2.5.1.3) 활성을 갖는 ThiE 폴리펩티드;
o 티아졸 신타아제 (E.C.2.8.1.10) 활성을 갖는 ThiG 폴리펩티드;
o 포스포하이드록시메틸피리미딘 키나아제 (EC 2.7.4.7) 활성을 갖는 ThiD 폴리펩티드;
o 황-운반 단백질 활성을 갖는 ThiS 폴리펩티드;
o 모노-포스페이트 포스파타제 (E.C. 3.1.3.-) 활성을 갖는 폴리펩티드; 및
o 글리신 옥시다아제 (EC 1.4.3.19) 활성을 갖는 ThiO 폴리펩티드; 및 선택적으로는, 하이드록시에틸티아졸 키나아제 (2.7.1.50) 활성을 갖는 ThiM 폴리펩티드를 코딩하는 추가적인 전이 유전자.
바람직하게, 상기 유전자 변형 박테리아는 다음의 폴리펩티드를 코딩하는 전이 유전자를 포함한다: ThiC (thiC 유전자에 의해 코딩됨); ThiD (thiD 유전자에 의해 코딩됨), ThiE (thiE 유전자에 의해 코딩됨), ThiF (thiF 유전자에 의해 코딩됨), sulfur-carrier protein (thiS 유전자에 의해 코딩됨), ThiG (thiG 유전자에 의해 코딩됨), TMP phophatase (TMP 포스파타제 유전자에 의해 코딩됨); 및 ThiH (thiH 유전자에 의해 코딩됨) 또는 ThiO (thiO 유전자에 의해 코딩됨). 구현예에 따르면, 상기 세포는 효소 ThiM (ThiM 유전자에 의해 코딩된)을 코딩하는 전이 유전자를 더 포함할 수 있다.
바람직하게 본 발명의 유전자 변형 박테리아에서 상기 적어도 하나의 전이 유전자 및 상기 하나 이상의 추가적인 전이 유전자는 항시성 프로모터(constitutive promoter)에 작동 가능하게 연결된 것이다 (상기 프로모터는 전이 유전자를 포함한 오페론에 작동 가능하게 연결된 것일 수 있음).
본 발명의 유전자 변형 박테리아는 바람직하게 에셔리키아 (Escherichia), 바실러스 (Bacillus), 브레비박테리움 (Brevibacterium), 버크홀데리아 (Burkholderia), 캄필로박터 (Campylobacter), 코리네박테리움 (Corynebacterium), 슈도모나스 (Pseudomonas), 셀라티아 (Serratia), 락토바실러스 (Lactobacillus), 락토코커스 (Lactocooccus), 아시네토박터 (Acinetobacter), 슈도모나스 (Pseudomonas), 및 아세토박터 (Acetobacter)로 이루어진 군으로부터 선택된 속의 종, 보다 바람직하게 에셔리키아 또는 코리네박테리움의 종은, 예를 들어, 에셔리키아 콜라이 (Escherichia coli) 또는 코리네박테리움 글루타미쿰 (Corynebacterium glutamicum)이다.
제2 구현예에 따르면, 본 발명은 비오틴을 생산하는 방법으로서:
o 본 발명에 따른 비오틴 신타아제 (EC 2.8.1.6) 활성을 갖는 폴리펩티드를 코딩하는 전이 유전자를 포함하는 유전자 변형 박테리아를 배양물을 생산하기 위한 증식 배지 내로 도입하는 단계;
o 상기 배양물을 배양하는 단계; 및
o 상기 배양에 의해 생산된 비오틴을 회수하고, 선택적으로 회수된 비오틴을 정제하는 단계를 포함하는 방법을 제공한다.
제3 구현예에 따르면, 본 발명은 리포산을 생산하는 방법으로서:
o 본 발명에 따른 리포산 신타아제 (EC 2.8.1.6) 활성을 갖는 폴리펩티드를 코딩하는 전이 유전자를 포함하는 유전자 변형 박테리아를 배양물을 생산하기 위한 증식 배지 내로 도입하는 단계;
o 상기 배양물을 배양하는 단계; 및
o 상기 배양에 의해 생산된 리포산을 회수하고, 선택적으로 회수된 리포산을 정제하는 단계를 포함하는 방법을 제공한다.
제4 구현예에 따르면, 본 발명은 티아민을 생산하는 방법으로서:
o 본 발명에 따른 HMP-P 신타아제 (EC 4.1.99.17) 활성을 갖는 폴리펩티드를 코딩하는 전이 유전자, 및/또는 티로신 리아제 (EC 4.1.99.19) 활성을 갖는 폴리펩티드를 코딩하는 전이 유전자를 포함하는 유전자 변형 박테리아를 배양물을 생산하기 위한 증식 배지 내로 도입하는 단계;
o 상기 배양물을 배양하는 단계; 및
o 상기 배양에 의해 생산된 티아민을 회수하고, 선택적으로 회수된 티아민을 정제하는 단계를 포함하는 방법을 제공한다.
바람직하게 비오틴, 리포산 및 티아민 중 어느 하나를 생산하는 방법에서 사용된 증식 배지는 배지는 글루코스, 말토스, 갈락토스, 프럭토스, 수크로스, 아라비노스, 자일로스, 라피노스, 만노스, 및 락토스, 또는 이들의 임의의 조합으로부터 선택된 탄소원을 포함한다.
제4 구현예에 따르면, 본 발명은 비오틴 신타아제를 코딩하는 전이 유전자를 발현하는 박테리아 세포에서 비오틴 생산을 증가시키기 위해 돌연변이 iscR 폴리펩티드를 코딩하는 유전적으로 변형된 유전자의 용도를 제공한다. 상기 돌연변이 iscR 폴리펩티드는 서열번호 2, 4, 6, 8, 10, 12 및 14와 적어도 80% 서열 상동성을 갖는 것이고; 상기 아미노산 서열은 L15X, Cys92X, Cys98X, Cysl04X, 및 Hisl07X로 이루어진 군으로부터 선택된 적어도 하나의 아미노산 치환을 갖는 것이고, 상기 X는 서열번호 2, 4, 6, 8, 10, 12 및 14에 상응하는 아미노산 잔기 이외의 임의의 아미노산이다.
제4 구현예에 따르면, 본 발명은 박테리아에서 비오틴, 리포산 또는 티아민 중 어느 하나의 생산을 증가시키기 위한 돌연변이 iscR 폴리펩티드를 코딩하는 유전적으로 변형된 유전자의 용도를 제공한다. 상기 박테리아는 이들으로부터 선택된 폴리펩티드를 코딩하는 적어도 하나의 전이 유전자를 포함하고 발현한다:
ㆍ비오틴 신타아제 (EC 2.8.1.6) 활성을 갖는 폴리펩티드,
ㆍ리포산 신타아제 (EC 2.8.1.8) 활성을 갖느 폴리펩티드,
ㆍHMP-P 신타아제 (EC 4.1.99.17) 활성을 갖는 폴리펩티드, 및 티로신 리아제 (EC 4.1.99.19) 활성을 갖는 폴리펩티드.
상기 유전적으로 변형된 유전자는 돌연변이 IscR 폴리펩티드를 코딩하는 내인성 iscR 유전자이고, 상기 돌연변이 IscR 폴리펩티드의 아미노산 서열은 서열번호 2, 4, 6, 8, 10, 12 및 14와 적어도 80% 아미노산 서열 상동성을 갖는 것이고,
상기 아미노산 서열은: L15X, C92X, C98X, C104X, 및 H107X로 이루어진 군으로부터 선택된 적어도 하나의 아미산 치환을 갖는 것이고; 상기 X는 서열번호 2, 4, 6, 8, 10, 12 및 14에 상응하는 아미노산 잔기 이외의 임의의 아미노산이다.
제5 구현예에 따르면, 본 발명은 비오틴, 리포산 또는 티아민의 증가된 생산을 위한 본 발명에 따른 유전자 변형 박테리아의 용도를 제공한다.
제6 구현예에 따르면, 본 발명은 비오틴, 리포산 또는 티아민 중 어느 하나의 증가된 생산을 위한 본 발명에 따른 유전자 변형 박테리아를 제공한다. 상기 박테리아는 전자 공여자 NADPH로부터 SAM-라디칼 이온-황 클러스터 효소의 [4Fe-4S]2+ 클러스터로의 증가된 전자 이동을 매개할 수 있는 폴리펩티드; 예를 들어, 플라보독신/페레독신 환원 효소 및 플라보독시 환원 시스템 또는 피루브산-플라보독신/페레독신 산화 환원 효소 시스템을 코딩하는 하나 이상의 유전자를 더 포함한다.
도 1 A) 박테리아의 비오틴 경로 및 비오틴의 합성을 일으키는 각각의 효소적 단계의 중간체를 나타내는 그림 (cartoon). SAM: S-아데노실-L-메티오닌, SAH: S-아데노실-L-호모시스테임, CoA: 코엔자임 A, ACP: 아실기 운반 단백질, KAPA: 7-케토-8-아미노펠라르곤산, AMTOD: S-아데노실-2-옥소-4-티오메틸부틸레이트, DAPA: 7,8-디아미노펠라르곤산, DTB: 데스티오비오틴, 5'DOA: 5'-데옥시아데노신. B) isc-오페론 구조 및 IscR의 조절 메커니즘뿐만 아니라 Fe-S-클러스터 형성에서의 역할을 나타내는 그림. 상기 isc 오페론은 다음의 유전자들의 발현을 조절하는 IscR을 코딩하는 iscR 유전자를 포함한다: iscS (시스테인 디설퍼라제 (cysteine desulphurase)), iscU (스캐폴드), iscA (A-타입 단백질), HscB (Dnaj-유사 코-샤페론), HscA (DnaK-유사 샤페론), 및 fdx (페레독신). IscR은 또한 hyaA, ydiU, erpA, 및 sufA 유전자를 포함하는 > 40 유전자를 조절한다.
도 2 박테리아의 리포산 경로 및 리포일화된 리포일 도메인 (리포산 합성)을 일으키는 각각의 효소적 단계의 중간체를 나타내는 그림. 상기 경로에서의 주요한 효소는 LipA (리포산 신타아제) 및 LipB (옥타노일 단백질 ACP 운반 단백질: 단백질 트랜스퍼라제)를 포함하고, 기질은 SAM: S-아데노실-L-메티오닌. LipA는 리포에이트-단백질 리가아제 A; EC: 6.3.1.20이다.
도 3 박테리아의 티아민 경로 및 티아민 (THI); 티아민 모노포스페이트 (TMP) 및 티아민 디포스페이트 (TPP)의 합성을 일으키는 각각의 효소적 단계의 중간체를 나타내는 그림. 중간체의 약어: 5-아미노이미다졸 리보뉴클레오티드 (AIR: aminoimidazole ribonucleotide), 4-아미노-2-메틸-5-(포스포옥시메틸)피리미딘 (HMP-P: 4-amino-2-methyl- 5-(phosphooxymethyl)pyrimidine), 4-아미노-2-메틸-5-(디포스포메틸)피리미딘 (HMP-PP: 4-amino-2-methyl-5-(diphosphomethyl) pyrimidine), 1-데옥시-D-자일룰로스 5-포스페이트 (DXP: 1-deoxy-D-xylulose 5-phosphate), 디하이드로글리신 (DHG: dehydroglycine), 4-메틸-5-(2-포스포옥시에틸)티아졸 (THZ- P: 4-methyl-5-(2-phosphooxyethyl)thiazole), 아데노신 트리포스페이트 (ATP: adenosine triphosphate), 아데노신 모노포스페이트 (AMP: adenosine monophosphate), S-아데노실-L-메티오닌 (SAM: S-adenosyl-L-methionine), 환원된 니코틴아마이드 아데닌 디뉴클레오티드 포스페이트 (NADPH: reduced nicotinamide adenine dinucleotide phosphate), 니코틴아마이드 아데닌 디뉴클레오티드 포스페이트 (NADP+: nicotinamide adenine dinucleotide phosphate), 환원된 페레독신 (Fdx red: reduced ferredoxin), 산화된 페레독신 (Fdx ox: oxidized ferredoxin).
도 4 IPTG 유도성 bioB 발현 플라스미드를 포함하는 △bioB 균주 (대징균 BW25113) (오른쪽 패널); 및 IPTG 유도성 프레임시프트된 bioB (조기 종결 코돈) 발현 플라스미드를 포함하는 표준 균주 (대장균 BW25113) (왼쪽 패널)의 시간 경과에 따라 측정된 세포 밀도 (OD600에서 측정됨)의 그래픽 표현 (Graphical presentation). 0.1 g/L DTB, 50 ㎍/mL 카나마이신 및 0 (점), 0.01 (삼각형), 또는 0.1 (사각형) mM IPTG를 갖는 200 ㎕ mMOPS에서 성장된 4개의 생물학적 복제물에 대하여 OD620은 멀티스칸을 사용하여 측정되고, OD600으로 전환되었다. 각 지수 성장률 값은 인접한 상자에 표시된다.
도 5 275 rpm 진탕을 하며 37℃에서 20시간 동안의 인큐베이션한 후에 (하기 기재된 비오틴 정량화하는 방법과 같음), 0 또는 0.244 ㎍ 비오틴/mL까지 증가하는 농도의 비오틴이 보충된 40 ㎍/mL 제오신을 갖는 150 ㎕ mMOPS 상에서 성장된 플라스미드 pBS451을 포함하는 대장균 BS1011의 배양물의 최종 세포 밀도 (OD600에서 측정됨)를 나타낸 산포도의 그래픽 표현. 수직의 회색 점선은 비오틴 바이오어쎄이를 위한 0.024 내지 0.24 ㎍ 비오틴/L의 최적의 농도 범위를 식별한다.
도 6 IPTG-유도성 bioB 발현 플라스미드 (pBS412)를 각각 포함하는 4개의 생물학적 복제물에서 아미노산 돌연변이를 갖는 돌연변이 IscR을 발현하는 3개의 상이한 iscR 돌연변이 균주 (BS1377, L15F), (BS1375, C92Y) 및 (BS1353, H107Y) 및 대장균 BW25113 bioB 균주 (BS1011, Ref) (균주에 대하여 표 1 참조)의 비오틴 생산을 나타내는 바 다이아그램. 균주들은 24시간 동안 37℃에서 275 rpm 진탕과 함께 0.1 g/l DTB 및 50 ㎍/mL 카나마이신을 갖는 400 ㎕ mMOPS에서 성장되었다. 바는 평균 비오틴 생산 값 (높이) 및 IPTG 유도 수준 (회색 음영)을 예시하고, 검은색 점은 개별적인 복제물 배양물로부터의 비오틴 생산을 나타내고, 수평의 점선은 참조 야생형 균주로부터의 최대 비오틴 생산을 표시한다. IPTG가 없이 배양된 경우 검출 가능한 수준의 비오틴을 생산한 균주는 없었다.
도 7 돌연변이 IscR을 발현하는 iscR 돌연변이 균주 (BS1353, H107Y), 및 대장균 BW25113 △bioB균주 (BS1011, Ref)의 세포 밀도 및 비오틴 생산의 그래픽 표현이고, 상기 각 균주는 IPTG-유도성 bioB 유전자 발현 플라스미드 (pBS412)를 포함한다. 상기 데이터는 25시간 동안 모니터링된 각각의 iscR H107Y 돌연변이 균주 (굵은 어두운 선) 및 표준 균주, 대장균 BW25113 △bioB 균주 (밝은 회색 점선)의 3개의 생물학적 복제물의 측정된 OD600 평균, 및 25시간 동안 모니터링된 각각의 iscR H107Y 돌연변이 균주 (굵은 어두운 점) 및 표준 균주, 대장균 BW25113 △bioB 균주 (밝은 회색 점)에 의한 비오틴 생산을 나타낸다. 상기 균주들은 250 mL 배플 진탕 플라스크 (baffled shake flask)에서 0.1 g/l DTB, 0.01 (A) 또는 0.5 mM IPTG (B) 및 50 ㎍/mL 카나마이신을 갖는 50 mL mMOPS에서 37℃에서 275 rpm 진탕과 함께 성장되었다. 성장률은 검정색 박스에 나타낸다.
도 8 동정된 돌연변이 균주의 iscR 유전자에서 뉴클레오티드 및 아미노산 서열 돌연변이의 위치를 나타내기 위해 주석이 달린 IscR 코딩 서열을 나타낸 그림
도 9 막대(sticks, WT 아미노산은 회색, 돌연변이 아미노산은 검정색으로 표시됨)로 표시된 L15F 및 H107Y iscR 돌연변이를 갖는 hya DNA 결합 부위 (검정색)에 결합된 IscR 이합체 (회색)의 결정 구조 (PDB entry 4HF1)를 나타낸 그림; 및 돌연변이된 잔기를 강조한 확대 이미지.
도 10 IPTG-유도성 bioB 발현 플라스미드 및 isc-오페론 (iscSUA-hscBA-fdx, iscR 유전자가 결핍된천연 대장균 isc 오페론 구조에 상응함) 또는 강한 리보좀 결합 부위 (RBS: strong ribosomal binding site)에 작동 가능하게 연결된 대장균 suf-오페론 (sufABCDSE)을 포함하는 플라스미드 및 미디엄 카피 넘버 플라스미드(p15A ori) 또는 대조군 플라스미드로부터의 T5 LacO 억제 프로모터를 포함하는 대장균 균주의 비오틴 생산을 나타내는 바 다이아그램. 상기 대조군 플라스미드는 suf- 또는 isc-오페론 대신에 IPTG-유도성 GFP를 코딩하는 유전자를 포함하였다. 각 균주의 생물학적 삼중물 (triplicates)은 100 μg/mL 암피실린 및 50 μg/mL 스펙티노마이신을 갖는 mMOPS에서 0.1 g/l (DTB)를 기질로서 제공하여 낮은 (0.01 mM IPTG) 및 높은 (0.1 mM IPTG) 유도 하에 배양되었다. 비오틴 생산이 성장 기반의 바이오어쎄이를 이용하여 평가된 후, 상기 균주들은 딥 웰 플레이트에서 24시간 동안 37℃에서 275 rpm으로 성장되었다. 바는 평균 비오틴 성장 값 (높이) 및 IPTG 유도 수준 (회색 음영)을 예시하고, 검은색 점은 개별적인 복제물로부터의 비오틴 생산을 나타내고, X표(crosses)는 OD600으로 측정된 각 균주의 종점(end-point, end) 세포 밀도를 나타낸다. 0.01 mM IPTG로 유도된 경우, 검출 가능한 수준의 비오틴을 생산한 균주는 없었다.
도 11 삼중불에서 수행된 4개의 상이한 샘플에서의 BioB 단백질 발현 수준 및 비오틴 생산의 상관관계의 그래픽 표현. 상기 균주는 pBS430 (bioB 프레임시프트 IPTG 유도성 플라스미드)을 갖는 BS1013 (대장균 BW25113, 백그라운드 균주), pBS412 (bioB IPTG 유도성 플라스미드)를 갖는 BS1011 (△bioB를 갖는 BS1013), pBS412를 갖는 BS1353 (iscR H107Y 돌연변이를 갖는 BS1011)이었다. 균주들은 0.1 g/l DTB 및 그래프에 나타낸 바와 같은 IPTG를 갖는 mMOPS에서 성장되었다.
도 12 IPTG-유도성 bioB 발현 플라스미드 및 다음 중 iscR의 게놈 변이체를 포함하는 대장균 △bioB 균주의 비오틴 생산을 나타내는 바 다이아그램: 야생형 (iscR WT), 22번 위치에서 종결 코돈으로 돌연변이된 E22* 글루탐산을 코딩하는 녹-아웃 돌연변이 (iscR KO), 92번 위치에서 시스테인으로부터 티로신으로의 치환을 코딩하는 돌연변이 (iscR C92Y). 바는 IPTG 유도의 주어진 수준 (회색의 음영)에서의 비오틴 생산 평균 값 (높이)을 예시하고, 점은 개별적인 복제물로부터의 비오틴 생산을 나타낸다. 각 균주의 생물학적 삼중물은 100 μg/mL 암피실린을 갖는 mMOPS에서 0.1 g/l DTB를 기질로서 제공하여 없는 (0 mM IPTG), 낮은 (0.01 mM IPTG) 및 낮은 (0.1 mM IPTG) 유도 하에 배양되었다. 비오틴 생산이 성장 기반의 바이오어쎄이를 이용하여 평가된 후, 각 균주는 딥 웰 플레이트에서 24시간 동안 37℃에서 275 rpm으로 성장되었다.
도 13 비오틴-오페론 플라스미드 및 다음 중 iscR의 게놈 변이체를 포함하는 대장균 △bioA △bioBFCD 균주에 의한 비오틴 생산을 나타내는 바 다이아그램: 야생형 (iscR WT), 돌연변이 iscR H107Y (107번 위치에서 히스티딘으로부터 티로신으로의 치환을 코딩함), 92번 위치에서 시스테인으로부터 티로신으로의 치환을 코딩하는 돌연변이 (iscR C92Y). 각 균주의 생물학적 사중물 (quadruplicate)은 0.1 데스티오비오틴 (DTB)을 기질로서 제공하거나 제공하지 않고 10 μg/mL 테트라사이클린을 갖는 mMOPS에서 배양되었다. 비오틴 생산이 성장 기반의 바이오어쎄이를 이용하여 평가된 후, 상기 균주들은 딥 웰 플레이트에서 24시간 동안 37℃에서 275 rpm으로 성장되었다. 바는 비오틴 생산 평균 값 (높이) 및 DTB의 공급 여부 (회색의 음영)를 예시하고, 검은색 점은 개별적인 복제물로부터의 비오틴 생산을 나타낸다.
도 14 IPTG 유도성 lipA (pBS993, 표 4 참조)를 각각 포함하는 3개의 생물학적 복제물에서 아미노산 돌연변이를 갖는 돌연변이 IscR을 발현하는 2개의 상이한 iscR 돌연변이 균주 (BS1375 C92Y) 및 (BS1353, H107Y) 및 대조군 균주 (BS1011, IscR WT) (균주에 대하여 표 1 참조)의 리포산 생산 및 생산 (X표) 24시간 후의 종점(최종) OD600을 나타내는 바 다이아그램. 균주들은 100 g/mL 암피실린, 0.1 mM 비오틴, 0.6 g/l 옥탄산 및 0.01 mM IPTG을 갖는 400 ㎕ mMOPS에서 24시간 동안 37℃에서 275 rpm 진탕과 함께 성장되었다. 바는 리포산 생산 평균 값 (높이)을 예시하고, 검은색 점은 개별적인 복제물 배양물로부터의 리포산 생산을 나타낸다. 종점 OD600이 동일하게 유지되더라도, 평균 리포산 생산은 1.79-배 증가하는 것을 볼 수 있다.
도 15 표준 균주 (대장균 BW25113), △lipA (WT, 삼각형); 및 돌연변이 iscR (C92Y)를 갖는 △lipA 균주 (C92Y, 사각형)로서, 상기 균주 모두 IPTG 유도성 lipA 발현 플라스미드 (pBS1037)를 포함하는 균주의 시간 경과에 따라 측정된, 세포 밀도 (OD620에서 측정됨)의 그래픽 표현. OD620은 0.6 g/L 옥탄산, 100 g/mL 암피실린 및 0 내지 0.04 mM IPTG (증가에 따라 회색 음영의 어둠이 증가함)을 갖는 200 ㎕ mMOPS에서 성장된 6개의 생물학적 균주 복제물에 대하여 멀티스칸을 이용하여 측정되었다. 각 생존율 (GR: growth rate)은 오른쪽에 나타낸다.
도 16 전체 티아민 경로 유전자, thiCEFSGHMD를 발현하는 플라스미드 (pBS140)를 각각 포함하는 4개의 생물학적 복제물에서 아미노산 돌연변이를 갖는 돌연변이 IscR을 발현하는 2개의 상이한 iscR 돌연변이 균주 (BS2019, C92Y) 및 (BS2020, H107Y) 및 대장균 BW25113 △thiP, thiL* 균주 (BS750, Ref) (균주에 대하여 표 5 참조)의 티아민 생산을 나타내는 바 다이아그램. 균주들은 50 ㎍/mL 카나마이신을 갖는 400 ㎕ mMOPS에서 24시간 동안 37℃에서 275 rpm 진탕과 함께 성장되었다. 바는 종점 OD600에 대하여 보정된 티오크롬 어쎄이 (티아민, TMP 및 TPP를 함유함)에 의해 측정된 바와 같은 상층액에서의 티아민 생산 평균 값 (높이)을 예시하고, 검은색 점은 개별적인 복제물 배양물로부터의 티아민 생산을 나타낸다. OD 정규화(normalized) 역가는 돌연변이 균주 (BS2019 및 BS2020)에서 표준 균주 (BS750)와 비교하여 1.43-배 향상된 것을 확인할 수 있다.
도 17 IPTG-유도성 BioB 과발현 플라스미드 pBS679 단독으로 (BS1937) 또는 FldA-Fpr의 항시적 과발현을 갖는 pBS1112 (BS2185) 또는 GFP의 항시적 과발현을 갖는 pBS1054 (BS2707)를 더한 대장균 △bioABFCD iscR H107Y (107번 위치에서 히스티딘으로부터 티로신으로의 치환을 코딩함) 균주에 의한 비오틴 생산을 나타내는 바 다이아그램. 각 균주는 100 ㎍/ml 암피실린, 기질로서 0.1 g/L 데스티오비오틴 (DTB), 및 0, 0.01, 0.025, 0.05, 0.075 또는 0.1 mM IPTG를 갖는 mMOPS에서 배양되었다. BS2185 및 BS2707을 위한 배지는 50 ㎍/ml 카나마이신을 포함하는 것을 제외하고는 동일하였다. 비오틴 생산이 성장 기반의 바이오어쎄이를 이용하여 평가된 후, 상기 균주들은 딥 웰 플레이트에서 24시간 동안 37℃에서 275 rpm 진탕과 함께 성장되었다. 바는 각각의 균주에 의한 비오틴 생산 값 (높이)을 예시한다: BS1937 (검은색 바); BS2185 (회색 바); 및 BS2707 (체크무늬 회색).
도 18 IPTG-유도성 BioB 과발현 플라스미드 pBS679를 포함하는 대장균 △bioABFCD iscR H107Y (107번 위치에서 히스티딘으로부터 티로신으로의 치환을 코딩함) 균주에 의한 비오틴 생산을 나타내는 바 다이아그램. BS2185는 FldA-Fpr의 항시적 과발현을 갖는 pBS1112를 더 포함한다. BS1937은 100 ㎍/ml 암피실린, 기질로서 0.1 g/L 데스티오비오틴 (DTB), 및 0.025 mM IPTG 유도를 갖는 mMOPS에서 배양되었다. BS2185를 위한 배지는 동일하나, 50 ㎍/ml 카나마이신을 더 포함하였다. 비오틴 생산이 성장 기반의 바이오어쎄이를 이용하여 평가된 후, 상기 균주들은 딥 웰 플레이트에서 24시간 동안 37℃에서 275 rpm과 함께 성장되었다. 어두운 회색 바는 비오틴 생산 평균 값 (높이) (BS1937 n=6 및 BS2185 n=8)을 예시하고, 밝은 회색 바는 종점 OD600을 예시한다. 검은색 점은 비오틴 생산 및 개별적인 복제물로부터의 종점 OD600을 나타낸다.
도 2 박테리아의 리포산 경로 및 리포일화된 리포일 도메인 (리포산 합성)을 일으키는 각각의 효소적 단계의 중간체를 나타내는 그림. 상기 경로에서의 주요한 효소는 LipA (리포산 신타아제) 및 LipB (옥타노일 단백질 ACP 운반 단백질: 단백질 트랜스퍼라제)를 포함하고, 기질은 SAM: S-아데노실-L-메티오닌. LipA는 리포에이트-단백질 리가아제 A; EC: 6.3.1.20이다.
도 3 박테리아의 티아민 경로 및 티아민 (THI); 티아민 모노포스페이트 (TMP) 및 티아민 디포스페이트 (TPP)의 합성을 일으키는 각각의 효소적 단계의 중간체를 나타내는 그림. 중간체의 약어: 5-아미노이미다졸 리보뉴클레오티드 (AIR: aminoimidazole ribonucleotide), 4-아미노-2-메틸-5-(포스포옥시메틸)피리미딘 (HMP-P: 4-amino-2-methyl- 5-(phosphooxymethyl)pyrimidine), 4-아미노-2-메틸-5-(디포스포메틸)피리미딘 (HMP-PP: 4-amino-2-methyl-5-(diphosphomethyl) pyrimidine), 1-데옥시-D-자일룰로스 5-포스페이트 (DXP: 1-deoxy-D-xylulose 5-phosphate), 디하이드로글리신 (DHG: dehydroglycine), 4-메틸-5-(2-포스포옥시에틸)티아졸 (THZ- P: 4-methyl-5-(2-phosphooxyethyl)thiazole), 아데노신 트리포스페이트 (ATP: adenosine triphosphate), 아데노신 모노포스페이트 (AMP: adenosine monophosphate), S-아데노실-L-메티오닌 (SAM: S-adenosyl-L-methionine), 환원된 니코틴아마이드 아데닌 디뉴클레오티드 포스페이트 (NADPH: reduced nicotinamide adenine dinucleotide phosphate), 니코틴아마이드 아데닌 디뉴클레오티드 포스페이트 (NADP+: nicotinamide adenine dinucleotide phosphate), 환원된 페레독신 (Fdx red: reduced ferredoxin), 산화된 페레독신 (Fdx ox: oxidized ferredoxin).
도 4 IPTG 유도성 bioB 발현 플라스미드를 포함하는 △bioB 균주 (대징균 BW25113) (오른쪽 패널); 및 IPTG 유도성 프레임시프트된 bioB (조기 종결 코돈) 발현 플라스미드를 포함하는 표준 균주 (대장균 BW25113) (왼쪽 패널)의 시간 경과에 따라 측정된 세포 밀도 (OD600에서 측정됨)의 그래픽 표현 (Graphical presentation). 0.1 g/L DTB, 50 ㎍/mL 카나마이신 및 0 (점), 0.01 (삼각형), 또는 0.1 (사각형) mM IPTG를 갖는 200 ㎕ mMOPS에서 성장된 4개의 생물학적 복제물에 대하여 OD620은 멀티스칸을 사용하여 측정되고, OD600으로 전환되었다. 각 지수 성장률 값은 인접한 상자에 표시된다.
도 5 275 rpm 진탕을 하며 37℃에서 20시간 동안의 인큐베이션한 후에 (하기 기재된 비오틴 정량화하는 방법과 같음), 0 또는 0.244 ㎍ 비오틴/mL까지 증가하는 농도의 비오틴이 보충된 40 ㎍/mL 제오신을 갖는 150 ㎕ mMOPS 상에서 성장된 플라스미드 pBS451을 포함하는 대장균 BS1011의 배양물의 최종 세포 밀도 (OD600에서 측정됨)를 나타낸 산포도의 그래픽 표현. 수직의 회색 점선은 비오틴 바이오어쎄이를 위한 0.024 내지 0.24 ㎍ 비오틴/L의 최적의 농도 범위를 식별한다.
도 6 IPTG-유도성 bioB 발현 플라스미드 (pBS412)를 각각 포함하는 4개의 생물학적 복제물에서 아미노산 돌연변이를 갖는 돌연변이 IscR을 발현하는 3개의 상이한 iscR 돌연변이 균주 (BS1377, L15F), (BS1375, C92Y) 및 (BS1353, H107Y) 및 대장균 BW25113 bioB 균주 (BS1011, Ref) (균주에 대하여 표 1 참조)의 비오틴 생산을 나타내는 바 다이아그램. 균주들은 24시간 동안 37℃에서 275 rpm 진탕과 함께 0.1 g/l DTB 및 50 ㎍/mL 카나마이신을 갖는 400 ㎕ mMOPS에서 성장되었다. 바는 평균 비오틴 생산 값 (높이) 및 IPTG 유도 수준 (회색 음영)을 예시하고, 검은색 점은 개별적인 복제물 배양물로부터의 비오틴 생산을 나타내고, 수평의 점선은 참조 야생형 균주로부터의 최대 비오틴 생산을 표시한다. IPTG가 없이 배양된 경우 검출 가능한 수준의 비오틴을 생산한 균주는 없었다.
도 7 돌연변이 IscR을 발현하는 iscR 돌연변이 균주 (BS1353, H107Y), 및 대장균 BW25113 △bioB균주 (BS1011, Ref)의 세포 밀도 및 비오틴 생산의 그래픽 표현이고, 상기 각 균주는 IPTG-유도성 bioB 유전자 발현 플라스미드 (pBS412)를 포함한다. 상기 데이터는 25시간 동안 모니터링된 각각의 iscR H107Y 돌연변이 균주 (굵은 어두운 선) 및 표준 균주, 대장균 BW25113 △bioB 균주 (밝은 회색 점선)의 3개의 생물학적 복제물의 측정된 OD600 평균, 및 25시간 동안 모니터링된 각각의 iscR H107Y 돌연변이 균주 (굵은 어두운 점) 및 표준 균주, 대장균 BW25113 △bioB 균주 (밝은 회색 점)에 의한 비오틴 생산을 나타낸다. 상기 균주들은 250 mL 배플 진탕 플라스크 (baffled shake flask)에서 0.1 g/l DTB, 0.01 (A) 또는 0.5 mM IPTG (B) 및 50 ㎍/mL 카나마이신을 갖는 50 mL mMOPS에서 37℃에서 275 rpm 진탕과 함께 성장되었다. 성장률은 검정색 박스에 나타낸다.
도 8 동정된 돌연변이 균주의 iscR 유전자에서 뉴클레오티드 및 아미노산 서열 돌연변이의 위치를 나타내기 위해 주석이 달린 IscR 코딩 서열을 나타낸 그림
도 9 막대(sticks, WT 아미노산은 회색, 돌연변이 아미노산은 검정색으로 표시됨)로 표시된 L15F 및 H107Y iscR 돌연변이를 갖는 hya DNA 결합 부위 (검정색)에 결합된 IscR 이합체 (회색)의 결정 구조 (PDB entry 4HF1)를 나타낸 그림; 및 돌연변이된 잔기를 강조한 확대 이미지.
도 10 IPTG-유도성 bioB 발현 플라스미드 및 isc-오페론 (iscSUA-hscBA-fdx, iscR 유전자가 결핍된천연 대장균 isc 오페론 구조에 상응함) 또는 강한 리보좀 결합 부위 (RBS: strong ribosomal binding site)에 작동 가능하게 연결된 대장균 suf-오페론 (sufABCDSE)을 포함하는 플라스미드 및 미디엄 카피 넘버 플라스미드(p15A ori) 또는 대조군 플라스미드로부터의 T5 LacO 억제 프로모터를 포함하는 대장균 균주의 비오틴 생산을 나타내는 바 다이아그램. 상기 대조군 플라스미드는 suf- 또는 isc-오페론 대신에 IPTG-유도성 GFP를 코딩하는 유전자를 포함하였다. 각 균주의 생물학적 삼중물 (triplicates)은 100 μg/mL 암피실린 및 50 μg/mL 스펙티노마이신을 갖는 mMOPS에서 0.1 g/l (DTB)를 기질로서 제공하여 낮은 (0.01 mM IPTG) 및 높은 (0.1 mM IPTG) 유도 하에 배양되었다. 비오틴 생산이 성장 기반의 바이오어쎄이를 이용하여 평가된 후, 상기 균주들은 딥 웰 플레이트에서 24시간 동안 37℃에서 275 rpm으로 성장되었다. 바는 평균 비오틴 성장 값 (높이) 및 IPTG 유도 수준 (회색 음영)을 예시하고, 검은색 점은 개별적인 복제물로부터의 비오틴 생산을 나타내고, X표(crosses)는 OD600으로 측정된 각 균주의 종점(end-point, end) 세포 밀도를 나타낸다. 0.01 mM IPTG로 유도된 경우, 검출 가능한 수준의 비오틴을 생산한 균주는 없었다.
도 11 삼중불에서 수행된 4개의 상이한 샘플에서의 BioB 단백질 발현 수준 및 비오틴 생산의 상관관계의 그래픽 표현. 상기 균주는 pBS430 (bioB 프레임시프트 IPTG 유도성 플라스미드)을 갖는 BS1013 (대장균 BW25113, 백그라운드 균주), pBS412 (bioB IPTG 유도성 플라스미드)를 갖는 BS1011 (△bioB를 갖는 BS1013), pBS412를 갖는 BS1353 (iscR H107Y 돌연변이를 갖는 BS1011)이었다. 균주들은 0.1 g/l DTB 및 그래프에 나타낸 바와 같은 IPTG를 갖는 mMOPS에서 성장되었다.
도 12 IPTG-유도성 bioB 발현 플라스미드 및 다음 중 iscR의 게놈 변이체를 포함하는 대장균 △bioB 균주의 비오틴 생산을 나타내는 바 다이아그램: 야생형 (iscR WT), 22번 위치에서 종결 코돈으로 돌연변이된 E22* 글루탐산을 코딩하는 녹-아웃 돌연변이 (iscR KO), 92번 위치에서 시스테인으로부터 티로신으로의 치환을 코딩하는 돌연변이 (iscR C92Y). 바는 IPTG 유도의 주어진 수준 (회색의 음영)에서의 비오틴 생산 평균 값 (높이)을 예시하고, 점은 개별적인 복제물로부터의 비오틴 생산을 나타낸다. 각 균주의 생물학적 삼중물은 100 μg/mL 암피실린을 갖는 mMOPS에서 0.1 g/l DTB를 기질로서 제공하여 없는 (0 mM IPTG), 낮은 (0.01 mM IPTG) 및 낮은 (0.1 mM IPTG) 유도 하에 배양되었다. 비오틴 생산이 성장 기반의 바이오어쎄이를 이용하여 평가된 후, 각 균주는 딥 웰 플레이트에서 24시간 동안 37℃에서 275 rpm으로 성장되었다.
도 13 비오틴-오페론 플라스미드 및 다음 중 iscR의 게놈 변이체를 포함하는 대장균 △bioA △bioBFCD 균주에 의한 비오틴 생산을 나타내는 바 다이아그램: 야생형 (iscR WT), 돌연변이 iscR H107Y (107번 위치에서 히스티딘으로부터 티로신으로의 치환을 코딩함), 92번 위치에서 시스테인으로부터 티로신으로의 치환을 코딩하는 돌연변이 (iscR C92Y). 각 균주의 생물학적 사중물 (quadruplicate)은 0.1 데스티오비오틴 (DTB)을 기질로서 제공하거나 제공하지 않고 10 μg/mL 테트라사이클린을 갖는 mMOPS에서 배양되었다. 비오틴 생산이 성장 기반의 바이오어쎄이를 이용하여 평가된 후, 상기 균주들은 딥 웰 플레이트에서 24시간 동안 37℃에서 275 rpm으로 성장되었다. 바는 비오틴 생산 평균 값 (높이) 및 DTB의 공급 여부 (회색의 음영)를 예시하고, 검은색 점은 개별적인 복제물로부터의 비오틴 생산을 나타낸다.
도 14 IPTG 유도성 lipA (pBS993, 표 4 참조)를 각각 포함하는 3개의 생물학적 복제물에서 아미노산 돌연변이를 갖는 돌연변이 IscR을 발현하는 2개의 상이한 iscR 돌연변이 균주 (BS1375 C92Y) 및 (BS1353, H107Y) 및 대조군 균주 (BS1011, IscR WT) (균주에 대하여 표 1 참조)의 리포산 생산 및 생산 (X표) 24시간 후의 종점(최종) OD600을 나타내는 바 다이아그램. 균주들은 100 g/mL 암피실린, 0.1 mM 비오틴, 0.6 g/l 옥탄산 및 0.01 mM IPTG을 갖는 400 ㎕ mMOPS에서 24시간 동안 37℃에서 275 rpm 진탕과 함께 성장되었다. 바는 리포산 생산 평균 값 (높이)을 예시하고, 검은색 점은 개별적인 복제물 배양물로부터의 리포산 생산을 나타낸다. 종점 OD600이 동일하게 유지되더라도, 평균 리포산 생산은 1.79-배 증가하는 것을 볼 수 있다.
도 15 표준 균주 (대장균 BW25113), △lipA (WT, 삼각형); 및 돌연변이 iscR (C92Y)를 갖는 △lipA 균주 (C92Y, 사각형)로서, 상기 균주 모두 IPTG 유도성 lipA 발현 플라스미드 (pBS1037)를 포함하는 균주의 시간 경과에 따라 측정된, 세포 밀도 (OD620에서 측정됨)의 그래픽 표현. OD620은 0.6 g/L 옥탄산, 100 g/mL 암피실린 및 0 내지 0.04 mM IPTG (증가에 따라 회색 음영의 어둠이 증가함)을 갖는 200 ㎕ mMOPS에서 성장된 6개의 생물학적 균주 복제물에 대하여 멀티스칸을 이용하여 측정되었다. 각 생존율 (GR: growth rate)은 오른쪽에 나타낸다.
도 16 전체 티아민 경로 유전자, thiCEFSGHMD를 발현하는 플라스미드 (pBS140)를 각각 포함하는 4개의 생물학적 복제물에서 아미노산 돌연변이를 갖는 돌연변이 IscR을 발현하는 2개의 상이한 iscR 돌연변이 균주 (BS2019, C92Y) 및 (BS2020, H107Y) 및 대장균 BW25113 △thiP, thiL* 균주 (BS750, Ref) (균주에 대하여 표 5 참조)의 티아민 생산을 나타내는 바 다이아그램. 균주들은 50 ㎍/mL 카나마이신을 갖는 400 ㎕ mMOPS에서 24시간 동안 37℃에서 275 rpm 진탕과 함께 성장되었다. 바는 종점 OD600에 대하여 보정된 티오크롬 어쎄이 (티아민, TMP 및 TPP를 함유함)에 의해 측정된 바와 같은 상층액에서의 티아민 생산 평균 값 (높이)을 예시하고, 검은색 점은 개별적인 복제물 배양물로부터의 티아민 생산을 나타낸다. OD 정규화(normalized) 역가는 돌연변이 균주 (BS2019 및 BS2020)에서 표준 균주 (BS750)와 비교하여 1.43-배 향상된 것을 확인할 수 있다.
도 17 IPTG-유도성 BioB 과발현 플라스미드 pBS679 단독으로 (BS1937) 또는 FldA-Fpr의 항시적 과발현을 갖는 pBS1112 (BS2185) 또는 GFP의 항시적 과발현을 갖는 pBS1054 (BS2707)를 더한 대장균 △bioABFCD iscR H107Y (107번 위치에서 히스티딘으로부터 티로신으로의 치환을 코딩함) 균주에 의한 비오틴 생산을 나타내는 바 다이아그램. 각 균주는 100 ㎍/ml 암피실린, 기질로서 0.1 g/L 데스티오비오틴 (DTB), 및 0, 0.01, 0.025, 0.05, 0.075 또는 0.1 mM IPTG를 갖는 mMOPS에서 배양되었다. BS2185 및 BS2707을 위한 배지는 50 ㎍/ml 카나마이신을 포함하는 것을 제외하고는 동일하였다. 비오틴 생산이 성장 기반의 바이오어쎄이를 이용하여 평가된 후, 상기 균주들은 딥 웰 플레이트에서 24시간 동안 37℃에서 275 rpm 진탕과 함께 성장되었다. 바는 각각의 균주에 의한 비오틴 생산 값 (높이)을 예시한다: BS1937 (검은색 바); BS2185 (회색 바); 및 BS2707 (체크무늬 회색).
도 18 IPTG-유도성 BioB 과발현 플라스미드 pBS679를 포함하는 대장균 △bioABFCD iscR H107Y (107번 위치에서 히스티딘으로부터 티로신으로의 치환을 코딩함) 균주에 의한 비오틴 생산을 나타내는 바 다이아그램. BS2185는 FldA-Fpr의 항시적 과발현을 갖는 pBS1112를 더 포함한다. BS1937은 100 ㎍/ml 암피실린, 기질로서 0.1 g/L 데스티오비오틴 (DTB), 및 0.025 mM IPTG 유도를 갖는 mMOPS에서 배양되었다. BS2185를 위한 배지는 동일하나, 50 ㎍/ml 카나마이신을 더 포함하였다. 비오틴 생산이 성장 기반의 바이오어쎄이를 이용하여 평가된 후, 상기 균주들은 딥 웰 플레이트에서 24시간 동안 37℃에서 275 rpm과 함께 성장되었다. 어두운 회색 바는 비오틴 생산 평균 값 (높이) (BS1937 n=6 및 BS2185 n=8)을 예시하고, 밝은 회색 바는 종점 OD600을 예시한다. 검은색 점은 비오틴 생산 및 개별적인 복제물로부터의 종점 OD600을 나타낸다.
정의:
아미노산 서열 상동성: 본원에서 사용된 용어 “서열 상동성”은 실질적으로 동일한 길이의 2개의 아미노산 서열 사이의 상동성 정도의 정량적인 치수를 나타낸다. 비교되는 2개의 서열은 갭(gaps)의 삽입 또는 대안적으로, 단백질 서열의 말단에서의 절단에 의해 가능한 최상의 핏(fit)을 제공하도록 정렬되어야 한다. 서열 상동성은 ((Nref- Ndif) 100)/(Nref)로 계산될 수 있고, 상기 Ndif는 정렬된 경우의 2개의 서열에서 비-동일한 잔기의 총 수이고, Nref는 서열 중 하나의 잔기의 수이다. 서열 상동성 계산은 바람직하게는 BLAST 프로그램, 예를 들어, BLASTP 프로그램 (Pearson W.R and DJ. Lipman (1988)) (www.ncbi.nlm.nih.gov/cgi-bin/BLAST)을 사용하여 자동화된다. 다중 서열 정렬은 http://www2.ebi.ac.uk/clustalw/ 에서 이용 가능한 Thompson J., et al 1994에 의해 기술된 바와 같은 디폴트 파라미터를 갖는 서열 정렬 방법 ClustalW로 수행된다. 바람직하게, 폴리펩티드에서 하나 이상의 아미노산 잔기의 치환, 삽입, 첨가 또는 결실의 수는, 이의 비교 폴리펩티드와 비교하여, 즉, 1, 2, 3, 4, 5, 6, 7, 8, 9, 또는 10 이하의 치환, 1, 2, 3, 4, 5, 6, 7, 8, 9, 또는 10 이하의 삽입, 1, 2, 3, 4, 5, 6, 7, 8, 9, 또는 10 이하의 첨가, 및 1, 2, 3, 4, 5, 6, 7, 8, 9, 또는 10 이하의 결실로 한정된다. 바람직하게, 상기 치환은 보존적 아미노산 치환이다: 제1 군: 글리신, 알라닌, 발린, 류신, 이소류신; 제2 군: 세린, 시스테인, 셀레노시스테인, 트레오닌, 메티오닌; 제3 군: 프롤린; 제4 군: 페닐알라닌, 티로신, 트립토판; 제5 군: 아스파르테이트, 글루타메이트, 아스파라긴, 글루타민인 군의 구성원 내에서의 교환으로 제한됨.
아미노산 약어: 류신 (L), 시스테인 (C), 및 히스티딘 (H).
내인성 유전자: 숙주 박테리아와 기원이 동일한 박테리아 세포 게놈 내의 유전자 (즉 숙주 박테리아의 천연 유전자)이다. 내인성 유전자는 당업계에 알려진 도구를 사용하여 유전적으로 변형될 수 있으며, 이에 의해 유전적으로 변형된 내인성 유전자는 이것이 유래된 모체 내인성 유전자에 의해 코딩되는 폴리펩티드와 하나 이상의 위치에서 아미노산 서열이 상이한 돌연변이 폴리펩티드를 코딩한다.
게놈: 세포 또는 유기체에 존재하는 유전적 물질이다; 상기 세포 또는 유기체를 구축하고 유지하기 위해 필요한 모든 정보를 포함하는 게놈은 세포 또는 유기체 내에 존재하는 염색체(들) 및 플라스미드(들) 모두에서 유전 물질을 포함한다.
GFP: 녹색 형광 단백질.
gi 번호: 유전자정보 검색번호(genInfo identifier)는 DDBJ/EMBL/GenBank로부터의 뉴클레오티드 서열, SWISS-PROT, PIR 및 다른 많은 것들로부터의 단백질 서열을 포함하여, Entrez로 처리된 모든 서열에 NCBI에 의해 할당되는 데이터베이스 근원과 관계 없이, 특정 서열을 식별하는 독특한 정수이다.
Isc 경로: 철 황 클러스터 경로; iscR 유전자를 포함한 isc 오페론에 의해 코딩됨.
멀티스칸(Multiskan): 필터-기반의 마이크로플레이트 광도계; 600 - 620nm을 포함한, 340 내지 850 nm 범위의 파장에서 96 또는 384-웰 플레이트 포맷으로부터 흡광도를 측정하기 위함. 플레이트를 최대 50℃의 선택된 온도에서 광도계에서 인큐베이트된다. 광도계는 Thermo Scientific에 의해 제공된다.
천연 유전자 (Native gene): 숙주 박테리아와 동일한, 박테리아 세포 게놈 내에서 내인성 유전자.
비-천연 프로모터 (Non-native promoter): 본 발명의 유전자 변형 박테리아와 관련하여, 상기 세포에서 유전자 또는 전이 유전자에 작동 가능하게-연결된 프로모터이며, 상기 프로모터는 자연에서 발견된 박테리아 세포에서 상기 유전자 또는 전이 유전자에 작동 가능하게-연결된 것으로 발견되지 않을 것임.
OD (Optical Density): 광학 밀도
전이 유전자(Transgene): 게놈 공학에 의해 박테리아의 게놈 내로 도입된 외인성 유전자이다. 본 발명의 내용에서, 상기 개놈은 염색체 및 에피솜 유전자 요소를 모두 포함한다.
발명의 상세한 설명
비오틴, 리포산 및 티아민의 합성을 위한 생합성 경로의 공통적인 특징은 복잡한 라디칼-매개 분자 재배열을 촉매하기 위한 하나 이상의 SAM 또는 AdoMet 라디칼 효소에 대한 필요 조건이다. 비오틴 신타아제, 리포산 신타아제, HMP-P 신타아제, 및 티로신 리아제는 이들 경로에서 이들 필수 단계를 촉매하는 것으로 알려진 각각의 효소들이다. 비오틴 오페론의 과발현, 또는 심지어 BirA 억제자에 의한 피드백 조절에 영향을 받지 않는 돌연변이 비오틴 오페론의 사용에 의한, 대장균에서 비오틴 생합성을 증가시키기 위한 초기 시도의 실패는 성장의 강력한 억제 때문이었다 (Ifuku, 0. et al., 1995). 관찰된 성장 억제에 대하여 증거-기반의 어떠한 설명도 없는 경우; 비오틴 신타아제 과-발현의 독성을 설명할 수 있는 세포 인자를 규명하기 위한 대안적인 접근법이 필요했다.
본 발명에 의해 제공되는, 이 문제에 대한 해결책은 박테리아 세포 공장 (예를 들어, 대장균)의 세포들에서 비오틴 신타아제, 리포산 신타아제, HMP-P 신타아제 및 티로신 리아제의 발현을 증가시키기 위해 동일하게 적용 가능한 것으로 나타난다. 이 문제를 해결하기 위한 접근법은 불완전한 오류-정정(error-correcting) 폴리머라제에 의해 생성된 백그라운드 돌연변이의 축적으로 인해 진화된 게놈 다양성을 갖는 대장균 세포의 라이브러리를 생성하는 것이었다. 이러한 라이브러리의 세포를 IPTG-유도성 bioB 유전자 발현 카세트를 포함하는 플라스미드로 형질전환시켰다. 후보 돌연변이는 돌인변이 세포가 유래된 모체 대장균 균주에서 BioB 발현 독성을 유도하기에 충분한 농도에서 IPTG 존재 하에 성장할 수 있는 라이브러리 내의 세포들이었다.
선택된 BioB-발현 돌연변이 균주의 증식을 위한 유전적 기초는 전체 게놈 시퀀싱에 의해 확립되었다. 놀랍게도, 3개의 균주들은 천연 철 황 클러스터 조절 유전자 (iscR)에서 돌연변이를 갖는 것으로 밝혀졌다; 이것은 다면 발현성 전사 인자 (IscR) [서열번호 2]를 코딩한다. Fe-S 클러스터는 많은 단백질 및 필수적 효소들의 보조인자로서, 비오틴과 같은 S-합유 화합물의 합성을 위해 단독으로 필요할 뿐만 아니라, 산화환원- (redox-) 또는 철-관련 스트레스 조건에 대한 센서로서 이들에 다양한 생화학적 능력을 부여한다.
IscR은 Fe-S 클러스터 단순-단백질(holo-protein), 또는 Fe-S 클러스터가 없는 아포단백질(apoprotein)로서 2가지 상태로 존재한다. IscR의 Fe-S 클러스터의 어셈블리는 isc 오페론에 의해 코딩되는 Isc 경로에 의해 촉매된다. isc 오페론은 먼저 조절자 (IscR)를 코딩하고, 그런 다음 시스테인 디설퍼라제(desulphurase) (IscS), 스캐폴드 (IscU), A-타입 단백질 (IscA), Dnaj-유사 코-샤페론(co-chaperone) (HscB), DnaK-유사 샤페론 (HscA) 및 페레독신 (Fdx)을 코딩한다. Isc 경로는 IscR 완전효소의 어셈블리를 위해 필수적일 뿐만 아니라, 대장균에서 Fe-S 클러스터 생물발생(biogenesis)을 위한 주요한 경로이다 (도 1B).
IscR의 2가지 형태 간의 비율은 [2Fe-2S] 클러스터의 세포 수준에 의해 결정되고, 이는 결국 철- 및 산소 수준을 포함한 여러 요인에 의해 영향을 받는다 (Py, B. & Barras, 2010). 철-풍부한 조건 하에, IscR은 주로 완전 효소로 존재하고, isc 오페론의 전사 억제자로 작용한다. 그러나, 철-적은 ([2Fe-2S] 클러스터의 낮은 수준) 조건 하에, IscR은 아포-단백질 상태로 전환되고, 이는 isc 오페론의 전사를 가능하게 한다. 이의 아포-단백질 상태에서, IscR은 산화 스트레스 하에 Fe-S 클러스터 생물발생을 촉매하는 sufABCDSE 오페론의 활성인자 (activator)로서 역할을 한다.
대장균에서 2개의 Fe-S-클러스터 어셈블리의 발현을 조절하는 것 외에도, IscR은 산화 스트레스 매커니즘 (예를 들어, sodA), 특이적 및 전면적(global) 조절자 (예를 들어, yqjI 및 soxS), 아미노산 생합성 (예를 들어, argE), 및 알려지지 않은 기능을 가진 다양한 유전자와 같은 작용의 다양한 메커니즘에 관여된 >40 개의 유전자들을 조절한다. IscR의 역할은 IscR 조절 환경이 호기성 및 혐기성 조건 사이에서 변화한다는 사실에 의해 더욱 복잡해진다 (Martin, and Imlay, 2012; Giel et al., 2006).
IscR의 항상성 역할; 및 전면적 유전자 조절에서의 이의 역할의 관점에서; 이의 조절 특성의 임의의 변형의 결과는 예측할 수 없으며, 아마도 세포 대사에 대해 심오하다. 또한, 황 형성 (suf) 및 isc 경로 모두의 증가된 발현으로 인해, Fe-S 클러스터 생물발생이 증가되는 세포 조건은 축적된 Fe-S 클러스터가 팬톤 반응(fenton reactions)에 의해 퍼옥사이드 라디칼을 생성할 위험을 생성한다.
이러한 관점에서, 3개의 분리된 개별적인 돌연변이에 의해 입증된 바와 같이, IscR이 세포 BioB의 활성 및 독성에 대해 매우 중요한 것으로 발견되는 것은 매우 예상하지 못한 것이었다. 또한, IscR 단백질의 중요성은 예상치 못한 것이었고, 이는 Fe-S 클러스터를 합성하고 조립하는 증가된 능력을 제공하는 isc 오페론 또는 suf 오페론의 과-발현이 bioB를 과-발현하는 세포에서 비오틴 생산을 증가시키는 것으로 발견되지 않았기 때문이다 (실시예 1, 도 10 참조). 또한, iscR 유전자 녹-아웃에 의한 세포 iscR 조절의 제거는 세포에서 비오틴 생산을 증가시키는데 실패하였다 (실시예 1, 도 12 참조).
돌연변이 세포에서 BioB 발현의 독성을 제거하는 IscR 단백질에서의 3가지의 다른 돌연변이는 아미노산들의 단일 아미노산 치환, L15 [서열번호 16], C92 [서열번호 18] 및 H107 [서열번호 20]이었다 (도 8). 3개의 돌연변이 중 2개는 IscR의 잔기에 정확하게 상응하고, 이들 각각은 IscR 단순-단백질의 형성에 필수적인 것으로 알려져 있다. 대장균에서 관찰된 바와 같이, IscR은 특이한 Fe-S 클러스터 결찰(ligation) 메커니즘을 가짐으로써, Fe-S 클러스터 결찰에 필수적인 잔기는 H107뿐만 아니라, C92, C98, 및 C104이다. 이 비정형 결찰은 다른 Fe-S 단백질에 비해 IscR의 완전효소 상태의 낮은 안정성을 부여할 수 있고, 이는 결국 낮은 Fe-S 조건 동안 아포-단백질 상태로의 전환을 설명한다 (Fleischhacker et al., 2012).
이론에 의해 구속되고자 하는 것은 아니나, 이는 본 발명의 돌연변이 iscR 유전자를 발현하는 반면에, 철-황 클러스터 함유 유전자들(비오틴 신타아제, 리포산 신타아제, HMP-P 신타아제 및 티로신 리아제)의 어셈블리를 심지어 이들의 과-발현 동안에 촉진하는 세포에서, Fe-S 클러스터 생물발생의 항상성 조절 및 세포 성장에 필요한 전면적인 유전자 조절이 유례없이 보존됨을 시사한다.
요약하면, 본 발명자들은 Fe-S 클러스터의 결찰에 필요한 하나 이상의 아미노산 잔기의 부족을 특징으로 하는, 돌연변이 IscR 단백질을 코딩하는 돌연변이 iscR 유전자로서, 그 결과 발현된 돌연변이 IscR 단백질이 아포-단백질 형태로만 존재하도록 하는 iscR 유전자를 규명하였다. 박테리아에서 비오틴, 리포산 또는 티아민의 생산을 증가시키기 위해 노력으로, 효소를 함유하는 철-황 클러스터의 합성은 상당한 장애물을 구성하는 것으로 보여진다. 본 발명에 의해 제공된 바와 같이, 이 문제에 대한 해결책은 아포-단백질 상태로 존재하는 돌연변이 IscR 단백질을 코딩하는 유전자를 포함하는 세포 공장에서 이들 효소의 과-발현에 의해 촉진된다. 본 발명의 다양한 구현예는 하기에 보다 자세히 기술된다.
I 비오틴의 생산을 위한 유전자 변형 박테리아 세포
본 발명은 증가된 수준의 비오틴을 생산할 수 있는 유전자 변형 박테리아 세포를 제공한다. 상기 박테리아 세포는 야생형 IscR을 대체하여 돌연변이 IscR을 발현하고, 비오틴 신타아제 (EC 2.8.1.6을 갖는 비오틴 신타아제)를 코딩하는 전이 유전자를 포함하도록 유전적으로 변형된다. 선택적으로, 상기 유전자 변형 박테리아 세포는 비오틴 경로 (도 1A)의 추가적인 단계를 촉매하는 폴리펩티드를 코딩하는 하나 이상의 추가적인 전이 유전자를 더 포함할 수 있다. 비오틴 경로에서 단계들을 촉매하는 이들 폴리펩티드 수준의 증가는 박테리아 세포에서 비오틴 경로에서의 중간체 및 상기 경로의 최종 생성물 (비오틴) 모두의 합성을 향상시킨다.
유전자 변형 박테리아 세포에 의해 발현되는, 돌연변이 IscR 폴리펩티드는 폴리펩티드 백본 (아포-단백질)을 특징으로 하는 IscR 폴리펩티드 군의 야생형 구성원으로부터 유래된다. IscR 폴리펩티드 군의 야생형 구성원의 아미노산 서열은 서열번호 2, 4, 6, 8, 10, 12 및 14 중 어느 하나로부터 선택된 서열과 적어도 70, 75, 80, 85, 90, 95, 96, 98, 100% 아미노산 서열 상동성을 갖는다. 본 발명에 따른 돌연변이 IscR 폴리펩티드의 아미노산 서열은 적어도 하나의 아미노산 치환에 의해 유래된 상응하는 야생형 IscR 폴리펩티드의 아미노산 서열과 상이하다; 상기 치환은 L15X, C92X, C98X, C104X, 및 H107X로부터 선택된다; 상기 치환 아미노산인 X는 돌연변이가 유래된 야생형 IscR에서 상응하는 위치에서 발견되는 아미노산 이외의 임의의 아미노산이다.
대안적인 구현예에서, 상기 돌연변이 IscR에서 아미노산 치환은 L15X로서, 상기 X는 L 이외의 임의 아미노산이고, 보다 바람직하게 X는 페닐알라닌 (F), 티로신 (Y), 메티오닌 (M) 및 트립토판 (W)으로부터 선택되는 것인 L15X; C92X로서, 상기 X는 C 이외의 임의의 아미노산이고, 보다 바람직하게는 X는 티로신 (Y), 알라닌 (A), 메티오닌 (M), 페닐알라닌 (F) 및 트립토판 (W)으로부터 선택되는 것인 C92X; C98X로서, 상기 X는 C 이외의 임의의 아미노산이고, 보다 바람직하게는 X는 알라닌 (A), 발린 (V), 이소류신 (I), 류신 (L), 페닐알라닌 (F) 및 트립토판 (W)으로부터 선택되는 것인 C98X; Cys104X로서, 상기 X는 C 이외의 임의의 아미노산이고, 보다 바람직하게는 X는 알라닌 (A), 발린 (V), 이소류신 (I), 류신 (L), 페닐알라닌 (F) 및 트립토판 (W)으로부터 선택되는 것인 Cys104X; 및 His107X로서, 상기 X는 H 이외의 임의의 아미노산이고, 보다 바람직하게는 X는 알라닌 (A), 티로신 (Y), 발린 (V), 이소류신 (I), 및 류신 (L)으로부터 선택되는 것인 His107X로부터 선택되는 것이다. 예를 들어, 상기 돌연변이 IscR에서 아미노산 치환은 L15F, C92Y, C92A, C98A, Cys104A, H107Y, 및 H107A 중에서 선택되는 것일 수 있다.
본 발명의 유전자 변형 박테리아 세포에 의해(야생형 IscR을 대체하여) 발현되는 돌연변이 IscR은 염색체 상 또는 자기-복제 플라스미드 상에, 박테리아 세포의 게놈에 위치한, 유전적으로 변형된 유전자에 의해 코딩된다. 상기 염색체에서 유전적으로 변형된 iscR 유전자는 천연 게놈에서 야생형 iscR 유전자와 동일한 위치에 있는 게놈에 위치할 수 있다. 천연 야생형 iscR 유전자는 결실되거나 유전적으로 변형된 iscR 유전자에 의해 직접적으로 치환되므로, 본 발명의 유전자 변형 박테리아 세포의 게놈은 천연 야생형 iscR 유전자가 결여된다. 유전적으로 변형된 iscR 유전자의 발현을 구동하는 프로모터는 상기 유전적으로 변형된 iscR 유전자가 유래되거나 대체된 야생형 iscR 유전자의 천연 프로모터일 수 있다. 대안적으로, 상기 프로모터는 이종의 항시성 또는 유도성 프로모터일 수 있다. 상기 프로모터가 이종의 항시성 프로모터인 경우, 적합한 프로모터는: apFab 패밀리 [서열번호 230-232]을 포함하는 반면에, 적합한 유도성 프로모터는: pBad (아라비노스 유도성 [서열번호 233] 및 LacI [서열번호 234]를 포함한다. 적합한 종결자(terminator)는 [서열번호 235-237]를 포함하는 apFAB 종결자 패밀리의 구성원을 포함한다.
본 발명에 따른 비오틴 신타아제 (EC 2.8.1.6) 활성을 갖는 폴리펩티드는 데스티오비오틴의 비오틴으로의 전환을 촉매하는 비오틴 신타아제 활성을 갖는 폴리펩티드이다. 비오틴 신타아제의 이 군의 구성원은 광범위한 속(genera)에 속하는 박테리아에서 발견된 유전자에 의해 코딩된다. 비오틴 신타아제 활성을 갖는 폴리펩티드의 아미노산 서열은 이들 중 어느 하나로부터 선택된 서열과 적어도 70, 75, 80, 85, 90, 95, 96, 98, 100% 아미노산 서열 상동성을 갖는다: 서열번호 22 (기원: 에셔리키아 콜라이(Escherichia coli)); 서열번호 27 (기원: 캔디다투스 클로라시도박테리움 써모필룸 B(Candidatus Chloracidobacterium thermophilum B)); 서열번호 29 (기원: 스트렙토마이세스 리디커스(Streptomyces lydicus)); 서열번호 31 (기원: 파라코커스 데니트리피칸스(Paracoccus denitrificans)); 서열번호 33 (기원: 파라코커스 데니트리피칸스 PD1222); 서열번호 35 (기원: 아그로박테리움 비티스(Agrobacterium vitis)); 서열번호 37 (기원: 루에제리아 포메로이(Ruegeria pomeroyi); 서열번호 39 (기원: 아그로박테리움 파브룸(Agrobacterium fabrum)); 서열번호 41 (기원: 시멕스 렉툴라리우스의 볼바키아속 내공생자(Wolbachia endosymbiont of Cimex lectularius)); 서열번호 43 (기원: 스핀고모나스 파우치모빌리스(Sphingomonas paucimobilis)); 서열번호 45 (기원: 애시디싸이오바실러스 페리보란스(Acidithiobacillus ferrivorans)); 서열번호 47 (기원: 갈리오넬라 캡시페리포르만스(Gallionella capsiferriformans)); 서열번호 49 (기원: 랄스토니아 유트로파(Ralstonia eutropha)); 서열번호 51 (기원: 보르데텔라 파라퍼투스(Bordetella parapertussis)); 서열번호 53 (기원: 푸실리모나스 종(Pusillimonas sp.)); 서열번호 55 (기원: 케나르카이움 심비오숨 종(Cenarchaeum symbiosum sp.)); 서열번호 57 (기원: 알리사이클로바실러스 아시도칼다리우스 종(Alicyclobacillus acidocaldarius sp.)); 서열번호 59 (기원: 게오바실루스 써모글루코시다시우스(Geobacillus thermoglucosidasius); 서열번호 61 (기원: 바실러스 서브틸리스(Bacillus subtilis)); 서열번호 63 (기원: 리시니바실러스 스파이리쿠스(Lysinibacillus sphaericus)); 서열번호 65 (기원: 메틸로코커스 캡슐라터스(Methylococcus capsulatus)); 서열번호 67 (기원: 레클레르시아 아데카르복시라타(Leclercia adecarboxylata)); 서열번호 69 (기원: 크로모할로박터 살렉시젠스(Chromohalobacter salexigens)); 서열번호 71, 73, 75, 77, 79, 81, 83, 85, 87 (기원: 슈도모나스 종(Pseudomonas spp)).
유전자 변형 박테리아 세포에서 추가적인 전이 유전자에 의해 코딩되고, 및 비오틴 경로의 중간체 및 생성물 모두의 합성을 증가시키는 역할을 활성을 갖는 폴리펩티드는 다음과 같다:
a) SAM (S-아데노실메티오닌)-의존적 메틸트랜스퍼라제 (BioC; EC 2.1.1.197) 활성을 갖는 폴리펩티드; 예를 들어, 서열번호 89와 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드;
b) 7-케토-8-아미노펠라르곤산 (KAPA) 신타아제 (BioF; EC 2.3.1.47) 활성을 갖는 폴리펩티드, 예를 들어, 서열번호 91과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드;
c) 7,8-디아미노펠라르곤산 (DAPA) 신타아제 (BioA; EC: 2.6.1.62) 활성을 갖는 폴리펩티드, 예를 들어, 서열번호 93과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드; 또는 L-리신:8-아미노-7-옥소노나노에이트 아미노트랜스퍼라제 (BioK; EC: 2.6.1.105) 활성을 갖는 폴리펩티드, 예를 들어, 서열번호 97과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드;
d) 데스티오비오틴 (desthiobiotin, DTB) 신타아제 (BioD; E.C 6.3.3.3) 활성을 갖는 폴리펩티드, 예를 들어, 서열번호 95와 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드; 및 선택적으로
f) 피멜로일-[아실-운반 단백질] 메틸 에스테르 에스테라제 (BioH; EC: 3.1.1.85) 활성을 갖는 폴리펩티드, 예를 들어, 서열번호 99와 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드; 또는
g) 6-카복시헥사노에이트-CoA 리가제 (BioW; EC 6.2.1.14) 활성을 갖는 폴리펩티드; 예를 들어, 서열번호 101과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드;
비오틴 경로의 추가적인 단계를 촉매하는 폴리펩티드를 코딩하는 하나 이상의 추가적인 전이 유전자와 함께, BioB를 코딩하는 전이 유전자는 박테리아 세포 염색체 내로 또는 자기-복제 플라스미드 상에 통합되는 유전자 변형 박테리아 세포의 게놈에 위치한다. BioB 및 비오틴 경로 효소들(BioABFCD 및 H 또는 W)에서 하나 이상의 효소를 코딩하는 전이 유전자는 하나 이상의 오페론 내의 게놈에 존재할 수 있다.
하나 이상의 추가적인 전이 유전자와 함께 BioB를 코딩하는 전이 유전자의 발현을 유도하는 프로모터는 바람직하게는 이종의 항시성-프로모터 또는 유도성-프로모터일 수 있는 비-천연 프로모터이다. 상기 프로모터가 이종의 항시성 프로모터인 경우, 적합한 프로모터는 apFab 패밀리 [서열번호 230-232]를 포함하는 반면에, 적합한 유도성 프로모터는: pBad (아라비노스 유도성 [서열번호 233] 및 LacI [서열번호 234]를 포함한다. 적합한 종결자는 [서열번호 235-237]를 포함하는 apFAB 종결자 패밀리의 구성원을 포함한다. 선택된 프로모터 및 종결자는 BioB에 대한 코딩 서열; 및 BioC, BioD, BioA, BioF, 및 BioW 또는 BioH 폴리펩티드에 대한 하나 이상의 코딩 서열에 작동 가능하게 연결되는 것일 수 있고, 또한 선택된 Bio 폴리펩티드를 코딩하는 하나 이상의 오페론에 작동 가능하게 연결되는 것일 수 있다.
II 본 발명에 따른 유전자 변형 박테리아를 이용하여 비오틴을 생산하고 검출하는 방법
비오틴은 비오틴의 생합성을 위해 적합한 탄소원을 포함하고; 배양에 의해 생산된 비오틴을 최종적으로 회수할 뿐만 아니라, 성장을 지지하기에 적합한 배양 배지 내로 세포를 도입함으로써, 본 발명의 유전적 변형된 박테리아 세포(예를 들어, 유전자 변형 대장균 세포)를 사용하여 비오틴은 생산되고 수출될 수 있다.
비오틴 신타아제 (BioB)를 코딩하는 전이 유전자를 포함하는 본 발명의 유전자 변형 박테리아 세포는 공급된 탄소원이 데스티오비오틴(DTB)를 포함하는 경우, 비오틴을 증가된 수준으로 생산할 것이다. 본 발명의 유전자 변형 박테리아 세포는 BioA, BioF, BioC, BioD, 및 BioH 또는 BioW 각각을 코딩하는 전이 유전자를 추가적으로 포함하고, 이는 공급된 탄소원이 글루코스, 말토스, 갈락토스, 프럭토스, 수크로스, 아라비노스, 자일로스, 라피노스, 만노스, 및 락토스 중에서 선택되는 경우, 비오틴을 생산할 것이다 (실시예 1, 도 13).
본 발명의 유전자 변형 박테리아 세포에 의해 생산된 세포 외 비오틴을 정량화하는 방법은 실시예 1.5에 기술된다. 상기 방법은 본 발명의 세포의 배양으로부터 유래된 세포 외 증식 배지로 보충된 비오틴-결핍 증식 배지 내에 플라스미드 pBS451을 포함하는 BS1011의 비오틴-결핍된(starved) 하룻밤 동안의 배양의 성장을 측정하는 것에 기초한, 바이오어쎄이 (bioassay)이다. 도 5에 나타낸 바와 같이, 비오틴 기준(standards)의 알려진 농도 범위로 보충된 경우, 비오틴 바이오어쎄이 검정 곡선 (calibration curve)이 비오틴-결핍된 하룻밤 동안의 배양의 성장을 측정함으써 작성된다.
III 리포산의 생산을 위한 유전자 변형 박테리아 세포
본 발명은 리포산의 증가된 수준을 생산할 수 있는 유전자 변형 박테리아 세포를 제공한다. 본 발명에 따르면, 상기 박테리아 세포는 리포산 신타아제 (EC 2.8.1.8)를 코딩하는 전이 유전자를 포함할 뿐만 아니라, 야생형 IscR 대신에 돌연변이 IscR을 발현하도록 유전적으로 변형된다 (섹션 I 참조). LipA는 2개의 황 결합의 형성을 촉진함으로써 공유결합된 옥타노일-도메인을 리포일 도메인으로의 전환을 촉매한다. 선택적으로, 상기 유전자 변형 박테리아 세포는 리포산 합성 경로의 추가적인 단계를 촉매하는 폴리펩티드를 코딩하는 하나 이상의 추가적인 전이 유전자를 더 포함할 수 있고 (도 2), 보다 구체적으로는 예를 들어 LipB 유전자의 상기 코딩된 폴리펩티드 LipB; EC:2.3.1.181, 및 예를 들어 aceF 유전자의 상기 코딩된 폴리펩티드 E2; EC:2.3.1.12. 박테리아 세포에서 리포산 경로에서의 단계를 촉매하는 이들 폴리펩티드의 수준의 증가는 상기 경로의 중간체 및 최종 산물 모두의 합성을 증가시킨다. LplA, 리포에이트-단백질 리가아제 A; EC:6.3.1.20를 코딩하는 추가적인 전이 유전자는 옥타노일 모이어티의 활성화된 리포일 도메인으로의 전달을 촉매함으로써, 옥탄산이 공급된 세포에서 리포산의 합성을 촉진하는 역할을 한다.
리포산 신타아제는 광범위한 속에 속하는 광범위한 박테리아 및 진균에서 발견되는 유전자에 의해 코딩된다. 리포산 신타아제 활성을 갖는 폴리펩티드의 아미노산 서열은 다음 중 어느 하나로부터 선택된 서열과 적어도 70, 75, 80, 85, 90, 95, 96, 98, 100% 아미노산 서열 상동성을 갖는다: 서열번호 103 (기원: 에셔리키아 콜라이); 서열번호 105 (기원: 바실러스 서브틸리스); 서열번호 107 (기원: 사카로미세스 세레비제(Saccharomyces cerevisiae)); 서열번호 109 (기원: 슈도모나스 푸티다 (Pseudomonas putida); 서열번호 111 (기원: 박테로이데스 프라길리스 (Bacteroides fragilis)); 및 서열번호 113 (기원: 스트렙토마이세스 씰리칼라 (Streptomyces coelicolor)).
유전자 변형 박테리아 세포에서 추가적인 전이 유전자에 의해 코딩되고, 리포산 경로의 중간체 및 생성물 모두의 합성을 증가시키는 역할을 하는 폴리펩티드는 다음과 같다:
a) 옥타노일트랜스퍼라제 활성을 갖는 폴리펩티드 (ACP로부터 타겟 효소의 E2 서브유닛의 아포-리포일 도메인으로의 옥타닐 잔기의 이동을 위한; LipB; EC: 2.3.1.181, 예를 들어, 서열번호 115 (기원: 에셔리키아 콜라이) 또는 서열번호 117 (기원: 시겔라플렉스너리 (Shigella flexneri))과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드;
b) 피루브산 탈수소효소 (E2; EC: 2.3.1.12)의 디하이드로리포일라이신-잔기 아세틸트랜스퍼라제 성분을 포함하는 폴리펩티드, 예를 들어 서열번호 119 (기원: 에셔리키아 콜라이), 또는 서열번호 121 (기원: 클렙시엘라 옥시토카 (Klebsiella oxytoca)) 또는 서열번호 239 (하이브리드 서열(hybrid sequence))과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드.
c) 리포에이트-단백질 리가아제 A (EC:6.3.1.20) 활성을 갖는 폴리펩티드, 예를 들어 서열번호 123 (기원: 에셔리키아 콜라이) 또는 서열번호 125 (기원: 클렙시엘라 옥시토카)과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드.
리포산 경로의 추가적인 단계를 촉매하는 폴리펩티드를 코딩하는 하나 이상의 추가적인 전이 유전자와 함께 리포산 신타아제를 코딩하는 전이 유전자는 박테리아 세포 염색체 내로 또는 자기-복제(self-replicating) 플라스미드 상에 통합되는 유전자 변형 박테리아 세포의 게놈에 위치한다. LipA를 코딩하는 전이 유전자 및 리포산 경로 효소를 코딩하는 하나 이상의 전이 유전자 (IpB, IplA, 및 AceF)는 하나 이상의 오페론 내에 게놈에 존재할 수 있다.
LipA를 코딩하는 전이 유전자 및 하나 이상의 추가적인 전이 유전자의 발현을 유도하는 프로모터는 바람직하게는 이종의 항시성-프로모터 또는 유도성-프로모터일 수 있는 비-천연 프로모터이다. 상기 프로모터가 이종의 항시성 프로모터인 경우, 적합한 프로모터는 apFab 패밀리 [서열번호 230-232]를 포함하는 반면에, 적합한 유도성 프로모터는: pBad (아라비노스 유도성 [서열번호 233] 및 LacI [서열번호 234]를 포함한다. 적합한 종결자는 [서열번호 235-237]를 포함하는 apFAB 종결자 패밀리의 구성원을 포함한다. 선택된 프로모터 및 종결자는 개별적인 유전자 조절을 제공하기 위해 또는 오페론의 조절을 위해, 각각의 유전자에 작동 가능하게 연결되는 것일 수 있다.
IV 본 발명의 유전자 변형 박테리아를 이용하여 리포산을 생산하고 검출하는 방법
실시예 2 및 도 14에 예시된 바와 같이, 리포산은 본 발명의 유전자 변형 박테리아 세포(예를 들어, 유전자 변형 대장균 세포)를 이용하여 적합한 배양 배지 내로 상기 세포를 도입하고; 세포에 의해 생산된 리포산을 최종적으로 회수함으로써 생산될 수 있다.
리포산 신타아제 (LipA)를 코딩하는 전이 유전자를 포함하는 본 발명의 유전자 변형 박테리아 세포는 공급된 탄소원이 옥탄산 (OA)를 포함하는 경우 리포산을 생산할 것이다. 상기 세포는 적합한 탄소원, 예를 들어, 글루코스, 말토스, 갈락토스, 프럭토스, 수크로스, 아라비노스, 자일로스, 라피노스, 만노스, 및 락토스 중에서 선택된 탄소원으로 공급될 경우, 리포산을 생산할 것이다.
본 발명의 유전자 변형 박테리아 세포에 의해 생산된 세포 외 리포산을 정량화하는 방법은 실시예 2에 기술된다. 상기 방법은 본 발명의 세포로부터 추출된 리포산으로 보충된 최소 배지 상에서 리포산-의존적 영양요구성(auxotrophic) 대장균 균주의 성장을 측정하는 것에 기초한, 바이오어쎄이이다.
V 티아민의 생산을 위한 유전자 변형 박테리아 세포
본 발명은 티아민을 증가된 수준으로 생산할 수 있는 유전자 변형 박테리아 세포를 제공한다. 본 발명에 따르면, 상기 박테리아 세포는 야생형 IscR을 대체하여 돌연변이 IscR을 발현하고, thiC에 의해 코딩되는 HMP-P 신타아제라고도 불리는 포스포메틸피리미딘 신타아제 (EC 4.1.99.17); 또는 thiH에 의해 코딩되는 티로신 리아제 (2-이미노아세테이트 신타아제 (EC 4.1.99.19)라고도 불림)를 코딩하는 전이 유전자를 포함하도록 유전적으로 변형된다.
상기 유전자 변형 박테리아 세포는 티아민 합성 경로에서 추가적인 단계를 촉매하는 폴리펩티드를 코딩하는 하나 이상의 전이 유전자를 더 포함할 수 있다 (도 3). 박테리아 세포에서 티아민 경로에서의 단계를 촉매하는 이들 폴리펩티드의 수준의 증가는 상기 경로의 중간체 및 최종 산물 모두의 합성을 증가시킨다. 예를 들어, 상기 박테리아 세포는 다음을 코딩하는 하나 이상의 전이 유전자를 더 포함할 수 있다: ThiE 티아민 포스페이트 신타아제 (EC 2.5.1.3); [ThiS] 아데닐일트랜스퍼라제 (EC 2.7.7.73) (예를 들어, thiF 유전자에 의해 코딩됨); ThiG 티아졸 신타아제 (E.C.2.8.1.10); ThiS 황-운반 단백질; ThiD 포스포하이드록시메틸피리미딘 키나아제 (EC 2.7.4.7) 및 티아민 모노-포스페이트 포스파타제 (E.C. 3.1.3.-); ThiO 글리신 옥시다아제 (EC 1.4.3.19); 및 ThiM 하이드록시에틸티아졸 키나아제 (2.7.1.50).
HMP-P 신타아제는 광범위한 속에 속하는 광범위한 박테리아 및 진균에서 발견되는 유전자에 의해 코딩된다. HMP-P 신타아제 활성을 갖는 폴리펩티드의 아미노산 서열은 다음 중 어느 하나로부터 선택된 서열과 적어도 70, 75, 80, 85, 90, 95, 96, 98, 100% 아미노산 서열 상동성을 갖는다: 서열번호 201 (기원: 에셔리키아 콜라이); 서열번호 203 (기원: 시네코커스_이롱가투스(Synechococcus_elongatus)); 서열번호 205 (기원: 코리네박테리움 글루타미쿰 (Corynebacterium glutamicum)); 서열번호 207 (기원 캔디다투스 바우마니아 시사델리니콜라 (Candidatus Baumannia cicadellinicola)). 티로신 리아제는 2-이미노아세테이트 신타아제 (EC 4.1.99.19)라고도 불린다. HMP-P 신타아제 활성을 갖는 폴리펩티드의 아미노산 서열은 서열번호 217의 서열과 적어도 70, 75, 80, 85, 90, 95, 96, 98, 100% 아미노산 서열 상동성을 갖는다.
유전자 변형 박테리아 세포에서 하나 이상의 추가적인 전이 유전자에 의해 코딩되고, 티아민 경로의 중간체 및 생성물 모두의 합성을 증가시키는 역할을 하는 폴리펩티드는 다음과 같다:
a) [ThiS] 아데닐일트랜스퍼라제 (EC 2.7.7.73) 활성을 갖는 폴리펩티드, 예를 들어 서열번호 211과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드;
b) 티아민 포스페이트 신타아제 (EC 2.5.1.3) 활성을 갖는 폴리펩티드, 예를 들어 서열번호 209와 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드;
c) 티아졸 신타아제 (E.C.2.8.1.10) 활성을 갖는 폴리펩티드, 예를 들어 서열번호 215와 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드;
d) 포스포하이드록시메틸피리미딘 키나아제 (EC 2.7.4.7) 활성을 갖는 폴리펩티드, 예를 들어 서열번호 225와 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드;
e) 글리신 옥시다아제 (EC 1.4.3.19) 활성을 갖는 폴리펩티드; 예를 들어 서열번호 219, 221, 및 223으로부터 선택된 서열과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드;
f) ThiS 황-운반 활성을 갖는 폴리펩티드 예를 들어 서열번호 213과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드;
g) 티아민 모노-포스페이트 포스파타제 (E.C. 3.1.3.-) 활성을 갖는 폴리펩티드; 예를 들어 서열번호 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199 중 어느 하나로부터 선택된 서열과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드; 및
h) ThiM 하이드록시에틸티아졸 키나아제 (2.7.1.50) 활성을 갖는 폴리펩티드, 예를 들어 서열번호 227과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는 폴리펩티드.
바람직하게는, 상기 유전자 변형 박테리아 세포는 다음의 효소를 코딩하는 전이 유전자를 포함한다: ThiC (thiC 유전자에 의해 코딩됨); ThiD (thiD 유전자에 의해 코딩됨), ThiE (thiE 유전자에 의해 코딩됨), ThiF (thiF 유전자에 의해 코딩됨), 황-운반 단백질 (thiS 유전자에 의해 코딩됨), ThiG (thiG 유전자에 의해 코딩됨), TMP 포스파타제 (TMP 포스파타제 유전자에 의해 코딩됨); 및 ThiH (thiH 유전자에 의해 코딩됨) 또는 ThiO (thiO 유전자에 의해 코딩됨). 구현예에 따르면, 상기 세포는 효소 ThiM (ThiM 유전자에 의해 코딩됨)을 코딩하는 전이 유전자를 더 포함할 수 있다.
본 발명의 유전자 변형 박테리아에서 티아민 합성 수준은 티아민-포스페이트 키나아제를 코딩하는 내인성 thiL 유전자의 돌연변이에 의해 더 증가될 수 있다. 돌연변이 thiL 유전자는 서열번호 228의 뉴클레오티드 서열을 가지고, 모체 야생형 유전자와 비교하여 G133D 치환을 갖는 폴리펩티드 [서열번호 229]를 코딩하는 뉴클레오티드 133-135에서의 돌연변이 (GGT에서 GAC)를 갖는다.
티아민 경로에서 추가적인 단계를 촉매하는 폴리펩티드를 코딩하는 하나 이상의 추가적인 전이 유전자와 함께, thiC에 의해 코딩되는, HMP-P 신타아제 (EC 4.1.99.17); 또는 thiH에 의해 코딩되는 티로신 리아제 (EC 4.1.99.19)를 코딩하는 전이 유전자는 박테리아 세포 염색체 내로 또는 자기-복제 플라스미드 상에 통합되는 유전자 변형 박테리아 세포의 게놈에 위치한다. thiC 또는 thiH 전이 유전자 및 티아민 경로에서의 효소를 코딩하는 하나 이상의 전이 유전자는 하나 이상의 오페론 내에 게놈에 위치할 수 있다.
thiC 또는 thiH 전이 유전자 및 하나 이상의 추가적인 전이 유전자의 발현을 유도하는 프로모터는 바람직하게는 이종의 항시성-프로모터 또는 유도성-프로모터일 수 있는 비-천연 프로모터이다. 상기 프로모터가 이종의 항시성 프로모터인 경우, 적합한 프로모터는 apFab 패밀리 [서열번호 230-232]를 포함하는 반면에, 적합한 유도성 프로모터는: pBad (아라비노스 유도성 [서열번호 233] 및 LacI [서열번호 234]를 포함한다. 적합한 종결자는 [서열번호 235-237]를 포함하는 apFAB 종결자 패밀리의 구성원을 포함한다. 선택된 프로모터 및 종결자는 개별적인 유전자 조절을 제공하기 위해 또는 오페론의 조절을 위해, 각각의 유전자에 작동 가능하게 연결되는 것일 수 있다.
VI 본 발명에 따른 유전자 변형 박테리아를 이용하여 티아민을 생산하고 검출하는 방법
실시예 3 및 도 16에 예시된 바와 같이, 티아민, 티아민 모노포스페이트 (TMP) 및 티아민 디포스페이트 (TPP)는 본 발명의 유전자 변형 박테리아 세포 (예를 들어, 유전자 변형 대장균 세포)를 이용하여 적합한 배양 배지 내로 상기 세포를 도입하고; 최종적으로 티아민, 및 추가적으로는 상기 세포에 의해 생산된 TPP 및 TMP를 회수함으로써 생산될 수 있다.
HMP-P 신타아제를 코딩하는 전이 유전자를 포함하는 본 발명의 유전자 변형 박테리아 세포는 공급된 탄소원이 글루코스, 말토스, 갈락토스, 프럭토스, 수크로스, 아라비노스, 자일로스, 라피노스, 만노스, 및 락토스 중에서 선택되는 경우, 티아민, TPP 및 TMP를 생산할 것이다.
본 발명의 유전자 변형 박테리아 세포에 의해 생산된 티아민을 정량화하는 방법은 실시예 3에 기술되고; 티아민 기준과 비교한 고압력 액체 크로마토그래피 (High Pressure Liquid Chromatography)의 사용을 포함할 수 있다.
VII 비오틴, 리포산 또는 티아민의 생산을 위한 유전자 변형 박테리아 세포를 설계하는 방법
본 발명의 박테리아 세포에서 비오틴, 리포산 또는 티아민의 합성과 관련된 효소 활성을 갖는 하나 이상의 폴리펩티드를 코딩하는 하나 이상의 전이 유전자를 클로닝하고 도입하기에 적합한 통합 (Integration) 및 자기-복제 벡터는 통상의 기술자에게 상업적으로 이용 가능하고 알려져 있다 (예를 들어, Sambrook et al., Molecular Cloning : A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, 1989 참조). 박테리아 세포는 이종 DNA의 세포로의 도입에 의해 유전적으로 조작된다. 본 발명의 박테리아 세포에서 비오틴, 리포산 또는 티아민 합성과 관련된 효소 활성을 갖는 하나 이상의 폴리펩티드를 코딩하는 유전자의 이종 발현은 실시예 1, 2 및 3에서 각각 입증된다.
본 발명의 박테리아 세포에서 비오틴, 리포산 또는 티아민 합성과 관련된 효소 활성을 갖는 하나 이상의 폴리펩티드를 코딩하는, 핵산 분자는 자기-복제 벡터에 의해 숙주 세포 내로 도입되거나, 당업계의 표준인 방법 및 기술을 사용하여 숙주 세포 게놈 내로 선택적으로 통합될 수 있다. 예를 들어, 핵산 분자는 화학적 변형 (chemical transformation) 및 전기천공법을 포함한 형질전환 (transformation), 형질도입 (transduction), 유전자총법 (particle bombardment), 등과 같은 표준 프로토콜에 의해 도입될 수 있다. 청구된 발명의 효소를 코딩하는 핵산 분자를 발현하는 것은 게놈 내로 핵산 분자를 통합함으로써 달성될 수 있다.
본 발명의 박테리아 세포에서 천연 내인성 iscR 유전자의 유전적 변형은 표준 재조합 방법을 적합한 모체 박테리아 세포에 적용함으로써, 내인성 iscR 유전자의 결실(녹아웃) 및 섹션 I에 기재된 바와 같은 돌연변이 IscR 폴리펩티드를 코딩하는 전이 유전자를 이용한 삽입/치환에 의해 수행될 수 있다 (Datsenko KA, et al. ; 2000).
비오틴, 리포산 또는 티아민의 생산을 위한, 본 발명에 따른 유전자 변형 박테리아 세포는 박테리아(bacterium)일 수 있고, 적합한 박테리아의 비-완전한(exhaustive) 리스트는 다음과 같이 제공된다: 에셔리키아 (Escherichia), 브레비박테리움 (Brevibacterium), 버크홀데리아 (Burkholderia), 캄필로박터 (Campylobacter), 코리네박테리움 (Corynebacterium), 슈도모나스 (Pseudomonas), 셀라티아 (Serratia), 락토바실러스 (Lactobacillus), 락토코커스 (Lactocooccus), 아세토박터 (Acetobacter), 아시네토박터 (Acinetobacter), 슈도모나스 (Pseudomonas) 등으로 이루어진 군으로부터 선택된 박테리아의 속에 속하는 종.
본 발명의 바람직한 박테리아 종은 에셔리키아 콜라이, 슈도모나스 푸티다, 셀라티아 마르센세스 (Serratia marcescens), 및 코리네박테리움 글루타미쿰이다.
VIII 본 발명의 유전자 변형 박테리아 세포의 비오틴 생산 능력은 증가된 전자 전달에 의해 향상된다.
산화된 [4Fe-4S]2+ 클러스터를 포함하는 SAM-라디칼 철-황 클러스터 효소, 예를 들어, BioB, ThiC 및 LipA는 [4Fe-4S]+ 클러스터로 환원하기 위해 전자 이동이 필요하다. 환원된 [4Fe-4S]+ 클러스터만이 촉매에 필요한 SAM-라디칼을 생성할 수 있다. 전자 공여자 NADPH로부터 [4Fe-4S]2+로의 전자 이동은 플라보독신/페레독신 환원 효소 (Fpr) 및 플라보독신 (FldA) 환원 시스템에 의해 또는 피루브산-플라보독신/페레독신 산화환원 효소 시스템에 의해 매개될 수 있다.
추가적인 구현예에서, 비오틴, 리포산 또는 티아민을 생산할 수 있는 본 발명에 따른 유전자 변형 박테리아 세포는 다음의 군으로부터 선택된 하나 이상의 유전자를 더 포함한다: 플라보독신/페레독신-NADP 환원 효소 (EC:1.18.1.2 및 EC 1.19.1.1)를 코딩하는 유전자; 피루브산-플라보독신/페레독신 산화 환원 효소 (EC 1.2.7)를 코딩하는 유전자; 플라보독신을 코딩하는 유전자; 페레독신을 코딩하는 유전자; 플라보독신 및 페레독신-NADP 환원 효소를 코딩하는 유전자. 상기 하나 이상의 유전자에 작동 가능하게-연결된 프로모터는 상기 박테리아에서 상기 하나 이상의 유전자의 발현을 증가시킬 수 있다; 상기 하나 이상의 유전자는 천연 유전자 또는 전이 유전자일 수 있다. 바람직하게, 상기 작동 가능하게-연결된 프로모더는 본 발명의 유전자 변형 박테리아가 유래된 모체 박테리아 보다 더 높은 수준으로 상기 박테리아에서 상기 하나 이상의 유전자의 발현을 증가시킨다. 바람직하게, 본 발명의 유전자 변형 박테리아 세포는 플라보독신/페레독신-NADP 환원 효소 (EC:1.18.1.2 및 EC 1.19.1.1)를 코딩하는 유전자 및 플라보독신을 코딩하는 유전자; 또는 플라보독신 및 페레독신-NADP 환원 효소 모두에 대한 코딩 서열을 포함하는 단일 유전자를 포함한다. 추가적으로 상기 유전자 변형 박테리아 세포는 페레독신을 코딩하는 유전자를 더 포함할 수 있다.
본 발명의 유전자 변형 박테리아 세포에서 전자 이동 경로의 구성요소를 발현하는 유전자의 과발현은 이들의 SAM-라디칼 철-황 클러스터 효소의 세포 활성을 증가시킨다 (본 발명의 비오틴-생산 세포에 대한 실시예 4에 예시된 바와 같음).
바람직하게, 본 발명의 유전자 변형 박테리아 세포에서 천연 유전자 또는 전이 유전자에 의해 코딩되는 폴리펩티드는 플라보독신/페레독신 환원효소 (EC:1.18.1.2 및 EC 1.19.1.1) 활성을 가지고, 이는 다음 중 어느 하나로부터 선택된 서열과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는다: 서열번호 241 (기원: 대장균으로부터 fpr 유전자); 서열번호 243 (기원: 바실러스 서브틸리스 168로부터 yumC 유전자); 서열번호 245 (기원: 슈도모나스 푸티다 KT2440로부터 fpr-I 유전자); 서열번호 247 (기원: 스트렙토마이세스 베네주엘라 ATCC 10712 -로부터 SVEN_0113 유전자); 서열번호 249 (기원: 코리네박테리움 글루타미쿰 ATTCC 13032로부터 Cgl2384 유전자), 및 서열번호 251 (기원: 스핑고박테리움 종 (Sphingobacterium sp.) JB170으로부터 SJN15614.1 유전자).
바람직하게, 본 발명의 유전자 변형 박테리아 세포에서 천연 유전자 또는 전이 유전자에 의해 코딩되는 폴리펩티드는 피루브산-플라보독신/페레독신 산화 환원 효소 (EC 1.2.7) 활성을 가지고, 이는 다음 중 어느 하나로부터 선택된 서열과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는다: 서열번호 253 (기원: 대장균 K12 MG1655로부터 YdbK 유전자); 서열번호 255 (기원: 지오박터 설퍼레두신스 (Geobacter sulfurreducens) AM-1으로부터 por 유전자); 서열번호 257 (기원: 스트렙토마이세스 프라텐시스 (Streptomyces pratensis) ATCC 33331로부터 Sfla_2592 유전자; 서열번호 259 (기원: 프로피오니박테리움 프레우덴레이치트 (Propionibacterium freudenreichit) DSM 20271로부터 RM25_0186 유전자); 서열번호 261 (기원: 시네코시스티스 종 (Synechocystis sp.) PCC 6803으로부터 nifJ 유전자)
바람직하게, 본 발명의 유전자 변형 박테리아 세포에서 천연 유전자 또는 전이 유전자에 의해 코딩되는 폴리펩티드는 플라보독신이고, 이는 다음 중 어느 하나로부터 선택된 서열과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는다: 서열번호 263 (기원: 대장균 K12 MG1655로부터 fldA 유전자); 서열번호 265 (기원: 대장균 K12 MG1655로부터 fldB 유전자); 서열번호 267 (기원: 바실러스 서브틸리스 168로부터 ykuN 유전자); 서열번호 269 (기원: 시네코시스티스 종 PCC 6803로부터 isiB 유전자; 서열번호 271 (기원: 스트렙토마이세스 베네주엘라 ATCC 10712로부터 wrbA 유전자); 서열번호 273 (기원: 메타노코커스 아이올리쿠스 난카이-3 (Methanococcus aeolicus Nankai-3)으로부터 PRK06242 유전자).
바람직하게, 본 발명의 유전자 변형 박테리아 세포에서 천연 유전자 또는 전이 유전자에 의해 코딩되는 폴리펩티드는 페레독신이고, 이는 다음 중 어느 하나로부터 선택된 서열과 80, 85, 90, 95 또는 100% 서열 상동성을 갖는 아미노산 서열을 갖는다: 서열번호 275 (기원: 대장균으로부터 fdx 유전자); 서열번호 277 (기원: 바실러스 서브틸리스 168로부터 fer 유전자); 서열번호 279 (기원: 코리네박테리움 글루타미쿰 ATTCC 13032로부터 fdxB 유전자); 서열번호 281 (기원: 시네코시스티스 종 PCC 6803로부터 fdx 유전자); 서열번호 283 (기원: 스트렙토마이세스 베네주엘라 ATCC 10712로부터 SVEN_7039 유전자); 서열번호 285 (기원: 메타노코커스 아이올리쿠스 난카이-3 으로부터 fdx 유전자).
유전자 발현을 증가시킬 수 있는 프로모터는 상기 박테리아에서 전자 이동 경로의 폴리펩티드를 코딩하는 천연 유전자 또는 전이 유전자에 작동 가능하게-연결된 경우 바람직하게 비-천연 프로모터이다. 상기 프로모터는 항시성 apFab309 프로모터 패밀리 [서열번호 230-232]의 구성원일 수 있다. 바람직하게 상기 비-천연 프로모터는 상기 천연 유전자 또는 전이 유전자에 작동 가능하게-연결된 경우 상기 유전자 변형 박테리아에서 이 박테리아가 유래된 모체 박테리아 보다 높은 수준까지 상기 코딩된 폴리펩티드의 발현을 증가시킨다. 상기 천연 유전자 또는 전이 유전자에 작동 가능하게-연결될 수 있는 적합한 종결자는 apFAB378 종결자 패밀리 [서열번호 235-237]를 포함한다.
실시예
실시예 1: 비오틴 생산을 향상시킬 수 있는 유전자 변형
대장균
균주의 동정 및 특성화
1 방법
1. 1 : 실시예에서 사용된 하기 에셰리키아 균주를 하기 열거한다.
명칭 | 설명 |
BS1013 | 하기 유전자형을 갖는 E. coli K-12 BW25113 모체 균주:rrnB3 △lacZ4787 hsdR514 △(araBAD)567 △(rhaBAD)568 rph-1 |
BS1011 | E coli K-12 BW25113로부터 유래된 △bioB 1(JW0758-1) |
BS1353 | iscR 유전자에 H107Y 돌연변이를 포함하는 BS1011 파생체(derivative) |
BS1113 | IPTG-유도성 BioB 발현을 제공하는 pBS412 플라스미드를 포함하는 BS1011 파생체 |
BS1375 | iscR 유전자에 C92Y 돌연변이를 포함하는 BS1011 파생체 |
BS1377 | iscR 유전자에 L15F 돌연변이를 포함하는 BS1011 파생체 |
1△bioB 유전자의 결실 전 뉴클레오티드 서열은 서열번호 21이었다.
1.2 : 실시예에서 사용된 하기 플라스미드를 하기 나열한다.
명칭 | 설명 |
pBS412 | T5 lacO 억제 프로모터(repressed promoter)로부터의 BioB [서열번호 22] 과발현 플라스미드 (kanR, SC101) |
pBS430 | T5 lacO 억제 프로모터 [서열번호 25]로부터의 bioB 1(kanR, SC101)에서 초기 프레임 시프트 돌연변이를 갖는 pBS412 |
pBS451 | 항시적으로 발현되는 GFP [서열번호 287] (zeoR, p15A) |
pBS281 | 미디엄 카피 넘버 플라스미드(medium copy number plasmid) (p15A ori)에서 복제된 IPTG 유도성 T5 프로모터로부터의 E. coli isc 오페론 (iscSUA-hscBA-fdx) |
pBS282 | 미디엄 카피 넘버 플라스미드 (p15 ori)에서 복제된 IPTG 유도성 T5 프로모터로부터의 E. coli suf 오페론 (sufABCDSE) |
pBS231 | IPTG 유도성 T5 프로모터로부터 sfGFP 단백질을 코딩하는 유전자를 발현하는 미디엄 카피 넘버 플라스미드 (p15 ori) |
pBS936 | bio 오퍼레이터 위치에 "타입 9(type 9)" 돌연변이를 갖는 E coli 유래의 천연 비오틴-오페론 (Ifuku et al., 1993) |
1: bioB 프레임시프트(frameshift) 유전자의 뉴클레오티드 서열은 서열번호 23을 갖는다.
1.3 배지 및 첨가제:
각각의 실시예에서 사용된 증식 배지(mMPOS)는 다음의 조성물을 갖는다: 1.32 mM K2HP04; 2 g/l D-글루코스; 0.0476 mg/l 판토텐산 칼슘; 0.0138 mg/l p-아미노벤조산; 0.0138 mg/l p- 하이드록시벤조산; 0.0154 mg/l 2,3-디하이드록시벤조산, 및 1x 변형된 MOPS 버퍼.
10 x 변형된 MOPS는 0.4 M MOPS (3-(N-모르폴리노)프로판 설폰산 (3-(N-morpholino)propane sulfonic acid)); 0.04 M 트리신(Tricine); 0.1 mM FeS047H20; 95 mM NH4CI; 2.76 mM K2S04; 5 μΜ CaCI22H20; 5.25 mM MgCI2; 0.5 M NaCI; 및 미량영양소 스톡 용액(micronutrient stock solution)의 5000x 희석물을 포함한다.
미량영양소 스톡 용액:
하기의 항생제 스톡(stocks)이 적용되었다: 1000x 희석물을 수득하기 위해 표시된 바와 같이 증식 배지에 첨가된; 암피실린 (amp, 100 mg/mL), 카나마이신 (kan, 50 mg/mL), 제오신(zeocin, zeo, 40 mg/mL).
1.4 대장균 균주 라이브러리의 구축:
진화된 게놈 다양성을 갖는 대장균 라이브러리는 세포를 kan으로 보충된 mMOPS 배지(mMOPS-kan)에서 하룻밤 동안 정치 배양하고, mMOPS-kan에서 생성된 배양물의 100x 희석물을 제조하고, 하룻밤 동안의 배양 및 희석의 연속적인 단계를 5회 반복함으로써 대장균 균주 BS1011의 세포로부터 유래된다. 이 과정은 불완전한 오류-정정 폴리머라제에 의해 생성된 백그라운드 돌연변이의 축적을 허용함으로써 유전적 다양성을 만든다. 배양 및 희석의 각 라운드 후에, 증가된 BioB 발현을 견디도록 적응된 세포의 진화를 검출하기 위해, 세포 배양물의 샘플은 IPTG를 갖는 mMOPS 플레이트 상에 플레이팅하였다(하기 참조). 이어서, 각 라이브러리의 세포는 BioB 과-발현 플라스미드, pBS412로 형질전환되었다.
1.5 돌연변이 균주의 선별
IPTG를 0, 0.0001, 0.001, 0.01, 0.1 및 1 mM의 농도로 포함하는 mMOPS (Ø=9츠)를 포함하는 일련의 1.5% 아가 플레이트 상에서 pBS412를 포함하는 BS1011의 mMOPS-kan에서 하룻밤 동안(o/n)의 배양으로부터 유래된 104, 105, 106 및 107 각각의 세포를 플레이팅함으로써, 선별 어쎄이(selection assay)가 개발되었다. 이어서, 상기 플레이트는 37℃에서 최대 36시간 동안 인큐베이션되었고, 세포 성장은 간격을 두고 평가되었다. 이러한 조건들 하에, 0.1 mM IPTG으로 pBS412로부터의 BioB 발현의 유도는 최대 10Λ5 세포의 성장을 억제하는 것으로 발견된 반면에, 1 mM IPTG으로의 유도는 싱글 페트리 디쉬 상에 플레이팅된 경우에 적어도 10Λ7 세포의 성장을 억제하였다. 10Λ5 세포의 세포 집단에 대한 1 mM IPTG로의 유도를 포함한 선택압(selection pressure)은 BioB 발현에 대한 더 높은 강건함을 갖는 균주를 식별하기 위해 최적인 건으로 발견되었고; 이에 따라 다음과 같이 실시되었다:
1) 섹션 1.4에 기재된 바와 같이, 각 라이브러리로부터 약 105 세포는 각 mMOPS-kan-1 mM IPTG 아가 플레이트 상에 플레이팅되었고, 최대 24시간 동안 37℃에서 인큐베이션되었다.
2) 단일 콜로니는 전-배양물(pre-cultures)을 생산하기 위해 mMOPS-kan 액체 배지에서 성장되었고, 이어서 전-배양물은 0.00, 0.01 또는 0.1 mM IPTG로 보충된 mMOPS-kan에서 수행된 비오틴 바이오어쎄이를 이용함으로써 이들의 비오틴 생산에 대해 평가되었다 (하기 섹션 1.6에 기재된 바와 같음). 각각의 전-배양물의 세포는 20% 글리세롤에서 글리세롤 스톡으로 보존되었다.
3) 1.5 mg 이상의 비오틴/l (세포 외 비오틴으로 검출됨)을 생산하는 콜로니는 mMOPS-kan 아가 플레이트 상에 재-배열되었고 (re-streaked), 최대 24시간 동안 37℃에서 인큐베이션되었고, 이어서 생물학적 복제물에서 비오틴 생산에 대하여 재-바이오어쎄이되었다 (re-bioassayed) (하기 섹션 1.6에 상세히 기재된 바와 같음).
4) 선별된 비오틴 과-발현 균주의 세포는 다음과 같이 평가되었다:
a) 모체 균주 BS1011의 게놈과 비교하여 선별된 균주의 세포의 게놈에서의 유전적 돌연변이를 식별하기 위해 전체 게놈 시퀀싱을 선별된 균주의 세포로부터 분리된 DNA 상에서 다음과 같이 수행하였다: 선별된 균주는 5-10 mL mMOPS-kan에서 성장되었고, 상기 세포는 이후 수확되었다; 게놈 DNA는 Invitrogen Purelink 게놈 DNA 추출 키트: (https://www.thermofisher.com/order/cataloq/product/K182001)를 이용하여, 수확된 세포로부터 분리되었다; 상기 추출된 DNA는 전체 게놈 시퀀싱에 적용되었다.
b) pBS412 플라시므디의 선택된 균주의 세포를 양생(curing)하는 단계; 이어서 상기 양생된 균주의 세포를 pBS412 플라스미드로 재-형질전환시키는 단계; 및 최종적으로 상기 형질전환된 균주의 세포의 배양물의 비오틴 생산에 대한 바이오어쎄이를 다시하는 단계. pBS412 플라스미드의 세포를 양생하는 단계는 항생제를 포함하지 않고 1 mM IPTG를 갖는 풍부한(rich) 루리아 브로스(LB: Luria Broth) 배지에서 세포를 37℃에서 하룻밤 동안 증식시키고; 생성된 배양물의 세포를 LB 아가 플레이트 상에 배열(streaking out)시키고 37℃에서 하룻밤 동안 인큐베이션 함으로써 수행되었다. 아가 플레이트로부터 단일 콜로니는 50 μl LB 배지에서 희석되었고, 5 μl는 37℃에서 하룻밤 동안 인큐베이션된 LB 및 LB-amp 아가 플레이트 상에 점을 표시하기 위해 사용되었다. LB-amp 플레이트 상이 아닌, LB 플레이트 상에서 성장된 이들 단일 콜로니는 재-형질전환을 위해 양성된 균주로 사용되는 단일 콜로니를 수득하기 위해 LB 플레이트 상에 재-배열되었다.
생물학적 복제물에서 측정된 형질전환된 균주의 세포에 의한 비오틴 생산 (하기 섹션 1.6에 자세히 기재됨. 간단히, 비오틴 생산은 0.00, 0.01 또는 0.1 mM IPTG로 보충된 mMOPS-kan에서 생물학적 복제물에 대해 재-평가되었다. 동시에, 각각의 형질전환된 균주의 세포의 성장률은 멀티스칸 FC에서 24시간의 기간 동안의 혐기성 성장을 위해 “빠른 진탕(fast shaking)”으로 37℃에서 투명한 통기성 밀봉으로 밀봉된 미량 정량판(microtiter plate)에서 200 μL mMOPS-kan 배지에서 측정되었다. 세포 성장은 30분 마다 OD620을 측정함으로써 모니터링되었다.
1.6 비오틴 생산을 정량화하기 위한 바이오어쎄이
전-배양물은 96 딥(deep)-웰 플레이트에서 400 μL mMOPS-kan에서 선별된 단일 세포 콜로니로부터 각각 제조되었고, 16-18시간 동안 275 rpm에서 진탕하면서 37℃에서 인큐베이션되었다. 생산 배양물(production cultures)은 ~0.03의 초기 OD600을 제공하기에 충분한 전-배양물의 4 μL을 갖는, 96 딥-웰 플레이트에서, 0.1 g/L 데스티오비오틴으로 보충된 400 μL mMOPS-kan을 접종하고, 선택적으로 최종 농도가 최대 1 mM까지 IPTG를 포함함으로써 생산되었다. 이어서, 배양물은 24시간 동안 275 rpm 진탕으로 37℃에서 성장되었다. 배양물의 OD600을 측정한 후 8분 동안 4000 G에서의 원심분리에 의해, 96 딥-웰 플레이트의 세포는 펠렛화(pelleted)되었다. 각 배양물 상층액으로부터의 상층액은 초순수(ultrapure water, Mili-Q)에서 0.05 nM 내지 0.50 nM 비오틴의 농도 범위까지 희석되었다. 동시에, 0.1 nM (0. 024 μg 비오틴/L) 내지 1 nM (0.24 μg 비오틴/L)의 농도 범위에서 >5 비오틴 표준물(biotin standards)은 Milli-Q 물에서 제조되었다. 각 희석된 상층액의 15 μL 및 각각의 비오틴 표준은 미량정량판의 웰에 첨가되었다; 상기 각 웰은 플라스미드 pBS451을 포함하는 BS1011의 비오틴-결핍된 하룻밤 동안의 배양물의 135 μL를 포함하고, 상기 하룻밤 동안의 배양물은 제오신이 보충된 mMOPS에서 0.01의 초기 OD620까지 희석되었다. 상기 플레이트는 통기성 밀봉으로 밀봉되었고, OD600이 측정되기 전에 20시간 동안 275 rpm 진탕으로 37 ℃에서 인큐베이션되었다. 비오틴 표준물의 범위를 사용하여, 이 바이오어쎄이로 수득된 비오틴 바이오어쎄이 검정 곡선은 도 5에 나타낸다.
1.7 게놈 돌연변이의 동정
모든 차세대 염기서열 분석(NGS: Next-Generation Sequencing) 데이터의 경우, CLC 게놈 워크벤치 버전 9.5.3 (CLC genomic workbench version 9.5.3, Qiagen에 의해 제공됨)이 모체 균주 게놈과 비교하여 선별된 균주의 세포의 게놈에서의 돌연변이(변이체 검출에 의한 단일 또는 일부 치환, 결실 또는 삽입 및 인델(InDels) 및 구조적 변이체에 의한 더 큰 삽입/결실)를 동정하기 위해 사용되었다. 80%의 컷-오프는 시퀀싱 과정에 의해 도입된 잘못된 뉴클레오티드로부터의 게놈 돌연변이와 구별하기 위해, 돌연변이가 제공된 박테리아 균주의 세포로부터 분리된 DNA 분자 (게놈) 집단의 85% 이상에서 존재해야 한다는 것을 의미하는 "유의미한 돌연변이”로 정의하는데 사용되었다.
NCBI의 게놈 수탁 번호 CP009273은 참조 서열로서 사용되었고, 시퀀싱에 의해 서열이 입증된Keio △bioB 흉터 돌연변이 (scar mutation)를 고려하였다.
1.8 iscR 돌연변이의 프로테오믹스(proteomics) 랜드스케이프(landscape)의 특성화
1 mM IPTG 유도에서의 BS1353 + pBS412뿐만 아니라 0.025 mM IPTG 유도 수준에서의 BS1013 + pBS430, BS1011 + pBS412 및 BS1353 + pBS412의 단백질 함량은 LC-MS 및 효율적인 단백질 추출을 조합한 최근에 개발된 접근 방법에 의해 측정되었다 (Schmidt et al, 2015). 2.0% FDR의 펩티드 역치(threshold)에 따른 분석을 위해 식별되는 펩티드의 최소 개수로서, 3개의 펩타이드가 선택되었다. 단백질 발현에서 유의적인 변화는 스캐폴드 뷰어 4.7.5 (Scaffold Viewer 4.7.5)를 사용한 다중 테스팅에 대한 Benjamini-Hochberg 보정을 통한 분산 분석 (ANOVA, Analysis of Variance)에 기초하여 0.5% 신뢰 구간으로 보고된다.
0.5의 OD600에 도달할 때까지, 약 10 세대 동안의 IPTG 유도로 mMOPS에서 성장되었다. 108 세포는 최대 속도에서 4℃에서의 원심분리에 의해 수확되었고; 아이스-콜드 PBS 버퍼에서 1회 세척되었고; 최대 속도에서 4℃에서의 원심분리에 의해 재-펠렛화되고 PBS 버퍼를 제거한 후 액체 질소에서 순간-동결 (snap-frozen)되었다.
2. 결과
2.1 BioB의 과발현은 유독하다
저-카피 플라스미드로부터의 고유한 비오틴 신타아제 유전자 (bioB)를 보유하지만, IPTG-유동성 프레임시프트된 대장균 bioB 유전자(조기 종결 코돈으로 인해 비-기능성 비오틴 신타아제를 코딩함)를 발현하는 대장균은 IPTG가 있거나 없는 mMOPS-kan 배지에서 호기성으로 성장할 수 있다 이는 대장균 BW25113, BioB 프레임시프트 돌연변이의 지수 성장곡선을 나타낸 도 4 (왼쪽 패널)에 예시된다. 반대로, 대장균 녹-아웃 균주 △bioB에서 기능성 비오틴 신타아제 유전자 (bioB)의 과-발현은 성장에 유독하여, 유도기 (lag phase)에서 매우 유의적인 확장을 야기한다. 이는 저-카피 플라스미드 (Sc101 복제 기점) 상에서 IPTG-유도성 T5 프로모터로부터 대장균 bioB 유전자를 발현하는 대장균 녹-아웃 균주 △bioB의 성장을 나타낸 도 4 (오른쪽 패널)에 예시된다. 도 4에 나타낸 바와 같이, IPTG 수준의 증가에 반응한 bioB 발현의 증가의 유도는 (회색의 어둠) 유도기에 유의적으로 영향을 미치는 반면, 성장률은 약간 영향을 받는다 (검정색 박스).
2.2 향상된 비오틴 생산 역가 (production titers)를 갖는 iscR 돌연변이 균주의 분리
진화된 게놈 다양성을 갖는 대장균 라이브러리 (섹션 1.4 및 1.5 참조)는 bioB 유전자 발현에 대한 향상된 내성 및 증가된 비오틴 생산을 갖는 균주에 대해 스크리닝되었다. 선별된 균주의 전체 게놈 시퀀싱으로 3개의 특이한 돌연변이를 동정하였고, 이들 각각은 L15F, C92Y 및 H107Y 중에서 하나의 아미노산 치환을 갖는 iscR 폴리펩티드를 코딩하는 철-황 클러스터 조절자 (iscR) 유전자를 포함하고, 상기 코딩된 조절자의 아미노산 서열은 각각 서열번호 16, 18 및 20이다. 섹션 1.6 (및 도 5)에 기재된 바와 같이, 비오틴 생산 수준은 바이오어쎄이를 이용하여 측정되었다. 대장균 BW25113 △bioB 표준 균주 (reference strain)뿐만 아니라, 각각의 iscR 돌연변이 균주에 대한 비오틴 생산 역가는 2개의 상이한 IPTG 농도 수준의 부재 또는 존재에서 (증가된 IPTG는 더 진한 회색임) 0.1 g/l DTB로 보충된 mMOPS에서 성장된 4개의 생물학적 복제물 (검정색 점)에 대하여 도 6에 나타낸다. 표준 균주에서 비오틴 생산은 표준 균주의 성장에 유독한 IPTG 수준에 상응하는 0.01 mM 이상의 IPTG 수준에서 억제되었고 (도 4 참조), 반면에 iscR 돌연변이 ?누는 0.01 - 0.1 mM IPTG에서 성장하였고 비오틴을 생산하였다. 3개의 iscR 돌연변이 균주 모두 0.01 mM의 IPTG 농도에서 표준 균주 (점선)에 비해 약 1.5배 더 많은 비오틴을 생산한다. iscR 돌연변이 균주는 0.1 mM의 IPTG 농도에서 표준 균주의 가장 높은 생산 역가 (~ 1.5 mg biotin/l)에 비해 최대 2배 더 많이 (~3.2 mg 비오틴/l) 생산한다.
2.3 IscR H107Y 돌연변이 균주에서의 비오틴 생산 및 성장
iscR (H107Y) 돌연변이 균주의 성장 특성 (growth profile) 및 비오틴 생산 역가는 2개의 상이한 IPTG 유도 수준 (도 7A에서 0.01 mM) 및 0.05 mM (도 7B)에서 250 mL 진탕-플라스크 실험에서 0.1 g/l DTB로 보충된 50 mL mMOPS에서 특성화되었다. 낮은 IPTG 수준에서 (도 7A), IscR 돌연변이 균주 (어두운 회색) 및 표준 균주 (밝은 회색)는 ~ 1.1 mg 비오틴/l의 최종 역가로 성장 및 비오틴 생산 역가에 대하여 유사하였다. 그러나, 높은 IPTG 유도 수준에서 (도 7B에서 0.5 mM) 표준 균주 (밝은 회색)의 성장은 심각하게 억제되었던 반면에, IscR 돌연변이 균주는 낮은 IPTG 유도 수준에서와 동일한 성장 특성을 보유하였다. 또한, IscR 돌연변이 균주의 비오틴 생산 역가는 25시간의 성장 후에 최대 ~2.2 mg 비오틴/l까지, 약 2배로 증가되었다.
2.4 IscR 돌연변이의 작용의 메커니즘
도 6에 나타낸 바와 같이, 향상된 비오틴 내성 표현형은 동정된 3개의 IscR 돌연변이 균주 모두에 대하여 명확하게 입증되었다. C92 돌연변이 (C92Y)의 비오틴 내성을 향상시키는 능력은 IscR의 [Fe-S] 클러스터 결합 특성에서 C92의 역할에 의한 것으로 제안된다. C92Y 돌연변이로 인한 [Fe-S] 클러스터 결합 특성의 손실은 IscR의 Isc-오페론 억제 반응을 불활성화시키는 것으로 제안된다. 동시에, C92Y 돌연변이 IscR에서 IscR의 프로모터 기능은 온전하게 유지되어, 다중 세포 과정에 필수적인 다른 경로를 활성화시키는데 그 기능을 유지하는 것으로 제안된다. IscR의 [Fe-S] 클러스터 결합 특성을 제공하는 유사한 필수적 역할은 H107에 기인하고; 여기에서 H107Y 돌연변이는 대장균에서 비오틴 내성을 유사하게 향상시킬 수 있다. IscR에서 L15F 또한 철-황 클러스터 결합을 방해하여, 철-황 클러스터 고갈을 부분적으로 극복하는 것으로 제안된다. 도 9는 DNA에 결합될 때 L15 및 H107의 위치를 나타내고 (hya, PDB 엔트리 4HF1), L15는 IscR 각각의 서브유닛의 내부에 위치함을 알 수 있다. 페닐알라닌은 류신보다 상당히 큰 아미노산이고, 단백질의 3차원 폴딩을 방해할 수 있다.
2.5 대장균 균주 단독에서의 isc-오페론 또는 suf 오페론의 과발현은 비오틴 생산을 향상시키기에 충분하지 않다.
isc-오페론 (iscSUA-hscBA-fdx, iscR 유전자가 제외된 천연 대장균 오페론 구조와 상응함) 또는 suf-오페론 (sufABCDSE, 천연 대장균 오페론 구조와 상응함)의 대장균에서의 비오틴 생산에 대한 직접적인 효과를 측정하기 위해, 각각의 오페론은 강한 RBS 및 IPTG 유도성 T5 프로모터의 조절 하에 배치된 미디엄 카피 넘버 플라스미드 (p15A ori) 내로 클로닝하였다. isc- 또는 suf-오페론 대신에 슈퍼 폴더 녹색 형광 단백질 (sfGFP: super folder Green Fluorescent Protein)을 코딩하는 유전자를 포함하는 플라스미드는 대조군으로 적용되었다. 각각의 플라스미드는 IPTG-유도성 bioB 발현 플라스미드를 포함하는 대장균 균주의 세포 내로 형질전환되었다. IPTG-유도성 bioB 발현 플라스미드 뿐만 아니라, 1) IPTG-유도성 isc-오페론, 2) IPTG-유도성 suf-오페론 또는 3) IPTG-유도성 GFP (대조군): 중 하나를 포함하는 생물학적 3중 (triplicate) 콜로니는 100 μg/mL 암피실린 및 50 μg/mL 스펙티노마이신을 갖는 400 L mMOPS에서 낮은 (0.01 mM IPTG) 및 높은 (0.1 mM IPTG) 유도 하에서 배양한 후, 비오틴 생산에 대하여 분석되었다 (섹션 1.5에 기재된 바와 같음).
그래프 (도 10)으로부터, 모든 균주에서 비오틴 생산은 IPTG-유도성임에도 불구하고; 검출 가능한 비오틴 생산 수준에 도달하기에 필요한 IPTG 농도는 도 6에 나타낸 표준 및 돌연변이 iscR 균주와 비교하였을 때, 0.01 mM IPTG 내지 0.1 mM IPTG로 증가되었다. 또한, 비오틴 생산 역가는 isc- 또는 suf-오페론의 과발현에 의해 도 10의 sfGFP 균주와 비교하였을 때, 현저히 감소되었다. 또한, isc-오페론의 과발현은 suf-오페론의 과발현 보다 훨씬 더 비오틴 생산을 억제하였다. iscR에 싱글 포인트 돌연변이 (single point mutation)를 갖는 돌연변이 균주에서의 비오틴 생산의 관찰된 증가와 함께, 이들 균주에서 isc-오페론의 결과적인 억제-해제(de-repression)는 이들 균주에서의 개선된 비오틴 생산의 유일한/주요한 원인일 가능성은 낮다.
2.6 BioB 단백질 함량은 비오틴 생관과 관련이 있다
야생형 및 돌연변이 백그라운드 균주에서 BioB 과발현의 분자적 효과를 조사하기 위해, 야생형 백그라운드 균주: BS1013 보유 (holding) pBS430; 비오틴 생산 플라스미드를 갖는 야생형 iscR 균주: BS1011 보유 pBS412; 및 bioB 생산 플라스미드를 갖는 돌연변이 iscR 균주: BS1353 보유 pBS412에 대하여 프로테오믹스 측정이 수행되었다. 모든 균주는 0.1 g/L DTB 및 0.025 mM IPTG로 mMOPS에서 성장되었다. 후자 균주는 1 mM IPTG 유도에서 추가로 성장되었다. 세포는 프로테오믹스 분석을 위해 수확되었고, 나머지 세포 배양물은 다른 곳에 기재된 바이오어쎄이를 이용하여 비오틴 생산이 측정되기 전에, 총 24시간 동안 인큐베이션이 유지되었다.
그래프 (도 11)로부터, 측정된 비오틴 단백질 수준은 비오틴 생산과 밀접한 관련이 있다 (R2 값0.96). 선형 상관관계는 향상된 BioB 발현을 촉진시키는 것이 IscR 돌연변이 세포 공장에서 비오틴 생산을 향상시키는 핵심임을 나타낸다. 프로테오믹스 데이터의 ANOVA 분석은 추가적인 29 단백질에서 발현의 현저한 증가 (95% 신뢰 구간, p-값 0.00166)를 나타냈다. 이들 중에서 isc-오페론 (IscA 및 IscS) 및 suf-오페론 (SufB 및 SufS)의 구성원이 있다.
2.7 iscR 녹아웃 돌연변이에서 비오틴 생산은 향상되지 않는다
iscR 유전자의 번역 녹아웃 (translational knockout)은 iscR에서의 22번 위치 상에서 글루탐산을 코딩하는 코돈 (E, GAA)을 종결 코돈 (*, TGA)으로 변환함으로써, MAGE에 의해, BW25113 △bioB 균주 내로 도입되었다. 상기 코돈의 성공적인 변환은 상기 영역의 PCR 증폭 및 이후의 생거 시퀀싱 (Sanger sequencing)에 의해 입증되었따. 야생형 iscR, iscR 녹아웃 (E22*), 및 돌연변이 iscR (C92Y)를 코딩하는 유전자를 갖는 균주는 IPTG-유도성 bioB 플라스미드 pBS412로 형질전환되었고, 상기 기재된 바와 같은 3개의 상이한 IPTG 유도 수준 (0, 0.01, 및 0.1 mM)에서 0.1 g/l DTB 및 50 ㎍/l 카나마이신으로 보충된 mMOPS에서 성장된 생물학적 복제물 (n=3)에서 비오틴 생산에 대해 시험하였다.
IPTG 유도에 의해 bioB 발현을 유도한 경우 iscR 녹아웃 (iscR KO) 및 야생형 iscR (iscR WT) 사이에 비오틴 생산에 있어 유의적인 차이는 관찰되지 않았다. 이는 iscR을 녹아웃하는 것이 비오틴 생산을 개선시키지 않는다는 증거를 제공한다. iscR WT 및 iscR KO 균주 모두와 비교하여, IscR C92Y 치환을 코딩하는 돌연변이 iscR에서 비오틴 생산에 있어 유의적인 개선이 다시 관찰되었다.
2.8 본 발명의 iscR 돌연변이 균주에서 드-노보 (De-novo) 비오틴 생산이 향상된다
bioA 유전자 및 전체 비오틴-오페론 (△bioB-△bioD)이 결실되고, iscR WT, iscR H107Y 돌연변이 또는 iscR C92Y 유전자를 포함하는, BW25113 대장균 균주는 천연 대장균 bioA 및 bioO 오퍼레이터 상에 싱글 포인트 돌연변이 (타입 9 돌연변이, Ifuku et al., 1993)를 갖는 비오틴-오페론을 항시적으로 과발현하는 테트라사이클린 내성 플라스미드로 형질전환되었다. 상기 기재된 바와 같이 (도 13), 0.1 g/l DTB를 첨가하거나 첨가하지 않고 10 ㎍/ml 테트라사이클린을 갖는 mMOPS (2 g 글루코스/l)에서의 생물학적 복제물 (n=4)에서 3개의 상이한 균주에 대하여 비오틴 생산이 평가되었다.
기질, DTB가 증식 배지에 첨가된 경우, 모든 3개의 균주에서 비오틴 역가의 현저한 증가가 관찰되었고, 이는 DTB를 비오틴으로 전환하는, bioB 효소 반응 자체가 더이상 이들 균주에서 비오틴 생산에 장애물 (bottleneck)이 아님을 나타낸다 (도 13). 또한, iscR WT 균주와 비교하여 iscR 돌연변이 균주 모두에서 글루코스로부터 비오틴의 드-노보 생산의 현저한 증가가 관찰되었다. 이들 결과를 고려하면, 본 발명의 모든 돌연변이 iscR 균주는 직접적인 전구체, DTB, 및 글루코스 모두로부터 향상된 비오틴 생산을 지지하는 것으로 추론될 수 있다.
실시예 2: 리포산 생산을 향상시킬 수 있는 유전자 변형
대장균
균주의 설계 및 특성화
실시예에서 사용된 하기의 에셰리키아 콜라이 균주는 하기에 나열된다.
명칭 | 설명 |
BS1912 | E.coli K-12 BW25113로부터 유래된 △lipdA |
BS2114 | iscR에서 H107Y 돌연변이를 포함하는 BS1912 파생체 |
실시예에서 사용된 하기의 플라스미드는 하기에 나열된다.
명칭 | 설명 |
pBS993 | AceF [서열번호 119] 발현의 추가적인 항시성 발현을 갖는 T5 lacO 억제 프로모터 [서열번호 234]로부터의 Lipid A [서열번호 103] 과발현 플라스미드 (kanR, SC101) |
pBS1037 | AceF [서열번호 119] 발현의 낮은 RBS 강도 및 SC101 대신에 p15A 복제 기점을 갖는 pBS993 파생체 |
pBS451 | 항시적으로 발현되는 GFP [서열번호 287] (zero, p15A) |
플라스미드 pBS1037 상에 클로닝된, LipA [서열번호 103]를 코딩하는 IIPTG-유도성 전이 유전자는, 실시예 1에 기재된 바와 같이, 천연 iscR 유전자를 포함하는 대장균 숙주 균주 또는 천연 iscR 유전자가 C92Y 또는 H107Y 치환을 갖는 IscR 단백질을 코딩하는 돌연변이 iscR 유전자로 치환된 대장균 숙주 균주 내로 도입되었다; 상개 2개의 균주는 bioB 또는 lipA의 녹-아웃을 더 포함한다. 상기 균주들은 0.1 mM 비오틴 (△bioB 균주를 위함), lipA 발현의 유도를 위한 IPTG 및 기질로서 0.6 g/l 옥탄산으로 보충된 mMOPS 배지 (섹션 1.3에 기재된 바와 같은)에서, 24시간 동안 37℃에서, 상층액에서의 자유(free) 리포산의 측정 전에, 배양되었다. △lipA 균주 (BS1912 및 BS2114, 표 3)의 경우 37℃에서 24시간의 성장이 뒤따랐다.
기재된 균주들의 배양된 세포에 의해 생산된, 리포산은 pBS451을 포함한 BS1912를 이용한 섹션 1.6에 기재된 것과 유사한 바이오어쎄이를 이용하여 상층액으로부터 측정되었다. 리포산의 정량화를 위한, 성장-기반의 바이오어쎄이는 리포산을 합성할 수 없는 영양요구성 대장균 단일 △lipA 돌연변이 균주를 이용하여 수행되었다 (Herbert and Guest, 1975) (pBS451을 포함하는 BS1912). 상기 상층액에서의 자유 리포산 농도는 리포산의 단독 공급원으로서 생산 균주로부터 회수된 리포산으로 보충되고, 탄소원으로서 50 nN na-숙시네이트를 갖는 최소 배지에서의 리포산 영양요구성 균주의 성장을 측정함으로써 결정되었다. 공지된 농도 범위의 리포산 표준으로 보충된 최소 배지 상에서 영양요구성 균주가 성장되는 경우와, 리포산 바이오어쎄이 검정 곡선은 병렬로 수행되었다.
상기 시험은 IscR 단백질의 돌연변이 형태 (C92Y 또는 H107 치환을 갖는 IscR 단백질)를 코딩하는 유전자를 포함하는 대장균 균주에서의 LipA 유전자의 과-발현은, IscR 단백질의 천연 형태를 코딩하는 유전자를 포함하는 모체 대장균 균주에서의 LipA 유전자의 과발현과 비교하여, 더 안정적인 생산 및 리포산 역가의 80% 증가를 나타냄을 입증한다 (도 14). 따라서 iscR WT 균주 (pBS993을 갖는 BS1011)의 생산 역가에서의 표준 편차는 2.73인 반면에 iscR C92Y (pBS993을 갖는 BS1375)의 경우 1.42이고 iscR H107Y (pBS993을 갖는 BS1353)의 경우 0.11로 낮다 (균주 표준에 대한 표 1 참조). 개별적인 균주의 평균 생산 역가에 기초하여, 리포산 생산은 WT 균주와 비교하여 돌연변이 균주에서 1.79-배 개선되었다 (도 14 참조).
WT iscR 균주 (삼각형, pBS1037을 포함하는 BS1912) 및 iscR 돌연변이 균주 (사각형, pBS1047을 포함하는 BS2114) 둘다의 경우에서, LipA의 과발현은 IPTG에 의한 LipA의 증가된 유도에 반응하여 성장률이 감소하는 명확한 경향을 나타내었다 (더 어두운 음영을 넣은 회색, 도 15). 그러나, WT iscR 균주의 성장률은 0.01 mM 내지 0.03 mM 내지의 시험된 모든 IPTG 유도 수준에서 돌연변이 iscR 균주와 비교하여 보다 극심하게 감소되었다 (도 15 참조).
실시예 3 티아민 생산을 향상시킬 수 있는 유전자 변형
대장균
균주의 설계 및 특성화
실시예에서 사용된 하기의 에셰리키아 콜라이 균주는 하기에 나열된다.
명칭 | 설명 |
BS750 | 코딩된 TMP 키나아제에서 G133D 치환을 일으키는 천연 thiL 유전자: 코돈 133에서 GGT로부터 GAC로의 점 돌연변이를 포함하는 BS1013 파생체, BW25113 △thiP |
BS2019 | iscR에 C92Y 돌연변이를 포함하는 BS750 파생체 |
BS2020 | iscR에 H107Y 돌연변이를 포함하는 BS750 파생체 |
실시예에서 사용된 하기의 플라스미드는 하기에 나열된다.
명칭 | 설명 |
pBS140 | thiC 오페론 (apFAB46 프로모터 [서열번호 147] 및apFAB377 종결자 [서열번호 153]에 기능적으로 연결됨) 및 thiM 오페론 (apFAB71 프로모터 [서열번호 149] 및 apFAB378 종결자 [서열번호 152]에 기능적으로 연결됨)의 조합으로 구성된; 대장균 티아민 경로 유전자 thiCEFSGHMD를 포함하는 벡터 |
pBS100 | pBS140의 구성에 사용되기 위한 빈 벡터 |
pBS93 | pFAB70 프로모터 [서열번호 148] 및 ap FAB381 종결자 [서열번호 154]에 기능적으로 연결된 대장균에서의 발현에 최적화된 애기장대 (Arabidopsis thaliana) AT5G32470.1 포스파타아제 코돈을 코딩하는 합성 유전자 (synthetic gene)를 포함하는 벡터 |
pBS209 | 대장균으로부터 추가적인 thiC를 갖는 pBS140에 기반한 플라스미드 |
실시예 1에 기재된 바와 같이, 플라스미드 pBS140에 클로닝된 티아민 경로 유전자 thiCEFSGHMD는 천연 iscR 유전자를 포함하는 대장균 숙주 균주 (BS750) (표준 균주)뿐만 아니라 상기 천연 iscR 유전자가 C92Y 또는 H107Y 치환을 각각 갖는 IscR을 코딩하는 돌연변이 iscR 유전자에 의해 치환된 이 표준 균주 (BS2019 및 BS2020)의 파생체 내로 도입되었다. 상기 균주들은 딥 배양 플레이트의 개별적인 웰에서 24시간 동안 37℃에서 mMOPS 배지 (섹션 1.3에 기재된 바와 같음)에서 배양되었다.
기재된 균주들의 배양된 세포들에 의해 생산된 세포 외 및 세포 내 티아민, TMP 및 TPP는 다음과 같이 회수되고 추출되었다: 각 배양물의 0.4 mL는 배양 플레이트에서 5분 동안 4000 x g에서 원심분리에 의해 4℃에서 수확되었다. 나머지 모든 단계는 얼음 상에서 수행되었다. 세포 외 TPP, TMP 및 티아민의 분석을 위해 상층액의 40 ㎍은 부드럽게 제거되었다. 남은 상층액을 따라낸 후, 배양 플레이트는 잔류 배지를 제거하기 위해 뒤집은 후 볼텍싱되었다 (voltexed). 100μL 아이스-콜드 HPLC 그레이드 메탄올 (HPLC grade methanol)을 배양 플레이트의 각 웰에 첨가되었고; 세포는 다시 볼텍싱되었다. 얼음 상에서 최소 20분 동안 인큐베이션 후, 세포 잔해물은 5분 동안 4000 x g에서의 원심분리에 의해 펠렛화되었다. 상층액은 추가적인 분석을 위한 세포 내 추출물로서 사용되었다.
형광 검출기를 이용하여 TPP, TMP 및 티아민을 검출하기 위해, 각 배양물에 의해 생산된 티아민 화합물은 강한 형광성인 티오크롬으로 유도체화되었다 (derivatized). 모든 단계는 실온에서 수행되었다. 세포 외 및 세포 내 추출물의 40 ㎕ 부피는 4M 포타슘 아세테이트의 80 ㎕에 첨가되었고, 피펫팅 (pipetting)에 의해 혼합되었다. 새로 제조된 7M NaOH에서의 3.8 mM 포타슘 페리사이아나이드 (potassium ferricyanide) 40 ㎕가 첨가되고 혼합되었다. 새로 제조된 포화된 KH2P04에서의 0.06% H202 40 ㎕의 첨가에 의해 상기 반응은 억제되었다 (quenched). 상기 추출물은 6M HCl 47 ㎕의 첨가에 의해 중화되었고 이후 HPLC 또는 멀티스칸을 이용한 직접적인 형광 측정에 의해 분석되었다. 유도체화된 모든 화합물은 상기 분석된 추출물과 병렬하여 티오크롬으로 유도체화된 새로 제조된 TPP, TMP 및 티아민 표준의 형광 표준 곡선을 이용하여 정량화되었다.
상기 시험은 IscR 단백질의 돌연변이 형태 (C92Y 또는 H107Y 치환을 갖는 IscR 단백질)를 코딩하는 유전자를 포함하는 숙주 대장균 균주 (BS2019 및 BS202)에서의 TMP 포스파타제 유전자 (At5g32470)와 조합하여 thiC 유전자 및 thiH 유전자를 포함한 티아민 경로 유전자의 과-발현은, IscR 단백질의 천연 형태를 코딩하는 유전자를 포함하는 숙주 대장균 균주 (BS750)에서의 과-발현과 비교하여, 티아민, TMP 및 TPP, 특히 티아민의 생합성을 향상시킴을 입증한다.
보다 구체적으로, 상기 시험은 WT iscR을 갖는 균주 (BS750)와 pBS140을 사용하는 경우 iscR 돌연변이를 코딩하는 균주 (BS2020, H107Y 또는 BS2019, C92Y) 사이에 티아민 (티아민, TMP 및 TPP)의 OD-표준화된 세포 외 생산에 있어 1.43 배의 증가를 나타내었다 (도 16).
실시예 4 비오틴을 생산할 수 있는 유전자 변형
대장균
균주의 생산성을 증가시키기 위한 플라보독신/페레독신 환원 효소 (Fpr)의 과발현 및 플라보독신 (FldA) 환원 시스템
1 방법
1.1 : 실시예에서 사용된 하기의 에셰리키아 콜라이 균주는 하기에 나열된다.
명칭 | 설명 |
BS1013 | 다음의 유전자형을 갖는 E. coli K-12 BW25113 모체 균주: rrnB3 △lacZ4787 hsdR514 △(araBAD)567 △(rhaBAD)568 rph-1 |
BS1011 | E. coli K-12 BW25113로부터 유래된 △bioB (JW0758-1) |
BS1353 | iscR에 H107Y 돌연변이를 포함하는 BS1011 파생체 |
BS1615 | △bioAFCD의 추가적인 결실을 갖는 BS1011 파생체 |
BS1937 | IPTG-유도성 BioB 발현을 제공하는 pBS679 플라스미드를 포함하는 BS1615 파생체 |
BS2185 | IPTG-유도성 BioB 발현을 제공하는 pBS679 플라스미드 및 FldA-Fpr 항시적 발현을 제공하는 pBS1112를 포함하는 BS1615 파생체 |
BS2707 | IPTG-유도성 BioB 발현을 제공하는 pBS679 플라스미드 및 GFP 항시적 발현을 제공하는 pBS1054를 포함하는 BS1615 파생체 |
실시예에서 사용된 하기의 플라스미드는 하기에 나열된다.
명칭 | 설명 |
pBS679 | T5 lacO 억제 프로모터 [서열번호 25]로부터의 BioB [서열번호 22] 과발현 플라스미드 (ampR, pSC101) 및 |
pBS1054 | apFAB378 종결자 [서열번호 292]를 갖는 apFAB309 항시적 프로모터 [서열번호 291]로부터의 GFP [서열번호 276] 과발현 플라스미드 (kanR, pBR322) |
pBS1112 | apFAB306 항시적 프로모터 (apFAB306-FldA-Fpr 유전자-apFAB378 종결자 [서열번호 288])로부터의 FldA-Fpr 과발현 플라스미드 (kanR, pBR322) |
BioB를 코딩하는 IPTG-유도성 전이 유전자는 플라스미드 pBS679 상에 클로닝되었고; GFP를 코딩하는 항시적으로-조절되는 전이 유전자는 플라스미드 pBS1054에 클로닝되었고; FldA-렉을 코딩하는 합성 오페론 (synthetic operon)를 포함하는 항시적으로-조절되는 전이 유전자는 플라스미드 pBS1112 상에 클로닝되었다. pBS679는 실시예 1에 기재된 바와 같이, 천연 iscR 유전자가 H107Y 치환을 갖는 IscR 단백질을 코딩하는 돌연변이 iscR 유전자에 의해 치환되고, BS1937 균주를 생성하는 bioAFCD 유전자의 녹-아웃을 더 포함하는 대장균 숙주 균주 (BS1615) 내로 도입되었다. 상기 BS1937 균주는 BS2707 (대조군 균주) 및 BS2185 균주 각각을 생성하도록 플라스미드 pBS1054 또는 pBS1112로 더 형질전환되었다.
상기 균주들은 적합한 항생제, BioB-매개 촉매를 위한 기질로서 0.1 g/l을 가지고, BioB 유전자의 발현을 유도하기 위한 0, 0.01, 0.025, 0.05, 0.075 또는 0.1 mM IPTG로 보충된 mMOPS 배지 (실시예 1.3에 기재된 바와 같음)에서 배양되었다. 상기 세포들은 딥 웰 배양 플레이트의 개별적인 웰에서 24시간 동안 37℃에서 인큐베이션되었다. 최종 OD가 예측되었고, 상층액은 원심분리에 의해 수확되었고, 실시예 1.6에 의해 기재된 바와 같이 비오틴 바이오어쎄이에 의해 상층액으로부터 비오틴이 정량화되었다.
균주 BS1937에 대한 도 12 및 도 17에 나타낸 바와 같이, 유전적으로 변형된 내인성 iscR 유전자를 포함하는 대장균 세포에서 BioB 유전자 발현이 IPTG 농도의 증가에 의해 유도된 경우, 상기 세포는 상응하는 비오틴 생산에 있어 점진적인 증가를 나타낸다. 이들 유전자 변형 세포에서 비오틴 생산은 이의 모체 균주 BS1937 및 FldA-Fpr 대신에 GFP를 코딩하는 전이유전자를 발현하는 대조군 균주와 비교하여, FldA-Fpr을 코딩하는 전이 유전자의 공동-발현 (BS2185 균주)에 의해 더 향상된다.
IscR 단백질의 돌연변이 형태 (H107Y 치환을 갖는 IscR 단백질)를 코딩하는 유전자, BioB (pBS679) 및 FldA-Fpr 유전자 (pBS1112)의 과발현을 위한 플라스미드를 포함하는 BS2185 균주의 비오틴 생산은 대조군 균주 BS1937 (FldA-Fpr 유전자의 과발현 없음)과 비교하여 2.12-배 향상된다 (도 18).
참고문헌
Datsenko KA, Wanner BL. (2000) One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. Proc Natl Acad Sci U S A. ;97(12) : 6640-5.
Fleischhacker, A. S. et al. (2012) Characterization of the [2Fe-2S] cluster of Escherichia coli transcription factor IscR. Biochemistry 51 : 4453-4462.
Giel, J. L., Rodionov, D., Liu, M., Blattner, F. R. & Kiley, P. J. (2006) IscR-dependent gene expression links iron-sulphur cluster assembly to the control of 02-regulated genes in Escherichia coli. Mol. Microbiol. 60, 1058-1075.
Herbert AA, Guest JR. (1975) Lipoic acid content of Escherichia coli and other microorganisms. Arch Microbiol. ; 106(3) :259-66. Epub 1975/12/31. pmid: 814874
Ifuku, O. et al. Sequencing analysis of mutation points in the biotin operon of biotin-overproducing Escherichia coli mutants. Biosci Biotechnol Biochem 57, 760-765 (1993).
Ifuku, O. et al., (1995) "Molecular analysis of growth inhibition caused by overexpression of the biotin operon in Escherichia coli." Bioscience, biotechnology, and biochemistry 59(2): 184-189.
Martin, J. E. & Imlay, J. A. (2012) Replication during periods of iron starvation. 80, 319-334.
Py, B. & Barras, F. (2010) Building Fe-S proteins: bacterial strategies. Nat. Rev. Microbiol. 8, 436-446.
Schmitt, A, Kochanowski K., Vedelaar S., Ahrne E., Volkmer B., Callipo L., Knoops K., Bauer M., Aebersold R., & Heinemann M., (2015) The quantitative and condition-dependent Escherichia coli proteome. Nature Biotechnology 2015; doi: 10.1038
SEQUENCE LISTING
<110> Biosyntia ApS
<120> Cell factory having improved iron-sulfur cluster delivery
<130> P2296PC00
<150> EP17181503.8
<151> 2017-07-14
<160> 292
<170> PatentIn version 3.5
<210> 1
<211> 489
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(489)
<223> iscR WT gene encoding Iron Sulfur Cluster Regulator protein
(IscR)
<400> 1
atg aga ctg aca tct aaa ggg cgc tat gcc gtg acc gca atg ctt gac 48
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
gtt gcg ctc aac tct gaa gcg ggc ccg gta ccg ttg gct gat att tcc 96
Val Ala Leu Asn Ser Glu Ala Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
gaa cgt cag gga att tcc ctt tct tat ctg gaa caa ctg ttt tcc cgt 144
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
ctg cgt aaa aat ggt ctg gtt tcc agc gta cgt gga cca ggc ggt ggt 192
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
tat ctg tta ggc aaa gat gcc agc agc atc gcc gtt ggc gaa gta att 240
Tyr Leu Leu Gly Lys Asp Ala Ser Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
agc gcc gtt gac gaa tct gta gat gcc acc cgt tgt cag ggt aaa ggc 288
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Gly
85 90 95
ggc tgc cag ggc ggc gat aaa tgc ctg acc cac gcg ctg tgg cgt gat 336
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
ttg agc gac cgt ctc acc ggt ttt ctc aac aac att act tta ggc gaa 384
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
ctg gtt aat aac cag gaa gtg ctg gat gtg tct ggt cgt cag cat act 432
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Thr
130 135 140
cac gac gcg cca cgc acc cgc aca caa gac gcg atc gac gtt aag tta 480
His Asp Ala Pro Arg Thr Arg Thr Gln Asp Ala Ile Asp Val Lys Leu
145 150 155 160
cgc gct taa 489
Arg Ala
<210> 2
<211> 162
<212> PRT
<213> Escherichia coli
<400> 2
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
Val Ala Leu Asn Ser Glu Ala Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
Tyr Leu Leu Gly Lys Asp Ala Ser Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Gly
85 90 95
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Thr
130 135 140
His Asp Ala Pro Arg Thr Arg Thr Gln Asp Ala Ile Asp Val Lys Leu
145 150 155 160
Arg Ala
<210> 3
<211> 486
<212> DNA
<213> Shigella sonnei
<220>
<221> CDS
<222> (1)..(486)
<223> iscR gene encoding Iron Sulfur Cluster Regulator
<400> 3
atg cgc ctg acc agc aaa ggc cgc tat gcg gtg acc gcg atg ctg gat 48
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
gtg gcg ctg aac agc gaa gcg ggc ccg gtg ccg ctg gcg gat att agc 96
Val Ala Leu Asn Ser Glu Ala Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
gaa cgc cag ggc att agc ctg agc tat ctg gaa cag ctg ttt agc cgc 144
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
ctg cgc aaa aac ggc ctg gtg agc agc gtg cgc ggc ccg ggc ggc ggc 192
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
tat ctg ctg ggc aaa gat gcg agc agc att gcg gtg ggc gaa gtg att 240
Tyr Leu Leu Gly Lys Asp Ala Ser Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
agc gcg gtg gat gaa agc gtg gat gcg acc cgc tgc cag ggc aaa ggc 288
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Gly
85 90 95
ggc tgc cag ggc ggc gat aaa tgc ctg acc cat gcg ctg tgg cgc gat 336
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
ctg agc gat cgc ctg acc ggc ttt ctg aac aac att acc ctg ggc gaa 384
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
ctg gtg aac aac cag gaa gtg ctg gat gtg agc ggc cgc cag cat acc 432
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Thr
130 135 140
cat gat gcg ccg cgc acc cgc att ctg gat gcg att gat gtg aaa ctg 480
His Asp Ala Pro Arg Thr Arg Ile Leu Asp Ala Ile Asp Val Lys Leu
145 150 155 160
cgc gcg 486
Arg Ala
<210> 4
<211> 162
<212> PRT
<213> Shigella sonnei
<400> 4
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
Val Ala Leu Asn Ser Glu Ala Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
Tyr Leu Leu Gly Lys Asp Ala Ser Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Gly
85 90 95
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Thr
130 135 140
His Asp Ala Pro Arg Thr Arg Ile Leu Asp Ala Ile Asp Val Lys Leu
145 150 155 160
Arg Ala
<210> 5
<211> 489
<212> DNA
<213> Citrobacter pasteurii
<220>
<221> CDS
<222> (1)..(489)
<223> iscR gene encoding Iron Sulfur Cluster Regulator
<400> 5
atg cgc ctg acc agc aaa ggc cgc tat gcg gtg acc gcg atg ctg gat 48
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
gtg gcg ctg aac agc gaa acc ggc ccg gtg ccg ctg gcg gat att agc 96
Val Ala Leu Asn Ser Glu Thr Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
gaa cgc cag ggc att agc ctg agc tat ctg gaa cag ctg ttt agc cgc 144
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
ctg cgc aaa aac ggc ctg gtg agc agc gtg cgc ggc ccg ggc ggc ggc 192
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
tat ctg ctg ggc aaa gat gcg agc agc att gcg gtg ggc gaa gtg att 240
Tyr Leu Leu Gly Lys Asp Ala Ser Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
agc gcg gtg gat gaa agc gtg gat gcg acc cgc tgc cag ggc aaa ggc 288
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Gly
85 90 95
ggc tgc cag ggc ggc gat aaa tgc ctg acc cat gcg ctg tgg cgc gat 336
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
ctg agc gat cgc ctg acc ggc ttt ctg aac aac att acc ctg ggc gaa 384
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
ctg gtg aac aac cag gaa gtg ctg gat gtg agc ggc cgc cag cat acc 432
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Thr
130 135 140
cat gat gcg ccg cgc acc aac cgc gcg cag gat gcg att gat gtg aaa 480
His Asp Ala Pro Arg Thr Asn Arg Ala Gln Asp Ala Ile Asp Val Lys
145 150 155 160
ctg cgc gcg 489
Leu Arg Ala
<210> 6
<211> 163
<212> PRT
<213> Citrobacter pasteurii
<400> 6
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
Val Ala Leu Asn Ser Glu Thr Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
Tyr Leu Leu Gly Lys Asp Ala Ser Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Gly
85 90 95
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Thr
130 135 140
His Asp Ala Pro Arg Thr Asn Arg Ala Gln Asp Ala Ile Asp Val Lys
145 150 155 160
Leu Arg Ala
<210> 7
<211> 489
<212> DNA
<213> Enterobacter timonensis
<220>
<221> CDS
<222> (1)..(489)
<223> iscR gene encoding Iron Sulfur Cluster Regulator
<400> 7
atg cgc ctg acc agc aaa ggc cgc tat gcg gtg acc gcg atg ctg gat 48
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
gtg gcg ctg aac agc gaa gcg ggc ccg gtg ccg ctg gcg gat att agc 96
Val Ala Leu Asn Ser Glu Ala Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
gaa cgc cag ggc att agc ctg agc tat ctg gaa cag ctg ttt agc cgc 144
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
ctg cgc aaa aac ggc ctg gtg agc agc gtg cgc ggc ccg ggc ggc ggc 192
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
tat ctg ctg ggc aaa gat gcg ggc agc att gcg gtg ggc gaa gtg att 240
Tyr Leu Leu Gly Lys Asp Ala Gly Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
agc gcg gtg gat gaa agc gtg gat gcg acc cgc tgc cag ggc aaa ggc 288
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Gly
85 90 95
ggc tgc cag ggc ggc gat aaa tgc ctg acc cat gcg ctg tgg cgc gat 336
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
ctg agc gat cgc ctg acc ggc ttt ctg aac aac att acc ctg ggc gaa 384
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
ctg gtg aac aac cag gaa gtg ctg gat gtg agc ggc cgc cag cat agc 432
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Ser
130 135 140
cat gat agc cag cgc aac acc cgc gcg cag gat gcg att gat gtg aaa 480
His Asp Ser Gln Arg Asn Thr Arg Ala Gln Asp Ala Ile Asp Val Lys
145 150 155 160
ctg cgc gcg 489
Leu Arg Ala
<210> 8
<211> 163
<212> PRT
<213> Enterobacter timonensis
<400> 8
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
Val Ala Leu Asn Ser Glu Ala Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
Tyr Leu Leu Gly Lys Asp Ala Gly Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Gly
85 90 95
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Ser
130 135 140
His Asp Ser Gln Arg Asn Thr Arg Ala Gln Asp Ala Ile Asp Val Lys
145 150 155 160
Leu Arg Ala
<210> 9
<211> 489
<212> DNA
<213> Pluralibacter gergoviae
<220>
<221> CDS
<222> (1)..(489)
<223> iscR gene encoding Iron Sulfur Cluster Regulator
<400> 9
atg cgc ctg acc agc aaa ggc cgc tat gcg gtg acc gcg atg ctg gat 48
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
gtg gcg ctg aac agc gaa agc ggc ccg gtg ccg ctg gcg gat att agc 96
Val Ala Leu Asn Ser Glu Ser Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
gaa cgc cag ggc att agc ctg agc tat ctg gaa cag ctg ttt agc cgc 144
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
ctg cgc aaa aac ggc ctg gtg agc agc gtg cgc ggc ccg ggc ggc ggc 192
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
tat ctg ctg ggc aaa gat gcg ggc agc att gcg gtg ggc gaa gtg att 240
Tyr Leu Leu Gly Lys Asp Ala Gly Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
agc gcg gtg gat gaa agc gtg gat gcg acc cgc tgc cag ggc aaa gcg 288
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Ala
85 90 95
ggc tgc cag ggc ggc gat aaa tgc ctg acc cat gcg ctg tgg cgc gat 336
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
ctg agc gat cgc ctg acc ggc ttt ctg aac aac att acc ctg ggc gaa 384
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
ctg gtg aac aac cag gaa gtg ctg gat gtg agc gat cgc cag cat att 432
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Asp Arg Gln His Ile
130 135 140
cat gaa acc cag cgc agc acc cgc agc cag gat gcg att gat gtg aaa 480
His Glu Thr Gln Arg Ser Thr Arg Ser Gln Asp Ala Ile Asp Val Lys
145 150 155 160
ctg cgc gcg 489
Leu Arg Ala
<210> 10
<211> 163
<212> PRT
<213> Pluralibacter gergoviae
<400> 10
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
Val Ala Leu Asn Ser Glu Ser Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
Tyr Leu Leu Gly Lys Asp Ala Gly Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Ala
85 90 95
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Asp Arg Gln His Ile
130 135 140
His Glu Thr Gln Arg Ser Thr Arg Ser Gln Asp Ala Ile Asp Val Lys
145 150 155 160
Leu Arg Ala
<210> 11
<211> 486
<212> DNA
<213> Buttiauxella
<220>
<221> CDS
<222> (1)..(486)
<223> iscR gene encoding Iron Sulfur Cluster Regulator
<400> 11
atg cgc ctg acc agc aaa ggc cgc tat gcg gtg acc gcg atg ctg gat 48
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
gtg gcg ctg aac agc gaa agc ggc ccg gtg ccg ctg gcg gat att agc 96
Val Ala Leu Asn Ser Glu Ser Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
gaa cgc cag ggc att agc ctg agc tat ctg gaa cag ctg ttt agc cgc 144
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
ctg cgc aaa aac ggc ctg gtg gcg agc gtg cgc ggc ccg ggc ggc ggc 192
Leu Arg Lys Asn Gly Leu Val Ala Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
tat ctg ctg ggc aaa gaa gcg agc gcg att gcg gtg ggc gaa gtg att 240
Tyr Leu Leu Gly Lys Glu Ala Ser Ala Ile Ala Val Gly Glu Val Ile
65 70 75 80
agc gcg gtg gat gaa agc gtg gat gcg acc cgc tgc gcg ggc aaa ggc 288
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Ala Gly Lys Gly
85 90 95
ggc tgc cag ggc ggc gat aaa tgc ctg acc cat gcg ctg tgg cgc gat 336
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
ctg agc gat cgc ctg acc ggc ttt ctg aac aac att acc ctg ggc gaa 384
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
ctg gtg aac aac cag gaa gtg ctg gat gtg agc ggc cgc cag cat aac 432
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Asn
130 135 140
gaa aac cat cgc agc acc cgc agc cag gat gcg att gat gtg aaa ctg 480
Glu Asn His Arg Ser Thr Arg Ser Gln Asp Ala Ile Asp Val Lys Leu
145 150 155 160
cgc gcg 486
Arg Ala
<210> 12
<211> 162
<212> PRT
<213> Buttiauxella
<400> 12
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
Val Ala Leu Asn Ser Glu Ser Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
Leu Arg Lys Asn Gly Leu Val Ala Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
Tyr Leu Leu Gly Lys Glu Ala Ser Ala Ile Ala Val Gly Glu Val Ile
65 70 75 80
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Ala Gly Lys Gly
85 90 95
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Asn
130 135 140
Glu Asn His Arg Ser Thr Arg Ser Gln Asp Ala Ile Asp Val Lys Leu
145 150 155 160
Arg Ala
<210> 13
<211> 489
<212> DNA
<213> Kosakonia sacchari
<220>
<221> CDS
<222> (1)..(489)
<223> iscR gene encoding Iron Sulfur Cluster Regulator
<400> 13
atg cgc ctg acc agc aaa ggc cgc tat gcg gtg acc gcg atg ctg gat 48
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
gtg gcg ctg aac agc gaa gcg ggc ccg gtg ccg ctg gcg gat att agc 96
Val Ala Leu Asn Ser Glu Ala Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
gaa cgc cag ggc att agc ctg agc tat ctg gaa cag ctg ttt agc cgc 144
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
ctg cgc aaa aac ggc ctg gtg gcg agc gtg cgc ggc ccg ggc ggc ggc 192
Leu Arg Lys Asn Gly Leu Val Ala Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
tat ctg ctg ggc aaa gat gcg aac acc att gcg gtg ggc gaa gtg att 240
Tyr Leu Leu Gly Lys Asp Ala Asn Thr Ile Ala Val Gly Glu Val Ile
65 70 75 80
agc gcg gtg gat gaa agc gtg gat gcg acc cgc tgc cag ggc aaa agc 288
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Ser
85 90 95
ggc tgc cag ggc ggc gat aaa tgc ctg acc cat gcg ctg tgg cgc gat 336
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
ctg agc gat cgc ctg acc ggc ttt ctg aac aac att acc ctg ggc gaa 384
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
ctg gtg aac aac cag gaa att ctg gat gtg agc gat cgc cag cat aac 432
Leu Val Asn Asn Gln Glu Ile Leu Asp Val Ser Asp Arg Gln His Asn
130 135 140
aac gaa agc cat cgc aac acc cgc ggc cag gat gcg att gat gtg aaa 480
Asn Glu Ser His Arg Asn Thr Arg Gly Gln Asp Ala Ile Asp Val Lys
145 150 155 160
ctg cgc gcg 489
Leu Arg Ala
<210> 14
<211> 163
<212> PRT
<213> Kosakonia sacchari
<400> 14
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
Val Ala Leu Asn Ser Glu Ala Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
Leu Arg Lys Asn Gly Leu Val Ala Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
Tyr Leu Leu Gly Lys Asp Ala Asn Thr Ile Ala Val Gly Glu Val Ile
65 70 75 80
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Ser
85 90 95
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
Leu Val Asn Asn Gln Glu Ile Leu Asp Val Ser Asp Arg Gln His Asn
130 135 140
Asn Glu Ser His Arg Asn Thr Arg Gly Gln Asp Ala Ile Asp Val Lys
145 150 155 160
Leu Arg Ala
<210> 15
<211> 489
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(489)
<223> iscR mutant gene encoding Iron Sulfur Cluster Regulator protein
(IscR) with L15F substitution
<400> 15
atg aga ctg aca tct aaa ggg cgc tat gcc gtg acc gca atg ttt gac 48
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Phe Asp
1 5 10 15
gtt gcg ctc aac tct gaa gcg ggc ccg gta ccg ttg gct gat att tcc 96
Val Ala Leu Asn Ser Glu Ala Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
gaa cgt cag gga att tcc ctt tct tat ctg gaa caa ctg ttt tcc cgt 144
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
ctg cgt aaa aat ggt ctg gtt tcc agc gta cgt gga cca ggc ggt ggt 192
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
tat ctg tta ggc aaa gat gcc agc agc atc gcc gtt ggc gaa gta att 240
Tyr Leu Leu Gly Lys Asp Ala Ser Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
agc gcc gtt gac gaa tct gta gat gcc acc cgt tgt cag ggt aaa ggc 288
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Gly
85 90 95
ggc tgc cag ggc ggc gat aaa tgc ctg acc cac gcg ctg tgg cgt gat 336
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
ttg agc gac cgt ctc acc ggt ttt ctc aac aac att act tta ggc gaa 384
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
ctg gtt aat aac cag gaa gtg ctg gat gtg tct ggt cgt cag cat act 432
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Thr
130 135 140
cac gac gcg cca cgc acc cgc aca caa gac gcg atc gac gtt aag tta 480
His Asp Ala Pro Arg Thr Arg Thr Gln Asp Ala Ile Asp Val Lys Leu
145 150 155 160
cgc gct taa 489
Arg Ala
<210> 16
<211> 162
<212> PRT
<213> Escherichia coli
<400> 16
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Phe Asp
1 5 10 15
Val Ala Leu Asn Ser Glu Ala Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
Tyr Leu Leu Gly Lys Asp Ala Ser Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Gly
85 90 95
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Thr
130 135 140
His Asp Ala Pro Arg Thr Arg Thr Gln Asp Ala Ile Asp Val Lys Leu
145 150 155 160
Arg Ala
<210> 17
<211> 489
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(489)
<223> iscR mutant gene encoding Iron Sulfur Cluster Regulator protein
(IscR) with C92Y substitution
<400> 17
atg aga ctg aca tct aaa ggg cgc tat gcc gtg acc gca atg ctt gac 48
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
gtt gcg ctc aac tct gaa gcg ggc ccg gta ccg ttg gct gat att tcc 96
Val Ala Leu Asn Ser Glu Ala Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
gaa cgt cag gga att tcc ctt tct tat ctg gaa caa ctg ttt tcc cgt 144
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
ctg cgt aaa aat ggt ctg gtt tcc agc gta cgt gga cca ggc ggt ggt 192
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
tat ctg tta ggc aaa gat gcc agc agc atc gcc gtt ggc gaa gta att 240
Tyr Leu Leu Gly Lys Asp Ala Ser Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
agc gcc gtt gac gaa tct gta gat gcc acc cgt tat cag ggt aaa ggc 288
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Tyr Gln Gly Lys Gly
85 90 95
ggc tgc cag ggc ggc gat aaa tgc ctg acc cac gcg ctg tgg cgt gat 336
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
ttg agc gac cgt ctc acc ggt ttt ctc aac aac att act tta ggc gaa 384
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
ctg gtt aat aac cag gaa gtg ctg gat gtg tct ggt cgt cag cat act 432
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Thr
130 135 140
cac gac gcg cca cgc acc cgc aca caa gac gcg atc gac gtt aag tta 480
His Asp Ala Pro Arg Thr Arg Thr Gln Asp Ala Ile Asp Val Lys Leu
145 150 155 160
cgc gct taa 489
Arg Ala
<210> 18
<211> 162
<212> PRT
<213> Escherichia coli
<400> 18
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
Val Ala Leu Asn Ser Glu Ala Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
Tyr Leu Leu Gly Lys Asp Ala Ser Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Tyr Gln Gly Lys Gly
85 90 95
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr His Ala Leu Trp Arg Asp
100 105 110
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Thr
130 135 140
His Asp Ala Pro Arg Thr Arg Thr Gln Asp Ala Ile Asp Val Lys Leu
145 150 155 160
Arg Ala
<210> 19
<211> 489
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(489)
<223> iscR mutant gene encoding Iron Sulfur Cluster Regulator protein
(IscR) with H107Y substitution
<400> 19
atg aga ctg aca tct aaa ggg cgc tat gcc gtg acc gca atg ctt gac 48
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
gtt gcg ctc aac tct gaa gcg ggc ccg gta ccg ttg gct gat att tcc 96
Val Ala Leu Asn Ser Glu Ala Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
gaa cgt cag gga att tcc ctt tct tat ctg gaa caa ctg ttt tcc cgt 144
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
ctg cgt aaa aat ggt ctg gtt tcc agc gta cgt gga cca ggc ggt ggt 192
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
tat ctg tta ggc aaa gat gcc agc agc atc gcc gtt ggc gaa gta att 240
Tyr Leu Leu Gly Lys Asp Ala Ser Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
agc gcc gtt gac gaa tct gta gat gcc acc cgt tgt cag ggt aaa ggc 288
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Gly
85 90 95
ggc tgc cag ggc ggc gat aaa tgc ctg acc tac gcg ctg tgg cgt gat 336
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr Tyr Ala Leu Trp Arg Asp
100 105 110
ttg agc gac cgt ctc acc ggt ttt ctc aac aac att act tta ggc gaa 384
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
ctg gtt aat aac cag gaa gtg ctg gat gtg tct ggt cgt cag cat act 432
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Thr
130 135 140
cac gac gcg cca cgc acc cgc aca caa gac gcg atc gac gtt aag tta 480
His Asp Ala Pro Arg Thr Arg Thr Gln Asp Ala Ile Asp Val Lys Leu
145 150 155 160
cgc gct taa 489
Arg Ala
<210> 20
<211> 162
<212> PRT
<213> Escherichia coli
<400> 20
Met Arg Leu Thr Ser Lys Gly Arg Tyr Ala Val Thr Ala Met Leu Asp
1 5 10 15
Val Ala Leu Asn Ser Glu Ala Gly Pro Val Pro Leu Ala Asp Ile Ser
20 25 30
Glu Arg Gln Gly Ile Ser Leu Ser Tyr Leu Glu Gln Leu Phe Ser Arg
35 40 45
Leu Arg Lys Asn Gly Leu Val Ser Ser Val Arg Gly Pro Gly Gly Gly
50 55 60
Tyr Leu Leu Gly Lys Asp Ala Ser Ser Ile Ala Val Gly Glu Val Ile
65 70 75 80
Ser Ala Val Asp Glu Ser Val Asp Ala Thr Arg Cys Gln Gly Lys Gly
85 90 95
Gly Cys Gln Gly Gly Asp Lys Cys Leu Thr Tyr Ala Leu Trp Arg Asp
100 105 110
Leu Ser Asp Arg Leu Thr Gly Phe Leu Asn Asn Ile Thr Leu Gly Glu
115 120 125
Leu Val Asn Asn Gln Glu Val Leu Asp Val Ser Gly Arg Gln His Thr
130 135 140
His Asp Ala Pro Arg Thr Arg Thr Gln Asp Ala Ile Asp Val Lys Leu
145 150 155 160
Arg Ala
<210> 21
<211> 1041
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(1041)
<223> bioB gene encoding biotin synthase (EC 2.8.1.6)
<400> 21
atg gct cac cgc cca cgc tgg aca ttg tcg caa gtc aca gaa tta ttt 48
Met Ala His Arg Pro Arg Trp Thr Leu Ser Gln Val Thr Glu Leu Phe
1 5 10 15
gaa aaa ccg ttg ctg gat ctg ctg ttt gaa gcg cag cag gtg cat cgc 96
Glu Lys Pro Leu Leu Asp Leu Leu Phe Glu Ala Gln Gln Val His Arg
20 25 30
cag cat ttc gat cct cgt cag gtg cag gtc agc acg ttg ctg tcg att 144
Gln His Phe Asp Pro Arg Gln Val Gln Val Ser Thr Leu Leu Ser Ile
35 40 45
aag acc gga gct tgt ccg gaa gat tgc aaa tac tgc ccg caa agc tcg 192
Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys Tyr Cys Pro Gln Ser Ser
50 55 60
cgc tac aaa acc ggg ctg gaa gcc gag cgg ttg atg gaa gtt gaa cag 240
Arg Tyr Lys Thr Gly Leu Glu Ala Glu Arg Leu Met Glu Val Glu Gln
65 70 75 80
gtg ctg gag tcg gcg cgc aaa gcg aaa gcg gca gga tcg acg cgc ttc 288
Val Leu Glu Ser Ala Arg Lys Ala Lys Ala Ala Gly Ser Thr Arg Phe
85 90 95
tgt atg ggc gcg gcg tgg aag aat ccc cac gaa cgc gat atg ccg tac 336
Cys Met Gly Ala Ala Trp Lys Asn Pro His Glu Arg Asp Met Pro Tyr
100 105 110
ctg gaa caa atg gtg cag ggg gta aaa gcg atg ggg ctg gag gcg tgt 384
Leu Glu Gln Met Val Gln Gly Val Lys Ala Met Gly Leu Glu Ala Cys
115 120 125
atg acg ctg ggc acg ttg agt gaa tct cag gcg cag cgc ctc gcg aac 432
Met Thr Leu Gly Thr Leu Ser Glu Ser Gln Ala Gln Arg Leu Ala Asn
130 135 140
gcc ggg ctg gat tac tac aac cac aac ctg gac acc tcg ccg gag ttt 480
Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu Asp Thr Ser Pro Glu Phe
145 150 155 160
tac ggc aat atc atc acc aca cgc act tat cag gaa cgc ctc gat acg 528
Tyr Gly Asn Ile Ile Thr Thr Arg Thr Tyr Gln Glu Arg Leu Asp Thr
165 170 175
ctg gaa aaa gtg cgc gat gcc ggg atc aaa gtc tgt tct ggc ggc att 576
Leu Glu Lys Val Arg Asp Ala Gly Ile Lys Val Cys Ser Gly Gly Ile
180 185 190
gtg ggc tta ggc gaa acg gta aaa gat cgc gcc gga tta ttg ctg caa 624
Val Gly Leu Gly Glu Thr Val Lys Asp Arg Ala Gly Leu Leu Leu Gln
195 200 205
ctg gca aac ctg ccg acg ccg ccg gaa agc gtg cca atc aac atg ctg 672
Leu Ala Asn Leu Pro Thr Pro Pro Glu Ser Val Pro Ile Asn Met Leu
210 215 220
gtg aag gtg aaa ggc acg ccg ctt gcc gat aac gat gat gtc gat gcc 720
Val Lys Val Lys Gly Thr Pro Leu Ala Asp Asn Asp Asp Val Asp Ala
225 230 235 240
ttt gat ttt att cgc acc att gcg gtc gcg cgg atc atg atg cca acc 768
Phe Asp Phe Ile Arg Thr Ile Ala Val Ala Arg Ile Met Met Pro Thr
245 250 255
tct tac gtg cgc ctt tct gcc gga cgc gag cag atg aac gaa cag act 816
Ser Tyr Val Arg Leu Ser Ala Gly Arg Glu Gln Met Asn Glu Gln Thr
260 265 270
cag gcg atg tgc ttt atg gca ggc gca aac tcg att ttc tac ggt tgc 864
Gln Ala Met Cys Phe Met Ala Gly Ala Asn Ser Ile Phe Tyr Gly Cys
275 280 285
aaa ctg ctg acc acg ccg aat ccg gaa gaa gat aaa gac ctg caa ctg 912
Lys Leu Leu Thr Thr Pro Asn Pro Glu Glu Asp Lys Asp Leu Gln Leu
290 295 300
ttc cgc aaa ctg ggg cta aat ccg cag caa act gcc gtg ctg gca ggg 960
Phe Arg Lys Leu Gly Leu Asn Pro Gln Gln Thr Ala Val Leu Ala Gly
305 310 315 320
gat aac gaa caa cag caa cgt ctt gaa cag gcg ctg atg acc ccg gac 1008
Asp Asn Glu Gln Gln Gln Arg Leu Glu Gln Ala Leu Met Thr Pro Asp
325 330 335
acc gac gaa tat tac aac gcg gca gca tta tga 1041
Thr Asp Glu Tyr Tyr Asn Ala Ala Ala Leu
340 345
<210> 22
<211> 346
<212> PRT
<213> Escherichia coli
<400> 22
Met Ala His Arg Pro Arg Trp Thr Leu Ser Gln Val Thr Glu Leu Phe
1 5 10 15
Glu Lys Pro Leu Leu Asp Leu Leu Phe Glu Ala Gln Gln Val His Arg
20 25 30
Gln His Phe Asp Pro Arg Gln Val Gln Val Ser Thr Leu Leu Ser Ile
35 40 45
Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys Tyr Cys Pro Gln Ser Ser
50 55 60
Arg Tyr Lys Thr Gly Leu Glu Ala Glu Arg Leu Met Glu Val Glu Gln
65 70 75 80
Val Leu Glu Ser Ala Arg Lys Ala Lys Ala Ala Gly Ser Thr Arg Phe
85 90 95
Cys Met Gly Ala Ala Trp Lys Asn Pro His Glu Arg Asp Met Pro Tyr
100 105 110
Leu Glu Gln Met Val Gln Gly Val Lys Ala Met Gly Leu Glu Ala Cys
115 120 125
Met Thr Leu Gly Thr Leu Ser Glu Ser Gln Ala Gln Arg Leu Ala Asn
130 135 140
Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu Asp Thr Ser Pro Glu Phe
145 150 155 160
Tyr Gly Asn Ile Ile Thr Thr Arg Thr Tyr Gln Glu Arg Leu Asp Thr
165 170 175
Leu Glu Lys Val Arg Asp Ala Gly Ile Lys Val Cys Ser Gly Gly Ile
180 185 190
Val Gly Leu Gly Glu Thr Val Lys Asp Arg Ala Gly Leu Leu Leu Gln
195 200 205
Leu Ala Asn Leu Pro Thr Pro Pro Glu Ser Val Pro Ile Asn Met Leu
210 215 220
Val Lys Val Lys Gly Thr Pro Leu Ala Asp Asn Asp Asp Val Asp Ala
225 230 235 240
Phe Asp Phe Ile Arg Thr Ile Ala Val Ala Arg Ile Met Met Pro Thr
245 250 255
Ser Tyr Val Arg Leu Ser Ala Gly Arg Glu Gln Met Asn Glu Gln Thr
260 265 270
Gln Ala Met Cys Phe Met Ala Gly Ala Asn Ser Ile Phe Tyr Gly Cys
275 280 285
Lys Leu Leu Thr Thr Pro Asn Pro Glu Glu Asp Lys Asp Leu Gln Leu
290 295 300
Phe Arg Lys Leu Gly Leu Asn Pro Gln Gln Thr Ala Val Leu Ala Gly
305 310 315 320
Asp Asn Glu Gln Gln Gln Arg Leu Glu Gln Ala Leu Met Thr Pro Asp
325 330 335
Thr Asp Glu Tyr Tyr Asn Ala Ala Ala Leu
340 345
<210> 23
<211> 1040
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(222)
<223> Mutant bioB gene with frameshift encoding inactive truncated
biotin synthase (EC 2.8.1.6)
<400> 23
atg gcc acc gcc cac gct gga cat tgt cgc aag tca cag aat tat ttg 48
Met Ala Thr Ala His Ala Gly His Cys Arg Lys Ser Gln Asn Tyr Leu
1 5 10 15
aaa aac cgt tgc tgg atc tgc tgt ttg aag cgc agc agg tgc atc gcc 96
Lys Asn Arg Cys Trp Ile Cys Cys Leu Lys Arg Ser Arg Cys Ile Ala
20 25 30
agc att tcg atc ctc gtc agg tgc agg tca gca cgt tgc tgt cga tta 144
Ser Ile Ser Ile Leu Val Arg Cys Arg Ser Ala Arg Cys Cys Arg Leu
35 40 45
aga ccg gag ctt gtc cgg aag att gca aat act gcc cgc aaa gct cgc 192
Arg Pro Glu Leu Val Arg Lys Ile Ala Asn Thr Ala Arg Lys Ala Arg
50 55 60
gct aca aaa ccg ggc tgg aag ccg agc ggt tgatggaagt tgaacaggtg 242
Ala Thr Lys Pro Gly Trp Lys Pro Ser Gly
65 70
ctggagtcgg cgcgcaaagc gaaagcggca ggatcgacgc gcttctgtat gggcgcggcg 302
tggaagaatc cccacgaacg cgatatgccg tacctggaac aaatggtgca gggggtaaaa 362
gcgatggggc tggaggcgtg tatgacgctg ggcacgttga gtgaatctca ggcgcagcgc 422
ctcgcgaacg ccgggctgga ttactacaac cacaacctgg acacctcgcc ggagttttac 482
ggcaatatca tcaccacacg cacttatcag gaacgcctcg atacgctgga aaaagtgcgc 542
gatgccggga tcaaagtctg ttctggcggc attgtgggct taggcgaaac ggtaaaagat 602
cgcgccggat tattgctgca actggcaaac ctgccgacgc cgccggaaag cgtgccaatc 662
aacatgctgg tgaaggtgaa aggcacgccg cttgccgata acgatgatgt cgatgccttt 722
gattttattc gcaccattgc ggtcgcgcgg atcatgatgc caacctctta cgtgcgcctt 782
tctgccggac gcgagcagat gaacgaacag actcaggcga tgtgctttat ggcaggcgca 842
aactcgattt tctacggttg caaactgctg accacgccga atccggaaga agataaagac 902
ctgcaactgt tccgcaaact ggggctaaat ccgcagcaaa ctgccgtgct ggcaggggat 962
aacgaacaac agcaacgtct tgaacaggcg ctgatgaccc cggacaccga cgaatattac 1022
aacgcggcag cattatga 1040
<210> 24
<211> 74
<212> PRT
<213> Escherichia coli
<400> 24
Met Ala Thr Ala His Ala Gly His Cys Arg Lys Ser Gln Asn Tyr Leu
1 5 10 15
Lys Asn Arg Cys Trp Ile Cys Cys Leu Lys Arg Ser Arg Cys Ile Ala
20 25 30
Ser Ile Ser Ile Leu Val Arg Cys Arg Ser Ala Arg Cys Cys Arg Leu
35 40 45
Arg Pro Glu Leu Val Arg Lys Ile Ala Asn Thr Ala Arg Lys Ala Arg
50 55 60
Ala Thr Lys Pro Gly Trp Lys Pro Ser Gly
65 70
<210> 25
<211> 97
<212> DNA
<213> Escherichia coli
<220>
<221> promoter
<222> (1)..(97)
<223> T5 lacO repressed promoter
<400> 25
tcataaaaaa tttatttgct ttgtgagcgg ataacaatta taatagattc aaatcggagg 60
ttctctaact agtatctcta gagctaagga ggtaaat 97
<210> 26
<211> 1041
<212> DNA
<213> Candidatus Chloracidobacterium thermophilum B
<220>
<221> CDS
<222> (1)..(1041)
<223> bioB gene encoding biotin synthase from Candidatus
Chloracidobacterium thermophilum B
<400> 26
atg agc cag ccg ctg gtt cgt ttt gat tgg acc cgt gat gaa ctg cgt 48
Met Ser Gln Pro Leu Val Arg Phe Asp Trp Thr Arg Asp Glu Leu Arg
1 5 10 15
gca ctg cat gat ctg ccg ctg ctg gaa ctg att cat cgt gca gca acc 96
Ala Leu His Asp Leu Pro Leu Leu Glu Leu Ile His Arg Ala Ala Thr
20 25 30
gtt cat cgt acc tgt cat gat ccg cag gaa gtt cag gtt tgt cgt ctg 144
Val His Arg Thr Cys His Asp Pro Gln Glu Val Gln Val Cys Arg Leu
35 40 45
att agc att aaa acc ggc ggt tgt ccg gaa gat tgt ggt tat tgt agc 192
Ile Ser Ile Lys Thr Gly Gly Cys Pro Glu Asp Cys Gly Tyr Cys Ser
50 55 60
cag agc gca cat tat gaa acc ggt att gca gca cag ccg ctg ctg gat 240
Gln Ser Ala His Tyr Glu Thr Gly Ile Ala Ala Gln Pro Leu Leu Asp
65 70 75 80
aaa gcc acc gtt gtt gca att gca gaa cgt gca aaa gca cat ggt gtt 288
Lys Ala Thr Val Val Ala Ile Ala Glu Arg Ala Lys Ala His Gly Val
85 90 95
agc cgt gtt tgt ctg ggt gca gca tgg cgt aat gtt cgt gat gat gca 336
Ser Arg Val Cys Leu Gly Ala Ala Trp Arg Asn Val Arg Asp Asp Ala
100 105 110
cag ttt gaa gca gtt ctg gat att gtt cgt agc gtt aat gca ctg ggt 384
Gln Phe Glu Ala Val Leu Asp Ile Val Arg Ser Val Asn Ala Leu Gly
115 120 125
att gaa gtt tgt tgt acc ctg ggt atg ctg acc gaa gcc cag gca cgt 432
Ile Glu Val Cys Cys Thr Leu Gly Met Leu Thr Glu Ala Gln Ala Arg
130 135 140
cgt ctg gaa gaa gca ggt ctg tat gca tat aat cat aat ctg gat acc 480
Arg Leu Glu Glu Ala Gly Leu Tyr Ala Tyr Asn His Asn Leu Asp Thr
145 150 155 160
agc cgt gaa tat tat ggt cgt gtt gtt acc acc cgt acc tat gat gat 528
Ser Arg Glu Tyr Tyr Gly Arg Val Val Thr Thr Arg Thr Tyr Asp Asp
165 170 175
cgc ctg gaa acc ctg gca aat gtt cgc aaa acc ggt gtt acc ctg tgt 576
Arg Leu Glu Thr Leu Ala Asn Val Arg Lys Thr Gly Val Thr Leu Cys
180 185 190
acc ggt ggt att ctg ggt ctg ggt gaa agc acc gat gat cgt att ggt 624
Thr Gly Gly Ile Leu Gly Leu Gly Glu Ser Thr Asp Asp Arg Ile Gly
195 200 205
ctg ctg cat acc ctg gcc acc atg aat ccg cat ccg gaa agc gtt ccg 672
Leu Leu His Thr Leu Ala Thr Met Asn Pro His Pro Glu Ser Val Pro
210 215 220
att aat ctg ctg acc cgt gtt ccg ggt acc ccg atg gaa aat gaa gca 720
Ile Asn Leu Leu Thr Arg Val Pro Gly Thr Pro Met Glu Asn Glu Ala
225 230 235 240
gaa gtt agc gtt tgg gaa acc ctg cgt gtt att gca acc gca cgt att 768
Glu Val Ser Val Trp Glu Thr Leu Arg Val Ile Ala Thr Ala Arg Ile
245 250 255
gca atg ccg cgt agt gtt att cgt ctg agc gca ggt cgt acc cag ctg 816
Ala Met Pro Arg Ser Val Ile Arg Leu Ser Ala Gly Arg Thr Gln Leu
260 265 270
agc gaa gaa gca cag gca ctg tgt ttt ctg gcc ggt gca aat agc att 864
Ser Glu Glu Ala Gln Ala Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile
275 280 285
ttt agc agc gat gca cgt atg atg ctg acc cgc gtt agc ccg acc aat 912
Phe Ser Ser Asp Ala Arg Met Met Leu Thr Arg Val Ser Pro Thr Asn
290 295 300
gat tat gat gaa gat gcc cag ctg ctg aat aaa ctg ggt ctg cat ccg 960
Asp Tyr Asp Glu Asp Ala Gln Leu Leu Asn Lys Leu Gly Leu His Pro
305 310 315 320
cgt gtt ccg ttt aaa gat gca ccg aat gca aaa acc gca ggt tgt gca 1008
Arg Val Pro Phe Lys Asp Ala Pro Asn Ala Lys Thr Ala Gly Cys Ala
325 330 335
agc gca gcc acc gca acc ctg cag gaa aaa taa 1041
Ser Ala Ala Thr Ala Thr Leu Gln Glu Lys
340 345
<210> 27
<211> 346
<212> PRT
<213> Candidatus Chloracidobacterium thermophilum B
<400> 27
Met Ser Gln Pro Leu Val Arg Phe Asp Trp Thr Arg Asp Glu Leu Arg
1 5 10 15
Ala Leu His Asp Leu Pro Leu Leu Glu Leu Ile His Arg Ala Ala Thr
20 25 30
Val His Arg Thr Cys His Asp Pro Gln Glu Val Gln Val Cys Arg Leu
35 40 45
Ile Ser Ile Lys Thr Gly Gly Cys Pro Glu Asp Cys Gly Tyr Cys Ser
50 55 60
Gln Ser Ala His Tyr Glu Thr Gly Ile Ala Ala Gln Pro Leu Leu Asp
65 70 75 80
Lys Ala Thr Val Val Ala Ile Ala Glu Arg Ala Lys Ala His Gly Val
85 90 95
Ser Arg Val Cys Leu Gly Ala Ala Trp Arg Asn Val Arg Asp Asp Ala
100 105 110
Gln Phe Glu Ala Val Leu Asp Ile Val Arg Ser Val Asn Ala Leu Gly
115 120 125
Ile Glu Val Cys Cys Thr Leu Gly Met Leu Thr Glu Ala Gln Ala Arg
130 135 140
Arg Leu Glu Glu Ala Gly Leu Tyr Ala Tyr Asn His Asn Leu Asp Thr
145 150 155 160
Ser Arg Glu Tyr Tyr Gly Arg Val Val Thr Thr Arg Thr Tyr Asp Asp
165 170 175
Arg Leu Glu Thr Leu Ala Asn Val Arg Lys Thr Gly Val Thr Leu Cys
180 185 190
Thr Gly Gly Ile Leu Gly Leu Gly Glu Ser Thr Asp Asp Arg Ile Gly
195 200 205
Leu Leu His Thr Leu Ala Thr Met Asn Pro His Pro Glu Ser Val Pro
210 215 220
Ile Asn Leu Leu Thr Arg Val Pro Gly Thr Pro Met Glu Asn Glu Ala
225 230 235 240
Glu Val Ser Val Trp Glu Thr Leu Arg Val Ile Ala Thr Ala Arg Ile
245 250 255
Ala Met Pro Arg Ser Val Ile Arg Leu Ser Ala Gly Arg Thr Gln Leu
260 265 270
Ser Glu Glu Ala Gln Ala Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile
275 280 285
Phe Ser Ser Asp Ala Arg Met Met Leu Thr Arg Val Ser Pro Thr Asn
290 295 300
Asp Tyr Asp Glu Asp Ala Gln Leu Leu Asn Lys Leu Gly Leu His Pro
305 310 315 320
Arg Val Pro Phe Lys Asp Ala Pro Asn Ala Lys Thr Ala Gly Cys Ala
325 330 335
Ser Ala Ala Thr Ala Thr Leu Gln Glu Lys
340 345
<210> 28
<211> 1164
<212> DNA
<213> Streptomyces lydicus
<220>
<221> CDS
<222> (1)..(1164)
<223> bioB gene encoding biotin synthase from Streptomyces lydicus A02
<400> 28
atg ccg tat gtt cgt att aat gca atg gat ctg ctg aat acc ctg gtt 48
Met Pro Tyr Val Arg Ile Asn Ala Met Asp Leu Leu Asn Thr Leu Val
1 5 10 15
gat aaa ggt ctg cgt cgt gaa ctg ccg acc cgt gaa gaa gca ctg gca 96
Asp Lys Gly Leu Arg Arg Glu Leu Pro Thr Arg Glu Glu Ala Leu Ala
20 25 30
gtt ctg gca acc agc gat gat gaa ctg ctg gat gtt gtt gca gca gcg 144
Val Leu Ala Thr Ser Asp Asp Glu Leu Leu Asp Val Val Ala Ala Ala
35 40 45
ggt aaa gtt cgc cgt cag tgg ttt ggt cgt cgt gtt aaa ctg aat tat 192
Gly Lys Val Arg Arg Gln Trp Phe Gly Arg Arg Val Lys Leu Asn Tyr
50 55 60
ctg gtg aat ctg aaa agc ggt ctg tgt ccg gaa gat tgt agc tat tgt 240
Leu Val Asn Leu Lys Ser Gly Leu Cys Pro Glu Asp Cys Ser Tyr Cys
65 70 75 80
agc cag cgt ctg ggt agc aaa gca gaa att ctg aaa tat acc tgg ctg 288
Ser Gln Arg Leu Gly Ser Lys Ala Glu Ile Leu Lys Tyr Thr Trp Leu
85 90 95
aaa ccg gat gat gca agt aaa gca gcc gca gca ggc gtt gcc ggt ggt 336
Lys Pro Asp Asp Ala Ser Lys Ala Ala Ala Ala Gly Val Ala Gly Gly
100 105 110
gca aaa cgt gtt tgt ctg gtt gca agc ggt cgt ggt ccg acc gat aaa 384
Ala Lys Arg Val Cys Leu Val Ala Ser Gly Arg Gly Pro Thr Asp Lys
115 120 125
gat gtt gat cgt gtg agc gaa acc att agc gca att aaa gaa cag aat 432
Asp Val Asp Arg Val Ser Glu Thr Ile Ser Ala Ile Lys Glu Gln Asn
130 135 140
gaa ggt gtg gaa gtt tgt gca tgt ctg ggt ctg ctg agc gat ggt cag 480
Glu Gly Val Glu Val Cys Ala Cys Leu Gly Leu Leu Ser Asp Gly Gln
145 150 155 160
gca gat cgt ctg cgt gca gcc ggt gca gat gca tat aat cat aat ctg 528
Ala Asp Arg Leu Arg Ala Ala Gly Ala Asp Ala Tyr Asn His Asn Leu
165 170 175
aat acc agt gaa gca acc tat ggt gat att tgt acc acc cat gat ttt 576
Asn Thr Ser Glu Ala Thr Tyr Gly Asp Ile Cys Thr Thr His Asp Phe
180 185 190
agc gat cgt gtt agc acc gtt cag cag gca cag gca gca ggt atg agc 624
Ser Asp Arg Val Ser Thr Val Gln Gln Ala Gln Ala Ala Gly Met Ser
195 200 205
gca tgt agc ggt ctg att gca ggt atg ggt gaa agc gat gca gat ctg 672
Ala Cys Ser Gly Leu Ile Ala Gly Met Gly Glu Ser Asp Ala Asp Leu
210 215 220
gtg gat gtg gtt ttt gca ctg cgt gaa ctg gat ccg gat agc gtt ccg 720
Val Asp Val Val Phe Ala Leu Arg Glu Leu Asp Pro Asp Ser Val Pro
225 230 235 240
gtt aat ttt ctg att ccg ttt gaa ggt acc ccg ctg gca aaa gaa tgg 768
Val Asn Phe Leu Ile Pro Phe Glu Gly Thr Pro Leu Ala Lys Glu Trp
245 250 255
aat ctg acc ccg cag cgt gcc ctg cgt att ctg gca atg gtt cgt ttt 816
Asn Leu Thr Pro Gln Arg Ala Leu Arg Ile Leu Ala Met Val Arg Phe
260 265 270
gtt tgt ccg gat gtt gaa gtt cgt ctg gca ggc ggt cgt gaa gtt cat 864
Val Cys Pro Asp Val Glu Val Arg Leu Ala Gly Gly Arg Glu Val His
275 280 285
ctg cgt agc ctg cag ccg ctg gca ctg cat ctg gtt aat agc att ttt 912
Leu Arg Ser Leu Gln Pro Leu Ala Leu His Leu Val Asn Ser Ile Phe
290 295 300
ctg ggt gat tat ctg acc agc gaa ggt cag gcc ggt aaa gaa gat ctg 960
Leu Gly Asp Tyr Leu Thr Ser Glu Gly Gln Ala Gly Lys Glu Asp Leu
305 310 315 320
gcc atg att gcc gat gca ggt ttt gaa gtg gaa ggt aca gat acc acc 1008
Ala Met Ile Ala Asp Ala Gly Phe Glu Val Glu Gly Thr Asp Thr Thr
325 330 335
acc ctg ccg gaa cat cgt aca gat gct gca gtt cag ccg gca ccg gaa 1056
Thr Leu Pro Glu His Arg Thr Asp Ala Ala Val Gln Pro Ala Pro Glu
340 345 350
ccg gca gca gat gca gca gtt cct gca cct cct agt gaa gaa aca cgt 1104
Pro Ala Ala Asp Ala Ala Val Pro Ala Pro Pro Ser Glu Glu Thr Arg
355 360 365
cgt gat ctg gtt agc gtt cgt cgt cgt ggt gca ggt acc gaa ctg ccg 1152
Arg Asp Leu Val Ser Val Arg Arg Arg Gly Ala Gly Thr Glu Leu Pro
370 375 380
ccg aat gca taa 1164
Pro Asn Ala
385
<210> 29
<211> 387
<212> PRT
<213> Streptomyces lydicus
<400> 29
Met Pro Tyr Val Arg Ile Asn Ala Met Asp Leu Leu Asn Thr Leu Val
1 5 10 15
Asp Lys Gly Leu Arg Arg Glu Leu Pro Thr Arg Glu Glu Ala Leu Ala
20 25 30
Val Leu Ala Thr Ser Asp Asp Glu Leu Leu Asp Val Val Ala Ala Ala
35 40 45
Gly Lys Val Arg Arg Gln Trp Phe Gly Arg Arg Val Lys Leu Asn Tyr
50 55 60
Leu Val Asn Leu Lys Ser Gly Leu Cys Pro Glu Asp Cys Ser Tyr Cys
65 70 75 80
Ser Gln Arg Leu Gly Ser Lys Ala Glu Ile Leu Lys Tyr Thr Trp Leu
85 90 95
Lys Pro Asp Asp Ala Ser Lys Ala Ala Ala Ala Gly Val Ala Gly Gly
100 105 110
Ala Lys Arg Val Cys Leu Val Ala Ser Gly Arg Gly Pro Thr Asp Lys
115 120 125
Asp Val Asp Arg Val Ser Glu Thr Ile Ser Ala Ile Lys Glu Gln Asn
130 135 140
Glu Gly Val Glu Val Cys Ala Cys Leu Gly Leu Leu Ser Asp Gly Gln
145 150 155 160
Ala Asp Arg Leu Arg Ala Ala Gly Ala Asp Ala Tyr Asn His Asn Leu
165 170 175
Asn Thr Ser Glu Ala Thr Tyr Gly Asp Ile Cys Thr Thr His Asp Phe
180 185 190
Ser Asp Arg Val Ser Thr Val Gln Gln Ala Gln Ala Ala Gly Met Ser
195 200 205
Ala Cys Ser Gly Leu Ile Ala Gly Met Gly Glu Ser Asp Ala Asp Leu
210 215 220
Val Asp Val Val Phe Ala Leu Arg Glu Leu Asp Pro Asp Ser Val Pro
225 230 235 240
Val Asn Phe Leu Ile Pro Phe Glu Gly Thr Pro Leu Ala Lys Glu Trp
245 250 255
Asn Leu Thr Pro Gln Arg Ala Leu Arg Ile Leu Ala Met Val Arg Phe
260 265 270
Val Cys Pro Asp Val Glu Val Arg Leu Ala Gly Gly Arg Glu Val His
275 280 285
Leu Arg Ser Leu Gln Pro Leu Ala Leu His Leu Val Asn Ser Ile Phe
290 295 300
Leu Gly Asp Tyr Leu Thr Ser Glu Gly Gln Ala Gly Lys Glu Asp Leu
305 310 315 320
Ala Met Ile Ala Asp Ala Gly Phe Glu Val Glu Gly Thr Asp Thr Thr
325 330 335
Thr Leu Pro Glu His Arg Thr Asp Ala Ala Val Gln Pro Ala Pro Glu
340 345 350
Pro Ala Ala Asp Ala Ala Val Pro Ala Pro Pro Ser Glu Glu Thr Arg
355 360 365
Arg Asp Leu Val Ser Val Arg Arg Arg Gly Ala Gly Thr Glu Leu Pro
370 375 380
Pro Asn Ala
385
<210> 30
<211> 975
<212> DNA
<213> Paracoccus denitrificans
<220>
<221> CDS
<222> (1)..(975)
<223> bioB gene encoding biotin synthase from Paracoccus denitrificans
PD1222
<400> 30
atg att cgt acc gat tgg aca atg gca gaa gca tgg gca att cat gca 48
Met Ile Arg Thr Asp Trp Thr Met Ala Glu Ala Trp Ala Ile His Ala
1 5 10 15
ctg ccg ttt gca gat ctg atg cat cgc gca cag acc ctg cat cgt gca 96
Leu Pro Phe Ala Asp Leu Met His Arg Ala Gln Thr Leu His Arg Ala
20 25 30
cat ttt gat ccg aat gca att gaa acc gca agc ctg ctg agc att aaa 144
His Phe Asp Pro Asn Ala Ile Glu Thr Ala Ser Leu Leu Ser Ile Lys
35 40 45
acc ggt ggt tgt ccg gaa gat tgt ggt tat tgt agc cag agc gca cat 192
Thr Gly Gly Cys Pro Glu Asp Cys Gly Tyr Cys Ser Gln Ser Ala His
50 55 60
cat gat acc ggt gtt aaa gca acc aaa ctg atg ggt acc gaa gaa gtt 240
His Asp Thr Gly Val Lys Ala Thr Lys Leu Met Gly Thr Glu Glu Val
65 70 75 80
ctg gca gca gca cgt cgt gca aaa gca agc ggt gca cag cgt ttt tgt 288
Leu Ala Ala Ala Arg Arg Ala Lys Ala Ser Gly Ala Gln Arg Phe Cys
85 90 95
atg ggt gca gca tgg cgt agc ccg aaa gat cgt gat atg gat aaa ctg 336
Met Gly Ala Ala Trp Arg Ser Pro Lys Asp Arg Asp Met Asp Lys Leu
100 105 110
tgt gat atg gtt cgt ggt gtt gca gaa ctg ggt ctg gaa acc tgt atg 384
Cys Asp Met Val Arg Gly Val Ala Glu Leu Gly Leu Glu Thr Cys Met
115 120 125
acc ctg ggt atg ctg agc ccg gaa cag gtt gca cgt ctg aaa gca gca 432
Thr Leu Gly Met Leu Ser Pro Glu Gln Val Ala Arg Leu Lys Ala Ala
130 135 140
ggt ctg gat ttt tat aat cat aat att gat acc agc ccg gaa tat tat 480
Gly Leu Asp Phe Tyr Asn His Asn Ile Asp Thr Ser Pro Glu Tyr Tyr
145 150 155 160
gcc cag att gcc agc acc cgt aca atg gaa aat cgt ctg gat acc gtt 528
Ala Gln Ile Ala Ser Thr Arg Thr Met Glu Asn Arg Leu Asp Thr Val
165 170 175
gaa cag gtt cgt aaa ggt ggt att aaa gtt tgt tgt ggt ggt att ctg 576
Glu Gln Val Arg Lys Gly Gly Ile Lys Val Cys Cys Gly Gly Ile Leu
180 185 190
ggt atg ggt gaa gca gaa gaa gat cgt att gca atg ctg gtt acc ctg 624
Gly Met Gly Glu Ala Glu Glu Asp Arg Ile Ala Met Leu Val Thr Leu
195 200 205
gca acc ctg ccg gca cat ccg gat agc gtt ccg gtt aat ctg tgg aat 672
Ala Thr Leu Pro Ala His Pro Asp Ser Val Pro Val Asn Leu Trp Asn
210 215 220
gaa att gaa ggt gtt ccg gtt cag gca cgt gca cag gca gtt gat ccg 720
Glu Ile Glu Gly Val Pro Val Gln Ala Arg Ala Gln Ala Val Asp Pro
225 230 235 240
ttt gcc ctg gtt cgt att gtt gca ctg gca cgt att ctg atg ccg gca 768
Phe Ala Leu Val Arg Ile Val Ala Leu Ala Arg Ile Leu Met Pro Ala
245 250 255
agc gtt gtt cgt ctg agc gca ggt cgt acc ggt atg agc gat gaa ctg 816
Ser Val Val Arg Leu Ser Ala Gly Arg Thr Gly Met Ser Asp Glu Leu
260 265 270
cag gca ctg tgt ttt ctg gcg ggt gca aat agc att ttt gtt ggt gat 864
Gln Ala Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile Phe Val Gly Asp
275 280 285
cag ctg ctg acc acc ggt aat ccg gca gca tgg aaa gat cag gat ctg 912
Gln Leu Leu Thr Thr Gly Asn Pro Ala Ala Trp Lys Asp Gln Asp Leu
290 295 300
ctg agt cgt ctg ggt atg cat att gca ccg gca cag gcc cgt ccg cgt 960
Leu Ser Arg Leu Gly Met His Ile Ala Pro Ala Gln Ala Arg Pro Arg
305 310 315 320
gtt gca gca gaa taa 975
Val Ala Ala Glu
<210> 31
<211> 324
<212> PRT
<213> Paracoccus denitrificans
<400> 31
Met Ile Arg Thr Asp Trp Thr Met Ala Glu Ala Trp Ala Ile His Ala
1 5 10 15
Leu Pro Phe Ala Asp Leu Met His Arg Ala Gln Thr Leu His Arg Ala
20 25 30
His Phe Asp Pro Asn Ala Ile Glu Thr Ala Ser Leu Leu Ser Ile Lys
35 40 45
Thr Gly Gly Cys Pro Glu Asp Cys Gly Tyr Cys Ser Gln Ser Ala His
50 55 60
His Asp Thr Gly Val Lys Ala Thr Lys Leu Met Gly Thr Glu Glu Val
65 70 75 80
Leu Ala Ala Ala Arg Arg Ala Lys Ala Ser Gly Ala Gln Arg Phe Cys
85 90 95
Met Gly Ala Ala Trp Arg Ser Pro Lys Asp Arg Asp Met Asp Lys Leu
100 105 110
Cys Asp Met Val Arg Gly Val Ala Glu Leu Gly Leu Glu Thr Cys Met
115 120 125
Thr Leu Gly Met Leu Ser Pro Glu Gln Val Ala Arg Leu Lys Ala Ala
130 135 140
Gly Leu Asp Phe Tyr Asn His Asn Ile Asp Thr Ser Pro Glu Tyr Tyr
145 150 155 160
Ala Gln Ile Ala Ser Thr Arg Thr Met Glu Asn Arg Leu Asp Thr Val
165 170 175
Glu Gln Val Arg Lys Gly Gly Ile Lys Val Cys Cys Gly Gly Ile Leu
180 185 190
Gly Met Gly Glu Ala Glu Glu Asp Arg Ile Ala Met Leu Val Thr Leu
195 200 205
Ala Thr Leu Pro Ala His Pro Asp Ser Val Pro Val Asn Leu Trp Asn
210 215 220
Glu Ile Glu Gly Val Pro Val Gln Ala Arg Ala Gln Ala Val Asp Pro
225 230 235 240
Phe Ala Leu Val Arg Ile Val Ala Leu Ala Arg Ile Leu Met Pro Ala
245 250 255
Ser Val Val Arg Leu Ser Ala Gly Arg Thr Gly Met Ser Asp Glu Leu
260 265 270
Gln Ala Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile Phe Val Gly Asp
275 280 285
Gln Leu Leu Thr Thr Gly Asn Pro Ala Ala Trp Lys Asp Gln Asp Leu
290 295 300
Leu Ser Arg Leu Gly Met His Ile Ala Pro Ala Gln Ala Arg Pro Arg
305 310 315 320
Val Ala Ala Glu
<210> 32
<211> 963
<212> DNA
<213> Paracoccus denitrificans
<220>
<221> CDS
<222> (1)..(963)
<223> bioB gene encoding biotin synthase from Paracoccus denitrificans
PD1222(2)
<400> 32
atg ccg cag cag agc cgt agc gca gca gaa att tat cat cag ccg ctg 48
Met Pro Gln Gln Ser Arg Ser Ala Ala Glu Ile Tyr His Gln Pro Leu
1 5 10 15
atg gat ctg ctg ttt cag gca cag acc gtt cat cgt gca cat ttt gat 96
Met Asp Leu Leu Phe Gln Ala Gln Thr Val His Arg Ala His Phe Asp
20 25 30
ccg aat gtt gtt cag tgt agc aaa ctg ctg agc att aaa acc ggt ggt 144
Pro Asn Val Val Gln Cys Ser Lys Leu Leu Ser Ile Lys Thr Gly Gly
35 40 45
tgt ccg gaa gat tgt gca tat tgt agc cag agc gca cgt aat ggt agc 192
Cys Pro Glu Asp Cys Ala Tyr Cys Ser Gln Ser Ala Arg Asn Gly Ser
50 55 60
gaa ctg agc gca agc aaa ctg atg gaa gtt cag cgt gtt ctg gca gaa 240
Glu Leu Ser Ala Ser Lys Leu Met Glu Val Gln Arg Val Leu Ala Glu
65 70 75 80
gca cgt cgt gca aaa gaa gcc ggt gca acc cgt tat tgt atg ggt gca 288
Ala Arg Arg Ala Lys Glu Ala Gly Ala Thr Arg Tyr Cys Met Gly Ala
85 90 95
gca tgg cgt agc ccg aaa gaa cgt gat atg ccg gca gtt ctg gca atg 336
Ala Trp Arg Ser Pro Lys Glu Arg Asp Met Pro Ala Val Leu Ala Met
100 105 110
att cgt ggt gtt aaa gca atg ggt atg gaa acc tgt atg acc ctg ggt 384
Ile Arg Gly Val Lys Ala Met Gly Met Glu Thr Cys Met Thr Leu Gly
115 120 125
atg ctg gat gca gat cag gca ctg cgt ctg aaa gat gcc ggt ctg gat 432
Met Leu Asp Ala Asp Gln Ala Leu Arg Leu Lys Asp Ala Gly Leu Asp
130 135 140
tat tat aac cat aat att gat acc agc gaa cgc tat tat agc gaa att 480
Tyr Tyr Asn His Asn Ile Asp Thr Ser Glu Arg Tyr Tyr Ser Glu Ile
145 150 155 160
att acc acc cgc acc ttt cag gat cgc att gaa acc ctg gaa cgt gtt 528
Ile Thr Thr Arg Thr Phe Gln Asp Arg Ile Glu Thr Leu Glu Arg Val
165 170 175
cag gca gca ggt att aat gtt tgt gcc ggt ggt att gtt ggt atg ggt 576
Gln Ala Ala Gly Ile Asn Val Cys Ala Gly Gly Ile Val Gly Met Gly
180 185 190
gaa acc gca gaa gat cgt att agc atg ctg gaa acc ctg gca ggt ctg 624
Glu Thr Ala Glu Asp Arg Ile Ser Met Leu Glu Thr Leu Ala Gly Leu
195 200 205
gaa gtg ccg ccg cag agc gtt ccg att aat atg ctg atg ccg atg gca 672
Glu Val Pro Pro Gln Ser Val Pro Ile Asn Met Leu Met Pro Met Ala
210 215 220
ggt acc ccg ctg gca gat gtt ccg cgt ctg gat gca att gaa atg gtt 720
Gly Thr Pro Leu Ala Asp Val Pro Arg Leu Asp Ala Ile Glu Met Val
225 230 235 240
cgt acc att gca acc gca cgt att ctg atg ccg gca agc tat gtt cgt 768
Arg Thr Ile Ala Thr Ala Arg Ile Leu Met Pro Ala Ser Tyr Val Arg
245 250 255
ctg agc gca ggt cgt agc gaa atg agc gat gaa atg cag gca atg tgt 816
Leu Ser Ala Gly Arg Ser Glu Met Ser Asp Glu Met Gln Ala Met Cys
260 265 270
ttt ttt gca ggc gca aat agc att ttt gtt ggt gat acc ctg ctg acc 864
Phe Phe Ala Gly Ala Asn Ser Ile Phe Val Gly Asp Thr Leu Leu Thr
275 280 285
gca ggt aat ccg gat gaa gat aaa gat gca ctg ctg ttt gca aaa ctg 912
Ala Gly Asn Pro Asp Glu Asp Lys Asp Ala Leu Leu Phe Ala Lys Leu
290 295 300
ggt ctg cgt gca gaa gtt ccg gaa gca agc cag gaa ggt tgt gca gca 960
Gly Leu Arg Ala Glu Val Pro Glu Ala Ser Gln Glu Gly Cys Ala Ala
305 310 315 320
taa 963
<210> 33
<211> 320
<212> PRT
<213> Paracoccus denitrificans
<400> 33
Met Pro Gln Gln Ser Arg Ser Ala Ala Glu Ile Tyr His Gln Pro Leu
1 5 10 15
Met Asp Leu Leu Phe Gln Ala Gln Thr Val His Arg Ala His Phe Asp
20 25 30
Pro Asn Val Val Gln Cys Ser Lys Leu Leu Ser Ile Lys Thr Gly Gly
35 40 45
Cys Pro Glu Asp Cys Ala Tyr Cys Ser Gln Ser Ala Arg Asn Gly Ser
50 55 60
Glu Leu Ser Ala Ser Lys Leu Met Glu Val Gln Arg Val Leu Ala Glu
65 70 75 80
Ala Arg Arg Ala Lys Glu Ala Gly Ala Thr Arg Tyr Cys Met Gly Ala
85 90 95
Ala Trp Arg Ser Pro Lys Glu Arg Asp Met Pro Ala Val Leu Ala Met
100 105 110
Ile Arg Gly Val Lys Ala Met Gly Met Glu Thr Cys Met Thr Leu Gly
115 120 125
Met Leu Asp Ala Asp Gln Ala Leu Arg Leu Lys Asp Ala Gly Leu Asp
130 135 140
Tyr Tyr Asn His Asn Ile Asp Thr Ser Glu Arg Tyr Tyr Ser Glu Ile
145 150 155 160
Ile Thr Thr Arg Thr Phe Gln Asp Arg Ile Glu Thr Leu Glu Arg Val
165 170 175
Gln Ala Ala Gly Ile Asn Val Cys Ala Gly Gly Ile Val Gly Met Gly
180 185 190
Glu Thr Ala Glu Asp Arg Ile Ser Met Leu Glu Thr Leu Ala Gly Leu
195 200 205
Glu Val Pro Pro Gln Ser Val Pro Ile Asn Met Leu Met Pro Met Ala
210 215 220
Gly Thr Pro Leu Ala Asp Val Pro Arg Leu Asp Ala Ile Glu Met Val
225 230 235 240
Arg Thr Ile Ala Thr Ala Arg Ile Leu Met Pro Ala Ser Tyr Val Arg
245 250 255
Leu Ser Ala Gly Arg Ser Glu Met Ser Asp Glu Met Gln Ala Met Cys
260 265 270
Phe Phe Ala Gly Ala Asn Ser Ile Phe Val Gly Asp Thr Leu Leu Thr
275 280 285
Ala Gly Asn Pro Asp Glu Asp Lys Asp Ala Leu Leu Phe Ala Lys Leu
290 295 300
Gly Leu Arg Ala Glu Val Pro Glu Ala Ser Gln Glu Gly Cys Ala Ala
305 310 315 320
<210> 34
<211> 987
<212> DNA
<213> Agrobacterium vitis
<220>
<221> CDS
<222> (1)..(987)
<223> bioB gene encoding biotin synthase from Agrobacterium vitis S4
<400> 34
atg agc gaa gca gcc ggt gaa att cgt aat gat tgg agc gtt gaa gaa 48
Met Ser Glu Ala Ala Gly Glu Ile Arg Asn Asp Trp Ser Val Glu Glu
1 5 10 15
att gtg acc ctg cat aat ctg ccg ctg ctg gaa ctg att ggt cat gca 96
Ile Val Thr Leu His Asn Leu Pro Leu Leu Glu Leu Ile Gly His Ala
20 25 30
aat gca gtt cat ggt cgt cat cat aat ccg aat gtt gtt cag aaa gca 144
Asn Ala Val His Gly Arg His His Asn Pro Asn Val Val Gln Lys Ala
35 40 45
agc ctg ctg agc att aaa acc ggt ggt tgt ccg gaa gat tgt gca tat 192
Ser Leu Leu Ser Ile Lys Thr Gly Gly Cys Pro Glu Asp Cys Ala Tyr
50 55 60
tgt ccg cag agc gca cat cat cgt gaa gtt aaa ctg acc aaa gat cgt 240
Cys Pro Gln Ser Ala His His Arg Glu Val Lys Leu Thr Lys Asp Arg
65 70 75 80
ctg atg cag ccg gaa acc gtt ctg gca ctg gca aaa cgt gca aaa gat 288
Leu Met Gln Pro Glu Thr Val Leu Ala Leu Ala Lys Arg Ala Lys Asp
85 90 95
gcc ggt gca gaa cgt ttt tgt atg ggt gca gca tgg cgt cag gtt cgt 336
Ala Gly Ala Glu Arg Phe Cys Met Gly Ala Ala Trp Arg Gln Val Arg
100 105 110
gat ggt aaa gaa ttt gat gca gtt ctg aca atg gtt cgt ggt gtg cgt 384
Asp Gly Lys Glu Phe Asp Ala Val Leu Thr Met Val Arg Gly Val Arg
115 120 125
gat ctg ggt atg gaa gca tgt gtt acc ctg ggt atg ctg gaa aaa cat 432
Asp Leu Gly Met Glu Ala Cys Val Thr Leu Gly Met Leu Glu Lys His
130 135 140
cag gcc gaa aaa ctg gca gaa gca ggt ctg acc gca tat aat cat aat 480
Gln Ala Glu Lys Leu Ala Glu Ala Gly Leu Thr Ala Tyr Asn His Asn
145 150 155 160
ctg gat acc agc ccg gaa ttt tat ggc gaa att att acc acc cgt agc 528
Leu Asp Thr Ser Pro Glu Phe Tyr Gly Glu Ile Ile Thr Thr Arg Ser
165 170 175
tat gca gat cgt ctg gaa acc ctg agc att gtt cgt agc ttt ggt att 576
Tyr Ala Asp Arg Leu Glu Thr Leu Ser Ile Val Arg Ser Phe Gly Ile
180 185 190
gat ctg tgt tgt ggt ggt att att ggt atg ggt gaa acc att cgt gat 624
Asp Leu Cys Cys Gly Gly Ile Ile Gly Met Gly Glu Thr Ile Arg Asp
195 200 205
cgt gca agt atg ctg cag gtt ctg gca agc atg cgt ccg cat ccg gaa 672
Arg Ala Ser Met Leu Gln Val Leu Ala Ser Met Arg Pro His Pro Glu
210 215 220
agt gtt ccg att aat gca ctg gtt ccg gtt gaa ggt acc ccg ctg gca 720
Ser Val Pro Ile Asn Ala Leu Val Pro Val Glu Gly Thr Pro Leu Ala
225 230 235 240
gca atg ccg cgt att gat ccg ctg gaa ctg gtt cgt atg gtt gca acc 768
Ala Met Pro Arg Ile Asp Pro Leu Glu Leu Val Arg Met Val Ala Thr
245 250 255
gca cgt att gtt atg ccg aaa agc acc gtt cgt ctg agc gca ggt cgt 816
Ala Arg Ile Val Met Pro Lys Ser Thr Val Arg Leu Ser Ala Gly Arg
260 265 270
agc acc ctg aat cgt gaa gca cag att ctg tgt ctg gtt agc ggt gca 864
Ser Thr Leu Asn Arg Glu Ala Gln Ile Leu Cys Leu Val Ser Gly Ala
275 280 285
aat agc gtt ttt tat ggt gat acc ctg ctg acc acc ccg aat gca ggt 912
Asn Ser Val Phe Tyr Gly Asp Thr Leu Leu Thr Thr Pro Asn Ala Gly
290 295 300
att ggt gaa gat gaa gca ctg ttt gca gca att ggt gca ctg ccg cat 960
Ile Gly Glu Asp Glu Ala Leu Phe Ala Ala Ile Gly Ala Leu Pro His
305 310 315 320
gaa gca gca ccg ctg gcc gca gaa taa 987
Glu Ala Ala Pro Leu Ala Ala Glu
325
<210> 35
<211> 328
<212> PRT
<213> Agrobacterium vitis
<400> 35
Met Ser Glu Ala Ala Gly Glu Ile Arg Asn Asp Trp Ser Val Glu Glu
1 5 10 15
Ile Val Thr Leu His Asn Leu Pro Leu Leu Glu Leu Ile Gly His Ala
20 25 30
Asn Ala Val His Gly Arg His His Asn Pro Asn Val Val Gln Lys Ala
35 40 45
Ser Leu Leu Ser Ile Lys Thr Gly Gly Cys Pro Glu Asp Cys Ala Tyr
50 55 60
Cys Pro Gln Ser Ala His His Arg Glu Val Lys Leu Thr Lys Asp Arg
65 70 75 80
Leu Met Gln Pro Glu Thr Val Leu Ala Leu Ala Lys Arg Ala Lys Asp
85 90 95
Ala Gly Ala Glu Arg Phe Cys Met Gly Ala Ala Trp Arg Gln Val Arg
100 105 110
Asp Gly Lys Glu Phe Asp Ala Val Leu Thr Met Val Arg Gly Val Arg
115 120 125
Asp Leu Gly Met Glu Ala Cys Val Thr Leu Gly Met Leu Glu Lys His
130 135 140
Gln Ala Glu Lys Leu Ala Glu Ala Gly Leu Thr Ala Tyr Asn His Asn
145 150 155 160
Leu Asp Thr Ser Pro Glu Phe Tyr Gly Glu Ile Ile Thr Thr Arg Ser
165 170 175
Tyr Ala Asp Arg Leu Glu Thr Leu Ser Ile Val Arg Ser Phe Gly Ile
180 185 190
Asp Leu Cys Cys Gly Gly Ile Ile Gly Met Gly Glu Thr Ile Arg Asp
195 200 205
Arg Ala Ser Met Leu Gln Val Leu Ala Ser Met Arg Pro His Pro Glu
210 215 220
Ser Val Pro Ile Asn Ala Leu Val Pro Val Glu Gly Thr Pro Leu Ala
225 230 235 240
Ala Met Pro Arg Ile Asp Pro Leu Glu Leu Val Arg Met Val Ala Thr
245 250 255
Ala Arg Ile Val Met Pro Lys Ser Thr Val Arg Leu Ser Ala Gly Arg
260 265 270
Ser Thr Leu Asn Arg Glu Ala Gln Ile Leu Cys Leu Val Ser Gly Ala
275 280 285
Asn Ser Val Phe Tyr Gly Asp Thr Leu Leu Thr Thr Pro Asn Ala Gly
290 295 300
Ile Gly Glu Asp Glu Ala Leu Phe Ala Ala Ile Gly Ala Leu Pro His
305 310 315 320
Glu Ala Ala Pro Leu Ala Ala Glu
325
<210> 36
<211> 957
<212> DNA
<213> Ruegeria pomeroyi
<220>
<221> CDS
<222> (1)..(957)
<223> bioB gene encoding biotin synthase from Ruegeria pomeroyi DSS-3
<400> 36
atg gcc gaa gca att cgt agc gat tgg agc gtt gat gaa gtt gaa gca 48
Met Ala Glu Ala Ile Arg Ser Asp Trp Ser Val Asp Glu Val Glu Ala
1 5 10 15
ctg ctg cgt ctg ccg ctg ctg gat ctg gtt ggt cgt gca aat ggt gtt 96
Leu Leu Arg Leu Pro Leu Leu Asp Leu Val Gly Arg Ala Asn Gly Val
20 25 30
cat cgt gca cat cat gca ccg gat gat att cag aaa gca agc ctg ctg 144
His Arg Ala His His Ala Pro Asp Asp Ile Gln Lys Ala Ser Leu Leu
35 40 45
agc att aaa acc ggt ggt tgt ccg gaa gat tgt gca tat tgc ccg cag 192
Ser Ile Lys Thr Gly Gly Cys Pro Glu Asp Cys Ala Tyr Cys Pro Gln
50 55 60
agc gca cat cat cgt gaa gtg gaa ctg acc cgt gaa aaa ctg atg aat 240
Ser Ala His His Arg Glu Val Glu Leu Thr Arg Glu Lys Leu Met Asn
65 70 75 80
ccg gat cat gtt gtt agc ctg gca cgt cgt gcc cag cgt gcc ggt gcc 288
Pro Asp His Val Val Ser Leu Ala Arg Arg Ala Gln Arg Ala Gly Ala
85 90 95
gaa cgt ttt tgt atg ggt gca gca tgg cgt cag gtt cgt gat ggt gca 336
Glu Arg Phe Cys Met Gly Ala Ala Trp Arg Gln Val Arg Asp Gly Ala
100 105 110
gaa ttt gat aat gtt ctg gca atg gtt cgt ggt gtt cgt gca ctg ggt 384
Glu Phe Asp Asn Val Leu Ala Met Val Arg Gly Val Arg Ala Leu Gly
115 120 125
atg gaa gca tgt gtt acc ctg ggt atg ctg cgt ccg cat cag gca cag 432
Met Glu Ala Cys Val Thr Leu Gly Met Leu Arg Pro His Gln Ala Gln
130 135 140
cgt ctg gca gaa gca ggt ctg acc gca tat aat cat aat ctg gat acc 480
Arg Leu Ala Glu Ala Gly Leu Thr Ala Tyr Asn His Asn Leu Asp Thr
145 150 155 160
agc ccg gaa ttt tat ggt cag att att ggt acc cgt acc tat cag gat 528
Ser Pro Glu Phe Tyr Gly Gln Ile Ile Gly Thr Arg Thr Tyr Gln Asp
165 170 175
cgt ctg gat acc ctg gca tat tgt cgt gat gca ggt att gaa ctg tgt 576
Arg Leu Asp Thr Leu Ala Tyr Cys Arg Asp Ala Gly Ile Glu Leu Cys
180 185 190
tgt ggt ggt att att ggc atg ggt gaa agc ctg cgt gat cgt gca gca 624
Cys Gly Gly Ile Ile Gly Met Gly Glu Ser Leu Arg Asp Arg Ala Ala
195 200 205
atg ctg cag gtt ctg gcc aat ttt gca ccg cat ccg gaa agc gtt ccg 672
Met Leu Gln Val Leu Ala Asn Phe Ala Pro His Pro Glu Ser Val Pro
210 215 220
att aat gca ctg att ccg att gaa ggt acc ccg ctg gca cat cgt gaa 720
Ile Asn Ala Leu Ile Pro Ile Glu Gly Thr Pro Leu Ala His Arg Glu
225 230 235 240
cgt gtt ggt att ttt gat ctg gtt cgt atg gtt gca acc gca cgt att 768
Arg Val Gly Ile Phe Asp Leu Val Arg Met Val Ala Thr Ala Arg Ile
245 250 255
att atg ccg ctg acc cgt gtt cgt ctg agc gca ggt cgt agt gat ttt 816
Ile Met Pro Leu Thr Arg Val Arg Leu Ser Ala Gly Arg Ser Asp Phe
260 265 270
agc gcc gca gaa cag gca ctg tgt ttt ctg gcg ggt gca aat agc gtt 864
Ser Ala Ala Glu Gln Ala Leu Cys Phe Leu Ala Gly Ala Asn Ser Val
275 280 285
ttt tat ggt gat gtt ctg ctg acc gca ccg aat gca ggt acc ggt gca 912
Phe Tyr Gly Asp Val Leu Leu Thr Ala Pro Asn Ala Gly Thr Gly Ala
290 295 300
gat gca gaa ctg ttt gca gca ctg ggt gca ctg gaa acc gca taa 957
Asp Ala Glu Leu Phe Ala Ala Leu Gly Ala Leu Glu Thr Ala
305 310 315
<210> 37
<211> 318
<212> PRT
<213> Ruegeria pomeroyi
<400> 37
Met Ala Glu Ala Ile Arg Ser Asp Trp Ser Val Asp Glu Val Glu Ala
1 5 10 15
Leu Leu Arg Leu Pro Leu Leu Asp Leu Val Gly Arg Ala Asn Gly Val
20 25 30
His Arg Ala His His Ala Pro Asp Asp Ile Gln Lys Ala Ser Leu Leu
35 40 45
Ser Ile Lys Thr Gly Gly Cys Pro Glu Asp Cys Ala Tyr Cys Pro Gln
50 55 60
Ser Ala His His Arg Glu Val Glu Leu Thr Arg Glu Lys Leu Met Asn
65 70 75 80
Pro Asp His Val Val Ser Leu Ala Arg Arg Ala Gln Arg Ala Gly Ala
85 90 95
Glu Arg Phe Cys Met Gly Ala Ala Trp Arg Gln Val Arg Asp Gly Ala
100 105 110
Glu Phe Asp Asn Val Leu Ala Met Val Arg Gly Val Arg Ala Leu Gly
115 120 125
Met Glu Ala Cys Val Thr Leu Gly Met Leu Arg Pro His Gln Ala Gln
130 135 140
Arg Leu Ala Glu Ala Gly Leu Thr Ala Tyr Asn His Asn Leu Asp Thr
145 150 155 160
Ser Pro Glu Phe Tyr Gly Gln Ile Ile Gly Thr Arg Thr Tyr Gln Asp
165 170 175
Arg Leu Asp Thr Leu Ala Tyr Cys Arg Asp Ala Gly Ile Glu Leu Cys
180 185 190
Cys Gly Gly Ile Ile Gly Met Gly Glu Ser Leu Arg Asp Arg Ala Ala
195 200 205
Met Leu Gln Val Leu Ala Asn Phe Ala Pro His Pro Glu Ser Val Pro
210 215 220
Ile Asn Ala Leu Ile Pro Ile Glu Gly Thr Pro Leu Ala His Arg Glu
225 230 235 240
Arg Val Gly Ile Phe Asp Leu Val Arg Met Val Ala Thr Ala Arg Ile
245 250 255
Ile Met Pro Leu Thr Arg Val Arg Leu Ser Ala Gly Arg Ser Asp Phe
260 265 270
Ser Ala Ala Glu Gln Ala Leu Cys Phe Leu Ala Gly Ala Asn Ser Val
275 280 285
Phe Tyr Gly Asp Val Leu Leu Thr Ala Pro Asn Ala Gly Thr Gly Ala
290 295 300
Asp Ala Glu Leu Phe Ala Ala Leu Gly Ala Leu Glu Thr Ala
305 310 315
<210> 38
<211> 1020
<212> DNA
<213> Agrobacterium fabrum
<220>
<221> CDS
<222> (1)..(1020)
<223> bioB gene encoding biotin synthase from Agrobacterium fabrum str.
C58
<400> 38
atg gat cag ctg gca acc cag att gat ggt aaa ccg gca agc att ccg 48
Met Asp Gln Leu Ala Thr Gln Ile Asp Gly Lys Pro Ala Ser Ile Pro
1 5 10 15
gca gtt gaa acc agc agc agc ctg gaa gaa gcc aaa att att tat aat 96
Ala Val Glu Thr Ser Ser Ser Leu Glu Glu Ala Lys Ile Ile Tyr Asn
20 25 30
ctg ccg ttt aat gat ctg ctg ttt cgc gcc cag cag gtt cat cgt tgt 144
Leu Pro Phe Asn Asp Leu Leu Phe Arg Ala Gln Gln Val His Arg Cys
35 40 45
cat ttt gat gcc aat gca att cag atg agc cgt ctg ctg agc att aaa 192
His Phe Asp Ala Asn Ala Ile Gln Met Ser Arg Leu Leu Ser Ile Lys
50 55 60
acc ggc ggt tgt ccg gaa gat tgt agc tat tgt agc cag agc gca cgt 240
Thr Gly Gly Cys Pro Glu Asp Cys Ser Tyr Cys Ser Gln Ser Ala Arg
65 70 75 80
aat ccg acc ggt ctg aaa gca agc aaa ctg atg gaa gtt gaa cgt gtt 288
Asn Pro Thr Gly Leu Lys Ala Ser Lys Leu Met Glu Val Glu Arg Val
85 90 95
ctg gca gaa gca cgt aaa gca aaa gaa ggt ggt gca acc cgt tat tgt 336
Leu Ala Glu Ala Arg Lys Ala Lys Glu Gly Gly Ala Thr Arg Tyr Cys
100 105 110
atg ggt gca gca tgg cgt aat ccg aaa gaa cgt gat atg gaa gca gtt 384
Met Gly Ala Ala Trp Arg Asn Pro Lys Glu Arg Asp Met Glu Ala Val
115 120 125
gtt gca atg gtt gaa ggt gtt aaa gca ctg gat atg gaa acc tgt atg 432
Val Ala Met Val Glu Gly Val Lys Ala Leu Asp Met Glu Thr Cys Met
130 135 140
acc ctg ggt atg ctg acc ccg gaa cag agc gaa cgt ctg gca gat gca 480
Thr Leu Gly Met Leu Thr Pro Glu Gln Ser Glu Arg Leu Ala Asp Ala
145 150 155 160
ggt ctg gat tat tat aat cat aat gtg gat acc agc gaa cgt ttt tat 528
Gly Leu Asp Tyr Tyr Asn His Asn Val Asp Thr Ser Glu Arg Phe Tyr
165 170 175
agc gaa att att acc acc cgt acc ttt gaa gat cgc ctg gaa acc ctg 576
Ser Glu Ile Ile Thr Thr Arg Thr Phe Glu Asp Arg Leu Glu Thr Leu
180 185 190
gcc aat gtt cgt gat gca ggt att aaa gtt tgt gca ggc ggt att ctg 624
Ala Asn Val Arg Asp Ala Gly Ile Lys Val Cys Ala Gly Gly Ile Leu
195 200 205
ggt atg ggt gaa acc gtg gaa gat cgt att agc atg ctg gtt acc ctg 672
Gly Met Gly Glu Thr Val Glu Asp Arg Ile Ser Met Leu Val Thr Leu
210 215 220
gca aat ctg ccg gtt ccg ccg gaa agc gtt ccg att aat atg ctg att 720
Ala Asn Leu Pro Val Pro Pro Glu Ser Val Pro Ile Asn Met Leu Ile
225 230 235 240
ccg att ccg ggt agc aaa ctg gca aat gca gat ccg gtt gat ccg att 768
Pro Ile Pro Gly Ser Lys Leu Ala Asn Ala Asp Pro Val Asp Pro Ile
245 250 255
gat ttt gtt cgt acc att gca ctg gca cgt att ctg atg ccg cgt agc 816
Asp Phe Val Arg Thr Ile Ala Leu Ala Arg Ile Leu Met Pro Arg Ser
260 265 270
cat gtt cgt ctg agc gca ggt cgt acc gaa atg agc gat gaa acc cag 864
His Val Arg Leu Ser Ala Gly Arg Thr Glu Met Ser Asp Glu Thr Gln
275 280 285
gca ctg tgt ttt ctg gcc ggt gca aat agc att ttt att ggt gaa acc 912
Ala Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile Phe Ile Gly Glu Thr
290 295 300
ctg ctg acc gca gat aat ccg ggt gaa gat cat gat acc gca ctg ttt 960
Leu Leu Thr Ala Asp Asn Pro Gly Glu Asp His Asp Thr Ala Leu Phe
305 310 315 320
cgt cgt ctg ggt ctg aaa ccg atg gaa ctg cag agc agc gaa gcc ggt 1008
Arg Arg Leu Gly Leu Lys Pro Met Glu Leu Gln Ser Ser Glu Ala Gly
325 330 335
ggt tgt cgt taa 1020
Gly Cys Arg
<210> 39
<211> 339
<212> PRT
<213> Agrobacterium fabrum
<400> 39
Met Asp Gln Leu Ala Thr Gln Ile Asp Gly Lys Pro Ala Ser Ile Pro
1 5 10 15
Ala Val Glu Thr Ser Ser Ser Leu Glu Glu Ala Lys Ile Ile Tyr Asn
20 25 30
Leu Pro Phe Asn Asp Leu Leu Phe Arg Ala Gln Gln Val His Arg Cys
35 40 45
His Phe Asp Ala Asn Ala Ile Gln Met Ser Arg Leu Leu Ser Ile Lys
50 55 60
Thr Gly Gly Cys Pro Glu Asp Cys Ser Tyr Cys Ser Gln Ser Ala Arg
65 70 75 80
Asn Pro Thr Gly Leu Lys Ala Ser Lys Leu Met Glu Val Glu Arg Val
85 90 95
Leu Ala Glu Ala Arg Lys Ala Lys Glu Gly Gly Ala Thr Arg Tyr Cys
100 105 110
Met Gly Ala Ala Trp Arg Asn Pro Lys Glu Arg Asp Met Glu Ala Val
115 120 125
Val Ala Met Val Glu Gly Val Lys Ala Leu Asp Met Glu Thr Cys Met
130 135 140
Thr Leu Gly Met Leu Thr Pro Glu Gln Ser Glu Arg Leu Ala Asp Ala
145 150 155 160
Gly Leu Asp Tyr Tyr Asn His Asn Val Asp Thr Ser Glu Arg Phe Tyr
165 170 175
Ser Glu Ile Ile Thr Thr Arg Thr Phe Glu Asp Arg Leu Glu Thr Leu
180 185 190
Ala Asn Val Arg Asp Ala Gly Ile Lys Val Cys Ala Gly Gly Ile Leu
195 200 205
Gly Met Gly Glu Thr Val Glu Asp Arg Ile Ser Met Leu Val Thr Leu
210 215 220
Ala Asn Leu Pro Val Pro Pro Glu Ser Val Pro Ile Asn Met Leu Ile
225 230 235 240
Pro Ile Pro Gly Ser Lys Leu Ala Asn Ala Asp Pro Val Asp Pro Ile
245 250 255
Asp Phe Val Arg Thr Ile Ala Leu Ala Arg Ile Leu Met Pro Arg Ser
260 265 270
His Val Arg Leu Ser Ala Gly Arg Thr Glu Met Ser Asp Glu Thr Gln
275 280 285
Ala Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile Phe Ile Gly Glu Thr
290 295 300
Leu Leu Thr Ala Asp Asn Pro Gly Glu Asp His Asp Thr Ala Leu Phe
305 310 315 320
Arg Arg Leu Gly Leu Lys Pro Met Glu Leu Gln Ser Ser Glu Ala Gly
325 330 335
Gly Cys Arg
<210> 40
<211> 954
<212> DNA
<213> Wolbachia endosymbiont of Cimex lectularius
<220>
<221> CDS
<222> (1)..(954)
<223> bioB gene encoding biotin synthase from Wolbachia endosymbiont of
Cimex lectularius
<400> 40
atg acc gaa gaa tgg acc ttt gcc aaa gca gat cag att ttt aat ttt 48
Met Thr Glu Glu Trp Thr Phe Ala Lys Ala Asp Gln Ile Phe Asn Phe
1 5 10 15
ccg ttt ccg gaa ctg att tat att gca cag acc gaa cat cgc aaa cag 96
Pro Phe Pro Glu Leu Ile Tyr Ile Ala Gln Thr Glu His Arg Lys Gln
20 25 30
ttt aat ccg agc gaa gtg cag att agc acc ctg ctg agc att aaa acc 144
Phe Asn Pro Ser Glu Val Gln Ile Ser Thr Leu Leu Ser Ile Lys Thr
35 40 45
ggt agc tgt ccg gaa aat tgt agc tat tgt ccg cag agc gca cat tat 192
Gly Ser Cys Pro Glu Asn Cys Ser Tyr Cys Pro Gln Ser Ala His Tyr
50 55 60
aat acc ggt ctg cag aaa aaa ccg ctg ctg gaa att gca gaa gtt att 240
Asn Thr Gly Leu Gln Lys Lys Pro Leu Leu Glu Ile Ala Glu Val Ile
65 70 75 80
gaa gca gca aaa tgt gca aaa gaa gca ggt agc acc cgt ttt tgt atg 288
Glu Ala Ala Lys Cys Ala Lys Glu Ala Gly Ser Thr Arg Phe Cys Met
85 90 95
ggt gca gca tgg cgt ggt ccg cgt gat cag gat ctg aaa gtt gtt tgt 336
Gly Ala Ala Trp Arg Gly Pro Arg Asp Gln Asp Leu Lys Val Val Cys
100 105 110
gaa atg att cgt gaa gtt aaa aaa ctg ggt ctg gaa acc tgt gtt acc 384
Glu Met Ile Arg Glu Val Lys Lys Leu Gly Leu Glu Thr Cys Val Thr
115 120 125
ctg ggt ctg ctg aaa gat cat cag gcc aac atg ctg aaa gaa gcc ggt 432
Leu Gly Leu Leu Lys Asp His Gln Ala Asn Met Leu Lys Glu Ala Gly
130 135 140
ctg gat ttt tat aac cat aac atc gat acc agc gaa gaa tat tat aac 480
Leu Asp Phe Tyr Asn His Asn Ile Asp Thr Ser Glu Glu Tyr Tyr Asn
145 150 155 160
aaa gtg atc acc acc cgt acc ttt cag gat cgt ctg gaa acc ctg gaa 528
Lys Val Ile Thr Thr Arg Thr Phe Gln Asp Arg Leu Glu Thr Leu Glu
165 170 175
tgt gtt cgt gca agc ggt att aaa gtt tgt tgt ggt ggt att ctg ggt 576
Cys Val Arg Ala Ser Gly Ile Lys Val Cys Cys Gly Gly Ile Leu Gly
180 185 190
atg ggt gaa acc aat gaa gat cgt att aaa atg ctg gtt ctg ctg gca 624
Met Gly Glu Thr Asn Glu Asp Arg Ile Lys Met Leu Val Leu Leu Ala
195 200 205
aat ctg aat gat ccg ccg gaa agc gtt ccg att aat acc ctg att aaa 672
Asn Leu Asn Asp Pro Pro Glu Ser Val Pro Ile Asn Thr Leu Ile Lys
210 215 220
att ccg ggt acc ccg ctg gaa aat gtt gca gat gtt gat ccg ttt gat 720
Ile Pro Gly Thr Pro Leu Glu Asn Val Ala Asp Val Asp Pro Phe Asp
225 230 235 240
ttt gtt cgt acc att gca att gcc cgt att att atg ccg aaa agc tat 768
Phe Val Arg Thr Ile Ala Ile Ala Arg Ile Ile Met Pro Lys Ser Tyr
245 250 255
att cgt ctg agc gca ggt cgt gaa aaa atg agc gat gaa ctg cag gca 816
Ile Arg Leu Ser Ala Gly Arg Glu Lys Met Ser Asp Glu Leu Gln Ala
260 265 270
ctg tgt ttt ctg gcc ggt gca aat agc att ttt tat ggt gaa aaa ctg 864
Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile Phe Tyr Gly Glu Lys Leu
275 280 285
ctg acc gca cag aat ccg att ccg gaa cag gat aat cat ctg ttt cag 912
Leu Thr Ala Gln Asn Pro Ile Pro Glu Gln Asp Asn His Leu Phe Gln
290 295 300
cgt ctg ggc ctg cag aaa ctg gca ctg ctg cgt gaa aat taa 954
Arg Leu Gly Leu Gln Lys Leu Ala Leu Leu Arg Glu Asn
305 310 315
<210> 41
<211> 317
<212> PRT
<213> Wolbachia endosymbiont of Cimex lectularius
<400> 41
Met Thr Glu Glu Trp Thr Phe Ala Lys Ala Asp Gln Ile Phe Asn Phe
1 5 10 15
Pro Phe Pro Glu Leu Ile Tyr Ile Ala Gln Thr Glu His Arg Lys Gln
20 25 30
Phe Asn Pro Ser Glu Val Gln Ile Ser Thr Leu Leu Ser Ile Lys Thr
35 40 45
Gly Ser Cys Pro Glu Asn Cys Ser Tyr Cys Pro Gln Ser Ala His Tyr
50 55 60
Asn Thr Gly Leu Gln Lys Lys Pro Leu Leu Glu Ile Ala Glu Val Ile
65 70 75 80
Glu Ala Ala Lys Cys Ala Lys Glu Ala Gly Ser Thr Arg Phe Cys Met
85 90 95
Gly Ala Ala Trp Arg Gly Pro Arg Asp Gln Asp Leu Lys Val Val Cys
100 105 110
Glu Met Ile Arg Glu Val Lys Lys Leu Gly Leu Glu Thr Cys Val Thr
115 120 125
Leu Gly Leu Leu Lys Asp His Gln Ala Asn Met Leu Lys Glu Ala Gly
130 135 140
Leu Asp Phe Tyr Asn His Asn Ile Asp Thr Ser Glu Glu Tyr Tyr Asn
145 150 155 160
Lys Val Ile Thr Thr Arg Thr Phe Gln Asp Arg Leu Glu Thr Leu Glu
165 170 175
Cys Val Arg Ala Ser Gly Ile Lys Val Cys Cys Gly Gly Ile Leu Gly
180 185 190
Met Gly Glu Thr Asn Glu Asp Arg Ile Lys Met Leu Val Leu Leu Ala
195 200 205
Asn Leu Asn Asp Pro Pro Glu Ser Val Pro Ile Asn Thr Leu Ile Lys
210 215 220
Ile Pro Gly Thr Pro Leu Glu Asn Val Ala Asp Val Asp Pro Phe Asp
225 230 235 240
Phe Val Arg Thr Ile Ala Ile Ala Arg Ile Ile Met Pro Lys Ser Tyr
245 250 255
Ile Arg Leu Ser Ala Gly Arg Glu Lys Met Ser Asp Glu Leu Gln Ala
260 265 270
Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile Phe Tyr Gly Glu Lys Leu
275 280 285
Leu Thr Ala Gln Asn Pro Ile Pro Glu Gln Asp Asn His Leu Phe Gln
290 295 300
Arg Leu Gly Leu Gln Lys Leu Ala Leu Leu Arg Glu Asn
305 310 315
<210> 42
<211> 1026
<212> DNA
<213> Sphingomonas paucimobilis
<220>
<221> CDS
<222> (1)..(1026)
<223> bioB gene encoding biotin synthase from Sphingomonas paucimobilis
NBRC 13935
<400> 42
atg acc acc acc ccg gca ctg agc agt gaa gca acc ccg cgt acc gat 48
Met Thr Thr Thr Pro Ala Leu Ser Ser Glu Ala Thr Pro Arg Thr Asp
1 5 10 15
tgg acc cgt gca gaa att gca gca ctg ttt gat ctg ccg ttt acc gaa 96
Trp Thr Arg Ala Glu Ile Ala Ala Leu Phe Asp Leu Pro Phe Thr Glu
20 25 30
ctg ctg ttt cgt gcc gca gaa gtt cat cgt gca cat cat gca gca gat 144
Leu Leu Phe Arg Ala Ala Glu Val His Arg Ala His His Ala Ala Asp
35 40 45
cag gtt cag ctg agc acc ctg ctg agc att aaa acc ggt ggt tgt ccg 192
Gln Val Gln Leu Ser Thr Leu Leu Ser Ile Lys Thr Gly Gly Cys Pro
50 55 60
gaa gat tgt ggt tat tgt agc cag agc acc cat gca gat acc ggt ctg 240
Glu Asp Cys Gly Tyr Cys Ser Gln Ser Thr His Ala Asp Thr Gly Leu
65 70 75 80
aaa gca acc aaa ctg atg gat ccg cgt gca gtt ctg cag gca gca gca 288
Lys Ala Thr Lys Leu Met Asp Pro Arg Ala Val Leu Gln Ala Ala Ala
85 90 95
cag gca aaa gat cat ggt agc acc cgt ttt tgt atg ggt gca gca tgg 336
Gln Ala Lys Asp His Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp
100 105 110
cgt aat ccg aaa gat cgt gat atg ccg gca att gtt gaa atg gtg aaa 384
Arg Asn Pro Lys Asp Arg Asp Met Pro Ala Ile Val Glu Met Val Lys
115 120 125
ggt gtt cgt gca atg ggt atg gaa acc tgt atg acc ctg ggt atg ctg 432
Gly Val Arg Ala Met Gly Met Glu Thr Cys Met Thr Leu Gly Met Leu
130 135 140
acc gat gca cag gca cag acc ctg gca gaa gca ggt ctg gat tat tat 480
Thr Asp Ala Gln Ala Gln Thr Leu Ala Glu Ala Gly Leu Asp Tyr Tyr
145 150 155 160
aat cat aat att gat acc agc ccg gaa cgt tat ggt gat gtg att acc 528
Asn His Asn Ile Asp Thr Ser Pro Glu Arg Tyr Gly Asp Val Ile Thr
165 170 175
acc cgt agc ttt ggt gaa cgt ctg gaa acc ctg gaa cat gtt cgt gat 576
Thr Arg Ser Phe Gly Glu Arg Leu Glu Thr Leu Glu His Val Arg Asp
180 185 190
gca ggt att aat gtt tgt tgt ggt ggt att gtt ggt atg ggt gaa acc 624
Ala Gly Ile Asn Val Cys Cys Gly Gly Ile Val Gly Met Gly Glu Thr
195 200 205
cgt ggt gat cgt gtt ggt ttt att cat gca ctg gca acc ctg ccg gtt 672
Arg Gly Asp Arg Val Gly Phe Ile His Ala Leu Ala Thr Leu Pro Val
210 215 220
cat ccg ggt agt gtg ccg gtt aat gca ctg gtt ccg gtt aaa ggt acc 720
His Pro Gly Ser Val Pro Val Asn Ala Leu Val Pro Val Lys Gly Thr
225 230 235 240
gtt ctg ggt gat atg ctg gca gat acc ccg ctg gca aaa att gat gat 768
Val Leu Gly Asp Met Leu Ala Asp Thr Pro Leu Ala Lys Ile Asp Asp
245 250 255
att gaa ttt gtt cgt acc gtt gca gtt gca cgt att acc atg ccg cat 816
Ile Glu Phe Val Arg Thr Val Ala Val Ala Arg Ile Thr Met Pro His
260 265 270
agc atg gtt cgt ctg agc gca ggt cgt gaa agc atg agc gat gca acc 864
Ser Met Val Arg Leu Ser Ala Gly Arg Glu Ser Met Ser Asp Ala Thr
275 280 285
cag gca ctg tgt ttt ctg gcc ggt gca aat agc att ttt acc ggt gat 912
Gln Ala Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile Phe Thr Gly Asp
290 295 300
aaa ctg ctg acc gca ggt aat gca ggc gat gat aaa gat gca gcc ctg 960
Lys Leu Leu Thr Ala Gly Asn Ala Gly Asp Asp Lys Asp Ala Ala Leu
305 310 315 320
ttt gca cgt ctg ggt ctg acc ccg atg gca gca gaa tgt aaa gtt gaa 1008
Phe Ala Arg Leu Gly Leu Thr Pro Met Ala Ala Glu Cys Lys Val Glu
325 330 335
ctg gaa gca gca gaa taa 1026
Leu Glu Ala Ala Glu
340
<210> 43
<211> 341
<212> PRT
<213> Sphingomonas paucimobilis
<400> 43
Met Thr Thr Thr Pro Ala Leu Ser Ser Glu Ala Thr Pro Arg Thr Asp
1 5 10 15
Trp Thr Arg Ala Glu Ile Ala Ala Leu Phe Asp Leu Pro Phe Thr Glu
20 25 30
Leu Leu Phe Arg Ala Ala Glu Val His Arg Ala His His Ala Ala Asp
35 40 45
Gln Val Gln Leu Ser Thr Leu Leu Ser Ile Lys Thr Gly Gly Cys Pro
50 55 60
Glu Asp Cys Gly Tyr Cys Ser Gln Ser Thr His Ala Asp Thr Gly Leu
65 70 75 80
Lys Ala Thr Lys Leu Met Asp Pro Arg Ala Val Leu Gln Ala Ala Ala
85 90 95
Gln Ala Lys Asp His Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp
100 105 110
Arg Asn Pro Lys Asp Arg Asp Met Pro Ala Ile Val Glu Met Val Lys
115 120 125
Gly Val Arg Ala Met Gly Met Glu Thr Cys Met Thr Leu Gly Met Leu
130 135 140
Thr Asp Ala Gln Ala Gln Thr Leu Ala Glu Ala Gly Leu Asp Tyr Tyr
145 150 155 160
Asn His Asn Ile Asp Thr Ser Pro Glu Arg Tyr Gly Asp Val Ile Thr
165 170 175
Thr Arg Ser Phe Gly Glu Arg Leu Glu Thr Leu Glu His Val Arg Asp
180 185 190
Ala Gly Ile Asn Val Cys Cys Gly Gly Ile Val Gly Met Gly Glu Thr
195 200 205
Arg Gly Asp Arg Val Gly Phe Ile His Ala Leu Ala Thr Leu Pro Val
210 215 220
His Pro Gly Ser Val Pro Val Asn Ala Leu Val Pro Val Lys Gly Thr
225 230 235 240
Val Leu Gly Asp Met Leu Ala Asp Thr Pro Leu Ala Lys Ile Asp Asp
245 250 255
Ile Glu Phe Val Arg Thr Val Ala Val Ala Arg Ile Thr Met Pro His
260 265 270
Ser Met Val Arg Leu Ser Ala Gly Arg Glu Ser Met Ser Asp Ala Thr
275 280 285
Gln Ala Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile Phe Thr Gly Asp
290 295 300
Lys Leu Leu Thr Ala Gly Asn Ala Gly Asp Asp Lys Asp Ala Ala Leu
305 310 315 320
Phe Ala Arg Leu Gly Leu Thr Pro Met Ala Ala Glu Cys Lys Val Glu
325 330 335
Leu Glu Ala Ala Glu
340
<210> 44
<211> 951
<212> DNA
<213> Acidithiobacillus ferrivorans
<220>
<221> CDS
<222> (1)..(951)
<223> bioB gene encoding biotin synthase from Acidithiobacillus
ferrivorans SS3
<400> 44
atg aat acc acc gca ccg ccg cag acc ctg gat gca att ctg gaa att 48
Met Asn Thr Thr Ala Pro Pro Gln Thr Leu Asp Ala Ile Leu Glu Ile
1 5 10 15
tat gcc agc ccg ttt aat gat ctg att ttt gaa gca cag aaa gtg cat 96
Tyr Ala Ser Pro Phe Asn Asp Leu Ile Phe Glu Ala Gln Lys Val His
20 25 30
cgc ctg cat ttt gat ccg aat gcc att cag tgt agc acc ctg ctg agc 144
Arg Leu His Phe Asp Pro Asn Ala Ile Gln Cys Ser Thr Leu Leu Ser
35 40 45
att aaa acc ggt ggt tgt ccg gaa gat tgt ggt tat tgt agc cag agc 192
Ile Lys Thr Gly Gly Cys Pro Glu Asp Cys Gly Tyr Cys Ser Gln Ser
50 55 60
gca cat cat cag acc gca ctg aaa gca gaa gca ctg atg gat ctg gaa 240
Ala His His Gln Thr Ala Leu Lys Ala Glu Ala Leu Met Asp Leu Glu
65 70 75 80
cag gtt cgt gca gca gca cag gaa gca aaa gca aat ggt gca cag cgt 288
Gln Val Arg Ala Ala Ala Gln Glu Ala Lys Ala Asn Gly Ala Gln Arg
85 90 95
ctg tgt atg ggt gca gca tgg cgt agc ccg cat gat aaa gat att gaa 336
Leu Cys Met Gly Ala Ala Trp Arg Ser Pro His Asp Lys Asp Ile Glu
100 105 110
aaa gtt gca gca atg att ggt gtt gtt aaa gaa tat ggt ctg gaa agc 384
Lys Val Ala Ala Met Ile Gly Val Val Lys Glu Tyr Gly Leu Glu Ser
115 120 125
tgt gtt acc ctg ggt atg ctg aaa ccg ggt cag gca gaa cgt ctg cag 432
Cys Val Thr Leu Gly Met Leu Lys Pro Gly Gln Ala Glu Arg Leu Gln
130 135 140
aat gcg ggt ctg gat tat tat aat cat aat ctg gat acc agc ccg gaa 480
Asn Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu Asp Thr Ser Pro Glu
145 150 155 160
ttt tat ggt gaa gtt att cat acc cgt agc tat cag gat cgt ctg gat 528
Phe Tyr Gly Glu Val Ile His Thr Arg Ser Tyr Gln Asp Arg Leu Asp
165 170 175
acc ctg gaa acc gtt cgt agc gca ggt att aaa att tgt agc ggt ggc 576
Thr Leu Glu Thr Val Arg Ser Ala Gly Ile Lys Ile Cys Ser Gly Gly
180 185 190
att ctg ggt atg ggt gaa agc cgt cgt gat cgt gca cgt atg ctg cag 624
Ile Leu Gly Met Gly Glu Ser Arg Arg Asp Arg Ala Arg Met Leu Gln
195 200 205
att ctg gca cat ctg ccg cag gca ccg gaa agc att ccg att aat gca 672
Ile Leu Ala His Leu Pro Gln Ala Pro Glu Ser Ile Pro Ile Asn Ala
210 215 220
ctg gtt ccg gtt ccg ggt acc ccg ctg gaa gca gca gaa ccg att gat 720
Leu Val Pro Val Pro Gly Thr Pro Leu Glu Ala Ala Glu Pro Ile Asp
225 230 235 240
ggt ttt gaa ttt gtt cgt acc att gca gtt gca cgt att ctg ttt ccg 768
Gly Phe Glu Phe Val Arg Thr Ile Ala Val Ala Arg Ile Leu Phe Pro
245 250 255
aaa gca tat gtt cgt ctg agc gca ggt cgt ggt gca atg agc gat gaa 816
Lys Ala Tyr Val Arg Leu Ser Ala Gly Arg Gly Ala Met Ser Asp Glu
260 265 270
ctg cag gca ctg gca ttt ctg gcc ggt gca aat agc att ttt ctg ggt 864
Leu Gln Ala Leu Ala Phe Leu Ala Gly Ala Asn Ser Ile Phe Leu Gly
275 280 285
gat cgt ctg ctg acc acc gat aat gca agc atg ggt cat gat cag agc 912
Asp Arg Leu Leu Thr Thr Asp Asn Ala Ser Met Gly His Asp Gln Ser
290 295 300
ctg ttt agc cgt ctg ggt ctg cat cgt agc gaa gca taa 951
Leu Phe Ser Arg Leu Gly Leu His Arg Ser Glu Ala
305 310 315
<210> 45
<211> 316
<212> PRT
<213> Acidithiobacillus ferrivorans
<400> 45
Met Asn Thr Thr Ala Pro Pro Gln Thr Leu Asp Ala Ile Leu Glu Ile
1 5 10 15
Tyr Ala Ser Pro Phe Asn Asp Leu Ile Phe Glu Ala Gln Lys Val His
20 25 30
Arg Leu His Phe Asp Pro Asn Ala Ile Gln Cys Ser Thr Leu Leu Ser
35 40 45
Ile Lys Thr Gly Gly Cys Pro Glu Asp Cys Gly Tyr Cys Ser Gln Ser
50 55 60
Ala His His Gln Thr Ala Leu Lys Ala Glu Ala Leu Met Asp Leu Glu
65 70 75 80
Gln Val Arg Ala Ala Ala Gln Glu Ala Lys Ala Asn Gly Ala Gln Arg
85 90 95
Leu Cys Met Gly Ala Ala Trp Arg Ser Pro His Asp Lys Asp Ile Glu
100 105 110
Lys Val Ala Ala Met Ile Gly Val Val Lys Glu Tyr Gly Leu Glu Ser
115 120 125
Cys Val Thr Leu Gly Met Leu Lys Pro Gly Gln Ala Glu Arg Leu Gln
130 135 140
Asn Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu Asp Thr Ser Pro Glu
145 150 155 160
Phe Tyr Gly Glu Val Ile His Thr Arg Ser Tyr Gln Asp Arg Leu Asp
165 170 175
Thr Leu Glu Thr Val Arg Ser Ala Gly Ile Lys Ile Cys Ser Gly Gly
180 185 190
Ile Leu Gly Met Gly Glu Ser Arg Arg Asp Arg Ala Arg Met Leu Gln
195 200 205
Ile Leu Ala His Leu Pro Gln Ala Pro Glu Ser Ile Pro Ile Asn Ala
210 215 220
Leu Val Pro Val Pro Gly Thr Pro Leu Glu Ala Ala Glu Pro Ile Asp
225 230 235 240
Gly Phe Glu Phe Val Arg Thr Ile Ala Val Ala Arg Ile Leu Phe Pro
245 250 255
Lys Ala Tyr Val Arg Leu Ser Ala Gly Arg Gly Ala Met Ser Asp Glu
260 265 270
Leu Gln Ala Leu Ala Phe Leu Ala Gly Ala Asn Ser Ile Phe Leu Gly
275 280 285
Asp Arg Leu Leu Thr Thr Asp Asn Ala Ser Met Gly His Asp Gln Ser
290 295 300
Leu Phe Ser Arg Leu Gly Leu His Arg Ser Glu Ala
305 310 315
<210> 46
<211> 996
<212> DNA
<213> Gallionella capsiferriformans
<220>
<221> CDS
<222> (1)..(996)
<223> bioB gene encoding biotin synthase from Gallionella
capsiferriformans ES-2
<400> 46
atg aat acc cag acc att gcc ttt cat cat ccg gtt aaa cgt acc gca 48
Met Asn Thr Gln Thr Ile Ala Phe His His Pro Val Lys Arg Thr Ala
1 5 10 15
acc ccg gaa cgt tgg agc gtt gaa gca gtt gaa agc ctg ttt gca ctg 96
Thr Pro Glu Arg Trp Ser Val Glu Ala Val Glu Ser Leu Phe Ala Leu
20 25 30
ccg ttt gcc gat ctg ctg tat cgt gca cag cag gtt cat cgt gaa cat 144
Pro Phe Ala Asp Leu Leu Tyr Arg Ala Gln Gln Val His Arg Glu His
35 40 45
ttt gat ccg aat cag gtt cag ctg agc acc ctg ctg agc att aaa acc 192
Phe Asp Pro Asn Gln Val Gln Leu Ser Thr Leu Leu Ser Ile Lys Thr
50 55 60
ggt ggt tgt agc gaa gat tgt ggt tat tgt ccg cag agc gca ttt cat 240
Gly Gly Cys Ser Glu Asp Cys Gly Tyr Cys Pro Gln Ser Ala Phe His
65 70 75 80
agc acc ggt gtt gaa gat cgt aaa atg ctg gca ctg gat gca gtt att 288
Ser Thr Gly Val Glu Asp Arg Lys Met Leu Ala Leu Asp Ala Val Ile
85 90 95
gaa gca gca aaa gca gca cag gca gca ggc gca gat cgt ttt tgt atg 336
Glu Ala Ala Lys Ala Ala Gln Ala Ala Gly Ala Asp Arg Phe Cys Met
100 105 110
ggt gca gca tgg cgt gaa ccg agc gaa gca gat atg ctg agc gtt gtt 384
Gly Ala Ala Trp Arg Glu Pro Ser Glu Ala Asp Met Leu Ser Val Val
115 120 125
gat atg gtt cag gca gtt cgt ggc ctg ggt atg gaa acc tgt gca acc 432
Asp Met Val Gln Ala Val Arg Gly Leu Gly Met Glu Thr Cys Ala Thr
130 135 140
ctg ggt atg ctg aat gat gca cag acc gaa cag ctg cgt gca gcc ggt 480
Leu Gly Met Leu Asn Asp Ala Gln Thr Glu Gln Leu Arg Ala Ala Gly
145 150 155 160
ctg gat tat tat aat cat aat ctg gat acc agc ccg gaa ttt tat ggc 528
Leu Asp Tyr Tyr Asn His Asn Leu Asp Thr Ser Pro Glu Phe Tyr Gly
165 170 175
gat att att agc acc cgt gat tat cag gat cgc ctg gat acc ctg gaa 576
Asp Ile Ile Ser Thr Arg Asp Tyr Gln Asp Arg Leu Asp Thr Leu Glu
180 185 190
cgc gtt cgt cgt gca ggt atg cat gtt tgt agc ggt ggt att gtt ggt 624
Arg Val Arg Arg Ala Gly Met His Val Cys Ser Gly Gly Ile Val Gly
195 200 205
atg ggt gaa agt ctg acc gaa cgt gca ggt ctg gtt gcc cag ctg gca 672
Met Gly Glu Ser Leu Thr Glu Arg Ala Gly Leu Val Ala Gln Leu Ala
210 215 220
aat ctg aat ccg tat ccg gaa agc gtt ccg att aat aat ctg gtt aaa 720
Asn Leu Asn Pro Tyr Pro Glu Ser Val Pro Ile Asn Asn Leu Val Lys
225 230 235 240
gtt gaa ggt acc ccg ctg gca gat gca gca gaa ctg gat ccg ctg gat 768
Val Glu Gly Thr Pro Leu Ala Asp Ala Ala Glu Leu Asp Pro Leu Asp
245 250 255
ttt gtt cgt acc att gca gtt gca cgt att acc atg ccg acc gca cgt 816
Phe Val Arg Thr Ile Ala Val Ala Arg Ile Thr Met Pro Thr Ala Arg
260 265 270
gtt cgt ctg agc gca ggt cgt cag gca atg agc gat gca att cag gca 864
Val Arg Leu Ser Ala Gly Arg Gln Ala Met Ser Asp Ala Ile Gln Ala
275 280 285
ctg tgt ttt ctg gcc ggt gca aat agc att ttt tat ggt gaa cag ctg 912
Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile Phe Tyr Gly Glu Gln Leu
290 295 300
ctg acc acc ggt aat ccg gaa gtt gaa cgt gat cgt gca ctg atg gat 960
Leu Thr Thr Gly Asn Pro Glu Val Glu Arg Asp Arg Ala Leu Met Asp
305 310 315 320
aaa ctg ggt atg tat ccg ttt gca gat aaa cat taa 996
Lys Leu Gly Met Tyr Pro Phe Ala Asp Lys His
325 330
<210> 47
<211> 331
<212> PRT
<213> Gallionella capsiferriformans
<400> 47
Met Asn Thr Gln Thr Ile Ala Phe His His Pro Val Lys Arg Thr Ala
1 5 10 15
Thr Pro Glu Arg Trp Ser Val Glu Ala Val Glu Ser Leu Phe Ala Leu
20 25 30
Pro Phe Ala Asp Leu Leu Tyr Arg Ala Gln Gln Val His Arg Glu His
35 40 45
Phe Asp Pro Asn Gln Val Gln Leu Ser Thr Leu Leu Ser Ile Lys Thr
50 55 60
Gly Gly Cys Ser Glu Asp Cys Gly Tyr Cys Pro Gln Ser Ala Phe His
65 70 75 80
Ser Thr Gly Val Glu Asp Arg Lys Met Leu Ala Leu Asp Ala Val Ile
85 90 95
Glu Ala Ala Lys Ala Ala Gln Ala Ala Gly Ala Asp Arg Phe Cys Met
100 105 110
Gly Ala Ala Trp Arg Glu Pro Ser Glu Ala Asp Met Leu Ser Val Val
115 120 125
Asp Met Val Gln Ala Val Arg Gly Leu Gly Met Glu Thr Cys Ala Thr
130 135 140
Leu Gly Met Leu Asn Asp Ala Gln Thr Glu Gln Leu Arg Ala Ala Gly
145 150 155 160
Leu Asp Tyr Tyr Asn His Asn Leu Asp Thr Ser Pro Glu Phe Tyr Gly
165 170 175
Asp Ile Ile Ser Thr Arg Asp Tyr Gln Asp Arg Leu Asp Thr Leu Glu
180 185 190
Arg Val Arg Arg Ala Gly Met His Val Cys Ser Gly Gly Ile Val Gly
195 200 205
Met Gly Glu Ser Leu Thr Glu Arg Ala Gly Leu Val Ala Gln Leu Ala
210 215 220
Asn Leu Asn Pro Tyr Pro Glu Ser Val Pro Ile Asn Asn Leu Val Lys
225 230 235 240
Val Glu Gly Thr Pro Leu Ala Asp Ala Ala Glu Leu Asp Pro Leu Asp
245 250 255
Phe Val Arg Thr Ile Ala Val Ala Arg Ile Thr Met Pro Thr Ala Arg
260 265 270
Val Arg Leu Ser Ala Gly Arg Gln Ala Met Ser Asp Ala Ile Gln Ala
275 280 285
Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile Phe Tyr Gly Glu Gln Leu
290 295 300
Leu Thr Thr Gly Asn Pro Glu Val Glu Arg Asp Arg Ala Leu Met Asp
305 310 315 320
Lys Leu Gly Met Tyr Pro Phe Ala Asp Lys His
325 330
<210> 48
<211> 1029
<212> DNA
<213> Ralstonia eutropha
<220>
<221> CDS
<222> (1)..(1029)
<223> bioB gene encoding biotin synthase from Ralstonia eutropha JMP134
<400> 48
atg aat cag gca gca cag acc gtt gca acc att agc gca gaa gcc ctg 48
Met Asn Gln Ala Ala Gln Thr Val Ala Thr Ile Ser Ala Glu Ala Leu
1 5 10 15
cgt cag acc gca cgt aat acc cat gca ctg ccg gaa gat gca cgt tgg 96
Arg Gln Thr Ala Arg Asn Thr His Ala Leu Pro Glu Asp Ala Arg Trp
20 25 30
cgt gtt gat gat gtt gca gca ctg ttt gcc ctg ccg ttt aat gat ctg 144
Arg Val Asp Asp Val Ala Ala Leu Phe Ala Leu Pro Phe Asn Asp Leu
35 40 45
ctg ttt cgt gca cag cag gtt cat cgt gaa aat ttt gat gca aat acc 192
Leu Phe Arg Ala Gln Gln Val His Arg Glu Asn Phe Asp Ala Asn Thr
50 55 60
gtt cag ctg agc acc ctg ctg agc att aaa acc ggt ggt tgt gaa gaa 240
Val Gln Leu Ser Thr Leu Leu Ser Ile Lys Thr Gly Gly Cys Glu Glu
65 70 75 80
gat tgt ggt tat tgt ccg cag agc gca cat cat gat gcg ggt gtt aaa 288
Asp Cys Gly Tyr Cys Pro Gln Ser Ala His His Asp Ala Gly Val Lys
85 90 95
gcc gaa aaa ctg atg gaa ctg gat gaa gtt ctg gaa gca gca cgt gca 336
Ala Glu Lys Leu Met Glu Leu Asp Glu Val Leu Glu Ala Ala Arg Ala
100 105 110
gca aaa gca aat ggt gca acc cgt ttt tgt atg ggt gca gca tgg cgt 384
Ala Lys Ala Asn Gly Ala Thr Arg Phe Cys Met Gly Ala Ala Trp Arg
115 120 125
agc ccg aaa gat cgt cat ctg gaa ccg gtt atg gat atg gtt cgt gaa 432
Ser Pro Lys Asp Arg His Leu Glu Pro Val Met Asp Met Val Arg Glu
130 135 140
gtt aaa gca atg ggt ctg gaa acc tgt gtt acc ctg ggt atg ctg aaa 480
Val Lys Ala Met Gly Leu Glu Thr Cys Val Thr Leu Gly Met Leu Lys
145 150 155 160
gca gaa cag gcc cag cag ctg aaa gat gcc ggt ctg gat tat tat aat 528
Ala Glu Gln Ala Gln Gln Leu Lys Asp Ala Gly Leu Asp Tyr Tyr Asn
165 170 175
cat aat ctg gat acc agc ccg gaa ttt tat ggc aaa att att acc acc 576
His Asn Leu Asp Thr Ser Pro Glu Phe Tyr Gly Lys Ile Ile Thr Thr
180 185 190
cgt acc tat cag gat cgc ctg gat acc att ggc cat gtt cgt gat gca 624
Arg Thr Tyr Gln Asp Arg Leu Asp Thr Ile Gly His Val Arg Asp Ala
195 200 205
ggt att aat gtt tgt tgt ggt ggt att gtt ggt atg ggt gaa agc cgt 672
Gly Ile Asn Val Cys Cys Gly Gly Ile Val Gly Met Gly Glu Ser Arg
210 215 220
gaa gcc cgt gca ggt ctg att gca cag ctg gca aat atg gat ccg tat 720
Glu Ala Arg Ala Gly Leu Ile Ala Gln Leu Ala Asn Met Asp Pro Tyr
225 230 235 240
ccg gaa agc gtt ccg att aat aat ctg gtt cag gtt gaa ggt acc ccg 768
Pro Glu Ser Val Pro Ile Asn Asn Leu Val Gln Val Glu Gly Thr Pro
245 250 255
ctg gca ggt acc gaa gca ctg gat ccg ttt gaa ttt gtt cgt acc att 816
Leu Ala Gly Thr Glu Ala Leu Asp Pro Phe Glu Phe Val Arg Thr Ile
260 265 270
gca gtt gca cgt att acc atg ccg ggt gca atg gtt cgt ctg agc gca 864
Ala Val Ala Arg Ile Thr Met Pro Gly Ala Met Val Arg Leu Ser Ala
275 280 285
ggt cgt gaa gca atg gat gaa gca ctg cag gca ctg tgt ttt atg gcc 912
Gly Arg Glu Ala Met Asp Glu Ala Leu Gln Ala Leu Cys Phe Met Ala
290 295 300
ggt gca aat agc att ttt tat ggt gaa aaa ctg ctg acc acc ggt aat 960
Gly Ala Asn Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Gly Asn
305 310 315 320
ccg cag gca gat cgt gat cgt gca ctg ctg gca cgt ctg gat att cgt 1008
Pro Gln Ala Asp Arg Asp Arg Ala Leu Leu Ala Arg Leu Asp Ile Arg
325 330 335
gca gaa ggt tat gca ggt taa 1029
Ala Glu Gly Tyr Ala Gly
340
<210> 49
<211> 342
<212> PRT
<213> Ralstonia eutropha
<400> 49
Met Asn Gln Ala Ala Gln Thr Val Ala Thr Ile Ser Ala Glu Ala Leu
1 5 10 15
Arg Gln Thr Ala Arg Asn Thr His Ala Leu Pro Glu Asp Ala Arg Trp
20 25 30
Arg Val Asp Asp Val Ala Ala Leu Phe Ala Leu Pro Phe Asn Asp Leu
35 40 45
Leu Phe Arg Ala Gln Gln Val His Arg Glu Asn Phe Asp Ala Asn Thr
50 55 60
Val Gln Leu Ser Thr Leu Leu Ser Ile Lys Thr Gly Gly Cys Glu Glu
65 70 75 80
Asp Cys Gly Tyr Cys Pro Gln Ser Ala His His Asp Ala Gly Val Lys
85 90 95
Ala Glu Lys Leu Met Glu Leu Asp Glu Val Leu Glu Ala Ala Arg Ala
100 105 110
Ala Lys Ala Asn Gly Ala Thr Arg Phe Cys Met Gly Ala Ala Trp Arg
115 120 125
Ser Pro Lys Asp Arg His Leu Glu Pro Val Met Asp Met Val Arg Glu
130 135 140
Val Lys Ala Met Gly Leu Glu Thr Cys Val Thr Leu Gly Met Leu Lys
145 150 155 160
Ala Glu Gln Ala Gln Gln Leu Lys Asp Ala Gly Leu Asp Tyr Tyr Asn
165 170 175
His Asn Leu Asp Thr Ser Pro Glu Phe Tyr Gly Lys Ile Ile Thr Thr
180 185 190
Arg Thr Tyr Gln Asp Arg Leu Asp Thr Ile Gly His Val Arg Asp Ala
195 200 205
Gly Ile Asn Val Cys Cys Gly Gly Ile Val Gly Met Gly Glu Ser Arg
210 215 220
Glu Ala Arg Ala Gly Leu Ile Ala Gln Leu Ala Asn Met Asp Pro Tyr
225 230 235 240
Pro Glu Ser Val Pro Ile Asn Asn Leu Val Gln Val Glu Gly Thr Pro
245 250 255
Leu Ala Gly Thr Glu Ala Leu Asp Pro Phe Glu Phe Val Arg Thr Ile
260 265 270
Ala Val Ala Arg Ile Thr Met Pro Gly Ala Met Val Arg Leu Ser Ala
275 280 285
Gly Arg Glu Ala Met Asp Glu Ala Leu Gln Ala Leu Cys Phe Met Ala
290 295 300
Gly Ala Asn Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Gly Asn
305 310 315 320
Pro Gln Ala Asp Arg Asp Arg Ala Leu Leu Ala Arg Leu Asp Ile Arg
325 330 335
Ala Glu Gly Tyr Ala Gly
340
<210> 50
<211> 1008
<212> DNA
<213> Bordetella parapertussis
<220>
<221> CDS
<222> (1)..(1008)
<223> bioB gene encoding biotin synthase from Bordetella parapertussis
12822
<400> 50
atg cat acc gca tat att ccg gtt ccg aca ccg gtt cgt ccg ccg agt 48
Met His Thr Ala Tyr Ile Pro Val Pro Thr Pro Val Arg Pro Pro Ser
1 5 10 15
gca gaa cgt tgg ccg ctg gca gca gtt gca gaa ctg ttt gaa ctg ccg 96
Ala Glu Arg Trp Pro Leu Ala Ala Val Ala Glu Leu Phe Glu Leu Pro
20 25 30
ttt ctg gat ctg ctg cat cgt gca cag cag gtt cat cgt cag cat ttt 144
Phe Leu Asp Leu Leu His Arg Ala Gln Gln Val His Arg Gln His Phe
35 40 45
gat gca aat acc gtt cag ctg agc agc ctg ctg agc att aaa acc ggt 192
Asp Ala Asn Thr Val Gln Leu Ser Ser Leu Leu Ser Ile Lys Thr Gly
50 55 60
ggt tgt ccg gaa gat tgt gca tat tgt ccg cag agc gca cat tat gat 240
Gly Cys Pro Glu Asp Cys Ala Tyr Cys Pro Gln Ser Ala His Tyr Asp
65 70 75 80
acc ggt gtt gat gca gat aaa ctg atg ccg ctg gat gaa gtt gtt cgt 288
Thr Gly Val Asp Ala Asp Lys Leu Met Pro Leu Asp Glu Val Val Arg
85 90 95
gca gcc cgt gca gca cag gca aat ggt gca cag cgt ttt tgt atg ggt 336
Ala Ala Arg Ala Ala Gln Ala Asn Gly Ala Gln Arg Phe Cys Met Gly
100 105 110
gca gca tgg cgt agc ccg aaa ccg cat cat ctg gaa gca gtg gca gaa 384
Ala Ala Trp Arg Ser Pro Lys Pro His His Leu Glu Ala Val Ala Glu
115 120 125
atg att ggt gca gtt aaa gca ctg ggt atg gaa acc tgt gtt acc ctg 432
Met Ile Gly Ala Val Lys Ala Leu Gly Met Glu Thr Cys Val Thr Leu
130 135 140
ggt atg ctg cgt gat ggt cag gca gaa cag ctg aaa gca gca ggc ctg 480
Gly Met Leu Arg Asp Gly Gln Ala Glu Gln Leu Lys Ala Ala Gly Leu
145 150 155 160
gat tat tat aat cat aat ctg gat acc gca ccg gaa ttt tat ggt aaa 528
Asp Tyr Tyr Asn His Asn Leu Asp Thr Ala Pro Glu Phe Tyr Gly Lys
165 170 175
att att agc acc cgt acc tat cag gat cgt ctg gat acc ctg cag cag 576
Ile Ile Ser Thr Arg Thr Tyr Gln Asp Arg Leu Asp Thr Leu Gln Gln
180 185 190
gtt cgt gaa gca ggt att aat gtt tgt tgt ggt ggt att gtt ggt atg 624
Val Arg Glu Ala Gly Ile Asn Val Cys Cys Gly Gly Ile Val Gly Met
195 200 205
ggt gaa agc cgt cgt gat cgt gca ggt ctg gtt gca cag ctg gca aat 672
Gly Glu Ser Arg Arg Asp Arg Ala Gly Leu Val Ala Gln Leu Ala Asn
210 215 220
atg gaa ccg tat ccg gaa agc gtt ccg att aat aat ctg gtt cag gtg 720
Met Glu Pro Tyr Pro Glu Ser Val Pro Ile Asn Asn Leu Val Gln Val
225 230 235 240
gaa ggt acc ccg ctg gcg ggt gca gaa acc ctg gat ccg ttt gaa ttt 768
Glu Gly Thr Pro Leu Ala Gly Ala Glu Thr Leu Asp Pro Phe Glu Phe
245 250 255
att cgt acc att gca gtt gca cgt att acc atg ccg ctg gcc aaa gtt 816
Ile Arg Thr Ile Ala Val Ala Arg Ile Thr Met Pro Leu Ala Lys Val
260 265 270
cgt ctg agc gca ggt cgt gaa acc atg agc gat agc gaa cag gca ctg 864
Arg Leu Ser Ala Gly Arg Glu Thr Met Ser Asp Ser Glu Gln Ala Leu
275 280 285
tgt ttt atg gcc ggt gca aat agc att ttt tat ggt gat gtt ctg ctg 912
Cys Phe Met Ala Gly Ala Asn Ser Ile Phe Tyr Gly Asp Val Leu Leu
290 295 300
aca acc ggt aat ccg cag gtt gaa gca gat cgt cgt ctg ctg cag cgt 960
Thr Thr Gly Asn Pro Gln Val Glu Ala Asp Arg Arg Leu Leu Gln Arg
305 310 315 320
ctg ggt atg cgt gca gaa ggt ctg ccg tgt gca gcc ggt cag gca taa 1008
Leu Gly Met Arg Ala Glu Gly Leu Pro Cys Ala Ala Gly Gln Ala
325 330 335
<210> 51
<211> 335
<212> PRT
<213> Bordetella parapertussis
<400> 51
Met His Thr Ala Tyr Ile Pro Val Pro Thr Pro Val Arg Pro Pro Ser
1 5 10 15
Ala Glu Arg Trp Pro Leu Ala Ala Val Ala Glu Leu Phe Glu Leu Pro
20 25 30
Phe Leu Asp Leu Leu His Arg Ala Gln Gln Val His Arg Gln His Phe
35 40 45
Asp Ala Asn Thr Val Gln Leu Ser Ser Leu Leu Ser Ile Lys Thr Gly
50 55 60
Gly Cys Pro Glu Asp Cys Ala Tyr Cys Pro Gln Ser Ala His Tyr Asp
65 70 75 80
Thr Gly Val Asp Ala Asp Lys Leu Met Pro Leu Asp Glu Val Val Arg
85 90 95
Ala Ala Arg Ala Ala Gln Ala Asn Gly Ala Gln Arg Phe Cys Met Gly
100 105 110
Ala Ala Trp Arg Ser Pro Lys Pro His His Leu Glu Ala Val Ala Glu
115 120 125
Met Ile Gly Ala Val Lys Ala Leu Gly Met Glu Thr Cys Val Thr Leu
130 135 140
Gly Met Leu Arg Asp Gly Gln Ala Glu Gln Leu Lys Ala Ala Gly Leu
145 150 155 160
Asp Tyr Tyr Asn His Asn Leu Asp Thr Ala Pro Glu Phe Tyr Gly Lys
165 170 175
Ile Ile Ser Thr Arg Thr Tyr Gln Asp Arg Leu Asp Thr Leu Gln Gln
180 185 190
Val Arg Glu Ala Gly Ile Asn Val Cys Cys Gly Gly Ile Val Gly Met
195 200 205
Gly Glu Ser Arg Arg Asp Arg Ala Gly Leu Val Ala Gln Leu Ala Asn
210 215 220
Met Glu Pro Tyr Pro Glu Ser Val Pro Ile Asn Asn Leu Val Gln Val
225 230 235 240
Glu Gly Thr Pro Leu Ala Gly Ala Glu Thr Leu Asp Pro Phe Glu Phe
245 250 255
Ile Arg Thr Ile Ala Val Ala Arg Ile Thr Met Pro Leu Ala Lys Val
260 265 270
Arg Leu Ser Ala Gly Arg Glu Thr Met Ser Asp Ser Glu Gln Ala Leu
275 280 285
Cys Phe Met Ala Gly Ala Asn Ser Ile Phe Tyr Gly Asp Val Leu Leu
290 295 300
Thr Thr Gly Asn Pro Gln Val Glu Ala Asp Arg Arg Leu Leu Gln Arg
305 310 315 320
Leu Gly Met Arg Ala Glu Gly Leu Pro Cys Ala Ala Gly Gln Ala
325 330 335
<210> 52
<211> 1044
<212> DNA
<213> Pusillimonas sp. T7-7
<220>
<221> CDS
<222> (1)..(1044)
<223> bioB gene encoding biotin synthase from Pusillimonas sp. T7-7
<400> 52
atg gca gca atg aaa ccg gca att ccg agc cat acc ccg acc ccg gat 48
Met Ala Ala Met Lys Pro Ala Ile Pro Ser His Thr Pro Thr Pro Asp
1 5 10 15
cat gca ccg cag gca tgg ggt att gca cag att ctg cgt ctg tat gaa 96
His Ala Pro Gln Ala Trp Gly Ile Ala Gln Ile Leu Arg Leu Tyr Glu
20 25 30
ctg ccg ttt ctg gat ctg ctg cat cag gca cag gcc gtt cat cgt gca 144
Leu Pro Phe Leu Asp Leu Leu His Gln Ala Gln Ala Val His Arg Ala
35 40 45
cat cat cag ccg aat acc gtt cag ctg agc agc ctg ctg agc att aaa 192
His His Gln Pro Asn Thr Val Gln Leu Ser Ser Leu Leu Ser Ile Lys
50 55 60
acc ggt gca tgt ccg gaa gat tgt gca tat tgt ccg cag agc gca cgt 240
Thr Gly Ala Cys Pro Glu Asp Cys Ala Tyr Cys Pro Gln Ser Ala Arg
65 70 75 80
cat gat acc ggt ggt aaa cag gaa gca ctg atg ccg gtt gca gaa gtt 288
His Asp Thr Gly Gly Lys Gln Glu Ala Leu Met Pro Val Ala Glu Val
85 90 95
ctg gaa gca gca cgt aaa gca aaa gca aat ggt gca cag cgt ttt tgt 336
Leu Glu Ala Ala Arg Lys Ala Lys Ala Asn Gly Ala Gln Arg Phe Cys
100 105 110
atg ggt gca gca tgg cgt agc ccg acc gca cgt cag ctg gat agc gtt 384
Met Gly Ala Ala Trp Arg Ser Pro Thr Ala Arg Gln Leu Asp Ser Val
115 120 125
gtt gaa atg gtt ggt gca gtt aaa gca ctg ggt ctg gaa acc tgt gtt 432
Val Glu Met Val Gly Ala Val Lys Ala Leu Gly Leu Glu Thr Cys Val
130 135 140
acc ctg ggt atg ctg aaa gaa ggt cag gca gaa cgt ctg cgt gat gcg 480
Thr Leu Gly Met Leu Lys Glu Gly Gln Ala Glu Arg Leu Arg Asp Ala
145 150 155 160
ggt ctg gat tat tat aat cat aat ctg gat acc agc ccg gaa ttt tat 528
Gly Leu Asp Tyr Tyr Asn His Asn Leu Asp Thr Ser Pro Glu Phe Tyr
165 170 175
ggt aat att att acc acc cgt agc tat cag gat cgt ctg gat acc ctg 576
Gly Asn Ile Ile Thr Thr Arg Ser Tyr Gln Asp Arg Leu Asp Thr Leu
180 185 190
gaa cgt gtt cgt aat gcc ggt gtt cat gtt tgt tgt ggt ggt att gtt 624
Glu Arg Val Arg Asn Ala Gly Val His Val Cys Cys Gly Gly Ile Val
195 200 205
ggt ctg ggt gaa agc cgt aaa gaa cgt gca ggt ctg gtt gca cag ctg 672
Gly Leu Gly Glu Ser Arg Lys Glu Arg Ala Gly Leu Val Ala Gln Leu
210 215 220
gca aat ctg agc ccg tat ccg gaa agc gtt ccg gtt aat aat ctg gtt 720
Ala Asn Leu Ser Pro Tyr Pro Glu Ser Val Pro Val Asn Asn Leu Val
225 230 235 240
aaa gtt gca ggt acc ccg ctg gat gcc acc ccg gat att gat ccg ttt 768
Lys Val Ala Gly Thr Pro Leu Asp Ala Thr Pro Asp Ile Asp Pro Phe
245 250 255
gaa ttt gtt cgt acc att gca gtt gca cgt att acc atg ccg cgt gca 816
Glu Phe Val Arg Thr Ile Ala Val Ala Arg Ile Thr Met Pro Arg Ala
260 265 270
gtt gtt cgt ctg agc gca ggt cgt gaa gca atg agc gat gca att cag 864
Val Val Arg Leu Ser Ala Gly Arg Glu Ala Met Ser Asp Ala Ile Gln
275 280 285
gca ctg tgt ttt atg gcc ggt gca aat agc att ttt tat ggt gaa cag 912
Ala Leu Cys Phe Met Ala Gly Ala Asn Ser Ile Phe Tyr Gly Glu Gln
290 295 300
ctg ctg aca aca gca aat ccg cag ctg agt cag gat cag caa ctg ttt 960
Leu Leu Thr Thr Ala Asn Pro Gln Leu Ser Gln Asp Gln Gln Leu Phe
305 310 315 320
cag cgt ctg ggt ctg aca gca acc ccg gca gat ccg gca cgt ccg gca 1008
Gln Arg Leu Gly Leu Thr Ala Thr Pro Ala Asp Pro Ala Arg Pro Ala
325 330 335
cat ctg gaa cat cat cat gaa gca acc ctg gca taa 1044
His Leu Glu His His His Glu Ala Thr Leu Ala
340 345
<210> 53
<211> 347
<212> PRT
<213> Pusillimonas sp. T7-7
<400> 53
Met Ala Ala Met Lys Pro Ala Ile Pro Ser His Thr Pro Thr Pro Asp
1 5 10 15
His Ala Pro Gln Ala Trp Gly Ile Ala Gln Ile Leu Arg Leu Tyr Glu
20 25 30
Leu Pro Phe Leu Asp Leu Leu His Gln Ala Gln Ala Val His Arg Ala
35 40 45
His His Gln Pro Asn Thr Val Gln Leu Ser Ser Leu Leu Ser Ile Lys
50 55 60
Thr Gly Ala Cys Pro Glu Asp Cys Ala Tyr Cys Pro Gln Ser Ala Arg
65 70 75 80
His Asp Thr Gly Gly Lys Gln Glu Ala Leu Met Pro Val Ala Glu Val
85 90 95
Leu Glu Ala Ala Arg Lys Ala Lys Ala Asn Gly Ala Gln Arg Phe Cys
100 105 110
Met Gly Ala Ala Trp Arg Ser Pro Thr Ala Arg Gln Leu Asp Ser Val
115 120 125
Val Glu Met Val Gly Ala Val Lys Ala Leu Gly Leu Glu Thr Cys Val
130 135 140
Thr Leu Gly Met Leu Lys Glu Gly Gln Ala Glu Arg Leu Arg Asp Ala
145 150 155 160
Gly Leu Asp Tyr Tyr Asn His Asn Leu Asp Thr Ser Pro Glu Phe Tyr
165 170 175
Gly Asn Ile Ile Thr Thr Arg Ser Tyr Gln Asp Arg Leu Asp Thr Leu
180 185 190
Glu Arg Val Arg Asn Ala Gly Val His Val Cys Cys Gly Gly Ile Val
195 200 205
Gly Leu Gly Glu Ser Arg Lys Glu Arg Ala Gly Leu Val Ala Gln Leu
210 215 220
Ala Asn Leu Ser Pro Tyr Pro Glu Ser Val Pro Val Asn Asn Leu Val
225 230 235 240
Lys Val Ala Gly Thr Pro Leu Asp Ala Thr Pro Asp Ile Asp Pro Phe
245 250 255
Glu Phe Val Arg Thr Ile Ala Val Ala Arg Ile Thr Met Pro Arg Ala
260 265 270
Val Val Arg Leu Ser Ala Gly Arg Glu Ala Met Ser Asp Ala Ile Gln
275 280 285
Ala Leu Cys Phe Met Ala Gly Ala Asn Ser Ile Phe Tyr Gly Glu Gln
290 295 300
Leu Leu Thr Thr Ala Asn Pro Gln Leu Ser Gln Asp Gln Gln Leu Phe
305 310 315 320
Gln Arg Leu Gly Leu Thr Ala Thr Pro Ala Asp Pro Ala Arg Pro Ala
325 330 335
His Leu Glu His His His Glu Ala Thr Leu Ala
340 345
<210> 54
<211> 957
<212> DNA
<213> Cenarchaeum symbiosum A
<220>
<221> CDS
<222> (1)..(957)
<223> bioB gene encoding biotin synthase from Cenarchaeum symbiosum A
<400> 54
atg ggt att gca gaa tgt cgt gat aaa gtt ctg ggt ggt ggt gaa ctg 48
Met Gly Ile Ala Glu Cys Arg Asp Lys Val Leu Gly Gly Gly Glu Leu
1 5 10 15
acc aaa gat gaa gca cgt ggt ctg atg gaa gca gat gtt acc gaa ctg 96
Thr Lys Asp Glu Ala Arg Gly Leu Met Glu Ala Asp Val Thr Glu Leu
20 25 30
gcc gca gca gca gat gaa att acc cgt cgt ttt aat ggt gat ggt gtg 144
Ala Ala Ala Ala Asp Glu Ile Thr Arg Arg Phe Asn Gly Asp Gly Val
35 40 45
gat gtg gaa cag ctg aat aat att aaa cgt gat ggt tgc agc gaa gat 192
Asp Val Glu Gln Leu Asn Asn Ile Lys Arg Asp Gly Cys Ser Glu Asp
50 55 60
tgt acc ttt tgt ggt cag agc gcc ttt tat gat gca gat aaa gaa ccg 240
Cys Thr Phe Cys Gly Gln Ser Ala Phe Tyr Asp Ala Asp Lys Glu Pro
65 70 75 80
cat ccg ctg ccg gaa ccg gaa gaa gtt gtt cgt gca gcc ctg aaa gca 288
His Pro Leu Pro Glu Pro Glu Glu Val Val Arg Ala Ala Leu Lys Ala
85 90 95
aaa aaa gaa gaa gcc agc agc tat tgt ctg gtt gcc gca tgg cgt gaa 336
Lys Lys Glu Glu Ala Ser Ser Tyr Cys Leu Val Ala Ala Trp Arg Glu
100 105 110
ccg acc ccg gaa ggt ttt gaa aaa gtt tgt acc att att cag gaa att 384
Pro Thr Pro Glu Gly Phe Glu Lys Val Cys Thr Ile Ile Gln Glu Ile
115 120 125
aat acc cat gtt ggt att agc gtt gaa tgt agc ctg ggt ttt ctg acc 432
Asn Thr His Val Gly Ile Ser Val Glu Cys Ser Leu Gly Phe Leu Thr
130 135 140
cgt gaa cgt gca gca cgt ctg aaa ggt ctg ggt gtt aaa cgt tat aat 480
Arg Glu Arg Ala Ala Arg Leu Lys Gly Leu Gly Val Lys Arg Tyr Asn
145 150 155 160
cat aat ctg gaa acc gcc cgt agc aaa ttt ccg gaa att tgt agc acc 528
His Asn Leu Glu Thr Ala Arg Ser Lys Phe Pro Glu Ile Cys Ser Thr
165 170 175
cat acc tat gaa gat cgt ctg gat acc ctg gaa att gca cgt gaa gcc 576
His Thr Tyr Glu Asp Arg Leu Asp Thr Leu Glu Ile Ala Arg Glu Ala
180 185 190
ggt ctg gaa ctg tgt acc ggt ggt att att ggt atg ggt gaa agc cgt 624
Gly Leu Glu Leu Cys Thr Gly Gly Ile Ile Gly Met Gly Glu Ser Arg
195 200 205
ggt cag cgt att gaa ctg gca atg gaa ctg gca cgt att cgt ccg gaa 672
Gly Gln Arg Ile Glu Leu Ala Met Glu Leu Ala Arg Ile Arg Pro Glu
210 215 220
gaa gca acc gtt aat att ctg gtt ccg gtt cag ggt acc ccg atg gaa 720
Glu Ala Thr Val Asn Ile Leu Val Pro Val Gln Gly Thr Pro Met Glu
225 230 235 240
ctg cag gca ccg ctg ccg ccg ggt gaa gca gaa cgt ttt ttt gca ctg 768
Leu Gln Ala Pro Leu Pro Pro Gly Glu Ala Glu Arg Phe Phe Ala Leu
245 250 255
gtt cgt ttt ctg ctg ccg cgt agc gtt gtt aaa att agc ggt ggt cgt 816
Val Arg Phe Leu Leu Pro Arg Ser Val Val Lys Ile Ser Gly Gly Arg
260 265 270
gaa aaa gca ctg gat gat gat ggt cgt gca att ctg cgt ggt ggt gca 864
Glu Lys Ala Leu Asp Asp Asp Gly Arg Ala Ile Leu Arg Gly Gly Ala
275 280 285
aat ggt att att acc agc ggt tat ctg aca atg ggt ggt aat gat agc 912
Asn Gly Ile Ile Thr Ser Gly Tyr Leu Thr Met Gly Gly Asn Asp Ser
290 295 300
agc gca gat atg gaa atg att cgt gaa gca ggt ctg gaa gca taa 957
Ser Ala Asp Met Glu Met Ile Arg Glu Ala Gly Leu Glu Ala
305 310 315
<210> 55
<211> 318
<212> PRT
<213> Cenarchaeum symbiosum A
<400> 55
Met Gly Ile Ala Glu Cys Arg Asp Lys Val Leu Gly Gly Gly Glu Leu
1 5 10 15
Thr Lys Asp Glu Ala Arg Gly Leu Met Glu Ala Asp Val Thr Glu Leu
20 25 30
Ala Ala Ala Ala Asp Glu Ile Thr Arg Arg Phe Asn Gly Asp Gly Val
35 40 45
Asp Val Glu Gln Leu Asn Asn Ile Lys Arg Asp Gly Cys Ser Glu Asp
50 55 60
Cys Thr Phe Cys Gly Gln Ser Ala Phe Tyr Asp Ala Asp Lys Glu Pro
65 70 75 80
His Pro Leu Pro Glu Pro Glu Glu Val Val Arg Ala Ala Leu Lys Ala
85 90 95
Lys Lys Glu Glu Ala Ser Ser Tyr Cys Leu Val Ala Ala Trp Arg Glu
100 105 110
Pro Thr Pro Glu Gly Phe Glu Lys Val Cys Thr Ile Ile Gln Glu Ile
115 120 125
Asn Thr His Val Gly Ile Ser Val Glu Cys Ser Leu Gly Phe Leu Thr
130 135 140
Arg Glu Arg Ala Ala Arg Leu Lys Gly Leu Gly Val Lys Arg Tyr Asn
145 150 155 160
His Asn Leu Glu Thr Ala Arg Ser Lys Phe Pro Glu Ile Cys Ser Thr
165 170 175
His Thr Tyr Glu Asp Arg Leu Asp Thr Leu Glu Ile Ala Arg Glu Ala
180 185 190
Gly Leu Glu Leu Cys Thr Gly Gly Ile Ile Gly Met Gly Glu Ser Arg
195 200 205
Gly Gln Arg Ile Glu Leu Ala Met Glu Leu Ala Arg Ile Arg Pro Glu
210 215 220
Glu Ala Thr Val Asn Ile Leu Val Pro Val Gln Gly Thr Pro Met Glu
225 230 235 240
Leu Gln Ala Pro Leu Pro Pro Gly Glu Ala Glu Arg Phe Phe Ala Leu
245 250 255
Val Arg Phe Leu Leu Pro Arg Ser Val Val Lys Ile Ser Gly Gly Arg
260 265 270
Glu Lys Ala Leu Asp Asp Asp Gly Arg Ala Ile Leu Arg Gly Gly Ala
275 280 285
Asn Gly Ile Ile Thr Ser Gly Tyr Leu Thr Met Gly Gly Asn Asp Ser
290 295 300
Ser Ala Asp Met Glu Met Ile Arg Glu Ala Gly Leu Glu Ala
305 310 315
<210> 56
<211> 999
<212> DNA
<213> Alicyclobacillus acidocaldarius subsp. acidocaldarius DSM 446
<220>
<221> CDS
<222> (1)..(999)
<223> bioB gene encoding biotin synthase from Alicyclobacillus
acidocaldarius subsp. acidocaldarius DSM 446
<400> 56
atg atg aaa att gat tat cag acc aat tgg att gat ctg gcc cgt cgt 48
Met Met Lys Ile Asp Tyr Gln Thr Asn Trp Ile Asp Leu Ala Arg Arg
1 5 10 15
gtt ctg gat ggt cgt ggt gtt acc cgt gaa gaa gca ctg gat att ctg 96
Val Leu Asp Gly Arg Gly Val Thr Arg Glu Glu Ala Leu Asp Ile Leu
20 25 30
cgt tct agc gat gat gaa ctg ctg gat ctg ctg gca gca gcc ttt ctg 144
Arg Ser Ser Asp Asp Glu Leu Leu Asp Leu Leu Ala Ala Ala Phe Leu
35 40 45
att cgt cgc cgc tat ttt ggc aaa aaa gtg aaa ctg aat atg att att 192
Ile Arg Arg Arg Tyr Phe Gly Lys Lys Val Lys Leu Asn Met Ile Ile
50 55 60
aat gcc aaa agc aaa atg tgc ccg gaa gat tgc gcc tat tgc agc cag 240
Asn Ala Lys Ser Lys Met Cys Pro Glu Asp Cys Ala Tyr Cys Ser Gln
65 70 75 80
agc gcc att agc aaa gca ccg gtt agc aaa tat ccg ctg gtt agt aaa 288
Ser Ala Ile Ser Lys Ala Pro Val Ser Lys Tyr Pro Leu Val Ser Lys
85 90 95
gaa gaa att att gcc ggt gca cgt gaa gca gaa cgt cgt aaa gca ggt 336
Glu Glu Ile Ile Ala Gly Ala Arg Glu Ala Glu Arg Arg Lys Ala Gly
100 105 110
acc tat tgt att gtt att agc ggt cgt cgt ccg agc gat cgt gaa att 384
Thr Tyr Cys Ile Val Ile Ser Gly Arg Arg Pro Ser Asp Arg Glu Ile
115 120 125
gaa cgt att gca gaa gca gtt gaa gaa att cgt gca acc acc acc ctg 432
Glu Arg Ile Ala Glu Ala Val Glu Glu Ile Arg Ala Thr Thr Thr Leu
130 135 140
aaa att tgt tgt tgt ctg ggt ctg ctg acc ccg gca cag gca gat cgt 480
Lys Ile Cys Cys Cys Leu Gly Leu Leu Thr Pro Ala Gln Ala Asp Arg
145 150 155 160
ctg gca cgt gcg ggt gtt cat cgt tat aat cat aat ctg aat acc agc 528
Leu Ala Arg Ala Gly Val His Arg Tyr Asn His Asn Leu Asn Thr Ser
165 170 175
cgt gat cgt tat ggt gat att tgt acc acc cat acc tat gat gat cgc 576
Arg Asp Arg Tyr Gly Asp Ile Cys Thr Thr His Thr Tyr Asp Asp Arg
180 185 190
gtt cgt acc ctg gaa cat gtg aaa gaa gca ggt att agc ccg tgt agc 624
Val Arg Thr Leu Glu His Val Lys Glu Ala Gly Ile Ser Pro Cys Ser
195 200 205
ggt gtt att ttt ggt atg ggt gaa agc gat gaa gaa gcc gtt gat atg 672
Gly Val Ile Phe Gly Met Gly Glu Ser Asp Glu Glu Ala Val Asp Met
210 215 220
gcc ttt gcc ctg aaa gaa atg gat gca gat agc att ccg tgt aat ttt 720
Ala Phe Ala Leu Lys Glu Met Asp Ala Asp Ser Ile Pro Cys Asn Phe
225 230 235 240
ctg aat ccg att ccg ggt acc ccg ctg gaa ggt atg gaa acc ctg aat 768
Leu Asn Pro Ile Pro Gly Thr Pro Leu Glu Gly Met Glu Thr Leu Asn
245 250 255
ccg cgt cgt tgt ctg aaa ctg ctg tgt atg atg cgt ttt gtt aat ccg 816
Pro Arg Arg Cys Leu Lys Leu Leu Cys Met Met Arg Phe Val Asn Pro
260 265 270
agc aaa gaa att cgt att gcg ggt ggt cgt gaa cgt aat ctg cgt agc 864
Ser Lys Glu Ile Arg Ile Ala Gly Gly Arg Glu Arg Asn Leu Arg Ser
275 280 285
ctg cag gtt ctg ggt ctg tat ccg gca aat agc att ttt gtt ggt gat 912
Leu Gln Val Leu Gly Leu Tyr Pro Ala Asn Ser Ile Phe Val Gly Asp
290 295 300
tat ctg acc acc ccg ggt cag gca ccg acc gaa gat tgg gca atg att 960
Tyr Leu Thr Thr Pro Gly Gln Ala Pro Thr Glu Asp Trp Ala Met Ile
305 310 315 320
gaa gat ctg ggt ttt gaa att gaa gaa tgt gca ctg taa 999
Glu Asp Leu Gly Phe Glu Ile Glu Glu Cys Ala Leu
325 330
<210> 57
<211> 332
<212> PRT
<213> Alicyclobacillus acidocaldarius subsp. acidocaldarius DSM 446
<400> 57
Met Met Lys Ile Asp Tyr Gln Thr Asn Trp Ile Asp Leu Ala Arg Arg
1 5 10 15
Val Leu Asp Gly Arg Gly Val Thr Arg Glu Glu Ala Leu Asp Ile Leu
20 25 30
Arg Ser Ser Asp Asp Glu Leu Leu Asp Leu Leu Ala Ala Ala Phe Leu
35 40 45
Ile Arg Arg Arg Tyr Phe Gly Lys Lys Val Lys Leu Asn Met Ile Ile
50 55 60
Asn Ala Lys Ser Lys Met Cys Pro Glu Asp Cys Ala Tyr Cys Ser Gln
65 70 75 80
Ser Ala Ile Ser Lys Ala Pro Val Ser Lys Tyr Pro Leu Val Ser Lys
85 90 95
Glu Glu Ile Ile Ala Gly Ala Arg Glu Ala Glu Arg Arg Lys Ala Gly
100 105 110
Thr Tyr Cys Ile Val Ile Ser Gly Arg Arg Pro Ser Asp Arg Glu Ile
115 120 125
Glu Arg Ile Ala Glu Ala Val Glu Glu Ile Arg Ala Thr Thr Thr Leu
130 135 140
Lys Ile Cys Cys Cys Leu Gly Leu Leu Thr Pro Ala Gln Ala Asp Arg
145 150 155 160
Leu Ala Arg Ala Gly Val His Arg Tyr Asn His Asn Leu Asn Thr Ser
165 170 175
Arg Asp Arg Tyr Gly Asp Ile Cys Thr Thr His Thr Tyr Asp Asp Arg
180 185 190
Val Arg Thr Leu Glu His Val Lys Glu Ala Gly Ile Ser Pro Cys Ser
195 200 205
Gly Val Ile Phe Gly Met Gly Glu Ser Asp Glu Glu Ala Val Asp Met
210 215 220
Ala Phe Ala Leu Lys Glu Met Asp Ala Asp Ser Ile Pro Cys Asn Phe
225 230 235 240
Leu Asn Pro Ile Pro Gly Thr Pro Leu Glu Gly Met Glu Thr Leu Asn
245 250 255
Pro Arg Arg Cys Leu Lys Leu Leu Cys Met Met Arg Phe Val Asn Pro
260 265 270
Ser Lys Glu Ile Arg Ile Ala Gly Gly Arg Glu Arg Asn Leu Arg Ser
275 280 285
Leu Gln Val Leu Gly Leu Tyr Pro Ala Asn Ser Ile Phe Val Gly Asp
290 295 300
Tyr Leu Thr Thr Pro Gly Gln Ala Pro Thr Glu Asp Trp Ala Met Ile
305 310 315 320
Glu Asp Leu Gly Phe Glu Ile Glu Glu Cys Ala Leu
325 330
<210> 58
<211> 996
<212> DNA
<213> Geobacillus thermoglucosidasius C56-YS93
<220>
<221> CDS
<222> (1)..(996)
<223> bioB gene encoding biotin synthase from Geobacillus
thermoglucosidasius C56-YS93
<400> 58
atg att aat tgg ctg gcc ctg gca gat cgt gtt att gca ggt cat gaa 48
Met Ile Asn Trp Leu Ala Leu Ala Asp Arg Val Ile Ala Gly His Glu
1 5 10 15
ctg acc gat gaa gaa gca ctg gca att ctg gat tgt ccg gat gaa gaa 96
Leu Thr Asp Glu Glu Ala Leu Ala Ile Leu Asp Cys Pro Asp Glu Glu
20 25 30
ctg ctg ctg ctg atg cag ggt gcc tat aat att cgt cgc acc tat tat 144
Leu Leu Leu Leu Met Gln Gly Ala Tyr Asn Ile Arg Arg Thr Tyr Tyr
35 40 45
ggc aat aaa gtt aaa ctg aat atg att att aat gcc aaa agc ggt ctg 192
Gly Asn Lys Val Lys Leu Asn Met Ile Ile Asn Ala Lys Ser Gly Leu
50 55 60
tgc ccg gaa aat tgc ggc tat tgc gca cag agc gca gtt agc acc gca 240
Cys Pro Glu Asn Cys Gly Tyr Cys Ala Gln Ser Ala Val Ser Thr Ala
65 70 75 80
ccg gtt aaa acc tat aaa atg gtt gat aaa gaa acc ctg att cgt ggt 288
Pro Val Lys Thr Tyr Lys Met Val Asp Lys Glu Thr Leu Ile Arg Gly
85 90 95
gca gaa gaa gca tat cgt atg cgt att ggt acc tat tgt att gtt gca 336
Ala Glu Glu Ala Tyr Arg Met Arg Ile Gly Thr Tyr Cys Ile Val Ala
100 105 110
agc ggt cgt ggt ccg agc gaa aaa gaa att gat acc gtt gtg agc gcc 384
Ser Gly Arg Gly Pro Ser Glu Lys Glu Ile Asp Thr Val Val Ser Ala
115 120 125
gtt aaa gaa att aaa gaa cgt ttt ggt ctg aaa att tgt gca tgt ctg 432
Val Lys Glu Ile Lys Glu Arg Phe Gly Leu Lys Ile Cys Ala Cys Leu
130 135 140
ggt att ctg aaa ccg gaa cag gca gca cgt ctg aaa gaa gcc ggt gtt 480
Gly Ile Leu Lys Pro Glu Gln Ala Ala Arg Leu Lys Glu Ala Gly Val
145 150 155 160
gat cgc tat aat cat aat att aat acc agc aaa gaa cat cat ccg aat 528
Asp Arg Tyr Asn His Asn Ile Asn Thr Ser Lys Glu His His Pro Asn
165 170 175
att acc acc agc cat acc tat gat gat cgc gtg cgt acc gtt gaa acc 576
Ile Thr Thr Ser His Thr Tyr Asp Asp Arg Val Arg Thr Val Glu Thr
180 185 190
gtt aaa cag gca ggt att agc ccg tgt agc ggt gtt att att ggt atg 624
Val Lys Gln Ala Gly Ile Ser Pro Cys Ser Gly Val Ile Ile Gly Met
195 200 205
cgt gaa acc aaa cag gat gtt att aat atg gca cgt agt ctg cgc att 672
Arg Glu Thr Lys Gln Asp Val Ile Asn Met Ala Arg Ser Leu Arg Ile
210 215 220
ctg gat gca gat agc att ccg gtt aat ttt ctg cat gca att gat ggt 720
Leu Asp Ala Asp Ser Ile Pro Val Asn Phe Leu His Ala Ile Asp Gly
225 230 235 240
acc ccg ctg gca ggt acc aat gaa ctg gat ccg cgt tat tgt ctg aaa 768
Thr Pro Leu Ala Gly Thr Asn Glu Leu Asp Pro Arg Tyr Cys Leu Lys
245 250 255
gtt ctg gca ctg ttt cgt tat atg aat ccg acc aaa gaa att cgt att 816
Val Leu Ala Leu Phe Arg Tyr Met Asn Pro Thr Lys Glu Ile Arg Ile
260 265 270
gcc ggt ggt cgt gaa gtt aat ctg cgt agc ctg cag ccg ctg ggt ctg 864
Ala Gly Gly Arg Glu Val Asn Leu Arg Ser Leu Gln Pro Leu Gly Leu
275 280 285
tat gca gca aat agc att ttt gtt ggt gat tat ctg acc acc gca ggt 912
Tyr Ala Ala Asn Ser Ile Phe Val Gly Asp Tyr Leu Thr Thr Ala Gly
290 295 300
cag gaa aaa agc gaa gat tat cgc atg ctg gaa gat ctg ggt ttt gaa 960
Gln Glu Lys Ser Glu Asp Tyr Arg Met Leu Glu Asp Leu Gly Phe Glu
305 310 315 320
att gat ttt gcc gaa gaa cag cag gtt gtt tgt taa 996
Ile Asp Phe Ala Glu Glu Gln Gln Val Val Cys
325 330
<210> 59
<211> 331
<212> PRT
<213> Geobacillus thermoglucosidasius C56-YS93
<400> 59
Met Ile Asn Trp Leu Ala Leu Ala Asp Arg Val Ile Ala Gly His Glu
1 5 10 15
Leu Thr Asp Glu Glu Ala Leu Ala Ile Leu Asp Cys Pro Asp Glu Glu
20 25 30
Leu Leu Leu Leu Met Gln Gly Ala Tyr Asn Ile Arg Arg Thr Tyr Tyr
35 40 45
Gly Asn Lys Val Lys Leu Asn Met Ile Ile Asn Ala Lys Ser Gly Leu
50 55 60
Cys Pro Glu Asn Cys Gly Tyr Cys Ala Gln Ser Ala Val Ser Thr Ala
65 70 75 80
Pro Val Lys Thr Tyr Lys Met Val Asp Lys Glu Thr Leu Ile Arg Gly
85 90 95
Ala Glu Glu Ala Tyr Arg Met Arg Ile Gly Thr Tyr Cys Ile Val Ala
100 105 110
Ser Gly Arg Gly Pro Ser Glu Lys Glu Ile Asp Thr Val Val Ser Ala
115 120 125
Val Lys Glu Ile Lys Glu Arg Phe Gly Leu Lys Ile Cys Ala Cys Leu
130 135 140
Gly Ile Leu Lys Pro Glu Gln Ala Ala Arg Leu Lys Glu Ala Gly Val
145 150 155 160
Asp Arg Tyr Asn His Asn Ile Asn Thr Ser Lys Glu His His Pro Asn
165 170 175
Ile Thr Thr Ser His Thr Tyr Asp Asp Arg Val Arg Thr Val Glu Thr
180 185 190
Val Lys Gln Ala Gly Ile Ser Pro Cys Ser Gly Val Ile Ile Gly Met
195 200 205
Arg Glu Thr Lys Gln Asp Val Ile Asn Met Ala Arg Ser Leu Arg Ile
210 215 220
Leu Asp Ala Asp Ser Ile Pro Val Asn Phe Leu His Ala Ile Asp Gly
225 230 235 240
Thr Pro Leu Ala Gly Thr Asn Glu Leu Asp Pro Arg Tyr Cys Leu Lys
245 250 255
Val Leu Ala Leu Phe Arg Tyr Met Asn Pro Thr Lys Glu Ile Arg Ile
260 265 270
Ala Gly Gly Arg Glu Val Asn Leu Arg Ser Leu Gln Pro Leu Gly Leu
275 280 285
Tyr Ala Ala Asn Ser Ile Phe Val Gly Asp Tyr Leu Thr Thr Ala Gly
290 295 300
Gln Glu Lys Ser Glu Asp Tyr Arg Met Leu Glu Asp Leu Gly Phe Glu
305 310 315 320
Ile Asp Phe Ala Glu Glu Gln Gln Val Val Cys
325 330
<210> 60
<211> 1008
<212> DNA
<213> Bacillus subtilis subsp. subtilis str. 168
<220>
<221> CDS
<222> (1)..(1008)
<223> bioB gene encoding biotin synthase from Bacillus subtilis subsp.
subtilis str. 168
<400> 60
atg aat cag tgg atg gaa ctg gca gat cgt gtt ctg gcc ggt gca gaa 48
Met Asn Gln Trp Met Glu Leu Ala Asp Arg Val Leu Ala Gly Ala Glu
1 5 10 15
gtt acc gat gaa gaa gca ctg agc att ctg cat tgc ccg gat gaa gat 96
Val Thr Asp Glu Glu Ala Leu Ser Ile Leu His Cys Pro Asp Glu Asp
20 25 30
atc ctg ctg ctg atg cat ggt gcc ttt cat att cgc aaa cat ttt tat 144
Ile Leu Leu Leu Met His Gly Ala Phe His Ile Arg Lys His Phe Tyr
35 40 45
ggc aaa aaa gtg aaa ctg aat atg att atg aac gcc aaa agc ggt ctg 192
Gly Lys Lys Val Lys Leu Asn Met Ile Met Asn Ala Lys Ser Gly Leu
50 55 60
tgt ccg gaa aat tgc ggt tat tgc agc cag agc gca att agc aaa gca 240
Cys Pro Glu Asn Cys Gly Tyr Cys Ser Gln Ser Ala Ile Ser Lys Ala
65 70 75 80
ccg att gaa agc tat cgt atg gtt aat aaa gaa acc ctg ctg gaa ggt 288
Pro Ile Glu Ser Tyr Arg Met Val Asn Lys Glu Thr Leu Leu Glu Gly
85 90 95
gca aaa cgt gca cat gat ctg aat att ggt acc tat tgt att gtt gca 336
Ala Lys Arg Ala His Asp Leu Asn Ile Gly Thr Tyr Cys Ile Val Ala
100 105 110
agc ggt cgt ggt ccg agc aat cgc gaa gtt gat cag gtt gtt gat gca 384
Ser Gly Arg Gly Pro Ser Asn Arg Glu Val Asp Gln Val Val Asp Ala
115 120 125
gtt cag gaa att aaa gaa acc tat ggt ctg aaa att tgt gca tgt ctg 432
Val Gln Glu Ile Lys Glu Thr Tyr Gly Leu Lys Ile Cys Ala Cys Leu
130 135 140
ggt ctg ctg aaa ccg gaa cag gca aaa cgt ctg aaa gat gca ggc gtt 480
Gly Leu Leu Lys Pro Glu Gln Ala Lys Arg Leu Lys Asp Ala Gly Val
145 150 155 160
gat cgt tat aat cat aat ctg aat acc agc cag cgt aac cat agc aat 528
Asp Arg Tyr Asn His Asn Leu Asn Thr Ser Gln Arg Asn His Ser Asn
165 170 175
att acc acc agc cat acc tat gat gat cgt gtg aat acc gtt gaa att 576
Ile Thr Thr Ser His Thr Tyr Asp Asp Arg Val Asn Thr Val Glu Ile
180 185 190
gcc aaa gaa agt ggt ctg agc ccg tgt agc ggt gca att att ggt atg 624
Ala Lys Glu Ser Gly Leu Ser Pro Cys Ser Gly Ala Ile Ile Gly Met
195 200 205
aaa gaa acc aaa cag gat gtt att gat att gcg aaa agc ctg aaa gca 672
Lys Glu Thr Lys Gln Asp Val Ile Asp Ile Ala Lys Ser Leu Lys Ala
210 215 220
ctg gat gca gat agc att ccg gtg aat ttt ctg cat gca att gat ggt 720
Leu Asp Ala Asp Ser Ile Pro Val Asn Phe Leu His Ala Ile Asp Gly
225 230 235 240
acc ccg ctg gaa ggt gtt aat gaa ctg aat ccg ctg tat tgt ctg aaa 768
Thr Pro Leu Glu Gly Val Asn Glu Leu Asn Pro Leu Tyr Cys Leu Lys
245 250 255
gtt ctg gca ctg ttt cgt ttt att aat ccg agc aaa gaa att cgt att 816
Val Leu Ala Leu Phe Arg Phe Ile Asn Pro Ser Lys Glu Ile Arg Ile
260 265 270
agc ggt ggt cgt gaa gtt aat ctg cgt acc ctg cag ccg ctg ggt ctg 864
Ser Gly Gly Arg Glu Val Asn Leu Arg Thr Leu Gln Pro Leu Gly Leu
275 280 285
tat gca gca aat agc att ttt gtt ggt gat tat ctg acc acc gca ggt 912
Tyr Ala Ala Asn Ser Ile Phe Val Gly Asp Tyr Leu Thr Thr Ala Gly
290 295 300
cag gaa gaa acc gaa gat cat aaa atg ctg agc gat ctg ggt ttt gaa 960
Gln Glu Glu Thr Glu Asp His Lys Met Leu Ser Asp Leu Gly Phe Glu
305 310 315 320
gtt gaa agc gtt gaa gaa atg aaa gca agc ctg agc gca aaa agc taa 1008
Val Glu Ser Val Glu Glu Met Lys Ala Ser Leu Ser Ala Lys Ser
325 330 335
<210> 61
<211> 335
<212> PRT
<213> Bacillus subtilis subsp. subtilis str. 168
<400> 61
Met Asn Gln Trp Met Glu Leu Ala Asp Arg Val Leu Ala Gly Ala Glu
1 5 10 15
Val Thr Asp Glu Glu Ala Leu Ser Ile Leu His Cys Pro Asp Glu Asp
20 25 30
Ile Leu Leu Leu Met His Gly Ala Phe His Ile Arg Lys His Phe Tyr
35 40 45
Gly Lys Lys Val Lys Leu Asn Met Ile Met Asn Ala Lys Ser Gly Leu
50 55 60
Cys Pro Glu Asn Cys Gly Tyr Cys Ser Gln Ser Ala Ile Ser Lys Ala
65 70 75 80
Pro Ile Glu Ser Tyr Arg Met Val Asn Lys Glu Thr Leu Leu Glu Gly
85 90 95
Ala Lys Arg Ala His Asp Leu Asn Ile Gly Thr Tyr Cys Ile Val Ala
100 105 110
Ser Gly Arg Gly Pro Ser Asn Arg Glu Val Asp Gln Val Val Asp Ala
115 120 125
Val Gln Glu Ile Lys Glu Thr Tyr Gly Leu Lys Ile Cys Ala Cys Leu
130 135 140
Gly Leu Leu Lys Pro Glu Gln Ala Lys Arg Leu Lys Asp Ala Gly Val
145 150 155 160
Asp Arg Tyr Asn His Asn Leu Asn Thr Ser Gln Arg Asn His Ser Asn
165 170 175
Ile Thr Thr Ser His Thr Tyr Asp Asp Arg Val Asn Thr Val Glu Ile
180 185 190
Ala Lys Glu Ser Gly Leu Ser Pro Cys Ser Gly Ala Ile Ile Gly Met
195 200 205
Lys Glu Thr Lys Gln Asp Val Ile Asp Ile Ala Lys Ser Leu Lys Ala
210 215 220
Leu Asp Ala Asp Ser Ile Pro Val Asn Phe Leu His Ala Ile Asp Gly
225 230 235 240
Thr Pro Leu Glu Gly Val Asn Glu Leu Asn Pro Leu Tyr Cys Leu Lys
245 250 255
Val Leu Ala Leu Phe Arg Phe Ile Asn Pro Ser Lys Glu Ile Arg Ile
260 265 270
Ser Gly Gly Arg Glu Val Asn Leu Arg Thr Leu Gln Pro Leu Gly Leu
275 280 285
Tyr Ala Ala Asn Ser Ile Phe Val Gly Asp Tyr Leu Thr Thr Ala Gly
290 295 300
Gln Glu Glu Thr Glu Asp His Lys Met Leu Ser Asp Leu Gly Phe Glu
305 310 315 320
Val Glu Ser Val Glu Glu Met Lys Ala Ser Leu Ser Ala Lys Ser
325 330 335
<210> 62
<211> 996
<212> DNA
<213> Lysinibacillus sphaericus
<220>
<221> CDS
<222> (1)..(996)
<223> bioB gene encoding biotin synthase from Lysinibacillus sphaericus
<400> 62
atg aat ttt ctg cag gtt gcc cag gaa gtt att gat ggc aaa att att 48
Met Asn Phe Leu Gln Val Ala Gln Glu Val Ile Asp Gly Lys Ile Ile
1 5 10 15
agc aat gaa gaa gcc ctg gcc att ctg aat agc aaa gat gat gaa ctg 96
Ser Asn Glu Glu Ala Leu Ala Ile Leu Asn Ser Lys Asp Asp Glu Leu
20 25 30
ctg cag ctg atg gat ggt gca ttt gcc att cgc cgc cat tat tat ggt 144
Leu Gln Leu Met Asp Gly Ala Phe Ala Ile Arg Arg His Tyr Tyr Gly
35 40 45
aaa aaa gtg aaa ctg aac atg att atg aac gcc aaa agc ggt tat tgc 192
Lys Lys Val Lys Leu Asn Met Ile Met Asn Ala Lys Ser Gly Tyr Cys
50 55 60
ccg gaa gat tgt ggt tat tgt agc cag agc agc aaa agc acc gca ccg 240
Pro Glu Asp Cys Gly Tyr Cys Ser Gln Ser Ser Lys Ser Thr Ala Pro
65 70 75 80
att gaa aaa tat ccg ttt att acc aaa gaa gaa att ctg gcc ggt gcc 288
Ile Glu Lys Tyr Pro Phe Ile Thr Lys Glu Glu Ile Leu Ala Gly Ala
85 90 95
aaa cgt gcc ttt gat aat aaa att ggt acc tat tgc att gtt gcc agc 336
Lys Arg Ala Phe Asp Asn Lys Ile Gly Thr Tyr Cys Ile Val Ala Ser
100 105 110
ggt cgt ggt ccg acc cgt aaa gat gtt aat gtt gtg agc gaa gca gtt 384
Gly Arg Gly Pro Thr Arg Lys Asp Val Asn Val Val Ser Glu Ala Val
115 120 125
acc gaa att aaa gaa aaa tat ggc ctg aaa gtt tgt gca tgt ctg ggt 432
Thr Glu Ile Lys Glu Lys Tyr Gly Leu Lys Val Cys Ala Cys Leu Gly
130 135 140
ctg ctg aaa gaa gaa cag gca cag cag ctg aaa gaa gcc ggt gtt gat 480
Leu Leu Lys Glu Glu Gln Ala Gln Gln Leu Lys Glu Ala Gly Val Asp
145 150 155 160
cgc tat aat cat aat ctg aac acc agc gaa cgt cat cat agc ttt att 528
Arg Tyr Asn His Asn Leu Asn Thr Ser Glu Arg His His Ser Phe Ile
165 170 175
acc acc agc cat acc tat gaa gat cgt gtg aac acc gtg gaa att gtg 576
Thr Thr Ser His Thr Tyr Glu Asp Arg Val Asn Thr Val Glu Ile Val
180 185 190
aaa aaa cat ggt att agc ccg tgt agc ggt gca att att ggt atg aaa 624
Lys Lys His Gly Ile Ser Pro Cys Ser Gly Ala Ile Ile Gly Met Lys
195 200 205
gaa acc cgt gaa gat gtt gtt aat att gca cgt gca ctg cat cag ctg 672
Glu Thr Arg Glu Asp Val Val Asn Ile Ala Arg Ala Leu His Gln Leu
210 215 220
gat gca gat agc att ccg gtt aat ttt ctg aat gca att gat ggt acc 720
Asp Ala Asp Ser Ile Pro Val Asn Phe Leu Asn Ala Ile Asp Gly Thr
225 230 235 240
aaa ctg gaa ggt acc cgt gat ctg aat ccg cgt tat tgt ctg aaa gtg 768
Lys Leu Glu Gly Thr Arg Asp Leu Asn Pro Arg Tyr Cys Leu Lys Val
245 250 255
ctg gca ctg ttt cgt tat att aat ccg acc aaa gaa att cgt att agc 816
Leu Ala Leu Phe Arg Tyr Ile Asn Pro Thr Lys Glu Ile Arg Ile Ser
260 265 270
ggt ggt cgt gaa att aat ctg ggt agc ctg cag ccg ctg ggt ctg tat 864
Gly Gly Arg Glu Ile Asn Leu Gly Ser Leu Gln Pro Leu Gly Leu Tyr
275 280 285
gca gca aat agc att ttt gtt ggt gat tat ctg acc acc gca ggt cag 912
Ala Ala Asn Ser Ile Phe Val Gly Asp Tyr Leu Thr Thr Ala Gly Gln
290 295 300
gaa gcc aat agc gat tat cgt atg ctg gaa gat ctg ggt ttt gaa att 960
Glu Ala Asn Ser Asp Tyr Arg Met Leu Glu Asp Leu Gly Phe Glu Ile
305 310 315 320
gaa ctg acc cag aaa cag gaa gca gca ttt tgt taa 996
Glu Leu Thr Gln Lys Gln Glu Ala Ala Phe Cys
325 330
<210> 63
<211> 331
<212> PRT
<213> Lysinibacillus sphaericus
<400> 63
Met Asn Phe Leu Gln Val Ala Gln Glu Val Ile Asp Gly Lys Ile Ile
1 5 10 15
Ser Asn Glu Glu Ala Leu Ala Ile Leu Asn Ser Lys Asp Asp Glu Leu
20 25 30
Leu Gln Leu Met Asp Gly Ala Phe Ala Ile Arg Arg His Tyr Tyr Gly
35 40 45
Lys Lys Val Lys Leu Asn Met Ile Met Asn Ala Lys Ser Gly Tyr Cys
50 55 60
Pro Glu Asp Cys Gly Tyr Cys Ser Gln Ser Ser Lys Ser Thr Ala Pro
65 70 75 80
Ile Glu Lys Tyr Pro Phe Ile Thr Lys Glu Glu Ile Leu Ala Gly Ala
85 90 95
Lys Arg Ala Phe Asp Asn Lys Ile Gly Thr Tyr Cys Ile Val Ala Ser
100 105 110
Gly Arg Gly Pro Thr Arg Lys Asp Val Asn Val Val Ser Glu Ala Val
115 120 125
Thr Glu Ile Lys Glu Lys Tyr Gly Leu Lys Val Cys Ala Cys Leu Gly
130 135 140
Leu Leu Lys Glu Glu Gln Ala Gln Gln Leu Lys Glu Ala Gly Val Asp
145 150 155 160
Arg Tyr Asn His Asn Leu Asn Thr Ser Glu Arg His His Ser Phe Ile
165 170 175
Thr Thr Ser His Thr Tyr Glu Asp Arg Val Asn Thr Val Glu Ile Val
180 185 190
Lys Lys His Gly Ile Ser Pro Cys Ser Gly Ala Ile Ile Gly Met Lys
195 200 205
Glu Thr Arg Glu Asp Val Val Asn Ile Ala Arg Ala Leu His Gln Leu
210 215 220
Asp Ala Asp Ser Ile Pro Val Asn Phe Leu Asn Ala Ile Asp Gly Thr
225 230 235 240
Lys Leu Glu Gly Thr Arg Asp Leu Asn Pro Arg Tyr Cys Leu Lys Val
245 250 255
Leu Ala Leu Phe Arg Tyr Ile Asn Pro Thr Lys Glu Ile Arg Ile Ser
260 265 270
Gly Gly Arg Glu Ile Asn Leu Gly Ser Leu Gln Pro Leu Gly Leu Tyr
275 280 285
Ala Ala Asn Ser Ile Phe Val Gly Asp Tyr Leu Thr Thr Ala Gly Gln
290 295 300
Glu Ala Asn Ser Asp Tyr Arg Met Leu Glu Asp Leu Gly Phe Glu Ile
305 310 315 320
Glu Leu Thr Gln Lys Gln Glu Ala Ala Phe Cys
325 330
<210> 64
<211> 1002
<212> DNA
<213> Methylococcus capsulatus str. Bath
<220>
<221> CDS
<222> (1)..(1002)
<223> bioB gene encoding biotin synthase from Methylococcus capsulatus
str. Bath
<400> 64
atg cat gca gaa gtt gca gtt atg acc aat cag gaa cgt gca gaa gaa 48
Met His Ala Glu Val Ala Val Met Thr Asn Gln Glu Arg Ala Glu Glu
1 5 10 15
ccg gtt ctg cgt cat gat tgg acc cag ggt gaa gca gaa gca ctg ttt 96
Pro Val Leu Arg His Asp Trp Thr Gln Gly Glu Ala Glu Ala Leu Phe
20 25 30
gca ctg ccg ttt aat gaa ctg ctg ttt cag gca cag acc att cat cgt 144
Ala Leu Pro Phe Asn Glu Leu Leu Phe Gln Ala Gln Thr Ile His Arg
35 40 45
cgt cat ttt gat ccg aat gaa gtt cag gtt agc agc ctg ctg agc att 192
Arg His Phe Asp Pro Asn Glu Val Gln Val Ser Ser Leu Leu Ser Ile
50 55 60
aaa acc ggt gca tgt agc gaa gat tgt gca tat tgt ccg cag agc gca 240
Lys Thr Gly Ala Cys Ser Glu Asp Cys Ala Tyr Cys Pro Gln Ser Ala
65 70 75 80
cat tat gaa acc ggt gtt aaa cgt gaa agc ctg atg agc ctg gaa gat 288
His Tyr Glu Thr Gly Val Lys Arg Glu Ser Leu Met Ser Leu Glu Asp
85 90 95
gtt ctg gaa gca gca cag cgt gca cgt gaa gaa ggt gca acc cgt ttt 336
Val Leu Glu Ala Ala Gln Arg Ala Arg Glu Glu Gly Ala Thr Arg Phe
100 105 110
tgt atg ggt gca gca tgg cgt agc ccg cgt gat ggt gat ctg gaa gca 384
Cys Met Gly Ala Ala Trp Arg Ser Pro Arg Asp Gly Asp Leu Glu Ala
115 120 125
att gca gca atg gtt gaa ggt gtt aaa gca ctg ggt atg gaa acc tgt 432
Ile Ala Ala Met Val Glu Gly Val Lys Ala Leu Gly Met Glu Thr Cys
130 135 140
gtt acc gcc ggt atg ctg agt gat gaa cag gcc cgt cgt ctg aaa gaa 480
Val Thr Ala Gly Met Leu Ser Asp Glu Gln Ala Arg Arg Leu Lys Glu
145 150 155 160
gcc ggt ctg gat tat tat aat cat aat ctg gat acc agc gaa agc tat 528
Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu Asp Thr Ser Glu Ser Tyr
165 170 175
tat ggc gaa att att acc acc cgc acc tat cag gat cgt ctg gat acc 576
Tyr Gly Glu Ile Ile Thr Thr Arg Thr Tyr Gln Asp Arg Leu Asp Thr
180 185 190
ctg cag cgt gtt cgt gat gca ggt atg cat gtt tgt tgt ggt ggt att 624
Leu Gln Arg Val Arg Asp Ala Gly Met His Val Cys Cys Gly Gly Ile
195 200 205
gtt ggt atg ggt gaa agc gca gca gat cgt gca ggt ctg ctg att ggt 672
Val Gly Met Gly Glu Ser Ala Ala Asp Arg Ala Gly Leu Leu Ile Gly
210 215 220
ctg gca aat ctg ccg cgt cat ccg gaa agc gtt ccg att aat ctg ctg 720
Leu Ala Asn Leu Pro Arg His Pro Glu Ser Val Pro Ile Asn Leu Leu
225 230 235 240
gtt cgt gtt gaa ggt acc ccg ctg gca gat acc gca gca ctg gat ccg 768
Val Arg Val Glu Gly Thr Pro Leu Ala Asp Thr Ala Ala Leu Asp Pro
245 250 255
ttt gat ttt gtt cgt acc gtt gca gtt gca cgt att atg atg ccg gca 816
Phe Asp Phe Val Arg Thr Val Ala Val Ala Arg Ile Met Met Pro Ala
260 265 270
agc cgt gtt cgt ctg agc gca ggt cgt agc gat atg agc gat gaa atg 864
Ser Arg Val Arg Leu Ser Ala Gly Arg Ser Asp Met Ser Asp Glu Met
275 280 285
cag gca ctg tgt ttt ctg gcc ggt gca aat agc att ttt tat ggt gat 912
Gln Ala Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile Phe Tyr Gly Asp
290 295 300
cgt ctg ctg acc acc gaa aat ccg cag gcc cag cgt gat cgc cgt ctg 960
Arg Leu Leu Thr Thr Glu Asn Pro Gln Ala Gln Arg Asp Arg Arg Leu
305 310 315 320
ttt gcc cgt ctg ggt ctg cgt atg gca ggt ctg ggt tgt taa 1002
Phe Ala Arg Leu Gly Leu Arg Met Ala Gly Leu Gly Cys
325 330
<210> 65
<211> 333
<212> PRT
<213> Methylococcus capsulatus str. Bath
<400> 65
Met His Ala Glu Val Ala Val Met Thr Asn Gln Glu Arg Ala Glu Glu
1 5 10 15
Pro Val Leu Arg His Asp Trp Thr Gln Gly Glu Ala Glu Ala Leu Phe
20 25 30
Ala Leu Pro Phe Asn Glu Leu Leu Phe Gln Ala Gln Thr Ile His Arg
35 40 45
Arg His Phe Asp Pro Asn Glu Val Gln Val Ser Ser Leu Leu Ser Ile
50 55 60
Lys Thr Gly Ala Cys Ser Glu Asp Cys Ala Tyr Cys Pro Gln Ser Ala
65 70 75 80
His Tyr Glu Thr Gly Val Lys Arg Glu Ser Leu Met Ser Leu Glu Asp
85 90 95
Val Leu Glu Ala Ala Gln Arg Ala Arg Glu Glu Gly Ala Thr Arg Phe
100 105 110
Cys Met Gly Ala Ala Trp Arg Ser Pro Arg Asp Gly Asp Leu Glu Ala
115 120 125
Ile Ala Ala Met Val Glu Gly Val Lys Ala Leu Gly Met Glu Thr Cys
130 135 140
Val Thr Ala Gly Met Leu Ser Asp Glu Gln Ala Arg Arg Leu Lys Glu
145 150 155 160
Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu Asp Thr Ser Glu Ser Tyr
165 170 175
Tyr Gly Glu Ile Ile Thr Thr Arg Thr Tyr Gln Asp Arg Leu Asp Thr
180 185 190
Leu Gln Arg Val Arg Asp Ala Gly Met His Val Cys Cys Gly Gly Ile
195 200 205
Val Gly Met Gly Glu Ser Ala Ala Asp Arg Ala Gly Leu Leu Ile Gly
210 215 220
Leu Ala Asn Leu Pro Arg His Pro Glu Ser Val Pro Ile Asn Leu Leu
225 230 235 240
Val Arg Val Glu Gly Thr Pro Leu Ala Asp Thr Ala Ala Leu Asp Pro
245 250 255
Phe Asp Phe Val Arg Thr Val Ala Val Ala Arg Ile Met Met Pro Ala
260 265 270
Ser Arg Val Arg Leu Ser Ala Gly Arg Ser Asp Met Ser Asp Glu Met
275 280 285
Gln Ala Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile Phe Tyr Gly Asp
290 295 300
Arg Leu Leu Thr Thr Glu Asn Pro Gln Ala Gln Arg Asp Arg Arg Leu
305 310 315 320
Phe Ala Arg Leu Gly Leu Arg Met Ala Gly Leu Gly Cys
325 330
<210> 66
<211> 1041
<212> DNA
<213> Leclercia adecarboxylata
<220>
<221> CDS
<222> (1)..(1041)
<223> bioB gene encoding biotin synthase from Leclercia adecarboxylata
<400> 66
atg gca cat cag acc cgt tgg acc ctg agc cag gtt acc gca ctg ttt 48
Met Ala His Gln Thr Arg Trp Thr Leu Ser Gln Val Thr Ala Leu Phe
1 5 10 15
gaa aaa ccg ctg ctg gaa ctg ctg ttt gaa gca cag cag att cat cgt 96
Glu Lys Pro Leu Leu Glu Leu Leu Phe Glu Ala Gln Gln Ile His Arg
20 25 30
cag cat ttt gat ccg cag cag att cag gtt agc acc ctg ctg agc att 144
Gln His Phe Asp Pro Gln Gln Ile Gln Val Ser Thr Leu Leu Ser Ile
35 40 45
aaa acc ggt gca tgt ccg gaa gat tgt aaa tat tgt ccg cag agc gca 192
Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys Tyr Cys Pro Gln Ser Ala
50 55 60
cgt tat aaa acc ggt ctg gaa tca gaa cgt ctg atg gaa gtt gaa cag 240
Arg Tyr Lys Thr Gly Leu Glu Ser Glu Arg Leu Met Glu Val Glu Gln
65 70 75 80
gtt ctg gaa agc gca cgt cag gca aaa aat gca ggt agc acc cgt ttt 288
Val Leu Glu Ser Ala Arg Gln Ala Lys Asn Ala Gly Ser Thr Arg Phe
85 90 95
tgt atg ggt gca gca tgg aaa aat ccg cat gaa cgt gat atg ccg tat 336
Cys Met Gly Ala Ala Trp Lys Asn Pro His Glu Arg Asp Met Pro Tyr
100 105 110
ctg gaa cag atg gtt cag ggt gtt aaa gca atg ggt ctg gaa gca tgt 384
Leu Glu Gln Met Val Gln Gly Val Lys Ala Met Gly Leu Glu Ala Cys
115 120 125
atg acc ctg ggt acc ctg gat gat acc cag gca cag cgt ctg gca agc 432
Met Thr Leu Gly Thr Leu Asp Asp Thr Gln Ala Gln Arg Leu Ala Ser
130 135 140
gca ggt ctg gat tat tat aat cat aat ctg gat acc agc ccg gaa ttt 480
Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu Asp Thr Ser Pro Glu Phe
145 150 155 160
tat ggc aat att att acc acc cgc acc tat cag gaa cgc ctg gat acc 528
Tyr Gly Asn Ile Ile Thr Thr Arg Thr Tyr Gln Glu Arg Leu Asp Thr
165 170 175
ctg gat aaa gtt cgt gat gca ggt att aaa gtt tgt agc ggt ggt att 576
Leu Asp Lys Val Arg Asp Ala Gly Ile Lys Val Cys Ser Gly Gly Ile
180 185 190
gtt ggt ctg ggt gaa acc gtt acc gat cgt gca ggt ctg ctg ctg cag 624
Val Gly Leu Gly Glu Thr Val Thr Asp Arg Ala Gly Leu Leu Leu Gln
195 200 205
ctg gca aat ctg ccg acc ccg ccg gaa agc gtt ccg att aat atg ctg 672
Leu Ala Asn Leu Pro Thr Pro Pro Glu Ser Val Pro Ile Asn Met Leu
210 215 220
gtt aaa gtt aaa ggt acc ccg ctg gcc gat aat gat gat gtt gat gca 720
Val Lys Val Lys Gly Thr Pro Leu Ala Asp Asn Asp Asp Val Asp Ala
225 230 235 240
ttt gat ttt att cgt acc att gca gtg gcc cgt gtg atg atg ccg acc 768
Phe Asp Phe Ile Arg Thr Ile Ala Val Ala Arg Val Met Met Pro Thr
245 250 255
agc ttt gtt cgt ctg agc gcc ggt cgt gaa cag atg aat gaa cag acc 816
Ser Phe Val Arg Leu Ser Ala Gly Arg Glu Gln Met Asn Glu Gln Thr
260 265 270
cag gcc atg tgt ttt atg gcc ggt gca aat agc att ttt tat ggt tgt 864
Gln Ala Met Cys Phe Met Ala Gly Ala Asn Ser Ile Phe Tyr Gly Cys
275 280 285
aaa ctg ctg acc acc ccg aat ccg gaa gaa gat aaa gat gtt cag ctg 912
Lys Leu Leu Thr Thr Pro Asn Pro Glu Glu Asp Lys Asp Val Gln Leu
290 295 300
ttt cgt aaa ctg ggt ctg aat ccg cag cag acc gca gtt ctg acc ggt 960
Phe Arg Lys Leu Gly Leu Asn Pro Gln Gln Thr Ala Val Leu Thr Gly
305 310 315 320
gat aat gaa cag cag cat cag ctg gaa cag cag ctg att aat gca gat 1008
Asp Asn Glu Gln Gln His Gln Leu Glu Gln Gln Leu Ile Asn Ala Asp
325 330 335
acc gat cag ttt tat aat gca gca acc gtt taa 1041
Thr Asp Gln Phe Tyr Asn Ala Ala Thr Val
340 345
<210> 67
<211> 346
<212> PRT
<213> Leclercia adecarboxylata
<400> 67
Met Ala His Gln Thr Arg Trp Thr Leu Ser Gln Val Thr Ala Leu Phe
1 5 10 15
Glu Lys Pro Leu Leu Glu Leu Leu Phe Glu Ala Gln Gln Ile His Arg
20 25 30
Gln His Phe Asp Pro Gln Gln Ile Gln Val Ser Thr Leu Leu Ser Ile
35 40 45
Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys Tyr Cys Pro Gln Ser Ala
50 55 60
Arg Tyr Lys Thr Gly Leu Glu Ser Glu Arg Leu Met Glu Val Glu Gln
65 70 75 80
Val Leu Glu Ser Ala Arg Gln Ala Lys Asn Ala Gly Ser Thr Arg Phe
85 90 95
Cys Met Gly Ala Ala Trp Lys Asn Pro His Glu Arg Asp Met Pro Tyr
100 105 110
Leu Glu Gln Met Val Gln Gly Val Lys Ala Met Gly Leu Glu Ala Cys
115 120 125
Met Thr Leu Gly Thr Leu Asp Asp Thr Gln Ala Gln Arg Leu Ala Ser
130 135 140
Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu Asp Thr Ser Pro Glu Phe
145 150 155 160
Tyr Gly Asn Ile Ile Thr Thr Arg Thr Tyr Gln Glu Arg Leu Asp Thr
165 170 175
Leu Asp Lys Val Arg Asp Ala Gly Ile Lys Val Cys Ser Gly Gly Ile
180 185 190
Val Gly Leu Gly Glu Thr Val Thr Asp Arg Ala Gly Leu Leu Leu Gln
195 200 205
Leu Ala Asn Leu Pro Thr Pro Pro Glu Ser Val Pro Ile Asn Met Leu
210 215 220
Val Lys Val Lys Gly Thr Pro Leu Ala Asp Asn Asp Asp Val Asp Ala
225 230 235 240
Phe Asp Phe Ile Arg Thr Ile Ala Val Ala Arg Val Met Met Pro Thr
245 250 255
Ser Phe Val Arg Leu Ser Ala Gly Arg Glu Gln Met Asn Glu Gln Thr
260 265 270
Gln Ala Met Cys Phe Met Ala Gly Ala Asn Ser Ile Phe Tyr Gly Cys
275 280 285
Lys Leu Leu Thr Thr Pro Asn Pro Glu Glu Asp Lys Asp Val Gln Leu
290 295 300
Phe Arg Lys Leu Gly Leu Asn Pro Gln Gln Thr Ala Val Leu Thr Gly
305 310 315 320
Asp Asn Glu Gln Gln His Gln Leu Glu Gln Gln Leu Ile Asn Ala Asp
325 330 335
Thr Asp Gln Phe Tyr Asn Ala Ala Thr Val
340 345
<210> 68
<211> 1113
<212> DNA
<213> Chromohalobacter salexigens DSM 3043
<220>
<221> CDS
<222> (1)..(1113)
<223> bioB gene encoding biotin synthase from Chromohalobacter
salexigens DSM 3043
<400> 68
atg acc gca cag agc cgt gat ccg gca tgg acc gat gca agc ccg acc 48
Met Thr Ala Gln Ser Arg Asp Pro Ala Trp Thr Asp Ala Ser Pro Thr
1 5 10 15
ttt cag ccg acc atg cgt cat gat tgg agc ctg gaa gaa att gaa gca 96
Phe Gln Pro Thr Met Arg His Asp Trp Ser Leu Glu Glu Ile Glu Ala
20 25 30
ctg ttt gca ctg ccg ttt aat gat ctg ctg ttt cgt gca cag cag gtt 144
Leu Phe Ala Leu Pro Phe Asn Asp Leu Leu Phe Arg Ala Gln Gln Val
35 40 45
cat cgt gca cat ttt gat ccg aat gca gtt cag gtt agc acc ctg ctg 192
His Arg Ala His Phe Asp Pro Asn Ala Val Gln Val Ser Thr Leu Leu
50 55 60
agc att aaa acc ggt gca tgt ccg gaa gat tgt aaa tat tgt ccg cag 240
Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys Tyr Cys Pro Gln
65 70 75 80
agc ggt cat tat aat acc ggc ctg ggt aaa gaa aaa ctg ctg gaa att 288
Ser Gly His Tyr Asn Thr Gly Leu Gly Lys Glu Lys Leu Leu Glu Ile
85 90 95
gaa aaa gtt gtt gaa cag gcc cgt gca gca aaa gca gca ggc gca agc 336
Glu Lys Val Val Glu Gln Ala Arg Ala Ala Lys Ala Ala Gly Ala Ser
100 105 110
cgt ttt tgt atg ggt gca gca tgg cgt agc ccg cgt gaa aaa gat ctg 384
Arg Phe Cys Met Gly Ala Ala Trp Arg Ser Pro Arg Glu Lys Asp Leu
115 120 125
cgt gtt gtt acc gaa atg gtt ggt cgt gtt aaa gca ctg ggc ctg gaa 432
Arg Val Val Thr Glu Met Val Gly Arg Val Lys Ala Leu Gly Leu Glu
130 135 140
acc tgt atg acc ctg ggt atg gtt gat gtt gat cag gca cgt cgt ctg 480
Thr Cys Met Thr Leu Gly Met Val Asp Val Asp Gln Ala Arg Arg Leu
145 150 155 160
gca gaa gcc ggt ctg gat tat tat aat cat aat ctg gat acc agc ccg 528
Ala Glu Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu Asp Thr Ser Pro
165 170 175
gat tat tat ggt gaa att att acc acc cgt acc tat gca gat cgt ctg 576
Asp Tyr Tyr Gly Glu Ile Ile Thr Thr Arg Thr Tyr Ala Asp Arg Leu
180 185 190
gaa acc ctg gcc aat gtt cgt gaa gca ggt atg aaa gtt tgt agt ggt 624
Glu Thr Leu Ala Asn Val Arg Glu Ala Gly Met Lys Val Cys Ser Gly
195 200 205
ggt att ctg ggt atg ggt gaa gca ccg cgt gat cgc gca gca ctg ctg 672
Gly Ile Leu Gly Met Gly Glu Ala Pro Arg Asp Arg Ala Ala Leu Leu
210 215 220
cag cag ctg gtt cgt ctg gat ccg cat ccg gaa agc gtt ccg att aat 720
Gln Gln Leu Val Arg Leu Asp Pro His Pro Glu Ser Val Pro Ile Asn
225 230 235 240
atg ctg gtt aaa gtt ccg ggt acc ccg atg gaa aat gtt gaa gat atg 768
Met Leu Val Lys Val Pro Gly Thr Pro Met Glu Asn Val Glu Asp Met
245 250 255
gat ccg ctg acc ttt att cgt gca att gca gtt gca cgt att ctg atg 816
Asp Pro Leu Thr Phe Ile Arg Ala Ile Ala Val Ala Arg Ile Leu Met
260 265 270
ccg aaa agc cat gtt cgt ctg agc gca ggt cgt gaa cag atg gat gaa 864
Pro Lys Ser His Val Arg Leu Ser Ala Gly Arg Glu Gln Met Asp Glu
275 280 285
agc acc cag gca ctg gca ttt ctg gcc ggt gca aat agc att ttt tat 912
Ser Thr Gln Ala Leu Ala Phe Leu Ala Gly Ala Asn Ser Ile Phe Tyr
290 295 300
ggt gat acc ctg ctg acc acc ggt aat ccg cag gtt gaa cgt gat cgt 960
Gly Asp Thr Leu Leu Thr Thr Gly Asn Pro Gln Val Glu Arg Asp Arg
305 310 315 320
gcc ctg ttt gat aaa ctg ggt ctg cat ccg gaa ccg agc gat ccg cat 1008
Ala Leu Phe Asp Lys Leu Gly Leu His Pro Glu Pro Ser Asp Pro His
325 330 335
gca gat gat gcc cat cgt gat gat gaa cag gca gaa att gca ctg gca 1056
Ala Asp Asp Ala His Arg Asp Asp Glu Gln Ala Glu Ile Ala Leu Ala
340 345 350
cat gca att cag cgt cag cgt gat gat gca ctg ttt tat gat gca acc 1104
His Ala Ile Gln Arg Gln Arg Asp Asp Ala Leu Phe Tyr Asp Ala Thr
355 360 365
cgt ggt taa 1113
Arg Gly
370
<210> 69
<211> 370
<212> PRT
<213> Chromohalobacter salexigens DSM 3043
<400> 69
Met Thr Ala Gln Ser Arg Asp Pro Ala Trp Thr Asp Ala Ser Pro Thr
1 5 10 15
Phe Gln Pro Thr Met Arg His Asp Trp Ser Leu Glu Glu Ile Glu Ala
20 25 30
Leu Phe Ala Leu Pro Phe Asn Asp Leu Leu Phe Arg Ala Gln Gln Val
35 40 45
His Arg Ala His Phe Asp Pro Asn Ala Val Gln Val Ser Thr Leu Leu
50 55 60
Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys Tyr Cys Pro Gln
65 70 75 80
Ser Gly His Tyr Asn Thr Gly Leu Gly Lys Glu Lys Leu Leu Glu Ile
85 90 95
Glu Lys Val Val Glu Gln Ala Arg Ala Ala Lys Ala Ala Gly Ala Ser
100 105 110
Arg Phe Cys Met Gly Ala Ala Trp Arg Ser Pro Arg Glu Lys Asp Leu
115 120 125
Arg Val Val Thr Glu Met Val Gly Arg Val Lys Ala Leu Gly Leu Glu
130 135 140
Thr Cys Met Thr Leu Gly Met Val Asp Val Asp Gln Ala Arg Arg Leu
145 150 155 160
Ala Glu Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu Asp Thr Ser Pro
165 170 175
Asp Tyr Tyr Gly Glu Ile Ile Thr Thr Arg Thr Tyr Ala Asp Arg Leu
180 185 190
Glu Thr Leu Ala Asn Val Arg Glu Ala Gly Met Lys Val Cys Ser Gly
195 200 205
Gly Ile Leu Gly Met Gly Glu Ala Pro Arg Asp Arg Ala Ala Leu Leu
210 215 220
Gln Gln Leu Val Arg Leu Asp Pro His Pro Glu Ser Val Pro Ile Asn
225 230 235 240
Met Leu Val Lys Val Pro Gly Thr Pro Met Glu Asn Val Glu Asp Met
245 250 255
Asp Pro Leu Thr Phe Ile Arg Ala Ile Ala Val Ala Arg Ile Leu Met
260 265 270
Pro Lys Ser His Val Arg Leu Ser Ala Gly Arg Glu Gln Met Asp Glu
275 280 285
Ser Thr Gln Ala Leu Ala Phe Leu Ala Gly Ala Asn Ser Ile Phe Tyr
290 295 300
Gly Asp Thr Leu Leu Thr Thr Gly Asn Pro Gln Val Glu Arg Asp Arg
305 310 315 320
Ala Leu Phe Asp Lys Leu Gly Leu His Pro Glu Pro Ser Asp Pro His
325 330 335
Ala Asp Asp Ala His Arg Asp Asp Glu Gln Ala Glu Ile Ala Leu Ala
340 345 350
His Ala Ile Gln Arg Gln Arg Asp Asp Ala Leu Phe Tyr Asp Ala Thr
355 360 365
Arg Gly
370
<210> 70
<211> 1059
<212> DNA
<213> Pseudomonas caeni
<220>
<221> CDS
<222> (1)..(1059)
<223> bioB gene encoding biotin synthase from Pseudomonas caeni
<400> 70
atg acc acc agc ccg cat gca gat acc cgt cat gat tgg acc ctg gca 48
Met Thr Thr Ser Pro His Ala Asp Thr Arg His Asp Trp Thr Leu Ala
1 5 10 15
gaa gtt acc gca ctg ctg cag cag ccg ttt aat gat ctg att ttt cag 96
Glu Val Thr Ala Leu Leu Gln Gln Pro Phe Asn Asp Leu Ile Phe Gln
20 25 30
gca cag agc gtt cat cgt cag cat ttt aat gca aat cgt gtt cag gtt 144
Ala Gln Ser Val His Arg Gln His Phe Asn Ala Asn Arg Val Gln Val
35 40 45
agc acc ctg ctg agc att aaa acc ggt gca tgt ccg gaa gat tgt aaa 192
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
tat tgt ccg cag agc ggt cat tat aat acc ggt ctg gat aaa gaa aaa 240
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Asp Lys Glu Lys
65 70 75 80
ctg atg gaa gtg cag aaa gtt ctg gat gaa gcc aaa cgt gcc aaa gaa 288
Leu Met Glu Val Gln Lys Val Leu Asp Glu Ala Lys Arg Ala Lys Glu
85 90 95
att ggt agc acc cgt ttt tgc atg ggt gca gca tgg aaa cat ccg agc 336
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
gca aaa gat ctg ccg tat gtt ctg gaa atg gtt aaa ggt gtt aaa gca 384
Ala Lys Asp Leu Pro Tyr Val Leu Glu Met Val Lys Gly Val Lys Ala
115 120 125
atg ggt atg gaa acc tgt atg acc ctg ggt aaa ctg gat gaa gca cag 432
Met Gly Met Glu Thr Cys Met Thr Leu Gly Lys Leu Asp Glu Ala Gln
130 135 140
acc aaa gcg ctg gca gat gcg ggt ctg gat tat tat aat cat aat ctg 480
Thr Lys Ala Leu Ala Asp Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
gat acc agc ccg gaa ttt tat ggt aat att att acc acc cgt acc tat 528
Asp Thr Ser Pro Glu Phe Tyr Gly Asn Ile Ile Thr Thr Arg Thr Tyr
165 170 175
gca gaa cgt ctg cag acc ctg agc tat gtt cgt gat gca ggt atg aaa 576
Ala Glu Arg Leu Gln Thr Leu Ser Tyr Val Arg Asp Ala Gly Met Lys
180 185 190
att tgt agc ggt ggt att ctg ggt atg ggt gaa agc att gca gat cgt 624
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Ile Ala Asp Arg
195 200 205
gca ggt ctg ctg att cag ctg gca aat ctg ccg gaa cat ccg gaa agc 672
Ala Gly Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
gtt ccg att aat atg ctg gtt aaa gtt gaa ggt acc ccg ctg gaa aat 720
Val Pro Ile Asn Met Leu Val Lys Val Glu Gly Thr Pro Leu Glu Asn
225 230 235 240
gca gaa gat gtt gat ccg ttt gat ttt att cgt atg ctg gca gtt gca 768
Ala Glu Asp Val Asp Pro Phe Asp Phe Ile Arg Met Leu Ala Val Ala
245 250 255
cgc att atg atg ccg aaa agc cat gtt cgt ctg agc gca ggt cgt gaa 816
Arg Ile Met Met Pro Lys Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
cag atg aat gaa cag atg cag agc ctg gca ttt ctg gcc ggt gca aat 864
Gln Met Asn Glu Gln Met Gln Ser Leu Ala Phe Leu Ala Gly Ala Asn
275 280 285
agc att ttt tat ggt gaa aaa ctg ctg acc acc gca aat ccg cag gca 912
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Ala Asn Pro Gln Ala
290 295 300
gat aaa gat atg cag ctg ttt gca cgt ctg ggt att aaa ccg gaa gca 960
Asp Lys Asp Met Gln Leu Phe Ala Arg Leu Gly Ile Lys Pro Glu Ala
305 310 315 320
cgt gaa gaa tat gca gat gaa gtt cat cag gca gca att gaa cat gca 1008
Arg Glu Glu Tyr Ala Asp Glu Val His Gln Ala Ala Ile Glu His Ala
325 330 335
att att gaa cag cgt gat gcc agc ctg ttt tat gat gca gca acc cat 1056
Ile Ile Glu Gln Arg Asp Ala Ser Leu Phe Tyr Asp Ala Ala Thr His
340 345 350
taa 1059
<210> 71
<211> 352
<212> PRT
<213> Pseudomonas caeni
<400> 71
Met Thr Thr Ser Pro His Ala Asp Thr Arg His Asp Trp Thr Leu Ala
1 5 10 15
Glu Val Thr Ala Leu Leu Gln Gln Pro Phe Asn Asp Leu Ile Phe Gln
20 25 30
Ala Gln Ser Val His Arg Gln His Phe Asn Ala Asn Arg Val Gln Val
35 40 45
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Asp Lys Glu Lys
65 70 75 80
Leu Met Glu Val Gln Lys Val Leu Asp Glu Ala Lys Arg Ala Lys Glu
85 90 95
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
Ala Lys Asp Leu Pro Tyr Val Leu Glu Met Val Lys Gly Val Lys Ala
115 120 125
Met Gly Met Glu Thr Cys Met Thr Leu Gly Lys Leu Asp Glu Ala Gln
130 135 140
Thr Lys Ala Leu Ala Asp Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
Asp Thr Ser Pro Glu Phe Tyr Gly Asn Ile Ile Thr Thr Arg Thr Tyr
165 170 175
Ala Glu Arg Leu Gln Thr Leu Ser Tyr Val Arg Asp Ala Gly Met Lys
180 185 190
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Ile Ala Asp Arg
195 200 205
Ala Gly Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
Val Pro Ile Asn Met Leu Val Lys Val Glu Gly Thr Pro Leu Glu Asn
225 230 235 240
Ala Glu Asp Val Asp Pro Phe Asp Phe Ile Arg Met Leu Ala Val Ala
245 250 255
Arg Ile Met Met Pro Lys Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
Gln Met Asn Glu Gln Met Gln Ser Leu Ala Phe Leu Ala Gly Ala Asn
275 280 285
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Ala Asn Pro Gln Ala
290 295 300
Asp Lys Asp Met Gln Leu Phe Ala Arg Leu Gly Ile Lys Pro Glu Ala
305 310 315 320
Arg Glu Glu Tyr Ala Asp Glu Val His Gln Ala Ala Ile Glu His Ala
325 330 335
Ile Ile Glu Gln Arg Asp Ala Ser Leu Phe Tyr Asp Ala Ala Thr His
340 345 350
<210> 72
<211> 1059
<212> DNA
<213> Pseudomonas monteilii
<220>
<221> CDS
<222> (1)..(1059)
<223> bioB gene encoding biotin synthase from Pseudomonas monteilii
<400> 72
atg agc gca agc acc att gca acc acc cgt cat gat tgg acc ctg gcc 48
Met Ser Ala Ser Thr Ile Ala Thr Thr Arg His Asp Trp Thr Leu Ala
1 5 10 15
gaa gtt cgt gca ctg ttt cag cag ccg ttt aat gat ctg ctg ttt cag 96
Glu Val Arg Ala Leu Phe Gln Gln Pro Phe Asn Asp Leu Leu Phe Gln
20 25 30
gca cag agc gtt cat cgt gca cat ttt gat gca aat cgt gtt cag gtt 144
Ala Gln Ser Val His Arg Ala His Phe Asp Ala Asn Arg Val Gln Val
35 40 45
agc acc ctg ctg agc att aaa acc ggt gca tgt ccg gaa gat tgt aaa 192
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
tat tgc ccg cag agc ggt cat tat aat acc ggt ctg gaa aaa cag aaa 240
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Glu Lys Gln Lys
65 70 75 80
ctg atg gaa gtt cag aaa gtt ctg gaa gaa gca gca cgt gcc aaa gca 288
Leu Met Glu Val Gln Lys Val Leu Glu Glu Ala Ala Arg Ala Lys Ala
85 90 95
att ggt agc acc cgt ttt tgc atg ggt gca gca tgg aaa cat ccg agc 336
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
gca aaa gat atg ccg tat gtt ctg cag atg gtt cag ggt gtt aaa gca 384
Ala Lys Asp Met Pro Tyr Val Leu Gln Met Val Gln Gly Val Lys Ala
115 120 125
atg ggt ctg gaa acc tgt atg acc ctg ggt cgt ctg gat cgc gaa cag 432
Met Gly Leu Glu Thr Cys Met Thr Leu Gly Arg Leu Asp Arg Glu Gln
130 135 140
acc gca gca ctg gca gaa gca ggt ctg gat tat tat aat cat aat ctg 480
Thr Ala Ala Leu Ala Glu Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
gat acc agc ccg gaa ttt tat ggt agc att att acc acc cgt acc tat 528
Asp Thr Ser Pro Glu Phe Tyr Gly Ser Ile Ile Thr Thr Arg Thr Tyr
165 170 175
gca gaa cgt ctg cag acc ctg gca tat gtt cgt gat gca ggt atg aaa 576
Ala Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Asp Ala Gly Met Lys
180 185 190
att tgt agc ggt ggt att ctg ggt atg ggt gaa agc ctg gat gat cgt 624
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Leu Asp Asp Arg
195 200 205
gca aat ctg ctg att cag ctg gcc aat ctg ccg gaa cat ccg gaa agc 672
Ala Asn Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
gtt ccg att aat atg ctg gtt aaa gtt gcc ggt acc ccg ctg gca aat 720
Val Pro Ile Asn Met Leu Val Lys Val Ala Gly Thr Pro Leu Ala Asn
225 230 235 240
gaa gaa gat gtt gat ccg ttt gat ttt att cgt atg ctg gca gtt gca 768
Glu Glu Asp Val Asp Pro Phe Asp Phe Ile Arg Met Leu Ala Val Ala
245 250 255
cgt att ctg atg ccg cag agc cat gtt cgt ctg agc gca ggt cgt gaa 816
Arg Ile Leu Met Pro Gln Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
cag atg aat gaa cag atg cag gca ctg gca ttt ctg gcc ggt gca aat 864
Gln Met Asn Glu Gln Met Gln Ala Leu Ala Phe Leu Ala Gly Ala Asn
275 280 285
agc att ttt tat ggt gaa aaa ctg ctg acc acc ggt aat ccg cag gca 912
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Gly Asn Pro Gln Ala
290 295 300
gat cgt gat atg cag ctg ttt gca cgt ctg ggt att cag ccg gaa gcc 960
Asp Arg Asp Met Gln Leu Phe Ala Arg Leu Gly Ile Gln Pro Glu Ala
305 310 315 320
ggt gaa ggt cat gca gat gaa gtt cat cag gca gca att gaa cag gca 1008
Gly Glu Gly His Ala Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
gtt att gaa cag cgt aat ggt gaa ctg ttt tat gat gca gtt agc gca 1056
Val Ile Glu Gln Arg Asn Gly Glu Leu Phe Tyr Asp Ala Val Ser Ala
340 345 350
taa 1059
<210> 73
<211> 352
<212> PRT
<213> Pseudomonas monteilii
<400> 73
Met Ser Ala Ser Thr Ile Ala Thr Thr Arg His Asp Trp Thr Leu Ala
1 5 10 15
Glu Val Arg Ala Leu Phe Gln Gln Pro Phe Asn Asp Leu Leu Phe Gln
20 25 30
Ala Gln Ser Val His Arg Ala His Phe Asp Ala Asn Arg Val Gln Val
35 40 45
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Glu Lys Gln Lys
65 70 75 80
Leu Met Glu Val Gln Lys Val Leu Glu Glu Ala Ala Arg Ala Lys Ala
85 90 95
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
Ala Lys Asp Met Pro Tyr Val Leu Gln Met Val Gln Gly Val Lys Ala
115 120 125
Met Gly Leu Glu Thr Cys Met Thr Leu Gly Arg Leu Asp Arg Glu Gln
130 135 140
Thr Ala Ala Leu Ala Glu Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
Asp Thr Ser Pro Glu Phe Tyr Gly Ser Ile Ile Thr Thr Arg Thr Tyr
165 170 175
Ala Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Asp Ala Gly Met Lys
180 185 190
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Leu Asp Asp Arg
195 200 205
Ala Asn Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
Val Pro Ile Asn Met Leu Val Lys Val Ala Gly Thr Pro Leu Ala Asn
225 230 235 240
Glu Glu Asp Val Asp Pro Phe Asp Phe Ile Arg Met Leu Ala Val Ala
245 250 255
Arg Ile Leu Met Pro Gln Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
Gln Met Asn Glu Gln Met Gln Ala Leu Ala Phe Leu Ala Gly Ala Asn
275 280 285
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Gly Asn Pro Gln Ala
290 295 300
Asp Arg Asp Met Gln Leu Phe Ala Arg Leu Gly Ile Gln Pro Glu Ala
305 310 315 320
Gly Glu Gly His Ala Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
Val Ile Glu Gln Arg Asn Gly Glu Leu Phe Tyr Asp Ala Val Ser Ala
340 345 350
<210> 74
<211> 1059
<212> DNA
<213> Pseudomonas massiliensis CB1
<220>
<221> CDS
<222> (1)..(1059)
<223> bioB gene encoding biotin synthase from Pseudomonas massiliensis
CB1
<400> 74
atg agc gca agc ctg aat agc ccg ctg cgt cat gat tgg acc ctg agc 48
Met Ser Ala Ser Leu Asn Ser Pro Leu Arg His Asp Trp Thr Leu Ser
1 5 10 15
gaa gtt aaa gcc ctg ttt acc cag ccg ttt aat gat ctg ctg ttt cat 96
Glu Val Lys Ala Leu Phe Thr Gln Pro Phe Asn Asp Leu Leu Phe His
20 25 30
gca atg agc gtt cat cgt gca cat ttt gat ccg aat cag gtt cag gtt 144
Ala Met Ser Val His Arg Ala His Phe Asp Pro Asn Gln Val Gln Val
35 40 45
agc acc ctg ctg agc att aaa acc ggt gca tgt ccg gaa gat tgt aaa 192
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
tat tgt ccg cag agc ggt cat tat aat acc ggc ctg gaa aaa gaa aaa 240
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Glu Lys Glu Lys
65 70 75 80
ctg ctg gaa gtg cag aaa gtg att gaa gaa gca gca cgt gcc aaa gca 288
Leu Leu Glu Val Gln Lys Val Ile Glu Glu Ala Ala Arg Ala Lys Ala
85 90 95
att ggt agc acc cgt ttt tgt atg ggt gca gca tgg aaa cat ccg agc 336
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
gca aaa gat atg ccg tat gtt ctg gaa atg gtt cgt ggt gtt aaa gca 384
Ala Lys Asp Met Pro Tyr Val Leu Glu Met Val Arg Gly Val Lys Ala
115 120 125
ctg ggt ctg gaa acc tgt atg acc ctg ggt cgt ctg gat cgt gat cag 432
Leu Gly Leu Glu Thr Cys Met Thr Leu Gly Arg Leu Asp Arg Asp Gln
130 135 140
acc gtt gca ctg gca gaa gca ggt ctg gat tat tat aat cat aat ctg 480
Thr Val Ala Leu Ala Glu Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
gat acc agc ccg gaa ttt tat ggc aat att att acc acc cgt acc tat 528
Asp Thr Ser Pro Glu Phe Tyr Gly Asn Ile Ile Thr Thr Arg Thr Tyr
165 170 175
ggt gaa cgt ctg cag acc ctg gcc tat gtt cgt gat gca ggt atg aaa 576
Gly Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Asp Ala Gly Met Lys
180 185 190
att tgt agc ggt ggt att ctg ggt atg ggt gaa agc ctg gat gat cgt 624
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Leu Asp Asp Arg
195 200 205
gca ggt ctg ctg att cag ctg gca aat ctg ccg gaa cat ccg gaa agc 672
Ala Gly Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
gtt ccg att aat atg ctg gtt aaa gtt gcc ggt acc ccg ctg gaa aat 720
Val Pro Ile Asn Met Leu Val Lys Val Ala Gly Thr Pro Leu Glu Asn
225 230 235 240
gca gaa gat gtt gat ccg ttt gat ttt att cgt atg ctg gca gtt gca 768
Ala Glu Asp Val Asp Pro Phe Asp Phe Ile Arg Met Leu Ala Val Ala
245 250 255
cgt att ctg atg ccg cgt agc cat gtt cgt ctg agc gca ggt cgt gaa 816
Arg Ile Leu Met Pro Arg Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
cag atg aat gaa cag atg cag gcc ctg gca ttt atg gcc ggt gca aat 864
Gln Met Asn Glu Gln Met Gln Ala Leu Ala Phe Met Ala Gly Ala Asn
275 280 285
agc att ttt tat ggt gaa aaa ctg ctg acc acc gca aat ccg cag gca 912
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Ala Asn Pro Gln Ala
290 295 300
gat aaa gat atg cgt ctg ttt gca cgt ctg ggt att cgt ccg gaa gca 960
Asp Lys Asp Met Arg Leu Phe Ala Arg Leu Gly Ile Arg Pro Glu Ala
305 310 315 320
cgt gaa gaa cat gat gat gaa gtt cat cag gca gca att gaa cag gca 1008
Arg Glu Glu His Asp Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
ctg gtt gaa cag cgt agc ggt gaa ctg ttt tat gat gca gca gca gtt 1056
Leu Val Glu Gln Arg Ser Gly Glu Leu Phe Tyr Asp Ala Ala Ala Val
340 345 350
taa 1059
<210> 75
<211> 352
<212> PRT
<213> Pseudomonas massiliensis CB1
<400> 75
Met Ser Ala Ser Leu Asn Ser Pro Leu Arg His Asp Trp Thr Leu Ser
1 5 10 15
Glu Val Lys Ala Leu Phe Thr Gln Pro Phe Asn Asp Leu Leu Phe His
20 25 30
Ala Met Ser Val His Arg Ala His Phe Asp Pro Asn Gln Val Gln Val
35 40 45
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Glu Lys Glu Lys
65 70 75 80
Leu Leu Glu Val Gln Lys Val Ile Glu Glu Ala Ala Arg Ala Lys Ala
85 90 95
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
Ala Lys Asp Met Pro Tyr Val Leu Glu Met Val Arg Gly Val Lys Ala
115 120 125
Leu Gly Leu Glu Thr Cys Met Thr Leu Gly Arg Leu Asp Arg Asp Gln
130 135 140
Thr Val Ala Leu Ala Glu Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
Asp Thr Ser Pro Glu Phe Tyr Gly Asn Ile Ile Thr Thr Arg Thr Tyr
165 170 175
Gly Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Asp Ala Gly Met Lys
180 185 190
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Leu Asp Asp Arg
195 200 205
Ala Gly Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
Val Pro Ile Asn Met Leu Val Lys Val Ala Gly Thr Pro Leu Glu Asn
225 230 235 240
Ala Glu Asp Val Asp Pro Phe Asp Phe Ile Arg Met Leu Ala Val Ala
245 250 255
Arg Ile Leu Met Pro Arg Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
Gln Met Asn Glu Gln Met Gln Ala Leu Ala Phe Met Ala Gly Ala Asn
275 280 285
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Ala Asn Pro Gln Ala
290 295 300
Asp Lys Asp Met Arg Leu Phe Ala Arg Leu Gly Ile Arg Pro Glu Ala
305 310 315 320
Arg Glu Glu His Asp Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
Leu Val Glu Gln Arg Ser Gly Glu Leu Phe Tyr Asp Ala Ala Ala Val
340 345 350
<210> 76
<211> 1059
<212> DNA
<213> Pseudomonas putida F1
<220>
<221> CDS
<222> (1)..(1059)
<223> bioB gene encoding biotin synthase from Pseudomonas putida F1
<400> 76
atg agc gca agc acc acc gca acc acc cgt cat gat tgg agc ctg gca 48
Met Ser Ala Ser Thr Thr Ala Thr Thr Arg His Asp Trp Ser Leu Ala
1 5 10 15
gaa gtt aaa gcc ctg ttt cag cag ccg ttt aat gat ctg ctg ttt cag 96
Glu Val Lys Ala Leu Phe Gln Gln Pro Phe Asn Asp Leu Leu Phe Gln
20 25 30
gca cag acc gtt cat cgt gca cat ttt aat ccg aat cgt gtt cag gtt 144
Ala Gln Thr Val His Arg Ala His Phe Asn Pro Asn Arg Val Gln Val
35 40 45
agc acc ctg ctg agc att aaa acc ggt gca tgc ccg gaa gat tgt aaa 192
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
tat tgt ccg cag agc ggt cat tat aat acc ggt ctg gaa aaa cag aaa 240
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Glu Lys Gln Lys
65 70 75 80
ctg atg gaa gtt cag aaa gtt ctg gaa gaa gca gcc cgt gca aaa gca 288
Leu Met Glu Val Gln Lys Val Leu Glu Glu Ala Ala Arg Ala Lys Ala
85 90 95
att ggt agc acc cgt ttt tgc atg ggt gca gca tgg aaa cat ccg agc 336
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
gca aaa gat atg ccg tat gtt ctg gaa atg gtt aaa ggt gtt aaa gca 384
Ala Lys Asp Met Pro Tyr Val Leu Glu Met Val Lys Gly Val Lys Ala
115 120 125
atg ggc ctg gaa acc tgt atg acc ctg ggt aaa ctg gat cag gaa cag 432
Met Gly Leu Glu Thr Cys Met Thr Leu Gly Lys Leu Asp Gln Glu Gln
130 135 140
acc aaa gca ctg gca cat gcg ggt ctg gat tat tat aat cat aat ctg 480
Thr Lys Ala Leu Ala His Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
gat acc agc ccg gaa ttt tat ggc agc att att acc acc cgt acc tat 528
Asp Thr Ser Pro Glu Phe Tyr Gly Ser Ile Ile Thr Thr Arg Thr Tyr
165 170 175
agc gaa cgt ctg cag acc ctg gca tat gtt cgt gat gca ggt atg aaa 576
Ser Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Asp Ala Gly Met Lys
180 185 190
att tgt agc ggt ggt att ctg ggt atg ggt gaa agc ctg gat gat cgt 624
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Leu Asp Asp Arg
195 200 205
gca ggt ctg ctg att cag ctg gca aat ctg ccg gaa cat ccg gaa agc 672
Ala Gly Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
gtt ccg att aat atg ctg gtt aaa gtt gca ggt acc ccg ctg gcc gaa 720
Val Pro Ile Asn Met Leu Val Lys Val Ala Gly Thr Pro Leu Ala Glu
225 230 235 240
gaa gaa gat gtt gat ccg ttt gat ttt att cgt atg ctg gca gtt gca 768
Glu Glu Asp Val Asp Pro Phe Asp Phe Ile Arg Met Leu Ala Val Ala
245 250 255
cgt att ctg atg ccg aaa agc cat gtt cgt ctg agc gca ggt cgt gaa 816
Arg Ile Leu Met Pro Lys Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
cag atg aat gaa cag atg cag gcg ctg gca ttt atg gcc ggt gca aat 864
Gln Met Asn Glu Gln Met Gln Ala Leu Ala Phe Met Ala Gly Ala Asn
275 280 285
agc att ttt tat ggt gaa aaa ctg ctg acc acc gcc aat ccg cag gca 912
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Ala Asn Pro Gln Ala
290 295 300
gat aaa gat atg cag ctg ttt gca cgt ctg ggt att aaa ccg gaa gca 960
Asp Lys Asp Met Gln Leu Phe Ala Arg Leu Gly Ile Lys Pro Glu Ala
305 310 315 320
cgt gaa gaa cat gca gat gaa gtt cat cag gca gca att gaa cag gca 1008
Arg Glu Glu His Ala Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
ctg gtt gaa cag cgt agc agc gaa atg ttt tat gat gca gca acc gca 1056
Leu Val Glu Gln Arg Ser Ser Glu Met Phe Tyr Asp Ala Ala Thr Ala
340 345 350
taa 1059
<210> 77
<211> 352
<212> PRT
<213> Pseudomonas putida F1
<400> 77
Met Ser Ala Ser Thr Thr Ala Thr Thr Arg His Asp Trp Ser Leu Ala
1 5 10 15
Glu Val Lys Ala Leu Phe Gln Gln Pro Phe Asn Asp Leu Leu Phe Gln
20 25 30
Ala Gln Thr Val His Arg Ala His Phe Asn Pro Asn Arg Val Gln Val
35 40 45
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Glu Lys Gln Lys
65 70 75 80
Leu Met Glu Val Gln Lys Val Leu Glu Glu Ala Ala Arg Ala Lys Ala
85 90 95
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
Ala Lys Asp Met Pro Tyr Val Leu Glu Met Val Lys Gly Val Lys Ala
115 120 125
Met Gly Leu Glu Thr Cys Met Thr Leu Gly Lys Leu Asp Gln Glu Gln
130 135 140
Thr Lys Ala Leu Ala His Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
Asp Thr Ser Pro Glu Phe Tyr Gly Ser Ile Ile Thr Thr Arg Thr Tyr
165 170 175
Ser Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Asp Ala Gly Met Lys
180 185 190
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Leu Asp Asp Arg
195 200 205
Ala Gly Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
Val Pro Ile Asn Met Leu Val Lys Val Ala Gly Thr Pro Leu Ala Glu
225 230 235 240
Glu Glu Asp Val Asp Pro Phe Asp Phe Ile Arg Met Leu Ala Val Ala
245 250 255
Arg Ile Leu Met Pro Lys Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
Gln Met Asn Glu Gln Met Gln Ala Leu Ala Phe Met Ala Gly Ala Asn
275 280 285
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Ala Asn Pro Gln Ala
290 295 300
Asp Lys Asp Met Gln Leu Phe Ala Arg Leu Gly Ile Lys Pro Glu Ala
305 310 315 320
Arg Glu Glu His Ala Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
Leu Val Glu Gln Arg Ser Ser Glu Met Phe Tyr Asp Ala Ala Thr Ala
340 345 350
<210> 78
<211> 1059
<212> DNA
<213> Pseudomonas thermotolerans
<220>
<221> CDS
<222> (1)..(1059)
<223> bioB gene encoding biotin synthase from Pseudomonas
thermotolerans
<400> 78
atg aat gca agc gtt gca gca gca att cgt cat gat tgg acc ctg gca 48
Met Asn Ala Ser Val Ala Ala Ala Ile Arg His Asp Trp Thr Leu Ala
1 5 10 15
gaa gtt aaa gcc ctg ttt gca ctg ccg ttt aat gat ctg ctg tat cag 96
Glu Val Lys Ala Leu Phe Ala Leu Pro Phe Asn Asp Leu Leu Tyr Gln
20 25 30
gca cag acc gtt cat cgt cag tat ttt gat gca aat cgt gtt cag gtt 144
Ala Gln Thr Val His Arg Gln Tyr Phe Asp Ala Asn Arg Val Gln Val
35 40 45
agc acc ctg ctg agc att aaa acc ggt gca tgt ccg gaa gat tgt aaa 192
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
tat tgt ccg cag agc ggt cat tat aat acc ggc ctg gaa aaa cag aaa 240
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Glu Lys Gln Lys
65 70 75 80
ctg atg gaa gtt cag aaa gtt ctg cag gca gca gca gaa gca aaa gca 288
Leu Met Glu Val Gln Lys Val Leu Gln Ala Ala Ala Glu Ala Lys Ala
85 90 95
atg ggt agc acc cgt ttt tgt atg ggt gca gca tgg aaa cat ccg agc 336
Met Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
gca aaa gat ttt ccg tat gtt ctg gaa atg gtt aaa ggt gtt aaa gca 384
Ala Lys Asp Phe Pro Tyr Val Leu Glu Met Val Lys Gly Val Lys Ala
115 120 125
ctg ggt ctg gaa acc tgt atg acc ctg ggt cgt ctg tca cgt gaa cag 432
Leu Gly Leu Glu Thr Cys Met Thr Leu Gly Arg Leu Ser Arg Glu Gln
130 135 140
acc cag gca ctg gcg gaa gcc ggt ctg gat tat tat aat cat aat ctg 480
Thr Gln Ala Leu Ala Glu Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
gat acc agc ccg gaa ttt tat ggt cgt att att acc acc cgt acc tat 528
Asp Thr Ser Pro Glu Phe Tyr Gly Arg Ile Ile Thr Thr Arg Thr Tyr
165 170 175
gca gaa cgt ctg cag acc ctg gca tat gtt cgt gaa gca ggt atg aaa 576
Ala Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Glu Ala Gly Met Lys
180 185 190
att tgt agc ggt ggt att ctg ggt atg ggt gaa agc ctg gat gat cgt 624
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Leu Asp Asp Arg
195 200 205
gca ggt ctg ctg att cag ctg gca aat ctg ccg gaa cat ccg gaa agc 672
Ala Gly Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
gtt ccg att aat atg ctg gtt aaa gtt cag ggt acc ccg ctg gca gat 720
Val Pro Ile Asn Met Leu Val Lys Val Gln Gly Thr Pro Leu Ala Asp
225 230 235 240
gca gaa gat gtt gat ccg ttt gat ttt att cgt acc ctg gcg gtt gca 768
Ala Glu Asp Val Asp Pro Phe Asp Phe Ile Arg Thr Leu Ala Val Ala
245 250 255
cgt att atg atg ccg aaa agc cat gtt cgt ctg agc gca ggt cgc gaa 816
Arg Ile Met Met Pro Lys Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
cag atg aat gaa cag atg cag gcg ctg gca ttt ctg gcc ggt gca aat 864
Gln Met Asn Glu Gln Met Gln Ala Leu Ala Phe Leu Ala Gly Ala Asn
275 280 285
agc att ttt tat ggt gaa aaa ctg ctg acc acc ggt aat ccg cag gca 912
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Gly Asn Pro Gln Ala
290 295 300
gaa aaa gat ctg cag ctg ttt cgt cgt ctg ggt att cag ccg gaa gaa 960
Glu Lys Asp Leu Gln Leu Phe Arg Arg Leu Gly Ile Gln Pro Glu Glu
305 310 315 320
cgt gaa gaa cat gca gat gaa gtt cat cag gcc gca att gaa cag gcc 1008
Arg Glu Glu His Ala Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
ctg gcc gaa cag cgt gat agc cag ctg ttt tat gat gca gca agc gca 1056
Leu Ala Glu Gln Arg Asp Ser Gln Leu Phe Tyr Asp Ala Ala Ser Ala
340 345 350
taa 1059
<210> 79
<211> 352
<212> PRT
<213> Pseudomonas thermotolerans
<400> 79
Met Asn Ala Ser Val Ala Ala Ala Ile Arg His Asp Trp Thr Leu Ala
1 5 10 15
Glu Val Lys Ala Leu Phe Ala Leu Pro Phe Asn Asp Leu Leu Tyr Gln
20 25 30
Ala Gln Thr Val His Arg Gln Tyr Phe Asp Ala Asn Arg Val Gln Val
35 40 45
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Glu Lys Gln Lys
65 70 75 80
Leu Met Glu Val Gln Lys Val Leu Gln Ala Ala Ala Glu Ala Lys Ala
85 90 95
Met Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
Ala Lys Asp Phe Pro Tyr Val Leu Glu Met Val Lys Gly Val Lys Ala
115 120 125
Leu Gly Leu Glu Thr Cys Met Thr Leu Gly Arg Leu Ser Arg Glu Gln
130 135 140
Thr Gln Ala Leu Ala Glu Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
Asp Thr Ser Pro Glu Phe Tyr Gly Arg Ile Ile Thr Thr Arg Thr Tyr
165 170 175
Ala Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Glu Ala Gly Met Lys
180 185 190
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Leu Asp Asp Arg
195 200 205
Ala Gly Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
Val Pro Ile Asn Met Leu Val Lys Val Gln Gly Thr Pro Leu Ala Asp
225 230 235 240
Ala Glu Asp Val Asp Pro Phe Asp Phe Ile Arg Thr Leu Ala Val Ala
245 250 255
Arg Ile Met Met Pro Lys Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
Gln Met Asn Glu Gln Met Gln Ala Leu Ala Phe Leu Ala Gly Ala Asn
275 280 285
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Gly Asn Pro Gln Ala
290 295 300
Glu Lys Asp Leu Gln Leu Phe Arg Arg Leu Gly Ile Gln Pro Glu Glu
305 310 315 320
Arg Glu Glu His Ala Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
Leu Ala Glu Gln Arg Asp Ser Gln Leu Phe Tyr Asp Ala Ala Ser Ala
340 345 350
<210> 80
<211> 1059
<212> DNA
<213> pseudomonad ancestor
<220>
<221> CDS
<222> (1)..(1059)
<223> bioB gene encoding biotin synthase from pseudomonad ancestor
<400> 80
atg agc gca agc acc aat agc ccg ctg cgt cat gat tgg acc ctg agc 48
Met Ser Ala Ser Thr Asn Ser Pro Leu Arg His Asp Trp Thr Leu Ser
1 5 10 15
gaa gtt aaa gca ctg ttt acc cag ccg ttt aat gat ctg ctg ttt cat 96
Glu Val Lys Ala Leu Phe Thr Gln Pro Phe Asn Asp Leu Leu Phe His
20 25 30
gca atg acc gtt cat cgt gca cat ttt gat ccg aat cag gtt cag gtt 144
Ala Met Thr Val His Arg Ala His Phe Asp Pro Asn Gln Val Gln Val
35 40 45
agc acc ctg ctg agc att aaa acc ggt gca tgt ccg gaa gat tgt aaa 192
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
tat tgt ccg cag agc ggc cat tat aat acc ggc ctg gaa aaa gaa aaa 240
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Glu Lys Glu Lys
65 70 75 80
ctg atg gaa gtg cag aaa gtt att gaa gaa gca gca cgt gcc aaa gca 288
Leu Met Glu Val Gln Lys Val Ile Glu Glu Ala Ala Arg Ala Lys Ala
85 90 95
att ggt agc acc cgt ttt tgc atg ggt gca gca tgg aaa cat ccg agc 336
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
gca aaa gat atg ccg tat gtt ctg gaa atg gtt cgt ggt gtt aaa gca 384
Ala Lys Asp Met Pro Tyr Val Leu Glu Met Val Arg Gly Val Lys Ala
115 120 125
atg ggt ctg gaa acc tgt atg acc ctg ggt cgt ctg gat cag gat cag 432
Met Gly Leu Glu Thr Cys Met Thr Leu Gly Arg Leu Asp Gln Asp Gln
130 135 140
acc gtt gca ctg gca gaa gca ggt ctg gat tat tat aat cat aat ctg 480
Thr Val Ala Leu Ala Glu Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
gat acc agc ccg gaa ttt tat ggt aat att att acc acc cgt acc tat 528
Asp Thr Ser Pro Glu Phe Tyr Gly Asn Ile Ile Thr Thr Arg Thr Tyr
165 170 175
ggt gaa cgt ctg cag acc ctg gcc tat gtt cgt gat gca ggt atg aaa 576
Gly Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Asp Ala Gly Met Lys
180 185 190
att tgt agc ggt ggt att ctg ggt atg ggt gaa agc ctg gat gat cgt 624
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Leu Asp Asp Arg
195 200 205
gca ggt ctg ctg att cag ctg gca aat ctg ccg gaa cat ccg gaa agc 672
Ala Gly Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
gtt ccg att aat atg ctg gtt aaa gtt gca ggt acc ccg ctg gaa aat 720
Val Pro Ile Asn Met Leu Val Lys Val Ala Gly Thr Pro Leu Glu Asn
225 230 235 240
gca gaa gat gtt gat ccg ttt gat ttt att cgt atg ctg gca gtt gca 768
Ala Glu Asp Val Asp Pro Phe Asp Phe Ile Arg Met Leu Ala Val Ala
245 250 255
cgt att ctg atg ccg cgt agc cat gtt cgt ctg agc gca ggt cgt gaa 816
Arg Ile Leu Met Pro Arg Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
cag atg aat gaa cag atg cag gcc ctg gca ttt atg gcc ggt gca aat 864
Gln Met Asn Glu Gln Met Gln Ala Leu Ala Phe Met Ala Gly Ala Asn
275 280 285
agc att ttt tat ggt gaa aaa ctg ctg acc acc gca aat ccg cag gca 912
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Ala Asn Pro Gln Ala
290 295 300
gat aaa gat atg cag ctg ttt gca cgt ctg ggt att cgt ccg gaa gca 960
Asp Lys Asp Met Gln Leu Phe Ala Arg Leu Gly Ile Arg Pro Glu Ala
305 310 315 320
cgt gaa gaa cat gat gat gaa gtt cat cag gca gca att gaa cag gca 1008
Arg Glu Glu His Asp Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
ctg gtt gaa cag cgt agc agc gaa atg ttt tat gat gca gca gca gtt 1056
Leu Val Glu Gln Arg Ser Ser Glu Met Phe Tyr Asp Ala Ala Ala Val
340 345 350
taa 1059
<210> 81
<211> 352
<212> PRT
<213> pseudomonad ancestor
<400> 81
Met Ser Ala Ser Thr Asn Ser Pro Leu Arg His Asp Trp Thr Leu Ser
1 5 10 15
Glu Val Lys Ala Leu Phe Thr Gln Pro Phe Asn Asp Leu Leu Phe His
20 25 30
Ala Met Thr Val His Arg Ala His Phe Asp Pro Asn Gln Val Gln Val
35 40 45
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Glu Lys Glu Lys
65 70 75 80
Leu Met Glu Val Gln Lys Val Ile Glu Glu Ala Ala Arg Ala Lys Ala
85 90 95
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
Ala Lys Asp Met Pro Tyr Val Leu Glu Met Val Arg Gly Val Lys Ala
115 120 125
Met Gly Leu Glu Thr Cys Met Thr Leu Gly Arg Leu Asp Gln Asp Gln
130 135 140
Thr Val Ala Leu Ala Glu Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
Asp Thr Ser Pro Glu Phe Tyr Gly Asn Ile Ile Thr Thr Arg Thr Tyr
165 170 175
Gly Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Asp Ala Gly Met Lys
180 185 190
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Leu Asp Asp Arg
195 200 205
Ala Gly Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
Val Pro Ile Asn Met Leu Val Lys Val Ala Gly Thr Pro Leu Glu Asn
225 230 235 240
Ala Glu Asp Val Asp Pro Phe Asp Phe Ile Arg Met Leu Ala Val Ala
245 250 255
Arg Ile Leu Met Pro Arg Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
Gln Met Asn Glu Gln Met Gln Ala Leu Ala Phe Met Ala Gly Ala Asn
275 280 285
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Ala Asn Pro Gln Ala
290 295 300
Asp Lys Asp Met Gln Leu Phe Ala Arg Leu Gly Ile Arg Pro Glu Ala
305 310 315 320
Arg Glu Glu His Asp Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
Leu Val Glu Gln Arg Ser Ser Glu Met Phe Tyr Asp Ala Ala Ala Val
340 345 350
<210> 82
<211> 1059
<212> DNA
<213> Pseudomonas aeruginosa PAO1
<220>
<221> CDS
<222> (1)..(1059)
<223> bioB gene encoding biotin synthase from Pseudomonas aeruginosa
PAO1
<400> 82
atg agc gca acc gca agc gtt gca acc cgt cat gat tgg agc ctg gca 48
Met Ser Ala Thr Ala Ser Val Ala Thr Arg His Asp Trp Ser Leu Ala
1 5 10 15
gaa gtt cgt gca ctg ttt gaa cag ccg ttt aat gat ctg ctg ttt cag 96
Glu Val Arg Ala Leu Phe Glu Gln Pro Phe Asn Asp Leu Leu Phe Gln
20 25 30
gca cag acc gtt cat cgt gca cat ttt gat ccg aat cgt gtt cag gtt 144
Ala Gln Thr Val His Arg Ala His Phe Asp Pro Asn Arg Val Gln Val
35 40 45
agc acc ctg ctg agc att aaa acc ggt gca tgt ccg gaa gat tgt aaa 192
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
tat tgt ccg cag agc ggt cat tat aat acc ggc ctg gat aaa gaa aaa 240
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Asp Lys Glu Lys
65 70 75 80
ctg atg gaa gtt cag aaa gtt ctg gaa gca gca gca gaa gca aaa gca 288
Leu Met Glu Val Gln Lys Val Leu Glu Ala Ala Ala Glu Ala Lys Ala
85 90 95
att ggt agc acc cgt ttt tgc atg ggt gca gca tgg aaa cat ccg agc 336
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
gca aaa gat atg ccg tat gtt ctg gaa atg gtt aaa ggt gtt aaa aaa 384
Ala Lys Asp Met Pro Tyr Val Leu Glu Met Val Lys Gly Val Lys Lys
115 120 125
ctg ggt ctg gaa acc tgt atg acc ctg ggt cgt ctg acc cag gaa cag 432
Leu Gly Leu Glu Thr Cys Met Thr Leu Gly Arg Leu Thr Gln Glu Gln
130 135 140
acc cag gcg ctg gca gat gcc ggt ctg gat tat tat aat cat aat ctg 480
Thr Gln Ala Leu Ala Asp Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
gat acc agc ccg gaa ttt tat ggt aat att att acc acc cgt acc tat 528
Asp Thr Ser Pro Glu Phe Tyr Gly Asn Ile Ile Thr Thr Arg Thr Tyr
165 170 175
agc gaa cgt ctg cag acc ctg gcc tat gtt cgt gaa gca ggt atg aaa 576
Ser Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Glu Ala Gly Met Lys
180 185 190
att tgt agc ggt ggt att ctg ggt atg ggt gaa agc gtt gat gat cgt 624
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Val Asp Asp Arg
195 200 205
gca ggt ctg ctg att cag ctg gca aat ctg ccg gaa cat ccg gaa agc 672
Ala Gly Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
gtt ccg att aat atg ctg gtt aaa gtt aaa ggt acc ccg ctg gcc gaa 720
Val Pro Ile Asn Met Leu Val Lys Val Lys Gly Thr Pro Leu Ala Glu
225 230 235 240
gaa aaa gat gtt gat ccg ttt gat ttt att cgt acc ctg gca gtt gcc 768
Glu Lys Asp Val Asp Pro Phe Asp Phe Ile Arg Thr Leu Ala Val Ala
245 250 255
cgt att atg atg ccg aaa agc cat gtt cgt ctg agc gca ggt cgt gaa 816
Arg Ile Met Met Pro Lys Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
cag atg aat gaa cag atg cag gca ctg gca ttt atg gcc ggt gca aat 864
Gln Met Asn Glu Gln Met Gln Ala Leu Ala Phe Met Ala Gly Ala Asn
275 280 285
agc att ttt tat ggt gaa aaa ctg ctg acc acc aaa aat ccg cag gcc 912
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Lys Asn Pro Gln Ala
290 295 300
gaa aaa gat atg cag ctg ttt gca cgt ctg ggt att aaa ccg gaa gaa 960
Glu Lys Asp Met Gln Leu Phe Ala Arg Leu Gly Ile Lys Pro Glu Glu
305 310 315 320
cgt gaa gaa cat gca gat gaa gtt cat cag gca gca att gaa cag gcc 1008
Arg Glu Glu His Ala Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
ctg gtt gaa cag cgt gaa agc aaa ctg ttt tat aat gca gca agc gca 1056
Leu Val Glu Gln Arg Glu Ser Lys Leu Phe Tyr Asn Ala Ala Ser Ala
340 345 350
taa 1059
<210> 83
<211> 352
<212> PRT
<213> Pseudomonas aeruginosa PAO1
<400> 83
Met Ser Ala Thr Ala Ser Val Ala Thr Arg His Asp Trp Ser Leu Ala
1 5 10 15
Glu Val Arg Ala Leu Phe Glu Gln Pro Phe Asn Asp Leu Leu Phe Gln
20 25 30
Ala Gln Thr Val His Arg Ala His Phe Asp Pro Asn Arg Val Gln Val
35 40 45
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Asp Lys Glu Lys
65 70 75 80
Leu Met Glu Val Gln Lys Val Leu Glu Ala Ala Ala Glu Ala Lys Ala
85 90 95
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
Ala Lys Asp Met Pro Tyr Val Leu Glu Met Val Lys Gly Val Lys Lys
115 120 125
Leu Gly Leu Glu Thr Cys Met Thr Leu Gly Arg Leu Thr Gln Glu Gln
130 135 140
Thr Gln Ala Leu Ala Asp Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
Asp Thr Ser Pro Glu Phe Tyr Gly Asn Ile Ile Thr Thr Arg Thr Tyr
165 170 175
Ser Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Glu Ala Gly Met Lys
180 185 190
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Val Asp Asp Arg
195 200 205
Ala Gly Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
Val Pro Ile Asn Met Leu Val Lys Val Lys Gly Thr Pro Leu Ala Glu
225 230 235 240
Glu Lys Asp Val Asp Pro Phe Asp Phe Ile Arg Thr Leu Ala Val Ala
245 250 255
Arg Ile Met Met Pro Lys Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
Gln Met Asn Glu Gln Met Gln Ala Leu Ala Phe Met Ala Gly Ala Asn
275 280 285
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Lys Asn Pro Gln Ala
290 295 300
Glu Lys Asp Met Gln Leu Phe Ala Arg Leu Gly Ile Lys Pro Glu Glu
305 310 315 320
Arg Glu Glu His Ala Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
Leu Val Glu Gln Arg Glu Ser Lys Leu Phe Tyr Asn Ala Ala Ser Ala
340 345 350
<210> 84
<211> 1059
<212> DNA
<213> Pseudomonas balearica
<220>
<221> CDS
<222> (1)..(1059)
<223> bioB gene encoding biotin synthase from Pseudomonas balearica
<400> 84
atg agc gca acc gca agc att gca acc cgt cat gat tgg agc ctg gcg 48
Met Ser Ala Thr Ala Ser Ile Ala Thr Arg His Asp Trp Ser Leu Ala
1 5 10 15
gaa gtt aaa gca ctg ttt gaa cag ccg ttt aat gat ctg ctg ttt cag 96
Glu Val Lys Ala Leu Phe Glu Gln Pro Phe Asn Asp Leu Leu Phe Gln
20 25 30
gca cag acc gtt cat cgt gca cat ttt gat ccg aat cgt gtt cag gtt 144
Ala Gln Thr Val His Arg Ala His Phe Asp Pro Asn Arg Val Gln Val
35 40 45
agc acc ctg ctg agc att aaa acc ggt gca tgt ccg gaa gat tgc aaa 192
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
tat tgt ccg cag agc ggt cat tat aat acc ggt ctg gat aaa gaa aaa 240
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Asp Lys Glu Lys
65 70 75 80
ctg atg gaa gtt cag aaa gtt ctg gaa gca gca gca gaa gcc aaa gca 288
Leu Met Glu Val Gln Lys Val Leu Glu Ala Ala Ala Glu Ala Lys Ala
85 90 95
att ggt agc acc cgt ttt tgc atg ggt gca gca tgg aaa cat ccg agc 336
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
gca aaa gat atg ccg tat gtt ctg gaa atg gtt aaa ggt gtt aaa aaa 384
Ala Lys Asp Met Pro Tyr Val Leu Glu Met Val Lys Gly Val Lys Lys
115 120 125
ctg ggt ctg gaa acc tgt atg acc ctg ggt cgt ctg gat cag gaa cag 432
Leu Gly Leu Glu Thr Cys Met Thr Leu Gly Arg Leu Asp Gln Glu Gln
130 135 140
acc cag gcc ctg gca gaa gca ggc ctg gat tat tat aat cat aat ctg 480
Thr Gln Ala Leu Ala Glu Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
gat acc agc ccg gaa ttt tat ggt aat att att acc acc cgt acc tat 528
Asp Thr Ser Pro Glu Phe Tyr Gly Asn Ile Ile Thr Thr Arg Thr Tyr
165 170 175
agc gaa cgc ctg cag acc ctg gca tat gtt cgt gaa gca ggt atg aaa 576
Ser Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Glu Ala Gly Met Lys
180 185 190
att tgt agc ggt ggt att ctg ggt atg ggt gaa agc gtt gat gat cgt 624
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Val Asp Asp Arg
195 200 205
gca ggt ctg ctg att cag ctg gca aat ctg ccg gaa cat ccg gaa agc 672
Ala Gly Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
gtt ccg att aat atg ctg gtt aaa gtg aaa ggt acc ccg ctg gcc gaa 720
Val Pro Ile Asn Met Leu Val Lys Val Lys Gly Thr Pro Leu Ala Glu
225 230 235 240
gaa aaa gat gtt gat ccg ttt gat ttt att cgt acc ctg gcc gtt gca 768
Glu Lys Asp Val Asp Pro Phe Asp Phe Ile Arg Thr Leu Ala Val Ala
245 250 255
cgt att atg atg ccg aaa agc cat gtt cgt ctg agc gca ggt cgt gaa 816
Arg Ile Met Met Pro Lys Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
cag atg aat gaa cag atg cag gcg ctg gca ttt atg gcc ggt gca aat 864
Gln Met Asn Glu Gln Met Gln Ala Leu Ala Phe Met Ala Gly Ala Asn
275 280 285
agc att ttt tat ggt gaa aaa ctg ctg acc acc gca aat ccg cag gca 912
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Ala Asn Pro Gln Ala
290 295 300
gaa aaa gat atg cag ctg ttt gca cgt ctg ggt att aaa ccg gaa gaa 960
Glu Lys Asp Met Gln Leu Phe Ala Arg Leu Gly Ile Lys Pro Glu Glu
305 310 315 320
cgt gaa gaa cat gca gat gaa gtt cat cag gca gca att gaa cag gca 1008
Arg Glu Glu His Ala Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
ctg gtt gaa cag cgt gat agc cag ctg ttt tat gat gca gca agc gca 1056
Leu Val Glu Gln Arg Asp Ser Gln Leu Phe Tyr Asp Ala Ala Ser Ala
340 345 350
taa 1059
<210> 85
<211> 352
<212> PRT
<213> Pseudomonas balearica
<400> 85
Met Ser Ala Thr Ala Ser Ile Ala Thr Arg His Asp Trp Ser Leu Ala
1 5 10 15
Glu Val Lys Ala Leu Phe Glu Gln Pro Phe Asn Asp Leu Leu Phe Gln
20 25 30
Ala Gln Thr Val His Arg Ala His Phe Asp Pro Asn Arg Val Gln Val
35 40 45
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Asp Lys Glu Lys
65 70 75 80
Leu Met Glu Val Gln Lys Val Leu Glu Ala Ala Ala Glu Ala Lys Ala
85 90 95
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
Ala Lys Asp Met Pro Tyr Val Leu Glu Met Val Lys Gly Val Lys Lys
115 120 125
Leu Gly Leu Glu Thr Cys Met Thr Leu Gly Arg Leu Asp Gln Glu Gln
130 135 140
Thr Gln Ala Leu Ala Glu Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
Asp Thr Ser Pro Glu Phe Tyr Gly Asn Ile Ile Thr Thr Arg Thr Tyr
165 170 175
Ser Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Glu Ala Gly Met Lys
180 185 190
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Val Asp Asp Arg
195 200 205
Ala Gly Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
Val Pro Ile Asn Met Leu Val Lys Val Lys Gly Thr Pro Leu Ala Glu
225 230 235 240
Glu Lys Asp Val Asp Pro Phe Asp Phe Ile Arg Thr Leu Ala Val Ala
245 250 255
Arg Ile Met Met Pro Lys Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
Gln Met Asn Glu Gln Met Gln Ala Leu Ala Phe Met Ala Gly Ala Asn
275 280 285
Ser Ile Phe Tyr Gly Glu Lys Leu Leu Thr Thr Ala Asn Pro Gln Ala
290 295 300
Glu Lys Asp Met Gln Leu Phe Ala Arg Leu Gly Ile Lys Pro Glu Glu
305 310 315 320
Arg Glu Glu His Ala Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
Leu Val Glu Gln Arg Asp Ser Gln Leu Phe Tyr Asp Ala Ala Ser Ala
340 345 350
<210> 86
<211> 1056
<212> DNA
<213> Pseudomonas fluorescens SBW25
<220>
<221> CDS
<222> (1)..(1056)
<223> bioB gene encoding biotin synthase from Pseudomonas fluorescens
SBW25
<400> 86
atg agc gca agc acc acc gcc acc ctg cgt cat gat tgg agc ctg gca 48
Met Ser Ala Ser Thr Thr Ala Thr Leu Arg His Asp Trp Ser Leu Ala
1 5 10 15
gaa gtt aaa gca ctg ttt gtt cag ccg ttt aat gat ctg ctg ttt cag 96
Glu Val Lys Ala Leu Phe Val Gln Pro Phe Asn Asp Leu Leu Phe Gln
20 25 30
gca cag acc gtt cat cgt gca cat ttt gat gca aat cgt gtt cag gtt 144
Ala Gln Thr Val His Arg Ala His Phe Asp Ala Asn Arg Val Gln Val
35 40 45
agc acc ctg ctg agc att aaa acc ggt gca tgt ccg gaa gat tgt aaa 192
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
tat tgt ccg cag agc ggt cat tat aat acc ggc ctg gaa aaa gaa aaa 240
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Glu Lys Glu Lys
65 70 75 80
ctg atg gaa gtt cag aaa gtt ctg gaa gaa gca gca cgt gca aaa gca 288
Leu Met Glu Val Gln Lys Val Leu Glu Glu Ala Ala Arg Ala Lys Ala
85 90 95
att ggt agc acc cgt ttt tgc atg ggt gca gca tgg aaa cat ccg agc 336
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
gca aaa gat atg ccg tat gtt ctg cag atg gtt aaa ggt gtt aaa gca 384
Ala Lys Asp Met Pro Tyr Val Leu Gln Met Val Lys Gly Val Lys Ala
115 120 125
atg ggt ctg gaa acc tgt atg acc ctg ggt cgt ctg gat cag gat cag 432
Met Gly Leu Glu Thr Cys Met Thr Leu Gly Arg Leu Asp Gln Asp Gln
130 135 140
acc gaa gca ctg gca cag gca ggt ctg gat tat tat aat cat aat ctg 480
Thr Glu Ala Leu Ala Gln Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
gat acc agc ccg gaa ttt tat ggt agc att att acc acc cgt acc tat 528
Asp Thr Ser Pro Glu Phe Tyr Gly Ser Ile Ile Thr Thr Arg Thr Tyr
165 170 175
ggt gaa cgt ctg cag acc ctg gca tat gtt cgt gat agc ggt atg aaa 576
Gly Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Asp Ser Gly Met Lys
180 185 190
att tgt agc ggt ggt att ctg ggt atg ggt gaa agc ctg gat gat cgt 624
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Leu Asp Asp Arg
195 200 205
gca aat ctg ctg att cag ctg gcc aat ctg ccg gaa cat ccg gaa agc 672
Ala Asn Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
gtt ccg att aat atg ctg gtt aaa gtt gca ggt acc ccg ctg gaa aat 720
Val Pro Ile Asn Met Leu Val Lys Val Ala Gly Thr Pro Leu Glu Asn
225 230 235 240
gcc gaa gat att gat ccg ttt gat ttt att cgt atg ctg gca gtt gca 768
Ala Glu Asp Ile Asp Pro Phe Asp Phe Ile Arg Met Leu Ala Val Ala
245 250 255
cgt att ctg atg ccg cgt agc cat gtt cgt ctg agc gca ggt cgt gaa 816
Arg Ile Leu Met Pro Arg Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
gca atg aat gaa cag atg cag gca ctg gcc ttt ttt gcc ggt gca aat 864
Ala Met Asn Glu Gln Met Gln Ala Leu Ala Phe Phe Ala Gly Ala Asn
275 280 285
agc att ttt tat ggt gat aaa ctg ctg acc acc gca aat ccg cag gca 912
Ser Ile Phe Tyr Gly Asp Lys Leu Leu Thr Thr Ala Asn Pro Gln Ala
290 295 300
gat aaa gat atg cag ctg ttt agc cgt ctg ggt att ctg ccg gaa gca 960
Asp Lys Asp Met Gln Leu Phe Ser Arg Leu Gly Ile Leu Pro Glu Ala
305 310 315 320
cgt gaa gaa cat gca gat gaa gtt cat cag gca gca att gaa cag gcc 1008
Arg Glu Glu His Ala Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
ctg gtt gaa cag aaa agc agc gaa cag ttt tat aat gca gca gtt taa 1056
Leu Val Glu Gln Lys Ser Ser Glu Gln Phe Tyr Asn Ala Ala Val
340 345 350
<210> 87
<211> 351
<212> PRT
<213> Pseudomonas fluorescens SBW25
<400> 87
Met Ser Ala Ser Thr Thr Ala Thr Leu Arg His Asp Trp Ser Leu Ala
1 5 10 15
Glu Val Lys Ala Leu Phe Val Gln Pro Phe Asn Asp Leu Leu Phe Gln
20 25 30
Ala Gln Thr Val His Arg Ala His Phe Asp Ala Asn Arg Val Gln Val
35 40 45
Ser Thr Leu Leu Ser Ile Lys Thr Gly Ala Cys Pro Glu Asp Cys Lys
50 55 60
Tyr Cys Pro Gln Ser Gly His Tyr Asn Thr Gly Leu Glu Lys Glu Lys
65 70 75 80
Leu Met Glu Val Gln Lys Val Leu Glu Glu Ala Ala Arg Ala Lys Ala
85 90 95
Ile Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Lys His Pro Ser
100 105 110
Ala Lys Asp Met Pro Tyr Val Leu Gln Met Val Lys Gly Val Lys Ala
115 120 125
Met Gly Leu Glu Thr Cys Met Thr Leu Gly Arg Leu Asp Gln Asp Gln
130 135 140
Thr Glu Ala Leu Ala Gln Ala Gly Leu Asp Tyr Tyr Asn His Asn Leu
145 150 155 160
Asp Thr Ser Pro Glu Phe Tyr Gly Ser Ile Ile Thr Thr Arg Thr Tyr
165 170 175
Gly Glu Arg Leu Gln Thr Leu Ala Tyr Val Arg Asp Ser Gly Met Lys
180 185 190
Ile Cys Ser Gly Gly Ile Leu Gly Met Gly Glu Ser Leu Asp Asp Arg
195 200 205
Ala Asn Leu Leu Ile Gln Leu Ala Asn Leu Pro Glu His Pro Glu Ser
210 215 220
Val Pro Ile Asn Met Leu Val Lys Val Ala Gly Thr Pro Leu Glu Asn
225 230 235 240
Ala Glu Asp Ile Asp Pro Phe Asp Phe Ile Arg Met Leu Ala Val Ala
245 250 255
Arg Ile Leu Met Pro Arg Ser His Val Arg Leu Ser Ala Gly Arg Glu
260 265 270
Ala Met Asn Glu Gln Met Gln Ala Leu Ala Phe Phe Ala Gly Ala Asn
275 280 285
Ser Ile Phe Tyr Gly Asp Lys Leu Leu Thr Thr Ala Asn Pro Gln Ala
290 295 300
Asp Lys Asp Met Gln Leu Phe Ser Arg Leu Gly Ile Leu Pro Glu Ala
305 310 315 320
Arg Glu Glu His Ala Asp Glu Val His Gln Ala Ala Ile Glu Gln Ala
325 330 335
Leu Val Glu Gln Lys Ser Ser Glu Gln Phe Tyr Asn Ala Ala Val
340 345 350
<210> 88
<211> 756
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(756)
<223> bioC gene encoding SAM (S-adenosylmethionine)-dependent
methyltransferase (BioC)
<400> 88
atg gca acg gtt aat aaa caa gcc att gca gcg gca ttt ggt cgg gca 48
Met Ala Thr Val Asn Lys Gln Ala Ile Ala Ala Ala Phe Gly Arg Ala
1 5 10 15
gcc gca cac tat gag caa cat gca gat cta cag cgc cag agt gct gac 96
Ala Ala His Tyr Glu Gln His Ala Asp Leu Gln Arg Gln Ser Ala Asp
20 25 30
gcc tta ctg gca atg ctt cca cag cgt aaa tac acc cac gta ctg gac 144
Ala Leu Leu Ala Met Leu Pro Gln Arg Lys Tyr Thr His Val Leu Asp
35 40 45
gcg ggt tgt gga cct ggc tgg atg agc cgc cac tgg cgg gaa cgt cac 192
Ala Gly Cys Gly Pro Gly Trp Met Ser Arg His Trp Arg Glu Arg His
50 55 60
gcg cag gtg acg gcc tta gat ctc tcg ccg cca atg ctt gtt cag gca 240
Ala Gln Val Thr Ala Leu Asp Leu Ser Pro Pro Met Leu Val Gln Ala
65 70 75 80
cgc cag aag gat gcc gca gac cat tat ctg gcg gga gat atc gaa tcc 288
Arg Gln Lys Asp Ala Ala Asp His Tyr Leu Ala Gly Asp Ile Glu Ser
85 90 95
ctg ccg tta gcg act gcg acg ttc gat ctt gca tgg agc aat ctc gca 336
Leu Pro Leu Ala Thr Ala Thr Phe Asp Leu Ala Trp Ser Asn Leu Ala
100 105 110
gtg cag tgg tgc ggt aat tta tcc acg gca ctc cgc gag ctg tat cgg 384
Val Gln Trp Cys Gly Asn Leu Ser Thr Ala Leu Arg Glu Leu Tyr Arg
115 120 125
gtg gtg cgc ccc aaa ggc gtg gtc gcg ttt acc acg ctg gtg cag gga 432
Val Val Arg Pro Lys Gly Val Val Ala Phe Thr Thr Leu Val Gln Gly
130 135 140
tcg tta ccc gaa ctg cat cag gcg tgg cag gcg gtg gac gag cgt ccg 480
Ser Leu Pro Glu Leu His Gln Ala Trp Gln Ala Val Asp Glu Arg Pro
145 150 155 160
cat gct aat cgc ttt tta ccg cca gat gaa atc gaa cag tcg ctg aac 528
His Ala Asn Arg Phe Leu Pro Pro Asp Glu Ile Glu Gln Ser Leu Asn
165 170 175
ggc gtg cat tat caa cat cat att cag ccc atc acg ctg tgg ttt gat 576
Gly Val His Tyr Gln His His Ile Gln Pro Ile Thr Leu Trp Phe Asp
180 185 190
gat gcg ctc agt gcc atg cgt tcg ctg aaa ggc atc ggt gcc acg cat 624
Asp Ala Leu Ser Ala Met Arg Ser Leu Lys Gly Ile Gly Ala Thr His
195 200 205
ctt cat gaa ggg cgc gac ccg cga ata tta acg cgt tcg cag ttg cag 672
Leu His Glu Gly Arg Asp Pro Arg Ile Leu Thr Arg Ser Gln Leu Gln
210 215 220
cga ttg caa ctg gcc tgg ccg caa cag cag ggg cga tat cct ctg acg 720
Arg Leu Gln Leu Ala Trp Pro Gln Gln Gln Gly Arg Tyr Pro Leu Thr
225 230 235 240
tat cat ctt ttt ttg gga gtg att gct cgt gag taa 756
Tyr His Leu Phe Leu Gly Val Ile Ala Arg Glu
245 250
<210> 89
<211> 251
<212> PRT
<213> Escherichia coli
<400> 89
Met Ala Thr Val Asn Lys Gln Ala Ile Ala Ala Ala Phe Gly Arg Ala
1 5 10 15
Ala Ala His Tyr Glu Gln His Ala Asp Leu Gln Arg Gln Ser Ala Asp
20 25 30
Ala Leu Leu Ala Met Leu Pro Gln Arg Lys Tyr Thr His Val Leu Asp
35 40 45
Ala Gly Cys Gly Pro Gly Trp Met Ser Arg His Trp Arg Glu Arg His
50 55 60
Ala Gln Val Thr Ala Leu Asp Leu Ser Pro Pro Met Leu Val Gln Ala
65 70 75 80
Arg Gln Lys Asp Ala Ala Asp His Tyr Leu Ala Gly Asp Ile Glu Ser
85 90 95
Leu Pro Leu Ala Thr Ala Thr Phe Asp Leu Ala Trp Ser Asn Leu Ala
100 105 110
Val Gln Trp Cys Gly Asn Leu Ser Thr Ala Leu Arg Glu Leu Tyr Arg
115 120 125
Val Val Arg Pro Lys Gly Val Val Ala Phe Thr Thr Leu Val Gln Gly
130 135 140
Ser Leu Pro Glu Leu His Gln Ala Trp Gln Ala Val Asp Glu Arg Pro
145 150 155 160
His Ala Asn Arg Phe Leu Pro Pro Asp Glu Ile Glu Gln Ser Leu Asn
165 170 175
Gly Val His Tyr Gln His His Ile Gln Pro Ile Thr Leu Trp Phe Asp
180 185 190
Asp Ala Leu Ser Ala Met Arg Ser Leu Lys Gly Ile Gly Ala Thr His
195 200 205
Leu His Glu Gly Arg Asp Pro Arg Ile Leu Thr Arg Ser Gln Leu Gln
210 215 220
Arg Leu Gln Leu Ala Trp Pro Gln Gln Gln Gly Arg Tyr Pro Leu Thr
225 230 235 240
Tyr His Leu Phe Leu Gly Val Ile Ala Arg Glu
245 250
<210> 90
<211> 1155
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(1155)
<223> bioF gene encoding (Bio F) 7-keto-8-aminopelargonic acid (KAPA)
synthase
<400> 90
atg agc tgg cag gag aaa atc aac gcg gcg ctc gat gcg cgg cgt gct 48
Met Ser Trp Gln Glu Lys Ile Asn Ala Ala Leu Asp Ala Arg Arg Ala
1 5 10 15
gcc gat gcc ctg cgt cgc cgt tat ccg gtg gcg caa gga gcc gga cgc 96
Ala Asp Ala Leu Arg Arg Arg Tyr Pro Val Ala Gln Gly Ala Gly Arg
20 25 30
tgg ctg gtg gcg gat gat cgc cag tat ctg aac ttt tcc agt aac gat 144
Trp Leu Val Ala Asp Asp Arg Gln Tyr Leu Asn Phe Ser Ser Asn Asp
35 40 45
tat tta ggt tta agc cat cat ccg caa att atc cgt gcc tgg cag cag 192
Tyr Leu Gly Leu Ser His His Pro Gln Ile Ile Arg Ala Trp Gln Gln
50 55 60
ggg gcg gag caa ttt ggc atc ggt agc ggc ggc tcc ggt cac gtc agc 240
Gly Ala Glu Gln Phe Gly Ile Gly Ser Gly Gly Ser Gly His Val Ser
65 70 75 80
ggt tat agc gtg gtg cat cag gca ctg gaa gaa gag ctg gcc gag tgg 288
Gly Tyr Ser Val Val His Gln Ala Leu Glu Glu Glu Leu Ala Glu Trp
85 90 95
ctt ggc tat tcg cgg gca ctg ctg ttt atc tct ggt ttc gcc gct aat 336
Leu Gly Tyr Ser Arg Ala Leu Leu Phe Ile Ser Gly Phe Ala Ala Asn
100 105 110
cag gca gtt att gcc gcg atg atg gcg aaa gag gac cgt att gct gcc 384
Gln Ala Val Ile Ala Ala Met Met Ala Lys Glu Asp Arg Ile Ala Ala
115 120 125
gac cgg ctt agc cat gcc tca ttg ctg gaa gct gcc agt tta agc ccg 432
Asp Arg Leu Ser His Ala Ser Leu Leu Glu Ala Ala Ser Leu Ser Pro
130 135 140
tcg cag ctt cgc cgt ttt gct cat aac gat gtc act cat ttg gcg cga 480
Ser Gln Leu Arg Arg Phe Ala His Asn Asp Val Thr His Leu Ala Arg
145 150 155 160
ttg ctt gct tcc ccc tgt ccg ggg cag caa atg gtg gtg aca gaa ggc 528
Leu Leu Ala Ser Pro Cys Pro Gly Gln Gln Met Val Val Thr Glu Gly
165 170 175
gtg ttc agc atg gac ggc gat agt gcg cca ctg gcg gaa atc cag cag 576
Val Phe Ser Met Asp Gly Asp Ser Ala Pro Leu Ala Glu Ile Gln Gln
180 185 190
gta acg caa cag cac aat ggc tgg ttg atg gtc gat gat gcc cac ggc 624
Val Thr Gln Gln His Asn Gly Trp Leu Met Val Asp Asp Ala His Gly
195 200 205
acg ggc gtt atc ggg gag cag ggg cgc ggc agc tgc tgg ctg caa aag 672
Thr Gly Val Ile Gly Glu Gln Gly Arg Gly Ser Cys Trp Leu Gln Lys
210 215 220
gta aaa cca gaa ttg ctg gta gtg act ttt ggc aaa gga ttt ggc gtc 720
Val Lys Pro Glu Leu Leu Val Val Thr Phe Gly Lys Gly Phe Gly Val
225 230 235 240
agc ggg gca gcg gtg ctt tgc tcc agt acg gtg gcg gat tat ctg ctg 768
Ser Gly Ala Ala Val Leu Cys Ser Ser Thr Val Ala Asp Tyr Leu Leu
245 250 255
caa ttc gcc cgc cac ctt atc tac agc acc agt atg ccg ccc gct cag 816
Gln Phe Ala Arg His Leu Ile Tyr Ser Thr Ser Met Pro Pro Ala Gln
260 265 270
gcg cag gca tta cgt gcg tcg ctg gcg gtc att cgc agt gat gag ggt 864
Ala Gln Ala Leu Arg Ala Ser Leu Ala Val Ile Arg Ser Asp Glu Gly
275 280 285
gat gca cgg cgc gaa aaa ctg gcg gca ctc att acg cgt ttt cgt gcc 912
Asp Ala Arg Arg Glu Lys Leu Ala Ala Leu Ile Thr Arg Phe Arg Ala
290 295 300
gga gta cag gat ttg ccg ttt acg ctt gct gat tca tgc agc gcc atc 960
Gly Val Gln Asp Leu Pro Phe Thr Leu Ala Asp Ser Cys Ser Ala Ile
305 310 315 320
cag cca ttg att gtc ggt gat aac agc cgt gcg tta caa ctg gca gaa 1008
Gln Pro Leu Ile Val Gly Asp Asn Ser Arg Ala Leu Gln Leu Ala Glu
325 330 335
aaa ctg cgt cag caa ggc tgc tgg gtc acg gcg att cgc ccg cca acc 1056
Lys Leu Arg Gln Gln Gly Cys Trp Val Thr Ala Ile Arg Pro Pro Thr
340 345 350
gta ccc gct ggt act gcg cga ctg cgc tta acg cta acc gct gcg cat 1104
Val Pro Ala Gly Thr Ala Arg Leu Arg Leu Thr Leu Thr Ala Ala His
355 360 365
gaa atg cag gat atc gac cgt ctg ctg gag gtg ctg cat ggc aac ggt 1152
Glu Met Gln Asp Ile Asp Arg Leu Leu Glu Val Leu His Gly Asn Gly
370 375 380
taa 1155
<210> 91
<211> 384
<212> PRT
<213> Escherichia coli
<400> 91
Met Ser Trp Gln Glu Lys Ile Asn Ala Ala Leu Asp Ala Arg Arg Ala
1 5 10 15
Ala Asp Ala Leu Arg Arg Arg Tyr Pro Val Ala Gln Gly Ala Gly Arg
20 25 30
Trp Leu Val Ala Asp Asp Arg Gln Tyr Leu Asn Phe Ser Ser Asn Asp
35 40 45
Tyr Leu Gly Leu Ser His His Pro Gln Ile Ile Arg Ala Trp Gln Gln
50 55 60
Gly Ala Glu Gln Phe Gly Ile Gly Ser Gly Gly Ser Gly His Val Ser
65 70 75 80
Gly Tyr Ser Val Val His Gln Ala Leu Glu Glu Glu Leu Ala Glu Trp
85 90 95
Leu Gly Tyr Ser Arg Ala Leu Leu Phe Ile Ser Gly Phe Ala Ala Asn
100 105 110
Gln Ala Val Ile Ala Ala Met Met Ala Lys Glu Asp Arg Ile Ala Ala
115 120 125
Asp Arg Leu Ser His Ala Ser Leu Leu Glu Ala Ala Ser Leu Ser Pro
130 135 140
Ser Gln Leu Arg Arg Phe Ala His Asn Asp Val Thr His Leu Ala Arg
145 150 155 160
Leu Leu Ala Ser Pro Cys Pro Gly Gln Gln Met Val Val Thr Glu Gly
165 170 175
Val Phe Ser Met Asp Gly Asp Ser Ala Pro Leu Ala Glu Ile Gln Gln
180 185 190
Val Thr Gln Gln His Asn Gly Trp Leu Met Val Asp Asp Ala His Gly
195 200 205
Thr Gly Val Ile Gly Glu Gln Gly Arg Gly Ser Cys Trp Leu Gln Lys
210 215 220
Val Lys Pro Glu Leu Leu Val Val Thr Phe Gly Lys Gly Phe Gly Val
225 230 235 240
Ser Gly Ala Ala Val Leu Cys Ser Ser Thr Val Ala Asp Tyr Leu Leu
245 250 255
Gln Phe Ala Arg His Leu Ile Tyr Ser Thr Ser Met Pro Pro Ala Gln
260 265 270
Ala Gln Ala Leu Arg Ala Ser Leu Ala Val Ile Arg Ser Asp Glu Gly
275 280 285
Asp Ala Arg Arg Glu Lys Leu Ala Ala Leu Ile Thr Arg Phe Arg Ala
290 295 300
Gly Val Gln Asp Leu Pro Phe Thr Leu Ala Asp Ser Cys Ser Ala Ile
305 310 315 320
Gln Pro Leu Ile Val Gly Asp Asn Ser Arg Ala Leu Gln Leu Ala Glu
325 330 335
Lys Leu Arg Gln Gln Gly Cys Trp Val Thr Ala Ile Arg Pro Pro Thr
340 345 350
Val Pro Ala Gly Thr Ala Arg Leu Arg Leu Thr Leu Thr Ala Ala His
355 360 365
Glu Met Gln Asp Ile Asp Arg Leu Leu Glu Val Leu His Gly Asn Gly
370 375 380
<210> 92
<211> 1290
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(1290)
<223> bioA gene encoding 7,8-Diaminopelargonic Acid (DAPA) Synthase
(BioA)
<400> 92
atg aca acg gac gat ctt gcc ttt gac caa cgc cat atc tgg cac cca 48
Met Thr Thr Asp Asp Leu Ala Phe Asp Gln Arg His Ile Trp His Pro
1 5 10 15
tac aca tcc atg acc tcc cct ctg ccg gtt tat ccg gtg gtg agc gcc 96
Tyr Thr Ser Met Thr Ser Pro Leu Pro Val Tyr Pro Val Val Ser Ala
20 25 30
gaa ggt tgc gag ctg att ttg tct gac ggc aga cgc ctg gtt gac ggt 144
Glu Gly Cys Glu Leu Ile Leu Ser Asp Gly Arg Arg Leu Val Asp Gly
35 40 45
atg tcg tcc tgg tgg gcg gcg atc cac ggc tac aat cac ccg cag ctt 192
Met Ser Ser Trp Trp Ala Ala Ile His Gly Tyr Asn His Pro Gln Leu
50 55 60
aat gcg gcg atg aag tcg caa att gat gcc atg tcg cat gtg atg ttt 240
Asn Ala Ala Met Lys Ser Gln Ile Asp Ala Met Ser His Val Met Phe
65 70 75 80
ggc ggt atc acc cat gcg cca gcc att gag ctg tgc cgc aaa ctg gtg 288
Gly Gly Ile Thr His Ala Pro Ala Ile Glu Leu Cys Arg Lys Leu Val
85 90 95
gcg atg acg ccg caa ccg ctg gag tgc gtt ttt ctc gcg gac tcc ggt 336
Ala Met Thr Pro Gln Pro Leu Glu Cys Val Phe Leu Ala Asp Ser Gly
100 105 110
tcc gta gcg gtg gaa gtg gcg atg aaa atg gcg ttg cag tac tgg caa 384
Ser Val Ala Val Glu Val Ala Met Lys Met Ala Leu Gln Tyr Trp Gln
115 120 125
gcc aaa ggc gaa gcg cgc cag cgt ttt ctg acc ttc cgc aat ggt tat 432
Ala Lys Gly Glu Ala Arg Gln Arg Phe Leu Thr Phe Arg Asn Gly Tyr
130 135 140
cat ggc gat acc ttt ggc gcg atg tcg gtg tgc gat ccg gat aac tca 480
His Gly Asp Thr Phe Gly Ala Met Ser Val Cys Asp Pro Asp Asn Ser
145 150 155 160
atg cac agt ctg tgg aaa ggc tac ctg cca gaa aac ctg ttt gct ccc 528
Met His Ser Leu Trp Lys Gly Tyr Leu Pro Glu Asn Leu Phe Ala Pro
165 170 175
gcc ccg caa agc cgc atg gat ggc gaa tgg gat gag cgc gat atg gtg 576
Ala Pro Gln Ser Arg Met Asp Gly Glu Trp Asp Glu Arg Asp Met Val
180 185 190
ggc ttt gcc cgc ctg atg gcg gcg cat cgt cat gaa atc gcg gcg gtg 624
Gly Phe Ala Arg Leu Met Ala Ala His Arg His Glu Ile Ala Ala Val
195 200 205
atc att gag ccg att gtc cag ggc gca ggc ggg atg cgc atg tac cat 672
Ile Ile Glu Pro Ile Val Gln Gly Ala Gly Gly Met Arg Met Tyr His
210 215 220
ccg gaa tgg tta aaa cga atc cgc aaa ata tgc gat cgc gaa ggt atc 720
Pro Glu Trp Leu Lys Arg Ile Arg Lys Ile Cys Asp Arg Glu Gly Ile
225 230 235 240
ttg ctg att gcc gac gag atc gcc act gga ttt ggt cgt acc ggg aaa 768
Leu Leu Ile Ala Asp Glu Ile Ala Thr Gly Phe Gly Arg Thr Gly Lys
245 250 255
ctg ttt gcc tgt gaa cat gca gaa atc gcg ccg gac att ttg tgc ctc 816
Leu Phe Ala Cys Glu His Ala Glu Ile Ala Pro Asp Ile Leu Cys Leu
260 265 270
ggt aaa gcc tta acc ggc ggc aca atg acc ctt tcc gcc aca ctc acc 864
Gly Lys Ala Leu Thr Gly Gly Thr Met Thr Leu Ser Ala Thr Leu Thr
275 280 285
acg cgc gag gtt gca gaa acc atc agt aac ggt gaa gcc ggt tgc ttt 912
Thr Arg Glu Val Ala Glu Thr Ile Ser Asn Gly Glu Ala Gly Cys Phe
290 295 300
atg cat ggg cca act ttt atg ggc aat ccg ctg gcc tgc gcg gca gca 960
Met His Gly Pro Thr Phe Met Gly Asn Pro Leu Ala Cys Ala Ala Ala
305 310 315 320
aac gcc agc ctg gcg att ctc gaa tct ggc gac tgg cag caa cag gtg 1008
Asn Ala Ser Leu Ala Ile Leu Glu Ser Gly Asp Trp Gln Gln Gln Val
325 330 335
gcg gat att gaa gta cag ctg cgc gag caa ctt gcc ccc gcc cgt gat 1056
Ala Asp Ile Glu Val Gln Leu Arg Glu Gln Leu Ala Pro Ala Arg Asp
340 345 350
gcc gaa atg gtt gcc gat gtg cgc gta ctg ggg gcc att ggc gtg gtc 1104
Ala Glu Met Val Ala Asp Val Arg Val Leu Gly Ala Ile Gly Val Val
355 360 365
gaa acc act cat ccg gtg aat atg gcg gcg ctg caa aaa ttc ttt gtc 1152
Glu Thr Thr His Pro Val Asn Met Ala Ala Leu Gln Lys Phe Phe Val
370 375 380
gaa cag ggt gtc tgg atc cgg cct ttt ggc aaa ctg att tac ctg atg 1200
Glu Gln Gly Val Trp Ile Arg Pro Phe Gly Lys Leu Ile Tyr Leu Met
385 390 395 400
ccg ccc tat att att ctc ccg caa cag ttg cag cgt ctg acc gca gcg 1248
Pro Pro Tyr Ile Ile Leu Pro Gln Gln Leu Gln Arg Leu Thr Ala Ala
405 410 415
gtt aac cgc gcg gta cag gat gaa aca ttt ttt tgc caa taa 1290
Val Asn Arg Ala Val Gln Asp Glu Thr Phe Phe Cys Gln
420 425
<210> 93
<211> 429
<212> PRT
<213> Escherichia coli
<400> 93
Met Thr Thr Asp Asp Leu Ala Phe Asp Gln Arg His Ile Trp His Pro
1 5 10 15
Tyr Thr Ser Met Thr Ser Pro Leu Pro Val Tyr Pro Val Val Ser Ala
20 25 30
Glu Gly Cys Glu Leu Ile Leu Ser Asp Gly Arg Arg Leu Val Asp Gly
35 40 45
Met Ser Ser Trp Trp Ala Ala Ile His Gly Tyr Asn His Pro Gln Leu
50 55 60
Asn Ala Ala Met Lys Ser Gln Ile Asp Ala Met Ser His Val Met Phe
65 70 75 80
Gly Gly Ile Thr His Ala Pro Ala Ile Glu Leu Cys Arg Lys Leu Val
85 90 95
Ala Met Thr Pro Gln Pro Leu Glu Cys Val Phe Leu Ala Asp Ser Gly
100 105 110
Ser Val Ala Val Glu Val Ala Met Lys Met Ala Leu Gln Tyr Trp Gln
115 120 125
Ala Lys Gly Glu Ala Arg Gln Arg Phe Leu Thr Phe Arg Asn Gly Tyr
130 135 140
His Gly Asp Thr Phe Gly Ala Met Ser Val Cys Asp Pro Asp Asn Ser
145 150 155 160
Met His Ser Leu Trp Lys Gly Tyr Leu Pro Glu Asn Leu Phe Ala Pro
165 170 175
Ala Pro Gln Ser Arg Met Asp Gly Glu Trp Asp Glu Arg Asp Met Val
180 185 190
Gly Phe Ala Arg Leu Met Ala Ala His Arg His Glu Ile Ala Ala Val
195 200 205
Ile Ile Glu Pro Ile Val Gln Gly Ala Gly Gly Met Arg Met Tyr His
210 215 220
Pro Glu Trp Leu Lys Arg Ile Arg Lys Ile Cys Asp Arg Glu Gly Ile
225 230 235 240
Leu Leu Ile Ala Asp Glu Ile Ala Thr Gly Phe Gly Arg Thr Gly Lys
245 250 255
Leu Phe Ala Cys Glu His Ala Glu Ile Ala Pro Asp Ile Leu Cys Leu
260 265 270
Gly Lys Ala Leu Thr Gly Gly Thr Met Thr Leu Ser Ala Thr Leu Thr
275 280 285
Thr Arg Glu Val Ala Glu Thr Ile Ser Asn Gly Glu Ala Gly Cys Phe
290 295 300
Met His Gly Pro Thr Phe Met Gly Asn Pro Leu Ala Cys Ala Ala Ala
305 310 315 320
Asn Ala Ser Leu Ala Ile Leu Glu Ser Gly Asp Trp Gln Gln Gln Val
325 330 335
Ala Asp Ile Glu Val Gln Leu Arg Glu Gln Leu Ala Pro Ala Arg Asp
340 345 350
Ala Glu Met Val Ala Asp Val Arg Val Leu Gly Ala Ile Gly Val Val
355 360 365
Glu Thr Thr His Pro Val Asn Met Ala Ala Leu Gln Lys Phe Phe Val
370 375 380
Glu Gln Gly Val Trp Ile Arg Pro Phe Gly Lys Leu Ile Tyr Leu Met
385 390 395 400
Pro Pro Tyr Ile Ile Leu Pro Gln Gln Leu Gln Arg Leu Thr Ala Ala
405 410 415
Val Asn Arg Ala Val Gln Asp Glu Thr Phe Phe Cys Gln
420 425
<210> 94
<211> 678
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(678)
<223> bioD gene encoding Dethiobiotin (DTB) Synthetase (BioD)
<400> 94
gtg agt aaa cgt tat ttt gtc acc gga acg gat acc gaa gtg ggg aaa 48
Val Ser Lys Arg Tyr Phe Val Thr Gly Thr Asp Thr Glu Val Gly Lys
1 5 10 15
act gtc gcc agt tgt gca ctt tta caa gcc gca aag gca gca ggc tac 96
Thr Val Ala Ser Cys Ala Leu Leu Gln Ala Ala Lys Ala Ala Gly Tyr
20 25 30
cgg acg gca ggt tat aaa ccg gtc gcc tct ggc agc gaa aag acc ccg 144
Arg Thr Ala Gly Tyr Lys Pro Val Ala Ser Gly Ser Glu Lys Thr Pro
35 40 45
gaa ggt tta cgc aat agc gac gcg ctg gcg tta cag cgc aac agc agc 192
Glu Gly Leu Arg Asn Ser Asp Ala Leu Ala Leu Gln Arg Asn Ser Ser
50 55 60
ctg cag ctg gat tac gca aca gta aat cct tac acc ttc gca gaa ccc 240
Leu Gln Leu Asp Tyr Ala Thr Val Asn Pro Tyr Thr Phe Ala Glu Pro
65 70 75 80
act tcg ccg cac atc atc agc gcg caa gag ggc aga ccg ata gaa tca 288
Thr Ser Pro His Ile Ile Ser Ala Gln Glu Gly Arg Pro Ile Glu Ser
85 90 95
ttg gta atg agc gcc gga tta cgc gcg ctt gaa caa cag gct gac tgg 336
Leu Val Met Ser Ala Gly Leu Arg Ala Leu Glu Gln Gln Ala Asp Trp
100 105 110
gtg tta gtg gaa ggt gct ggc ggc tgg ttt acg ccg ctt tct gac act 384
Val Leu Val Glu Gly Ala Gly Gly Trp Phe Thr Pro Leu Ser Asp Thr
115 120 125
ttc act ttt gca gat tgg gta aca cag gaa caa ctg ccg gtg ata ctg 432
Phe Thr Phe Ala Asp Trp Val Thr Gln Glu Gln Leu Pro Val Ile Leu
130 135 140
gta gtt ggt gtg aaa ctc ggc tgt att aat cac gcg atg ttg act gca 480
Val Val Gly Val Lys Leu Gly Cys Ile Asn His Ala Met Leu Thr Ala
145 150 155 160
cag gta ata caa cac gcc gga ctg act ctg gcg ggt tgg gtg gcg aac 528
Gln Val Ile Gln His Ala Gly Leu Thr Leu Ala Gly Trp Val Ala Asn
165 170 175
gat gtt acg cct ccg gga aaa cgt cac gct gaa tat atg acc acg ctc 576
Asp Val Thr Pro Pro Gly Lys Arg His Ala Glu Tyr Met Thr Thr Leu
180 185 190
acc cgc atg att ccc gcg ccg ctg ctg gga gag atc ccc tgg ctt gca 624
Thr Arg Met Ile Pro Ala Pro Leu Leu Gly Glu Ile Pro Trp Leu Ala
195 200 205
gaa aat cca gaa aat gcg gca acc gga aag tac ata aac ctt gcc ttg 672
Glu Asn Pro Glu Asn Ala Ala Thr Gly Lys Tyr Ile Asn Leu Ala Leu
210 215 220
ttg tag 678
Leu
225
<210> 95
<211> 225
<212> PRT
<213> Escherichia coli
<400> 95
Val Ser Lys Arg Tyr Phe Val Thr Gly Thr Asp Thr Glu Val Gly Lys
1 5 10 15
Thr Val Ala Ser Cys Ala Leu Leu Gln Ala Ala Lys Ala Ala Gly Tyr
20 25 30
Arg Thr Ala Gly Tyr Lys Pro Val Ala Ser Gly Ser Glu Lys Thr Pro
35 40 45
Glu Gly Leu Arg Asn Ser Asp Ala Leu Ala Leu Gln Arg Asn Ser Ser
50 55 60
Leu Gln Leu Asp Tyr Ala Thr Val Asn Pro Tyr Thr Phe Ala Glu Pro
65 70 75 80
Thr Ser Pro His Ile Ile Ser Ala Gln Glu Gly Arg Pro Ile Glu Ser
85 90 95
Leu Val Met Ser Ala Gly Leu Arg Ala Leu Glu Gln Gln Ala Asp Trp
100 105 110
Val Leu Val Glu Gly Ala Gly Gly Trp Phe Thr Pro Leu Ser Asp Thr
115 120 125
Phe Thr Phe Ala Asp Trp Val Thr Gln Glu Gln Leu Pro Val Ile Leu
130 135 140
Val Val Gly Val Lys Leu Gly Cys Ile Asn His Ala Met Leu Thr Ala
145 150 155 160
Gln Val Ile Gln His Ala Gly Leu Thr Leu Ala Gly Trp Val Ala Asn
165 170 175
Asp Val Thr Pro Pro Gly Lys Arg His Ala Glu Tyr Met Thr Thr Leu
180 185 190
Thr Arg Met Ile Pro Ala Pro Leu Leu Gly Glu Ile Pro Trp Leu Ala
195 200 205
Glu Asn Pro Glu Asn Ala Ala Thr Gly Lys Tyr Ile Asn Leu Ala Leu
210 215 220
Leu
225
<210> 96
<211> 1347
<212> DNA
<213> Bacillus subtilis
<220>
<221> CDS
<222> (1)..(1347)
<223> bioK gene encoding L-lysine:8-amino-7-oxononanoate
aminotransferase (BioK)
<400> 96
atg act cac gat tta atc gaa aaa agc aaa aag cac ttg tgg ctg ccc 48
Met Thr His Asp Leu Ile Glu Lys Ser Lys Lys His Leu Trp Leu Pro
1 5 10 15
ttc act cag atg aaa gat tat gac gaa aac cct ttg atc att gaa agc 96
Phe Thr Gln Met Lys Asp Tyr Asp Glu Asn Pro Leu Ile Ile Glu Ser
20 25 30
ggc aca ggg att aaa gta aaa gat atc aat ggg aag gaa tat tac gac 144
Gly Thr Gly Ile Lys Val Lys Asp Ile Asn Gly Lys Glu Tyr Tyr Asp
35 40 45
ggc ttc agt tct gtt tgg ctg aac gtt cac ggg cac cgc aag aag gag 192
Gly Phe Ser Ser Val Trp Leu Asn Val His Gly His Arg Lys Lys Glu
50 55 60
ctt gat gac gca atc aag aag caa tta ggt aag att gcg cat agt acg 240
Leu Asp Asp Ala Ile Lys Lys Gln Leu Gly Lys Ile Ala His Ser Thr
65 70 75 80
ctt tta gga atg acg aat gtc cct gca act caa tta gct gaa aca ctt 288
Leu Leu Gly Met Thr Asn Val Pro Ala Thr Gln Leu Ala Glu Thr Leu
85 90 95
att gat att tcc cct aaa aag ctg acc cgt gtt ttt tat tct gat tcc 336
Ile Asp Ile Ser Pro Lys Lys Leu Thr Arg Val Phe Tyr Ser Asp Ser
100 105 110
ggt gca gaa gct atg gag att gcg ctt aaa atg gcc ttt cag tat tgg 384
Gly Ala Glu Ala Met Glu Ile Ala Leu Lys Met Ala Phe Gln Tyr Trp
115 120 125
aaa aat att ggc aaa cca gaa aag caa aaa ttc atc gcc atg aag aat 432
Lys Asn Ile Gly Lys Pro Glu Lys Gln Lys Phe Ile Ala Met Lys Asn
130 135 140
gga tac cac ggt gat acc atc gga gca gta agc gta ggc tca att gag 480
Gly Tyr His Gly Asp Thr Ile Gly Ala Val Ser Val Gly Ser Ile Glu
145 150 155 160
ttg ttt cac cac gta tat ggt cca ttg atg ttt gag tct tac aag gcg 528
Leu Phe His His Val Tyr Gly Pro Leu Met Phe Glu Ser Tyr Lys Ala
165 170 175
cct att ccc tat gtt tac cgc tcg gag tca ggt gac cca gat gag tgc 576
Pro Ile Pro Tyr Val Tyr Arg Ser Glu Ser Gly Asp Pro Asp Glu Cys
180 185 190
cgc gac cag tgc ctt cgc gaa ttg gcc cag ctt ttg gag gaa cac cat 624
Arg Asp Gln Cys Leu Arg Glu Leu Ala Gln Leu Leu Glu Glu His His
195 200 205
gag gag atc gcg gca ctg agt att gaa tca atg gtt caa ggg gcg agt 672
Glu Glu Ile Ala Ala Leu Ser Ile Glu Ser Met Val Gln Gly Ala Ser
210 215 220
gga atg att gta atg cca gaa ggc tac tta gca ggc gta cgc gaa ctt 720
Gly Met Ile Val Met Pro Glu Gly Tyr Leu Ala Gly Val Arg Glu Leu
225 230 235 240
tgc acg act tac gat gtc ttg atg att gtg gat gaa gtt gca aca gga 768
Cys Thr Thr Tyr Asp Val Leu Met Ile Val Asp Glu Val Ala Thr Gly
245 250 255
ttc ggt cgc acc ggg aaa atg ttt gca tgc gaa cat gaa aac gtg caa 816
Phe Gly Arg Thr Gly Lys Met Phe Ala Cys Glu His Glu Asn Val Gln
260 265 270
ccg gat ctg atg gcc gca ggc aag ggt atc acg ggc gga tac ctt ccg 864
Pro Asp Leu Met Ala Ala Gly Lys Gly Ile Thr Gly Gly Tyr Leu Pro
275 280 285
att gcg gtt act ttt gcc acc gaa gac att tat aag gca ttt tac gat 912
Ile Ala Val Thr Phe Ala Thr Glu Asp Ile Tyr Lys Ala Phe Tyr Asp
290 295 300
gat tat gaa aac ttg aag acc ttt ttt cat gga cac tct tac aca gga 960
Asp Tyr Glu Asn Leu Lys Thr Phe Phe His Gly His Ser Tyr Thr Gly
305 310 315 320
aat caa ctg ggt tgt gca gtc gca ctg gag aat ctg gca ctg ttt gaa 1008
Asn Gln Leu Gly Cys Ala Val Ala Leu Glu Asn Leu Ala Leu Phe Glu
325 330 335
agc gaa aac att gtt gag cag gtc gct gaa aaa tcg aag aaa tta cat 1056
Ser Glu Asn Ile Val Glu Gln Val Ala Glu Lys Ser Lys Lys Leu His
340 345 350
ttt tta tta caa gat tta cat gcc ttg cca cat gta ggc gac atc cgt 1104
Phe Leu Leu Gln Asp Leu His Ala Leu Pro His Val Gly Asp Ile Arg
355 360 365
caa tta gga ttc atg tgt ggt gcg gag tta gtc cgt agc aaa gaa aca 1152
Gln Leu Gly Phe Met Cys Gly Ala Glu Leu Val Arg Ser Lys Glu Thr
370 375 380
aaa gag ccc tat ccc gct gat cgt cgc atc ggt tac aaa gtc agt ctt 1200
Lys Glu Pro Tyr Pro Ala Asp Arg Arg Ile Gly Tyr Lys Val Ser Leu
385 390 395 400
aaa atg cgt gaa tta ggg atg ttg aca cgc ccg ttg gga gat gtt att 1248
Lys Met Arg Glu Leu Gly Met Leu Thr Arg Pro Leu Gly Asp Val Ile
405 410 415
gca ttt ttg cct ccg tta gcg tct acc gcg gag gag ctg agt gag atg 1296
Ala Phe Leu Pro Pro Leu Ala Ser Thr Ala Glu Glu Leu Ser Glu Met
420 425 430
gta gca att atg aag caa gcc att cac gaa gtt act tcc ttg gaa gac 1344
Val Ala Ile Met Lys Gln Ala Ile His Glu Val Thr Ser Leu Glu Asp
435 440 445
tga 1347
<210> 97
<211> 448
<212> PRT
<213> Bacillus subtilis
<400> 97
Met Thr His Asp Leu Ile Glu Lys Ser Lys Lys His Leu Trp Leu Pro
1 5 10 15
Phe Thr Gln Met Lys Asp Tyr Asp Glu Asn Pro Leu Ile Ile Glu Ser
20 25 30
Gly Thr Gly Ile Lys Val Lys Asp Ile Asn Gly Lys Glu Tyr Tyr Asp
35 40 45
Gly Phe Ser Ser Val Trp Leu Asn Val His Gly His Arg Lys Lys Glu
50 55 60
Leu Asp Asp Ala Ile Lys Lys Gln Leu Gly Lys Ile Ala His Ser Thr
65 70 75 80
Leu Leu Gly Met Thr Asn Val Pro Ala Thr Gln Leu Ala Glu Thr Leu
85 90 95
Ile Asp Ile Ser Pro Lys Lys Leu Thr Arg Val Phe Tyr Ser Asp Ser
100 105 110
Gly Ala Glu Ala Met Glu Ile Ala Leu Lys Met Ala Phe Gln Tyr Trp
115 120 125
Lys Asn Ile Gly Lys Pro Glu Lys Gln Lys Phe Ile Ala Met Lys Asn
130 135 140
Gly Tyr His Gly Asp Thr Ile Gly Ala Val Ser Val Gly Ser Ile Glu
145 150 155 160
Leu Phe His His Val Tyr Gly Pro Leu Met Phe Glu Ser Tyr Lys Ala
165 170 175
Pro Ile Pro Tyr Val Tyr Arg Ser Glu Ser Gly Asp Pro Asp Glu Cys
180 185 190
Arg Asp Gln Cys Leu Arg Glu Leu Ala Gln Leu Leu Glu Glu His His
195 200 205
Glu Glu Ile Ala Ala Leu Ser Ile Glu Ser Met Val Gln Gly Ala Ser
210 215 220
Gly Met Ile Val Met Pro Glu Gly Tyr Leu Ala Gly Val Arg Glu Leu
225 230 235 240
Cys Thr Thr Tyr Asp Val Leu Met Ile Val Asp Glu Val Ala Thr Gly
245 250 255
Phe Gly Arg Thr Gly Lys Met Phe Ala Cys Glu His Glu Asn Val Gln
260 265 270
Pro Asp Leu Met Ala Ala Gly Lys Gly Ile Thr Gly Gly Tyr Leu Pro
275 280 285
Ile Ala Val Thr Phe Ala Thr Glu Asp Ile Tyr Lys Ala Phe Tyr Asp
290 295 300
Asp Tyr Glu Asn Leu Lys Thr Phe Phe His Gly His Ser Tyr Thr Gly
305 310 315 320
Asn Gln Leu Gly Cys Ala Val Ala Leu Glu Asn Leu Ala Leu Phe Glu
325 330 335
Ser Glu Asn Ile Val Glu Gln Val Ala Glu Lys Ser Lys Lys Leu His
340 345 350
Phe Leu Leu Gln Asp Leu His Ala Leu Pro His Val Gly Asp Ile Arg
355 360 365
Gln Leu Gly Phe Met Cys Gly Ala Glu Leu Val Arg Ser Lys Glu Thr
370 375 380
Lys Glu Pro Tyr Pro Ala Asp Arg Arg Ile Gly Tyr Lys Val Ser Leu
385 390 395 400
Lys Met Arg Glu Leu Gly Met Leu Thr Arg Pro Leu Gly Asp Val Ile
405 410 415
Ala Phe Leu Pro Pro Leu Ala Ser Thr Ala Glu Glu Leu Ser Glu Met
420 425 430
Val Ala Ile Met Lys Gln Ala Ile His Glu Val Thr Ser Leu Glu Asp
435 440 445
<210> 98
<211> 771
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(771)
<223> bioH gene encoding Pimeloyl-[acyl-carrier protein] methyl ester
esterase (BioH)
<400> 98
atg aat aac atc tgg tgg cag acc aaa ggt cag ggg aat gtt cat ctt 48
Met Asn Asn Ile Trp Trp Gln Thr Lys Gly Gln Gly Asn Val His Leu
1 5 10 15
gtg ctg ctg cac gga tgg gga ctg aat gcc gaa gtg tgg cgt tgc att 96
Val Leu Leu His Gly Trp Gly Leu Asn Ala Glu Val Trp Arg Cys Ile
20 25 30
gac gag gaa ctt agc tcg cat ttt acg ctg cac ctt gtt gac ctg ccc 144
Asp Glu Glu Leu Ser Ser His Phe Thr Leu His Leu Val Asp Leu Pro
35 40 45
ggc ttc ggg cgt agc cgg gga ttt ggt gcg ctg tca ctt gct gat atg 192
Gly Phe Gly Arg Ser Arg Gly Phe Gly Ala Leu Ser Leu Ala Asp Met
50 55 60
gcc gaa gcc gtg ctg caa cag gca cct gat aaa gcc att tgg tta ggc 240
Ala Glu Ala Val Leu Gln Gln Ala Pro Asp Lys Ala Ile Trp Leu Gly
65 70 75 80
tgg agt ctg ggc ggg ctg gtg gca agc cag att gcg tta acc cat ccc 288
Trp Ser Leu Gly Gly Leu Val Ala Ser Gln Ile Ala Leu Thr His Pro
85 90 95
gag cgt gtt cag gcg ctg gtc acc gtg gcg tcg tca cct tgt ttt agt 336
Glu Arg Val Gln Ala Leu Val Thr Val Ala Ser Ser Pro Cys Phe Ser
100 105 110
gct cgt gac gag tgg ccg ggg ata aaa ccg gac gtg ctg gcg gga ttt 384
Ala Arg Asp Glu Trp Pro Gly Ile Lys Pro Asp Val Leu Ala Gly Phe
115 120 125
cag cag caa ctc agt gat gat ttt cag cgt aca gtg gag cgg ttc ctg 432
Gln Gln Gln Leu Ser Asp Asp Phe Gln Arg Thr Val Glu Arg Phe Leu
130 135 140
gcg tta caa acc atg ggg act gaa acg gcg cgc cag gat gcg cgg gcg 480
Ala Leu Gln Thr Met Gly Thr Glu Thr Ala Arg Gln Asp Ala Arg Ala
145 150 155 160
ttg aag aaa acc gtt ctg gcg tta ccg atg ccg gag gtt gac gtg ctt 528
Leu Lys Lys Thr Val Leu Ala Leu Pro Met Pro Glu Val Asp Val Leu
165 170 175
aat ggc ggg ctg gaa atc ctg aaa acg gtc gat ctc cgt cag ccg ctg 576
Asn Gly Gly Leu Glu Ile Leu Lys Thr Val Asp Leu Arg Gln Pro Leu
180 185 190
caa aac gtg tcc atg ccg ttt ttg cga ttg tat ggc tat ctc gac ggt 624
Gln Asn Val Ser Met Pro Phe Leu Arg Leu Tyr Gly Tyr Leu Asp Gly
195 200 205
ctg gtg ccg cgc aaa gtg gtg ccg atg ctg gat aaa ctt tgg cct cac 672
Leu Val Pro Arg Lys Val Val Pro Met Leu Asp Lys Leu Trp Pro His
210 215 220
agc gaa tca tat atc ttc gcc aaa gcg gcc cat gcg cca ttt att tcg 720
Ser Glu Ser Tyr Ile Phe Ala Lys Ala Ala His Ala Pro Phe Ile Ser
225 230 235 240
cat ccg gcc gag ttt tgt cac ctg ctg gtg gcg ttg aag cag agg gtg 768
His Pro Ala Glu Phe Cys His Leu Leu Val Ala Leu Lys Gln Arg Val
245 250 255
tag 771
<210> 99
<211> 256
<212> PRT
<213> Escherichia coli
<400> 99
Met Asn Asn Ile Trp Trp Gln Thr Lys Gly Gln Gly Asn Val His Leu
1 5 10 15
Val Leu Leu His Gly Trp Gly Leu Asn Ala Glu Val Trp Arg Cys Ile
20 25 30
Asp Glu Glu Leu Ser Ser His Phe Thr Leu His Leu Val Asp Leu Pro
35 40 45
Gly Phe Gly Arg Ser Arg Gly Phe Gly Ala Leu Ser Leu Ala Asp Met
50 55 60
Ala Glu Ala Val Leu Gln Gln Ala Pro Asp Lys Ala Ile Trp Leu Gly
65 70 75 80
Trp Ser Leu Gly Gly Leu Val Ala Ser Gln Ile Ala Leu Thr His Pro
85 90 95
Glu Arg Val Gln Ala Leu Val Thr Val Ala Ser Ser Pro Cys Phe Ser
100 105 110
Ala Arg Asp Glu Trp Pro Gly Ile Lys Pro Asp Val Leu Ala Gly Phe
115 120 125
Gln Gln Gln Leu Ser Asp Asp Phe Gln Arg Thr Val Glu Arg Phe Leu
130 135 140
Ala Leu Gln Thr Met Gly Thr Glu Thr Ala Arg Gln Asp Ala Arg Ala
145 150 155 160
Leu Lys Lys Thr Val Leu Ala Leu Pro Met Pro Glu Val Asp Val Leu
165 170 175
Asn Gly Gly Leu Glu Ile Leu Lys Thr Val Asp Leu Arg Gln Pro Leu
180 185 190
Gln Asn Val Ser Met Pro Phe Leu Arg Leu Tyr Gly Tyr Leu Asp Gly
195 200 205
Leu Val Pro Arg Lys Val Val Pro Met Leu Asp Lys Leu Trp Pro His
210 215 220
Ser Glu Ser Tyr Ile Phe Ala Lys Ala Ala His Ala Pro Phe Ile Ser
225 230 235 240
His Pro Ala Glu Phe Cys His Leu Leu Val Ala Leu Lys Gln Arg Val
245 250 255
<210> 100
<211> 777
<212> DNA
<213> Bacillus subtilis
<220>
<221> CDS
<222> (1)..(777)
<223> bioW gene encoding 6-carboxyhexanoate-CoA ligase (BioW)
<400> 100
atg caa gaa gag acg ttc tat tca gtg cgt atg cgc gct tca atg aat 48
Met Gln Glu Glu Thr Phe Tyr Ser Val Arg Met Arg Ala Ser Met Asn
1 5 10 15
ggc tcc cac gaa gat gga ggt aag cac atc tcc ggg ggt gag cgc ctt 96
Gly Ser His Glu Asp Gly Gly Lys His Ile Ser Gly Gly Glu Arg Leu
20 25 30
atc ccg ttc cac gag atg aaa cat acc gtc aac gct ttg ctt gag aag 144
Ile Pro Phe His Glu Met Lys His Thr Val Asn Ala Leu Leu Glu Lys
35 40 45
ggt ctt tct cat tct cgt ggg aaa cct gat ttt atg caa att cag ttt 192
Gly Leu Ser His Ser Arg Gly Lys Pro Asp Phe Met Gln Ile Gln Phe
50 55 60
gaa gag gtt cac gag tca atc aag aca atc cag ccc tta cct gtg cac 240
Glu Glu Val His Glu Ser Ile Lys Thr Ile Gln Pro Leu Pro Val His
65 70 75 80
acc aac gaa gtt agc tgc ccc gaa gaa gga caa aaa ctt gca cgc ttg 288
Thr Asn Glu Val Ser Cys Pro Glu Glu Gly Gln Lys Leu Ala Arg Leu
85 90 95
tta ctg gag aaa gaa ggg gtg agc cgc gac gtt att gaa aag gct tac 336
Leu Leu Glu Lys Glu Gly Val Ser Arg Asp Val Ile Glu Lys Ala Tyr
100 105 110
gaa caa att ccc gag tgg tcg gat gtc cgt ggt gcc gta ttg ttt gat 384
Glu Gln Ile Pro Glu Trp Ser Asp Val Arg Gly Ala Val Leu Phe Asp
115 120 125
att cat acg ggc aag cgt atg gat cag acg aaa gaa aag ggg gtg cgc 432
Ile His Thr Gly Lys Arg Met Asp Gln Thr Lys Glu Lys Gly Val Arg
130 135 140
gtc tct cgt atg gac tgg ccc gac gct aac ttt gag aaa tgg gcc tta 480
Val Ser Arg Met Asp Trp Pro Asp Ala Asn Phe Glu Lys Trp Ala Leu
145 150 155 160
cac agc cac gtg cca gca cat tca cgc atc aag gag gca ctg gca ctt 528
His Ser His Val Pro Ala His Ser Arg Ile Lys Glu Ala Leu Ala Leu
165 170 175
gct agc aag gtg tcc cgt cac ccg gca gtc gtt gcc gaa ttg tgt tgg 576
Ala Ser Lys Val Ser Arg His Pro Ala Val Val Ala Glu Leu Cys Trp
180 185 190
agc gac gat cca gat tac atc acc gga tat gta gct ggt aaa aaa atg 624
Ser Asp Asp Pro Asp Tyr Ile Thr Gly Tyr Val Ala Gly Lys Lys Met
195 200 205
ggg tac caa cgc att acc gca atg aag gag tac ggg acc gag gag gga 672
Gly Tyr Gln Arg Ile Thr Ala Met Lys Glu Tyr Gly Thr Glu Glu Gly
210 215 220
tgt cgt gtc ttc ttc atc gac ggc tcg aac gat gtt aat act tac att 720
Cys Arg Val Phe Phe Ile Asp Gly Ser Asn Asp Val Asn Thr Tyr Ile
225 230 235 240
cac gac ttg gag aaa cag ccg atc ctg att gaa tgg gaa gaa gac cac 768
His Asp Leu Glu Lys Gln Pro Ile Leu Ile Glu Trp Glu Glu Asp His
245 250 255
gat agc tga 777
Asp Ser
<210> 101
<211> 258
<212> PRT
<213> Bacillus subtilis
<400> 101
Met Gln Glu Glu Thr Phe Tyr Ser Val Arg Met Arg Ala Ser Met Asn
1 5 10 15
Gly Ser His Glu Asp Gly Gly Lys His Ile Ser Gly Gly Glu Arg Leu
20 25 30
Ile Pro Phe His Glu Met Lys His Thr Val Asn Ala Leu Leu Glu Lys
35 40 45
Gly Leu Ser His Ser Arg Gly Lys Pro Asp Phe Met Gln Ile Gln Phe
50 55 60
Glu Glu Val His Glu Ser Ile Lys Thr Ile Gln Pro Leu Pro Val His
65 70 75 80
Thr Asn Glu Val Ser Cys Pro Glu Glu Gly Gln Lys Leu Ala Arg Leu
85 90 95
Leu Leu Glu Lys Glu Gly Val Ser Arg Asp Val Ile Glu Lys Ala Tyr
100 105 110
Glu Gln Ile Pro Glu Trp Ser Asp Val Arg Gly Ala Val Leu Phe Asp
115 120 125
Ile His Thr Gly Lys Arg Met Asp Gln Thr Lys Glu Lys Gly Val Arg
130 135 140
Val Ser Arg Met Asp Trp Pro Asp Ala Asn Phe Glu Lys Trp Ala Leu
145 150 155 160
His Ser His Val Pro Ala His Ser Arg Ile Lys Glu Ala Leu Ala Leu
165 170 175
Ala Ser Lys Val Ser Arg His Pro Ala Val Val Ala Glu Leu Cys Trp
180 185 190
Ser Asp Asp Pro Asp Tyr Ile Thr Gly Tyr Val Ala Gly Lys Lys Met
195 200 205
Gly Tyr Gln Arg Ile Thr Ala Met Lys Glu Tyr Gly Thr Glu Glu Gly
210 215 220
Cys Arg Val Phe Phe Ile Asp Gly Ser Asn Asp Val Asn Thr Tyr Ile
225 230 235 240
His Asp Leu Glu Lys Gln Pro Ile Leu Ile Glu Trp Glu Glu Asp His
245 250 255
Asp Ser
<210> 102
<211> 966
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(966)
<223> lipA gene encoding lipoic acid synthase (LipA)
<400> 102
atg agt aaa ccc att gtg atg gaa cgc ggt gtt aaa tac cgc gat gcc 48
Met Ser Lys Pro Ile Val Met Glu Arg Gly Val Lys Tyr Arg Asp Ala
1 5 10 15
gat aag atg gcc ctt atc ccg gtt aaa aac gtg gca aca gag cgc gaa 96
Asp Lys Met Ala Leu Ile Pro Val Lys Asn Val Ala Thr Glu Arg Glu
20 25 30
gcc ctg ctg cgc aag ccg gaa tgg atg aaa atc aag ctt cca gcg gac 144
Ala Leu Leu Arg Lys Pro Glu Trp Met Lys Ile Lys Leu Pro Ala Asp
35 40 45
tct aca cgt atc cag ggc atc aaa gcc gca atg cgc aaa aat ggc ctg 192
Ser Thr Arg Ile Gln Gly Ile Lys Ala Ala Met Arg Lys Asn Gly Leu
50 55 60
cat tct gtc tgc gag gaa gcc tcc tgc cct aac ctg gcg gaa tgc ttc 240
His Ser Val Cys Glu Glu Ala Ser Cys Pro Asn Leu Ala Glu Cys Phe
65 70 75 80
aac cac ggc aca gca acg ttt atg atc ctc ggc gct att tgt acc cgc 288
Asn His Gly Thr Ala Thr Phe Met Ile Leu Gly Ala Ile Cys Thr Arg
85 90 95
cgt tgt ccg ttc tgt gat gtt gcc cac ggt cgc ccg gta gct cct gat 336
Arg Cys Pro Phe Cys Asp Val Ala His Gly Arg Pro Val Ala Pro Asp
100 105 110
gcc aat gaa cca gtg aaa ctg gcg cag acc att gcc gat atg gcg ctg 384
Ala Asn Glu Pro Val Lys Leu Ala Gln Thr Ile Ala Asp Met Ala Leu
115 120 125
cgt tat gtg gtt atc acc tcc gtt gac cgt gat gac ctg cgc gat ggc 432
Arg Tyr Val Val Ile Thr Ser Val Asp Arg Asp Asp Leu Arg Asp Gly
130 135 140
ggt gcc cag cac ttt gcg gat tgc att act gcc att cgg gaa aaa agc 480
Gly Ala Gln His Phe Ala Asp Cys Ile Thr Ala Ile Arg Glu Lys Ser
145 150 155 160
ccg caa atc aaa att gaa act ctg gtg ccg gat ttc cgc ggt cgt atg 528
Pro Gln Ile Lys Ile Glu Thr Leu Val Pro Asp Phe Arg Gly Arg Met
165 170 175
gat cgt gct ctg gat att ctg act gca acg cca cca gat gtg ttc aac 576
Asp Arg Ala Leu Asp Ile Leu Thr Ala Thr Pro Pro Asp Val Phe Asn
180 185 190
cat aac ctg gaa aac gta ccg cgt att tac cgt cag gta cgg cct ggt 624
His Asn Leu Glu Asn Val Pro Arg Ile Tyr Arg Gln Val Arg Pro Gly
195 200 205
gca gat tac aac tgg tcg ctg aag ctg ctg gaa cgc ttt aaa gaa gcg 672
Ala Asp Tyr Asn Trp Ser Leu Lys Leu Leu Glu Arg Phe Lys Glu Ala
210 215 220
cat ccg gaa atc ccg acc aag tct ggt ctg atg gtg gga ctg ggt gaa 720
His Pro Glu Ile Pro Thr Lys Ser Gly Leu Met Val Gly Leu Gly Glu
225 230 235 240
acc aat gaa gaa att att gag gta atg cgc gac ctg cgc cgt cat ggt 768
Thr Asn Glu Glu Ile Ile Glu Val Met Arg Asp Leu Arg Arg His Gly
245 250 255
gtg acg atg tta acg ctg ggg caa tat ttg cag cca agc cgc cat cac 816
Val Thr Met Leu Thr Leu Gly Gln Tyr Leu Gln Pro Ser Arg His His
260 265 270
ctg ccg gtt caa cgt tac gtt agc ccg gat gag ttc gac gaa atg aaa 864
Leu Pro Val Gln Arg Tyr Val Ser Pro Asp Glu Phe Asp Glu Met Lys
275 280 285
gcc gaa gcg ctg gcg atg ggc ttt acc cat gct gca tgc ggt ccg ttt 912
Ala Glu Ala Leu Ala Met Gly Phe Thr His Ala Ala Cys Gly Pro Phe
290 295 300
gtc cgc tct tct tac cac gcc gat ttg cag gcg aaa ggg atg gaa gtt 960
Val Arg Ser Ser Tyr His Ala Asp Leu Gln Ala Lys Gly Met Glu Val
305 310 315 320
aag taa 966
Lys
<210> 103
<211> 321
<212> PRT
<213> Escherichia coli
<400> 103
Met Ser Lys Pro Ile Val Met Glu Arg Gly Val Lys Tyr Arg Asp Ala
1 5 10 15
Asp Lys Met Ala Leu Ile Pro Val Lys Asn Val Ala Thr Glu Arg Glu
20 25 30
Ala Leu Leu Arg Lys Pro Glu Trp Met Lys Ile Lys Leu Pro Ala Asp
35 40 45
Ser Thr Arg Ile Gln Gly Ile Lys Ala Ala Met Arg Lys Asn Gly Leu
50 55 60
His Ser Val Cys Glu Glu Ala Ser Cys Pro Asn Leu Ala Glu Cys Phe
65 70 75 80
Asn His Gly Thr Ala Thr Phe Met Ile Leu Gly Ala Ile Cys Thr Arg
85 90 95
Arg Cys Pro Phe Cys Asp Val Ala His Gly Arg Pro Val Ala Pro Asp
100 105 110
Ala Asn Glu Pro Val Lys Leu Ala Gln Thr Ile Ala Asp Met Ala Leu
115 120 125
Arg Tyr Val Val Ile Thr Ser Val Asp Arg Asp Asp Leu Arg Asp Gly
130 135 140
Gly Ala Gln His Phe Ala Asp Cys Ile Thr Ala Ile Arg Glu Lys Ser
145 150 155 160
Pro Gln Ile Lys Ile Glu Thr Leu Val Pro Asp Phe Arg Gly Arg Met
165 170 175
Asp Arg Ala Leu Asp Ile Leu Thr Ala Thr Pro Pro Asp Val Phe Asn
180 185 190
His Asn Leu Glu Asn Val Pro Arg Ile Tyr Arg Gln Val Arg Pro Gly
195 200 205
Ala Asp Tyr Asn Trp Ser Leu Lys Leu Leu Glu Arg Phe Lys Glu Ala
210 215 220
His Pro Glu Ile Pro Thr Lys Ser Gly Leu Met Val Gly Leu Gly Glu
225 230 235 240
Thr Asn Glu Glu Ile Ile Glu Val Met Arg Asp Leu Arg Arg His Gly
245 250 255
Val Thr Met Leu Thr Leu Gly Gln Tyr Leu Gln Pro Ser Arg His His
260 265 270
Leu Pro Val Gln Arg Tyr Val Ser Pro Asp Glu Phe Asp Glu Met Lys
275 280 285
Ala Glu Ala Leu Ala Met Gly Phe Thr His Ala Ala Cys Gly Pro Phe
290 295 300
Val Arg Ser Ser Tyr His Ala Asp Leu Gln Ala Lys Gly Met Glu Val
305 310 315 320
Lys
<210> 104
<211> 897
<212> DNA
<213> Bacillus subtilis subsp. Subtilis, str. 168
<220>
<221> CDS
<222> (1)..(897)
<223> lipA gene encoding a lipoic acid synthase (LipA)
<400> 104
atg gcg aag aag gat gaa cac ctg aga aag cca gaa tgg ctt aaa att 48
Met Ala Lys Lys Asp Glu His Leu Arg Lys Pro Glu Trp Leu Lys Ile
1 5 10 15
aag ctg aat aca aac gaa aac tac act ggc tta aaa aag tta atg cgt 96
Lys Leu Asn Thr Asn Glu Asn Tyr Thr Gly Leu Lys Lys Leu Met Arg
20 25 30
gag aat aac tta cat act gtc tgt gag gag gca aaa tgt cca aat ata 144
Glu Asn Asn Leu His Thr Val Cys Glu Glu Ala Lys Cys Pro Asn Ile
35 40 45
cac gaa tgc tgg gcc gtt cgg cgt acc gcg acg ttt atg ata ctg ggc 192
His Glu Cys Trp Ala Val Arg Arg Thr Ala Thr Phe Met Ile Leu Gly
50 55 60
tcc gtc tgc acg aga gca tgt cgt ttt tgc gcg gtt aaa acc ggc ctg 240
Ser Val Cys Thr Arg Ala Cys Arg Phe Cys Ala Val Lys Thr Gly Leu
65 70 75 80
ccg act gag ctt gac ttg caa gag cca gag cgc gtg gct gat tca gtt 288
Pro Thr Glu Leu Asp Leu Gln Glu Pro Glu Arg Val Ala Asp Ser Val
85 90 95
gcc ctt atg aac ctg aaa cac gcc gtt atc acg gcg gtc gcc cgt gac 336
Ala Leu Met Asn Leu Lys His Ala Val Ile Thr Ala Val Ala Arg Asp
100 105 110
gat caa aaa gat ggt gga gcg gga ata ttc gca gaa acg gta cgt gct 384
Asp Gln Lys Asp Gly Gly Ala Gly Ile Phe Ala Glu Thr Val Arg Ala
115 120 125
atc cgc cgg aag tct cca ttt acc acg att gaa gtg ctg ccg agc gat 432
Ile Arg Arg Lys Ser Pro Phe Thr Thr Ile Glu Val Leu Pro Ser Asp
130 135 140
atg ggc ggt aat tat gat aac ctt aag acc ttg atg gac aca cgg ccg 480
Met Gly Gly Asn Tyr Asp Asn Leu Lys Thr Leu Met Asp Thr Arg Pro
145 150 155 160
gat att ctg aat cac aac atc gag act gta cgg cgt tta aca cca aga 528
Asp Ile Leu Asn His Asn Ile Glu Thr Val Arg Arg Leu Thr Pro Arg
165 170 175
gtc aga gca cgc gct acc tat gat cgc agc ctt gaa ttc ttg cgt cgc 576
Val Arg Ala Arg Ala Thr Tyr Asp Arg Ser Leu Glu Phe Leu Arg Arg
180 185 190
gcc aaa gag atg cag ccc gac ata cca aca aaa agt agt ata atg atc 624
Ala Lys Glu Met Gln Pro Asp Ile Pro Thr Lys Ser Ser Ile Met Ile
195 200 205
ggc ttg gga gaa aca aaa gaa gaa atc atc gag gtc atg gat gac ctt 672
Gly Leu Gly Glu Thr Lys Glu Glu Ile Ile Glu Val Met Asp Asp Leu
210 215 220
ttg gca aac aac gtg gac ata atg gcc att ggg caa tac ttg caa cca 720
Leu Ala Asn Asn Val Asp Ile Met Ala Ile Gly Gln Tyr Leu Gln Pro
225 230 235 240
act aaa aag cac tta aaa gtt cag aaa tac tat cat cct gat gaa ttt 768
Thr Lys Lys His Leu Lys Val Gln Lys Tyr Tyr His Pro Asp Glu Phe
245 250 255
gcc gag ttg aag gaa atc gcc atg cag aag ggg ttt tca cat tgc gag 816
Ala Glu Leu Lys Glu Ile Ala Met Gln Lys Gly Phe Ser His Cys Glu
260 265 270
gcg ggt ccg ttg gtc cgt tca agt tac cac gcg gac gaa cag gtg aat 864
Ala Gly Pro Leu Val Arg Ser Ser Tyr His Ala Asp Glu Gln Val Asn
275 280 285
gaa gcg tca aag aaa cgc caa gca caa gct taa 897
Glu Ala Ser Lys Lys Arg Gln Ala Gln Ala
290 295
<210> 105
<211> 298
<212> PRT
<213> Bacillus subtilis subsp. Subtilis, str. 168
<400> 105
Met Ala Lys Lys Asp Glu His Leu Arg Lys Pro Glu Trp Leu Lys Ile
1 5 10 15
Lys Leu Asn Thr Asn Glu Asn Tyr Thr Gly Leu Lys Lys Leu Met Arg
20 25 30
Glu Asn Asn Leu His Thr Val Cys Glu Glu Ala Lys Cys Pro Asn Ile
35 40 45
His Glu Cys Trp Ala Val Arg Arg Thr Ala Thr Phe Met Ile Leu Gly
50 55 60
Ser Val Cys Thr Arg Ala Cys Arg Phe Cys Ala Val Lys Thr Gly Leu
65 70 75 80
Pro Thr Glu Leu Asp Leu Gln Glu Pro Glu Arg Val Ala Asp Ser Val
85 90 95
Ala Leu Met Asn Leu Lys His Ala Val Ile Thr Ala Val Ala Arg Asp
100 105 110
Asp Gln Lys Asp Gly Gly Ala Gly Ile Phe Ala Glu Thr Val Arg Ala
115 120 125
Ile Arg Arg Lys Ser Pro Phe Thr Thr Ile Glu Val Leu Pro Ser Asp
130 135 140
Met Gly Gly Asn Tyr Asp Asn Leu Lys Thr Leu Met Asp Thr Arg Pro
145 150 155 160
Asp Ile Leu Asn His Asn Ile Glu Thr Val Arg Arg Leu Thr Pro Arg
165 170 175
Val Arg Ala Arg Ala Thr Tyr Asp Arg Ser Leu Glu Phe Leu Arg Arg
180 185 190
Ala Lys Glu Met Gln Pro Asp Ile Pro Thr Lys Ser Ser Ile Met Ile
195 200 205
Gly Leu Gly Glu Thr Lys Glu Glu Ile Ile Glu Val Met Asp Asp Leu
210 215 220
Leu Ala Asn Asn Val Asp Ile Met Ala Ile Gly Gln Tyr Leu Gln Pro
225 230 235 240
Thr Lys Lys His Leu Lys Val Gln Lys Tyr Tyr His Pro Asp Glu Phe
245 250 255
Ala Glu Leu Lys Glu Ile Ala Met Gln Lys Gly Phe Ser His Cys Glu
260 265 270
Ala Gly Pro Leu Val Arg Ser Ser Tyr His Ala Asp Glu Gln Val Asn
275 280 285
Glu Ala Ser Lys Lys Arg Gln Ala Gln Ala
290 295
<210> 106
<211> 1245
<212> DNA
<213> Saccharomyces cerevisiae, S288C
<220>
<221> CDS
<222> (1)..(1245)
<223> lipA gene encoding a lipoic acid synthase (LipA)
<400> 106
atg tac aga cgt agt gta ggg gtg ctt ttc gta ggg cgt aac act cgg 48
Met Tyr Arg Arg Ser Val Gly Val Leu Phe Val Gly Arg Asn Thr Arg
1 5 10 15
tgg atc agc agc acg atc cgg tgt ggc act agc gca acc cgc cct att 96
Trp Ile Ser Ser Thr Ile Arg Cys Gly Thr Ser Ala Thr Arg Pro Ile
20 25 30
cgt agt aac gcg ttg aac act gac tca gat aac gct agt gtg cgc gta 144
Arg Ser Asn Ala Leu Asn Thr Asp Ser Asp Asn Ala Ser Val Arg Val
35 40 45
ccc gta ggg aac agc acg gag gta gaa aat gcg acc tcg caa ctg aca 192
Pro Val Gly Asn Ser Thr Glu Val Glu Asn Ala Thr Ser Gln Leu Thr
50 55 60
ggt act tcg ggc aag aga cgg aag gga aat aga aag cgg ata acg gaa 240
Gly Thr Ser Gly Lys Arg Arg Lys Gly Asn Arg Lys Arg Ile Thr Glu
65 70 75 80
ttt aaa gat gcc tta aac ctg ggt ccc tcg ttc gcc gat ttc gta tca 288
Phe Lys Asp Ala Leu Asn Leu Gly Pro Ser Phe Ala Asp Phe Val Ser
85 90 95
ggt aag gct tcg aaa atg att ttg gat cct ttg gag aaa gcc cgc cag 336
Gly Lys Ala Ser Lys Met Ile Leu Asp Pro Leu Glu Lys Ala Arg Gln
100 105 110
aac aca gag gag gct aag aag tta cct aga tgg ttg aaa gtg cct att 384
Asn Thr Glu Glu Ala Lys Lys Leu Pro Arg Trp Leu Lys Val Pro Ile
115 120 125
cct aaa ggc acg aac tac cac aaa ttg aaa ggt gat gta aaa gaa tta 432
Pro Lys Gly Thr Asn Tyr His Lys Leu Lys Gly Asp Val Lys Glu Leu
130 135 140
gga ctt tcc act gtc tgt gag gaa gct cgc tgc cca aat atc gga gag 480
Gly Leu Ser Thr Val Cys Glu Glu Ala Arg Cys Pro Asn Ile Gly Glu
145 150 155 160
tgc tgg gga gga aag gac aaa tcc aag gct acg gcg acg ata atg ttg 528
Cys Trp Gly Gly Lys Asp Lys Ser Lys Ala Thr Ala Thr Ile Met Leu
165 170 175
ctt ggc gat acg tgc acg cgc ggt tgc cgt ttt tgc tct gta aag act 576
Leu Gly Asp Thr Cys Thr Arg Gly Cys Arg Phe Cys Ser Val Lys Thr
180 185 190
aac cgc aca ccc tcg aaa cct gat cct atg gag ccc gaa aat aca gca 624
Asn Arg Thr Pro Ser Lys Pro Asp Pro Met Glu Pro Glu Asn Thr Ala
195 200 205
gag gcg att aaa cgt tgg ggc ctg ggc tat gtt gta tta act aca gta 672
Glu Ala Ile Lys Arg Trp Gly Leu Gly Tyr Val Val Leu Thr Thr Val
210 215 220
gat cgt gat gat tta gtc gac ggt ggg gca aat cat ttg gcg gag act 720
Asp Arg Asp Asp Leu Val Asp Gly Gly Ala Asn His Leu Ala Glu Thr
225 230 235 240
gtc aga aag atc aag caa aaa gcg ccg aat act ctt gtg gag act ctt 768
Val Arg Lys Ile Lys Gln Lys Ala Pro Asn Thr Leu Val Glu Thr Leu
245 250 255
tcc ggt gac ttt aga ggg gac tta aag atg gtt gac atc atg gca caa 816
Ser Gly Asp Phe Arg Gly Asp Leu Lys Met Val Asp Ile Met Ala Gln
260 265 270
tgc ggc tta gat gtt tac gct cat aac tta gaa acc gtt gaa agt tta 864
Cys Gly Leu Asp Val Tyr Ala His Asn Leu Glu Thr Val Glu Ser Leu
275 280 285
act ccc cac gta aga gac cgg cgg gcc aca tac aga caa agt ctg agc 912
Thr Pro His Val Arg Asp Arg Arg Ala Thr Tyr Arg Gln Ser Leu Ser
290 295 300
gtt ttg gag cgt gcc aaa gcc acg gtg cca tca tta atc act aaa acc 960
Val Leu Glu Arg Ala Lys Ala Thr Val Pro Ser Leu Ile Thr Lys Thr
305 310 315 320
tct ata atg tta ggc tta ggg gaa acg gac gaa caa ata acg caa acc 1008
Ser Ile Met Leu Gly Leu Gly Glu Thr Asp Glu Gln Ile Thr Gln Thr
325 330 335
tta aaa gat tta cgg aac att caa tgc gac gtc gtg acc ttt ggt caa 1056
Leu Lys Asp Leu Arg Asn Ile Gln Cys Asp Val Val Thr Phe Gly Gln
340 345 350
tat atg cgg cca aca aag cgg cac atg aag gtg gtc gag tac gta aaa 1104
Tyr Met Arg Pro Thr Lys Arg His Met Lys Val Val Glu Tyr Val Lys
355 360 365
ccc gaa aaa ttt gat tac tgg aag gag cgg gct ctt gag atg ggt ttc 1152
Pro Glu Lys Phe Asp Tyr Trp Lys Glu Arg Ala Leu Glu Met Gly Phe
370 375 380
tta tat tgc gcc tct ggg cca ctt gta cgc tct agc tat aaa gcg ggt 1200
Leu Tyr Cys Ala Ser Gly Pro Leu Val Arg Ser Ser Tyr Lys Ala Gly
385 390 395 400
gag gca ttt atc gag aat gtt tta aaa aaa aga aac atg aag tga 1245
Glu Ala Phe Ile Glu Asn Val Leu Lys Lys Arg Asn Met Lys
405 410
<210> 107
<211> 414
<212> PRT
<213> Saccharomyces cerevisiae, S288C
<400> 107
Met Tyr Arg Arg Ser Val Gly Val Leu Phe Val Gly Arg Asn Thr Arg
1 5 10 15
Trp Ile Ser Ser Thr Ile Arg Cys Gly Thr Ser Ala Thr Arg Pro Ile
20 25 30
Arg Ser Asn Ala Leu Asn Thr Asp Ser Asp Asn Ala Ser Val Arg Val
35 40 45
Pro Val Gly Asn Ser Thr Glu Val Glu Asn Ala Thr Ser Gln Leu Thr
50 55 60
Gly Thr Ser Gly Lys Arg Arg Lys Gly Asn Arg Lys Arg Ile Thr Glu
65 70 75 80
Phe Lys Asp Ala Leu Asn Leu Gly Pro Ser Phe Ala Asp Phe Val Ser
85 90 95
Gly Lys Ala Ser Lys Met Ile Leu Asp Pro Leu Glu Lys Ala Arg Gln
100 105 110
Asn Thr Glu Glu Ala Lys Lys Leu Pro Arg Trp Leu Lys Val Pro Ile
115 120 125
Pro Lys Gly Thr Asn Tyr His Lys Leu Lys Gly Asp Val Lys Glu Leu
130 135 140
Gly Leu Ser Thr Val Cys Glu Glu Ala Arg Cys Pro Asn Ile Gly Glu
145 150 155 160
Cys Trp Gly Gly Lys Asp Lys Ser Lys Ala Thr Ala Thr Ile Met Leu
165 170 175
Leu Gly Asp Thr Cys Thr Arg Gly Cys Arg Phe Cys Ser Val Lys Thr
180 185 190
Asn Arg Thr Pro Ser Lys Pro Asp Pro Met Glu Pro Glu Asn Thr Ala
195 200 205
Glu Ala Ile Lys Arg Trp Gly Leu Gly Tyr Val Val Leu Thr Thr Val
210 215 220
Asp Arg Asp Asp Leu Val Asp Gly Gly Ala Asn His Leu Ala Glu Thr
225 230 235 240
Val Arg Lys Ile Lys Gln Lys Ala Pro Asn Thr Leu Val Glu Thr Leu
245 250 255
Ser Gly Asp Phe Arg Gly Asp Leu Lys Met Val Asp Ile Met Ala Gln
260 265 270
Cys Gly Leu Asp Val Tyr Ala His Asn Leu Glu Thr Val Glu Ser Leu
275 280 285
Thr Pro His Val Arg Asp Arg Arg Ala Thr Tyr Arg Gln Ser Leu Ser
290 295 300
Val Leu Glu Arg Ala Lys Ala Thr Val Pro Ser Leu Ile Thr Lys Thr
305 310 315 320
Ser Ile Met Leu Gly Leu Gly Glu Thr Asp Glu Gln Ile Thr Gln Thr
325 330 335
Leu Lys Asp Leu Arg Asn Ile Gln Cys Asp Val Val Thr Phe Gly Gln
340 345 350
Tyr Met Arg Pro Thr Lys Arg His Met Lys Val Val Glu Tyr Val Lys
355 360 365
Pro Glu Lys Phe Asp Tyr Trp Lys Glu Arg Ala Leu Glu Met Gly Phe
370 375 380
Leu Tyr Cys Ala Ser Gly Pro Leu Val Arg Ser Ser Tyr Lys Ala Gly
385 390 395 400
Glu Ala Phe Ile Glu Asn Val Leu Lys Lys Arg Asn Met Lys
405 410
<210> 108
<211> 1017
<212> DNA
<213> Pseudomonas putida KT2440
<220>
<221> CDS
<222> (1)..(1017)
<223> lipA gene encoding a lipoic acid synthase (LipA)
<400> 108
atg acg acc gtt cag gaa gcc gtc ccg aat ttg atc ccc acg caa gat 48
Met Thr Thr Val Gln Glu Ala Val Pro Asn Leu Ile Pro Thr Gln Asp
1 5 10 15
gct acg ccg cgc ccg gct ccc aaa aag gtt gaa gct ggt gtg aag ttg 96
Ala Thr Pro Arg Pro Ala Pro Lys Lys Val Glu Ala Gly Val Lys Leu
20 25 30
cgc gga gcg gat aaa gtt gcc cgc atc cct gtg aaa att att ccg aca 144
Arg Gly Ala Asp Lys Val Ala Arg Ile Pro Val Lys Ile Ile Pro Thr
35 40 45
gat gag tta ccg aaa aaa cct gac tgg atc cgc gtc cgc atc cct gta 192
Asp Glu Leu Pro Lys Lys Pro Asp Trp Ile Arg Val Arg Ile Pro Val
50 55 60
tcg ccc gag gtt gat cgt atc aaa caa ttg ttg cgt aag cac aaa ttg 240
Ser Pro Glu Val Asp Arg Ile Lys Gln Leu Leu Arg Lys His Lys Leu
65 70 75 80
cat agc gtc tgc gag gag gcc tcc tgc cca aat ttg ggc gag tgc ttt 288
His Ser Val Cys Glu Glu Ala Ser Cys Pro Asn Leu Gly Glu Cys Phe
85 90 95
agc ggc ggc acg gct act ttc atg att atg ggc gac atc tgt aca cgt 336
Ser Gly Gly Thr Ala Thr Phe Met Ile Met Gly Asp Ile Cys Thr Arg
100 105 110
cgt tgt cct ttt tgc gac gtg gga cac gga cgc cca aag ccg ctg gat 384
Arg Cys Pro Phe Cys Asp Val Gly His Gly Arg Pro Lys Pro Leu Asp
115 120 125
ttg gat gag cca aaa aat ctt gca gtt gcg att gca gac ttg cgc tta 432
Leu Asp Glu Pro Lys Asn Leu Ala Val Ala Ile Ala Asp Leu Arg Leu
130 135 140
aag tac gtg gtt atc aca tcg gtt gat cgt gac gat tta cgt gac ggg 480
Lys Tyr Val Val Ile Thr Ser Val Asp Arg Asp Asp Leu Arg Asp Gly
145 150 155 160
ggc gct caa cat ttt gcc gac tgc atc cgt gag atc cgc gca ctg tcc 528
Gly Ala Gln His Phe Ala Asp Cys Ile Arg Glu Ile Arg Ala Leu Ser
165 170 175
ccg ggg gtg cag ctg gag act ttg gtt ccg gac tac cgt gga cgc atg 576
Pro Gly Val Gln Leu Glu Thr Leu Val Pro Asp Tyr Arg Gly Arg Met
180 185 190
gat gtt gca ttg gag att acc gcc cag gag cct cca gat gtg ttc aac 624
Asp Val Ala Leu Glu Ile Thr Ala Gln Glu Pro Pro Asp Val Phe Asn
195 200 205
cat aat ctt gag aca gtc cca cgc tta tat aag gct gct cgt ccg ggt 672
His Asn Leu Glu Thr Val Pro Arg Leu Tyr Lys Ala Ala Arg Pro Gly
210 215 220
tct gat tac gac tgg agt tta gac ttg ttg caa aaa ttt aag cag ctg 720
Ser Asp Tyr Asp Trp Ser Leu Asp Leu Leu Gln Lys Phe Lys Gln Leu
225 230 235 240
gtt ccg cat gtg cca act aag tct gga ctg atg tta gga tta gga gaa 768
Val Pro His Val Pro Thr Lys Ser Gly Leu Met Leu Gly Leu Gly Glu
245 250 255
aca gat gag gaa gtc att gaa gta atg cat cgt atg cgt gag cat gat 816
Thr Asp Glu Glu Val Ile Glu Val Met His Arg Met Arg Glu His Asp
260 265 270
atc gac atg ttg act ctg gga cag tac ctt cag ccc tcg cgt tcg cat 864
Ile Asp Met Leu Thr Leu Gly Gln Tyr Leu Gln Pro Ser Arg Ser His
275 280 285
ctt cct gtt cag cgt ttt gtt cat ccc gat act ttt gct tgg ttc gcg 912
Leu Pro Val Gln Arg Phe Val His Pro Asp Thr Phe Ala Trp Phe Ala
290 295 300
gaa gag gga tac aag atg ggg ttc aaa aat gtc gct tct gga cca ttg 960
Glu Glu Gly Tyr Lys Met Gly Phe Lys Asn Val Ala Ser Gly Pro Leu
305 310 315 320
gta cgc tcg tca tat cac gca gac cag cag gct cat gag gcc aaa att 1008
Val Arg Ser Ser Tyr His Ala Asp Gln Gln Ala His Glu Ala Lys Ile
325 330 335
aag ctt tga 1017
Lys Leu
<210> 109
<211> 338
<212> PRT
<213> Pseudomonas putida KT2440
<400> 109
Met Thr Thr Val Gln Glu Ala Val Pro Asn Leu Ile Pro Thr Gln Asp
1 5 10 15
Ala Thr Pro Arg Pro Ala Pro Lys Lys Val Glu Ala Gly Val Lys Leu
20 25 30
Arg Gly Ala Asp Lys Val Ala Arg Ile Pro Val Lys Ile Ile Pro Thr
35 40 45
Asp Glu Leu Pro Lys Lys Pro Asp Trp Ile Arg Val Arg Ile Pro Val
50 55 60
Ser Pro Glu Val Asp Arg Ile Lys Gln Leu Leu Arg Lys His Lys Leu
65 70 75 80
His Ser Val Cys Glu Glu Ala Ser Cys Pro Asn Leu Gly Glu Cys Phe
85 90 95
Ser Gly Gly Thr Ala Thr Phe Met Ile Met Gly Asp Ile Cys Thr Arg
100 105 110
Arg Cys Pro Phe Cys Asp Val Gly His Gly Arg Pro Lys Pro Leu Asp
115 120 125
Leu Asp Glu Pro Lys Asn Leu Ala Val Ala Ile Ala Asp Leu Arg Leu
130 135 140
Lys Tyr Val Val Ile Thr Ser Val Asp Arg Asp Asp Leu Arg Asp Gly
145 150 155 160
Gly Ala Gln His Phe Ala Asp Cys Ile Arg Glu Ile Arg Ala Leu Ser
165 170 175
Pro Gly Val Gln Leu Glu Thr Leu Val Pro Asp Tyr Arg Gly Arg Met
180 185 190
Asp Val Ala Leu Glu Ile Thr Ala Gln Glu Pro Pro Asp Val Phe Asn
195 200 205
His Asn Leu Glu Thr Val Pro Arg Leu Tyr Lys Ala Ala Arg Pro Gly
210 215 220
Ser Asp Tyr Asp Trp Ser Leu Asp Leu Leu Gln Lys Phe Lys Gln Leu
225 230 235 240
Val Pro His Val Pro Thr Lys Ser Gly Leu Met Leu Gly Leu Gly Glu
245 250 255
Thr Asp Glu Glu Val Ile Glu Val Met His Arg Met Arg Glu His Asp
260 265 270
Ile Asp Met Leu Thr Leu Gly Gln Tyr Leu Gln Pro Ser Arg Ser His
275 280 285
Leu Pro Val Gln Arg Phe Val His Pro Asp Thr Phe Ala Trp Phe Ala
290 295 300
Glu Glu Gly Tyr Lys Met Gly Phe Lys Asn Val Ala Ser Gly Pro Leu
305 310 315 320
Val Arg Ser Ser Tyr His Ala Asp Gln Gln Ala His Glu Ala Lys Ile
325 330 335
Lys Leu
<210> 110
<211> 867
<212> DNA
<213> Bacteroides fragilis 638R
<220>
<221> CDS
<222> (1)..(867)
<223> lipA gene encoding a lipoic acid synthase (LipA)
<400> 110
atg ggg aac gac aag cgc gtt cgc aag cct gag tgg tta aaa att tct 48
Met Gly Asn Asp Lys Arg Val Arg Lys Pro Glu Trp Leu Lys Ile Ser
1 5 10 15
att ggt gca aat gag cgc tac acc gag act aaa cgt atc gtg gaa agc 96
Ile Gly Ala Asn Glu Arg Tyr Thr Glu Thr Lys Arg Ile Val Glu Ser
20 25 30
cat tgt ctt cac acc atc tgc agt tct ggg cgc tgc ccg aat atg ggg 144
His Cys Leu His Thr Ile Cys Ser Ser Gly Arg Cys Pro Asn Met Gly
35 40 45
gaa tgt tgg ggg aaa ggg aca gca acc ttt atg atc gct ggt gac atc 192
Glu Cys Trp Gly Lys Gly Thr Ala Thr Phe Met Ile Ala Gly Asp Ile
50 55 60
tgc act cgc tct tgc aag ttc tgt aat acc caa acc ggg cgc ccc tta 240
Cys Thr Arg Ser Cys Lys Phe Cys Asn Thr Gln Thr Gly Arg Pro Leu
65 70 75 80
cct tta gac ccg gat gaa ccc acc cac gtt gcc gaa tct att gca tta 288
Pro Leu Asp Pro Asp Glu Pro Thr His Val Ala Glu Ser Ile Ala Leu
85 90 95
atg aag ctg tca cat gca gtc att aca agc gta gac cgt gac gac ctt 336
Met Lys Leu Ser His Ala Val Ile Thr Ser Val Asp Arg Asp Asp Leu
100 105 110
ccg gac tta gga gca gca cat tgg gct cag act atc cgc gag atc aag 384
Pro Asp Leu Gly Ala Ala His Trp Ala Gln Thr Ile Arg Glu Ile Lys
115 120 125
cgt ttg aat ccg gaa act acc aca gag gtt tta att cct gac ttt cag 432
Arg Leu Asn Pro Glu Thr Thr Thr Glu Val Leu Ile Pro Asp Phe Gln
130 135 140
gga cgt aag gaa ctt atc gac caa gtc att aag gcg tgt ccc gaa att 480
Gly Arg Lys Glu Leu Ile Asp Gln Val Ile Lys Ala Cys Pro Glu Ile
145 150 155 160
att tca cat aac atg gaa acg gtc aaa cgc att tcg ccg cag gtt cgt 528
Ile Ser His Asn Met Glu Thr Val Lys Arg Ile Ser Pro Gln Val Arg
165 170 175
tct gca gcg aat tac cac act agt ctt gaa gtc att cgt cag att gct 576
Ser Ala Ala Asn Tyr His Thr Ser Leu Glu Val Ile Arg Gln Ile Ala
180 185 190
gaa agc ggg atc act gct aaa tcg ggc att atg gtt ggg ttg ggt gag 624
Glu Ser Gly Ile Thr Ala Lys Ser Gly Ile Met Val Gly Leu Gly Glu
195 200 205
act ccc gcc gaa gtc gaa gag ctt atg gac gac ttg atc tca gtc ggt 672
Thr Pro Ala Glu Val Glu Glu Leu Met Asp Asp Leu Ile Ser Val Gly
210 215 220
tgc aaa atc ctg acc atc ggt caa tat ctt caa cct aca cat aag cat 720
Cys Lys Ile Leu Thr Ile Gly Gln Tyr Leu Gln Pro Thr His Lys His
225 230 235 240
ttc ccg gtt gct gct tac att acc cca gaa cag ttc gcc gtc tat aag 768
Phe Pro Val Ala Ala Tyr Ile Thr Pro Glu Gln Phe Ala Val Tyr Lys
245 250 255
gag acg ggc ttg aag aaa ggt ttt gag cag gtg gag tca gcg ccc ctt 816
Glu Thr Gly Leu Lys Lys Gly Phe Glu Gln Val Glu Ser Ala Pro Leu
260 265 270
gtg cgc tct tct tat cac gca gaa aaa cac atc cgc ttt aat aac aag 864
Val Arg Ser Ser Tyr His Ala Glu Lys His Ile Arg Phe Asn Asn Lys
275 280 285
taa 867
<210> 111
<211> 288
<212> PRT
<213> Bacteroides fragilis 638R
<400> 111
Met Gly Asn Asp Lys Arg Val Arg Lys Pro Glu Trp Leu Lys Ile Ser
1 5 10 15
Ile Gly Ala Asn Glu Arg Tyr Thr Glu Thr Lys Arg Ile Val Glu Ser
20 25 30
His Cys Leu His Thr Ile Cys Ser Ser Gly Arg Cys Pro Asn Met Gly
35 40 45
Glu Cys Trp Gly Lys Gly Thr Ala Thr Phe Met Ile Ala Gly Asp Ile
50 55 60
Cys Thr Arg Ser Cys Lys Phe Cys Asn Thr Gln Thr Gly Arg Pro Leu
65 70 75 80
Pro Leu Asp Pro Asp Glu Pro Thr His Val Ala Glu Ser Ile Ala Leu
85 90 95
Met Lys Leu Ser His Ala Val Ile Thr Ser Val Asp Arg Asp Asp Leu
100 105 110
Pro Asp Leu Gly Ala Ala His Trp Ala Gln Thr Ile Arg Glu Ile Lys
115 120 125
Arg Leu Asn Pro Glu Thr Thr Thr Glu Val Leu Ile Pro Asp Phe Gln
130 135 140
Gly Arg Lys Glu Leu Ile Asp Gln Val Ile Lys Ala Cys Pro Glu Ile
145 150 155 160
Ile Ser His Asn Met Glu Thr Val Lys Arg Ile Ser Pro Gln Val Arg
165 170 175
Ser Ala Ala Asn Tyr His Thr Ser Leu Glu Val Ile Arg Gln Ile Ala
180 185 190
Glu Ser Gly Ile Thr Ala Lys Ser Gly Ile Met Val Gly Leu Gly Glu
195 200 205
Thr Pro Ala Glu Val Glu Glu Leu Met Asp Asp Leu Ile Ser Val Gly
210 215 220
Cys Lys Ile Leu Thr Ile Gly Gln Tyr Leu Gln Pro Thr His Lys His
225 230 235 240
Phe Pro Val Ala Ala Tyr Ile Thr Pro Glu Gln Phe Ala Val Tyr Lys
245 250 255
Glu Thr Gly Leu Lys Lys Gly Phe Glu Gln Val Glu Ser Ala Pro Leu
260 265 270
Val Arg Ser Ser Tyr His Ala Glu Lys His Ile Arg Phe Asn Asn Lys
275 280 285
<210> 112
<211> 954
<212> DNA
<213> Streptomyces coelicolor A3(2)
<220>
<221> CDS
<222> (1)..(954)
<223> lipA gene encoding a lipoic acid synthase (LipA)
<400> 112
atg agc gcg gta gct cct gac ggc cgc aag atg ctg cgt ttg gag gtt 48
Met Ser Ala Val Ala Pro Asp Gly Arg Lys Met Leu Arg Leu Glu Val
1 5 10 15
cgt aac agc caa acg cct atc gaa cgc aag ccc gag tgg atc aag act 96
Arg Asn Ser Gln Thr Pro Ile Glu Arg Lys Pro Glu Trp Ile Lys Thr
20 25 30
cgt gct aag atg ggt ccc gaa tac acg aag atg cag aac ttg gtc aag 144
Arg Ala Lys Met Gly Pro Glu Tyr Thr Lys Met Gln Asn Leu Val Lys
35 40 45
tcg gag ggc tta cat acg gta tgc cag gaa gct ggg tgc ccc aac att 192
Ser Glu Gly Leu His Thr Val Cys Gln Glu Ala Gly Cys Pro Asn Ile
50 55 60
tat gag tgc tgg gag gat cgt gaa gca acg ttt ttg atc gga ggc gat 240
Tyr Glu Cys Trp Glu Asp Arg Glu Ala Thr Phe Leu Ile Gly Gly Asp
65 70 75 80
cag tgc act cgt cgc tgc gac ttt tgt caa atc gat aca gga aaa cct 288
Gln Cys Thr Arg Arg Cys Asp Phe Cys Gln Ile Asp Thr Gly Lys Pro
85 90 95
gaa gcc ctt gat cgt gat gag cca cgt cgt gta gga gaa tcg gtc gtc 336
Glu Ala Leu Asp Arg Asp Glu Pro Arg Arg Val Gly Glu Ser Val Val
100 105 110
aca atg gat ctt aac tat gct acc att act ggc gtt gca cgt gat gat 384
Thr Met Asp Leu Asn Tyr Ala Thr Ile Thr Gly Val Ala Arg Asp Asp
115 120 125
ctg ccc gat ggt gga gct tgg ctt tac gct gaa act gtt cgt cag atc 432
Leu Pro Asp Gly Gly Ala Trp Leu Tyr Ala Glu Thr Val Arg Gln Ile
130 135 140
cat gaa cag act gcg ggt cgc gaa gcc ggg cgt acc aaa gtt gaa ctt 480
His Glu Gln Thr Ala Gly Arg Glu Ala Gly Arg Thr Lys Val Glu Leu
145 150 155 160
ctt gca cct gat ttt aac gcg gtg cca gag ctg tta cgc gaa gtc ttt 528
Leu Ala Pro Asp Phe Asn Ala Val Pro Glu Leu Leu Arg Glu Val Phe
165 170 175
gaa tcg cgc cct gaa gtc ttc gct cac aat gta gag aca gta cca cgc 576
Glu Ser Arg Pro Glu Val Phe Ala His Asn Val Glu Thr Val Pro Arg
180 185 190
atc ttt aag cgt att cgt cca ggg ttt cgc tat gag cgc agt ctt aag 624
Ile Phe Lys Arg Ile Arg Pro Gly Phe Arg Tyr Glu Arg Ser Leu Lys
195 200 205
gtt atc act gat gcg cgt gat ttt gga ttg gtc acc aaa tcg aac ctt 672
Val Ile Thr Asp Ala Arg Asp Phe Gly Leu Val Thr Lys Ser Asn Leu
210 215 220
atc ttg gga atg ggg gaa aca cgc gag gaa att tcg gaa gcc tta aaa 720
Ile Leu Gly Met Gly Glu Thr Arg Glu Glu Ile Ser Glu Ala Leu Lys
225 230 235 240
cag ctg cat gaa gcg ggt tgc gag ctt att aca atc acc cag tat ctt 768
Gln Leu His Glu Ala Gly Cys Glu Leu Ile Thr Ile Thr Gln Tyr Leu
245 250 255
cgt cct agt gtc cgt cat cat ccg gtc gaa cgc tgg gta aaa ccc cag 816
Arg Pro Ser Val Arg His His Pro Val Glu Arg Trp Val Lys Pro Gln
260 265 270
gaa ttt gtg gaa ctt aag gag gag gcg gag caa att ggc ttt tcg ggt 864
Glu Phe Val Glu Leu Lys Glu Glu Ala Glu Gln Ile Gly Phe Ser Gly
275 280 285
gta atg tca ggc ccc ctt gtg cgt tcc tcg tac cgt gcg ggg cgc ttg 912
Val Met Ser Gly Pro Leu Val Arg Ser Ser Tyr Arg Ala Gly Arg Leu
290 295 300
tac gga atg gct atg gaa cag cgc cgt tcc gca acc gtt tga 954
Tyr Gly Met Ala Met Glu Gln Arg Arg Ser Ala Thr Val
305 310 315
<210> 113
<211> 317
<212> PRT
<213> Streptomyces coelicolor A3(2)
<400> 113
Met Ser Ala Val Ala Pro Asp Gly Arg Lys Met Leu Arg Leu Glu Val
1 5 10 15
Arg Asn Ser Gln Thr Pro Ile Glu Arg Lys Pro Glu Trp Ile Lys Thr
20 25 30
Arg Ala Lys Met Gly Pro Glu Tyr Thr Lys Met Gln Asn Leu Val Lys
35 40 45
Ser Glu Gly Leu His Thr Val Cys Gln Glu Ala Gly Cys Pro Asn Ile
50 55 60
Tyr Glu Cys Trp Glu Asp Arg Glu Ala Thr Phe Leu Ile Gly Gly Asp
65 70 75 80
Gln Cys Thr Arg Arg Cys Asp Phe Cys Gln Ile Asp Thr Gly Lys Pro
85 90 95
Glu Ala Leu Asp Arg Asp Glu Pro Arg Arg Val Gly Glu Ser Val Val
100 105 110
Thr Met Asp Leu Asn Tyr Ala Thr Ile Thr Gly Val Ala Arg Asp Asp
115 120 125
Leu Pro Asp Gly Gly Ala Trp Leu Tyr Ala Glu Thr Val Arg Gln Ile
130 135 140
His Glu Gln Thr Ala Gly Arg Glu Ala Gly Arg Thr Lys Val Glu Leu
145 150 155 160
Leu Ala Pro Asp Phe Asn Ala Val Pro Glu Leu Leu Arg Glu Val Phe
165 170 175
Glu Ser Arg Pro Glu Val Phe Ala His Asn Val Glu Thr Val Pro Arg
180 185 190
Ile Phe Lys Arg Ile Arg Pro Gly Phe Arg Tyr Glu Arg Ser Leu Lys
195 200 205
Val Ile Thr Asp Ala Arg Asp Phe Gly Leu Val Thr Lys Ser Asn Leu
210 215 220
Ile Leu Gly Met Gly Glu Thr Arg Glu Glu Ile Ser Glu Ala Leu Lys
225 230 235 240
Gln Leu His Glu Ala Gly Cys Glu Leu Ile Thr Ile Thr Gln Tyr Leu
245 250 255
Arg Pro Ser Val Arg His His Pro Val Glu Arg Trp Val Lys Pro Gln
260 265 270
Glu Phe Val Glu Leu Lys Glu Glu Ala Glu Gln Ile Gly Phe Ser Gly
275 280 285
Val Met Ser Gly Pro Leu Val Arg Ser Ser Tyr Arg Ala Gly Arg Leu
290 295 300
Tyr Gly Met Ala Met Glu Gln Arg Arg Ser Ala Thr Val
305 310 315
<210> 114
<211> 642
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(642)
<223> lipB encoding octanoyltransferase (LipB)
<400> 114
atg tat cag gat aaa att ctt gtc cgc cag ctc ggt ctt cag cct tac 48
Met Tyr Gln Asp Lys Ile Leu Val Arg Gln Leu Gly Leu Gln Pro Tyr
1 5 10 15
gag cca atc tcc cag gct atg cat gaa ttc acc gat acc cgc gat gat 96
Glu Pro Ile Ser Gln Ala Met His Glu Phe Thr Asp Thr Arg Asp Asp
20 25 30
agt acc ctt gat gaa atc tgg ctg gtc gag cac tat ccg gta ttc acc 144
Ser Thr Leu Asp Glu Ile Trp Leu Val Glu His Tyr Pro Val Phe Thr
35 40 45
caa ggt cag gca gga aaa gcg gag cac att tta atg ccg ggt gat att 192
Gln Gly Gln Ala Gly Lys Ala Glu His Ile Leu Met Pro Gly Asp Ile
50 55 60
ccg gtg atc cag agc gat cgc ggt ggg cag gtg act tat cac ggg ccg 240
Pro Val Ile Gln Ser Asp Arg Gly Gly Gln Val Thr Tyr His Gly Pro
65 70 75 80
ggg caa cag gtg atg tat gtg ttg ctt aac ctg aaa cgc cgt aaa ctc 288
Gly Gln Gln Val Met Tyr Val Leu Leu Asn Leu Lys Arg Arg Lys Leu
85 90 95
ggt gtg cgt gaa ctg gtg acc ttg ctt gag caa aca gtg gtg aat acc 336
Gly Val Arg Glu Leu Val Thr Leu Leu Glu Gln Thr Val Val Asn Thr
100 105 110
ctg gct gaa ctg ggt ata gaa gcg cat cct cgg gct gac gcg cca ggt 384
Leu Ala Glu Leu Gly Ile Glu Ala His Pro Arg Ala Asp Ala Pro Gly
115 120 125
gtc tat gtt ggg gaa aag aaa att tgc tca ctg ggt tta cgt att cga 432
Val Tyr Val Gly Glu Lys Lys Ile Cys Ser Leu Gly Leu Arg Ile Arg
130 135 140
cgc ggt tgt tca ttc cac ggt ctg gca tta aac gtc aat atg gat ctt 480
Arg Gly Cys Ser Phe His Gly Leu Ala Leu Asn Val Asn Met Asp Leu
145 150 155 160
tca cca ttt tta cgt att aat cct tgt ggg tat gcc gga atg gaa atg 528
Ser Pro Phe Leu Arg Ile Asn Pro Cys Gly Tyr Ala Gly Met Glu Met
165 170 175
gct aaa ata tca caa tgg aaa ccc gaa gcg acg act aat aat att gct 576
Ala Lys Ile Ser Gln Trp Lys Pro Glu Ala Thr Thr Asn Asn Ile Ala
180 185 190
cca cgt tta ctg gaa aat att tta gcg cta cta aac aat ccg gac ttc 624
Pro Arg Leu Leu Glu Asn Ile Leu Ala Leu Leu Asn Asn Pro Asp Phe
195 200 205
gaa tat att acc gct taa 642
Glu Tyr Ile Thr Ala
210
<210> 115
<211> 213
<212> PRT
<213> Escherichia coli
<400> 115
Met Tyr Gln Asp Lys Ile Leu Val Arg Gln Leu Gly Leu Gln Pro Tyr
1 5 10 15
Glu Pro Ile Ser Gln Ala Met His Glu Phe Thr Asp Thr Arg Asp Asp
20 25 30
Ser Thr Leu Asp Glu Ile Trp Leu Val Glu His Tyr Pro Val Phe Thr
35 40 45
Gln Gly Gln Ala Gly Lys Ala Glu His Ile Leu Met Pro Gly Asp Ile
50 55 60
Pro Val Ile Gln Ser Asp Arg Gly Gly Gln Val Thr Tyr His Gly Pro
65 70 75 80
Gly Gln Gln Val Met Tyr Val Leu Leu Asn Leu Lys Arg Arg Lys Leu
85 90 95
Gly Val Arg Glu Leu Val Thr Leu Leu Glu Gln Thr Val Val Asn Thr
100 105 110
Leu Ala Glu Leu Gly Ile Glu Ala His Pro Arg Ala Asp Ala Pro Gly
115 120 125
Val Tyr Val Gly Glu Lys Lys Ile Cys Ser Leu Gly Leu Arg Ile Arg
130 135 140
Arg Gly Cys Ser Phe His Gly Leu Ala Leu Asn Val Asn Met Asp Leu
145 150 155 160
Ser Pro Phe Leu Arg Ile Asn Pro Cys Gly Tyr Ala Gly Met Glu Met
165 170 175
Ala Lys Ile Ser Gln Trp Lys Pro Glu Ala Thr Thr Asn Asn Ile Ala
180 185 190
Pro Arg Leu Leu Glu Asn Ile Leu Ala Leu Leu Asn Asn Pro Asp Phe
195 200 205
Glu Tyr Ile Thr Ala
210
<210> 116
<211> 576
<212> DNA
<213> Shigella flexneri
<220>
<221> CDS
<222> (1)..(576)
<223> lipB gene encoding a octanoyltransferase (LipB)
<400> 116
atg cat gaa ttc acc gat acc cgc gat aat agt acc ctt gat gaa atc 48
Met His Glu Phe Thr Asp Thr Arg Asp Asn Ser Thr Leu Asp Glu Ile
1 5 10 15
tgg ctg gtc gag cac tat ccg gta ttc acc caa ggt cag gca gga aaa 96
Trp Leu Val Glu His Tyr Pro Val Phe Thr Gln Gly Gln Ala Gly Lys
20 25 30
gcg gag cac att tta atg ccg ggt gat att ccg gtg atc cag agc gat 144
Ala Glu His Ile Leu Met Pro Gly Asp Ile Pro Val Ile Gln Ser Asp
35 40 45
cgc ggt ggg cag gtg act tat cac ggg ccg gga caa cag gtg atg tat 192
Arg Gly Gly Gln Val Thr Tyr His Gly Pro Gly Gln Gln Val Met Tyr
50 55 60
gtg ttg ctt aac ctg aaa cgc cgt aaa ctc ggt gtg cgt gaa ctg gtg 240
Val Leu Leu Asn Leu Lys Arg Arg Lys Leu Gly Val Arg Glu Leu Val
65 70 75 80
acc ttg ctt gag caa aca gtg gtg aat acc ctg gct gaa ctg ggt ata 288
Thr Leu Leu Glu Gln Thr Val Val Asn Thr Leu Ala Glu Leu Gly Ile
85 90 95
gaa gcg cat cct cgg gct gac gcg cct ggt gtc tat gtc ggg gaa aag 336
Glu Ala His Pro Arg Ala Asp Ala Pro Gly Val Tyr Val Gly Glu Lys
100 105 110
aaa att tgc tca ctg ggt tta cga att cga cgc ggt tgt tca ttc cac 384
Lys Ile Cys Ser Leu Gly Leu Arg Ile Arg Arg Gly Cys Ser Phe His
115 120 125
ggt ctg gca tta aac gtc aat atg gat ctt tca cca ttt tta cgt att 432
Gly Leu Ala Leu Asn Val Asn Met Asp Leu Ser Pro Phe Leu Arg Ile
130 135 140
aat cct tgt ggg tat gcc gga atg gaa atg gct aaa ata tca caa tgg 480
Asn Pro Cys Gly Tyr Ala Gly Met Glu Met Ala Lys Ile Ser Gln Trp
145 150 155 160
aaa ccc gaa gcg acg act aat aat att gct cca cgt tta ctg gaa aat 528
Lys Pro Glu Ala Thr Thr Asn Asn Ile Ala Pro Arg Leu Leu Glu Asn
165 170 175
att tta gcg cta cta aac aat ccg gac ttc gaa tat att acc gct taa 576
Ile Leu Ala Leu Leu Asn Asn Pro Asp Phe Glu Tyr Ile Thr Ala
180 185 190
<210> 117
<211> 191
<212> PRT
<213> Shigella flexneri
<400> 117
Met His Glu Phe Thr Asp Thr Arg Asp Asn Ser Thr Leu Asp Glu Ile
1 5 10 15
Trp Leu Val Glu His Tyr Pro Val Phe Thr Gln Gly Gln Ala Gly Lys
20 25 30
Ala Glu His Ile Leu Met Pro Gly Asp Ile Pro Val Ile Gln Ser Asp
35 40 45
Arg Gly Gly Gln Val Thr Tyr His Gly Pro Gly Gln Gln Val Met Tyr
50 55 60
Val Leu Leu Asn Leu Lys Arg Arg Lys Leu Gly Val Arg Glu Leu Val
65 70 75 80
Thr Leu Leu Glu Gln Thr Val Val Asn Thr Leu Ala Glu Leu Gly Ile
85 90 95
Glu Ala His Pro Arg Ala Asp Ala Pro Gly Val Tyr Val Gly Glu Lys
100 105 110
Lys Ile Cys Ser Leu Gly Leu Arg Ile Arg Arg Gly Cys Ser Phe His
115 120 125
Gly Leu Ala Leu Asn Val Asn Met Asp Leu Ser Pro Phe Leu Arg Ile
130 135 140
Asn Pro Cys Gly Tyr Ala Gly Met Glu Met Ala Lys Ile Ser Gln Trp
145 150 155 160
Lys Pro Glu Ala Thr Thr Asn Asn Ile Ala Pro Arg Leu Leu Glu Asn
165 170 175
Ile Leu Ala Leu Leu Asn Asn Pro Asp Phe Glu Tyr Ile Thr Ala
180 185 190
<210> 118
<211> 1893
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(1893)
<223> aceF gene encoding dihydrolipoyllysine-residue acetyltransferase
component of pyruvate dehydrogenase (E2)
<400> 118
atg gct atc gaa atc aaa gta ccg gac atc ggg gct gat gaa gtt gaa 48
Met Ala Ile Glu Ile Lys Val Pro Asp Ile Gly Ala Asp Glu Val Glu
1 5 10 15
atc acc gag atc ctg gtc aaa gtg ggc gac aaa gtt gaa gcc gaa cag 96
Ile Thr Glu Ile Leu Val Lys Val Gly Asp Lys Val Glu Ala Glu Gln
20 25 30
tcg ctg atc acc gta gaa ggc gac aaa gcc tct atg gaa gtt ccg tct 144
Ser Leu Ile Thr Val Glu Gly Asp Lys Ala Ser Met Glu Val Pro Ser
35 40 45
ccg cag gcg ggt atc gtt aaa gag atc aaa gtc tct gtt ggc gat aaa 192
Pro Gln Ala Gly Ile Val Lys Glu Ile Lys Val Ser Val Gly Asp Lys
50 55 60
acc cag acc ggc gca ctg att atg att ttc gat tcc gcc gac ggt gca 240
Thr Gln Thr Gly Ala Leu Ile Met Ile Phe Asp Ser Ala Asp Gly Ala
65 70 75 80
gca gac gct gca cct gct cag gca gaa gag aag aaa gaa gca gct ccg 288
Ala Asp Ala Ala Pro Ala Gln Ala Glu Glu Lys Lys Glu Ala Ala Pro
85 90 95
gca gca gca cca gcg gct gcg gcg gca aaa gac gtt aac gtt ccg gat 336
Ala Ala Ala Pro Ala Ala Ala Ala Ala Lys Asp Val Asn Val Pro Asp
100 105 110
atc ggc agc gac gaa gtt gaa gtg acc gaa atc ctg gtg aaa gtt ggc 384
Ile Gly Ser Asp Glu Val Glu Val Thr Glu Ile Leu Val Lys Val Gly
115 120 125
gat aaa gtt gaa gct gaa cag tcg ctg atc acc gta gaa ggc gac aag 432
Asp Lys Val Glu Ala Glu Gln Ser Leu Ile Thr Val Glu Gly Asp Lys
130 135 140
gct tct atg gaa gtt ccg gct ccg ttt gct ggc acc gtg aaa gag atc 480
Ala Ser Met Glu Val Pro Ala Pro Phe Ala Gly Thr Val Lys Glu Ile
145 150 155 160
aaa gtg aac gtg ggt gac aaa gtg tct acc ggc tcg ctg att atg gtc 528
Lys Val Asn Val Gly Asp Lys Val Ser Thr Gly Ser Leu Ile Met Val
165 170 175
ttc gaa gtc gcg ggt gaa gca ggc gcg gca gct ccg gcc gct aaa cag 576
Phe Glu Val Ala Gly Glu Ala Gly Ala Ala Ala Pro Ala Ala Lys Gln
180 185 190
gaa gca gct ccg gca gcg gcc cct gca cca gcg gct ggc gtg aaa gaa 624
Glu Ala Ala Pro Ala Ala Ala Pro Ala Pro Ala Ala Gly Val Lys Glu
195 200 205
gtt aac gtt ccg gat atc ggc ggt gac gaa gtt gaa gtg act gaa gtg 672
Val Asn Val Pro Asp Ile Gly Gly Asp Glu Val Glu Val Thr Glu Val
210 215 220
atg gtg aaa gtg ggc gac aaa gtt gcc gct gaa cag tca ctg atc acc 720
Met Val Lys Val Gly Asp Lys Val Ala Ala Glu Gln Ser Leu Ile Thr
225 230 235 240
gta gaa ggc gac aaa gct tct atg gaa gtt ccg gcg ccg ttt gca ggc 768
Val Glu Gly Asp Lys Ala Ser Met Glu Val Pro Ala Pro Phe Ala Gly
245 250 255
gtc gtg aag gaa ctg aaa gtc aac gtt ggc gat aaa gtg aaa act ggc 816
Val Val Lys Glu Leu Lys Val Asn Val Gly Asp Lys Val Lys Thr Gly
260 265 270
tcg ctg att atg atc ttc gaa gtt gaa ggc gca gcg cct gcg gca gct 864
Ser Leu Ile Met Ile Phe Glu Val Glu Gly Ala Ala Pro Ala Ala Ala
275 280 285
cct gcg aaa cag gaa gcg gca gcg ccg gca ccg gca gca aaa gct gaa 912
Pro Ala Lys Gln Glu Ala Ala Ala Pro Ala Pro Ala Ala Lys Ala Glu
290 295 300
gcc ccg gca gca gca cca gct gcg aaa gcg gaa ggc aaa tct gaa ttt 960
Ala Pro Ala Ala Ala Pro Ala Ala Lys Ala Glu Gly Lys Ser Glu Phe
305 310 315 320
gct gaa aac gac gct tat gtt cac gcg act ccg ctg atc cgc cgt ctg 1008
Ala Glu Asn Asp Ala Tyr Val His Ala Thr Pro Leu Ile Arg Arg Leu
325 330 335
gca cgc gag ttt ggt gtt aac ctt gcg aaa gtg aag ggc act ggc cgt 1056
Ala Arg Glu Phe Gly Val Asn Leu Ala Lys Val Lys Gly Thr Gly Arg
340 345 350
aaa ggt cgt atc ctg cgc gaa gac gtt cag gct tac gtg aaa gaa gct 1104
Lys Gly Arg Ile Leu Arg Glu Asp Val Gln Ala Tyr Val Lys Glu Ala
355 360 365
atc aaa cgt gca gaa gca gct ccg gca gcg act ggc ggt ggt atc cct 1152
Ile Lys Arg Ala Glu Ala Ala Pro Ala Ala Thr Gly Gly Gly Ile Pro
370 375 380
ggc atg ctg ccg tgg ccg aag gtg gac ttc agc aag ttt ggt gaa atc 1200
Gly Met Leu Pro Trp Pro Lys Val Asp Phe Ser Lys Phe Gly Glu Ile
385 390 395 400
gaa gaa gtg gaa ctg ggc cgc atc cag aaa atc tct ggt gcg aac ctg 1248
Glu Glu Val Glu Leu Gly Arg Ile Gln Lys Ile Ser Gly Ala Asn Leu
405 410 415
agc cgt aac tgg gta atg atc ccg cat gtt act cac ttc gac aaa acc 1296
Ser Arg Asn Trp Val Met Ile Pro His Val Thr His Phe Asp Lys Thr
420 425 430
gat atc acc gag ttg gaa gcg ttc cgt aaa cag cag aac gaa gaa gcg 1344
Asp Ile Thr Glu Leu Glu Ala Phe Arg Lys Gln Gln Asn Glu Glu Ala
435 440 445
gcg aaa cgt aag ctg gat gtg aag atc acc ccg gtt gtc ttc atc atg 1392
Ala Lys Arg Lys Leu Asp Val Lys Ile Thr Pro Val Val Phe Ile Met
450 455 460
aaa gcc gtt gct gca gct ctt gag cag atg cct cgc ttc aat agt tcg 1440
Lys Ala Val Ala Ala Ala Leu Glu Gln Met Pro Arg Phe Asn Ser Ser
465 470 475 480
ctg tcg gaa gac ggt cag cgt ctg acc ctg aag aaa tac atc aac atc 1488
Leu Ser Glu Asp Gly Gln Arg Leu Thr Leu Lys Lys Tyr Ile Asn Ile
485 490 495
ggt gtg gcg gtg gat acc ccg aac ggt ctg gtt gtt ccg gta ttc aaa 1536
Gly Val Ala Val Asp Thr Pro Asn Gly Leu Val Val Pro Val Phe Lys
500 505 510
gac gtc aac aag aaa ggc atc atc gag ctg tct cgc gag ctg atg act 1584
Asp Val Asn Lys Lys Gly Ile Ile Glu Leu Ser Arg Glu Leu Met Thr
515 520 525
att tct aag aaa gcg cgt gac ggt aag ctg act gcg ggc gaa atg cag 1632
Ile Ser Lys Lys Ala Arg Asp Gly Lys Leu Thr Ala Gly Glu Met Gln
530 535 540
ggc ggt tgc ttc acc atc tcc agc atc ggc ggc ctg ggt act acc cac 1680
Gly Gly Cys Phe Thr Ile Ser Ser Ile Gly Gly Leu Gly Thr Thr His
545 550 555 560
ttc gcg ccg att gtg aac gcg ccg gaa gtg gct atc ctc ggc gtt tcc 1728
Phe Ala Pro Ile Val Asn Ala Pro Glu Val Ala Ile Leu Gly Val Ser
565 570 575
aag tcc gcg atg gag ccg gtg tgg aat ggt aaa gag ttc gtg ccg cgt 1776
Lys Ser Ala Met Glu Pro Val Trp Asn Gly Lys Glu Phe Val Pro Arg
580 585 590
ctg atg ctg ccg att tct ctc tcc ttc gac cac cgc gtg atc gac ggt 1824
Leu Met Leu Pro Ile Ser Leu Ser Phe Asp His Arg Val Ile Asp Gly
595 600 605
gct gat ggt gcc cgt ttc att acc atc att aac aac acg ctg tct gac 1872
Ala Asp Gly Ala Arg Phe Ile Thr Ile Ile Asn Asn Thr Leu Ser Asp
610 615 620
att cgc cgt ctg gtg atg taa 1893
Ile Arg Arg Leu Val Met
625 630
<210> 119
<211> 630
<212> PRT
<213> Escherichia coli
<400> 119
Met Ala Ile Glu Ile Lys Val Pro Asp Ile Gly Ala Asp Glu Val Glu
1 5 10 15
Ile Thr Glu Ile Leu Val Lys Val Gly Asp Lys Val Glu Ala Glu Gln
20 25 30
Ser Leu Ile Thr Val Glu Gly Asp Lys Ala Ser Met Glu Val Pro Ser
35 40 45
Pro Gln Ala Gly Ile Val Lys Glu Ile Lys Val Ser Val Gly Asp Lys
50 55 60
Thr Gln Thr Gly Ala Leu Ile Met Ile Phe Asp Ser Ala Asp Gly Ala
65 70 75 80
Ala Asp Ala Ala Pro Ala Gln Ala Glu Glu Lys Lys Glu Ala Ala Pro
85 90 95
Ala Ala Ala Pro Ala Ala Ala Ala Ala Lys Asp Val Asn Val Pro Asp
100 105 110
Ile Gly Ser Asp Glu Val Glu Val Thr Glu Ile Leu Val Lys Val Gly
115 120 125
Asp Lys Val Glu Ala Glu Gln Ser Leu Ile Thr Val Glu Gly Asp Lys
130 135 140
Ala Ser Met Glu Val Pro Ala Pro Phe Ala Gly Thr Val Lys Glu Ile
145 150 155 160
Lys Val Asn Val Gly Asp Lys Val Ser Thr Gly Ser Leu Ile Met Val
165 170 175
Phe Glu Val Ala Gly Glu Ala Gly Ala Ala Ala Pro Ala Ala Lys Gln
180 185 190
Glu Ala Ala Pro Ala Ala Ala Pro Ala Pro Ala Ala Gly Val Lys Glu
195 200 205
Val Asn Val Pro Asp Ile Gly Gly Asp Glu Val Glu Val Thr Glu Val
210 215 220
Met Val Lys Val Gly Asp Lys Val Ala Ala Glu Gln Ser Leu Ile Thr
225 230 235 240
Val Glu Gly Asp Lys Ala Ser Met Glu Val Pro Ala Pro Phe Ala Gly
245 250 255
Val Val Lys Glu Leu Lys Val Asn Val Gly Asp Lys Val Lys Thr Gly
260 265 270
Ser Leu Ile Met Ile Phe Glu Val Glu Gly Ala Ala Pro Ala Ala Ala
275 280 285
Pro Ala Lys Gln Glu Ala Ala Ala Pro Ala Pro Ala Ala Lys Ala Glu
290 295 300
Ala Pro Ala Ala Ala Pro Ala Ala Lys Ala Glu Gly Lys Ser Glu Phe
305 310 315 320
Ala Glu Asn Asp Ala Tyr Val His Ala Thr Pro Leu Ile Arg Arg Leu
325 330 335
Ala Arg Glu Phe Gly Val Asn Leu Ala Lys Val Lys Gly Thr Gly Arg
340 345 350
Lys Gly Arg Ile Leu Arg Glu Asp Val Gln Ala Tyr Val Lys Glu Ala
355 360 365
Ile Lys Arg Ala Glu Ala Ala Pro Ala Ala Thr Gly Gly Gly Ile Pro
370 375 380
Gly Met Leu Pro Trp Pro Lys Val Asp Phe Ser Lys Phe Gly Glu Ile
385 390 395 400
Glu Glu Val Glu Leu Gly Arg Ile Gln Lys Ile Ser Gly Ala Asn Leu
405 410 415
Ser Arg Asn Trp Val Met Ile Pro His Val Thr His Phe Asp Lys Thr
420 425 430
Asp Ile Thr Glu Leu Glu Ala Phe Arg Lys Gln Gln Asn Glu Glu Ala
435 440 445
Ala Lys Arg Lys Leu Asp Val Lys Ile Thr Pro Val Val Phe Ile Met
450 455 460
Lys Ala Val Ala Ala Ala Leu Glu Gln Met Pro Arg Phe Asn Ser Ser
465 470 475 480
Leu Ser Glu Asp Gly Gln Arg Leu Thr Leu Lys Lys Tyr Ile Asn Ile
485 490 495
Gly Val Ala Val Asp Thr Pro Asn Gly Leu Val Val Pro Val Phe Lys
500 505 510
Asp Val Asn Lys Lys Gly Ile Ile Glu Leu Ser Arg Glu Leu Met Thr
515 520 525
Ile Ser Lys Lys Ala Arg Asp Gly Lys Leu Thr Ala Gly Glu Met Gln
530 535 540
Gly Gly Cys Phe Thr Ile Ser Ser Ile Gly Gly Leu Gly Thr Thr His
545 550 555 560
Phe Ala Pro Ile Val Asn Ala Pro Glu Val Ala Ile Leu Gly Val Ser
565 570 575
Lys Ser Ala Met Glu Pro Val Trp Asn Gly Lys Glu Phe Val Pro Arg
580 585 590
Leu Met Leu Pro Ile Ser Leu Ser Phe Asp His Arg Val Ile Asp Gly
595 600 605
Ala Asp Gly Ala Arg Phe Ile Thr Ile Ile Asn Asn Thr Leu Ser Asp
610 615 620
Ile Arg Arg Leu Val Met
625 630
<210> 120
<211> 1890
<212> DNA
<213> Klebsiella oxytoca
<220>
<221> CDS
<222> (1)..(1890)
<223> aceF gene encoding dihydrolipoyllysine-residue acetyltransferase
component of pyruvate dehydrogenase (E2)
<400> 120
atg gct atc gag atc aag gtg ccc gac atc ggc gct gac gaa gta gaa 48
Met Ala Ile Glu Ile Lys Val Pro Asp Ile Gly Ala Asp Glu Val Glu
1 5 10 15
att acc gag atc ctg gtt aaa gtc ggt gat aag gta gag gca gaa cag 96
Ile Thr Glu Ile Leu Val Lys Val Gly Asp Lys Val Glu Ala Glu Gln
20 25 30
agt tta atc act gta gaa ggc gat aag gca tcc atg gag gta cca tca 144
Ser Leu Ile Thr Val Glu Gly Asp Lys Ala Ser Met Glu Val Pro Ser
35 40 45
ccg caa gct ggt gta gtt aaa gag atc aaa gtg agt gta ggc gac aaa 192
Pro Gln Ala Gly Val Val Lys Glu Ile Lys Val Ser Val Gly Asp Lys
50 55 60
act gaa acc ggt aag tta att atg atc ttc gat tca gcc gac ggg gca 240
Thr Glu Thr Gly Lys Leu Ile Met Ile Phe Asp Ser Ala Asp Gly Ala
65 70 75 80
gcc gcc gct gct ccc gca cag gaa gag aaa aag gag gcg gca cct gcc 288
Ala Ala Ala Ala Pro Ala Gln Glu Glu Lys Lys Glu Ala Ala Pro Ala
85 90 95
gct gca gca cca gca gcc gct tct gca aaa gag gtg cat gta cct gac 336
Ala Ala Ala Pro Ala Ala Ala Ser Ala Lys Glu Val His Val Pro Asp
100 105 110
att gga ggc gac gaa gta gaa gta aca gag att atg gtc aag gtt ggc 384
Ile Gly Gly Asp Glu Val Glu Val Thr Glu Ile Met Val Lys Val Gly
115 120 125
gat aca atc gca gcg gaa caa agc tta att acg gta gaa ggc gat aaa 432
Asp Thr Ile Ala Ala Glu Gln Ser Leu Ile Thr Val Glu Gly Asp Lys
130 135 140
gca agc atg gaa gtt ccc gct ccc ttc gct ggg act gta aaa gaa atc 480
Ala Ser Met Glu Val Pro Ala Pro Phe Ala Gly Thr Val Lys Glu Ile
145 150 155 160
aag att aac acc ggc gac aag gtt tcc acc ggc tca tta att atg atc 528
Lys Ile Asn Thr Gly Asp Lys Val Ser Thr Gly Ser Leu Ile Met Ile
165 170 175
ttc gaa gta gca gga gct gca cct gcg gca gcg cct gcg aaa gcg gag 576
Phe Glu Val Ala Gly Ala Ala Pro Ala Ala Ala Pro Ala Lys Ala Glu
180 185 190
gct gca cct gca gcg gcg gct ccc gcc gct agt ggt agt aaa gaa gtg 624
Ala Ala Pro Ala Ala Ala Ala Pro Ala Ala Ser Gly Ser Lys Glu Val
195 200 205
cac gtt ccg gac atc gga ggt gac gag gtc gaa gtc act gaa gtg atg 672
His Val Pro Asp Ile Gly Gly Asp Glu Val Glu Val Thr Glu Val Met
210 215 220
gta aaa gca ggg gat aaa atc gca gcc gag cag agt tta att aca gtc 720
Val Lys Ala Gly Asp Lys Ile Ala Ala Glu Gln Ser Leu Ile Thr Val
225 230 235 240
gaa ggc gat aag gcg tct atg gaa gtt cca gcg cca ttc gcc ggt aca 768
Glu Gly Asp Lys Ala Ser Met Glu Val Pro Ala Pro Phe Ala Gly Thr
245 250 255
gta aag gaa att aag atc agc act gga gat aaa gtc tca act ggt tca 816
Val Lys Glu Ile Lys Ile Ser Thr Gly Asp Lys Val Ser Thr Gly Ser
260 265 270
ttg atc atg gtc ttc gaa gtc gaa ggc gcc gca cct gcg gcg gca ccg 864
Leu Ile Met Val Phe Glu Val Glu Gly Ala Ala Pro Ala Ala Ala Pro
275 280 285
gca gcg gct gcc gct cca gca cca gct gct gcg ccc gca caa gct gca 912
Ala Ala Ala Ala Ala Pro Ala Pro Ala Ala Ala Pro Ala Gln Ala Ala
290 295 300
aaa cca gct gcc ccc gca gcg aag gcc gaa ggc aag agc gag ttc gca 960
Lys Pro Ala Ala Pro Ala Ala Lys Ala Glu Gly Lys Ser Glu Phe Ala
305 310 315 320
gag aat gat gcg tac gta cat gcg aca cca ttg att cgc cgt ttg gca 1008
Glu Asn Asp Ala Tyr Val His Ala Thr Pro Leu Ile Arg Arg Leu Ala
325 330 335
cgc gaa ttc ggt gtg aac ctg gct aaa gtg aag gga acg ggg cgc aaa 1056
Arg Glu Phe Gly Val Asn Leu Ala Lys Val Lys Gly Thr Gly Arg Lys
340 345 350
ggc cgc att ttg cgt gag gac gtc cag gct tat gtt aaa gaa gca gtc 1104
Gly Arg Ile Leu Arg Glu Asp Val Gln Ala Tyr Val Lys Glu Ala Val
355 360 365
aag cgt gcc gaa gca gcg cca gcc gca acc ggc ggg ggg atc cca ggc 1152
Lys Arg Ala Glu Ala Ala Pro Ala Ala Thr Gly Gly Gly Ile Pro Gly
370 375 380
atg tta ccc tgg cca aag gta gac ttt tca aaa ttt ggg gag gta gaa 1200
Met Leu Pro Trp Pro Lys Val Asp Phe Ser Lys Phe Gly Glu Val Glu
385 390 395 400
gag gtt gag ttg gga cgc atc cag aag atc tcc ggt gca aat ttg tcg 1248
Glu Val Glu Leu Gly Arg Ile Gln Lys Ile Ser Gly Ala Asn Leu Ser
405 410 415
cgc aac tgg gtc atg att ccg cac gtc act cac ttt gac aaa acg gac 1296
Arg Asn Trp Val Met Ile Pro His Val Thr His Phe Asp Lys Thr Asp
420 425 430
att acc gat ttg gag gct ttt cgt aag caa caa aat gct gag gcg gag 1344
Ile Thr Asp Leu Glu Ala Phe Arg Lys Gln Gln Asn Ala Glu Ala Glu
435 440 445
aag cgt aaa ttg gac gtg aag ttc acc ccg gtg gtg ttc att atg aag 1392
Lys Arg Lys Leu Asp Val Lys Phe Thr Pro Val Val Phe Ile Met Lys
450 455 460
gca gtg gcc gca gca ctt gaa cag atg ccg cgt ttc aac tca tcc ctg 1440
Ala Val Ala Ala Ala Leu Glu Gln Met Pro Arg Phe Asn Ser Ser Leu
465 470 475 480
tca gag gat gct caa cgt ctt acc tta aag aag tac atc aac att ggt 1488
Ser Glu Asp Ala Gln Arg Leu Thr Leu Lys Lys Tyr Ile Asn Ile Gly
485 490 495
gtt gct gtg gac acg cca aat ggg ttg gta gtc ccc gtg ttt aaa gac 1536
Val Ala Val Asp Thr Pro Asn Gly Leu Val Val Pro Val Phe Lys Asp
500 505 510
gtt aat aag aag tcc att aca gag tta tcg cgc gaa tta act gtt atc 1584
Val Asn Lys Lys Ser Ile Thr Glu Leu Ser Arg Glu Leu Thr Val Ile
515 520 525
agc aag aaa gca cgc gat ggg aag ctg act gcc ggc gaa atg caa ggc 1632
Ser Lys Lys Ala Arg Asp Gly Lys Leu Thr Ala Gly Glu Met Gln Gly
530 535 540
gga tgt ttt acc atc tcg agt att ggg gga tta ggg acc aca cat ttt 1680
Gly Cys Phe Thr Ile Ser Ser Ile Gly Gly Leu Gly Thr Thr His Phe
545 550 555 560
gca ccc atc gtc aat gca cct gaa gta gct atc tta ggg gta tca aaa 1728
Ala Pro Ile Val Asn Ala Pro Glu Val Ala Ile Leu Gly Val Ser Lys
565 570 575
tca gcg atg gag ccg gtt tgg aat ggg aaa gag ttc gta ccc cgt ctg 1776
Ser Ala Met Glu Pro Val Trp Asn Gly Lys Glu Phe Val Pro Arg Leu
580 585 590
atg atg cca att tca ctg tca ttc gac cat cgc gtc att gat ggc gcg 1824
Met Met Pro Ile Ser Leu Ser Phe Asp His Arg Val Ile Asp Gly Ala
595 600 605
gat ggc gca cgc ttt atc aca att atc aat aat atg ctt tca gat att 1872
Asp Gly Ala Arg Phe Ile Thr Ile Ile Asn Asn Met Leu Ser Asp Ile
610 615 620
cgc cgt tta gtc atg taa 1890
Arg Arg Leu Val Met
625
<210> 121
<211> 629
<212> PRT
<213> Klebsiella oxytoca
<400> 121
Met Ala Ile Glu Ile Lys Val Pro Asp Ile Gly Ala Asp Glu Val Glu
1 5 10 15
Ile Thr Glu Ile Leu Val Lys Val Gly Asp Lys Val Glu Ala Glu Gln
20 25 30
Ser Leu Ile Thr Val Glu Gly Asp Lys Ala Ser Met Glu Val Pro Ser
35 40 45
Pro Gln Ala Gly Val Val Lys Glu Ile Lys Val Ser Val Gly Asp Lys
50 55 60
Thr Glu Thr Gly Lys Leu Ile Met Ile Phe Asp Ser Ala Asp Gly Ala
65 70 75 80
Ala Ala Ala Ala Pro Ala Gln Glu Glu Lys Lys Glu Ala Ala Pro Ala
85 90 95
Ala Ala Ala Pro Ala Ala Ala Ser Ala Lys Glu Val His Val Pro Asp
100 105 110
Ile Gly Gly Asp Glu Val Glu Val Thr Glu Ile Met Val Lys Val Gly
115 120 125
Asp Thr Ile Ala Ala Glu Gln Ser Leu Ile Thr Val Glu Gly Asp Lys
130 135 140
Ala Ser Met Glu Val Pro Ala Pro Phe Ala Gly Thr Val Lys Glu Ile
145 150 155 160
Lys Ile Asn Thr Gly Asp Lys Val Ser Thr Gly Ser Leu Ile Met Ile
165 170 175
Phe Glu Val Ala Gly Ala Ala Pro Ala Ala Ala Pro Ala Lys Ala Glu
180 185 190
Ala Ala Pro Ala Ala Ala Ala Pro Ala Ala Ser Gly Ser Lys Glu Val
195 200 205
His Val Pro Asp Ile Gly Gly Asp Glu Val Glu Val Thr Glu Val Met
210 215 220
Val Lys Ala Gly Asp Lys Ile Ala Ala Glu Gln Ser Leu Ile Thr Val
225 230 235 240
Glu Gly Asp Lys Ala Ser Met Glu Val Pro Ala Pro Phe Ala Gly Thr
245 250 255
Val Lys Glu Ile Lys Ile Ser Thr Gly Asp Lys Val Ser Thr Gly Ser
260 265 270
Leu Ile Met Val Phe Glu Val Glu Gly Ala Ala Pro Ala Ala Ala Pro
275 280 285
Ala Ala Ala Ala Ala Pro Ala Pro Ala Ala Ala Pro Ala Gln Ala Ala
290 295 300
Lys Pro Ala Ala Pro Ala Ala Lys Ala Glu Gly Lys Ser Glu Phe Ala
305 310 315 320
Glu Asn Asp Ala Tyr Val His Ala Thr Pro Leu Ile Arg Arg Leu Ala
325 330 335
Arg Glu Phe Gly Val Asn Leu Ala Lys Val Lys Gly Thr Gly Arg Lys
340 345 350
Gly Arg Ile Leu Arg Glu Asp Val Gln Ala Tyr Val Lys Glu Ala Val
355 360 365
Lys Arg Ala Glu Ala Ala Pro Ala Ala Thr Gly Gly Gly Ile Pro Gly
370 375 380
Met Leu Pro Trp Pro Lys Val Asp Phe Ser Lys Phe Gly Glu Val Glu
385 390 395 400
Glu Val Glu Leu Gly Arg Ile Gln Lys Ile Ser Gly Ala Asn Leu Ser
405 410 415
Arg Asn Trp Val Met Ile Pro His Val Thr His Phe Asp Lys Thr Asp
420 425 430
Ile Thr Asp Leu Glu Ala Phe Arg Lys Gln Gln Asn Ala Glu Ala Glu
435 440 445
Lys Arg Lys Leu Asp Val Lys Phe Thr Pro Val Val Phe Ile Met Lys
450 455 460
Ala Val Ala Ala Ala Leu Glu Gln Met Pro Arg Phe Asn Ser Ser Leu
465 470 475 480
Ser Glu Asp Ala Gln Arg Leu Thr Leu Lys Lys Tyr Ile Asn Ile Gly
485 490 495
Val Ala Val Asp Thr Pro Asn Gly Leu Val Val Pro Val Phe Lys Asp
500 505 510
Val Asn Lys Lys Ser Ile Thr Glu Leu Ser Arg Glu Leu Thr Val Ile
515 520 525
Ser Lys Lys Ala Arg Asp Gly Lys Leu Thr Ala Gly Glu Met Gln Gly
530 535 540
Gly Cys Phe Thr Ile Ser Ser Ile Gly Gly Leu Gly Thr Thr His Phe
545 550 555 560
Ala Pro Ile Val Asn Ala Pro Glu Val Ala Ile Leu Gly Val Ser Lys
565 570 575
Ser Ala Met Glu Pro Val Trp Asn Gly Lys Glu Phe Val Pro Arg Leu
580 585 590
Met Met Pro Ile Ser Leu Ser Phe Asp His Arg Val Ile Asp Gly Ala
595 600 605
Asp Gly Ala Arg Phe Ile Thr Ile Ile Asn Asn Met Leu Ser Asp Ile
610 615 620
Arg Arg Leu Val Met
625
<210> 122
<211> 1017
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(1017)
<223> lplA gene encoding lipoate-protein ligase A (LplA)
<400> 122
atg tcc aca tta cgc ctg ctc atc tct gac tct tac gac ccg tgg ttt 48
Met Ser Thr Leu Arg Leu Leu Ile Ser Asp Ser Tyr Asp Pro Trp Phe
1 5 10 15
aac ctg gcg gtg gaa gag tgt att ttt cgc caa atg ccc gcc acg cag 96
Asn Leu Ala Val Glu Glu Cys Ile Phe Arg Gln Met Pro Ala Thr Gln
20 25 30
cgc gtt ctg ttt ctc tgg cgc aat gcc gac acg gta gta att ggt cgc 144
Arg Val Leu Phe Leu Trp Arg Asn Ala Asp Thr Val Val Ile Gly Arg
35 40 45
gcg cag aac ccg tgg aaa gag tgt aat acc cgg cgg atg gaa gaa gat 192
Ala Gln Asn Pro Trp Lys Glu Cys Asn Thr Arg Arg Met Glu Glu Asp
50 55 60
aac gtc cgc ctg gcg cgg cgc agt agc ggt ggc ggc gcg gtg ttc cac 240
Asn Val Arg Leu Ala Arg Arg Ser Ser Gly Gly Gly Ala Val Phe His
65 70 75 80
gat ctc ggc aat acc tgc ttt acc ttt atg gct ggc aag ccg gag tac 288
Asp Leu Gly Asn Thr Cys Phe Thr Phe Met Ala Gly Lys Pro Glu Tyr
85 90 95
gat aaa act atc tcc acg tcg att gtg ctc aat gcg ctg aac gcg ctc 336
Asp Lys Thr Ile Ser Thr Ser Ile Val Leu Asn Ala Leu Asn Ala Leu
100 105 110
ggc gtc agc gcc gaa gcg tcc gga cgt aac gat ctg gtg gtg aaa acc 384
Gly Val Ser Ala Glu Ala Ser Gly Arg Asn Asp Leu Val Val Lys Thr
115 120 125
gtc gaa ggc gac cgc aaa gtc tca ggc tcg gcc tat cgc gaa acc aaa 432
Val Glu Gly Asp Arg Lys Val Ser Gly Ser Ala Tyr Arg Glu Thr Lys
130 135 140
gat cgc ggc ttc cac cac ggc acc ttg cta ctc aat gcc gac ctc agc 480
Asp Arg Gly Phe His His Gly Thr Leu Leu Leu Asn Ala Asp Leu Ser
145 150 155 160
cgc ctg gca aac tat ctc aat ccg gat aaa aag aaa ctg gcg gcg aaa 528
Arg Leu Ala Asn Tyr Leu Asn Pro Asp Lys Lys Lys Leu Ala Ala Lys
165 170 175
ggc att acg tcg gta cgt tcc cgc gtg acc aac ctc acc gag ctg ttg 576
Gly Ile Thr Ser Val Arg Ser Arg Val Thr Asn Leu Thr Glu Leu Leu
180 185 190
ccg ggg atc acc cat gag cag gtt tgc gag gcc ata acc gag gcc ttt 624
Pro Gly Ile Thr His Glu Gln Val Cys Glu Ala Ile Thr Glu Ala Phe
195 200 205
ttc gcc cat tat ggc gag cgc gtg gaa gcg gaa atc atc tcc ccg aac 672
Phe Ala His Tyr Gly Glu Arg Val Glu Ala Glu Ile Ile Ser Pro Asn
210 215 220
aaa acg cca gac ttg cca aac ttc gcc gaa acc ttt gcc cgc cag agt 720
Lys Thr Pro Asp Leu Pro Asn Phe Ala Glu Thr Phe Ala Arg Gln Ser
225 230 235 240
agc tgg gaa tgg aac ttc ggt cag gct ccg gca ttc tcg cat ctg ctg 768
Ser Trp Glu Trp Asn Phe Gly Gln Ala Pro Ala Phe Ser His Leu Leu
245 250 255
gat gaa cgc ttt acc tgg ggc ggc gtg gaa ctg cat ttc gac gtt gaa 816
Asp Glu Arg Phe Thr Trp Gly Gly Val Glu Leu His Phe Asp Val Glu
260 265 270
aaa ggc cat atc acc cgc gcc cag gtg ttt acc gac agc ctc aac ccc 864
Lys Gly His Ile Thr Arg Ala Gln Val Phe Thr Asp Ser Leu Asn Pro
275 280 285
gcg ccg ctg gaa gcc ctc gcc gga cga ctg caa ggc tgc ctg tac cgc 912
Ala Pro Leu Glu Ala Leu Ala Gly Arg Leu Gln Gly Cys Leu Tyr Arg
290 295 300
gca gat atg ctg caa cag gag tgc gaa gcg ctg ttg gtt gac ttc ccg 960
Ala Asp Met Leu Gln Gln Glu Cys Glu Ala Leu Leu Val Asp Phe Pro
305 310 315 320
gaa cag gaa aaa gag cta cgg gag tta tcg gca tgg atg gcg ggg gct 1008
Glu Gln Glu Lys Glu Leu Arg Glu Leu Ser Ala Trp Met Ala Gly Ala
325 330 335
gta agg tag 1017
Val Arg
<210> 123
<211> 338
<212> PRT
<213> Escherichia coli
<400> 123
Met Ser Thr Leu Arg Leu Leu Ile Ser Asp Ser Tyr Asp Pro Trp Phe
1 5 10 15
Asn Leu Ala Val Glu Glu Cys Ile Phe Arg Gln Met Pro Ala Thr Gln
20 25 30
Arg Val Leu Phe Leu Trp Arg Asn Ala Asp Thr Val Val Ile Gly Arg
35 40 45
Ala Gln Asn Pro Trp Lys Glu Cys Asn Thr Arg Arg Met Glu Glu Asp
50 55 60
Asn Val Arg Leu Ala Arg Arg Ser Ser Gly Gly Gly Ala Val Phe His
65 70 75 80
Asp Leu Gly Asn Thr Cys Phe Thr Phe Met Ala Gly Lys Pro Glu Tyr
85 90 95
Asp Lys Thr Ile Ser Thr Ser Ile Val Leu Asn Ala Leu Asn Ala Leu
100 105 110
Gly Val Ser Ala Glu Ala Ser Gly Arg Asn Asp Leu Val Val Lys Thr
115 120 125
Val Glu Gly Asp Arg Lys Val Ser Gly Ser Ala Tyr Arg Glu Thr Lys
130 135 140
Asp Arg Gly Phe His His Gly Thr Leu Leu Leu Asn Ala Asp Leu Ser
145 150 155 160
Arg Leu Ala Asn Tyr Leu Asn Pro Asp Lys Lys Lys Leu Ala Ala Lys
165 170 175
Gly Ile Thr Ser Val Arg Ser Arg Val Thr Asn Leu Thr Glu Leu Leu
180 185 190
Pro Gly Ile Thr His Glu Gln Val Cys Glu Ala Ile Thr Glu Ala Phe
195 200 205
Phe Ala His Tyr Gly Glu Arg Val Glu Ala Glu Ile Ile Ser Pro Asn
210 215 220
Lys Thr Pro Asp Leu Pro Asn Phe Ala Glu Thr Phe Ala Arg Gln Ser
225 230 235 240
Ser Trp Glu Trp Asn Phe Gly Gln Ala Pro Ala Phe Ser His Leu Leu
245 250 255
Asp Glu Arg Phe Thr Trp Gly Gly Val Glu Leu His Phe Asp Val Glu
260 265 270
Lys Gly His Ile Thr Arg Ala Gln Val Phe Thr Asp Ser Leu Asn Pro
275 280 285
Ala Pro Leu Glu Ala Leu Ala Gly Arg Leu Gln Gly Cys Leu Tyr Arg
290 295 300
Ala Asp Met Leu Gln Gln Glu Cys Glu Ala Leu Leu Val Asp Phe Pro
305 310 315 320
Glu Gln Glu Lys Glu Leu Arg Glu Leu Ser Ala Trp Met Ala Gly Ala
325 330 335
Val Arg
<210> 124
<211> 1017
<212> DNA
<213> Klebsiella oxytoca
<220>
<221> CDS
<222> (1)..(1017)
<223> lplA gene encoding lipoate-protein ligase A (LplA)
<400> 124
atg tca acc ttg cgt ttg ctg tta tcc gac agt tat gac cca tgg ttt 48
Met Ser Thr Leu Arg Leu Leu Leu Ser Asp Ser Tyr Asp Pro Trp Phe
1 5 10 15
aac ctt gcc gta gag gag tcc att ttc cgc cag atg cca gcg aca cag 96
Asn Leu Ala Val Glu Glu Ser Ile Phe Arg Gln Met Pro Ala Thr Gln
20 25 30
cgt gtc ttg ttt ttg tgg cgt aac gcc gat acc gta gtt atc gga cgc 144
Arg Val Leu Phe Leu Trp Arg Asn Ala Asp Thr Val Val Ile Gly Arg
35 40 45
gca cag aat cca tgg aag gag tgt aac aca cgt cgc atg gag gag gac 192
Ala Gln Asn Pro Trp Lys Glu Cys Asn Thr Arg Arg Met Glu Glu Asp
50 55 60
aat gtg cgt ctg gct cgt cgc tct tcc ggg gga ggt gct gtg ttt cat 240
Asn Val Arg Leu Ala Arg Arg Ser Ser Gly Gly Gly Ala Val Phe His
65 70 75 80
gac ctt ggc aat acc tgc ttt aca ttc atg gcg ggg aag cct gaa tac 288
Asp Leu Gly Asn Thr Cys Phe Thr Phe Met Ala Gly Lys Pro Glu Tyr
85 90 95
gac aaa aca gtg tct acg aac atc gtc ctg act gcg ctg aac gcg tta 336
Asp Lys Thr Val Ser Thr Asn Ile Val Leu Thr Ala Leu Asn Ala Leu
100 105 110
ggg gtt gct gca gaa gcg tct ggg cgt aat gat tta gta gtc aag act 384
Gly Val Ala Ala Glu Ala Ser Gly Arg Asn Asp Leu Val Val Lys Thr
115 120 125
gct gag gga gac cgc aag gtt tcg ggt tca gcg tac cgc gaa aca atg 432
Ala Glu Gly Asp Arg Lys Val Ser Gly Ser Ala Tyr Arg Glu Thr Met
130 135 140
gat cgt ggc ttt cat cat ggt act ttg ctg tta aat gcg gat ctt tcc 480
Asp Arg Gly Phe His His Gly Thr Leu Leu Leu Asn Ala Asp Leu Ser
145 150 155 160
cgc ctg gcg aac tac ttg aac ccg gac aag aaa aaa ctt caa gca aag 528
Arg Leu Ala Asn Tyr Leu Asn Pro Asp Lys Lys Lys Leu Gln Ala Lys
165 170 175
ggc atc aca tcg gtc cgc ggc cgt gtc gct aat ctt gtc gag ctg tta 576
Gly Ile Thr Ser Val Arg Gly Arg Val Ala Asn Leu Val Glu Leu Leu
180 185 190
cca ggc atc acc cac cag caa gta tgc gag gcc atc cag gaa gca ttc 624
Pro Gly Ile Thr His Gln Gln Val Cys Glu Ala Ile Gln Glu Ala Phe
195 200 205
ttc gcc cac tat ggc gag cgc gtg gag gca gag gta atc tca cct gaa 672
Phe Ala His Tyr Gly Glu Arg Val Glu Ala Glu Val Ile Ser Pro Glu
210 215 220
aaa atg cct gac ctg cct aat ttc gcc gct acc ttc gct cgc cag tcg 720
Lys Met Pro Asp Leu Pro Asn Phe Ala Ala Thr Phe Ala Arg Gln Ser
225 230 235 240
tcg tgg gaa tgg aac ttc ggc caa gct cca gcc ttt agt cac ctg ctt 768
Ser Trp Glu Trp Asn Phe Gly Gln Ala Pro Ala Phe Ser His Leu Leu
245 250 255
gac gaa cgc ttc aca tgg gga ggc gtt gag ctt cat ttc gac gtg gag 816
Asp Glu Arg Phe Thr Trp Gly Gly Val Glu Leu His Phe Asp Val Glu
260 265 270
aag gga cac atc aca cgc acc cag att ttt acc gac agc ctt aac cca 864
Lys Gly His Ile Thr Arg Thr Gln Ile Phe Thr Asp Ser Leu Asn Pro
275 280 285
gca ccg ctg gag gct ttg gcc gcc cgc tta caa ggt tgc ctt tat cgc 912
Ala Pro Leu Glu Ala Leu Ala Ala Arg Leu Gln Gly Cys Leu Tyr Arg
290 295 300
gcc gac atg ctg caa caa gag tgc gat gct ctg tta gta gac ttt cca 960
Ala Asp Met Leu Gln Gln Glu Cys Asp Ala Leu Leu Val Asp Phe Pro
305 310 315 320
gag cag gag aaa gca tta cgt gag ctg tca gcc tgg att gct ggt gca 1008
Glu Gln Glu Lys Ala Leu Arg Glu Leu Ser Ala Trp Ile Ala Gly Ala
325 330 335
gta cgt taa 1017
Val Arg
<210> 125
<211> 338
<212> PRT
<213> Klebsiella oxytoca
<400> 125
Met Ser Thr Leu Arg Leu Leu Leu Ser Asp Ser Tyr Asp Pro Trp Phe
1 5 10 15
Asn Leu Ala Val Glu Glu Ser Ile Phe Arg Gln Met Pro Ala Thr Gln
20 25 30
Arg Val Leu Phe Leu Trp Arg Asn Ala Asp Thr Val Val Ile Gly Arg
35 40 45
Ala Gln Asn Pro Trp Lys Glu Cys Asn Thr Arg Arg Met Glu Glu Asp
50 55 60
Asn Val Arg Leu Ala Arg Arg Ser Ser Gly Gly Gly Ala Val Phe His
65 70 75 80
Asp Leu Gly Asn Thr Cys Phe Thr Phe Met Ala Gly Lys Pro Glu Tyr
85 90 95
Asp Lys Thr Val Ser Thr Asn Ile Val Leu Thr Ala Leu Asn Ala Leu
100 105 110
Gly Val Ala Ala Glu Ala Ser Gly Arg Asn Asp Leu Val Val Lys Thr
115 120 125
Ala Glu Gly Asp Arg Lys Val Ser Gly Ser Ala Tyr Arg Glu Thr Met
130 135 140
Asp Arg Gly Phe His His Gly Thr Leu Leu Leu Asn Ala Asp Leu Ser
145 150 155 160
Arg Leu Ala Asn Tyr Leu Asn Pro Asp Lys Lys Lys Leu Gln Ala Lys
165 170 175
Gly Ile Thr Ser Val Arg Gly Arg Val Ala Asn Leu Val Glu Leu Leu
180 185 190
Pro Gly Ile Thr His Gln Gln Val Cys Glu Ala Ile Gln Glu Ala Phe
195 200 205
Phe Ala His Tyr Gly Glu Arg Val Glu Ala Glu Val Ile Ser Pro Glu
210 215 220
Lys Met Pro Asp Leu Pro Asn Phe Ala Ala Thr Phe Ala Arg Gln Ser
225 230 235 240
Ser Trp Glu Trp Asn Phe Gly Gln Ala Pro Ala Phe Ser His Leu Leu
245 250 255
Asp Glu Arg Phe Thr Trp Gly Gly Val Glu Leu His Phe Asp Val Glu
260 265 270
Lys Gly His Ile Thr Arg Thr Gln Ile Phe Thr Asp Ser Leu Asn Pro
275 280 285
Ala Pro Leu Glu Ala Leu Ala Ala Arg Leu Gln Gly Cys Leu Tyr Arg
290 295 300
Ala Asp Met Leu Gln Gln Glu Cys Asp Ala Leu Leu Val Asp Phe Pro
305 310 315 320
Glu Gln Glu Lys Ala Leu Arg Glu Leu Ser Ala Trp Ile Ala Gly Ala
325 330 335
Val Arg
<210> 126
<211> 1854
<212> DNA
<213> Arabidopsis thaliana
<220>
<221> CDS
<222> (1)..(1854)
<223> Arabidopsis thaliana gene encoding TMP phosphatase [AT5G32470.1]
<400> 126
atg cgc ttc ctc ttc ccc acg cgc ctc atc aac aac tca tct ctc ggt 48
Met Arg Phe Leu Phe Pro Thr Arg Leu Ile Asn Asn Ser Ser Leu Gly
1 5 10 15
ctc ctc cga tct cca cac acc acc gcg ccg atc cgt tct ctc tgg ttt 96
Leu Leu Arg Ser Pro His Thr Thr Ala Pro Ile Arg Ser Leu Trp Phe
20 25 30
cgc acc aag tct ccg gtc ttc cga tcg gcg act act cca ata atg acg 144
Arg Thr Lys Ser Pro Val Phe Arg Ser Ala Thr Thr Pro Ile Met Thr
35 40 45
gcg gtc gct ttc tct tca tcg ttg tcg att ccc cct acc tcg gaa gaa 192
Ala Val Ala Phe Ser Ser Ser Leu Ser Ile Pro Pro Thr Ser Glu Glu
50 55 60
gca ctt cca ggg aag cta tgg atc aag ttt aac aga gag tgt ctc ttc 240
Ala Leu Pro Gly Lys Leu Trp Ile Lys Phe Asn Arg Glu Cys Leu Phe
65 70 75 80
tct atc tat agc ccc ttc gcc gtc tgt tta gcc gcc gga aat ctc aag 288
Ser Ile Tyr Ser Pro Phe Ala Val Cys Leu Ala Ala Gly Asn Leu Lys
85 90 95
atc gac aca ttt cgt cag tat att gca cag gat gtt cat ttc ctt aag 336
Ile Asp Thr Phe Arg Gln Tyr Ile Ala Gln Asp Val His Phe Leu Lys
100 105 110
gcc ttt gct cac gcg tat gaa ctg gcc gca gat tgt gct gat gac gat 384
Ala Phe Ala His Ala Tyr Glu Leu Ala Ala Asp Cys Ala Asp Asp Asp
115 120 125
gat gat aaa ttg gca att tct gat ttg agg aaa agc gtg atg gaa gaa 432
Asp Asp Lys Leu Ala Ile Ser Asp Leu Arg Lys Ser Val Met Glu Glu
130 135 140
ttg aaa atg cac gac tca ttt gta cag gat tgg gat tta gac atc aac 480
Leu Lys Met His Asp Ser Phe Val Gln Asp Trp Asp Leu Asp Ile Asn
145 150 155 160
aaa gaa gta agt gtt aac tca gca act ttg aga tac act gag ttc ttg 528
Lys Glu Val Ser Val Asn Ser Ala Thr Leu Arg Tyr Thr Glu Phe Leu
165 170 175
tta gct aca gca tcc gga aaa gta gaa gga tgc aaa gct ccc ggc atg 576
Leu Ala Thr Ala Ser Gly Lys Val Glu Gly Cys Lys Ala Pro Gly Met
180 185 190
ctt gat act cca ttt gaa aaa aca aaa gtt gct gcc tac acg ctt ggt 624
Leu Asp Thr Pro Phe Glu Lys Thr Lys Val Ala Ala Tyr Thr Leu Gly
195 200 205
gct gtg aca cct tgc atg cgg ttg tat gcc ttt ctc ggt aag gag ttt 672
Ala Val Thr Pro Cys Met Arg Leu Tyr Ala Phe Leu Gly Lys Glu Phe
210 215 220
gga tca ctt ctt gat ctg agt gat gtg aac cat ccc tac aag aaa tgg 720
Gly Ser Leu Leu Asp Leu Ser Asp Val Asn His Pro Tyr Lys Lys Trp
225 230 235 240
atc gat aat tat tct agt gat gct ttc cag gca tca gcc aag caa act 768
Ile Asp Asn Tyr Ser Ser Asp Ala Phe Gln Ala Ser Ala Lys Gln Thr
245 250 255
gaa gac ttg ctt gag aag ctt agt gtc tct atg act ggt gaa gaa ttg 816
Glu Asp Leu Leu Glu Lys Leu Ser Val Ser Met Thr Gly Glu Glu Leu
260 265 270
gac ata att gaa aaa ttg tat caa cag gct atg aaa ctt gaa gta gag 864
Asp Ile Ile Glu Lys Leu Tyr Gln Gln Ala Met Lys Leu Glu Val Glu
275 280 285
ttc ttc cat gcc cag cca ctt gcc cag cct acc ata gtt cca ctg ctc 912
Phe Phe His Ala Gln Pro Leu Ala Gln Pro Thr Ile Val Pro Leu Leu
290 295 300
aag aac cac tca aaa gat gat ctg gtg atc ttt tct gat ttt gat ctg 960
Lys Asn His Ser Lys Asp Asp Leu Val Ile Phe Ser Asp Phe Asp Leu
305 310 315 320
act tgc acc gtt gtg gat tct tct gct att tta gcg gaa ata gca att 1008
Thr Cys Thr Val Val Asp Ser Ser Ala Ile Leu Ala Glu Ile Ala Ile
325 330 335
gta act gcc cca aaa gat gaa caa agt cga tct gga caa caa att cat 1056
Val Thr Ala Pro Lys Asp Glu Gln Ser Arg Ser Gly Gln Gln Ile His
340 345 350
cgg atg ctc tca tct gac ctt aag aac acc tgg aat cta ctt tct aaa 1104
Arg Met Leu Ser Ser Asp Leu Lys Asn Thr Trp Asn Leu Leu Ser Lys
355 360 365
caa tac aca gag cat tat gaa gaa tgc ata gag agt att ctg aat aaa 1152
Gln Tyr Thr Glu His Tyr Glu Glu Cys Ile Glu Ser Ile Leu Asn Lys
370 375 380
aag aaa gcg gac aag ttt gac tat gaa ggt tta tgt aaa gca cta gag 1200
Lys Lys Ala Asp Lys Phe Asp Tyr Glu Gly Leu Cys Lys Ala Leu Glu
385 390 395 400
cag ctt tca gat ttt gag aaa gag gca aat aat cga gtg att gag tct 1248
Gln Leu Ser Asp Phe Glu Lys Glu Ala Asn Asn Arg Val Ile Glu Ser
405 410 415
ggt gta ctc aaa ggc ctg aat ctt gaa gac att aag cgc gct ggg gaa 1296
Gly Val Leu Lys Gly Leu Asn Leu Glu Asp Ile Lys Arg Ala Gly Glu
420 425 430
agg tta atc ctt caa gat gga tgc atc aat gtc ttc cag aaa att tta 1344
Arg Leu Ile Leu Gln Asp Gly Cys Ile Asn Val Phe Gln Lys Ile Leu
435 440 445
aag act gag aat ctg aat gca gaa ctt cat gtg ctt tcc tat tgt tgg 1392
Lys Thr Glu Asn Leu Asn Ala Glu Leu His Val Leu Ser Tyr Cys Trp
450 455 460
tgt ggt gac ctc atc agg gca gcc ttt tct gca ggc gga gta gat gca 1440
Cys Gly Asp Leu Ile Arg Ala Ala Phe Ser Ala Gly Gly Val Asp Ala
465 470 475 480
gtg gaa gta cat gca aat gaa ttc aca ttt gag gaa tcc atc tcg act 1488
Val Glu Val His Ala Asn Glu Phe Thr Phe Glu Glu Ser Ile Ser Thr
485 490 495
ggt gag atc gaa aga aag gtg gaa tcc cca att aac aaa gct caa cag 1536
Gly Glu Ile Glu Arg Lys Val Glu Ser Pro Ile Asn Lys Ala Gln Gln
500 505 510
ttc aaa agt atc cta caa aac aga aag aat gag aac aat aag aaa agt 1584
Phe Lys Ser Ile Leu Gln Asn Arg Lys Asn Glu Asn Asn Lys Lys Ser
515 520 525
ttc ttg agt gtg tat att gga gat tcg gta ggt gac ttg ctg tgt ctc 1632
Phe Leu Ser Val Tyr Ile Gly Asp Ser Val Gly Asp Leu Leu Cys Leu
530 535 540
ctc gaa gca gat ata gga ata gtg gtt agc tct agc tcg agt ctc agg 1680
Leu Glu Ala Asp Ile Gly Ile Val Val Ser Ser Ser Ser Ser Leu Arg
545 550 555 560
aga gtt gga agc cat ttt ggg gtc tca ttt gtg cct ttg ttt tct gga 1728
Arg Val Gly Ser His Phe Gly Val Ser Phe Val Pro Leu Phe Ser Gly
565 570 575
atc gtc cag aaa cag aaa caa cac act gaa gaa tca tca tca tca gca 1776
Ile Val Gln Lys Gln Lys Gln His Thr Glu Glu Ser Ser Ser Ser Ala
580 585 590
tgg aaa gga ctc tct ggc aca ctt tac aca gtt tca agc tgg gcc gaa 1824
Trp Lys Gly Leu Ser Gly Thr Leu Tyr Thr Val Ser Ser Trp Ala Glu
595 600 605
att cat tca ttc gct ctt gga tgg gag taa 1854
Ile His Ser Phe Ala Leu Gly Trp Glu
610 615
<210> 127
<211> 617
<212> PRT
<213> Arabidopsis thaliana
<400> 127
Met Arg Phe Leu Phe Pro Thr Arg Leu Ile Asn Asn Ser Ser Leu Gly
1 5 10 15
Leu Leu Arg Ser Pro His Thr Thr Ala Pro Ile Arg Ser Leu Trp Phe
20 25 30
Arg Thr Lys Ser Pro Val Phe Arg Ser Ala Thr Thr Pro Ile Met Thr
35 40 45
Ala Val Ala Phe Ser Ser Ser Leu Ser Ile Pro Pro Thr Ser Glu Glu
50 55 60
Ala Leu Pro Gly Lys Leu Trp Ile Lys Phe Asn Arg Glu Cys Leu Phe
65 70 75 80
Ser Ile Tyr Ser Pro Phe Ala Val Cys Leu Ala Ala Gly Asn Leu Lys
85 90 95
Ile Asp Thr Phe Arg Gln Tyr Ile Ala Gln Asp Val His Phe Leu Lys
100 105 110
Ala Phe Ala His Ala Tyr Glu Leu Ala Ala Asp Cys Ala Asp Asp Asp
115 120 125
Asp Asp Lys Leu Ala Ile Ser Asp Leu Arg Lys Ser Val Met Glu Glu
130 135 140
Leu Lys Met His Asp Ser Phe Val Gln Asp Trp Asp Leu Asp Ile Asn
145 150 155 160
Lys Glu Val Ser Val Asn Ser Ala Thr Leu Arg Tyr Thr Glu Phe Leu
165 170 175
Leu Ala Thr Ala Ser Gly Lys Val Glu Gly Cys Lys Ala Pro Gly Met
180 185 190
Leu Asp Thr Pro Phe Glu Lys Thr Lys Val Ala Ala Tyr Thr Leu Gly
195 200 205
Ala Val Thr Pro Cys Met Arg Leu Tyr Ala Phe Leu Gly Lys Glu Phe
210 215 220
Gly Ser Leu Leu Asp Leu Ser Asp Val Asn His Pro Tyr Lys Lys Trp
225 230 235 240
Ile Asp Asn Tyr Ser Ser Asp Ala Phe Gln Ala Ser Ala Lys Gln Thr
245 250 255
Glu Asp Leu Leu Glu Lys Leu Ser Val Ser Met Thr Gly Glu Glu Leu
260 265 270
Asp Ile Ile Glu Lys Leu Tyr Gln Gln Ala Met Lys Leu Glu Val Glu
275 280 285
Phe Phe His Ala Gln Pro Leu Ala Gln Pro Thr Ile Val Pro Leu Leu
290 295 300
Lys Asn His Ser Lys Asp Asp Leu Val Ile Phe Ser Asp Phe Asp Leu
305 310 315 320
Thr Cys Thr Val Val Asp Ser Ser Ala Ile Leu Ala Glu Ile Ala Ile
325 330 335
Val Thr Ala Pro Lys Asp Glu Gln Ser Arg Ser Gly Gln Gln Ile His
340 345 350
Arg Met Leu Ser Ser Asp Leu Lys Asn Thr Trp Asn Leu Leu Ser Lys
355 360 365
Gln Tyr Thr Glu His Tyr Glu Glu Cys Ile Glu Ser Ile Leu Asn Lys
370 375 380
Lys Lys Ala Asp Lys Phe Asp Tyr Glu Gly Leu Cys Lys Ala Leu Glu
385 390 395 400
Gln Leu Ser Asp Phe Glu Lys Glu Ala Asn Asn Arg Val Ile Glu Ser
405 410 415
Gly Val Leu Lys Gly Leu Asn Leu Glu Asp Ile Lys Arg Ala Gly Glu
420 425 430
Arg Leu Ile Leu Gln Asp Gly Cys Ile Asn Val Phe Gln Lys Ile Leu
435 440 445
Lys Thr Glu Asn Leu Asn Ala Glu Leu His Val Leu Ser Tyr Cys Trp
450 455 460
Cys Gly Asp Leu Ile Arg Ala Ala Phe Ser Ala Gly Gly Val Asp Ala
465 470 475 480
Val Glu Val His Ala Asn Glu Phe Thr Phe Glu Glu Ser Ile Ser Thr
485 490 495
Gly Glu Ile Glu Arg Lys Val Glu Ser Pro Ile Asn Lys Ala Gln Gln
500 505 510
Phe Lys Ser Ile Leu Gln Asn Arg Lys Asn Glu Asn Asn Lys Lys Ser
515 520 525
Phe Leu Ser Val Tyr Ile Gly Asp Ser Val Gly Asp Leu Leu Cys Leu
530 535 540
Leu Glu Ala Asp Ile Gly Ile Val Val Ser Ser Ser Ser Ser Leu Arg
545 550 555 560
Arg Val Gly Ser His Phe Gly Val Ser Phe Val Pro Leu Phe Ser Gly
565 570 575
Ile Val Gln Lys Gln Lys Gln His Thr Glu Glu Ser Ser Ser Ser Ala
580 585 590
Trp Lys Gly Leu Ser Gly Thr Leu Tyr Thr Val Ser Ser Trp Ala Glu
595 600 605
Ile His Ser Phe Ala Leu Gly Trp Glu
610 615
<210> 128
<211> 1806
<212> DNA
<213> Pyrus x bretschneideri
<220>
<221> CDS
<222> (1)..(1806)
<223> Pyrus x bretschneideri gene encoding TMP phosphatase
[XP_009379735.1]
<400> 128
atg cgc ata ctc ttc ccc cca aac cca atc aaa acc cca act ctc ttc 48
Met Arg Ile Leu Phe Pro Pro Asn Pro Ile Lys Thr Pro Thr Leu Phe
1 5 10 15
aac tcc ctc cgt ctg cga ttc aac tcg ctc cga tcc cac tgt gcc aac 96
Asn Ser Leu Arg Leu Arg Phe Asn Ser Leu Arg Ser His Cys Ala Asn
20 25 30
tca atg gcc gta cct ccg ccg aag tca gcc atg gct tcc gcc gtc gtc 144
Ser Met Ala Val Pro Pro Pro Lys Ser Ala Met Ala Ser Ala Val Val
35 40 45
ggc aac gag gtg ggt ctc gcc cgc cgc ttc tgg atc aag ttc aag cga 192
Gly Asn Glu Val Gly Leu Ala Arg Arg Phe Trp Ile Lys Phe Lys Arg
50 55 60
gaa tcg att ttc gct atg tac act ccc ttc acg ctc tgt ttg gct gct 240
Glu Ser Ile Phe Ala Met Tyr Thr Pro Phe Thr Leu Cys Leu Ala Ala
65 70 75 80
ggg aat ctc aag att gaa act ttc cgc gat tat att gcc caa gat gtt 288
Gly Asn Leu Lys Ile Glu Thr Phe Arg Asp Tyr Ile Ala Gln Asp Val
85 90 95
cac ttt ctc aag gcc ttc gct cac gcg tat gaa ttg gca gaa gat tgt 336
His Phe Leu Lys Ala Phe Ala His Ala Tyr Glu Leu Ala Glu Asp Cys
100 105 110
gca gac gat gat gat gca aag ccc gtg att tct gag ttg agg agg gca 384
Ala Asp Asp Asp Asp Ala Lys Pro Val Ile Ser Glu Leu Arg Arg Ala
115 120 125
gtt ctg cag gag ctg aaa atg cat gat tca ttt gtg aag gaa tgg ggg 432
Val Leu Gln Glu Leu Lys Met His Asp Ser Phe Val Lys Glu Trp Gly
130 135 140
tta cag ggt gct aaa gag acc cct atc aac tcc gct gcg gtg aag tac 480
Leu Gln Gly Ala Lys Glu Thr Pro Ile Asn Ser Ala Ala Val Lys Tyr
145 150 155 160
aca gat ttc tta ttg gca aca gcc tct gga aaa gtt gaa gga gtc aag 528
Thr Asp Phe Leu Leu Ala Thr Ala Ser Gly Lys Val Glu Gly Val Lys
165 170 175
gga cct ggt aaa ctt gca act cca ttt gaa aga acc aaa gtg gct gct 576
Gly Pro Gly Lys Leu Ala Thr Pro Phe Glu Arg Thr Lys Val Ala Ala
180 185 190
tac acc ctt ggc gct atg act cct tgc atg aga ctg tat gcc ttt ctt 624
Tyr Thr Leu Gly Ala Met Thr Pro Cys Met Arg Leu Tyr Ala Phe Leu
195 200 205
ggt aag gag ttc aag gca ctt cta gat ccc agc gaa ggc agt cac ccg 672
Gly Lys Glu Phe Lys Ala Leu Leu Asp Pro Ser Glu Gly Ser His Pro
210 215 220
tac ttg aag tgg att gac agt tat tct tct aaa agt ttt cag gca tca 720
Tyr Leu Lys Trp Ile Asp Ser Tyr Ser Ser Lys Ser Phe Gln Ala Ser
225 230 235 240
gct gtg caa atc gaa gag ttg ctg gat aaa cta agt gtc tct ttg aca 768
Ala Val Gln Ile Glu Glu Leu Leu Asp Lys Leu Ser Val Ser Leu Thr
245 250 255
ggc gag gag ctt gac atc atc gaa aag ctt tac cac caa gca atg aaa 816
Gly Glu Glu Leu Asp Ile Ile Glu Lys Leu Tyr His Gln Ala Met Lys
260 265 270
ctt gag atc gag ttc ttc tct gct cag tct ctt gtt cag cca act gta 864
Leu Glu Ile Glu Phe Phe Ser Ala Gln Ser Leu Val Gln Pro Thr Val
275 280 285
gtt cct ctg atc aga gaa cat aac cct gca gaa gat cgg ctc atg ata 912
Val Pro Leu Ile Arg Glu His Asn Pro Ala Glu Asp Arg Leu Met Ile
290 295 300
ttt tct gat ttt gat ttg act tgt aca gtc gtt gat tca tct gcc att 960
Phe Ser Asp Phe Asp Leu Thr Cys Thr Val Val Asp Ser Ser Ala Ile
305 310 315 320
ttg gct gaa att gca ata gta aca gca cca aaa tct gat caa cat caa 1008
Leu Ala Glu Ile Ala Ile Val Thr Ala Pro Lys Ser Asp Gln His Gln
325 330 335
ccc gaa aat cag att gct cgg atg tct tcg gct gat ctc agg aat aca 1056
Pro Glu Asn Gln Ile Ala Arg Met Ser Ser Ala Asp Leu Arg Asn Thr
340 345 350
tgg ggt ctt ctt tcc agg cag tac aca gaa gag tat gag caa tgc ata 1104
Trp Gly Leu Leu Ser Arg Gln Tyr Thr Glu Glu Tyr Glu Gln Cys Ile
355 360 365
gaa agc att gtt ccc act gaa aaa gca gtg ttt gac tat gaa aat ttg 1152
Glu Ser Ile Val Pro Thr Glu Lys Ala Val Phe Asp Tyr Glu Asn Leu
370 375 380
ctt aaa gca cta gag aaa ctt tca gat ttt gag agg aag gca aac aat 1200
Leu Lys Ala Leu Glu Lys Leu Ser Asp Phe Glu Arg Lys Ala Asn Asn
385 390 395 400
aga gtc acg aag tct gaa gta ctc aag ggt ctt aat ctc gaa gat ata 1248
Arg Val Thr Lys Ser Glu Val Leu Lys Gly Leu Asn Leu Glu Asp Ile
405 410 415
aaa aga gct ggt gaa cgt ctc att ctt caa gat ggc tgt att aat ttc 1296
Lys Arg Ala Gly Glu Arg Leu Ile Leu Gln Asp Gly Cys Ile Asn Phe
420 425 430
ttt cag aaa att gcc aag agt gaa aac ttg aat gca aat gtt cat gtt 1344
Phe Gln Lys Ile Ala Lys Ser Glu Asn Leu Asn Ala Asn Val His Val
435 440 445
ctt tca tac tgt tgg tgt ggt gat ctc ata aga tcg gcc ttt tca tca 1392
Leu Ser Tyr Cys Trp Cys Gly Asp Leu Ile Arg Ser Ala Phe Ser Ser
450 455 460
ggg ggt tta aac gag ctg gat gta cat gca aat gag ttt acc ttc gag 1440
Gly Gly Leu Asn Glu Leu Asp Val His Ala Asn Glu Phe Thr Phe Glu
465 470 475 480
gaa tcc atc tcc aca ggt gat att gtt aag aag gtg gag tcc cct att 1488
Glu Ser Ile Ser Thr Gly Asp Ile Val Lys Lys Val Glu Ser Pro Ile
485 490 495
gac aag gtt aaa tct ttt aaa gat att ttg aaa aat tgc agc aat gac 1536
Asp Lys Val Lys Ser Phe Lys Asp Ile Leu Lys Asn Cys Ser Asn Asp
500 505 510
aga aag aac ttg act gtt tac att gga gac tcg gtg ggt gac tta ctt 1584
Arg Lys Asn Leu Thr Val Tyr Ile Gly Asp Ser Val Gly Asp Leu Leu
515 520 525
tgt ctg ctg gag gcg gat att gga atc gta att ggg tca agt tca agc 1632
Cys Leu Leu Glu Ala Asp Ile Gly Ile Val Ile Gly Ser Ser Ser Ser
530 535 540
ctt agg aga gtg gcg act cag ttt ggg gta tct ttt gtt ccg ttg ttc 1680
Leu Arg Arg Val Ala Thr Gln Phe Gly Val Ser Phe Val Pro Leu Phe
545 550 555 560
ccg ggt tta gtt aag aaa cag aaa gaa tgc aca gat gga agg tct cct 1728
Pro Gly Leu Val Lys Lys Gln Lys Glu Cys Thr Asp Gly Arg Ser Pro
565 570 575
agt tgg aaa ggg tta act ggt att ctt tac aca gtg aat agt tgg gcg 1776
Ser Trp Lys Gly Leu Thr Gly Ile Leu Tyr Thr Val Asn Ser Trp Ala
580 585 590
gaa ata cat gcc ttc att ttg ggg tgt taa 1806
Glu Ile His Ala Phe Ile Leu Gly Cys
595 600
<210> 129
<211> 601
<212> PRT
<213> Pyrus x bretschneideri
<400> 129
Met Arg Ile Leu Phe Pro Pro Asn Pro Ile Lys Thr Pro Thr Leu Phe
1 5 10 15
Asn Ser Leu Arg Leu Arg Phe Asn Ser Leu Arg Ser His Cys Ala Asn
20 25 30
Ser Met Ala Val Pro Pro Pro Lys Ser Ala Met Ala Ser Ala Val Val
35 40 45
Gly Asn Glu Val Gly Leu Ala Arg Arg Phe Trp Ile Lys Phe Lys Arg
50 55 60
Glu Ser Ile Phe Ala Met Tyr Thr Pro Phe Thr Leu Cys Leu Ala Ala
65 70 75 80
Gly Asn Leu Lys Ile Glu Thr Phe Arg Asp Tyr Ile Ala Gln Asp Val
85 90 95
His Phe Leu Lys Ala Phe Ala His Ala Tyr Glu Leu Ala Glu Asp Cys
100 105 110
Ala Asp Asp Asp Asp Ala Lys Pro Val Ile Ser Glu Leu Arg Arg Ala
115 120 125
Val Leu Gln Glu Leu Lys Met His Asp Ser Phe Val Lys Glu Trp Gly
130 135 140
Leu Gln Gly Ala Lys Glu Thr Pro Ile Asn Ser Ala Ala Val Lys Tyr
145 150 155 160
Thr Asp Phe Leu Leu Ala Thr Ala Ser Gly Lys Val Glu Gly Val Lys
165 170 175
Gly Pro Gly Lys Leu Ala Thr Pro Phe Glu Arg Thr Lys Val Ala Ala
180 185 190
Tyr Thr Leu Gly Ala Met Thr Pro Cys Met Arg Leu Tyr Ala Phe Leu
195 200 205
Gly Lys Glu Phe Lys Ala Leu Leu Asp Pro Ser Glu Gly Ser His Pro
210 215 220
Tyr Leu Lys Trp Ile Asp Ser Tyr Ser Ser Lys Ser Phe Gln Ala Ser
225 230 235 240
Ala Val Gln Ile Glu Glu Leu Leu Asp Lys Leu Ser Val Ser Leu Thr
245 250 255
Gly Glu Glu Leu Asp Ile Ile Glu Lys Leu Tyr His Gln Ala Met Lys
260 265 270
Leu Glu Ile Glu Phe Phe Ser Ala Gln Ser Leu Val Gln Pro Thr Val
275 280 285
Val Pro Leu Ile Arg Glu His Asn Pro Ala Glu Asp Arg Leu Met Ile
290 295 300
Phe Ser Asp Phe Asp Leu Thr Cys Thr Val Val Asp Ser Ser Ala Ile
305 310 315 320
Leu Ala Glu Ile Ala Ile Val Thr Ala Pro Lys Ser Asp Gln His Gln
325 330 335
Pro Glu Asn Gln Ile Ala Arg Met Ser Ser Ala Asp Leu Arg Asn Thr
340 345 350
Trp Gly Leu Leu Ser Arg Gln Tyr Thr Glu Glu Tyr Glu Gln Cys Ile
355 360 365
Glu Ser Ile Val Pro Thr Glu Lys Ala Val Phe Asp Tyr Glu Asn Leu
370 375 380
Leu Lys Ala Leu Glu Lys Leu Ser Asp Phe Glu Arg Lys Ala Asn Asn
385 390 395 400
Arg Val Thr Lys Ser Glu Val Leu Lys Gly Leu Asn Leu Glu Asp Ile
405 410 415
Lys Arg Ala Gly Glu Arg Leu Ile Leu Gln Asp Gly Cys Ile Asn Phe
420 425 430
Phe Gln Lys Ile Ala Lys Ser Glu Asn Leu Asn Ala Asn Val His Val
435 440 445
Leu Ser Tyr Cys Trp Cys Gly Asp Leu Ile Arg Ser Ala Phe Ser Ser
450 455 460
Gly Gly Leu Asn Glu Leu Asp Val His Ala Asn Glu Phe Thr Phe Glu
465 470 475 480
Glu Ser Ile Ser Thr Gly Asp Ile Val Lys Lys Val Glu Ser Pro Ile
485 490 495
Asp Lys Val Lys Ser Phe Lys Asp Ile Leu Lys Asn Cys Ser Asn Asp
500 505 510
Arg Lys Asn Leu Thr Val Tyr Ile Gly Asp Ser Val Gly Asp Leu Leu
515 520 525
Cys Leu Leu Glu Ala Asp Ile Gly Ile Val Ile Gly Ser Ser Ser Ser
530 535 540
Leu Arg Arg Val Ala Thr Gln Phe Gly Val Ser Phe Val Pro Leu Phe
545 550 555 560
Pro Gly Leu Val Lys Lys Gln Lys Glu Cys Thr Asp Gly Arg Ser Pro
565 570 575
Ser Trp Lys Gly Leu Thr Gly Ile Leu Tyr Thr Val Asn Ser Trp Ala
580 585 590
Glu Ile His Ala Phe Ile Leu Gly Cys
595 600
<210> 130
<211> 1797
<212> DNA
<213> Brassica napus
<220>
<221> CDS
<222> (1)..(1797)
<223> Brassica napus gene encoding TMP phosphatase [CDY62623.1]
<400> 130
atg cgc atc ctc aac aac tcg ctc gcc ctt ctc cga tcg ccc cgc gcc 48
Met Arg Ile Leu Asn Asn Ser Leu Ala Leu Leu Arg Ser Pro Arg Ala
1 5 10 15
gcc gcc ccg atc cgt tct cta ctg ttc ggc agc aag aag tct tcc gtc 96
Ala Ala Pro Ile Arg Ser Leu Leu Phe Gly Ser Lys Lys Ser Ser Val
20 25 30
tcc cga tcg gcg gcc gcc ttc tct tcg gcg atg tcg att cct cct cct 144
Ser Arg Ser Ala Ala Ala Phe Ser Ser Ala Met Ser Ile Pro Pro Pro
35 40 45
agc ata tcc acc tcg gaa gaa gct ctg gcg ggg agg ctg tgg atc aag 192
Ser Ile Ser Thr Ser Glu Glu Ala Leu Ala Gly Arg Leu Trp Ile Lys
50 55 60
ttc aac aga gag tgc ctc ttc tct atg tac agc ccc ttc gcc gtt tct 240
Phe Asn Arg Glu Cys Leu Phe Ser Met Tyr Ser Pro Phe Ala Val Ser
65 70 75 80
ttg gcc gcc ggc aat ctc aag atc gag acc ttc cgg cag tat att gct 288
Leu Ala Ala Gly Asn Leu Lys Ile Glu Thr Phe Arg Gln Tyr Ile Ala
85 90 95
cag gat gtt cat ttc ctc aag gcc ttt gct cac gcg tat gag ttg gcc 336
Gln Asp Val His Phe Leu Lys Ala Phe Ala His Ala Tyr Glu Leu Ala
100 105 110
gca gag tgt gct gat gat gat gat gat aag ttg gca att tct gac ttg 384
Ala Glu Cys Ala Asp Asp Asp Asp Asp Lys Leu Ala Ile Ser Asp Leu
115 120 125
agg aaa agc gtc atg gat gag ttg aaa atg cac aac tca ttt gta cag 432
Arg Lys Ser Val Met Asp Glu Leu Lys Met His Asn Ser Phe Val Gln
130 135 140
gat tgg gat tta gac atc agc aaa gaa gta agt gtt aac tca gca aca 480
Asp Trp Asp Leu Asp Ile Ser Lys Glu Val Ser Val Asn Ser Ala Thr
145 150 155 160
ttg aga tac acc gag ttc tta tta gct aca tca tcc gga aaa gta gaa 528
Leu Arg Tyr Thr Glu Phe Leu Leu Ala Thr Ser Ser Gly Lys Val Glu
165 170 175
gga ctc aaa gct ccc ggc atg ctt gat act cca ttt gag aaa acc aaa 576
Gly Leu Lys Ala Pro Gly Met Leu Asp Thr Pro Phe Glu Lys Thr Lys
180 185 190
gtg gcc gcc tac acg ctt ggt gct gtg aca cct tgc atg aag ctg tat 624
Val Ala Ala Tyr Thr Leu Gly Ala Val Thr Pro Cys Met Lys Leu Tyr
195 200 205
gcc ttt ctt ggt aag gag ttt gga gcg ctt cta gat tcg agt gaa gcg 672
Ala Phe Leu Gly Lys Glu Phe Gly Ala Leu Leu Asp Ser Ser Glu Ala
210 215 220
aac cat ccc tac aag aaa tgg atc gaa aat tat tct agt gat gca ttc 720
Asn His Pro Tyr Lys Lys Trp Ile Glu Asn Tyr Ser Ser Asp Ala Phe
225 230 235 240
cag gca tca gct aag caa act gaa gac ttg ctt gag aag ctt agt gtg 768
Gln Ala Ser Ala Lys Gln Thr Glu Asp Leu Leu Glu Lys Leu Ser Val
245 250 255
tgt atg act ggc gaa gag ctg gac atc att gaa aaa ctg tat caa cag 816
Cys Met Thr Gly Glu Glu Leu Asp Ile Ile Glu Lys Leu Tyr Gln Gln
260 265 270
gca atg aaa ctt gaa gta gag ttc ttc cac gca caa ccg ttt gct cag 864
Ala Met Lys Leu Glu Val Glu Phe Phe His Ala Gln Pro Phe Ala Gln
275 280 285
cct acc ata gtt ccg ctg ctg aag aac cat tca aaa gat gag ctg atg 912
Pro Thr Ile Val Pro Leu Leu Lys Asn His Ser Lys Asp Glu Leu Met
290 295 300
ata ttt tct gat ttt gat ctg act tgc acc gtt gtt gat tct tct gct 960
Ile Phe Ser Asp Phe Asp Leu Thr Cys Thr Val Val Asp Ser Ser Ala
305 310 315 320
att tta gcc gaa att gca atc gta act gcc ccg aaa gat gat cag ggt 1008
Ile Leu Ala Glu Ile Ala Ile Val Thr Ala Pro Lys Asp Asp Gln Gly
325 330 335
caa caa att aat cgg atg ctt tcg gct gac ctt aag aac acc tgg agt 1056
Gln Gln Ile Asn Arg Met Leu Ser Ala Asp Leu Lys Asn Thr Trp Ser
340 345 350
cta ctt tcc aaa cag tat aca gag cac tat gaa gag tgc ata gag agt 1104
Leu Leu Ser Lys Gln Tyr Thr Glu His Tyr Glu Glu Cys Ile Glu Ser
355 360 365
att ctg aat aag gaa aaa gcg gac aag ttt gac tac gag ggt ttg tgt 1152
Ile Leu Asn Lys Glu Lys Ala Asp Lys Phe Asp Tyr Glu Gly Leu Cys
370 375 380
gaa gca cta gag cag ctg tca gag ttt gag aag aaa gca aac gac cga 1200
Glu Ala Leu Glu Gln Leu Ser Glu Phe Glu Lys Lys Ala Asn Asp Arg
385 390 395 400
gtg ata gag tct ggt gta ctc aag ggc ctg aat ctc gat gac atc aag 1248
Val Ile Glu Ser Gly Val Leu Lys Gly Leu Asn Leu Asp Asp Ile Lys
405 410 415
cga gct ggg gaa agg ttg att ctt caa gat ggc tgc atc aat gtc ttc 1296
Arg Ala Gly Glu Arg Leu Ile Leu Gln Asp Gly Cys Ile Asn Val Phe
420 425 430
cag aaa att ttg aag act cag gat gtg aat gca aaa ctc cac gtg ctt 1344
Gln Lys Ile Leu Lys Thr Gln Asp Val Asn Ala Lys Leu His Val Leu
435 440 445
tcg tat tgt tgg tgt ggt gac ctc atc aga gca gcc ttt tct gca cgg 1392
Ser Tyr Cys Trp Cys Gly Asp Leu Ile Arg Ala Ala Phe Ser Ala Arg
450 455 460
gga gta gat gca gtg gaa gta cat gca aat gaa ttc aca ttc gag gaa 1440
Gly Val Asp Ala Val Glu Val His Ala Asn Glu Phe Thr Phe Glu Glu
465 470 475 480
tcc atc tct act gga gaa ata gaa aga aaa gtg gaa tcc cca atc gac 1488
Ser Ile Ser Thr Gly Glu Ile Glu Arg Lys Val Glu Ser Pro Ile Asp
485 490 495
aag gct caa cag ttc aag agc atc cta caa aac aga aag aag gat gag 1536
Lys Ala Gln Gln Phe Lys Ser Ile Leu Gln Asn Arg Lys Lys Asp Glu
500 505 510
gag aaa agc atc ctc act gtt tac att gga gat tca gta ggt gac ttg 1584
Glu Lys Ser Ile Leu Thr Val Tyr Ile Gly Asp Ser Val Gly Asp Leu
515 520 525
ctc tgt ctc ctg gag gca gac att gga ata gtg gtc gcc tct agc tcg 1632
Leu Cys Leu Leu Glu Ala Asp Ile Gly Ile Val Val Ala Ser Ser Ser
530 535 540
agc ctc agg aga gtg gga agc cat ttc ggg gtc tca ttt gtg cct ttg 1680
Ser Leu Arg Arg Val Gly Ser His Phe Gly Val Ser Phe Val Pro Leu
545 550 555 560
ttc tct gga att gtg caa aaa cag aaa caa gaa gaa acc tgg aag ggg 1728
Phe Ser Gly Ile Val Gln Lys Gln Lys Gln Glu Glu Thr Trp Lys Gly
565 570 575
ctc tct ggc aca ctt tac acg gta tca agc tgg gct gaa ata cat tcc 1776
Leu Ser Gly Thr Leu Tyr Thr Val Ser Ser Trp Ala Glu Ile His Ser
580 585 590
ttc gct ctt gga tgg gag taa 1797
Phe Ala Leu Gly Trp Glu
595
<210> 131
<211> 598
<212> PRT
<213> Brassica napus
<400> 131
Met Arg Ile Leu Asn Asn Ser Leu Ala Leu Leu Arg Ser Pro Arg Ala
1 5 10 15
Ala Ala Pro Ile Arg Ser Leu Leu Phe Gly Ser Lys Lys Ser Ser Val
20 25 30
Ser Arg Ser Ala Ala Ala Phe Ser Ser Ala Met Ser Ile Pro Pro Pro
35 40 45
Ser Ile Ser Thr Ser Glu Glu Ala Leu Ala Gly Arg Leu Trp Ile Lys
50 55 60
Phe Asn Arg Glu Cys Leu Phe Ser Met Tyr Ser Pro Phe Ala Val Ser
65 70 75 80
Leu Ala Ala Gly Asn Leu Lys Ile Glu Thr Phe Arg Gln Tyr Ile Ala
85 90 95
Gln Asp Val His Phe Leu Lys Ala Phe Ala His Ala Tyr Glu Leu Ala
100 105 110
Ala Glu Cys Ala Asp Asp Asp Asp Asp Lys Leu Ala Ile Ser Asp Leu
115 120 125
Arg Lys Ser Val Met Asp Glu Leu Lys Met His Asn Ser Phe Val Gln
130 135 140
Asp Trp Asp Leu Asp Ile Ser Lys Glu Val Ser Val Asn Ser Ala Thr
145 150 155 160
Leu Arg Tyr Thr Glu Phe Leu Leu Ala Thr Ser Ser Gly Lys Val Glu
165 170 175
Gly Leu Lys Ala Pro Gly Met Leu Asp Thr Pro Phe Glu Lys Thr Lys
180 185 190
Val Ala Ala Tyr Thr Leu Gly Ala Val Thr Pro Cys Met Lys Leu Tyr
195 200 205
Ala Phe Leu Gly Lys Glu Phe Gly Ala Leu Leu Asp Ser Ser Glu Ala
210 215 220
Asn His Pro Tyr Lys Lys Trp Ile Glu Asn Tyr Ser Ser Asp Ala Phe
225 230 235 240
Gln Ala Ser Ala Lys Gln Thr Glu Asp Leu Leu Glu Lys Leu Ser Val
245 250 255
Cys Met Thr Gly Glu Glu Leu Asp Ile Ile Glu Lys Leu Tyr Gln Gln
260 265 270
Ala Met Lys Leu Glu Val Glu Phe Phe His Ala Gln Pro Phe Ala Gln
275 280 285
Pro Thr Ile Val Pro Leu Leu Lys Asn His Ser Lys Asp Glu Leu Met
290 295 300
Ile Phe Ser Asp Phe Asp Leu Thr Cys Thr Val Val Asp Ser Ser Ala
305 310 315 320
Ile Leu Ala Glu Ile Ala Ile Val Thr Ala Pro Lys Asp Asp Gln Gly
325 330 335
Gln Gln Ile Asn Arg Met Leu Ser Ala Asp Leu Lys Asn Thr Trp Ser
340 345 350
Leu Leu Ser Lys Gln Tyr Thr Glu His Tyr Glu Glu Cys Ile Glu Ser
355 360 365
Ile Leu Asn Lys Glu Lys Ala Asp Lys Phe Asp Tyr Glu Gly Leu Cys
370 375 380
Glu Ala Leu Glu Gln Leu Ser Glu Phe Glu Lys Lys Ala Asn Asp Arg
385 390 395 400
Val Ile Glu Ser Gly Val Leu Lys Gly Leu Asn Leu Asp Asp Ile Lys
405 410 415
Arg Ala Gly Glu Arg Leu Ile Leu Gln Asp Gly Cys Ile Asn Val Phe
420 425 430
Gln Lys Ile Leu Lys Thr Gln Asp Val Asn Ala Lys Leu His Val Leu
435 440 445
Ser Tyr Cys Trp Cys Gly Asp Leu Ile Arg Ala Ala Phe Ser Ala Arg
450 455 460
Gly Val Asp Ala Val Glu Val His Ala Asn Glu Phe Thr Phe Glu Glu
465 470 475 480
Ser Ile Ser Thr Gly Glu Ile Glu Arg Lys Val Glu Ser Pro Ile Asp
485 490 495
Lys Ala Gln Gln Phe Lys Ser Ile Leu Gln Asn Arg Lys Lys Asp Glu
500 505 510
Glu Lys Ser Ile Leu Thr Val Tyr Ile Gly Asp Ser Val Gly Asp Leu
515 520 525
Leu Cys Leu Leu Glu Ala Asp Ile Gly Ile Val Val Ala Ser Ser Ser
530 535 540
Ser Leu Arg Arg Val Gly Ser His Phe Gly Val Ser Phe Val Pro Leu
545 550 555 560
Phe Ser Gly Ile Val Gln Lys Gln Lys Gln Glu Glu Thr Trp Lys Gly
565 570 575
Leu Ser Gly Thr Leu Tyr Thr Val Ser Ser Trp Ala Glu Ile His Ser
580 585 590
Phe Ala Leu Gly Trp Glu
595
<210> 132
<211> 1815
<212> DNA
<213> Glycine max
<220>
<221> CDS
<222> (1)..(1815)
<223> Glycine max gene encoding TMP phosphatase [XP_003536133.1]
<400> 132
atg cgc atg cgg tgg ttc ctc cga agc cca atc atc aaa acc tcg ctg 48
Met Arg Met Arg Trp Phe Leu Arg Ser Pro Ile Ile Lys Thr Ser Leu
1 5 10 15
ctg aat ctg agc cct cca att tcg ttt aga cct cac tgg gcg agg agg 96
Leu Asn Leu Ser Pro Pro Ile Ser Phe Arg Pro His Trp Ala Arg Arg
20 25 30
acc ttc act tct tcg aga ttg tca atg gcg gcc atc cac aac cac agc 144
Thr Phe Thr Ser Ser Arg Leu Ser Met Ala Ala Ile His Asn His Ser
35 40 45
aac agc aac agc gaa acc gga ctc gcg aga cgg ttt tgg atc aag ttc 192
Asn Ser Asn Ser Glu Thr Gly Leu Ala Arg Arg Phe Trp Ile Lys Phe
50 55 60
act cgt gaa tcc atc ttc gcc atg tac act ccc ttc gcc atc gcc ttg 240
Thr Arg Glu Ser Ile Phe Ala Met Tyr Thr Pro Phe Ala Ile Ala Leu
65 70 75 80
gcc tcc ggt aat ttg cac att gat tcc ttc cac cat tac atc gcc caa 288
Ala Ser Gly Asn Leu His Ile Asp Ser Phe His His Tyr Ile Ala Gln
85 90 95
gac gtt cat ttc cta cgc gcc ttt gct caa gcg tat gag ttg gct gaa 336
Asp Val His Phe Leu Arg Ala Phe Ala Gln Ala Tyr Glu Leu Ala Glu
100 105 110
gag tgt gct gat gac gac gat gcg aaa ctt gga atc tgt gag ttg agg 384
Glu Cys Ala Asp Asp Asp Asp Ala Lys Leu Gly Ile Cys Glu Leu Arg
115 120 125
aag gca gtt cta gag gag ctg aag atg cac aac ttg ctg gta cag gaa 432
Lys Ala Val Leu Glu Glu Leu Lys Met His Asn Leu Leu Val Gln Glu
130 135 140
cgg gag ttg gac ctt gcc aaa gag cat ggt att aat tct gca act gtt 480
Arg Glu Leu Asp Leu Ala Lys Glu His Gly Ile Asn Ser Ala Thr Val
145 150 155 160
aag tac aca gag ttc ctg ctg gct aca gcc tct ggg aag att gaa gga 528
Lys Tyr Thr Glu Phe Leu Leu Ala Thr Ala Ser Gly Lys Ile Glu Gly
165 170 175
cta aaa ggt cct ggt aaa ctt gct aca cca ttt gag aaa aca aaa att 576
Leu Lys Gly Pro Gly Lys Leu Ala Thr Pro Phe Glu Lys Thr Lys Ile
180 185 190
gct gct tat act tta ggt gcc atg act cct tgc atg agg ctt tat gcc 624
Ala Ala Tyr Thr Leu Gly Ala Met Thr Pro Cys Met Arg Leu Tyr Ala
195 200 205
gtt atg gga aag aag ttc cag gaa ctt ttg gat tcc aat gaa agt act 672
Val Met Gly Lys Lys Phe Gln Glu Leu Leu Asp Ser Asn Glu Ser Thr
210 215 220
cac cca tat aac aag tgg atc aac aac tat tcc tct gat ggt ttc cag 720
His Pro Tyr Asn Lys Trp Ile Asn Asn Tyr Ser Ser Asp Gly Phe Gln
225 230 235 240
gct act act ctg caa act gaa gat ttg ctc gac aaa cta agt gtc tct 768
Ala Thr Thr Leu Gln Thr Glu Asp Leu Leu Asp Lys Leu Ser Val Ser
245 250 255
ttg act ggt gaa gaa ctt gat gtc att gaa aag ctt tat tac caa gca 816
Leu Thr Gly Glu Glu Leu Asp Val Ile Glu Lys Leu Tyr Tyr Gln Ala
260 265 270
atg aag ctt gaa ata gag ttc ttc tct gct cag cca ctc ttc cag cca 864
Met Lys Leu Glu Ile Glu Phe Phe Ser Ala Gln Pro Leu Phe Gln Pro
275 280 285
act ata gta ccc ttg act aaa gga cat aag cct gtg gaa gat cat ctc 912
Thr Ile Val Pro Leu Thr Lys Gly His Lys Pro Val Glu Asp His Leu
290 295 300
att att ttt tct gat ttt gat tta aca tgc acc gta gtt gat tcg tcc 960
Ile Ile Phe Ser Asp Phe Asp Leu Thr Cys Thr Val Val Asp Ser Ser
305 310 315 320
gcc atc ttg gct gaa att gcc ata gtg acg gca cca aaa tct gat cag 1008
Ala Ile Leu Ala Glu Ile Ala Ile Val Thr Ala Pro Lys Ser Asp Gln
325 330 335
aat cag cct gaa gat caa att gtt cgg atg tta tct tct gac ctc agg 1056
Asn Gln Pro Glu Asp Gln Ile Val Arg Met Leu Ser Ser Asp Leu Arg
340 345 350
aat aca tgg ggt ttt cta tct aaa cag tat acg gag gag tat gag caa 1104
Asn Thr Trp Gly Phe Leu Ser Lys Gln Tyr Thr Glu Glu Tyr Glu Gln
355 360 365
tgt ata gaa agc att atg cct ccc gat aga ttg aac aat ttc gat tac 1152
Cys Ile Glu Ser Ile Met Pro Pro Asp Arg Leu Asn Asn Phe Asp Tyr
370 375 380
aaa gaa ttg tcg atg gcc ctt gag caa ctt tca aaa ttt gag aac act 1200
Lys Glu Leu Ser Met Ala Leu Glu Gln Leu Ser Lys Phe Glu Asn Thr
385 390 395 400
gca aat aat agg gtt atc gag tca ggg gta ctc aag ggt ata agt cta 1248
Ala Asn Asn Arg Val Ile Glu Ser Gly Val Leu Lys Gly Ile Ser Leu
405 410 415
gaa gat ata aag cgt gct gga gag cgt ctg ata cta caa gat ggt tgc 1296
Glu Asp Ile Lys Arg Ala Gly Glu Arg Leu Ile Leu Gln Asp Gly Cys
420 425 430
cct aac ttc ttt cag agc att gtt aag aat gaa aat ttg aat gcc aac 1344
Pro Asn Phe Phe Gln Ser Ile Val Lys Asn Glu Asn Leu Asn Ala Asn
435 440 445
gtg cat gtt ctt tca tac tgc tgg tgt ggt gac ctc att agg tct act 1392
Val His Val Leu Ser Tyr Cys Trp Cys Gly Asp Leu Ile Arg Ser Thr
450 455 460
ttc tct tcc gct gat tta aat gag ttg aat gtt cat gct aat gag ttc 1440
Phe Ser Ser Ala Asp Leu Asn Glu Leu Asn Val His Ala Asn Glu Phe
465 470 475 480
act tat gag gga tct gtt tcc acg ggt gaa att gtt aag aaa gtg gag 1488
Thr Tyr Glu Gly Ser Val Ser Thr Gly Glu Ile Val Lys Lys Val Glu
485 490 495
tct ccc att gac aag gtt gaa gct ttt cgt aac ata ttg aaa aat tgc 1536
Ser Pro Ile Asp Lys Val Glu Ala Phe Arg Asn Ile Leu Lys Asn Cys
500 505 510
aat gat gac aaa aag aaa tta act gtt tac att ggc gat tca gtg ggt 1584
Asn Asp Asp Lys Lys Lys Leu Thr Val Tyr Ile Gly Asp Ser Val Gly
515 520 525
gat tta ctt tgc cta ctt gaa gct gat gta gga att gtg att ggt tca 1632
Asp Leu Leu Cys Leu Leu Glu Ala Asp Val Gly Ile Val Ile Gly Ser
530 535 540
agt tca agc ctt aga agt gta ggg acg cag ttt ggt att tca ttt gtc 1680
Ser Ser Ser Leu Arg Ser Val Gly Thr Gln Phe Gly Ile Ser Phe Val
545 550 555 560
cca ttg tat tct ggc ttg gtt aag aaa cag aaa gaa tat gtt gaa gga 1728
Pro Leu Tyr Ser Gly Leu Val Lys Lys Gln Lys Glu Tyr Val Glu Gly
565 570 575
agc act tct gat tgg aag ggt tta tct ggc att ctt tac aca gtc tct 1776
Ser Thr Ser Asp Trp Lys Gly Leu Ser Gly Ile Leu Tyr Thr Val Ser
580 585 590
agt tgg gct gaa gtg cat gct ttt att ttg ggt tgc tag 1815
Ser Trp Ala Glu Val His Ala Phe Ile Leu Gly Cys
595 600
<210> 133
<211> 604
<212> PRT
<213> Glycine max
<400> 133
Met Arg Met Arg Trp Phe Leu Arg Ser Pro Ile Ile Lys Thr Ser Leu
1 5 10 15
Leu Asn Leu Ser Pro Pro Ile Ser Phe Arg Pro His Trp Ala Arg Arg
20 25 30
Thr Phe Thr Ser Ser Arg Leu Ser Met Ala Ala Ile His Asn His Ser
35 40 45
Asn Ser Asn Ser Glu Thr Gly Leu Ala Arg Arg Phe Trp Ile Lys Phe
50 55 60
Thr Arg Glu Ser Ile Phe Ala Met Tyr Thr Pro Phe Ala Ile Ala Leu
65 70 75 80
Ala Ser Gly Asn Leu His Ile Asp Ser Phe His His Tyr Ile Ala Gln
85 90 95
Asp Val His Phe Leu Arg Ala Phe Ala Gln Ala Tyr Glu Leu Ala Glu
100 105 110
Glu Cys Ala Asp Asp Asp Asp Ala Lys Leu Gly Ile Cys Glu Leu Arg
115 120 125
Lys Ala Val Leu Glu Glu Leu Lys Met His Asn Leu Leu Val Gln Glu
130 135 140
Arg Glu Leu Asp Leu Ala Lys Glu His Gly Ile Asn Ser Ala Thr Val
145 150 155 160
Lys Tyr Thr Glu Phe Leu Leu Ala Thr Ala Ser Gly Lys Ile Glu Gly
165 170 175
Leu Lys Gly Pro Gly Lys Leu Ala Thr Pro Phe Glu Lys Thr Lys Ile
180 185 190
Ala Ala Tyr Thr Leu Gly Ala Met Thr Pro Cys Met Arg Leu Tyr Ala
195 200 205
Val Met Gly Lys Lys Phe Gln Glu Leu Leu Asp Ser Asn Glu Ser Thr
210 215 220
His Pro Tyr Asn Lys Trp Ile Asn Asn Tyr Ser Ser Asp Gly Phe Gln
225 230 235 240
Ala Thr Thr Leu Gln Thr Glu Asp Leu Leu Asp Lys Leu Ser Val Ser
245 250 255
Leu Thr Gly Glu Glu Leu Asp Val Ile Glu Lys Leu Tyr Tyr Gln Ala
260 265 270
Met Lys Leu Glu Ile Glu Phe Phe Ser Ala Gln Pro Leu Phe Gln Pro
275 280 285
Thr Ile Val Pro Leu Thr Lys Gly His Lys Pro Val Glu Asp His Leu
290 295 300
Ile Ile Phe Ser Asp Phe Asp Leu Thr Cys Thr Val Val Asp Ser Ser
305 310 315 320
Ala Ile Leu Ala Glu Ile Ala Ile Val Thr Ala Pro Lys Ser Asp Gln
325 330 335
Asn Gln Pro Glu Asp Gln Ile Val Arg Met Leu Ser Ser Asp Leu Arg
340 345 350
Asn Thr Trp Gly Phe Leu Ser Lys Gln Tyr Thr Glu Glu Tyr Glu Gln
355 360 365
Cys Ile Glu Ser Ile Met Pro Pro Asp Arg Leu Asn Asn Phe Asp Tyr
370 375 380
Lys Glu Leu Ser Met Ala Leu Glu Gln Leu Ser Lys Phe Glu Asn Thr
385 390 395 400
Ala Asn Asn Arg Val Ile Glu Ser Gly Val Leu Lys Gly Ile Ser Leu
405 410 415
Glu Asp Ile Lys Arg Ala Gly Glu Arg Leu Ile Leu Gln Asp Gly Cys
420 425 430
Pro Asn Phe Phe Gln Ser Ile Val Lys Asn Glu Asn Leu Asn Ala Asn
435 440 445
Val His Val Leu Ser Tyr Cys Trp Cys Gly Asp Leu Ile Arg Ser Thr
450 455 460
Phe Ser Ser Ala Asp Leu Asn Glu Leu Asn Val His Ala Asn Glu Phe
465 470 475 480
Thr Tyr Glu Gly Ser Val Ser Thr Gly Glu Ile Val Lys Lys Val Glu
485 490 495
Ser Pro Ile Asp Lys Val Glu Ala Phe Arg Asn Ile Leu Lys Asn Cys
500 505 510
Asn Asp Asp Lys Lys Lys Leu Thr Val Tyr Ile Gly Asp Ser Val Gly
515 520 525
Asp Leu Leu Cys Leu Leu Glu Ala Asp Val Gly Ile Val Ile Gly Ser
530 535 540
Ser Ser Ser Leu Arg Ser Val Gly Thr Gln Phe Gly Ile Ser Phe Val
545 550 555 560
Pro Leu Tyr Ser Gly Leu Val Lys Lys Gln Lys Glu Tyr Val Glu Gly
565 570 575
Ser Thr Ser Asp Trp Lys Gly Leu Ser Gly Ile Leu Tyr Thr Val Ser
580 585 590
Ser Trp Ala Glu Val His Ala Phe Ile Leu Gly Cys
595 600
<210> 134
<211> 1845
<212> DNA
<213> Nicotiana tomentosiformis
<220>
<221> CDS
<222> (1)..(1845)
<223> Nicotiana tomentosiformi gene encoding TMP phosphatase
[XP_009615535.1]
<400> 134
atg cgc ttc tca tta tta tcg ccc ctt gtt ctt aac cca gtc atc aga 48
Met Arg Phe Ser Leu Leu Ser Pro Leu Val Leu Asn Pro Val Ile Arg
1 5 10 15
ttc tcc aat tcc aac gcg ctt ttt ggg tta cga ttc caa tta tac cct 96
Phe Ser Asn Ser Asn Ala Leu Phe Gly Leu Arg Phe Gln Leu Tyr Pro
20 25 30
cgt tac tct cgg tat tta cga tcg ccc gtt aca atg gcg tcg gcg aaa 144
Arg Tyr Ser Arg Tyr Leu Arg Ser Pro Val Thr Met Ala Ser Ala Lys
35 40 45
cca aag ccg gcg gcg gcg gtg aac aag ttt ccg gta gag gag gaa tgt 192
Pro Lys Pro Ala Ala Ala Val Asn Lys Phe Pro Val Glu Glu Glu Cys
50 55 60
gtg ggt ata gcg agg aag tgt tgg atc aag ttc aag aga gag tct act 240
Val Gly Ile Ala Arg Lys Cys Trp Ile Lys Phe Lys Arg Glu Ser Thr
65 70 75 80
ttc gct ctg tac act ccg ttt gtg gtt agt ttg gca tca gga acc cta 288
Phe Ala Leu Tyr Thr Pro Phe Val Val Ser Leu Ala Ser Gly Thr Leu
85 90 95
aat ctg gac act ttc cgc cat tac att gct cag gat gtt cac ttc ctc 336
Asn Leu Asp Thr Phe Arg His Tyr Ile Ala Gln Asp Val His Phe Leu
100 105 110
aaa tcc ttc gct caa gcg tat gaa gct gca gaa gag tgt act gac gat 384
Lys Ser Phe Ala Gln Ala Tyr Glu Ala Ala Glu Glu Cys Thr Asp Asp
115 120 125
gac gat gcg aag gtt ggc att agt gag ttg cgg aag aat gtt att gaa 432
Asp Asp Ala Lys Val Gly Ile Ser Glu Leu Arg Lys Asn Val Ile Glu
130 135 140
gaa ctt aaa atg cat gat gca gtt tta aaa gag tgg ggc att gat ctg 480
Glu Leu Lys Met His Asp Ala Val Leu Lys Glu Trp Gly Ile Asp Leu
145 150 155 160
gtc aaa gag tcc agt ctt aac cct gca acg gcc aag tac aca gat ttt 528
Val Lys Glu Ser Ser Leu Asn Pro Ala Thr Ala Lys Tyr Thr Asp Phe
165 170 175
tta tca gct aca gct tca gga aag gtg gaa gga gta aaa gct gct aaa 576
Leu Ser Ala Thr Ala Ser Gly Lys Val Glu Gly Val Lys Ala Ala Lys
180 185 190
ctt gcc aca cca ttt gag aga acg aag ttg gca gct tat act cta ggt 624
Leu Ala Thr Pro Phe Glu Arg Thr Lys Leu Ala Ala Tyr Thr Leu Gly
195 200 205
gct atg act cct tgc atg agg ctt tac gcc tac att ggt aag gag ctg 672
Ala Met Thr Pro Cys Met Arg Leu Tyr Ala Tyr Ile Gly Lys Glu Leu
210 215 220
caa gtg ttc ctc gag gga gag aaa att cat cca tac aag aag tgg att 720
Gln Val Phe Leu Glu Gly Glu Lys Ile His Pro Tyr Lys Lys Trp Ile
225 230 235 240
gac agt tat gcc tct gaa agt ttc cag gca tca gct ctt caa acc gag 768
Asp Ser Tyr Ala Ser Glu Ser Phe Gln Ala Ser Ala Leu Gln Thr Glu
245 250 255
gac ttg ttg gat aaa ctg agt gtc cct ttg aca ggc gag gag ctt gac 816
Asp Leu Leu Asp Lys Leu Ser Val Pro Leu Thr Gly Glu Glu Leu Asp
260 265 270
atc att gaa aag ctt tat cat caa gca atg aaa ctt gaa att gat ttc 864
Ile Ile Glu Lys Leu Tyr His Gln Ala Met Lys Leu Glu Ile Asp Phe
275 280 285
ttc tta acc cag cca ctt gtt cag aaa gct gtc atc cct ttg tca aaa 912
Phe Leu Thr Gln Pro Leu Val Gln Lys Ala Val Ile Pro Leu Ser Lys
290 295 300
gat cac aac cct gct gaa cac cgg ctt aca ata ttt tct gat ttc gat 960
Asp His Asn Pro Ala Glu His Arg Leu Thr Ile Phe Ser Asp Phe Asp
305 310 315 320
ttg acg tgc act gtt gtt gat tct tct gcc atc ttg gct gaa att gca 1008
Leu Thr Cys Thr Val Val Asp Ser Ser Ala Ile Leu Ala Glu Ile Ala
325 330 335
att ata aca gca ccg aga tct gat caa aat cga cca gag aat caa att 1056
Ile Ile Thr Ala Pro Arg Ser Asp Gln Asn Arg Pro Glu Asn Gln Ile
340 345 350
gcg cgg atg ttg tcg gct gat ttg agg aat aca tgg gga gat ctc tct 1104
Ala Arg Met Leu Ser Ala Asp Leu Arg Asn Thr Trp Gly Asp Leu Ser
355 360 365
aag cag tac act gaa gag tat gag caa tgt ata gag aag atg tta ctt 1152
Lys Gln Tyr Thr Glu Glu Tyr Glu Gln Cys Ile Glu Lys Met Leu Leu
370 375 380
act gaa aaa gcg gaa aaa ttt gat tat gaa aga ctg cat aaa aca ctt 1200
Thr Glu Lys Ala Glu Lys Phe Asp Tyr Glu Arg Leu His Lys Thr Leu
385 390 395 400
gag gaa ctt tct gat ttt gag aaa aga gca aat act agg gtg act gaa 1248
Glu Glu Leu Ser Asp Phe Glu Lys Arg Ala Asn Thr Arg Val Thr Glu
405 410 415
tct ggg gta ctg aaa ggt tta aac ctt gaa gac ata aaa cga gct ggg 1296
Ser Gly Val Leu Lys Gly Leu Asn Leu Glu Asp Ile Lys Arg Ala Gly
420 425 430
cag cga ttg att ctc cag gat ggt tgc acc aac ttc ttc cag agc ata 1344
Gln Arg Leu Ile Leu Gln Asp Gly Cys Thr Asn Phe Phe Gln Ser Ile
435 440 445
ata aga aat gaa aat ctg aac gca gac att cat gtc ctc tcc tat tgc 1392
Ile Arg Asn Glu Asn Leu Asn Ala Asp Ile His Val Leu Ser Tyr Cys
450 455 460
tgg tgt ggc gac ctt att agg tct tcc ttt tca tca ggg ggt ata gac 1440
Trp Cys Gly Asp Leu Ile Arg Ser Ser Phe Ser Ser Gly Gly Ile Asp
465 470 475 480
gct ctg aat gtg cat gcc aat gag ttt atg ttt caa gaa tct cta tcc 1488
Ala Leu Asn Val His Ala Asn Glu Phe Met Phe Gln Glu Ser Leu Ser
485 490 495
act ggt gaa att gtt aag aaa gtt gaa tcc ccc att gac aag gtt caa 1536
Thr Gly Glu Ile Val Lys Lys Val Glu Ser Pro Ile Asp Lys Val Gln
500 505 510
gca ttc agt aaa att cga atg aac tgt ggc aat gac caa aaa aat ctg 1584
Ala Phe Ser Lys Ile Arg Met Asn Cys Gly Asn Asp Gln Lys Asn Leu
515 520 525
act ctt tat att ggg gat tca gtc ggc gac tta ctt tgc ttg ctt gaa 1632
Thr Leu Tyr Ile Gly Asp Ser Val Gly Asp Leu Leu Cys Leu Leu Glu
530 535 540
gca gat gtt ggc ata gtg ctt ggt acg agc tca agt cta agg acg gtg 1680
Ala Asp Val Gly Ile Val Leu Gly Thr Ser Ser Ser Leu Arg Thr Val
545 550 555 560
ggg aat cat ttt ggt gtt tct ttt gtt cct ctg ttt cca ggt gtt gtc 1728
Gly Asn His Phe Gly Val Ser Phe Val Pro Leu Phe Pro Gly Val Val
565 570 575
cag aaa cag aag atg tgc act ggg gta gac tcg tca agt tgt tgg aag 1776
Gln Lys Gln Lys Met Cys Thr Gly Val Asp Ser Ser Ser Cys Trp Lys
580 585 590
gga cta tct ggt gtt ctc tat act gcc tct agc tgg gct gag ata cat 1824
Gly Leu Ser Gly Val Leu Tyr Thr Ala Ser Ser Trp Ala Glu Ile His
595 600 605
gct ttt gta ttg ggg tca tga 1845
Ala Phe Val Leu Gly Ser
610
<210> 135
<211> 614
<212> PRT
<213> Nicotiana tomentosiformis
<400> 135
Met Arg Phe Ser Leu Leu Ser Pro Leu Val Leu Asn Pro Val Ile Arg
1 5 10 15
Phe Ser Asn Ser Asn Ala Leu Phe Gly Leu Arg Phe Gln Leu Tyr Pro
20 25 30
Arg Tyr Ser Arg Tyr Leu Arg Ser Pro Val Thr Met Ala Ser Ala Lys
35 40 45
Pro Lys Pro Ala Ala Ala Val Asn Lys Phe Pro Val Glu Glu Glu Cys
50 55 60
Val Gly Ile Ala Arg Lys Cys Trp Ile Lys Phe Lys Arg Glu Ser Thr
65 70 75 80
Phe Ala Leu Tyr Thr Pro Phe Val Val Ser Leu Ala Ser Gly Thr Leu
85 90 95
Asn Leu Asp Thr Phe Arg His Tyr Ile Ala Gln Asp Val His Phe Leu
100 105 110
Lys Ser Phe Ala Gln Ala Tyr Glu Ala Ala Glu Glu Cys Thr Asp Asp
115 120 125
Asp Asp Ala Lys Val Gly Ile Ser Glu Leu Arg Lys Asn Val Ile Glu
130 135 140
Glu Leu Lys Met His Asp Ala Val Leu Lys Glu Trp Gly Ile Asp Leu
145 150 155 160
Val Lys Glu Ser Ser Leu Asn Pro Ala Thr Ala Lys Tyr Thr Asp Phe
165 170 175
Leu Ser Ala Thr Ala Ser Gly Lys Val Glu Gly Val Lys Ala Ala Lys
180 185 190
Leu Ala Thr Pro Phe Glu Arg Thr Lys Leu Ala Ala Tyr Thr Leu Gly
195 200 205
Ala Met Thr Pro Cys Met Arg Leu Tyr Ala Tyr Ile Gly Lys Glu Leu
210 215 220
Gln Val Phe Leu Glu Gly Glu Lys Ile His Pro Tyr Lys Lys Trp Ile
225 230 235 240
Asp Ser Tyr Ala Ser Glu Ser Phe Gln Ala Ser Ala Leu Gln Thr Glu
245 250 255
Asp Leu Leu Asp Lys Leu Ser Val Pro Leu Thr Gly Glu Glu Leu Asp
260 265 270
Ile Ile Glu Lys Leu Tyr His Gln Ala Met Lys Leu Glu Ile Asp Phe
275 280 285
Phe Leu Thr Gln Pro Leu Val Gln Lys Ala Val Ile Pro Leu Ser Lys
290 295 300
Asp His Asn Pro Ala Glu His Arg Leu Thr Ile Phe Ser Asp Phe Asp
305 310 315 320
Leu Thr Cys Thr Val Val Asp Ser Ser Ala Ile Leu Ala Glu Ile Ala
325 330 335
Ile Ile Thr Ala Pro Arg Ser Asp Gln Asn Arg Pro Glu Asn Gln Ile
340 345 350
Ala Arg Met Leu Ser Ala Asp Leu Arg Asn Thr Trp Gly Asp Leu Ser
355 360 365
Lys Gln Tyr Thr Glu Glu Tyr Glu Gln Cys Ile Glu Lys Met Leu Leu
370 375 380
Thr Glu Lys Ala Glu Lys Phe Asp Tyr Glu Arg Leu His Lys Thr Leu
385 390 395 400
Glu Glu Leu Ser Asp Phe Glu Lys Arg Ala Asn Thr Arg Val Thr Glu
405 410 415
Ser Gly Val Leu Lys Gly Leu Asn Leu Glu Asp Ile Lys Arg Ala Gly
420 425 430
Gln Arg Leu Ile Leu Gln Asp Gly Cys Thr Asn Phe Phe Gln Ser Ile
435 440 445
Ile Arg Asn Glu Asn Leu Asn Ala Asp Ile His Val Leu Ser Tyr Cys
450 455 460
Trp Cys Gly Asp Leu Ile Arg Ser Ser Phe Ser Ser Gly Gly Ile Asp
465 470 475 480
Ala Leu Asn Val His Ala Asn Glu Phe Met Phe Gln Glu Ser Leu Ser
485 490 495
Thr Gly Glu Ile Val Lys Lys Val Glu Ser Pro Ile Asp Lys Val Gln
500 505 510
Ala Phe Ser Lys Ile Arg Met Asn Cys Gly Asn Asp Gln Lys Asn Leu
515 520 525
Thr Leu Tyr Ile Gly Asp Ser Val Gly Asp Leu Leu Cys Leu Leu Glu
530 535 540
Ala Asp Val Gly Ile Val Leu Gly Thr Ser Ser Ser Leu Arg Thr Val
545 550 555 560
Gly Asn His Phe Gly Val Ser Phe Val Pro Leu Phe Pro Gly Val Val
565 570 575
Gln Lys Gln Lys Met Cys Thr Gly Val Asp Ser Ser Ser Cys Trp Lys
580 585 590
Gly Leu Ser Gly Val Leu Tyr Thr Ala Ser Ser Trp Ala Glu Ile His
595 600 605
Ala Phe Val Leu Gly Ser
610
<210> 136
<211> 1779
<212> DNA
<213> Populus trichocarpa
<220>
<221> CDS
<222> (1)..(1779)
<223> Populus trichocarpa gene encoding TMP phosphatase
[XP_002325785.2]
<400> 136
atg cgc cta ctc ttg ttt act tct cca aac cca atc aaa acc tct tca 48
Met Arg Leu Leu Leu Phe Thr Ser Pro Asn Pro Ile Lys Thr Ser Ser
1 5 10 15
tca cta tat ttc ctc aac tcg ctc cga tcc aac tta acc aaa cgc acc 96
Ser Leu Tyr Phe Leu Asn Ser Leu Arg Ser Asn Leu Thr Lys Arg Thr
20 25 30
ttg cca act cgg aga tct ttc atc cct gca aga atg gca atc cct cca 144
Leu Pro Thr Arg Arg Ser Phe Ile Pro Ala Arg Met Ala Ile Pro Pro
35 40 45
cga tca ata gca tca gcg cca tct tgc act aca aca tca ggc aga agt 192
Arg Ser Ile Ala Ser Ala Pro Ser Cys Thr Thr Thr Ser Gly Arg Ser
50 55 60
aac atc aac att gaa gag ggt ctt gct agt aaa ttc tgg atc aag ttt 240
Asn Ile Asn Ile Glu Glu Gly Leu Ala Ser Lys Phe Trp Ile Lys Phe
65 70 75 80
aga aga gaa tcc gtt ttt gct atg tac act cct ttt gtc atc tct ttg 288
Arg Arg Glu Ser Val Phe Ala Met Tyr Thr Pro Phe Val Ile Ser Leu
85 90 95
gct tct ggc act ctc aag att gat tct ttc agg cat tat atc tct caa 336
Ala Ser Gly Thr Leu Lys Ile Asp Ser Phe Arg His Tyr Ile Ser Gln
100 105 110
gat tct cac ttt ctc aaa tct ttt gct cat gcg ttt gaa tta gcg gaa 384
Asp Ser His Phe Leu Lys Ser Phe Ala His Ala Phe Glu Leu Ala Glu
115 120 125
gag tgt gct gat gat gat gaa gca aag cta gca atc tcc gag ttg agg 432
Glu Cys Ala Asp Asp Asp Glu Ala Lys Leu Ala Ile Ser Glu Leu Arg
130 135 140
aag ggt gtc tta gag gag ctg aag atg cac aat tca ttt gta cag gaa 480
Lys Gly Val Leu Glu Glu Leu Lys Met His Asn Ser Phe Val Gln Glu
145 150 155 160
tgg ggt ata gac cca ggt aaa gag ggg act atc aat tct gct act gta 528
Trp Gly Ile Asp Pro Gly Lys Glu Gly Thr Ile Asn Ser Ala Thr Val
165 170 175
aaa tac aca gat ttc ttg ttg gct aca gct tct ggg aag gtt gaa gga 576
Lys Tyr Thr Asp Phe Leu Leu Ala Thr Ala Ser Gly Lys Val Glu Gly
180 185 190
gtg aaa ggt ctt ggt aaa ctt gca act cct ttt gaa aga aca aaa gtt 624
Val Lys Gly Leu Gly Lys Leu Ala Thr Pro Phe Glu Arg Thr Lys Val
195 200 205
gca gcc tat act ctg ggt gcc atg aca cct tgc atg cgg ctg tat tcc 672
Ala Ala Tyr Thr Leu Gly Ala Met Thr Pro Cys Met Arg Leu Tyr Ser
210 215 220
ttt cta ggc aag gaa ctc cag gca gtt tta gat ccg gag gaa gat ggg 720
Phe Leu Gly Lys Glu Leu Gln Ala Val Leu Asp Pro Glu Glu Asp Gly
225 230 235 240
cac cct tac aag aag tgg att gac agt tat tcg tct gag agt ttt cag 768
His Pro Tyr Lys Lys Trp Ile Asp Ser Tyr Ser Ser Glu Ser Phe Gln
245 250 255
gca tca gct ctg caa act gaa gac ttg ctg gat aaa ctt agt gtc tcc 816
Ala Ser Ala Leu Gln Thr Glu Asp Leu Leu Asp Lys Leu Ser Val Ser
260 265 270
ttg aca ggc gag gag ctt gac atc att gaa aag ctt tat cac cag gcc 864
Leu Thr Gly Glu Glu Leu Asp Ile Ile Glu Lys Leu Tyr His Gln Ala
275 280 285
atg aaa ctt gaa ata gaa ttc ttc ctt gct cag cca att gct cag aca 912
Met Lys Leu Glu Ile Glu Phe Phe Leu Ala Gln Pro Ile Ala Gln Thr
290 295 300
act tta gct ccc ctg aca aaa ggg cat aac cct gaa gaa gac cgg ctt 960
Thr Leu Ala Pro Leu Thr Lys Gly His Asn Pro Glu Glu Asp Arg Leu
305 310 315 320
gtc ata ttt tct gat ttt gat ttg aca tgc act gtt gtt gac tct tct 1008
Val Ile Phe Ser Asp Phe Asp Leu Thr Cys Thr Val Val Asp Ser Ser
325 330 335
gcc att ttg gca gaa att gca ata cta aca gca cca aaa tct gat gtg 1056
Ala Ile Leu Ala Glu Ile Ala Ile Leu Thr Ala Pro Lys Ser Asp Val
340 345 350
gtt caa cct gag act caa att gct cga atg tca tca gct gat ctg agg 1104
Val Gln Pro Glu Thr Gln Ile Ala Arg Met Ser Ser Ala Asp Leu Arg
355 360 365
aac aca tgg ggt ctt ctt tct gga cag tac acg gaa gag tat gaa caa 1152
Asn Thr Trp Gly Leu Leu Ser Gly Gln Tyr Thr Glu Glu Tyr Glu Gln
370 375 380
tgt att gaa agc att atg cca tct gca aaa gtg gaa ttc aac tat gaa 1200
Cys Ile Glu Ser Ile Met Pro Ser Ala Lys Val Glu Phe Asn Tyr Glu
385 390 395 400
gct ctt tgt aaa gca ctt gaa caa ctt tca gac ttt gag cga agg gca 1248
Ala Leu Cys Lys Ala Leu Glu Gln Leu Ser Asp Phe Glu Arg Arg Ala
405 410 415
aat tct aga gtg att gat tct gga gtt ctc aaa ggt ttg aat ctt gaa 1296
Asn Ser Arg Val Ile Asp Ser Gly Val Leu Lys Gly Leu Asn Leu Glu
420 425 430
gat gta aaa cga gcg ggt gaa cgt ttg att ctt cag gat ggt tgc att 1344
Asp Val Lys Arg Ala Gly Glu Arg Leu Ile Leu Gln Asp Gly Cys Ile
435 440 445
ggt ttc ttt cag aaa att gtg aag aat gaa aat ttg aac act aat gtc 1392
Gly Phe Phe Gln Lys Ile Val Lys Asn Glu Asn Leu Asn Thr Asn Val
450 455 460
cat gtg ctc tca tac tgc tgg tgt ggt gat ctc atc aga tca gct ttc 1440
His Val Leu Ser Tyr Cys Trp Cys Gly Asp Leu Ile Arg Ser Ala Phe
465 470 475 480
tcc tca ggg ggt ttg gat gct cta aat att cat gca aat gag tta att 1488
Ser Ser Gly Gly Leu Asp Ala Leu Asn Ile His Ala Asn Glu Leu Ile
485 490 495
ttt gaa gaa tca atc tcc acg gga gag att aac ttg act gtt tac att 1536
Phe Glu Glu Ser Ile Ser Thr Gly Glu Ile Asn Leu Thr Val Tyr Ile
500 505 510
gga gat tca gtt ggt gac ttg ctt tgt cta ctt cag gca gat att ggt 1584
Gly Asp Ser Val Gly Asp Leu Leu Cys Leu Leu Gln Ala Asp Ile Gly
515 520 525
att gta gtt gga tct agt gca agc tta agg agc gtg gga agt caa tat 1632
Ile Val Val Gly Ser Ser Ala Ser Leu Arg Ser Val Gly Ser Gln Tyr
530 535 540
ggt gtt tct ttt gta cca ctg ttc cct ggc ttg gta aga aaa cag aaa 1680
Gly Val Ser Phe Val Pro Leu Phe Pro Gly Leu Val Arg Lys Gln Lys
545 550 555 560
gaa tct gat gga gaa tct cct aat tgg aaa ggg cta tct ggc ata cta 1728
Glu Ser Asp Gly Glu Ser Pro Asn Trp Lys Gly Leu Ser Gly Ile Leu
565 570 575
tat aca gtc tcc agt tgg tca gaa ata cat gcc ttc att ttg ggg tgg 1776
Tyr Thr Val Ser Ser Trp Ser Glu Ile His Ala Phe Ile Leu Gly Trp
580 585 590
tag 1779
<210> 137
<211> 592
<212> PRT
<213> Populus trichocarpa
<400> 137
Met Arg Leu Leu Leu Phe Thr Ser Pro Asn Pro Ile Lys Thr Ser Ser
1 5 10 15
Ser Leu Tyr Phe Leu Asn Ser Leu Arg Ser Asn Leu Thr Lys Arg Thr
20 25 30
Leu Pro Thr Arg Arg Ser Phe Ile Pro Ala Arg Met Ala Ile Pro Pro
35 40 45
Arg Ser Ile Ala Ser Ala Pro Ser Cys Thr Thr Thr Ser Gly Arg Ser
50 55 60
Asn Ile Asn Ile Glu Glu Gly Leu Ala Ser Lys Phe Trp Ile Lys Phe
65 70 75 80
Arg Arg Glu Ser Val Phe Ala Met Tyr Thr Pro Phe Val Ile Ser Leu
85 90 95
Ala Ser Gly Thr Leu Lys Ile Asp Ser Phe Arg His Tyr Ile Ser Gln
100 105 110
Asp Ser His Phe Leu Lys Ser Phe Ala His Ala Phe Glu Leu Ala Glu
115 120 125
Glu Cys Ala Asp Asp Asp Glu Ala Lys Leu Ala Ile Ser Glu Leu Arg
130 135 140
Lys Gly Val Leu Glu Glu Leu Lys Met His Asn Ser Phe Val Gln Glu
145 150 155 160
Trp Gly Ile Asp Pro Gly Lys Glu Gly Thr Ile Asn Ser Ala Thr Val
165 170 175
Lys Tyr Thr Asp Phe Leu Leu Ala Thr Ala Ser Gly Lys Val Glu Gly
180 185 190
Val Lys Gly Leu Gly Lys Leu Ala Thr Pro Phe Glu Arg Thr Lys Val
195 200 205
Ala Ala Tyr Thr Leu Gly Ala Met Thr Pro Cys Met Arg Leu Tyr Ser
210 215 220
Phe Leu Gly Lys Glu Leu Gln Ala Val Leu Asp Pro Glu Glu Asp Gly
225 230 235 240
His Pro Tyr Lys Lys Trp Ile Asp Ser Tyr Ser Ser Glu Ser Phe Gln
245 250 255
Ala Ser Ala Leu Gln Thr Glu Asp Leu Leu Asp Lys Leu Ser Val Ser
260 265 270
Leu Thr Gly Glu Glu Leu Asp Ile Ile Glu Lys Leu Tyr His Gln Ala
275 280 285
Met Lys Leu Glu Ile Glu Phe Phe Leu Ala Gln Pro Ile Ala Gln Thr
290 295 300
Thr Leu Ala Pro Leu Thr Lys Gly His Asn Pro Glu Glu Asp Arg Leu
305 310 315 320
Val Ile Phe Ser Asp Phe Asp Leu Thr Cys Thr Val Val Asp Ser Ser
325 330 335
Ala Ile Leu Ala Glu Ile Ala Ile Leu Thr Ala Pro Lys Ser Asp Val
340 345 350
Val Gln Pro Glu Thr Gln Ile Ala Arg Met Ser Ser Ala Asp Leu Arg
355 360 365
Asn Thr Trp Gly Leu Leu Ser Gly Gln Tyr Thr Glu Glu Tyr Glu Gln
370 375 380
Cys Ile Glu Ser Ile Met Pro Ser Ala Lys Val Glu Phe Asn Tyr Glu
385 390 395 400
Ala Leu Cys Lys Ala Leu Glu Gln Leu Ser Asp Phe Glu Arg Arg Ala
405 410 415
Asn Ser Arg Val Ile Asp Ser Gly Val Leu Lys Gly Leu Asn Leu Glu
420 425 430
Asp Val Lys Arg Ala Gly Glu Arg Leu Ile Leu Gln Asp Gly Cys Ile
435 440 445
Gly Phe Phe Gln Lys Ile Val Lys Asn Glu Asn Leu Asn Thr Asn Val
450 455 460
His Val Leu Ser Tyr Cys Trp Cys Gly Asp Leu Ile Arg Ser Ala Phe
465 470 475 480
Ser Ser Gly Gly Leu Asp Ala Leu Asn Ile His Ala Asn Glu Leu Ile
485 490 495
Phe Glu Glu Ser Ile Ser Thr Gly Glu Ile Asn Leu Thr Val Tyr Ile
500 505 510
Gly Asp Ser Val Gly Asp Leu Leu Cys Leu Leu Gln Ala Asp Ile Gly
515 520 525
Ile Val Val Gly Ser Ser Ala Ser Leu Arg Ser Val Gly Ser Gln Tyr
530 535 540
Gly Val Ser Phe Val Pro Leu Phe Pro Gly Leu Val Arg Lys Gln Lys
545 550 555 560
Glu Ser Asp Gly Glu Ser Pro Asn Trp Lys Gly Leu Ser Gly Ile Leu
565 570 575
Tyr Thr Val Ser Ser Trp Ser Glu Ile His Ala Phe Ile Leu Gly Trp
580 585 590
<210> 138
<211> 1860
<212> DNA
<213> Jatropha curcas
<220>
<221> CDS
<222> (1)..(1860)
<223> Jatropha curcas gene encoding TMP phosphatase [KDP23738.1]
<400> 138
atg gcg atc cct cca aag cta gct tcc tca tcg tct tcc atg gcc gcc 48
Met Ala Ile Pro Pro Lys Leu Ala Ser Ser Ser Ser Ser Met Ala Ala
1 5 10 15
tcc cct act tct gct ggt gga acc aac gag gaa ggc ctc gct agt aaa 96
Ser Pro Thr Ser Ala Gly Gly Thr Asn Glu Glu Gly Leu Ala Ser Lys
20 25 30
ttc tgg atc aag ttt cgc cga gaa tcg gtt ctc gct atg tac act cct 144
Phe Trp Ile Lys Phe Arg Arg Glu Ser Val Leu Ala Met Tyr Thr Pro
35 40 45
ttc gtc gtc tct ttt gcc gcc ggc aac ctc aag att gag agt ttt agg 192
Phe Val Val Ser Phe Ala Ala Gly Asn Leu Lys Ile Glu Ser Phe Arg
50 55 60
cat tac atc gct cag gat ttt cac ttc ctc aaa gcc ttc gct cac gcg 240
His Tyr Ile Ala Gln Asp Phe His Phe Leu Lys Ala Phe Ala His Ala
65 70 75 80
tat gaa ttg gca gaa gag tgt gct gat gat gat gat gcc aag cta gct 288
Tyr Glu Leu Ala Glu Glu Cys Ala Asp Asp Asp Asp Ala Lys Leu Ala
85 90 95
att gcc gcg ttg agg aag ggg gtc tta gag gag ctg aag ttg cat aaa 336
Ile Ala Ala Leu Arg Lys Gly Val Leu Glu Glu Leu Lys Leu His Lys
100 105 110
tca ttt gta cag gaa tgg ggt atg gac cct tcc aaa gag gtg act atc 384
Ser Phe Val Gln Glu Trp Gly Met Asp Pro Ser Lys Glu Val Thr Ile
115 120 125
aat tct gca act gca aaa tac aca gat ttc ttg ttg gct aca gct tct 432
Asn Ser Ala Thr Ala Lys Tyr Thr Asp Phe Leu Leu Ala Thr Ala Ser
130 135 140
gga aag gtt gaa gga gtg aaa ggt cct ggt aaa ctt gca act cct ttt 480
Gly Lys Val Glu Gly Val Lys Gly Pro Gly Lys Leu Ala Thr Pro Phe
145 150 155 160
gaa aga aca aaa gtt gca gct tac act ctt ggt acc atg aca ccc tgt 528
Glu Arg Thr Lys Val Ala Ala Tyr Thr Leu Gly Thr Met Thr Pro Cys
165 170 175
atg agg ttg tat gcc ttt cta gct aag gag ctg caa gca cta ata gat 576
Met Arg Leu Tyr Ala Phe Leu Ala Lys Glu Leu Gln Ala Leu Ile Asp
180 185 190
gca gaa gct ggt att cat cct tac cag aag tgg att gac aat tac tca 624
Ala Glu Ala Gly Ile His Pro Tyr Gln Lys Trp Ile Asp Asn Tyr Ser
195 200 205
tct gag agt ttt cag gca tca gct ctg caa act gaa gac ttg ctg gat 672
Ser Glu Ser Phe Gln Ala Ser Ala Leu Gln Thr Glu Asp Leu Leu Asp
210 215 220
aaa ctt agt gtc cct ttg aca ggc gaa gag ctt gac atc att gaa aag 720
Lys Leu Ser Val Pro Leu Thr Gly Glu Glu Leu Asp Ile Ile Glu Lys
225 230 235 240
ctt tat cac caa gcc atg aaa ctt gaa ata gag ttc ttc aat gcg cag 768
Leu Tyr His Gln Ala Met Lys Leu Glu Ile Glu Phe Phe Asn Ala Gln
245 250 255
cca ctt gat cag ccc act gtg gtt cct ctg aca aaa gag cat aac cct 816
Pro Leu Asp Gln Pro Thr Val Val Pro Leu Thr Lys Glu His Asn Pro
260 265 270
cta gaa gat cgc ctc gtg ata ttt tct gat ttt gat ttg aca tgc aca 864
Leu Glu Asp Arg Leu Val Ile Phe Ser Asp Phe Asp Leu Thr Cys Thr
275 280 285
gtt gtt gat tcc tct gcc att ttg gca gag att gca att tta aca gca 912
Val Val Asp Ser Ser Ala Ile Leu Ala Glu Ile Ala Ile Leu Thr Ala
290 295 300
tca aaa tct gat cag tca caa tct gat aat caa aat gct agg atg tca 960
Ser Lys Ser Asp Gln Ser Gln Ser Asp Asn Gln Asn Ala Arg Met Ser
305 310 315 320
tca act gag cta agg aac aca tgg gtt ctt ctc tct gga cag tat act 1008
Ser Thr Glu Leu Arg Asn Thr Trp Val Leu Leu Ser Gly Gln Tyr Thr
325 330 335
gaa gaa tat gag caa tgc att gaa agc att ctg ccc tct gaa aaa atg 1056
Glu Glu Tyr Glu Gln Cys Ile Glu Ser Ile Leu Pro Ser Glu Lys Met
340 345 350
gag ttc aac ttt gaa gct ttg tgt aaa gca ctc gaa caa ctc tca gac 1104
Glu Phe Asn Phe Glu Ala Leu Cys Lys Ala Leu Glu Gln Leu Ser Asp
355 360 365
ttt gag cga agg gca aat gct aga gtt atc aaa tct gga gtt ctt aag 1152
Phe Glu Arg Arg Ala Asn Ala Arg Val Ile Lys Ser Gly Val Leu Lys
370 375 380
ggt ttg aat ctt gaa gac ata aaa cga gct gtg gag ttc aac ttt gaa 1200
Gly Leu Asn Leu Glu Asp Ile Lys Arg Ala Val Glu Phe Asn Phe Glu
385 390 395 400
gct ttg tgt aaa gca ctc gaa caa ctc tca gac ttt gag cga agg gca 1248
Ala Leu Cys Lys Ala Leu Glu Gln Leu Ser Asp Phe Glu Arg Arg Ala
405 410 415
aat gct aga gtt atc aaa tct gga gtt ctt aag ggt ttg aat ctt gaa 1296
Asn Ala Arg Val Ile Lys Ser Gly Val Leu Lys Gly Leu Asn Leu Glu
420 425 430
gac ata aaa cga gct ggt gaa aga ctg att ctt caa gat ggc tgc acc 1344
Asp Ile Lys Arg Ala Gly Glu Arg Leu Ile Leu Gln Asp Gly Cys Thr
435 440 445
agt ttt ttt cag aaa atc tcg aag aat gaa aat ctg aat gct aat ata 1392
Ser Phe Phe Gln Lys Ile Ser Lys Asn Glu Asn Leu Asn Ala Asn Ile
450 455 460
cat ttc ctc tca tat tgt tgg tgt gct gat ctg atc aga tct gct ttc 1440
His Phe Leu Ser Tyr Cys Trp Cys Ala Asp Leu Ile Arg Ser Ala Phe
465 470 475 480
tca tca ggg ggt ttg gat gtt ctg aat ata cat gcg aat gag ttt gat 1488
Ser Ser Gly Gly Leu Asp Val Leu Asn Ile His Ala Asn Glu Phe Asp
485 490 495
ttc gta gaa tca att tca acg ggt gag att att atg aag gtg gaa acc 1536
Phe Val Glu Ser Ile Ser Thr Gly Glu Ile Ile Met Lys Val Glu Thr
500 505 510
cct aca gac aaa gcc caa gct ttt aat aat att tta atg aac tac agc 1584
Pro Thr Asp Lys Ala Gln Ala Phe Asn Asn Ile Leu Met Asn Tyr Ser
515 520 525
cct gac aaa aag aat ttg act gtt tat att gga gac tca gtt ggg gac 1632
Pro Asp Lys Lys Asn Leu Thr Val Tyr Ile Gly Asp Ser Val Gly Asp
530 535 540
ttg ctt tgt ctg ctt gcg gca gat ata ggc atc gtg atc gga tca agc 1680
Leu Leu Cys Leu Leu Ala Ala Asp Ile Gly Ile Val Ile Gly Ser Ser
545 550 555 560
tcc agc cta agg aga gtc gga agt cag ttt ggt gta aca ttt tta cca 1728
Ser Ser Leu Arg Arg Val Gly Ser Gln Phe Gly Val Thr Phe Leu Pro
565 570 575
ttg tat cct ggc ttg gtt aaa aaa cag aga gag tat act gaa gga agc 1776
Leu Tyr Pro Gly Leu Val Lys Lys Gln Arg Glu Tyr Thr Glu Gly Ser
580 585 590
tct tgg aat tgg aag ggt caa tct ggc gtt ctg tac aca gtt tct agt 1824
Ser Trp Asn Trp Lys Gly Gln Ser Gly Val Leu Tyr Thr Val Ser Ser
595 600 605
tgg gct gaa ata cat tcc ttc gtt ttg gga tgg tag 1860
Trp Ala Glu Ile His Ser Phe Val Leu Gly Trp
610 615
<210> 139
<211> 619
<212> PRT
<213> Jatropha curcas
<400> 139
Met Ala Ile Pro Pro Lys Leu Ala Ser Ser Ser Ser Ser Met Ala Ala
1 5 10 15
Ser Pro Thr Ser Ala Gly Gly Thr Asn Glu Glu Gly Leu Ala Ser Lys
20 25 30
Phe Trp Ile Lys Phe Arg Arg Glu Ser Val Leu Ala Met Tyr Thr Pro
35 40 45
Phe Val Val Ser Phe Ala Ala Gly Asn Leu Lys Ile Glu Ser Phe Arg
50 55 60
His Tyr Ile Ala Gln Asp Phe His Phe Leu Lys Ala Phe Ala His Ala
65 70 75 80
Tyr Glu Leu Ala Glu Glu Cys Ala Asp Asp Asp Asp Ala Lys Leu Ala
85 90 95
Ile Ala Ala Leu Arg Lys Gly Val Leu Glu Glu Leu Lys Leu His Lys
100 105 110
Ser Phe Val Gln Glu Trp Gly Met Asp Pro Ser Lys Glu Val Thr Ile
115 120 125
Asn Ser Ala Thr Ala Lys Tyr Thr Asp Phe Leu Leu Ala Thr Ala Ser
130 135 140
Gly Lys Val Glu Gly Val Lys Gly Pro Gly Lys Leu Ala Thr Pro Phe
145 150 155 160
Glu Arg Thr Lys Val Ala Ala Tyr Thr Leu Gly Thr Met Thr Pro Cys
165 170 175
Met Arg Leu Tyr Ala Phe Leu Ala Lys Glu Leu Gln Ala Leu Ile Asp
180 185 190
Ala Glu Ala Gly Ile His Pro Tyr Gln Lys Trp Ile Asp Asn Tyr Ser
195 200 205
Ser Glu Ser Phe Gln Ala Ser Ala Leu Gln Thr Glu Asp Leu Leu Asp
210 215 220
Lys Leu Ser Val Pro Leu Thr Gly Glu Glu Leu Asp Ile Ile Glu Lys
225 230 235 240
Leu Tyr His Gln Ala Met Lys Leu Glu Ile Glu Phe Phe Asn Ala Gln
245 250 255
Pro Leu Asp Gln Pro Thr Val Val Pro Leu Thr Lys Glu His Asn Pro
260 265 270
Leu Glu Asp Arg Leu Val Ile Phe Ser Asp Phe Asp Leu Thr Cys Thr
275 280 285
Val Val Asp Ser Ser Ala Ile Leu Ala Glu Ile Ala Ile Leu Thr Ala
290 295 300
Ser Lys Ser Asp Gln Ser Gln Ser Asp Asn Gln Asn Ala Arg Met Ser
305 310 315 320
Ser Thr Glu Leu Arg Asn Thr Trp Val Leu Leu Ser Gly Gln Tyr Thr
325 330 335
Glu Glu Tyr Glu Gln Cys Ile Glu Ser Ile Leu Pro Ser Glu Lys Met
340 345 350
Glu Phe Asn Phe Glu Ala Leu Cys Lys Ala Leu Glu Gln Leu Ser Asp
355 360 365
Phe Glu Arg Arg Ala Asn Ala Arg Val Ile Lys Ser Gly Val Leu Lys
370 375 380
Gly Leu Asn Leu Glu Asp Ile Lys Arg Ala Val Glu Phe Asn Phe Glu
385 390 395 400
Ala Leu Cys Lys Ala Leu Glu Gln Leu Ser Asp Phe Glu Arg Arg Ala
405 410 415
Asn Ala Arg Val Ile Lys Ser Gly Val Leu Lys Gly Leu Asn Leu Glu
420 425 430
Asp Ile Lys Arg Ala Gly Glu Arg Leu Ile Leu Gln Asp Gly Cys Thr
435 440 445
Ser Phe Phe Gln Lys Ile Ser Lys Asn Glu Asn Leu Asn Ala Asn Ile
450 455 460
His Phe Leu Ser Tyr Cys Trp Cys Ala Asp Leu Ile Arg Ser Ala Phe
465 470 475 480
Ser Ser Gly Gly Leu Asp Val Leu Asn Ile His Ala Asn Glu Phe Asp
485 490 495
Phe Val Glu Ser Ile Ser Thr Gly Glu Ile Ile Met Lys Val Glu Thr
500 505 510
Pro Thr Asp Lys Ala Gln Ala Phe Asn Asn Ile Leu Met Asn Tyr Ser
515 520 525
Pro Asp Lys Lys Asn Leu Thr Val Tyr Ile Gly Asp Ser Val Gly Asp
530 535 540
Leu Leu Cys Leu Leu Ala Ala Asp Ile Gly Ile Val Ile Gly Ser Ser
545 550 555 560
Ser Ser Leu Arg Arg Val Gly Ser Gln Phe Gly Val Thr Phe Leu Pro
565 570 575
Leu Tyr Pro Gly Leu Val Lys Lys Gln Arg Glu Tyr Thr Glu Gly Ser
580 585 590
Ser Trp Asn Trp Lys Gly Gln Ser Gly Val Leu Tyr Thr Val Ser Ser
595 600 605
Trp Ala Glu Ile His Ser Phe Val Leu Gly Trp
610 615
<210> 140
<211> 1842
<212> DNA
<213> Citrus sinensis
<220>
<221> CDS
<222> (1)..(1842)
<223> Citrus sinensi gene encoding TMP phosphatase [XP_006484613.1]
<400> 140
atg cgc ttc ctt ttc aca aac cca atc aaa acc cca tta ctc tct tct 48
Met Arg Phe Leu Phe Thr Asn Pro Ile Lys Thr Pro Leu Leu Ser Ser
1 5 10 15
att ctt ttc cat tgt ccc aac tcg ccc cga ctc ggc ctt ctt gac tca 96
Ile Leu Phe His Cys Pro Asn Ser Pro Arg Leu Gly Leu Leu Asp Ser
20 25 30
gtc cga gtc aac tca cct tct tct ttg aca act caa aga tcg tca ctt 144
Val Arg Val Asn Ser Pro Ser Ser Leu Thr Thr Gln Arg Ser Ser Leu
35 40 45
tcg atg gcg gcg att ccc cca aaa tcg ccg agc cct gag gag gag gga 192
Ser Met Ala Ala Ile Pro Pro Lys Ser Pro Ser Pro Glu Glu Glu Gly
50 55 60
ctc gcg agg agg ttg tgg atc aag ttt aag aga gaa tct gtg ttt gcc 240
Leu Ala Arg Arg Leu Trp Ile Lys Phe Lys Arg Glu Ser Val Phe Ala
65 70 75 80
atg tac tcc ccg ttt acg gtt tgt ttg gct tct ggg aac cta aag ctt 288
Met Tyr Ser Pro Phe Thr Val Cys Leu Ala Ser Gly Asn Leu Lys Leu
85 90 95
gaa acc ttc agg cat tac atc gcc caa gat ttt cat ttt ctc aaa gct 336
Glu Thr Phe Arg His Tyr Ile Ala Gln Asp Phe His Phe Leu Lys Ala
100 105 110
ttc gcc caa gcg tat gaa ctg gcg gaa gaa tgt gct gat gat gat gat 384
Phe Ala Gln Ala Tyr Glu Leu Ala Glu Glu Cys Ala Asp Asp Asp Asp
115 120 125
gca aag tta tct atc tct gaa ttg agg aag ggt gta ctt gag gag tta 432
Ala Lys Leu Ser Ile Ser Glu Leu Arg Lys Gly Val Leu Glu Glu Leu
130 135 140
aaa atg cat gat tcc ttt gtg aag gag tgg ggt aca gat ctt gct aaa 480
Lys Met His Asp Ser Phe Val Lys Glu Trp Gly Thr Asp Leu Ala Lys
145 150 155 160
atg gct act gtt aac tct gca act gta aag tat aca gag ttc ttg ttg 528
Met Ala Thr Val Asn Ser Ala Thr Val Lys Tyr Thr Glu Phe Leu Leu
165 170 175
gca aca gct tcc ggg aag gtc gaa ggt gtt aaa ggt cct gga aaa ctt 576
Ala Thr Ala Ser Gly Lys Val Glu Gly Val Lys Gly Pro Gly Lys Leu
180 185 190
gca acc cca ttt gag aaa act aaa gtt gcc gct tac aca ttg ggt gcc 624
Ala Thr Pro Phe Glu Lys Thr Lys Val Ala Ala Tyr Thr Leu Gly Ala
195 200 205
atg tca cct tgt atg agg ctc tat gct ttc ctt gga aag gaa ttc cat 672
Met Ser Pro Cys Met Arg Leu Tyr Ala Phe Leu Gly Lys Glu Phe His
210 215 220
ggc ctc cta aat gct aat gaa ggc aat cat cct tac aag aag tgg att 720
Gly Leu Leu Asn Ala Asn Glu Gly Asn His Pro Tyr Lys Lys Trp Ile
225 230 235 240
gac aat tat tct tct gaa agt ttt cag gcc tca gct ctg caa aat gag 768
Asp Asn Tyr Ser Ser Glu Ser Phe Gln Ala Ser Ala Leu Gln Asn Glu
245 250 255
gac ttg ctg gat aaa ctt agt gtc tct ttg aca ggc gaa gaa cta gac 816
Asp Leu Leu Asp Lys Leu Ser Val Ser Leu Thr Gly Glu Glu Leu Asp
260 265 270
ata ata gaa aag ctc tat cac caa gcc atg aaa ctt gaa gta gag ttc 864
Ile Ile Glu Lys Leu Tyr His Gln Ala Met Lys Leu Glu Val Glu Phe
275 280 285
ttc tgt gct cag cca ctt gct cag ccc act gta gtt cct ctg att aaa 912
Phe Cys Ala Gln Pro Leu Ala Gln Pro Thr Val Val Pro Leu Ile Lys
290 295 300
ggg cat aat cct gca gga gac cgt cta att ata ttt tct gat ttc gat 960
Gly His Asn Pro Ala Gly Asp Arg Leu Ile Ile Phe Ser Asp Phe Asp
305 310 315 320
ttg act tgc acc att gtt gat tcc tct gcc att ttg gca gag atc gca 1008
Leu Thr Cys Thr Ile Val Asp Ser Ser Ala Ile Leu Ala Glu Ile Ala
325 330 335
ata gtg aca gca cca aaa tct gac cag aat caa cct gaa aat caa ctt 1056
Ile Val Thr Ala Pro Lys Ser Asp Gln Asn Gln Pro Glu Asn Gln Leu
340 345 350
ggt cgg atg tca tca ggt gag ctg agg aac aca tgg ggt ctt ctt tcc 1104
Gly Arg Met Ser Ser Gly Glu Leu Arg Asn Thr Trp Gly Leu Leu Ser
355 360 365
aaa cag tac aca gag gag tac gaa caa tgc att gaa agc ttc atg ccc 1152
Lys Gln Tyr Thr Glu Glu Tyr Glu Gln Cys Ile Glu Ser Phe Met Pro
370 375 380
tct gag aaa gtg gag aat ttc aac tat gaa act ttg cat aaa gca ctt 1200
Ser Glu Lys Val Glu Asn Phe Asn Tyr Glu Thr Leu His Lys Ala Leu
385 390 395 400
gag caa ctc tca cac ttt gag aag agg gca aat tct aga gtg atc gaa 1248
Glu Gln Leu Ser His Phe Glu Lys Arg Ala Asn Ser Arg Val Ile Glu
405 410 415
tct gga gtt ctc aag ggt ata aat ctt gaa gat att aaa aaa gct ggt 1296
Ser Gly Val Leu Lys Gly Ile Asn Leu Glu Asp Ile Lys Lys Ala Gly
420 425 430
gaa cgc ctg agt ctt caa gat ggt tgc act acc ttc ttt cag aaa gtt 1344
Glu Arg Leu Ser Leu Gln Asp Gly Cys Thr Thr Phe Phe Gln Lys Val
435 440 445
gta aag aat gaa aat ttg aat gct aat gtc cat gtg ctt tca tac tgt 1392
Val Lys Asn Glu Asn Leu Asn Ala Asn Val His Val Leu Ser Tyr Cys
450 455 460
tgg tgt ggt gat ctc atc aga gca tct ttt tct tca gca ggt tta aat 1440
Trp Cys Gly Asp Leu Ile Arg Ala Ser Phe Ser Ser Ala Gly Leu Asn
465 470 475 480
gca ctg aat gta cat gcg aat gag ttc tca ttc aaa gaa tct att tca 1488
Ala Leu Asn Val His Ala Asn Glu Phe Ser Phe Lys Glu Ser Ile Ser
485 490 495
acg ggt gaa att att gag aaa gtg gag tcc ccc att gac aaa gtt caa 1536
Thr Gly Glu Ile Ile Glu Lys Val Glu Ser Pro Ile Asp Lys Val Gln
500 505 510
gct ttc aac aat act tta gag aaa tac gga act gac aga aag aac ttg 1584
Ala Phe Asn Asn Thr Leu Glu Lys Tyr Gly Thr Asp Arg Lys Asn Leu
515 520 525
agt gtt tac att gga gac tct gtg ggt gac ttg ctt tgt ctg ctt gag 1632
Ser Val Tyr Ile Gly Asp Ser Val Gly Asp Leu Leu Cys Leu Leu Glu
530 535 540
gct gat ata ggc att gta atc ggg tct agc tca agc tta agg aga gtg 1680
Ala Asp Ile Gly Ile Val Ile Gly Ser Ser Ser Ser Leu Arg Arg Val
545 550 555 560
gga tct caa ttt ggt gtt aca ttt atc ccg ttg tac cct ggc ttg gtt 1728
Gly Ser Gln Phe Gly Val Thr Phe Ile Pro Leu Tyr Pro Gly Leu Val
565 570 575
aag aaa cag aag gag tac act gaa gga agc tct tct aac tgg aag gag 1776
Lys Lys Gln Lys Glu Tyr Thr Glu Gly Ser Ser Ser Asn Trp Lys Glu
580 585 590
aaa tct ggc ata ctt tac aca gtc tca agt tgg gct gaa gta cat gcc 1824
Lys Ser Gly Ile Leu Tyr Thr Val Ser Ser Trp Ala Glu Val His Ala
595 600 605
ttt atc ttg ggg tgg tag 1842
Phe Ile Leu Gly Trp
610
<210> 141
<211> 613
<212> PRT
<213> Citrus sinensis
<400> 141
Met Arg Phe Leu Phe Thr Asn Pro Ile Lys Thr Pro Leu Leu Ser Ser
1 5 10 15
Ile Leu Phe His Cys Pro Asn Ser Pro Arg Leu Gly Leu Leu Asp Ser
20 25 30
Val Arg Val Asn Ser Pro Ser Ser Leu Thr Thr Gln Arg Ser Ser Leu
35 40 45
Ser Met Ala Ala Ile Pro Pro Lys Ser Pro Ser Pro Glu Glu Glu Gly
50 55 60
Leu Ala Arg Arg Leu Trp Ile Lys Phe Lys Arg Glu Ser Val Phe Ala
65 70 75 80
Met Tyr Ser Pro Phe Thr Val Cys Leu Ala Ser Gly Asn Leu Lys Leu
85 90 95
Glu Thr Phe Arg His Tyr Ile Ala Gln Asp Phe His Phe Leu Lys Ala
100 105 110
Phe Ala Gln Ala Tyr Glu Leu Ala Glu Glu Cys Ala Asp Asp Asp Asp
115 120 125
Ala Lys Leu Ser Ile Ser Glu Leu Arg Lys Gly Val Leu Glu Glu Leu
130 135 140
Lys Met His Asp Ser Phe Val Lys Glu Trp Gly Thr Asp Leu Ala Lys
145 150 155 160
Met Ala Thr Val Asn Ser Ala Thr Val Lys Tyr Thr Glu Phe Leu Leu
165 170 175
Ala Thr Ala Ser Gly Lys Val Glu Gly Val Lys Gly Pro Gly Lys Leu
180 185 190
Ala Thr Pro Phe Glu Lys Thr Lys Val Ala Ala Tyr Thr Leu Gly Ala
195 200 205
Met Ser Pro Cys Met Arg Leu Tyr Ala Phe Leu Gly Lys Glu Phe His
210 215 220
Gly Leu Leu Asn Ala Asn Glu Gly Asn His Pro Tyr Lys Lys Trp Ile
225 230 235 240
Asp Asn Tyr Ser Ser Glu Ser Phe Gln Ala Ser Ala Leu Gln Asn Glu
245 250 255
Asp Leu Leu Asp Lys Leu Ser Val Ser Leu Thr Gly Glu Glu Leu Asp
260 265 270
Ile Ile Glu Lys Leu Tyr His Gln Ala Met Lys Leu Glu Val Glu Phe
275 280 285
Phe Cys Ala Gln Pro Leu Ala Gln Pro Thr Val Val Pro Leu Ile Lys
290 295 300
Gly His Asn Pro Ala Gly Asp Arg Leu Ile Ile Phe Ser Asp Phe Asp
305 310 315 320
Leu Thr Cys Thr Ile Val Asp Ser Ser Ala Ile Leu Ala Glu Ile Ala
325 330 335
Ile Val Thr Ala Pro Lys Ser Asp Gln Asn Gln Pro Glu Asn Gln Leu
340 345 350
Gly Arg Met Ser Ser Gly Glu Leu Arg Asn Thr Trp Gly Leu Leu Ser
355 360 365
Lys Gln Tyr Thr Glu Glu Tyr Glu Gln Cys Ile Glu Ser Phe Met Pro
370 375 380
Ser Glu Lys Val Glu Asn Phe Asn Tyr Glu Thr Leu His Lys Ala Leu
385 390 395 400
Glu Gln Leu Ser His Phe Glu Lys Arg Ala Asn Ser Arg Val Ile Glu
405 410 415
Ser Gly Val Leu Lys Gly Ile Asn Leu Glu Asp Ile Lys Lys Ala Gly
420 425 430
Glu Arg Leu Ser Leu Gln Asp Gly Cys Thr Thr Phe Phe Gln Lys Val
435 440 445
Val Lys Asn Glu Asn Leu Asn Ala Asn Val His Val Leu Ser Tyr Cys
450 455 460
Trp Cys Gly Asp Leu Ile Arg Ala Ser Phe Ser Ser Ala Gly Leu Asn
465 470 475 480
Ala Leu Asn Val His Ala Asn Glu Phe Ser Phe Lys Glu Ser Ile Ser
485 490 495
Thr Gly Glu Ile Ile Glu Lys Val Glu Ser Pro Ile Asp Lys Val Gln
500 505 510
Ala Phe Asn Asn Thr Leu Glu Lys Tyr Gly Thr Asp Arg Lys Asn Leu
515 520 525
Ser Val Tyr Ile Gly Asp Ser Val Gly Asp Leu Leu Cys Leu Leu Glu
530 535 540
Ala Asp Ile Gly Ile Val Ile Gly Ser Ser Ser Ser Leu Arg Arg Val
545 550 555 560
Gly Ser Gln Phe Gly Val Thr Phe Ile Pro Leu Tyr Pro Gly Leu Val
565 570 575
Lys Lys Gln Lys Glu Tyr Thr Glu Gly Ser Ser Ser Asn Trp Lys Glu
580 585 590
Lys Ser Gly Ile Leu Tyr Thr Val Ser Ser Trp Ala Glu Val His Ala
595 600 605
Phe Ile Leu Gly Trp
610
<210> 142
<211> 1728
<212> DNA
<213> Prunus persica
<220>
<221> CDS
<222> (1)..(1728)
<223> Prunus persica gene encoding TMP phosphatase [XP_007199656.1]
<400> 142
atg gcg gca ttg gct cgt cat agc att gtt aga ctc aat cac gaa gga 48
Met Ala Ala Leu Ala Arg His Ser Ile Val Arg Leu Asn His Glu Gly
1 5 10 15
ggc cta gcc aga cgg ctg tgg ttc aag ttc aga gac gac tct gtt ttc 96
Gly Leu Ala Arg Arg Leu Trp Phe Lys Phe Arg Asp Asp Ser Val Phe
20 25 30
tct ctc tac act ccc ttc ttc gtt ggc tta gcc tct gct act ctg cac 144
Ser Leu Tyr Thr Pro Phe Phe Val Gly Leu Ala Ser Ala Thr Leu His
35 40 45
tct gaa act acc ttt cgc cat ttc atc tct cag gac ctc cat ttt ctc 192
Ser Glu Thr Thr Phe Arg His Phe Ile Ser Gln Asp Leu His Phe Leu
50 55 60
aaa gcc ttc gtt ctc gca tat gaa ttg gcg gaa gat tgt gct gat gat 240
Lys Ala Phe Val Leu Ala Tyr Glu Leu Ala Glu Asp Cys Ala Asp Asp
65 70 75 80
gag gac gac aag aat ggt tta cgc gat ttg aga aaa cgt gcc gtc ggc 288
Glu Asp Asp Lys Asn Gly Leu Arg Asp Leu Arg Lys Arg Ala Val Gly
85 90 95
agg ctt caa atg cac gac aca ttt gtc cga gaa tgg ggt ttt gaa ttc 336
Arg Leu Gln Met His Asp Thr Phe Val Arg Glu Trp Gly Phe Glu Phe
100 105 110
cca aat gag gac att tct aaa gac att gca aca acc aaa tac aca gat 384
Pro Asn Glu Asp Ile Ser Lys Asp Ile Ala Thr Thr Lys Tyr Thr Asp
115 120 125
ttc ttg ctt gca aca gca tca ggg aaa att gaa gga gaa aga tcg gtt 432
Phe Leu Leu Ala Thr Ala Ser Gly Lys Ile Glu Gly Glu Arg Ser Val
130 135 140
ctg gac aaa atc gca acc cct ttc gaa aag acc aag gtt gct gca tat 480
Leu Asp Lys Ile Ala Thr Pro Phe Glu Lys Thr Lys Val Ala Ala Tyr
145 150 155 160
aca ctt gct gct ctg gct cct tgt atg aga ctc tat gcc ttc atc agt 528
Thr Leu Ala Ala Leu Ala Pro Cys Met Arg Leu Tyr Ala Phe Ile Ser
165 170 175
act gag atc caa ggc att ata aat cct gat caa gat agc act cac att 576
Thr Glu Ile Gln Gly Ile Ile Asn Pro Asp Gln Asp Ser Thr His Ile
180 185 190
tac aaa agc tgg ata gaa aat tat tcg tct caa gtt ttc gag gaa ata 624
Tyr Lys Ser Trp Ile Glu Asn Tyr Ser Ser Gln Val Phe Glu Glu Ile
195 200 205
gcc ctg caa aat gaa gac atg cta gat aaa ctt agt gtt tct ttg act 672
Ala Leu Gln Asn Glu Asp Met Leu Asp Lys Leu Ser Val Ser Leu Thr
210 215 220
ggt gag gag ctt gag att ata gag aag ctc tat cat caa gct atg aag 720
Gly Glu Glu Leu Glu Ile Ile Glu Lys Leu Tyr His Gln Ala Met Lys
225 230 235 240
ctt caa gta gat ttt att gct gct caa cca att tct gat cag caa tct 768
Leu Gln Val Asp Phe Ile Ala Ala Gln Pro Ile Ser Asp Gln Gln Ser
245 250 255
gta gtt cct ttg tct cgg gtg cat gac ttt agc aaa cgc cat ctt acg 816
Val Val Pro Leu Ser Arg Val His Asp Phe Ser Lys Arg His Leu Thr
260 265 270
ata ctt tgt gac ttt gat ttg gca tgc act gct ttt gat tct gct gcc 864
Ile Leu Cys Asp Phe Asp Leu Ala Cys Thr Ala Phe Asp Ser Ala Ala
275 280 285
ata ttg gct gag att gcg atc ata aca gca cca aag gct gat atg gat 912
Ile Leu Ala Glu Ile Ala Ile Ile Thr Ala Pro Lys Ala Asp Met Asp
290 295 300
gga tct gat caa acc caa ctt gct cgg atg cca tca gca gac tta agg 960
Gly Ser Asp Gln Thr Gln Leu Ala Arg Met Pro Ser Ala Asp Leu Arg
305 310 315 320
agc aca tgg gat gtt ctt tca acc caa tac act gaa caa ttt gaa caa 1008
Ser Thr Trp Asp Val Leu Ser Thr Gln Tyr Thr Glu Gln Phe Glu Gln
325 330 335
tgt gta gaa agc att gtg gcc agt gag aga gtg gaa gaa ttc gat tat 1056
Cys Val Glu Ser Ile Val Ala Ser Glu Arg Val Glu Glu Phe Asp Tyr
340 345 350
gaa cgt ctg tgt agc gcg ctt gaa caa ctt gcg gag ttt gag aga aag 1104
Glu Arg Leu Cys Ser Ala Leu Glu Gln Leu Ala Glu Phe Glu Arg Lys
355 360 365
gca aat gaa agg gtg gtt cag tca gga gtg ttg aag ggt tta aat gcg 1152
Ala Asn Glu Arg Val Val Gln Ser Gly Val Leu Lys Gly Leu Asn Ala
370 375 380
gag gat ata aaa agg gct gga cag agc ctc att ctg caa gat ggt tgc 1200
Glu Asp Ile Lys Arg Ala Gly Gln Ser Leu Ile Leu Gln Asp Gly Cys
385 390 395 400
aga agc ttc ttt cag aag att gtg aaa aat aaa aat ctg aaa act gat 1248
Arg Ser Phe Phe Gln Lys Ile Val Lys Asn Lys Asn Leu Lys Thr Asp
405 410 415
gtt cat gtg ctt tca tac tgc tgg tgt aat gac ctc att gta tca gct 1296
Val His Val Leu Ser Tyr Cys Trp Cys Asn Asp Leu Ile Val Ser Ala
420 425 430
ttc tct tca gga gat ttg aat gtc ttg aat gta cat tca aat gag ttg 1344
Phe Ser Ser Gly Asp Leu Asn Val Leu Asn Val His Ser Asn Glu Leu
435 440 445
gtt tat caa gaa tct gtc aca act ggt gaa att gta aag aag atg gag 1392
Val Tyr Gln Glu Ser Val Thr Thr Gly Glu Ile Val Lys Lys Met Glu
450 455 460
tct ccc atg gaa aag ctt caa gtc ttc aac gac gtc cta atc gac cgc 1440
Ser Pro Met Glu Lys Leu Gln Val Phe Asn Asp Val Leu Ile Asp Arg
465 470 475 480
agg ggc gaa ggc aat aaa cac ttg aca gtt tac att gga ggc tca gtg 1488
Arg Gly Glu Gly Asn Lys His Leu Thr Val Tyr Ile Gly Gly Ser Val
485 490 495
ggt gac ttg ctt tgc ctg ctt gaa gca gat ata ggc att gta gtt ggt 1536
Gly Asp Leu Leu Cys Leu Leu Glu Ala Asp Ile Gly Ile Val Val Gly
500 505 510
tca agt tca agc cta agg aga cta ggt gat cat ttt ggt gtt tcc ttt 1584
Ser Ser Ser Ser Leu Arg Arg Leu Gly Asp His Phe Gly Val Ser Phe
515 520 525
gtc cca ttg ttc tct ggc ttg gtg aag agg cag aaa gaa ctt gct gat 1632
Val Pro Leu Phe Ser Gly Leu Val Lys Arg Gln Lys Glu Leu Ala Asp
530 535 540
caa gat tgt gcc tct aat tgg tgg aaa cca ttg tct ggt gtt ctt tat 1680
Gln Asp Cys Ala Ser Asn Trp Trp Lys Pro Leu Ser Gly Val Leu Tyr
545 550 555 560
acg gtg tct agt tgg gct gaa ata cag gca ttc att ttg ggt aca tag 1728
Thr Val Ser Ser Trp Ala Glu Ile Gln Ala Phe Ile Leu Gly Thr
565 570 575
<210> 143
<211> 575
<212> PRT
<213> Prunus persica
<400> 143
Met Ala Ala Leu Ala Arg His Ser Ile Val Arg Leu Asn His Glu Gly
1 5 10 15
Gly Leu Ala Arg Arg Leu Trp Phe Lys Phe Arg Asp Asp Ser Val Phe
20 25 30
Ser Leu Tyr Thr Pro Phe Phe Val Gly Leu Ala Ser Ala Thr Leu His
35 40 45
Ser Glu Thr Thr Phe Arg His Phe Ile Ser Gln Asp Leu His Phe Leu
50 55 60
Lys Ala Phe Val Leu Ala Tyr Glu Leu Ala Glu Asp Cys Ala Asp Asp
65 70 75 80
Glu Asp Asp Lys Asn Gly Leu Arg Asp Leu Arg Lys Arg Ala Val Gly
85 90 95
Arg Leu Gln Met His Asp Thr Phe Val Arg Glu Trp Gly Phe Glu Phe
100 105 110
Pro Asn Glu Asp Ile Ser Lys Asp Ile Ala Thr Thr Lys Tyr Thr Asp
115 120 125
Phe Leu Leu Ala Thr Ala Ser Gly Lys Ile Glu Gly Glu Arg Ser Val
130 135 140
Leu Asp Lys Ile Ala Thr Pro Phe Glu Lys Thr Lys Val Ala Ala Tyr
145 150 155 160
Thr Leu Ala Ala Leu Ala Pro Cys Met Arg Leu Tyr Ala Phe Ile Ser
165 170 175
Thr Glu Ile Gln Gly Ile Ile Asn Pro Asp Gln Asp Ser Thr His Ile
180 185 190
Tyr Lys Ser Trp Ile Glu Asn Tyr Ser Ser Gln Val Phe Glu Glu Ile
195 200 205
Ala Leu Gln Asn Glu Asp Met Leu Asp Lys Leu Ser Val Ser Leu Thr
210 215 220
Gly Glu Glu Leu Glu Ile Ile Glu Lys Leu Tyr His Gln Ala Met Lys
225 230 235 240
Leu Gln Val Asp Phe Ile Ala Ala Gln Pro Ile Ser Asp Gln Gln Ser
245 250 255
Val Val Pro Leu Ser Arg Val His Asp Phe Ser Lys Arg His Leu Thr
260 265 270
Ile Leu Cys Asp Phe Asp Leu Ala Cys Thr Ala Phe Asp Ser Ala Ala
275 280 285
Ile Leu Ala Glu Ile Ala Ile Ile Thr Ala Pro Lys Ala Asp Met Asp
290 295 300
Gly Ser Asp Gln Thr Gln Leu Ala Arg Met Pro Ser Ala Asp Leu Arg
305 310 315 320
Ser Thr Trp Asp Val Leu Ser Thr Gln Tyr Thr Glu Gln Phe Glu Gln
325 330 335
Cys Val Glu Ser Ile Val Ala Ser Glu Arg Val Glu Glu Phe Asp Tyr
340 345 350
Glu Arg Leu Cys Ser Ala Leu Glu Gln Leu Ala Glu Phe Glu Arg Lys
355 360 365
Ala Asn Glu Arg Val Val Gln Ser Gly Val Leu Lys Gly Leu Asn Ala
370 375 380
Glu Asp Ile Lys Arg Ala Gly Gln Ser Leu Ile Leu Gln Asp Gly Cys
385 390 395 400
Arg Ser Phe Phe Gln Lys Ile Val Lys Asn Lys Asn Leu Lys Thr Asp
405 410 415
Val His Val Leu Ser Tyr Cys Trp Cys Asn Asp Leu Ile Val Ser Ala
420 425 430
Phe Ser Ser Gly Asp Leu Asn Val Leu Asn Val His Ser Asn Glu Leu
435 440 445
Val Tyr Gln Glu Ser Val Thr Thr Gly Glu Ile Val Lys Lys Met Glu
450 455 460
Ser Pro Met Glu Lys Leu Gln Val Phe Asn Asp Val Leu Ile Asp Arg
465 470 475 480
Arg Gly Glu Gly Asn Lys His Leu Thr Val Tyr Ile Gly Gly Ser Val
485 490 495
Gly Asp Leu Leu Cys Leu Leu Glu Ala Asp Ile Gly Ile Val Val Gly
500 505 510
Ser Ser Ser Ser Leu Arg Arg Leu Gly Asp His Phe Gly Val Ser Phe
515 520 525
Val Pro Leu Phe Ser Gly Leu Val Lys Arg Gln Lys Glu Leu Ala Asp
530 535 540
Gln Asp Cys Ala Ser Asn Trp Trp Lys Pro Leu Ser Gly Val Leu Tyr
545 550 555 560
Thr Val Ser Ser Trp Ala Glu Ile Gln Ala Phe Ile Leu Gly Thr
565 570 575
<210> 144
<211> 1944
<212> DNA
<213> Phoenix dactylifera
<220>
<221> CDS
<222> (1)..(1944)
<223> Phoenix_dactylifera gene encoding TMP phosphatase [XP_008796407]
<400> 144
atg cga ttc ctc tcc cct ctt ctc ccc ctc cgc cga aac cca aac cct 48
Met Arg Phe Leu Ser Pro Leu Leu Pro Leu Arg Arg Asn Pro Asn Pro
1 5 10 15
agc cct agg ttc ttc tcg ctc tcc cct ccc ata tcc ctc gcc tcc gcc 96
Ser Pro Arg Phe Phe Ser Leu Ser Pro Pro Ile Ser Leu Ala Ser Ala
20 25 30
tgc ccc cga ttc ggt ttc ttg aat cga gat cgc ccc cgg cgc cgc ctt 144
Cys Pro Arg Phe Gly Phe Leu Asn Arg Asp Arg Pro Arg Arg Arg Leu
35 40 45
cca aag ggg ttc cga tcg atc gcc gcg gcg aat cag cgg gcg tcg cct 192
Pro Lys Gly Phe Arg Ser Ile Ala Ala Ala Asn Gln Arg Ala Ser Pro
50 55 60
cca aga ttg gtg ccg gag agg gcg gcc gcc acg agt tct tgg cct tct 240
Pro Arg Leu Val Pro Glu Arg Ala Ala Ala Thr Ser Ser Trp Pro Ser
65 70 75 80
tca gcc gga cga gcc atg gca gtg gtg gcg acg gcg gtt gaa gaa ggc 288
Ser Ala Gly Arg Ala Met Ala Val Val Ala Thr Ala Val Glu Glu Gly
85 90 95
tcc gcg gcg aag cgg ttc tgg atc agg tcc cgg aag gag gcg gtg ttc 336
Ser Ala Ala Lys Arg Phe Trp Ile Arg Ser Arg Lys Glu Ala Val Phe
100 105 110
gcg gag tac acc ccg ttc gtg gtg tgc ctg gcg gcg ggg aga ctg gag 384
Ala Glu Tyr Thr Pro Phe Val Val Cys Leu Ala Ala Gly Arg Leu Glu
115 120 125
atg gag gcc ttc cgc gac tac att gct cag gac gtg cac ttc ctc aat 432
Met Glu Ala Phe Arg Asp Tyr Ile Ala Gln Asp Val His Phe Leu Asn
130 135 140
act ttt gcc caa gcg tat gag atg gcg gaa gag tgt gct gat gat gat 480
Thr Phe Ala Gln Ala Tyr Glu Met Ala Glu Glu Cys Ala Asp Asp Asp
145 150 155 160
gat gcg aag gct gca ata act gat ctg agg aaa gct gtt ttg gag gaa 528
Asp Ala Lys Ala Ala Ile Thr Asp Leu Arg Lys Ala Val Leu Glu Glu
165 170 175
ctg aaa atg cat agt tca ttt gtc caa gaa tgg gga ata gac ccc act 576
Leu Lys Met His Ser Ser Phe Val Gln Glu Trp Gly Ile Asp Pro Thr
180 185 190
aaa gaa atc att cct ttc cct gca aca gta aag tac acc gac ttc ctg 624
Lys Glu Ile Ile Pro Phe Pro Ala Thr Val Lys Tyr Thr Asp Phe Leu
195 200 205
ctt gct aca gct gca gga aaa gtt gaa gga ggg aaa gat cct ggg aaa 672
Leu Ala Thr Ala Ala Gly Lys Val Glu Gly Gly Lys Asp Pro Gly Lys
210 215 220
att gtc act cct ttt gag aag aca aaa att gct gct tat act gta ggt 720
Ile Val Thr Pro Phe Glu Lys Thr Lys Ile Ala Ala Tyr Thr Val Gly
225 230 235 240
gcc atg gct cct tgc atg agg ctt tat gca ttc ttg gga aaa gag ctc 768
Ala Met Ala Pro Cys Met Arg Leu Tyr Ala Phe Leu Gly Lys Glu Leu
245 250 255
cag acg tgt ctg caa ctt gac gaa aat tgt cat ccc tac aaa aag tgg 816
Gln Thr Cys Leu Gln Leu Asp Glu Asn Cys His Pro Tyr Lys Lys Trp
260 265 270
att gat aat tat tcc tct gaa agt ttt gag aca gct gct gtg caa ata 864
Ile Asp Asn Tyr Ser Ser Glu Ser Phe Glu Thr Ala Ala Val Gln Ile
275 280 285
gaa gaa ttg ctt gac aaa ttg agt gtt tca ttg act ggg gag gag ctt 912
Glu Glu Leu Leu Asp Lys Leu Ser Val Ser Leu Thr Gly Glu Glu Leu
290 295 300
gaa gac ata gaa aag ctt tac cgc caa gct atg aaa ctt gaa att gaa 960
Glu Asp Ile Glu Lys Leu Tyr Arg Gln Ala Met Lys Leu Glu Ile Glu
305 310 315 320
ttt ttt ctt gct cag cca att gtc cga cca gct gta gtt cct ttg aca 1008
Phe Phe Leu Ala Gln Pro Ile Val Arg Pro Ala Val Val Pro Leu Thr
325 330 335
aga ctg cat gat ccg gca aat tgc ctt gtc att ttt tct gat ttt gac 1056
Arg Leu His Asp Pro Ala Asn Cys Leu Val Ile Phe Ser Asp Phe Asp
340 345 350
ttg aca tgc agt gta gtt gat tcc tct gcc att tta gca gag att gca 1104
Leu Thr Cys Ser Val Val Asp Ser Ser Ala Ile Leu Ala Glu Ile Ala
355 360 365
ata tta agt gca cca aag act gat aag act ggg act gat aat tta gat 1152
Ile Leu Ser Ala Pro Lys Thr Asp Lys Thr Gly Thr Asp Asn Leu Asp
370 375 380
gct cga agg tct tct tca gaa atg aga aac tca tgg gat gct ctt tct 1200
Ala Arg Arg Ser Ser Ser Glu Met Arg Asn Ser Trp Asp Ala Leu Ser
385 390 395 400
aaa cag tat aca gaa gag tat gag cag tgc ata gaa agc tta ctt cca 1248
Lys Gln Tyr Thr Glu Glu Tyr Glu Gln Cys Ile Glu Ser Leu Leu Pro
405 410 415
tta gaa gaa gct aaa aca ttt gat tat gaa ggc ctt tgc aaa agt ttg 1296
Leu Glu Glu Ala Lys Thr Phe Asp Tyr Glu Gly Leu Cys Lys Ser Leu
420 425 430
ggc cag ctc tct gag ttt gag aaa cga gca aat tcc agg gtt att gag 1344
Gly Gln Leu Ser Glu Phe Glu Lys Arg Ala Asn Ser Arg Val Ile Glu
435 440 445
tct ggg gtg cta aag gga atg aat cta gat gac ata aaa aga gct ggg 1392
Ser Gly Val Leu Lys Gly Met Asn Leu Asp Asp Ile Lys Arg Ala Gly
450 455 460
gaa cgt ttg atc ctc caa gat ggt tgt ata gat ttt ttt cag aag gtt 1440
Glu Arg Leu Ile Leu Gln Asp Gly Cys Ile Asp Phe Phe Gln Lys Val
465 470 475 480
gta aag gaa aag gaa aat cta aat tta gat ctc cat gta ctt tct tat 1488
Val Lys Glu Lys Glu Asn Leu Asn Leu Asp Leu His Val Leu Ser Tyr
485 490 495
tgt tgg tgt gcg gat cta ata agg tca gct ttt tca tca gta ggt tgc 1536
Cys Trp Cys Ala Asp Leu Ile Arg Ser Ala Phe Ser Ser Val Gly Cys
500 505 510
cta aat gat ttg aac ata cac tca aat gag ttc aac tat caa gaa tct 1584
Leu Asn Asp Leu Asn Ile His Ser Asn Glu Phe Asn Tyr Gln Glu Ser
515 520 525
att tca acg ggt gaa att gtt agg aag atg gaa tca ccc atg gac aag 1632
Ile Ser Thr Gly Glu Ile Val Arg Lys Met Glu Ser Pro Met Asp Lys
530 535 540
gtt gaa gca ttc aaa agt atc tta agc aac ctt gga agc aat gag aag 1680
Val Glu Ala Phe Lys Ser Ile Leu Ser Asn Leu Gly Ser Asn Glu Lys
545 550 555 560
cgc tta tct gtg tac att gga gat tcg gtt ggt gac ttg ctt tgc ctg 1728
Arg Leu Ser Val Tyr Ile Gly Asp Ser Val Gly Asp Leu Leu Cys Leu
565 570 575
ttg gaa gca gat gtt ggt att gtg att gga tca agc act agc tta agg 1776
Leu Glu Ala Asp Val Gly Ile Val Ile Gly Ser Ser Thr Ser Leu Arg
580 585 590
aga atc ggg aag cag ttt ggt gtt tct ttc att cca ctc ttc cgt ggt 1824
Arg Ile Gly Lys Gln Phe Gly Val Ser Phe Ile Pro Leu Phe Arg Gly
595 600 605
ttg gta aac aag caa aga caa ctt aat gaa aaa gac tca tct atc tgg 1872
Leu Val Asn Lys Gln Arg Gln Leu Asn Glu Lys Asp Ser Ser Ile Trp
610 615 620
aag ggg ttg tct ggt gtt ctt tat aca gca tca agc tgg tca gaa ata 1920
Lys Gly Leu Ser Gly Val Leu Tyr Thr Ala Ser Ser Trp Ser Glu Ile
625 630 635 640
caa gct ttt att ttg ggg gca taa 1944
Gln Ala Phe Ile Leu Gly Ala
645
<210> 145
<211> 647
<212> PRT
<213> Phoenix dactylifera
<400> 145
Met Arg Phe Leu Ser Pro Leu Leu Pro Leu Arg Arg Asn Pro Asn Pro
1 5 10 15
Ser Pro Arg Phe Phe Ser Leu Ser Pro Pro Ile Ser Leu Ala Ser Ala
20 25 30
Cys Pro Arg Phe Gly Phe Leu Asn Arg Asp Arg Pro Arg Arg Arg Leu
35 40 45
Pro Lys Gly Phe Arg Ser Ile Ala Ala Ala Asn Gln Arg Ala Ser Pro
50 55 60
Pro Arg Leu Val Pro Glu Arg Ala Ala Ala Thr Ser Ser Trp Pro Ser
65 70 75 80
Ser Ala Gly Arg Ala Met Ala Val Val Ala Thr Ala Val Glu Glu Gly
85 90 95
Ser Ala Ala Lys Arg Phe Trp Ile Arg Ser Arg Lys Glu Ala Val Phe
100 105 110
Ala Glu Tyr Thr Pro Phe Val Val Cys Leu Ala Ala Gly Arg Leu Glu
115 120 125
Met Glu Ala Phe Arg Asp Tyr Ile Ala Gln Asp Val His Phe Leu Asn
130 135 140
Thr Phe Ala Gln Ala Tyr Glu Met Ala Glu Glu Cys Ala Asp Asp Asp
145 150 155 160
Asp Ala Lys Ala Ala Ile Thr Asp Leu Arg Lys Ala Val Leu Glu Glu
165 170 175
Leu Lys Met His Ser Ser Phe Val Gln Glu Trp Gly Ile Asp Pro Thr
180 185 190
Lys Glu Ile Ile Pro Phe Pro Ala Thr Val Lys Tyr Thr Asp Phe Leu
195 200 205
Leu Ala Thr Ala Ala Gly Lys Val Glu Gly Gly Lys Asp Pro Gly Lys
210 215 220
Ile Val Thr Pro Phe Glu Lys Thr Lys Ile Ala Ala Tyr Thr Val Gly
225 230 235 240
Ala Met Ala Pro Cys Met Arg Leu Tyr Ala Phe Leu Gly Lys Glu Leu
245 250 255
Gln Thr Cys Leu Gln Leu Asp Glu Asn Cys His Pro Tyr Lys Lys Trp
260 265 270
Ile Asp Asn Tyr Ser Ser Glu Ser Phe Glu Thr Ala Ala Val Gln Ile
275 280 285
Glu Glu Leu Leu Asp Lys Leu Ser Val Ser Leu Thr Gly Glu Glu Leu
290 295 300
Glu Asp Ile Glu Lys Leu Tyr Arg Gln Ala Met Lys Leu Glu Ile Glu
305 310 315 320
Phe Phe Leu Ala Gln Pro Ile Val Arg Pro Ala Val Val Pro Leu Thr
325 330 335
Arg Leu His Asp Pro Ala Asn Cys Leu Val Ile Phe Ser Asp Phe Asp
340 345 350
Leu Thr Cys Ser Val Val Asp Ser Ser Ala Ile Leu Ala Glu Ile Ala
355 360 365
Ile Leu Ser Ala Pro Lys Thr Asp Lys Thr Gly Thr Asp Asn Leu Asp
370 375 380
Ala Arg Arg Ser Ser Ser Glu Met Arg Asn Ser Trp Asp Ala Leu Ser
385 390 395 400
Lys Gln Tyr Thr Glu Glu Tyr Glu Gln Cys Ile Glu Ser Leu Leu Pro
405 410 415
Leu Glu Glu Ala Lys Thr Phe Asp Tyr Glu Gly Leu Cys Lys Ser Leu
420 425 430
Gly Gln Leu Ser Glu Phe Glu Lys Arg Ala Asn Ser Arg Val Ile Glu
435 440 445
Ser Gly Val Leu Lys Gly Met Asn Leu Asp Asp Ile Lys Arg Ala Gly
450 455 460
Glu Arg Leu Ile Leu Gln Asp Gly Cys Ile Asp Phe Phe Gln Lys Val
465 470 475 480
Val Lys Glu Lys Glu Asn Leu Asn Leu Asp Leu His Val Leu Ser Tyr
485 490 495
Cys Trp Cys Ala Asp Leu Ile Arg Ser Ala Phe Ser Ser Val Gly Cys
500 505 510
Leu Asn Asp Leu Asn Ile His Ser Asn Glu Phe Asn Tyr Gln Glu Ser
515 520 525
Ile Ser Thr Gly Glu Ile Val Arg Lys Met Glu Ser Pro Met Asp Lys
530 535 540
Val Glu Ala Phe Lys Ser Ile Leu Ser Asn Leu Gly Ser Asn Glu Lys
545 550 555 560
Arg Leu Ser Val Tyr Ile Gly Asp Ser Val Gly Asp Leu Leu Cys Leu
565 570 575
Leu Glu Ala Asp Val Gly Ile Val Ile Gly Ser Ser Thr Ser Leu Arg
580 585 590
Arg Ile Gly Lys Gln Phe Gly Val Ser Phe Ile Pro Leu Phe Arg Gly
595 600 605
Leu Val Asn Lys Gln Arg Gln Leu Asn Glu Lys Asp Ser Ser Ile Trp
610 615 620
Lys Gly Leu Ser Gly Val Leu Tyr Thr Ala Ser Ser Trp Ser Glu Ile
625 630 635 640
Gln Ala Phe Ile Leu Gly Ala
645
<210> 146
<211> 1860
<212> DNA
<213> Zea mays
<220>
<221> CDS
<222> (1)..(1860)
<223> Zea mays gene encoding TMP phosphatase [XP_008678418.1]
<400> 146
atg ctt gtt ctc cgc cgt ctc cgc ctc cgc ctc cca ctg cca cgc cct 48
Met Leu Val Leu Arg Arg Leu Arg Leu Arg Leu Pro Leu Pro Arg Pro
1 5 10 15
ctt ctc gtc tcc tcc ttc tcc tcc acc tcc ccc tcc tcc tca ccc tcg 96
Leu Leu Val Ser Ser Phe Ser Ser Thr Ser Pro Ser Ser Ser Pro Ser
20 25 30
acc tct agc tcc tcc tcc tgt tgg tcg tcg aca ggc gaa agt aga agg 144
Thr Ser Ser Ser Ser Ser Cys Trp Ser Ser Thr Gly Glu Ser Arg Arg
35 40 45
gcc atg gcg tca tct cct tct ccc gat tcg gcc gcg gtc gtt gcc gag 192
Ala Met Ala Ser Ser Pro Ser Pro Asp Ser Ala Ala Val Val Ala Glu
50 55 60
ggc tcc gcg gct cgc cgc ttc tgg atc gct gcc tcc acg cgc gag gcc 240
Gly Ser Ala Ala Arg Arg Phe Trp Ile Ala Ala Ser Thr Arg Glu Ala
65 70 75 80
gcc ttc gcc gca tac acg ccc ttc ctc ctc tcc ctc gcc gcc ggc aat 288
Ala Phe Ala Ala Tyr Thr Pro Phe Leu Leu Ser Leu Ala Ala Gly Asn
85 90 95
ctg cgg ctc aac gtg ttt cgc cac tac atc gcg cag gac gcg cac ttc 336
Leu Arg Leu Asn Val Phe Arg His Tyr Ile Ala Gln Asp Ala His Phe
100 105 110
ctt cac gcc ttc gct cgc gcg tac gaa atg gcc gag gac tgc gct gat 384
Leu His Ala Phe Ala Arg Ala Tyr Glu Met Ala Glu Asp Cys Ala Asp
115 120 125
gat gac gac gac atg gcc acc ata gcc gcc ctc agg aag gcc atc ctc 432
Asp Asp Asp Asp Met Ala Thr Ile Ala Ala Leu Arg Lys Ala Ile Leu
130 135 140
caa gag ctc aac ctc cac tcc tcc gtt ctg aag gag tgg gga gtt gat 480
Gln Glu Leu Asn Leu His Ser Ser Val Leu Lys Glu Trp Gly Val Asp
145 150 155 160
cct acc aaa gag ata cct cca agt gca gct aca acc aaa tat act gat 528
Pro Thr Lys Glu Ile Pro Pro Ser Ala Ala Thr Thr Lys Tyr Thr Asp
165 170 175
ttc cta ctt gca act gcg gct gga aaa gtt gat ggc aca aaa ggt tct 576
Phe Leu Leu Ala Thr Ala Ala Gly Lys Val Asp Gly Thr Lys Gly Ser
180 185 190
gac aaa atg gtt act cca ttt gag aag act aaa att gct gca tac act 624
Asp Lys Met Val Thr Pro Phe Glu Lys Thr Lys Ile Ala Ala Tyr Thr
195 200 205
gtt ggg gcc atg act cca tgc atg agg ctt tat gca tat cta ggc aaa 672
Val Gly Ala Met Thr Pro Cys Met Arg Leu Tyr Ala Tyr Leu Gly Lys
210 215 220
gaa ctc atg gtt ttc ctt aaa caa gat gaa aat cat cca tac aag aaa 720
Glu Leu Met Val Phe Leu Lys Gln Asp Glu Asn His Pro Tyr Lys Lys
225 230 235 240
tgg att aac aca tat gca tcc agt gat ttt gag gac acc aca ctc caa 768
Trp Ile Asn Thr Tyr Ala Ser Ser Asp Phe Glu Asp Thr Thr Leu Gln
245 250 255
ata gaa gaa ttg cta gac aaa cta agt gtc tca tta act ggt gag gaa 816
Ile Glu Glu Leu Leu Asp Lys Leu Ser Val Ser Leu Thr Gly Glu Glu
260 265 270
ctt gag att att ggc aag ctc tac cag caa gct atg aaa ctg gaa gtg 864
Leu Glu Ile Ile Gly Lys Leu Tyr Gln Gln Ala Met Lys Leu Glu Val
275 280 285
gag ttc ttt tct tct cag ctt ata gac caa cct gtt gta gct cca ctt 912
Glu Phe Phe Ser Ser Gln Leu Ile Asp Gln Pro Val Val Ala Pro Leu
290 295 300
tca aga tac tgt gat cca aaa tat aaa ctc ttg atc ttt tct gat ttt 960
Ser Arg Tyr Cys Asp Pro Lys Tyr Lys Leu Leu Ile Phe Ser Asp Phe
305 310 315 320
gat ttg acg tgc act att gtt gat tca tct gcc att ttg gcg gag att 1008
Asp Leu Thr Cys Thr Ile Val Asp Ser Ser Ala Ile Leu Ala Glu Ile
325 330 335
gca att ttg tca ttc caa aag gca aat caa agt ggg att gat aat aac 1056
Ala Ile Leu Ser Phe Gln Lys Ala Asn Gln Ser Gly Ile Asp Asn Asn
340 345 350
ctc gac cgt gca aaa tcg gga gac ctg aga agt tcg tgg aac atg ctc 1104
Leu Asp Arg Ala Lys Ser Gly Asp Leu Arg Ser Ser Trp Asn Met Leu
355 360 365
tct aag caa tac atg gaa gag tat gag aaa tgc atg gaa aga cta ctt 1152
Ser Lys Gln Tyr Met Glu Glu Tyr Glu Lys Cys Met Glu Arg Leu Leu
370 375 380
cct cca gaa gaa tcg aag tca cta gat tat gat aaa ctg tat aaa ggc 1200
Pro Pro Glu Glu Ser Lys Ser Leu Asp Tyr Asp Lys Leu Tyr Lys Gly
385 390 395 400
ctg gag gtg cta gct gag ttt gag aag ctt gca aat tct agg gtt gtc 1248
Leu Glu Val Leu Ala Glu Phe Glu Lys Leu Ala Asn Ser Arg Val Val
405 410 415
gac tct ggt gtg ctg agg gga atg aat ttg gaa gac atc agg aaa gct 1296
Asp Ser Gly Val Leu Arg Gly Met Asn Leu Glu Asp Ile Arg Lys Ala
420 425 430
ggt gag cgt ctt att ctt caa ggt ggc tgt aaa aat ttc ttt cag aag 1344
Gly Glu Arg Leu Ile Leu Gln Gly Gly Cys Lys Asn Phe Phe Gln Lys
435 440 445
att gta aaa aca agg gag aac ctc aat ttg gat gtc cat att ctt tcc 1392
Ile Val Lys Thr Arg Glu Asn Leu Asn Leu Asp Val His Ile Leu Ser
450 455 460
tat tgc tgg tgt gca gaa ctt ata aga tca gcc ttc tca tca gcc ggt 1440
Tyr Cys Trp Cys Ala Glu Leu Ile Arg Ser Ala Phe Ser Ser Ala Gly
465 470 475 480
tgt cta gat ggt ttg aac ata cat tca aat gag ttt gcc ttt gag gat 1488
Cys Leu Asp Gly Leu Asn Ile His Ser Asn Glu Phe Ala Phe Glu Asp
485 490 495
tct gtt tca act ggt gag atc gac aga aag atg cag tct ccg cta gac 1536
Ser Val Ser Thr Gly Glu Ile Asp Arg Lys Met Gln Ser Pro Leu Asp
500 505 510
aaa gtt gaa aag ttc aag agc atc aga agt gac gtg gac agt aca gtg 1584
Lys Val Glu Lys Phe Lys Ser Ile Arg Ser Asp Val Asp Ser Thr Val
515 520 525
cca ttc cta tct gtt tat att gga gac tcg gtt gga gat ttg ctc tgc 1632
Pro Phe Leu Ser Val Tyr Ile Gly Asp Ser Val Gly Asp Leu Leu Cys
530 535 540
tta ttg gag gct gat att ggt ata gtc att ggg tca acc aca agt ttg 1680
Leu Leu Glu Ala Asp Ile Gly Ile Val Ile Gly Ser Thr Thr Ser Leu
545 550 555 560
cgt agg gtg ggc aaa cag ttt ggt gtt tct ttt gtc cca ttg ttc cct 1728
Arg Arg Val Gly Lys Gln Phe Gly Val Ser Phe Val Pro Leu Phe Pro
565 570 575
ggt cta gta gag aag cag agg caa ctg gcg gag gaa gat gca tcc gta 1776
Gly Leu Val Glu Lys Gln Arg Gln Leu Ala Glu Glu Asp Ala Ser Val
580 585 590
ttc aag gca cgg tct gga gtc ctc tat acg gtt tct agc tgg tca gaa 1824
Phe Lys Ala Arg Ser Gly Val Leu Tyr Thr Val Ser Ser Trp Ser Glu
595 600 605
ata cac gcc ttc gta ctg gga agt gat ttc agc tga 1860
Ile His Ala Phe Val Leu Gly Ser Asp Phe Ser
610 615
<210> 147
<211> 619
<212> PRT
<213> Zea mays
<400> 147
Met Leu Val Leu Arg Arg Leu Arg Leu Arg Leu Pro Leu Pro Arg Pro
1 5 10 15
Leu Leu Val Ser Ser Phe Ser Ser Thr Ser Pro Ser Ser Ser Pro Ser
20 25 30
Thr Ser Ser Ser Ser Ser Cys Trp Ser Ser Thr Gly Glu Ser Arg Arg
35 40 45
Ala Met Ala Ser Ser Pro Ser Pro Asp Ser Ala Ala Val Val Ala Glu
50 55 60
Gly Ser Ala Ala Arg Arg Phe Trp Ile Ala Ala Ser Thr Arg Glu Ala
65 70 75 80
Ala Phe Ala Ala Tyr Thr Pro Phe Leu Leu Ser Leu Ala Ala Gly Asn
85 90 95
Leu Arg Leu Asn Val Phe Arg His Tyr Ile Ala Gln Asp Ala His Phe
100 105 110
Leu His Ala Phe Ala Arg Ala Tyr Glu Met Ala Glu Asp Cys Ala Asp
115 120 125
Asp Asp Asp Asp Met Ala Thr Ile Ala Ala Leu Arg Lys Ala Ile Leu
130 135 140
Gln Glu Leu Asn Leu His Ser Ser Val Leu Lys Glu Trp Gly Val Asp
145 150 155 160
Pro Thr Lys Glu Ile Pro Pro Ser Ala Ala Thr Thr Lys Tyr Thr Asp
165 170 175
Phe Leu Leu Ala Thr Ala Ala Gly Lys Val Asp Gly Thr Lys Gly Ser
180 185 190
Asp Lys Met Val Thr Pro Phe Glu Lys Thr Lys Ile Ala Ala Tyr Thr
195 200 205
Val Gly Ala Met Thr Pro Cys Met Arg Leu Tyr Ala Tyr Leu Gly Lys
210 215 220
Glu Leu Met Val Phe Leu Lys Gln Asp Glu Asn His Pro Tyr Lys Lys
225 230 235 240
Trp Ile Asn Thr Tyr Ala Ser Ser Asp Phe Glu Asp Thr Thr Leu Gln
245 250 255
Ile Glu Glu Leu Leu Asp Lys Leu Ser Val Ser Leu Thr Gly Glu Glu
260 265 270
Leu Glu Ile Ile Gly Lys Leu Tyr Gln Gln Ala Met Lys Leu Glu Val
275 280 285
Glu Phe Phe Ser Ser Gln Leu Ile Asp Gln Pro Val Val Ala Pro Leu
290 295 300
Ser Arg Tyr Cys Asp Pro Lys Tyr Lys Leu Leu Ile Phe Ser Asp Phe
305 310 315 320
Asp Leu Thr Cys Thr Ile Val Asp Ser Ser Ala Ile Leu Ala Glu Ile
325 330 335
Ala Ile Leu Ser Phe Gln Lys Ala Asn Gln Ser Gly Ile Asp Asn Asn
340 345 350
Leu Asp Arg Ala Lys Ser Gly Asp Leu Arg Ser Ser Trp Asn Met Leu
355 360 365
Ser Lys Gln Tyr Met Glu Glu Tyr Glu Lys Cys Met Glu Arg Leu Leu
370 375 380
Pro Pro Glu Glu Ser Lys Ser Leu Asp Tyr Asp Lys Leu Tyr Lys Gly
385 390 395 400
Leu Glu Val Leu Ala Glu Phe Glu Lys Leu Ala Asn Ser Arg Val Val
405 410 415
Asp Ser Gly Val Leu Arg Gly Met Asn Leu Glu Asp Ile Arg Lys Ala
420 425 430
Gly Glu Arg Leu Ile Leu Gln Gly Gly Cys Lys Asn Phe Phe Gln Lys
435 440 445
Ile Val Lys Thr Arg Glu Asn Leu Asn Leu Asp Val His Ile Leu Ser
450 455 460
Tyr Cys Trp Cys Ala Glu Leu Ile Arg Ser Ala Phe Ser Ser Ala Gly
465 470 475 480
Cys Leu Asp Gly Leu Asn Ile His Ser Asn Glu Phe Ala Phe Glu Asp
485 490 495
Ser Val Ser Thr Gly Glu Ile Asp Arg Lys Met Gln Ser Pro Leu Asp
500 505 510
Lys Val Glu Lys Phe Lys Ser Ile Arg Ser Asp Val Asp Ser Thr Val
515 520 525
Pro Phe Leu Ser Val Tyr Ile Gly Asp Ser Val Gly Asp Leu Leu Cys
530 535 540
Leu Leu Glu Ala Asp Ile Gly Ile Val Ile Gly Ser Thr Thr Ser Leu
545 550 555 560
Arg Arg Val Gly Lys Gln Phe Gly Val Ser Phe Val Pro Leu Phe Pro
565 570 575
Gly Leu Val Glu Lys Gln Arg Gln Leu Ala Glu Glu Asp Ala Ser Val
580 585 590
Phe Lys Ala Arg Ser Gly Val Leu Tyr Thr Val Ser Ser Trp Ser Glu
595 600 605
Ile His Ala Phe Val Leu Gly Ser Asp Phe Ser
610 615
<210> 148
<211> 1830
<212> DNA
<213> Oryza sativa
<220>
<221> CDS
<222> (1)..(1830)
<223> Oryza sativa gene encoding TMP phosphatase [NP_001062539.1]
<400> 148
atg cgc ggc ctc ctc cgc cgt gtc tac ctc cgc ctc ccc cct ttc cct 48
Met Arg Gly Leu Leu Arg Arg Val Tyr Leu Arg Leu Pro Pro Phe Pro
1 5 10 15
cct gcc acc tct ctt tat tat tgg tca aga aca aga cct gca gct gca 96
Pro Ala Thr Ser Leu Tyr Tyr Trp Ser Arg Thr Arg Pro Ala Ala Ala
20 25 30
ggg ccc aac cac ccc atc cct agg cgc atg tcg acg tcc tct act gcc 144
Gly Pro Asn His Pro Ile Pro Arg Arg Met Ser Thr Ser Ser Thr Ala
35 40 45
gcg gcg gtc gtt gcc gag ggc tcc gcc gct cgc cgc ttc tgg atc gcc 192
Ala Ala Val Val Ala Glu Gly Ser Ala Ala Arg Arg Phe Trp Ile Ala
50 55 60
gcc gcc tcg agg gag gcc gcc ttc gcc gcc tac acg ccc ttc ctc gtc 240
Ala Ala Ser Arg Glu Ala Ala Phe Ala Ala Tyr Thr Pro Phe Leu Val
65 70 75 80
tcc ctc gcc gcc ggg gcc ctc cgc ctg gat tcc ttc cgc caa tac atc 288
Ser Leu Ala Ala Gly Ala Leu Arg Leu Asp Ser Phe Arg Gln Tyr Ile
85 90 95
gcc cag gat gcc tac ttc ctc cac gcc ttc gcc cgc gcc tat gag atg 336
Ala Gln Asp Ala Tyr Phe Leu His Ala Phe Ala Arg Ala Tyr Glu Met
100 105 110
gcc gag gag tgc gcc gat gac gac gac gac aag gcc acc atc gtc gtc 384
Ala Glu Glu Cys Ala Asp Asp Asp Asp Asp Lys Ala Thr Ile Val Val
115 120 125
ctc agg aag gcc atc ctc cgc gag ctc aac ctc cac gct tcc gtc ctt 432
Leu Arg Lys Ala Ile Leu Arg Glu Leu Asn Leu His Ala Ser Val Leu
130 135 140
cag gaa tgg gga gtc gat ccc aac aaa gaa atc cct cca atc cca gcc 480
Gln Glu Trp Gly Val Asp Pro Asn Lys Glu Ile Pro Pro Ile Pro Ala
145 150 155 160
aca act aag tac act gat ttc tta ctt gca act tcc act gga aag gtt 528
Thr Thr Lys Tyr Thr Asp Phe Leu Leu Ala Thr Ser Thr Gly Lys Val
165 170 175
gat ggt ggg aaa ggt tct gat aaa atg gtc aca cca ttc gag aag acg 576
Asp Gly Gly Lys Gly Ser Asp Lys Met Val Thr Pro Phe Glu Lys Thr
180 185 190
aaa att gct gca tac act gtt ggg gct atg acc cca tgc atg agg ctt 624
Lys Ile Ala Ala Tyr Thr Val Gly Ala Met Thr Pro Cys Met Arg Leu
195 200 205
tat gcg tat ctg ggc aaa gaa ctt gca gtt ttc ttg aaa cag gat gaa 672
Tyr Ala Tyr Leu Gly Lys Glu Leu Ala Val Phe Leu Lys Gln Asp Glu
210 215 220
aat cac cca tac aag aaa tgg att gag act tat gca tcc agt gat ttt 720
Asn His Pro Tyr Lys Lys Trp Ile Glu Thr Tyr Ala Ser Ser Asp Phe
225 230 235 240
gag aat aac gca ctc caa ata gaa gag ttg ctt gat aaa cta agt gtc 768
Glu Asn Asn Ala Leu Gln Ile Glu Glu Leu Leu Asp Lys Leu Ser Val
245 250 255
tct cta act ggc gag gag ctt gag att att ggg aag ctc tac cag caa 816
Ser Leu Thr Gly Glu Glu Leu Glu Ile Ile Gly Lys Leu Tyr Gln Gln
260 265 270
gct atg agg ctg gaa gtt gag ttc ttc tct gct cag cca gta gac caa 864
Ala Met Arg Leu Glu Val Glu Phe Phe Ser Ala Gln Pro Val Asp Gln
275 280 285
cct gtt gta gct cca ctc tca aga tat tgt ggt ccg aaa gat aag ctc 912
Pro Val Val Ala Pro Leu Ser Arg Tyr Cys Gly Pro Lys Asp Lys Leu
290 295 300
ttg ata ttt tgt gat ttt gat ttg aca tgc act gtt gtt gat tca tct 960
Leu Ile Phe Cys Asp Phe Asp Leu Thr Cys Thr Val Val Asp Ser Ser
305 310 315 320
gcc att ttg gcg gag att gca atc ttg tca cac caa aag gct agt caa 1008
Ala Ile Leu Ala Glu Ile Ala Ile Leu Ser His Gln Lys Ala Ser Gln
325 330 335
ggt ggg gct gat agt tcc ctt gat cgt aca aaa tca gcg gac ttg aga 1056
Gly Gly Ala Asp Ser Ser Leu Asp Arg Thr Lys Ser Ala Asp Leu Arg
340 345 350
aat tca tgg aac atg ctc tca aat caa tac atg gaa gag tat gag caa 1104
Asn Ser Trp Asn Met Leu Ser Asn Gln Tyr Met Glu Glu Tyr Glu Gln
355 360 365
tgc ata gca agc ttg ctt cct cca gaa gaa gca agg tca cta gac tat 1152
Cys Ile Ala Ser Leu Leu Pro Pro Glu Glu Ala Arg Ser Leu Asp Tyr
370 375 380
gat caa ctg tat aaa ggt ttg gag gtg cta tcg cag ttt gag aaa ctt 1200
Asp Gln Leu Tyr Lys Gly Leu Glu Val Leu Ser Gln Phe Glu Lys Leu
385 390 395 400
gca aac tct agg gtg gtt gat tct ggt gtc ctg agg gga atg aat tta 1248
Ala Asn Ser Arg Val Val Asp Ser Gly Val Leu Arg Gly Met Asn Leu
405 410 415
gat gac atc cga aaa gct gga gag agg ctt att ctg caa gat gga tgc 1296
Asp Asp Ile Arg Lys Ala Gly Glu Arg Leu Ile Leu Gln Asp Gly Cys
420 425 430
aaa att ttt ttt caa aag att ggc aaa aca agg gag aac ctc aat tta 1344
Lys Ile Phe Phe Gln Lys Ile Gly Lys Thr Arg Glu Asn Leu Asn Leu
435 440 445
gat gtc cat att ctt tcc tat tgc tgg tgc gca gat ctt ata agg tca 1392
Asp Val His Ile Leu Ser Tyr Cys Trp Cys Ala Asp Leu Ile Arg Ser
450 455 460
gct ttt tca tca gtt ggt tgt cta gac ggg ctg aac ata cat tca aat 1440
Ala Phe Ser Ser Val Gly Cys Leu Asp Gly Leu Asn Ile His Ser Asn
465 470 475 480
gag ttt gct ttt gag gga tct gtt tca act ggt cat att aac aga caa 1488
Glu Phe Ala Phe Glu Gly Ser Val Ser Thr Gly His Ile Asn Arg Gln
485 490 495
atg gag tct cct ctg gac aaa gct gaa aag ttc aag agc atc aaa agc 1536
Met Glu Ser Pro Leu Asp Lys Ala Glu Lys Phe Lys Ser Ile Lys Ser
500 505 510
gac gtg ggt agt aca ggg aca tta ttg tca gtc tat att ggg gac tcg 1584
Asp Val Gly Ser Thr Gly Thr Leu Leu Ser Val Tyr Ile Gly Asp Ser
515 520 525
gtt gga gat ttg ctt tgc ttg ttg gag gca gat att ggt att gtt gtt 1632
Val Gly Asp Leu Leu Cys Leu Leu Glu Ala Asp Ile Gly Ile Val Val
530 535 540
gga tca agc aca acc ttg cgg aga gtg ggc aaa cag ttt ggt gtt tca 1680
Gly Ser Ser Thr Thr Leu Arg Arg Val Gly Lys Gln Phe Gly Val Ser
545 550 555 560
ttt gtt cct ctg ttc act ggg ttg gta gag aag cag agg cga ata gaa 1728
Phe Val Pro Leu Phe Thr Gly Leu Val Glu Lys Gln Arg Arg Ile Glu
565 570 575
aag gaa gaa tca tcc atc ttc aag gca cgg tct gga att ctt tat acg 1776
Lys Glu Glu Ser Ser Ile Phe Lys Ala Arg Ser Gly Ile Leu Tyr Thr
580 585 590
gtt tct agc tgg tcg gag gta cag gct ttc atc ctg gga aat gat ttc 1824
Val Ser Ser Trp Ser Glu Val Gln Ala Phe Ile Leu Gly Asn Asp Phe
595 600 605
agc tga 1830
Ser
<210> 149
<211> 609
<212> PRT
<213> Oryza sativa
<400> 149
Met Arg Gly Leu Leu Arg Arg Val Tyr Leu Arg Leu Pro Pro Phe Pro
1 5 10 15
Pro Ala Thr Ser Leu Tyr Tyr Trp Ser Arg Thr Arg Pro Ala Ala Ala
20 25 30
Gly Pro Asn His Pro Ile Pro Arg Arg Met Ser Thr Ser Ser Thr Ala
35 40 45
Ala Ala Val Val Ala Glu Gly Ser Ala Ala Arg Arg Phe Trp Ile Ala
50 55 60
Ala Ala Ser Arg Glu Ala Ala Phe Ala Ala Tyr Thr Pro Phe Leu Val
65 70 75 80
Ser Leu Ala Ala Gly Ala Leu Arg Leu Asp Ser Phe Arg Gln Tyr Ile
85 90 95
Ala Gln Asp Ala Tyr Phe Leu His Ala Phe Ala Arg Ala Tyr Glu Met
100 105 110
Ala Glu Glu Cys Ala Asp Asp Asp Asp Asp Lys Ala Thr Ile Val Val
115 120 125
Leu Arg Lys Ala Ile Leu Arg Glu Leu Asn Leu His Ala Ser Val Leu
130 135 140
Gln Glu Trp Gly Val Asp Pro Asn Lys Glu Ile Pro Pro Ile Pro Ala
145 150 155 160
Thr Thr Lys Tyr Thr Asp Phe Leu Leu Ala Thr Ser Thr Gly Lys Val
165 170 175
Asp Gly Gly Lys Gly Ser Asp Lys Met Val Thr Pro Phe Glu Lys Thr
180 185 190
Lys Ile Ala Ala Tyr Thr Val Gly Ala Met Thr Pro Cys Met Arg Leu
195 200 205
Tyr Ala Tyr Leu Gly Lys Glu Leu Ala Val Phe Leu Lys Gln Asp Glu
210 215 220
Asn His Pro Tyr Lys Lys Trp Ile Glu Thr Tyr Ala Ser Ser Asp Phe
225 230 235 240
Glu Asn Asn Ala Leu Gln Ile Glu Glu Leu Leu Asp Lys Leu Ser Val
245 250 255
Ser Leu Thr Gly Glu Glu Leu Glu Ile Ile Gly Lys Leu Tyr Gln Gln
260 265 270
Ala Met Arg Leu Glu Val Glu Phe Phe Ser Ala Gln Pro Val Asp Gln
275 280 285
Pro Val Val Ala Pro Leu Ser Arg Tyr Cys Gly Pro Lys Asp Lys Leu
290 295 300
Leu Ile Phe Cys Asp Phe Asp Leu Thr Cys Thr Val Val Asp Ser Ser
305 310 315 320
Ala Ile Leu Ala Glu Ile Ala Ile Leu Ser His Gln Lys Ala Ser Gln
325 330 335
Gly Gly Ala Asp Ser Ser Leu Asp Arg Thr Lys Ser Ala Asp Leu Arg
340 345 350
Asn Ser Trp Asn Met Leu Ser Asn Gln Tyr Met Glu Glu Tyr Glu Gln
355 360 365
Cys Ile Ala Ser Leu Leu Pro Pro Glu Glu Ala Arg Ser Leu Asp Tyr
370 375 380
Asp Gln Leu Tyr Lys Gly Leu Glu Val Leu Ser Gln Phe Glu Lys Leu
385 390 395 400
Ala Asn Ser Arg Val Val Asp Ser Gly Val Leu Arg Gly Met Asn Leu
405 410 415
Asp Asp Ile Arg Lys Ala Gly Glu Arg Leu Ile Leu Gln Asp Gly Cys
420 425 430
Lys Ile Phe Phe Gln Lys Ile Gly Lys Thr Arg Glu Asn Leu Asn Leu
435 440 445
Asp Val His Ile Leu Ser Tyr Cys Trp Cys Ala Asp Leu Ile Arg Ser
450 455 460
Ala Phe Ser Ser Val Gly Cys Leu Asp Gly Leu Asn Ile His Ser Asn
465 470 475 480
Glu Phe Ala Phe Glu Gly Ser Val Ser Thr Gly His Ile Asn Arg Gln
485 490 495
Met Glu Ser Pro Leu Asp Lys Ala Glu Lys Phe Lys Ser Ile Lys Ser
500 505 510
Asp Val Gly Ser Thr Gly Thr Leu Leu Ser Val Tyr Ile Gly Asp Ser
515 520 525
Val Gly Asp Leu Leu Cys Leu Leu Glu Ala Asp Ile Gly Ile Val Val
530 535 540
Gly Ser Ser Thr Thr Leu Arg Arg Val Gly Lys Gln Phe Gly Val Ser
545 550 555 560
Phe Val Pro Leu Phe Thr Gly Leu Val Glu Lys Gln Arg Arg Ile Glu
565 570 575
Lys Glu Glu Ser Ser Ile Phe Lys Ala Arg Ser Gly Ile Leu Tyr Thr
580 585 590
Val Ser Ser Trp Ser Glu Val Gln Ala Phe Ile Leu Gly Asn Asp Phe
595 600 605
Ser
<210> 150
<211> 1683
<212> DNA
<213> Picea sitchensis
<220>
<221> CDS
<222> (1)..(1683)
<223> Picea_sitchensis gene encoding TMP phosphatase [ABR16455]
<400> 150
atg ggg gtc gcc gat gaa gca gga gtc gcc aga agg cta tgg aca aag 48
Met Gly Val Ala Asp Glu Ala Gly Val Ala Arg Arg Leu Trp Thr Lys
1 5 10 15
ttc aag aaa gac acc gcg ctt gca cag tat aat tcc ttt gtt gtt gct 96
Phe Lys Lys Asp Thr Ala Leu Ala Gln Tyr Asn Ser Phe Val Val Ala
20 25 30
ttg gcg gcc ggg acg ctc aac atg acg tct ttt cag cag tac atg gcg 144
Leu Ala Ala Gly Thr Leu Asn Met Thr Ser Phe Gln Gln Tyr Met Ala
35 40 45
cag gat gct tat ttt ctc aaa gca ttt gct cag gcg tac aca atg gca 192
Gln Asp Ala Tyr Phe Leu Lys Ala Phe Ala Gln Ala Tyr Thr Met Ala
50 55 60
gag gat tgc gca gat gat gac gac gac aaa gca tcg atc cgt gaa cta 240
Glu Asp Cys Ala Asp Asp Asp Asp Asp Lys Ala Ser Ile Arg Glu Leu
65 70 75 80
cga aaa gcc gct gag gaa gag ctc aat ctg cac aat tcc ttg gct gag 288
Arg Lys Ala Ala Glu Glu Glu Leu Asn Leu His Asn Ser Leu Ala Glu
85 90 95
gac tgg gac gtt gaa ttt gca aaa gag tgc tct ccc aat atg gca aca 336
Asp Trp Asp Val Glu Phe Ala Lys Glu Cys Ser Pro Asn Met Ala Thr
100 105 110
gtc aag tac aca gaa ttt tta ttg gca aca gct gct ggc aag gtg gaa 384
Val Lys Tyr Thr Glu Phe Leu Leu Ala Thr Ala Ala Gly Lys Val Glu
115 120 125
gga ggg aag gga cca agc aga agt gtg act cct ttt gag aaa aca aaa 432
Gly Gly Lys Gly Pro Ser Arg Ser Val Thr Pro Phe Glu Lys Thr Lys
130 135 140
ata gca gca tac aca gtg ggt gcc atg acc ccg tgc atg agg ctt tat 480
Ile Ala Ala Tyr Thr Val Gly Ala Met Thr Pro Cys Met Arg Leu Tyr
145 150 155 160
gct ttc ttg ggc caa gaa att gtc aaa gcc ctg gaa cct gat tgc agt 528
Ala Phe Leu Gly Gln Glu Ile Val Lys Ala Leu Glu Pro Asp Cys Ser
165 170 175
aat cat cca tat aag cag tgg att gaa aca tac tct tct gca aag ttt 576
Asn His Pro Tyr Lys Gln Trp Ile Glu Thr Tyr Ser Ser Ala Lys Phe
180 185 190
gag gca tcg gca tta caa act gaa gag ttg ctt gac aaa ctg gct att 624
Glu Ala Ser Ala Leu Gln Thr Glu Glu Leu Leu Asp Lys Leu Ala Ile
195 200 205
tcg cta act ggg gaa gag ctt gaa gtg ctg cgg agg ttg tat tat cat 672
Ser Leu Thr Gly Glu Glu Leu Glu Val Leu Arg Arg Leu Tyr Tyr His
210 215 220
gcc tta aaa cta gaa ata gaa ttc ttt tcc gct cag cct ttc tct cag 720
Ala Leu Lys Leu Glu Ile Glu Phe Phe Ser Ala Gln Pro Phe Ser Gln
225 230 235 240
aga aca tta gtt ccg atg ttg aaa ctg ggt gat tca gcc agc cgc cga 768
Arg Thr Leu Val Pro Met Leu Lys Leu Gly Asp Ser Ala Ser Arg Arg
245 250 255
tat acc att gtc tca gat ttc gat ttg tct tgc act gtc ttg gat tct 816
Tyr Thr Ile Val Ser Asp Phe Asp Leu Ser Cys Thr Val Leu Asp Ser
260 265 270
tca gca gta tta gca gaa att gca ata ttg act act ctc aaa act gag 864
Ser Ala Val Leu Ala Glu Ile Ala Ile Leu Thr Thr Leu Lys Thr Glu
275 280 285
caa aat ggt gct gaa aac tta agt gat cac aag tca tca tcg gag ttg 912
Gln Asn Gly Ala Glu Asn Leu Ser Asp His Lys Ser Ser Ser Glu Leu
290 295 300
aga aaa act tgg gat gca ctt tct agt caa tat tct gaa gaa tgt gaa 960
Arg Lys Thr Trp Asp Ala Leu Ser Ser Gln Tyr Ser Glu Glu Cys Glu
305 310 315 320
gaa tgc tta agg aag act ctg cca cct gaa gaa gtg ggc tct ttt gat 1008
Glu Cys Leu Arg Lys Thr Leu Pro Pro Glu Glu Val Gly Ser Phe Asp
325 330 335
tat gaa ggc cta cac caa tct ctt gag cat ctg tct cag ttt gaa atg 1056
Tyr Glu Gly Leu His Gln Ser Leu Glu His Leu Ser Gln Phe Glu Met
340 345 350
gag gca aac tct aaa gtt gtc gag tca ggt gtc ctt gag ggc att aat 1104
Glu Ala Asn Ser Lys Val Val Glu Ser Gly Val Leu Glu Gly Ile Asn
355 360 365
ata gat gac att aaa aag gca gga gag cgt ctt gca ttt cag gat gga 1152
Ile Asp Asp Ile Lys Lys Ala Gly Glu Arg Leu Ala Phe Gln Asp Gly
370 375 380
tgc gca aac ttt ttt gaa caa atc cta acg aaa atg gac agc tta aat 1200
Cys Ala Asn Phe Phe Glu Gln Ile Leu Thr Lys Met Asp Ser Leu Asn
385 390 395 400
gtg gat gtg cac ata att tct gtt tgt tgg agt gga gat atc atc agg 1248
Val Asp Val His Ile Ile Ser Val Cys Trp Ser Gly Asp Ile Ile Arg
405 410 415
gct gct ttt tca tca agc ggt ttg gat ggt tta cag gtt cat tca aat 1296
Ala Ala Phe Ser Ser Ser Gly Leu Asp Gly Leu Gln Val His Ser Asn
420 425 430
gaa ctc acc ttt gtg gaa tca gtc tct act ggt ggt att gat agg cgt 1344
Glu Leu Thr Phe Val Glu Ser Val Ser Thr Gly Gly Ile Asp Arg Arg
435 440 445
gtt gag tcc cca gtt gac aag ttg aaa atc ttc aat aat att tgg agt 1392
Val Glu Ser Pro Val Asp Lys Leu Lys Ile Phe Asn Asn Ile Trp Ser
450 455 460
tct tca aag gac cag gac acg gaa cat atc tct ata tac att ggg gac 1440
Ser Ser Lys Asp Gln Asp Thr Glu His Ile Ser Ile Tyr Ile Gly Asp
465 470 475 480
ggt tta ggt gac ttg ctt tgt ctt ctt cag gca gat att gga ata gtg 1488
Gly Leu Gly Asp Leu Leu Cys Leu Leu Gln Ala Asp Ile Gly Ile Val
485 490 495
att ggt aca agc tca acg cta aga agg gtt gga aaa cgt ttt gga gta 1536
Ile Gly Thr Ser Ser Thr Leu Arg Arg Val Gly Lys Arg Phe Gly Val
500 505 510
tcc ttt gtt cct ttg ttt tct ggt ctt ctc aaa cag gag aga gca tat 1584
Ser Phe Val Pro Leu Phe Ser Gly Leu Leu Lys Gln Glu Arg Ala Tyr
515 520 525
gta gaa ggt tct agt tgt tgg aca aaa caa agt ggt att ctt tat acc 1632
Val Glu Gly Ser Ser Cys Trp Thr Lys Gln Ser Gly Ile Leu Tyr Thr
530 535 540
gtc tct agt tgg agt gaa ata cat gct ttt att ttg ggc tct tcc aat 1680
Val Ser Ser Trp Ser Glu Ile His Ala Phe Ile Leu Gly Ser Ser Asn
545 550 555 560
tga 1683
<210> 151
<211> 560
<212> PRT
<213> Picea sitchensis
<400> 151
Met Gly Val Ala Asp Glu Ala Gly Val Ala Arg Arg Leu Trp Thr Lys
1 5 10 15
Phe Lys Lys Asp Thr Ala Leu Ala Gln Tyr Asn Ser Phe Val Val Ala
20 25 30
Leu Ala Ala Gly Thr Leu Asn Met Thr Ser Phe Gln Gln Tyr Met Ala
35 40 45
Gln Asp Ala Tyr Phe Leu Lys Ala Phe Ala Gln Ala Tyr Thr Met Ala
50 55 60
Glu Asp Cys Ala Asp Asp Asp Asp Asp Lys Ala Ser Ile Arg Glu Leu
65 70 75 80
Arg Lys Ala Ala Glu Glu Glu Leu Asn Leu His Asn Ser Leu Ala Glu
85 90 95
Asp Trp Asp Val Glu Phe Ala Lys Glu Cys Ser Pro Asn Met Ala Thr
100 105 110
Val Lys Tyr Thr Glu Phe Leu Leu Ala Thr Ala Ala Gly Lys Val Glu
115 120 125
Gly Gly Lys Gly Pro Ser Arg Ser Val Thr Pro Phe Glu Lys Thr Lys
130 135 140
Ile Ala Ala Tyr Thr Val Gly Ala Met Thr Pro Cys Met Arg Leu Tyr
145 150 155 160
Ala Phe Leu Gly Gln Glu Ile Val Lys Ala Leu Glu Pro Asp Cys Ser
165 170 175
Asn His Pro Tyr Lys Gln Trp Ile Glu Thr Tyr Ser Ser Ala Lys Phe
180 185 190
Glu Ala Ser Ala Leu Gln Thr Glu Glu Leu Leu Asp Lys Leu Ala Ile
195 200 205
Ser Leu Thr Gly Glu Glu Leu Glu Val Leu Arg Arg Leu Tyr Tyr His
210 215 220
Ala Leu Lys Leu Glu Ile Glu Phe Phe Ser Ala Gln Pro Phe Ser Gln
225 230 235 240
Arg Thr Leu Val Pro Met Leu Lys Leu Gly Asp Ser Ala Ser Arg Arg
245 250 255
Tyr Thr Ile Val Ser Asp Phe Asp Leu Ser Cys Thr Val Leu Asp Ser
260 265 270
Ser Ala Val Leu Ala Glu Ile Ala Ile Leu Thr Thr Leu Lys Thr Glu
275 280 285
Gln Asn Gly Ala Glu Asn Leu Ser Asp His Lys Ser Ser Ser Glu Leu
290 295 300
Arg Lys Thr Trp Asp Ala Leu Ser Ser Gln Tyr Ser Glu Glu Cys Glu
305 310 315 320
Glu Cys Leu Arg Lys Thr Leu Pro Pro Glu Glu Val Gly Ser Phe Asp
325 330 335
Tyr Glu Gly Leu His Gln Ser Leu Glu His Leu Ser Gln Phe Glu Met
340 345 350
Glu Ala Asn Ser Lys Val Val Glu Ser Gly Val Leu Glu Gly Ile Asn
355 360 365
Ile Asp Asp Ile Lys Lys Ala Gly Glu Arg Leu Ala Phe Gln Asp Gly
370 375 380
Cys Ala Asn Phe Phe Glu Gln Ile Leu Thr Lys Met Asp Ser Leu Asn
385 390 395 400
Val Asp Val His Ile Ile Ser Val Cys Trp Ser Gly Asp Ile Ile Arg
405 410 415
Ala Ala Phe Ser Ser Ser Gly Leu Asp Gly Leu Gln Val His Ser Asn
420 425 430
Glu Leu Thr Phe Val Glu Ser Val Ser Thr Gly Gly Ile Asp Arg Arg
435 440 445
Val Glu Ser Pro Val Asp Lys Leu Lys Ile Phe Asn Asn Ile Trp Ser
450 455 460
Ser Ser Lys Asp Gln Asp Thr Glu His Ile Ser Ile Tyr Ile Gly Asp
465 470 475 480
Gly Leu Gly Asp Leu Leu Cys Leu Leu Gln Ala Asp Ile Gly Ile Val
485 490 495
Ile Gly Thr Ser Ser Thr Leu Arg Arg Val Gly Lys Arg Phe Gly Val
500 505 510
Ser Phe Val Pro Leu Phe Ser Gly Leu Leu Lys Gln Glu Arg Ala Tyr
515 520 525
Val Glu Gly Ser Ser Cys Trp Thr Lys Gln Ser Gly Ile Leu Tyr Thr
530 535 540
Val Ser Ser Trp Ser Glu Ile His Ala Phe Ile Leu Gly Ser Ser Asn
545 550 555 560
<210> 152
<211> 1641
<212> DNA
<213> Physcomitrella patens
<220>
<221> CDS
<222> (1)..(1641)
<223> Physcomitrella patens gene encoding TMP phosphatase
[XP_001769831]
<400> 152
atg aat ttg agc acg caa gct aac aca ggg ttg gcg aag agc ttc tgg 48
Met Asn Leu Ser Thr Gln Ala Asn Thr Gly Leu Ala Lys Ser Phe Trp
1 5 10 15
gct agt tgt aag aga gag gct tat gca tca ctc tac cat ccg ttt gtg 96
Ala Ser Cys Lys Arg Glu Ala Tyr Ala Ser Leu Tyr His Pro Phe Val
20 25 30
gtt gcg tta gcg gct ggc acc ttg cca aaa caa act ttt caa cgt tac 144
Val Ala Leu Ala Ala Gly Thr Leu Pro Lys Gln Thr Phe Gln Arg Tyr
35 40 45
atg gca cag gat gcc tat ttc ttg gag gcg ttc aag aat gcg tat caa 192
Met Ala Gln Asp Ala Tyr Phe Leu Glu Ala Phe Lys Asn Ala Tyr Gln
50 55 60
ctg gct atg gaa acc act aca gac gaa gag gca aag gcc atc att gag 240
Leu Ala Met Glu Thr Thr Thr Asp Glu Glu Ala Lys Ala Ile Ile Glu
65 70 75 80
tcc ctt cag aga gat gtg cag gaa gag ctc aat ttg cac tcg tcg atc 288
Ser Leu Gln Arg Asp Val Gln Glu Glu Leu Asn Leu His Ser Ser Ile
85 90 95
atg cag tct ttg gat gct acc gat cag aat tgc ttt gaa cca aac atg 336
Met Gln Ser Leu Asp Ala Thr Asp Gln Asn Cys Phe Glu Pro Asn Met
100 105 110
gca aca aca gcg tat tgt gat ttt ctg cta gcc aca gct aca gga agt 384
Ala Thr Thr Ala Tyr Cys Asp Phe Leu Leu Ala Thr Ala Thr Gly Ser
115 120 125
aac gaa gca caa aaa ttt gga agc aca agt gct caa atc ata acc gct 432
Asn Glu Ala Gln Lys Phe Gly Ser Thr Ser Ala Gln Ile Ile Thr Ala
130 135 140
atg act cct tgc atg cgg cta tat gca ttt ttg ggg cag gag ctc aaa 480
Met Thr Pro Cys Met Arg Leu Tyr Ala Phe Leu Gly Gln Glu Leu Lys
145 150 155 160
aaa cac gtt gat cat gtt gct gac cat cct tac cag gag tgg att gat 528
Lys His Val Asp His Val Ala Asp His Pro Tyr Gln Glu Trp Ile Asp
165 170 175
act tac tct gct gca gag ttc gag gct gca gct tcg aag att gag cag 576
Thr Tyr Ser Ala Ala Glu Phe Glu Ala Ala Ala Ser Lys Ile Glu Gln
180 185 190
ctg cta gac aag tta act gct act ttg act gga aag cat gaa ata gca 624
Leu Leu Asp Lys Leu Thr Ala Thr Leu Thr Gly Lys His Glu Ile Ala
195 200 205
ttc tta gaa agt ctc tat ctt caa gcc atg aac ttg gag gtg gat ttc 672
Phe Leu Glu Ser Leu Tyr Leu Gln Ala Met Asn Leu Glu Val Asp Phe
210 215 220
ttc ggt gct cag ctg tta ggg cct gtg ctc gta ccc ttc ctc aaa tgc 720
Phe Gly Ala Gln Leu Leu Gly Pro Val Leu Val Pro Phe Leu Lys Cys
225 230 235 240
caa ccg gct cca gag agc tat ata tta ctt gcg tct gac ttt gat tcc 768
Gln Pro Ala Pro Glu Ser Tyr Ile Leu Leu Ala Ser Asp Phe Asp Ser
245 250 255
acg tgc acg ata tct gat tca tgc ccc ata ttg gca gac ctg acc gtg 816
Thr Cys Thr Ile Ser Asp Ser Cys Pro Ile Leu Ala Asp Leu Thr Val
260 265 270
caa act gcg cga aaa tct cac ggt ggt cgt tca gtt ggt gaa tca ggg 864
Gln Thr Ala Arg Lys Ser His Gly Gly Arg Ser Val Gly Glu Ser Gly
275 280 285
gcc agc ttg ttg aaa aaa aga tgg gat gat ctc gtc atg cag tat atg 912
Ala Ser Leu Leu Lys Lys Arg Trp Asp Asp Leu Val Met Gln Tyr Met
290 295 300
gac gag tat gag gac gtt ctg aag cga agc ctg gtg aaa aaa gat aat 960
Asp Glu Tyr Glu Asp Val Leu Lys Arg Ser Leu Val Lys Lys Asp Asn
305 310 315 320
ggc agt gtt aat gcg ctc agt gca gag aat ctc caa gag ttt ctg aag 1008
Gly Ser Val Asn Ala Leu Ser Ala Glu Asn Leu Gln Glu Phe Leu Lys
325 330 335
gaa atg tcc aac ttc gaa cag aag gcc aat gcg agg gtc gaa gag gct 1056
Glu Met Ser Asn Phe Glu Gln Lys Ala Asn Ala Arg Val Glu Glu Ala
340 345 350
gca gtt cta aaa ggc tta tct ctg gct tcg att caa gaa gct gga aaa 1104
Ala Val Leu Lys Gly Leu Ser Leu Ala Ser Ile Gln Glu Ala Gly Lys
355 360 365
tcc atg cct ctt cgt gag ggc tgt tct gac ttt ttt aag cgt ctg gaa 1152
Ser Met Pro Leu Arg Glu Gly Cys Ser Asp Phe Phe Lys Arg Leu Glu
370 375 380
tca gga gag gtt ctt gtt gac aca tgt ata ttg tct gtg tgc tgg agc 1200
Ser Gly Glu Val Leu Val Asp Thr Cys Ile Leu Ser Val Cys Trp Ser
385 390 395 400
aaa acc ttc atc gaa gct gtc ttg gaa aag gtt cgt att cca aac atc 1248
Lys Thr Phe Ile Glu Ala Val Leu Glu Lys Val Arg Ile Pro Asn Ile
405 410 415
aat gcc aac gag ctc gtt ttc gaa gga cgc att tcc acc ggt gct att 1296
Asn Ala Asn Glu Leu Val Phe Glu Gly Arg Ile Ser Thr Gly Ala Ile
420 425 430
atc aaa aac gtc gaa acg gct ctt gac aag caa aga cac ttc gtt cag 1344
Ile Lys Asn Val Glu Thr Ala Leu Asp Lys Gln Arg His Phe Val Gln
435 440 445
ttg ctg gat aat cta aaa cca act caa gac gtg ctg tcc att tat gtt 1392
Leu Leu Asp Asn Leu Lys Pro Thr Gln Asp Val Leu Ser Ile Tyr Val
450 455 460
ggt gat agt ctg act gat ctt ctc tgc cta atc aga gca gac ctg ggt 1440
Gly Asp Ser Leu Thr Asp Leu Leu Cys Leu Ile Arg Ala Asp Leu Gly
465 470 475 480
ata gtt ctc ggt gac agc agc gct ctg aag cag gtg tat ggg cca aaa 1488
Ile Val Leu Gly Asp Ser Ser Ala Leu Lys Gln Val Tyr Gly Pro Lys
485 490 495
atg gcc ccc ctc ttc atg aaa gcc ata ctc ttg gag cag gca aac atg 1536
Met Ala Pro Leu Phe Met Lys Ala Ile Leu Leu Glu Gln Ala Asn Met
500 505 510
cga ggc agg cag caa ccc aca ggt tac gtc ttc act gtc tcc agt tgg 1584
Arg Gly Arg Gln Gln Pro Thr Gly Tyr Val Phe Thr Val Ser Ser Trp
515 520 525
tat gag gtg gaa gcc ttt ctg ttg ggt cct gct aga aac aga cct ttg 1632
Tyr Glu Val Glu Ala Phe Leu Leu Gly Pro Ala Arg Asn Arg Pro Leu
530 535 540
tac atc tag 1641
Tyr Ile
545
<210> 153
<211> 546
<212> PRT
<213> Physcomitrella patens
<400> 153
Met Asn Leu Ser Thr Gln Ala Asn Thr Gly Leu Ala Lys Ser Phe Trp
1 5 10 15
Ala Ser Cys Lys Arg Glu Ala Tyr Ala Ser Leu Tyr His Pro Phe Val
20 25 30
Val Ala Leu Ala Ala Gly Thr Leu Pro Lys Gln Thr Phe Gln Arg Tyr
35 40 45
Met Ala Gln Asp Ala Tyr Phe Leu Glu Ala Phe Lys Asn Ala Tyr Gln
50 55 60
Leu Ala Met Glu Thr Thr Thr Asp Glu Glu Ala Lys Ala Ile Ile Glu
65 70 75 80
Ser Leu Gln Arg Asp Val Gln Glu Glu Leu Asn Leu His Ser Ser Ile
85 90 95
Met Gln Ser Leu Asp Ala Thr Asp Gln Asn Cys Phe Glu Pro Asn Met
100 105 110
Ala Thr Thr Ala Tyr Cys Asp Phe Leu Leu Ala Thr Ala Thr Gly Ser
115 120 125
Asn Glu Ala Gln Lys Phe Gly Ser Thr Ser Ala Gln Ile Ile Thr Ala
130 135 140
Met Thr Pro Cys Met Arg Leu Tyr Ala Phe Leu Gly Gln Glu Leu Lys
145 150 155 160
Lys His Val Asp His Val Ala Asp His Pro Tyr Gln Glu Trp Ile Asp
165 170 175
Thr Tyr Ser Ala Ala Glu Phe Glu Ala Ala Ala Ser Lys Ile Glu Gln
180 185 190
Leu Leu Asp Lys Leu Thr Ala Thr Leu Thr Gly Lys His Glu Ile Ala
195 200 205
Phe Leu Glu Ser Leu Tyr Leu Gln Ala Met Asn Leu Glu Val Asp Phe
210 215 220
Phe Gly Ala Gln Leu Leu Gly Pro Val Leu Val Pro Phe Leu Lys Cys
225 230 235 240
Gln Pro Ala Pro Glu Ser Tyr Ile Leu Leu Ala Ser Asp Phe Asp Ser
245 250 255
Thr Cys Thr Ile Ser Asp Ser Cys Pro Ile Leu Ala Asp Leu Thr Val
260 265 270
Gln Thr Ala Arg Lys Ser His Gly Gly Arg Ser Val Gly Glu Ser Gly
275 280 285
Ala Ser Leu Leu Lys Lys Arg Trp Asp Asp Leu Val Met Gln Tyr Met
290 295 300
Asp Glu Tyr Glu Asp Val Leu Lys Arg Ser Leu Val Lys Lys Asp Asn
305 310 315 320
Gly Ser Val Asn Ala Leu Ser Ala Glu Asn Leu Gln Glu Phe Leu Lys
325 330 335
Glu Met Ser Asn Phe Glu Gln Lys Ala Asn Ala Arg Val Glu Glu Ala
340 345 350
Ala Val Leu Lys Gly Leu Ser Leu Ala Ser Ile Gln Glu Ala Gly Lys
355 360 365
Ser Met Pro Leu Arg Glu Gly Cys Ser Asp Phe Phe Lys Arg Leu Glu
370 375 380
Ser Gly Glu Val Leu Val Asp Thr Cys Ile Leu Ser Val Cys Trp Ser
385 390 395 400
Lys Thr Phe Ile Glu Ala Val Leu Glu Lys Val Arg Ile Pro Asn Ile
405 410 415
Asn Ala Asn Glu Leu Val Phe Glu Gly Arg Ile Ser Thr Gly Ala Ile
420 425 430
Ile Lys Asn Val Glu Thr Ala Leu Asp Lys Gln Arg His Phe Val Gln
435 440 445
Leu Leu Asp Asn Leu Lys Pro Thr Gln Asp Val Leu Ser Ile Tyr Val
450 455 460
Gly Asp Ser Leu Thr Asp Leu Leu Cys Leu Ile Arg Ala Asp Leu Gly
465 470 475 480
Ile Val Leu Gly Asp Ser Ser Ala Leu Lys Gln Val Tyr Gly Pro Lys
485 490 495
Met Ala Pro Leu Phe Met Lys Ala Ile Leu Leu Glu Gln Ala Asn Met
500 505 510
Arg Gly Arg Gln Gln Pro Thr Gly Tyr Val Phe Thr Val Ser Ser Trp
515 520 525
Tyr Glu Val Glu Ala Phe Leu Leu Gly Pro Ala Arg Asn Arg Pro Leu
530 535 540
Tyr Ile
545
<210> 154
<211> 1593
<212> DNA
<213> Selaginella moellendorffii
<220>
<221> CDS
<222> (1)..(1593)
<223> Selaginella_moellendorffii gene encoding TMP phosphatase
[XP_002990363]
<400> 154
atg tcg tgt ttg ctt aga aat gta gtg gcc aga gga ttg agg agc ttg 48
Met Ser Cys Leu Leu Arg Asn Val Val Ala Arg Gly Leu Arg Ser Leu
1 5 10 15
gca agc gcc cag gcg atg gag cca tcc att tca aag cgc ttg tgg cag 96
Ala Ser Ala Gln Ala Met Glu Pro Ser Ile Ser Lys Arg Leu Trp Gln
20 25 30
caa tcc aag cgc gag gca atg gta tgt ctg tat cat cca ttt gtg gtg 144
Gln Ser Lys Arg Glu Ala Met Val Cys Leu Tyr His Pro Phe Val Val
35 40 45
tcc atc gct gct ggg acg ctg gat ctt cac agc ttc cag cga ttc ata 192
Ser Ile Ala Ala Gly Thr Leu Asp Leu His Ser Phe Gln Arg Phe Ile
50 55 60
gcg cag gat tcc ttc ttc ctg acg gca ttc gcg aaa gcc tat ggt ttg 240
Ala Gln Asp Ser Phe Phe Leu Thr Ala Phe Ala Lys Ala Tyr Gly Leu
65 70 75 80
gcc ata gag cgc agc gat gat cga gaa gtt aaa tct gag att tgc aag 288
Ala Ile Glu Arg Ser Asp Asp Arg Glu Val Lys Ser Glu Ile Cys Lys
85 90 95
ctc caa cag gct gtg tac gag gaa ctt gag ctc cat tct tcc ctc atg 336
Leu Gln Gln Ala Val Tyr Glu Glu Leu Glu Leu His Ser Ser Leu Met
100 105 110
aag gct tgg aac ttc gat cat aca cca cca tcg cca gca act tgt gct 384
Lys Ala Trp Asn Phe Asp His Thr Pro Pro Ser Pro Ala Thr Cys Ala
115 120 125
tac aca gat ttt ctc ctc gca gtg gct gct ggg aag aaa att gaa tgc 432
Tyr Thr Asp Phe Leu Leu Ala Val Ala Ala Gly Lys Lys Ile Glu Cys
130 135 140
gag aaa act aag gtg ccg atg ctc gct ctg gca gca atg gct ccg tgc 480
Glu Lys Thr Lys Val Pro Met Leu Ala Leu Ala Ala Met Ala Pro Cys
145 150 155 160
atg cgt ctc tac gct ttc cta ggc caa gag acg aga gtt ttc tct cga 528
Met Arg Leu Tyr Ala Phe Leu Gly Gln Glu Thr Arg Val Phe Ser Arg
165 170 175
gaa aat cat cca tat cgc gac tgg att tcg act tac tcg tcg cct ggt 576
Glu Asn His Pro Tyr Arg Asp Trp Ile Ser Thr Tyr Ser Ser Pro Gly
180 185 190
ttc gag act gct gct act cga ctc gag cag ctt ctc gat agc ctc tcg 624
Phe Glu Thr Ala Ala Thr Arg Leu Glu Gln Leu Leu Asp Ser Leu Ser
195 200 205
gaa gct caa gag act acg gca gcg gaa ttt cag agt atg caa agt ttg 672
Glu Ala Gln Glu Thr Thr Ala Ala Glu Phe Gln Ser Met Gln Ser Leu
210 215 220
tat cac cgt gcc ata gcg tac gag gtg agc ttc ttc gat gcc cag gaa 720
Tyr His Arg Ala Ile Ala Tyr Glu Val Ser Phe Phe Asp Ala Gln Glu
225 230 235 240
gtg cgt ggc agc aac gct ttt gtc ccg ctg cta gag agt gta gca ctc 768
Val Arg Gly Ser Asn Ala Phe Val Pro Leu Leu Glu Ser Val Ala Leu
245 250 255
aag gat cgc aac ttc gtc ctc atc tct gat ttt gat tct act tgc acc 816
Lys Asp Arg Asn Phe Val Leu Ile Ser Asp Phe Asp Ser Thr Cys Thr
260 265 270
gtc tct gat tca tcc cca gtt cta gcg gag ctg gct atg gcg gtc gat 864
Val Ser Asp Ser Ser Pro Val Leu Ala Glu Leu Ala Met Ala Val Asp
275 280 285
cca aat gta agg agg aaa tgg agc agc ctc tcg gac gag tat ttc agg 912
Pro Asn Val Arg Arg Lys Trp Ser Ser Leu Ser Asp Glu Tyr Phe Arg
290 295 300
gac tac tcc aaa ctc ctg gaa gaa gtt gtt ctt cgt gag tac gac tac 960
Asp Tyr Ser Lys Leu Leu Glu Glu Val Val Leu Arg Glu Tyr Asp Tyr
305 310 315 320
gat gcg atc aaa gag gct ctc caa gtt ctt tcc gag ttt gag aag caa 1008
Asp Ala Ile Lys Glu Ala Leu Gln Val Leu Ser Glu Phe Glu Lys Gln
325 330 335
ggg aac gcg aaa atc gac gcc tcc cgc gtt ttg caa ggc att aag atc 1056
Gly Asn Ala Lys Ile Asp Ala Ser Arg Val Leu Gln Gly Ile Lys Ile
340 345 350
gat gat atc aag caa gcc gga caa aac atg gca ctt caa gct ggc tgt 1104
Asp Asp Ile Lys Gln Ala Gly Gln Asn Met Ala Leu Gln Ala Gly Cys
355 360 365
gcc agt gtg ctt tgc agg cta agt tcc aaa atc tct tgt caa atc ctc 1152
Ala Ser Val Leu Cys Arg Leu Ser Ser Lys Ile Ser Cys Gln Ile Leu
370 375 380
tcg gtt tgc tgg agc cgg acc ttc atc gaa gca gct ttc tcc aaa gag 1200
Ser Val Cys Trp Ser Arg Thr Phe Ile Glu Ala Ala Phe Ser Lys Glu
385 390 395 400
aat atc acc aat gtt cct gtc cat tcc aac gaa ctc gaa aac gat ggg 1248
Asn Ile Thr Asn Val Pro Val His Ser Asn Glu Leu Glu Asn Asp Gly
405 410 415
aac ttt aca acc ggg agc ttg atc aga cgc gtc gag aca ccg att gac 1296
Asn Phe Thr Thr Gly Ser Leu Ile Arg Arg Val Glu Thr Pro Ile Asp
420 425 430
aag gaa gag acg atg ttt cgt gag att cta cac gct ccg gac gac aag 1344
Lys Glu Glu Thr Met Phe Arg Glu Ile Leu His Ala Pro Asp Asp Lys
435 440 445
ttt gtg att ttc att gga gac agc ctc acg gat ctg cta gcc ttg ctc 1392
Phe Val Ile Phe Ile Gly Asp Ser Leu Thr Asp Leu Leu Ala Leu Leu
450 455 460
cga gct gac att gga att gtt cta gga acg agc tcc agc ctc gat cga 1440
Arg Ala Asp Ile Gly Ile Val Leu Gly Thr Ser Ser Ser Leu Asp Arg
465 470 475 480
gcc tcc aaa gcc ttt gga gtg aag atc gtg cca ctc ttt tcc ggc ctc 1488
Ala Ser Lys Ala Phe Gly Val Lys Ile Val Pro Leu Phe Ser Gly Leu
485 490 495
gtc cag cgg cag caa agc tct cga tca gcg tgg aga aaa gag gaa gga 1536
Val Gln Arg Gln Gln Ser Ser Arg Ser Ala Trp Arg Lys Glu Glu Gly
500 505 510
gtt ttg tat cga gct tct gga tgg ctg gag ata gaa gcg ttt cta gct 1584
Val Leu Tyr Arg Ala Ser Gly Trp Leu Glu Ile Glu Ala Phe Leu Ala
515 520 525
ggt aat tag 1593
Gly Asn
530
<210> 155
<211> 530
<212> PRT
<213> Selaginella moellendorffii
<400> 155
Met Ser Cys Leu Leu Arg Asn Val Val Ala Arg Gly Leu Arg Ser Leu
1 5 10 15
Ala Ser Ala Gln Ala Met Glu Pro Ser Ile Ser Lys Arg Leu Trp Gln
20 25 30
Gln Ser Lys Arg Glu Ala Met Val Cys Leu Tyr His Pro Phe Val Val
35 40 45
Ser Ile Ala Ala Gly Thr Leu Asp Leu His Ser Phe Gln Arg Phe Ile
50 55 60
Ala Gln Asp Ser Phe Phe Leu Thr Ala Phe Ala Lys Ala Tyr Gly Leu
65 70 75 80
Ala Ile Glu Arg Ser Asp Asp Arg Glu Val Lys Ser Glu Ile Cys Lys
85 90 95
Leu Gln Gln Ala Val Tyr Glu Glu Leu Glu Leu His Ser Ser Leu Met
100 105 110
Lys Ala Trp Asn Phe Asp His Thr Pro Pro Ser Pro Ala Thr Cys Ala
115 120 125
Tyr Thr Asp Phe Leu Leu Ala Val Ala Ala Gly Lys Lys Ile Glu Cys
130 135 140
Glu Lys Thr Lys Val Pro Met Leu Ala Leu Ala Ala Met Ala Pro Cys
145 150 155 160
Met Arg Leu Tyr Ala Phe Leu Gly Gln Glu Thr Arg Val Phe Ser Arg
165 170 175
Glu Asn His Pro Tyr Arg Asp Trp Ile Ser Thr Tyr Ser Ser Pro Gly
180 185 190
Phe Glu Thr Ala Ala Thr Arg Leu Glu Gln Leu Leu Asp Ser Leu Ser
195 200 205
Glu Ala Gln Glu Thr Thr Ala Ala Glu Phe Gln Ser Met Gln Ser Leu
210 215 220
Tyr His Arg Ala Ile Ala Tyr Glu Val Ser Phe Phe Asp Ala Gln Glu
225 230 235 240
Val Arg Gly Ser Asn Ala Phe Val Pro Leu Leu Glu Ser Val Ala Leu
245 250 255
Lys Asp Arg Asn Phe Val Leu Ile Ser Asp Phe Asp Ser Thr Cys Thr
260 265 270
Val Ser Asp Ser Ser Pro Val Leu Ala Glu Leu Ala Met Ala Val Asp
275 280 285
Pro Asn Val Arg Arg Lys Trp Ser Ser Leu Ser Asp Glu Tyr Phe Arg
290 295 300
Asp Tyr Ser Lys Leu Leu Glu Glu Val Val Leu Arg Glu Tyr Asp Tyr
305 310 315 320
Asp Ala Ile Lys Glu Ala Leu Gln Val Leu Ser Glu Phe Glu Lys Gln
325 330 335
Gly Asn Ala Lys Ile Asp Ala Ser Arg Val Leu Gln Gly Ile Lys Ile
340 345 350
Asp Asp Ile Lys Gln Ala Gly Gln Asn Met Ala Leu Gln Ala Gly Cys
355 360 365
Ala Ser Val Leu Cys Arg Leu Ser Ser Lys Ile Ser Cys Gln Ile Leu
370 375 380
Ser Val Cys Trp Ser Arg Thr Phe Ile Glu Ala Ala Phe Ser Lys Glu
385 390 395 400
Asn Ile Thr Asn Val Pro Val His Ser Asn Glu Leu Glu Asn Asp Gly
405 410 415
Asn Phe Thr Thr Gly Ser Leu Ile Arg Arg Val Glu Thr Pro Ile Asp
420 425 430
Lys Glu Glu Thr Met Phe Arg Glu Ile Leu His Ala Pro Asp Asp Lys
435 440 445
Phe Val Ile Phe Ile Gly Asp Ser Leu Thr Asp Leu Leu Ala Leu Leu
450 455 460
Arg Ala Asp Ile Gly Ile Val Leu Gly Thr Ser Ser Ser Leu Asp Arg
465 470 475 480
Ala Ser Lys Ala Phe Gly Val Lys Ile Val Pro Leu Phe Ser Gly Leu
485 490 495
Val Gln Arg Gln Gln Ser Ser Arg Ser Ala Trp Arg Lys Glu Glu Gly
500 505 510
Val Leu Tyr Arg Ala Ser Gly Trp Leu Glu Ile Glu Ala Phe Leu Ala
515 520 525
Gly Asn
530
<210> 156
<211> 648
<212> DNA
<213> Anaerotruncus colihominis
<220>
<221> CDS
<222> (1)..(648)
<223> Anaerotruncus colihominis gene encoding TMP phosphatase
[WP_006874980]
<400> 156
atg atc aaa ggc gcg att ttt gat atg gac ggt acg ctg att gat tcc 48
Met Ile Lys Gly Ala Ile Phe Asp Met Asp Gly Thr Leu Ile Asp Ser
1 5 10 15
atg cct cta tgg gag gac tgc gga cgg gcc ttt tta tcc gcg cgc ggc 96
Met Pro Leu Trp Glu Asp Cys Gly Arg Ala Phe Leu Ser Ala Arg Gly
20 25 30
att act gcg cgt gac gat ctg ggc gaa acg ctc aaa tcc ctg tcg atg 144
Ile Thr Ala Arg Asp Asp Leu Gly Glu Thr Leu Lys Ser Leu Ser Met
35 40 45
gag caa acg gct aat tat ttg cgg gac gca tac ggt att tcc gag aca 192
Glu Gln Thr Ala Asn Tyr Leu Arg Asp Ala Tyr Gly Ile Ser Glu Thr
50 55 60
acc tct gaa atc att gag atg atc aat gga atg gtt act gac gca tat 240
Thr Ser Glu Ile Ile Glu Met Ile Asn Gly Met Val Thr Asp Ala Tyr
65 70 75 80
cag cgc acc atc ccg ctt aaa cgt gac att gcc gcg ttt ctc gag cgc 288
Gln Arg Thr Ile Pro Leu Lys Arg Asp Ile Ala Ala Phe Leu Glu Arg
85 90 95
ctc agg cag gcg gat gtg cgc atg tgt gtc gca acg gca acg gac cgt 336
Leu Arg Gln Ala Asp Val Arg Met Cys Val Ala Thr Ala Thr Asp Arg
100 105 110
cca ctg gtg gag gcg gcg ctt gga cgc ctt gac ctc ctg ccc ttt ttt 384
Pro Leu Val Glu Ala Ala Leu Gly Arg Leu Asp Leu Leu Pro Phe Phe
115 120 125
gaa cgg att ttc acc tgt tcg gag gtg ggg gcc ggc aag gac cgc ccc 432
Glu Arg Ile Phe Thr Cys Ser Glu Val Gly Ala Gly Lys Asp Arg Pro
130 135 140
gat atc ttt gag cag gcg tgc gcc gcg ctt ggc acg ccg cgc ggc gaa 480
Asp Ile Phe Glu Gln Ala Cys Ala Ala Leu Gly Thr Pro Arg Gly Glu
145 150 155 160
acc gtc atc ttt gag gat gct ctt tat gcg att gaa aca gct cgg cgc 528
Thr Val Ile Phe Glu Asp Ala Leu Tyr Ala Ile Glu Thr Ala Arg Arg
165 170 175
gcc ggg ttc cgc gtt gtc gca atc gcg gac gac gcc tcc gcc ggc gac 576
Ala Gly Phe Arg Val Val Ala Ile Ala Asp Asp Ala Ser Ala Gly Asp
180 185 190
gag gcg cgc ata gcc gca ctg tct gag caa tat ata cat aac tat gag 624
Glu Ala Arg Ile Ala Ala Leu Ser Glu Gln Tyr Ile His Asn Tyr Glu
195 200 205
gaa tgc gag gta aac agt tta tga 648
Glu Cys Glu Val Asn Ser Leu
210 215
<210> 157
<211> 215
<212> PRT
<213> Anaerotruncus colihominis
<400> 157
Met Ile Lys Gly Ala Ile Phe Asp Met Asp Gly Thr Leu Ile Asp Ser
1 5 10 15
Met Pro Leu Trp Glu Asp Cys Gly Arg Ala Phe Leu Ser Ala Arg Gly
20 25 30
Ile Thr Ala Arg Asp Asp Leu Gly Glu Thr Leu Lys Ser Leu Ser Met
35 40 45
Glu Gln Thr Ala Asn Tyr Leu Arg Asp Ala Tyr Gly Ile Ser Glu Thr
50 55 60
Thr Ser Glu Ile Ile Glu Met Ile Asn Gly Met Val Thr Asp Ala Tyr
65 70 75 80
Gln Arg Thr Ile Pro Leu Lys Arg Asp Ile Ala Ala Phe Leu Glu Arg
85 90 95
Leu Arg Gln Ala Asp Val Arg Met Cys Val Ala Thr Ala Thr Asp Arg
100 105 110
Pro Leu Val Glu Ala Ala Leu Gly Arg Leu Asp Leu Leu Pro Phe Phe
115 120 125
Glu Arg Ile Phe Thr Cys Ser Glu Val Gly Ala Gly Lys Asp Arg Pro
130 135 140
Asp Ile Phe Glu Gln Ala Cys Ala Ala Leu Gly Thr Pro Arg Gly Glu
145 150 155 160
Thr Val Ile Phe Glu Asp Ala Leu Tyr Ala Ile Glu Thr Ala Arg Arg
165 170 175
Ala Gly Phe Arg Val Val Ala Ile Ala Asp Asp Ala Ser Ala Gly Asp
180 185 190
Glu Ala Arg Ile Ala Ala Leu Ser Glu Gln Tyr Ile His Asn Tyr Glu
195 200 205
Glu Cys Glu Val Asn Ser Leu
210 215
<210> 158
<211> 666
<212> DNA
<213> Eubacterium ventriosum
<220>
<221> CDS
<222> (1)..(666)
<223> Eubacterium ventriosum gene encoding TMP phosphatase
[WP_005362972]
<400> 158
atg tca aca gga ttt ata ttt gat gta gat gga aca ata cta gac tca 48
Met Ser Thr Gly Phe Ile Phe Asp Val Asp Gly Thr Ile Leu Asp Ser
1 5 10 15
atg gga ata tgg atg aac gta gga gaa cta tat cta aaa gat atg gga 96
Met Gly Ile Trp Met Asn Val Gly Glu Leu Tyr Leu Lys Asp Met Gly
20 25 30
ata aag gcg gaa cca aat ctt gga gaa att cta ttc gaa atg aca atg 144
Ile Lys Ala Glu Pro Asn Leu Gly Glu Ile Leu Phe Glu Met Thr Met
35 40 45
aat gaa ggt gca gaa tac ata caa aaa aag tat aat cta aac ctt aca 192
Asn Glu Gly Ala Glu Tyr Ile Gln Lys Lys Tyr Asn Leu Asn Leu Thr
50 55 60
aca gaa gaa ata tgc acc gga ata aac aac cgt gta tac aaa ttc tac 240
Thr Glu Glu Ile Cys Thr Gly Ile Asn Asn Arg Val Tyr Lys Phe Tyr
65 70 75 80
gaa aaa gaa gca atg cca aaa cca aaa gtt atc gac ttt ata gaa caa 288
Glu Lys Glu Ala Met Pro Lys Pro Lys Val Ile Asp Phe Ile Glu Gln
85 90 95
gcc tac gag aac aaa atc cca atg aca ata gca acg tca aca gac aga 336
Ala Tyr Glu Asn Lys Ile Pro Met Thr Ile Ala Thr Ser Thr Asp Arg
100 105 110
cca atg ata gaa gca gct ttc aaa aga ctg cac ata gac aaa tat ttt 384
Pro Met Ile Glu Ala Ala Phe Lys Arg Leu His Ile Asp Lys Tyr Phe
115 120 125
aaa aaa ata ttt acc acg aca gag gtt ggg tat gga aaa gac aaa ccg 432
Lys Lys Ile Phe Thr Thr Thr Glu Val Gly Tyr Gly Lys Asp Lys Pro
130 135 140
gac atc ttc ata aaa gca atg gaa gaa atg gga aca aca cca aag caa 480
Asp Ile Phe Ile Lys Ala Met Glu Glu Met Gly Thr Thr Pro Lys Gln
145 150 155 160
aca tgg cta ttt gaa gat gga gca tac tca ata gaa aca gcc aaa caa 528
Thr Trp Leu Phe Glu Asp Gly Ala Tyr Ser Ile Glu Thr Ala Lys Gln
165 170 175
cta ggc ata aaa aca ata gga atc tac gat cct gca agc gaa aaa gac 576
Leu Gly Ile Lys Thr Ile Gly Ile Tyr Asp Pro Ala Ser Glu Lys Asp
180 185 190
cag gaa aaa ata aga aac cta aca aac atc tac ata aaa aat tgg aca 624
Gln Glu Lys Ile Arg Asn Leu Thr Asn Ile Tyr Ile Lys Asn Trp Thr
195 200 205
gaa cac aaa acc cta ctt aac caa ata caa aac aac aag tag 666
Glu His Lys Thr Leu Leu Asn Gln Ile Gln Asn Asn Lys
210 215 220
<210> 159
<211> 221
<212> PRT
<213> Eubacterium ventriosum
<400> 159
Met Ser Thr Gly Phe Ile Phe Asp Val Asp Gly Thr Ile Leu Asp Ser
1 5 10 15
Met Gly Ile Trp Met Asn Val Gly Glu Leu Tyr Leu Lys Asp Met Gly
20 25 30
Ile Lys Ala Glu Pro Asn Leu Gly Glu Ile Leu Phe Glu Met Thr Met
35 40 45
Asn Glu Gly Ala Glu Tyr Ile Gln Lys Lys Tyr Asn Leu Asn Leu Thr
50 55 60
Thr Glu Glu Ile Cys Thr Gly Ile Asn Asn Arg Val Tyr Lys Phe Tyr
65 70 75 80
Glu Lys Glu Ala Met Pro Lys Pro Lys Val Ile Asp Phe Ile Glu Gln
85 90 95
Ala Tyr Glu Asn Lys Ile Pro Met Thr Ile Ala Thr Ser Thr Asp Arg
100 105 110
Pro Met Ile Glu Ala Ala Phe Lys Arg Leu His Ile Asp Lys Tyr Phe
115 120 125
Lys Lys Ile Phe Thr Thr Thr Glu Val Gly Tyr Gly Lys Asp Lys Pro
130 135 140
Asp Ile Phe Ile Lys Ala Met Glu Glu Met Gly Thr Thr Pro Lys Gln
145 150 155 160
Thr Trp Leu Phe Glu Asp Gly Ala Tyr Ser Ile Glu Thr Ala Lys Gln
165 170 175
Leu Gly Ile Lys Thr Ile Gly Ile Tyr Asp Pro Ala Ser Glu Lys Asp
180 185 190
Gln Glu Lys Ile Arg Asn Leu Thr Asn Ile Tyr Ile Lys Asn Trp Thr
195 200 205
Glu His Lys Thr Leu Leu Asn Gln Ile Gln Asn Asn Lys
210 215 220
<210> 160
<211> 1482
<212> DNA
<213> Coprococcus eutactus
<220>
<221> CDS
<222> (1)..(1482)
<223> Coprococcus eutactus ATCC 27759 gene encoding TMP phosphatase
[EDP27707]
<400> 160
atg aaa aag ata gtt atc agc gat ata aaa ggt gcg ata ttt gac atg 48
Met Lys Lys Ile Val Ile Ser Asp Ile Lys Gly Ala Ile Phe Asp Met
1 5 10 15
gat gga gtt ctg ctg gac tct atg ccg atg tgg gac cat gcg ggc gag 96
Asp Gly Val Leu Leu Asp Ser Met Pro Met Trp Asp His Ala Gly Glu
20 25 30
atg tac ctt gca gga cag ggg ata gag gct gag cct gat ctt gaa aaa 144
Met Tyr Leu Ala Gly Gln Gly Ile Glu Ala Glu Pro Asp Leu Glu Lys
35 40 45
gtc ttg ttt aca atg act atg caa aag ggc gct gaa tat ata cgt gat 192
Val Leu Phe Thr Met Thr Met Gln Lys Gly Ala Glu Tyr Ile Arg Asp
50 55 60
cat tat ggg tta aaa ctc acg gcg gat gag atc ata gat ggc ata aat 240
His Tyr Gly Leu Lys Leu Thr Ala Asp Glu Ile Ile Asp Gly Ile Asn
65 70 75 80
gag act gtg aga gat ttc tat gca aat aag gtt gtg cct aag aat gga 288
Glu Thr Val Arg Asp Phe Tyr Ala Asn Lys Val Val Pro Lys Asn Gly
85 90 95
gtc ctt aag ttc ctc agg ctg ttg aag agt cac aat ata cct gta acc 336
Val Leu Lys Phe Leu Arg Leu Leu Lys Ser His Asn Ile Pro Val Thr
100 105 110
gtt gca act tcg acc gac aga tgc cat gtg gag gct gct ctt tca aga 384
Val Ala Thr Ser Thr Asp Arg Cys His Val Glu Ala Ala Leu Ser Arg
115 120 125
aat gga ctt atg gaa tat gta gac aag ata ttt acg tgt tcg gaa gtt 432
Asn Gly Leu Met Glu Tyr Val Asp Lys Ile Phe Thr Cys Ser Glu Val
130 135 140
ggc gtt gga aag gct gcc tct cca aag ata tat gag ctt gcg gcc gaa 480
Gly Val Gly Lys Ala Ala Ser Pro Lys Ile Tyr Glu Leu Ala Ala Glu
145 150 155 160
ttt atg ggg acg aaa gtc ggc gag tca ttt gtg ttc gag gat gcc tat 528
Phe Met Gly Thr Lys Val Gly Glu Ser Phe Val Phe Glu Asp Ala Tyr
165 170 175
cat gcg gcc gag aca gct cag aat gcg gga ttt aca gtt gta gga ctc 576
His Ala Ala Glu Thr Ala Gln Asn Ala Gly Phe Thr Val Val Gly Leu
180 185 190
tat gac gag tca agc cgt gac atg caa gca gaa ctt aag gtt cac tgc 624
Tyr Asp Glu Ser Ser Arg Asp Met Gln Ala Glu Leu Lys Val His Cys
195 200 205
aat tat tac tat ttg gga ttt gcc gag ctt ata gat gag ctg ctg cct 672
Asn Tyr Tyr Tyr Leu Gly Phe Ala Glu Leu Ile Asp Glu Leu Leu Pro
210 215 220
gac aga agc cag ctt gca ccg gtt ctt acc atc gcg ggc agt gat tca 720
Asp Arg Ser Gln Leu Ala Pro Val Leu Thr Ile Ala Gly Ser Asp Ser
225 230 235 240
tcg gga ggt gcg gga ata cag gca gat ctt aag acc atg cag gca aat 768
Ser Gly Gly Ala Gly Ile Gln Ala Asp Leu Lys Thr Met Gln Ala Asn
245 250 255
gga gtg ttt ggc atg agc gca gta act gcc ttg acg gcg cag aat acc 816
Gly Val Phe Gly Met Ser Ala Val Thr Ala Leu Thr Ala Gln Asn Thr
260 265 270
aca ggt gtg aca tcc atc atg aat gtg aca cct gac ata ctt gca gat 864
Thr Gly Val Thr Ser Ile Met Asn Val Thr Pro Asp Ile Leu Ala Asp
275 280 285
cag ata gat gca gta ttt aca gat ata aga cca cag gcg gtc aag ata 912
Gln Ile Asp Ala Val Phe Thr Asp Ile Arg Pro Gln Ala Val Lys Ile
290 295 300
ggt atg gtg tct gtg cca gaa ctt ata aat gtg atc gca gac aag ctt 960
Gly Met Val Ser Val Pro Glu Leu Ile Asn Val Ile Ala Asp Lys Leu
305 310 315 320
gaa ttt tac agg gcg gag aat gtg gtg ctt gat cct gtg atg gtt gcg 1008
Glu Phe Tyr Arg Ala Glu Asn Val Val Leu Asp Pro Val Met Val Ala
325 330 335
aca agc ggt gct aaa ctc ata agc gat gat gct gtg gac gtt ttg aca 1056
Thr Ser Gly Ala Lys Leu Ile Ser Asp Asp Ala Val Asp Val Leu Thr
340 345 350
gga agg ctg ttc cca ctt gca aag ctg atc acc cca aat att cca gag 1104
Gly Arg Leu Phe Pro Leu Ala Lys Leu Ile Thr Pro Asn Ile Pro Glu
355 360 365
aca gag gcc ctc aca ggt atg agt atc cgg tct aag gaa gat atg gaa 1152
Thr Glu Ala Leu Thr Gly Met Ser Ile Arg Ser Lys Glu Asp Met Glu
370 375 380
agt gca gca agg aaa ata tat gaa aaa tat ggc tgc tca gtt ctt gtg 1200
Ser Ala Ala Arg Lys Ile Tyr Glu Lys Tyr Gly Cys Ser Val Leu Val
385 390 395 400
aag ggc gga cat agc ata aac gat gcg aat gat atg ctg ttt gat gga 1248
Lys Gly Gly His Ser Ile Asn Asp Ala Asn Asp Met Leu Phe Asp Gly
405 410 415
gag aat gta tca tgg ttt tca ggt gag aga ata gaa aat ccg aat acc 1296
Glu Asn Val Ser Trp Phe Ser Gly Glu Arg Ile Glu Asn Pro Asn Thr
420 425 430
cat gga acg ggg tgt aca ctc tca agt gca ata gcc tcc aac ctt gca 1344
His Gly Thr Gly Cys Thr Leu Ser Ser Ala Ile Ala Ser Asn Leu Ala
435 440 445
aag gga tat gat ata gaa act tct gtg cag aga gca aaa gcg tac atc 1392
Lys Gly Tyr Asp Ile Glu Thr Ser Val Gln Arg Ala Lys Ala Tyr Ile
450 455 460
tca gga gcc ctg gct gcg atg ctt gat cta gga aga gga agc ggc ccg 1440
Ser Gly Ala Leu Ala Ala Met Leu Asp Leu Gly Arg Gly Ser Gly Pro
465 470 475 480
tta aac cat ggc ttt gat ata gac agc aga ttc atg ata taa 1482
Leu Asn His Gly Phe Asp Ile Asp Ser Arg Phe Met Ile
485 490
<210> 161
<211> 493
<212> PRT
<213> Coprococcus eutactus
<400> 161
Met Lys Lys Ile Val Ile Ser Asp Ile Lys Gly Ala Ile Phe Asp Met
1 5 10 15
Asp Gly Val Leu Leu Asp Ser Met Pro Met Trp Asp His Ala Gly Glu
20 25 30
Met Tyr Leu Ala Gly Gln Gly Ile Glu Ala Glu Pro Asp Leu Glu Lys
35 40 45
Val Leu Phe Thr Met Thr Met Gln Lys Gly Ala Glu Tyr Ile Arg Asp
50 55 60
His Tyr Gly Leu Lys Leu Thr Ala Asp Glu Ile Ile Asp Gly Ile Asn
65 70 75 80
Glu Thr Val Arg Asp Phe Tyr Ala Asn Lys Val Val Pro Lys Asn Gly
85 90 95
Val Leu Lys Phe Leu Arg Leu Leu Lys Ser His Asn Ile Pro Val Thr
100 105 110
Val Ala Thr Ser Thr Asp Arg Cys His Val Glu Ala Ala Leu Ser Arg
115 120 125
Asn Gly Leu Met Glu Tyr Val Asp Lys Ile Phe Thr Cys Ser Glu Val
130 135 140
Gly Val Gly Lys Ala Ala Ser Pro Lys Ile Tyr Glu Leu Ala Ala Glu
145 150 155 160
Phe Met Gly Thr Lys Val Gly Glu Ser Phe Val Phe Glu Asp Ala Tyr
165 170 175
His Ala Ala Glu Thr Ala Gln Asn Ala Gly Phe Thr Val Val Gly Leu
180 185 190
Tyr Asp Glu Ser Ser Arg Asp Met Gln Ala Glu Leu Lys Val His Cys
195 200 205
Asn Tyr Tyr Tyr Leu Gly Phe Ala Glu Leu Ile Asp Glu Leu Leu Pro
210 215 220
Asp Arg Ser Gln Leu Ala Pro Val Leu Thr Ile Ala Gly Ser Asp Ser
225 230 235 240
Ser Gly Gly Ala Gly Ile Gln Ala Asp Leu Lys Thr Met Gln Ala Asn
245 250 255
Gly Val Phe Gly Met Ser Ala Val Thr Ala Leu Thr Ala Gln Asn Thr
260 265 270
Thr Gly Val Thr Ser Ile Met Asn Val Thr Pro Asp Ile Leu Ala Asp
275 280 285
Gln Ile Asp Ala Val Phe Thr Asp Ile Arg Pro Gln Ala Val Lys Ile
290 295 300
Gly Met Val Ser Val Pro Glu Leu Ile Asn Val Ile Ala Asp Lys Leu
305 310 315 320
Glu Phe Tyr Arg Ala Glu Asn Val Val Leu Asp Pro Val Met Val Ala
325 330 335
Thr Ser Gly Ala Lys Leu Ile Ser Asp Asp Ala Val Asp Val Leu Thr
340 345 350
Gly Arg Leu Phe Pro Leu Ala Lys Leu Ile Thr Pro Asn Ile Pro Glu
355 360 365
Thr Glu Ala Leu Thr Gly Met Ser Ile Arg Ser Lys Glu Asp Met Glu
370 375 380
Ser Ala Ala Arg Lys Ile Tyr Glu Lys Tyr Gly Cys Ser Val Leu Val
385 390 395 400
Lys Gly Gly His Ser Ile Asn Asp Ala Asn Asp Met Leu Phe Asp Gly
405 410 415
Glu Asn Val Ser Trp Phe Ser Gly Glu Arg Ile Glu Asn Pro Asn Thr
420 425 430
His Gly Thr Gly Cys Thr Leu Ser Ser Ala Ile Ala Ser Asn Leu Ala
435 440 445
Lys Gly Tyr Asp Ile Glu Thr Ser Val Gln Arg Ala Lys Ala Tyr Ile
450 455 460
Ser Gly Ala Leu Ala Ala Met Leu Asp Leu Gly Arg Gly Ser Gly Pro
465 470 475 480
Leu Asn His Gly Phe Asp Ile Asp Ser Arg Phe Met Ile
485 490
<210> 162
<211> 663
<212> DNA
<213> Ruminococcus bromii
<220>
<221> CDS
<222> (1)..(663)
<223> Ruminococcus bromii L2-63 gene encoding TMP phosphatase
[CBL14666]
<400> 162
atg att aaa tct gca ata ttt gat gtt gac ggc aca ctt ctc gat tca 48
Met Ile Lys Ser Ala Ile Phe Asp Val Asp Gly Thr Leu Leu Asp Ser
1 5 10 15
atg aag ata tgg gat gat gca gga gag cgt tac ctc tcg tct gtc ggc 96
Met Lys Ile Trp Asp Asp Ala Gly Glu Arg Tyr Leu Ser Ser Val Gly
20 25 30
aaa aca gcc gaa aac gga ctt tcc gaa aag ctc tgt gat atg agt ctg 144
Lys Thr Ala Glu Asn Gly Leu Ser Glu Lys Leu Cys Asp Met Ser Leu
35 40 45
acg gag ggt gcg gag tat atg aaa aag cag tat gct ctt tcc ttt tca 192
Thr Glu Gly Ala Glu Tyr Met Lys Lys Gln Tyr Ala Leu Ser Phe Ser
50 55 60
act gat gaa ata gtt tcg ggt gtg ctg aaa atc att gaa gat ttt tac 240
Thr Asp Glu Ile Val Ser Gly Val Leu Lys Ile Ile Glu Asp Phe Tyr
65 70 75 80
ttt tat gag gtc ggt tta aaa aac gat gca aaa gaa att ttg cag ttt 288
Phe Tyr Glu Val Gly Leu Lys Asn Asp Ala Lys Glu Ile Leu Gln Phe
85 90 95
ttg gaa tcg aac aat atc aaa atg att att gca aca tca agc gac aaa 336
Leu Glu Ser Asn Asn Ile Lys Met Ile Ile Ala Thr Ser Ser Asp Lys
100 105 110
acg cat att aaa aag gca ttt gaa agg ctc ggt att cta aaa tat ttt 384
Thr His Ile Lys Lys Ala Phe Glu Arg Leu Gly Ile Leu Lys Tyr Phe
115 120 125
acg gat att gtg acc tgt tca caa gtc gga aaa ggc aaa aca agc ccc 432
Thr Asp Ile Val Thr Cys Ser Gln Val Gly Lys Gly Lys Thr Ser Pro
130 135 140
gac att tac ctt gtc tgt gca gat aaa ctc gga aca gct ccg agt gaa 480
Asp Ile Tyr Leu Val Cys Ala Asp Lys Leu Gly Thr Ala Pro Ser Glu
145 150 155 160
acg ctt gta ttc gag gac gct gtt ttt gcc gca gaa act gct cac aag 528
Thr Leu Val Phe Glu Asp Ala Val Phe Ala Ala Glu Thr Ala His Lys
165 170 175
gca ggt ttc aaa acg gtg gga gtg tat gac gaa ttg agc agg aat aat 576
Ala Gly Phe Lys Thr Val Gly Val Tyr Asp Glu Leu Ser Arg Asn Asn
180 185 190
aaa aac aga ata aaa gcc gtt tgc gat tac tac gca gac agc ttt gaa 624
Lys Asn Arg Ile Lys Ala Val Cys Asp Tyr Tyr Ala Asp Ser Phe Glu
195 200 205
aaa gcg gca gat tgg ggg cac cac ctt ttg tcg ctg taa 663
Lys Ala Ala Asp Trp Gly His His Leu Leu Ser Leu
210 215 220
<210> 163
<211> 220
<212> PRT
<213> Ruminococcus bromii
<400> 163
Met Ile Lys Ser Ala Ile Phe Asp Val Asp Gly Thr Leu Leu Asp Ser
1 5 10 15
Met Lys Ile Trp Asp Asp Ala Gly Glu Arg Tyr Leu Ser Ser Val Gly
20 25 30
Lys Thr Ala Glu Asn Gly Leu Ser Glu Lys Leu Cys Asp Met Ser Leu
35 40 45
Thr Glu Gly Ala Glu Tyr Met Lys Lys Gln Tyr Ala Leu Ser Phe Ser
50 55 60
Thr Asp Glu Ile Val Ser Gly Val Leu Lys Ile Ile Glu Asp Phe Tyr
65 70 75 80
Phe Tyr Glu Val Gly Leu Lys Asn Asp Ala Lys Glu Ile Leu Gln Phe
85 90 95
Leu Glu Ser Asn Asn Ile Lys Met Ile Ile Ala Thr Ser Ser Asp Lys
100 105 110
Thr His Ile Lys Lys Ala Phe Glu Arg Leu Gly Ile Leu Lys Tyr Phe
115 120 125
Thr Asp Ile Val Thr Cys Ser Gln Val Gly Lys Gly Lys Thr Ser Pro
130 135 140
Asp Ile Tyr Leu Val Cys Ala Asp Lys Leu Gly Thr Ala Pro Ser Glu
145 150 155 160
Thr Leu Val Phe Glu Asp Ala Val Phe Ala Ala Glu Thr Ala His Lys
165 170 175
Ala Gly Phe Lys Thr Val Gly Val Tyr Asp Glu Leu Ser Arg Asn Asn
180 185 190
Lys Asn Arg Ile Lys Ala Val Cys Asp Tyr Tyr Ala Asp Ser Phe Glu
195 200 205
Lys Ala Ala Asp Trp Gly His His Leu Leu Ser Leu
210 215 220
<210> 164
<211> 1434
<212> DNA
<213> Dorea longicatena
<220>
<221> CDS
<222> (1)..(1434)
<223> Dorea longicatena DSM13814 gene encoding TMP phosphatase
[EDM62146]
<400> 164
atg ata aaa gga gca ata ttt gat gta gac gga acc ctt ctg gat tcc 48
Met Ile Lys Gly Ala Ile Phe Asp Val Asp Gly Thr Leu Leu Asp Ser
1 5 10 15
atg gag atc tgg gaa gac gta gga gtc cgt tat ctg aac agt atc ggt 96
Met Glu Ile Trp Glu Asp Val Gly Val Arg Tyr Leu Asn Ser Ile Gly
20 25 30
ata gag gca gag ccg gat ctt ggg acg gtg tta ttt aca atg agc atc 144
Ile Glu Ala Glu Pro Asp Leu Gly Thr Val Leu Phe Thr Met Ser Ile
35 40 45
cag gaa ggt gca gca tat gta aaa gaa cat tat cat ctg tcc cag gag 192
Gln Glu Gly Ala Ala Tyr Val Lys Glu His Tyr His Leu Ser Gln Glu
50 55 60
ccg gaa gaa att gtg cag gga gtt ctg gac atc atc agc aat tat tat 240
Pro Glu Glu Ile Val Gln Gly Val Leu Asp Ile Ile Ser Asn Tyr Tyr
65 70 75 80
aag aaa acc gca cta tta aag agt gga gtg aag gaa ctt ctg gaa aag 288
Lys Lys Thr Ala Leu Leu Lys Ser Gly Val Lys Glu Leu Leu Glu Lys
85 90 95
ctt gat aag cat aat atc cca atg acg gtt gca tca tcc aat aat aaa 336
Leu Asp Lys His Asn Ile Pro Met Thr Val Ala Ser Ser Asn Asn Lys
100 105 110
aaa gag ata gag atg gca ttt gag cgt ctg gga att gca aaa tat ttt 384
Lys Glu Ile Glu Met Ala Phe Glu Arg Leu Gly Ile Ala Lys Tyr Phe
115 120 125
gac cgg atc ttt acc tgt gaa gag gtc ggt gcg gga aag acg aag ccg 432
Asp Arg Ile Phe Thr Cys Glu Glu Val Gly Ala Gly Lys Thr Lys Pro
130 135 140
gat att tat ctg cgg gca gca gaa tat ctc gga acc cgt ccg gag gag 480
Asp Ile Tyr Leu Arg Ala Ala Glu Tyr Leu Gly Thr Arg Pro Glu Glu
145 150 155 160
acg gtt gta ttc gaa gat gtc att cat gca atc cgt act gca aag cag 528
Thr Val Val Phe Glu Asp Val Ile His Ala Ile Arg Thr Ala Lys Gln
165 170 175
gca ggg ttc cag gtt gta gga atc tat gat gaa gca agt aag gat gac 576
Ala Gly Phe Gln Val Val Gly Ile Tyr Asp Glu Ala Ser Lys Asp Asp
180 185 190
cag gaa gag gtt cag aga gaa gta gac tgg tat tgt aga gag tgg gca 624
Gln Glu Glu Val Gln Arg Glu Val Asp Trp Tyr Cys Arg Glu Trp Ala
195 200 205
gaa ctt atg aaa aaa aag aca gca att aca atc gcc gga agt gat tca 672
Glu Leu Met Lys Lys Lys Thr Ala Ile Thr Ile Ala Gly Ser Asp Ser
210 215 220
agt gga ggt gca gga att cag gca gac atc aag acg atg cag gca aac 720
Ser Gly Gly Ala Gly Ile Gln Ala Asp Ile Lys Thr Met Gln Ala Asn
225 230 235 240
gga gtc tac gca atg agt gca atc acc gca ctg aca gcc cag aat aca 768
Gly Val Tyr Ala Met Ser Ala Ile Thr Ala Leu Thr Ala Gln Asn Thr
245 250 255
acc gga gta acc gga atc atg gaa gta tct ccg gaa ttt cta gaa caa 816
Thr Gly Val Thr Gly Ile Met Glu Val Ser Pro Glu Phe Leu Glu Gln
260 265 270
cag ttg gac gca gtt atc aca gac atc cgt ccg gat gca gtg aaa atc 864
Gln Leu Asp Ala Val Ile Thr Asp Ile Arg Pro Asp Ala Val Lys Ile
275 280 285
ggt atg gtg tca tca gaa gag tta ata aaa atg ata tca aag aaa cta 912
Gly Met Val Ser Ser Glu Glu Leu Ile Lys Met Ile Ser Lys Lys Leu
290 295 300
aaa gag tac cat ctg gag aat atc gta gtt gat cca gtg atg gta gca 960
Lys Glu Tyr His Leu Glu Asn Ile Val Val Asp Pro Val Met Val Ala
305 310 315 320
aca agc gga tcc aga ctg atc agt gaa acc gcg att gat aca tta aaa 1008
Thr Ser Gly Ser Arg Leu Ile Ser Glu Thr Ala Ile Asp Thr Leu Lys
325 330 335
aca cag ctg ctg cca atg gca act gtg atc aca ccg aat atc cca gag 1056
Thr Gln Leu Leu Pro Met Ala Thr Val Ile Thr Pro Asn Ile Pro Glu
340 345 350
gca gaa gtt ctt gca gaa atg gag att aga tca gaa gat gat atg gtg 1104
Ala Glu Val Leu Ala Glu Met Glu Ile Arg Ser Glu Asp Asp Met Val
355 360 365
gaa gca gca aag aag att cat gaa atg tat cac tgt gca gtc tta tgc 1152
Glu Ala Ala Lys Lys Ile His Glu Met Tyr His Cys Ala Val Leu Cys
370 375 380
aaa ggc gga cac agc ctg aat gat gcg aat gat ctc cta tac cag gat 1200
Lys Gly Gly His Ser Leu Asn Asp Ala Asn Asp Leu Leu Tyr Gln Asp
385 390 395 400
gga gaa aca aca tgg ttc cac gga aaa aga atc aac aac ccg aac act 1248
Gly Glu Thr Thr Trp Phe His Gly Lys Arg Ile Asn Asn Pro Asn Thr
405 410 415
cac gga acc ggc tgt acc tta tcc agc gca atc gca tcc aat ctg gca 1296
His Gly Thr Gly Cys Thr Leu Ser Ser Ala Ile Ala Ser Asn Leu Ala
420 425 430
aaa gga tat tct ctg gaa gaa tct att cac cgc gcg aaa gag tat atc 1344
Lys Gly Tyr Ser Leu Glu Glu Ser Ile His Arg Ala Lys Glu Tyr Ile
435 440 445
agc ggg gcg ttg gaa gcc atg tta gat ctg gga aaa gga agc gga ccg 1392
Ser Gly Ala Leu Glu Ala Met Leu Asp Leu Gly Lys Gly Ser Gly Pro
450 455 460
atg gat cat ggg ttt gag atg cgg ggg aga ttt tct att taa 1434
Met Asp His Gly Phe Glu Met Arg Gly Arg Phe Ser Ile
465 470 475
<210> 165
<211> 477
<212> PRT
<213> Dorea longicatena
<400> 165
Met Ile Lys Gly Ala Ile Phe Asp Val Asp Gly Thr Leu Leu Asp Ser
1 5 10 15
Met Glu Ile Trp Glu Asp Val Gly Val Arg Tyr Leu Asn Ser Ile Gly
20 25 30
Ile Glu Ala Glu Pro Asp Leu Gly Thr Val Leu Phe Thr Met Ser Ile
35 40 45
Gln Glu Gly Ala Ala Tyr Val Lys Glu His Tyr His Leu Ser Gln Glu
50 55 60
Pro Glu Glu Ile Val Gln Gly Val Leu Asp Ile Ile Ser Asn Tyr Tyr
65 70 75 80
Lys Lys Thr Ala Leu Leu Lys Ser Gly Val Lys Glu Leu Leu Glu Lys
85 90 95
Leu Asp Lys His Asn Ile Pro Met Thr Val Ala Ser Ser Asn Asn Lys
100 105 110
Lys Glu Ile Glu Met Ala Phe Glu Arg Leu Gly Ile Ala Lys Tyr Phe
115 120 125
Asp Arg Ile Phe Thr Cys Glu Glu Val Gly Ala Gly Lys Thr Lys Pro
130 135 140
Asp Ile Tyr Leu Arg Ala Ala Glu Tyr Leu Gly Thr Arg Pro Glu Glu
145 150 155 160
Thr Val Val Phe Glu Asp Val Ile His Ala Ile Arg Thr Ala Lys Gln
165 170 175
Ala Gly Phe Gln Val Val Gly Ile Tyr Asp Glu Ala Ser Lys Asp Asp
180 185 190
Gln Glu Glu Val Gln Arg Glu Val Asp Trp Tyr Cys Arg Glu Trp Ala
195 200 205
Glu Leu Met Lys Lys Lys Thr Ala Ile Thr Ile Ala Gly Ser Asp Ser
210 215 220
Ser Gly Gly Ala Gly Ile Gln Ala Asp Ile Lys Thr Met Gln Ala Asn
225 230 235 240
Gly Val Tyr Ala Met Ser Ala Ile Thr Ala Leu Thr Ala Gln Asn Thr
245 250 255
Thr Gly Val Thr Gly Ile Met Glu Val Ser Pro Glu Phe Leu Glu Gln
260 265 270
Gln Leu Asp Ala Val Ile Thr Asp Ile Arg Pro Asp Ala Val Lys Ile
275 280 285
Gly Met Val Ser Ser Glu Glu Leu Ile Lys Met Ile Ser Lys Lys Leu
290 295 300
Lys Glu Tyr His Leu Glu Asn Ile Val Val Asp Pro Val Met Val Ala
305 310 315 320
Thr Ser Gly Ser Arg Leu Ile Ser Glu Thr Ala Ile Asp Thr Leu Lys
325 330 335
Thr Gln Leu Leu Pro Met Ala Thr Val Ile Thr Pro Asn Ile Pro Glu
340 345 350
Ala Glu Val Leu Ala Glu Met Glu Ile Arg Ser Glu Asp Asp Met Val
355 360 365
Glu Ala Ala Lys Lys Ile His Glu Met Tyr His Cys Ala Val Leu Cys
370 375 380
Lys Gly Gly His Ser Leu Asn Asp Ala Asn Asp Leu Leu Tyr Gln Asp
385 390 395 400
Gly Glu Thr Thr Trp Phe His Gly Lys Arg Ile Asn Asn Pro Asn Thr
405 410 415
His Gly Thr Gly Cys Thr Leu Ser Ser Ala Ile Ala Ser Asn Leu Ala
420 425 430
Lys Gly Tyr Ser Leu Glu Glu Ser Ile His Arg Ala Lys Glu Tyr Ile
435 440 445
Ser Gly Ala Leu Glu Ala Met Leu Asp Leu Gly Lys Gly Ser Gly Pro
450 455 460
Met Asp His Gly Phe Glu Met Arg Gly Arg Phe Ser Ile
465 470 475
<210> 166
<211> 1305
<212> DNA
<213> Lachnospiraceae bacterium
<220>
<221> CDS
<222> (1)..(1305)
<223> Lachnospiraceae_bacterium_3_1_57FAA_CT1 gene encoding TMP
phosphatase [EPC05128]
<400> 166
atg aaa tgt gac aga aag aca atg ctt ctt tat gcg gtg acc gat cgg 48
Met Lys Cys Asp Arg Lys Thr Met Leu Leu Tyr Ala Val Thr Asp Arg
1 5 10 15
gcc tgg aca gga gaa aag aca ctg ctt atg cag gtc gag gaa gcg ctg 96
Ala Trp Thr Gly Glu Lys Thr Leu Leu Met Gln Val Glu Glu Ala Leu
20 25 30
gca gga ggt gtg acc tgt gtc cag ctt cgt gaa aag gat atg cca aag 144
Ala Gly Gly Val Thr Cys Val Gln Leu Arg Glu Lys Asp Met Pro Lys
35 40 45
gag cag ttc ctg gaa gaa gcg gag agt ata aaa aga ctt tgt cat aaa 192
Glu Gln Phe Leu Glu Glu Ala Glu Ser Ile Lys Arg Leu Cys His Lys
50 55 60
tat ggg atc cct ttt ata att gac gat gat gtg gag ctg gcc gta cgc 240
Tyr Gly Ile Pro Phe Ile Ile Asp Asp Asp Val Glu Leu Ala Val Arg
65 70 75 80
tgc ggc gcg gac ggg gtg cat gtg gga cag cat gat atg gag gca ggc 288
Cys Gly Ala Asp Gly Val His Val Gly Gln His Asp Met Glu Ala Gly
85 90 95
gcg gtc cgc cgg aaa atc gga gac ggc atg ctg ctg ggc gta tca gtc 336
Ala Val Arg Arg Lys Ile Gly Asp Gly Met Leu Leu Gly Val Ser Val
100 105 110
cag act gtg gaa cag gca gtg gaa gcc gaa aaa aag gga gcg gat tac 384
Gln Thr Val Glu Gln Ala Val Glu Ala Glu Lys Lys Gly Ala Asp Tyr
115 120 125
ctt ggt gtg ggc gct gtg ttt tcc act tcc acg aaa acg gac gca cag 432
Leu Gly Val Gly Ala Val Phe Ser Thr Ser Thr Lys Thr Asp Ala Gln
130 135 140
gag gtt tcc ctg gat acc ctc cgg gaa atc tgc cgg gcg gtg tcc gta 480
Glu Val Ser Leu Asp Thr Leu Arg Glu Ile Cys Arg Ala Val Ser Val
145 150 155 160
ccc gtc tgt gca atc gga ggg ata cac aaa gga aat atg cat ttg ctg 528
Pro Val Cys Ala Ile Gly Gly Ile His Lys Gly Asn Met His Leu Leu
165 170 175
cag gat acg gga atc gat ggg gtg gct ttg gtg tcg gcc atc ttt tcc 576
Gln Asp Thr Gly Ile Asp Gly Val Ala Leu Val Ser Ala Ile Phe Ser
180 185 190
agt ccc tgc ata cag aag gaa tgc agg gag ctg cgg gtc ctg gca gag 624
Ser Pro Cys Ile Gln Lys Glu Cys Arg Glu Leu Arg Val Leu Ala Glu
195 200 205
aga ctg aaa agg aaa ggg gct att ttt gat gcg gac gga acc ctg ctg 672
Arg Leu Lys Arg Lys Gly Ala Ile Phe Asp Ala Asp Gly Thr Leu Leu
210 215 220
gat tcc atg tcc gtt tgg gat act ctg ggt gaa aaa tat ctg cgg aaa 720
Asp Ser Met Ser Val Trp Asp Thr Leu Gly Glu Lys Tyr Leu Arg Lys
225 230 235 240
aag ggt att gtt ccg gaa aag aac atc agg gaa aca ata aaa aat atg 768
Lys Gly Ile Val Pro Glu Lys Asn Ile Arg Glu Thr Ile Lys Asn Met
245 250 255
agt ctt cct cag gct gcg gtc tat ttt cag act gct tat ggg att gcg 816
Ser Leu Pro Gln Ala Ala Val Tyr Phe Gln Thr Ala Tyr Gly Ile Ala
260 265 270
gat gca gaa gac aag att ata gag gat att aat gga ata gcg gcg tcc 864
Asp Ala Glu Asp Lys Ile Ile Glu Asp Ile Asn Gly Ile Ala Ala Ser
275 280 285
ttt tac atc aat gag gtg aag ctg aag gaa ggc gtg aaa acg gtt ctg 912
Phe Tyr Ile Asn Glu Val Lys Leu Lys Glu Gly Val Lys Thr Val Leu
290 295 300
gac aag ctg aag cag aaa aac gta aag atg tgt gtg gcg acg gct acg 960
Asp Lys Leu Lys Gln Lys Asn Val Lys Met Cys Val Ala Thr Ala Thr
305 310 315 320
gac aaa ggg ctg att gaa aag gca ctt gag aga aac gga atc aga gat 1008
Asp Lys Gly Leu Ile Glu Lys Ala Leu Glu Arg Asn Gly Ile Arg Asp
325 330 335
tat ttt gag gct gtc ctc acc tgc acg gat gtg ggc gcg gga aag gat 1056
Tyr Phe Glu Ala Val Leu Thr Cys Thr Asp Val Gly Ala Gly Lys Asp
340 345 350
gag ccg gtt atc ttc cgt aag gcc ggg cag ctt ctc gga aca gca aaa 1104
Glu Pro Val Ile Phe Arg Lys Ala Gly Gln Leu Leu Gly Thr Ala Lys
355 360 365
gag gat acc att gta att gaa gat gcc ttg tat gct gtt aag aca gcg 1152
Glu Asp Thr Ile Val Ile Glu Asp Ala Leu Tyr Ala Val Lys Thr Ala
370 375 380
aaa gag gac ggt ttc ctg gtg gcg gct gtt tat gat ccg tca gca gaa 1200
Lys Glu Asp Gly Phe Leu Val Ala Ala Val Tyr Asp Pro Ser Ala Glu
385 390 395 400
aag gag gaa ccg gag atc cgg gag atc tct gac ttc tat ttc cgg tca 1248
Lys Glu Glu Pro Glu Ile Arg Glu Ile Ser Asp Phe Tyr Phe Arg Ser
405 410 415
ttt aat gaa atg gag agt tat ctg aat gaa aaa agt tct tac gat agc 1296
Phe Asn Glu Met Glu Ser Tyr Leu Asn Glu Lys Ser Ser Tyr Asp Ser
420 425 430
ggg ctc tga 1305
Gly Leu
<210> 167
<211> 434
<212> PRT
<213> Lachnospiraceae bacterium
<400> 167
Met Lys Cys Asp Arg Lys Thr Met Leu Leu Tyr Ala Val Thr Asp Arg
1 5 10 15
Ala Trp Thr Gly Glu Lys Thr Leu Leu Met Gln Val Glu Glu Ala Leu
20 25 30
Ala Gly Gly Val Thr Cys Val Gln Leu Arg Glu Lys Asp Met Pro Lys
35 40 45
Glu Gln Phe Leu Glu Glu Ala Glu Ser Ile Lys Arg Leu Cys His Lys
50 55 60
Tyr Gly Ile Pro Phe Ile Ile Asp Asp Asp Val Glu Leu Ala Val Arg
65 70 75 80
Cys Gly Ala Asp Gly Val His Val Gly Gln His Asp Met Glu Ala Gly
85 90 95
Ala Val Arg Arg Lys Ile Gly Asp Gly Met Leu Leu Gly Val Ser Val
100 105 110
Gln Thr Val Glu Gln Ala Val Glu Ala Glu Lys Lys Gly Ala Asp Tyr
115 120 125
Leu Gly Val Gly Ala Val Phe Ser Thr Ser Thr Lys Thr Asp Ala Gln
130 135 140
Glu Val Ser Leu Asp Thr Leu Arg Glu Ile Cys Arg Ala Val Ser Val
145 150 155 160
Pro Val Cys Ala Ile Gly Gly Ile His Lys Gly Asn Met His Leu Leu
165 170 175
Gln Asp Thr Gly Ile Asp Gly Val Ala Leu Val Ser Ala Ile Phe Ser
180 185 190
Ser Pro Cys Ile Gln Lys Glu Cys Arg Glu Leu Arg Val Leu Ala Glu
195 200 205
Arg Leu Lys Arg Lys Gly Ala Ile Phe Asp Ala Asp Gly Thr Leu Leu
210 215 220
Asp Ser Met Ser Val Trp Asp Thr Leu Gly Glu Lys Tyr Leu Arg Lys
225 230 235 240
Lys Gly Ile Val Pro Glu Lys Asn Ile Arg Glu Thr Ile Lys Asn Met
245 250 255
Ser Leu Pro Gln Ala Ala Val Tyr Phe Gln Thr Ala Tyr Gly Ile Ala
260 265 270
Asp Ala Glu Asp Lys Ile Ile Glu Asp Ile Asn Gly Ile Ala Ala Ser
275 280 285
Phe Tyr Ile Asn Glu Val Lys Leu Lys Glu Gly Val Lys Thr Val Leu
290 295 300
Asp Lys Leu Lys Gln Lys Asn Val Lys Met Cys Val Ala Thr Ala Thr
305 310 315 320
Asp Lys Gly Leu Ile Glu Lys Ala Leu Glu Arg Asn Gly Ile Arg Asp
325 330 335
Tyr Phe Glu Ala Val Leu Thr Cys Thr Asp Val Gly Ala Gly Lys Asp
340 345 350
Glu Pro Val Ile Phe Arg Lys Ala Gly Gln Leu Leu Gly Thr Ala Lys
355 360 365
Glu Asp Thr Ile Val Ile Glu Asp Ala Leu Tyr Ala Val Lys Thr Ala
370 375 380
Lys Glu Asp Gly Phe Leu Val Ala Ala Val Tyr Asp Pro Ser Ala Glu
385 390 395 400
Lys Glu Glu Pro Glu Ile Arg Glu Ile Ser Asp Phe Tyr Phe Arg Ser
405 410 415
Phe Asn Glu Met Glu Ser Tyr Leu Asn Glu Lys Ser Ser Tyr Asp Ser
420 425 430
Gly Leu
<210> 168
<211> 1305
<212> DNA
<213> Fusicatenibacter
<220>
<221> CDS
<222> (1)..(1305)
<223> Fusicatenibacter gene encoding TMP phosphatase [CUQ30753]
<400> 168
atg aaa tgt aac aga aag aca atg ctt ctt tat gcg gtg acc gac cgg 48
Met Lys Cys Asn Arg Lys Thr Met Leu Leu Tyr Ala Val Thr Asp Arg
1 5 10 15
gcc tgg aca gga gaa aag aca ctg ctt acg cag gtc gag gaa gcg ctg 96
Ala Trp Thr Gly Glu Lys Thr Leu Leu Thr Gln Val Glu Glu Ala Leu
20 25 30
gca gga ggt gta acc tgt gtc cag ctt cgt gaa aag gat atg cca aag 144
Ala Gly Gly Val Thr Cys Val Gln Leu Arg Glu Lys Asp Met Pro Lys
35 40 45
gag cag ttc ctg gaa gaa gcg gag agt ata aaa aga ctt tgc cat aaa 192
Glu Gln Phe Leu Glu Glu Ala Glu Ser Ile Lys Arg Leu Cys His Lys
50 55 60
tat ggt gtc cct ttt ata att gac gat gat gtg gag ctg gcc gta cgc 240
Tyr Gly Val Pro Phe Ile Ile Asp Asp Asp Val Glu Leu Ala Val Arg
65 70 75 80
tgc ggt gcg gac ggg gta cat gtg gga cag cat gat atg gag gca ggc 288
Cys Gly Ala Asp Gly Val His Val Gly Gln His Asp Met Glu Ala Gly
85 90 95
gcg gtc cgc cgg aaa atc gga gac ggc atg ctg ctg ggc gta tca gtc 336
Ala Val Arg Arg Lys Ile Gly Asp Gly Met Leu Leu Gly Val Ser Val
100 105 110
cag act gtg gaa cag gca gtg gaa gcc gag aaa aag gga gcg gat tac 384
Gln Thr Val Glu Gln Ala Val Glu Ala Glu Lys Lys Gly Ala Asp Tyr
115 120 125
ctt ggt gtg ggc gct gtg ttt tcc act tcc acg aaa acg gac gca cag 432
Leu Gly Val Gly Ala Val Phe Ser Thr Ser Thr Lys Thr Asp Ala Gln
130 135 140
gag gtt tcc ctg gat acc ctc cgg gaa atc tgc cgg gcg gtg tcc gta 480
Glu Val Ser Leu Asp Thr Leu Arg Glu Ile Cys Arg Ala Val Ser Val
145 150 155 160
ccc gtc tgt gca atc gga ggg ata cac aaa gga aat atg cat ttg ctg 528
Pro Val Cys Ala Ile Gly Gly Ile His Lys Gly Asn Met His Leu Leu
165 170 175
cag gat acg gga atc gat ggg gtg gct ttg gtg tcg gcc atc ttt tcc 576
Gln Asp Thr Gly Ile Asp Gly Val Ala Leu Val Ser Ala Ile Phe Ser
180 185 190
agt ccc tgc ata cag aag gaa tgc agg gag ctg cgg gcc ctg gca gag 624
Ser Pro Cys Ile Gln Lys Glu Cys Arg Glu Leu Arg Ala Leu Ala Glu
195 200 205
agg ctg aaa agg aaa ggg gct att ttt gat gcg gac gga acc ctg ctg 672
Arg Leu Lys Arg Lys Gly Ala Ile Phe Asp Ala Asp Gly Thr Leu Leu
210 215 220
gat tcc atg tct gtt tgg gat acc ctg ggt gaa aaa tat ctg cgg aaa 720
Asp Ser Met Ser Val Trp Asp Thr Leu Gly Glu Lys Tyr Leu Arg Lys
225 230 235 240
aag ggt att gtt ccg gaa aag aac atc agg gaa aca ata aaa aat atg 768
Lys Gly Ile Val Pro Glu Lys Asn Ile Arg Glu Thr Ile Lys Asn Met
245 250 255
agt ctt cct cag gcc gcg gtc tat ttt caa act gct tat ggg atc acg 816
Ser Leu Pro Gln Ala Ala Val Tyr Phe Gln Thr Ala Tyr Gly Ile Thr
260 265 270
gat gca gaa gac aag att ata gag gat att aat gga ata gcg gcg tcc 864
Asp Ala Glu Asp Lys Ile Ile Glu Asp Ile Asn Gly Ile Ala Ala Ser
275 280 285
ttt tac atc aat gag gtg aag ctg aag gaa ggc gtg aaa acg gtt ctg 912
Phe Tyr Ile Asn Glu Val Lys Leu Lys Glu Gly Val Lys Thr Val Leu
290 295 300
gac aag ctg aag cag aaa aac gta aag atg tgt gtg gcg acg gct acg 960
Asp Lys Leu Lys Gln Lys Asn Val Lys Met Cys Val Ala Thr Ala Thr
305 310 315 320
gac aag ggg ctg att gaa aag gca ctt gag aga aac gga atc aga gat 1008
Asp Lys Gly Leu Ile Glu Lys Ala Leu Glu Arg Asn Gly Ile Arg Asp
325 330 335
tat ttt gag gct gtc ctc acc tgc acg gat gtg ggc gcg gga aag gat 1056
Tyr Phe Glu Ala Val Leu Thr Cys Thr Asp Val Gly Ala Gly Lys Asp
340 345 350
gag ccg gtt atc ttc cgt aag gcc ggg cag ctt ctc gga acc gca aaa 1104
Glu Pro Val Ile Phe Arg Lys Ala Gly Gln Leu Leu Gly Thr Ala Lys
355 360 365
gag gat acc att gta att gaa gat gcc ttg tat gct gtt aag aca gcg 1152
Glu Asp Thr Ile Val Ile Glu Asp Ala Leu Tyr Ala Val Lys Thr Ala
370 375 380
aaa gag gac ggt ttc ctg gtg gcg gct gtt tat gat ccg tca gca gaa 1200
Lys Glu Asp Gly Phe Leu Val Ala Ala Val Tyr Asp Pro Ser Ala Glu
385 390 395 400
aag gag gaa ccg gag atc cgg gag atc tct gac ttc tat ttc cgg tca 1248
Lys Glu Glu Pro Glu Ile Arg Glu Ile Ser Asp Phe Tyr Phe Arg Ser
405 410 415
ttt aat gaa atg gag agt tat ctg aat gaa aaa agt tct tac gat agc 1296
Phe Asn Glu Met Glu Ser Tyr Leu Asn Glu Lys Ser Ser Tyr Asp Ser
420 425 430
ggg ctc tga 1305
Gly Leu
<210> 169
<211> 434
<212> PRT
<213> Fusicatenibacter
<400> 169
Met Lys Cys Asn Arg Lys Thr Met Leu Leu Tyr Ala Val Thr Asp Arg
1 5 10 15
Ala Trp Thr Gly Glu Lys Thr Leu Leu Thr Gln Val Glu Glu Ala Leu
20 25 30
Ala Gly Gly Val Thr Cys Val Gln Leu Arg Glu Lys Asp Met Pro Lys
35 40 45
Glu Gln Phe Leu Glu Glu Ala Glu Ser Ile Lys Arg Leu Cys His Lys
50 55 60
Tyr Gly Val Pro Phe Ile Ile Asp Asp Asp Val Glu Leu Ala Val Arg
65 70 75 80
Cys Gly Ala Asp Gly Val His Val Gly Gln His Asp Met Glu Ala Gly
85 90 95
Ala Val Arg Arg Lys Ile Gly Asp Gly Met Leu Leu Gly Val Ser Val
100 105 110
Gln Thr Val Glu Gln Ala Val Glu Ala Glu Lys Lys Gly Ala Asp Tyr
115 120 125
Leu Gly Val Gly Ala Val Phe Ser Thr Ser Thr Lys Thr Asp Ala Gln
130 135 140
Glu Val Ser Leu Asp Thr Leu Arg Glu Ile Cys Arg Ala Val Ser Val
145 150 155 160
Pro Val Cys Ala Ile Gly Gly Ile His Lys Gly Asn Met His Leu Leu
165 170 175
Gln Asp Thr Gly Ile Asp Gly Val Ala Leu Val Ser Ala Ile Phe Ser
180 185 190
Ser Pro Cys Ile Gln Lys Glu Cys Arg Glu Leu Arg Ala Leu Ala Glu
195 200 205
Arg Leu Lys Arg Lys Gly Ala Ile Phe Asp Ala Asp Gly Thr Leu Leu
210 215 220
Asp Ser Met Ser Val Trp Asp Thr Leu Gly Glu Lys Tyr Leu Arg Lys
225 230 235 240
Lys Gly Ile Val Pro Glu Lys Asn Ile Arg Glu Thr Ile Lys Asn Met
245 250 255
Ser Leu Pro Gln Ala Ala Val Tyr Phe Gln Thr Ala Tyr Gly Ile Thr
260 265 270
Asp Ala Glu Asp Lys Ile Ile Glu Asp Ile Asn Gly Ile Ala Ala Ser
275 280 285
Phe Tyr Ile Asn Glu Val Lys Leu Lys Glu Gly Val Lys Thr Val Leu
290 295 300
Asp Lys Leu Lys Gln Lys Asn Val Lys Met Cys Val Ala Thr Ala Thr
305 310 315 320
Asp Lys Gly Leu Ile Glu Lys Ala Leu Glu Arg Asn Gly Ile Arg Asp
325 330 335
Tyr Phe Glu Ala Val Leu Thr Cys Thr Asp Val Gly Ala Gly Lys Asp
340 345 350
Glu Pro Val Ile Phe Arg Lys Ala Gly Gln Leu Leu Gly Thr Ala Lys
355 360 365
Glu Asp Thr Ile Val Ile Glu Asp Ala Leu Tyr Ala Val Lys Thr Ala
370 375 380
Lys Glu Asp Gly Phe Leu Val Ala Ala Val Tyr Asp Pro Ser Ala Glu
385 390 395 400
Lys Glu Glu Pro Glu Ile Arg Glu Ile Ser Asp Phe Tyr Phe Arg Ser
405 410 415
Phe Asn Glu Met Glu Ser Tyr Leu Asn Glu Lys Ser Ser Tyr Asp Ser
420 425 430
Gly Leu
<210> 170
<211> 1296
<212> DNA
<213> Clostridium species
<220>
<221> CDS
<222> (1)..(1296)
<223> Clostridium sp KLE1755 gene encoding TMP phosphatase [ERI68966]:
<400> 170
atg aaa tgt gac aga agc atg ctg ctc ctc tat gcc gta acc gac cgt 48
Met Lys Cys Asp Arg Ser Met Leu Leu Leu Tyr Ala Val Thr Asp Arg
1 5 10 15
gcc tgg acg ggt aaa aaa aca ctg ctg cag cag gtg gag gaa gcc ctg 96
Ala Trp Thr Gly Lys Lys Thr Leu Leu Gln Gln Val Glu Glu Ala Leu
20 25 30
gca ggc ggc gcc acc tgc atc cag ctt cgg gaa aag gag ctg ccg gag 144
Ala Gly Gly Ala Thr Cys Ile Gln Leu Arg Glu Lys Glu Leu Pro Glu
35 40 45
gaa gaa ttc cgg cag gaa gcc ctg gct gtg aaa gaa ctt tgc cgc aga 192
Glu Glu Phe Arg Gln Glu Ala Leu Ala Val Lys Glu Leu Cys Arg Arg
50 55 60
tac cat gtc cct ttc ctc att aac gac aac gta gag ctg gct gtc agc 240
Tyr His Val Pro Phe Leu Ile Asn Asp Asn Val Glu Leu Ala Val Ser
65 70 75 80
tgc ggc gcg gac ggc gtc cat gtg ggc cag cac gac atg tct gcg gcg 288
Cys Gly Ala Asp Gly Val His Val Gly Gln His Asp Met Ser Ala Ala
85 90 95
gat gtg cgc cgc aga atc ggc ccc ggc aaa ata ctg gga gta tcc gcg 336
Asp Val Arg Arg Arg Ile Gly Pro Gly Lys Ile Leu Gly Val Ser Ala
100 105 110
cag acg gtg gag cag gcc cgc cag gcg gaa gaa gac ggc gca gat tat 384
Gln Thr Val Glu Gln Ala Arg Gln Ala Glu Glu Asp Gly Ala Asp Tyr
115 120 125
ctg ggc gtg ggc gct gtt ttt tcc acc tcc acc aaa tcc gac gca gac 432
Leu Gly Val Gly Ala Val Phe Ser Thr Ser Thr Lys Ser Asp Ala Asp
130 135 140
gcg gta tcc cat gag acc ctg caa aag atc tgc gcc gca gta tcc atc 480
Ala Val Ser His Glu Thr Leu Gln Lys Ile Cys Ala Ala Val Ser Ile
145 150 155 160
ccc gtc tgc gcc ata ggc ggc atc cat aaa gaa aat ctg cat ttg ctc 528
Pro Val Cys Ala Ile Gly Gly Ile His Lys Glu Asn Leu His Leu Leu
165 170 175
aaa ggc aca ggc atc gcc ggc gtg gcc ctt gtt tcc gcc atc ttc gca 576
Lys Gly Thr Gly Ile Ala Gly Val Ala Leu Val Ser Ala Ile Phe Ala
180 185 190
agc ccg gat atc cgt aag tcc tgc gaa gac ctg aaa aaa ctg gcc ctg 624
Ser Pro Asp Ile Arg Lys Ser Cys Glu Asp Leu Lys Lys Leu Ala Leu
195 200 205
cag ata aac gcg cag gac aca ctg gaa gca ctg ctg cat aca aac atc 672
Gln Ile Asn Ala Gln Asp Thr Leu Glu Ala Leu Leu His Thr Asn Ile
210 215 220
cgc gga gcc atc ttt gac gcg gac ggc acc ctt tta gac tcc atg ggc 720
Arg Gly Ala Ile Phe Asp Ala Asp Gly Thr Leu Leu Asp Ser Met Gly
225 230 235 240
atc tgg gat act ctg ggg gaa gat tac ctg cgt aca aaa ggg aaa atc 768
Ile Trp Asp Thr Leu Gly Glu Asp Tyr Leu Arg Thr Lys Gly Lys Ile
245 250 255
ccc cgg gaa aac ctg cgt gaa acc ttc cgc gac atg agc ctt ctc cag 816
Pro Arg Glu Asn Leu Arg Glu Thr Phe Arg Asp Met Ser Leu Leu Gln
260 265 270
gcc gcc tgc tat tac cgg gaa aat tac gcc ctt acg gaa agc cct gaa 864
Ala Ala Cys Tyr Tyr Arg Glu Asn Tyr Ala Leu Thr Glu Ser Pro Glu
275 280 285
aaa ata gtg gaa gag ctt aac gcc atg atc gcc tcc ttc tat gaa aaa 912
Lys Ile Val Glu Glu Leu Asn Ala Met Ile Ala Ser Phe Tyr Glu Lys
290 295 300
gaa gcc ccc ctg aag gaa gga gcc gcc gcc ttc ctg gaa gcg ctt tgc 960
Glu Ala Pro Leu Lys Glu Gly Ala Ala Ala Phe Leu Glu Ala Leu Cys
305 310 315 320
caa aga aac ata aaa atg tgc att gca aca gcc acc gat cac agc ctt 1008
Gln Arg Asn Ile Lys Met Cys Ile Ala Thr Ala Thr Asp His Ser Leu
325 330 335
atc cgg gcc gcc ctg aag cga tgc gga gtg ctg cat tac ttt act ttt 1056
Ile Arg Ala Ala Leu Lys Arg Cys Gly Val Leu His Tyr Phe Thr Phe
340 345 350
ata ctt acc tgc gga caa gca gga gcg gga aaa gac acc ccc gcc att 1104
Ile Leu Thr Cys Gly Gln Ala Gly Ala Gly Lys Asp Thr Pro Ala Ile
355 360 365
tat gaa gaa gcc ctg gcc ctg ctt gga acc gga aaa aaa gaa acc ttc 1152
Tyr Glu Glu Ala Leu Ala Leu Leu Gly Thr Gly Lys Lys Glu Thr Phe
370 375 380
gtt ttt gaa gat gcc ctg tac gcc ctg aaa acg gcg aaa aca gcc ggc 1200
Val Phe Glu Asp Ala Leu Tyr Ala Leu Lys Thr Ala Lys Thr Ala Gly
385 390 395 400
ttt cct aca gtc ggt gta aaa gac ccc tcc tcc gcc gga cag gaa ggg 1248
Phe Pro Thr Val Gly Val Lys Asp Pro Ser Ser Ala Gly Gln Glu Gly
405 410 415
gag att ata aaa caa gcc gat tac tat ctt tat acc ttc acg aaa tga 1296
Glu Ile Ile Lys Gln Ala Asp Tyr Tyr Leu Tyr Thr Phe Thr Lys
420 425 430
<210> 171
<211> 431
<212> PRT
<213> Clostridium species
<400> 171
Met Lys Cys Asp Arg Ser Met Leu Leu Leu Tyr Ala Val Thr Asp Arg
1 5 10 15
Ala Trp Thr Gly Lys Lys Thr Leu Leu Gln Gln Val Glu Glu Ala Leu
20 25 30
Ala Gly Gly Ala Thr Cys Ile Gln Leu Arg Glu Lys Glu Leu Pro Glu
35 40 45
Glu Glu Phe Arg Gln Glu Ala Leu Ala Val Lys Glu Leu Cys Arg Arg
50 55 60
Tyr His Val Pro Phe Leu Ile Asn Asp Asn Val Glu Leu Ala Val Ser
65 70 75 80
Cys Gly Ala Asp Gly Val His Val Gly Gln His Asp Met Ser Ala Ala
85 90 95
Asp Val Arg Arg Arg Ile Gly Pro Gly Lys Ile Leu Gly Val Ser Ala
100 105 110
Gln Thr Val Glu Gln Ala Arg Gln Ala Glu Glu Asp Gly Ala Asp Tyr
115 120 125
Leu Gly Val Gly Ala Val Phe Ser Thr Ser Thr Lys Ser Asp Ala Asp
130 135 140
Ala Val Ser His Glu Thr Leu Gln Lys Ile Cys Ala Ala Val Ser Ile
145 150 155 160
Pro Val Cys Ala Ile Gly Gly Ile His Lys Glu Asn Leu His Leu Leu
165 170 175
Lys Gly Thr Gly Ile Ala Gly Val Ala Leu Val Ser Ala Ile Phe Ala
180 185 190
Ser Pro Asp Ile Arg Lys Ser Cys Glu Asp Leu Lys Lys Leu Ala Leu
195 200 205
Gln Ile Asn Ala Gln Asp Thr Leu Glu Ala Leu Leu His Thr Asn Ile
210 215 220
Arg Gly Ala Ile Phe Asp Ala Asp Gly Thr Leu Leu Asp Ser Met Gly
225 230 235 240
Ile Trp Asp Thr Leu Gly Glu Asp Tyr Leu Arg Thr Lys Gly Lys Ile
245 250 255
Pro Arg Glu Asn Leu Arg Glu Thr Phe Arg Asp Met Ser Leu Leu Gln
260 265 270
Ala Ala Cys Tyr Tyr Arg Glu Asn Tyr Ala Leu Thr Glu Ser Pro Glu
275 280 285
Lys Ile Val Glu Glu Leu Asn Ala Met Ile Ala Ser Phe Tyr Glu Lys
290 295 300
Glu Ala Pro Leu Lys Glu Gly Ala Ala Ala Phe Leu Glu Ala Leu Cys
305 310 315 320
Gln Arg Asn Ile Lys Met Cys Ile Ala Thr Ala Thr Asp His Ser Leu
325 330 335
Ile Arg Ala Ala Leu Lys Arg Cys Gly Val Leu His Tyr Phe Thr Phe
340 345 350
Ile Leu Thr Cys Gly Gln Ala Gly Ala Gly Lys Asp Thr Pro Ala Ile
355 360 365
Tyr Glu Glu Ala Leu Ala Leu Leu Gly Thr Gly Lys Lys Glu Thr Phe
370 375 380
Val Phe Glu Asp Ala Leu Tyr Ala Leu Lys Thr Ala Lys Thr Ala Gly
385 390 395 400
Phe Pro Thr Val Gly Val Lys Asp Pro Ser Ser Ala Gly Gln Glu Gly
405 410 415
Glu Ile Ile Lys Gln Ala Asp Tyr Tyr Leu Tyr Thr Phe Thr Lys
420 425 430
<210> 172
<211> 1452
<212> DNA
<213> Eubacterium hallii
<220>
<221> CDS
<222> (1)..(1452)
<223> Eubacterium hallii gene encoding TMP phosphatase [EEG35494]
<400> 172
atg ata aaa gga gca atc ttt gat att gat gga act tta ctt gat tcc 48
Met Ile Lys Gly Ala Ile Phe Asp Ile Asp Gly Thr Leu Leu Asp Ser
1 5 10 15
atg ccc atc tgg gaa aat gca gga gcg aga tat ctt gct act ctt ggc 96
Met Pro Ile Trp Glu Asn Ala Gly Ala Arg Tyr Leu Ala Thr Leu Gly
20 25 30
att aag gca aag cca gat tta aaa gaa cgg ctg gat gct tta tct ttg 144
Ile Lys Ala Lys Pro Asp Leu Lys Glu Arg Leu Asp Ala Leu Ser Leu
35 40 45
cca gaa gga gcc atc tat atg caa aaa gag tat ggc ctt tcg gta tca 192
Pro Glu Gly Ala Ile Tyr Met Gln Lys Glu Tyr Gly Leu Ser Val Ser
50 55 60
gca gaa gac att tta gaa gga gtc aat cag gtt gta aaa gat ttt tac 240
Ala Glu Asp Ile Leu Glu Gly Val Asn Gln Val Val Lys Asp Phe Tyr
65 70 75 80
tat aaa gaa gcg gtc atg aag ccg gga gcc tat gcc tta gta aaa cgt 288
Tyr Lys Glu Ala Val Met Lys Pro Gly Ala Tyr Ala Leu Val Lys Arg
85 90 95
ctg aaa gaa aat ggt gtg aag tta att ata gcc aca gcg aca gat aag 336
Leu Lys Glu Asn Gly Val Lys Leu Ile Ile Ala Thr Ala Thr Asp Lys
100 105 110
gag atg gca aag gcg gcg ctt att cgt aac ggc ata tgg cag gac ttt 384
Glu Met Ala Lys Ala Ala Leu Ile Arg Asn Gly Ile Trp Gln Asp Phe
115 120 125
acg gga atg att acc tgc gag gaa gcc gga gcc gga aag aca agc ccg 432
Thr Gly Met Ile Thr Cys Glu Glu Ala Gly Ala Gly Lys Thr Ser Pro
130 135 140
aag gta ttt gag ctt gca agg caa aag cta ggc act aaa aaa gag gaa 480
Lys Val Phe Glu Leu Ala Arg Gln Lys Leu Gly Thr Lys Lys Glu Glu
145 150 155 160
aca tgg gta ttt gaa gat tct tta tat gcg gtg aaa act gct act gaa 528
Thr Trp Val Phe Glu Asp Ser Leu Tyr Ala Val Lys Thr Ala Thr Glu
165 170 175
gct gga ttt cca gta tgc agt atc tac gat acc tac agt gtg gga aat 576
Ala Gly Phe Pro Val Cys Ser Ile Tyr Asp Thr Tyr Ser Val Gly Asn
180 185 190
gcg aaa gaa atc cag aaa ctt tct aat att tat gtg aga gat ttt tcg 624
Ala Lys Glu Ile Gln Lys Leu Ser Asn Ile Tyr Val Arg Asp Phe Ser
195 200 205
gag ata ggt gat tat tct ttt tca aat atg aaa aca gtt ctt aca att 672
Glu Ile Gly Asp Tyr Ser Phe Ser Asn Met Lys Thr Val Leu Thr Ile
210 215 220
gca ggc agt gat tcg agc gga gga gca ggt att caa gcg gat atc aag 720
Ala Gly Ser Asp Ser Ser Gly Gly Ala Gly Ile Gln Ala Asp Ile Lys
225 230 235 240
act tta act gtt cat aaa gta tat gcc atg act tgt atc acc gca ctt 768
Thr Leu Thr Val His Lys Val Tyr Ala Met Thr Cys Ile Thr Ala Leu
245 250 255
acc gca caa aat aca gtc gga att acc ggg att atg cca gta cca gca 816
Thr Ala Gln Asn Thr Val Gly Ile Thr Gly Ile Met Pro Val Pro Ala
260 265 270
gaa ttt ttt aaa aaa cag atg gaa agc att ttc aca gat ata aag cca 864
Glu Phe Phe Lys Lys Gln Met Glu Ser Ile Phe Thr Asp Ile Lys Pro
275 280 285
gat gcg gtg aaa att gga atg att gct tca aag gaa cag gca gag att 912
Asp Ala Val Lys Ile Gly Met Ile Ala Ser Lys Glu Gln Ala Glu Ile
290 295 300
atc gca gaa tac ctg gaa aaa tat tct atc aaa aat gta gtg gca gac 960
Ile Ala Glu Tyr Leu Glu Lys Tyr Ser Ile Lys Asn Val Val Ala Asp
305 310 315 320
ccg gtg atg att tcg aca agc ggt acg gtt tta gta gaa gaa aca acg 1008
Pro Val Met Ile Ser Thr Ser Gly Thr Val Leu Val Glu Glu Thr Thr
325 330 335
aga aag ata tta tat gag aaa tta tat cca aaa gtt tcc ctg cta acc 1056
Arg Lys Ile Leu Tyr Glu Lys Leu Tyr Pro Lys Val Ser Leu Leu Thr
340 345 350
ccg aac att cca gaa acc gaa ttt tta tcc ggg ata aaa att acc gat 1104
Pro Asn Ile Pro Glu Thr Glu Phe Leu Ser Gly Ile Lys Ile Thr Asp
355 360 365
aaa aaa aca agg gaa gaa gca gca aaa gtc att gca gac agg tgg aat 1152
Lys Lys Thr Arg Glu Glu Ala Ala Lys Val Ile Ala Asp Arg Trp Asn
370 375 380
tgt gcg gtc tta agt aag ggc ggt cac agc gaa gaa aat gcg gac gat 1200
Cys Ala Val Leu Ser Lys Gly Gly His Ser Glu Glu Asn Ala Asp Asp
385 390 395 400
ttg ctt tat gag agt ttt ttg cag gaa gaa aaa aaa gaa aaa gcc gtt 1248
Leu Leu Tyr Glu Ser Phe Leu Gln Glu Glu Lys Lys Glu Lys Ala Val
405 410 415
tgg ttt cca gaa gaa aga att gat aat cca aac aca cac gga acc ggc 1296
Trp Phe Pro Glu Glu Arg Ile Asp Asn Pro Asn Thr His Gly Thr Gly
420 425 430
tgt aca ctt tca agt gcg gta gcg gca aat ctg gca aag gga ttt cct 1344
Cys Thr Leu Ser Ser Ala Val Ala Ala Asn Leu Ala Lys Gly Phe Pro
435 440 445
gta gaa gaa tcc gta aaa aag gca aaa gta tac atc agc gga gca att 1392
Val Glu Glu Ser Val Lys Lys Ala Lys Val Tyr Ile Ser Gly Ala Ile
450 455 460
aga gca atg ctg aat ctt gga cag gga aat ggc ccg cta aat cat atg 1440
Arg Ala Met Leu Asn Leu Gly Gln Gly Asn Gly Pro Leu Asn His Met
465 470 475 480
tgg gat ttg taa 1452
Trp Asp Leu
<210> 173
<211> 483
<212> PRT
<213> Eubacterium hallii
<400> 173
Met Ile Lys Gly Ala Ile Phe Asp Ile Asp Gly Thr Leu Leu Asp Ser
1 5 10 15
Met Pro Ile Trp Glu Asn Ala Gly Ala Arg Tyr Leu Ala Thr Leu Gly
20 25 30
Ile Lys Ala Lys Pro Asp Leu Lys Glu Arg Leu Asp Ala Leu Ser Leu
35 40 45
Pro Glu Gly Ala Ile Tyr Met Gln Lys Glu Tyr Gly Leu Ser Val Ser
50 55 60
Ala Glu Asp Ile Leu Glu Gly Val Asn Gln Val Val Lys Asp Phe Tyr
65 70 75 80
Tyr Lys Glu Ala Val Met Lys Pro Gly Ala Tyr Ala Leu Val Lys Arg
85 90 95
Leu Lys Glu Asn Gly Val Lys Leu Ile Ile Ala Thr Ala Thr Asp Lys
100 105 110
Glu Met Ala Lys Ala Ala Leu Ile Arg Asn Gly Ile Trp Gln Asp Phe
115 120 125
Thr Gly Met Ile Thr Cys Glu Glu Ala Gly Ala Gly Lys Thr Ser Pro
130 135 140
Lys Val Phe Glu Leu Ala Arg Gln Lys Leu Gly Thr Lys Lys Glu Glu
145 150 155 160
Thr Trp Val Phe Glu Asp Ser Leu Tyr Ala Val Lys Thr Ala Thr Glu
165 170 175
Ala Gly Phe Pro Val Cys Ser Ile Tyr Asp Thr Tyr Ser Val Gly Asn
180 185 190
Ala Lys Glu Ile Gln Lys Leu Ser Asn Ile Tyr Val Arg Asp Phe Ser
195 200 205
Glu Ile Gly Asp Tyr Ser Phe Ser Asn Met Lys Thr Val Leu Thr Ile
210 215 220
Ala Gly Ser Asp Ser Ser Gly Gly Ala Gly Ile Gln Ala Asp Ile Lys
225 230 235 240
Thr Leu Thr Val His Lys Val Tyr Ala Met Thr Cys Ile Thr Ala Leu
245 250 255
Thr Ala Gln Asn Thr Val Gly Ile Thr Gly Ile Met Pro Val Pro Ala
260 265 270
Glu Phe Phe Lys Lys Gln Met Glu Ser Ile Phe Thr Asp Ile Lys Pro
275 280 285
Asp Ala Val Lys Ile Gly Met Ile Ala Ser Lys Glu Gln Ala Glu Ile
290 295 300
Ile Ala Glu Tyr Leu Glu Lys Tyr Ser Ile Lys Asn Val Val Ala Asp
305 310 315 320
Pro Val Met Ile Ser Thr Ser Gly Thr Val Leu Val Glu Glu Thr Thr
325 330 335
Arg Lys Ile Leu Tyr Glu Lys Leu Tyr Pro Lys Val Ser Leu Leu Thr
340 345 350
Pro Asn Ile Pro Glu Thr Glu Phe Leu Ser Gly Ile Lys Ile Thr Asp
355 360 365
Lys Lys Thr Arg Glu Glu Ala Ala Lys Val Ile Ala Asp Arg Trp Asn
370 375 380
Cys Ala Val Leu Ser Lys Gly Gly His Ser Glu Glu Asn Ala Asp Asp
385 390 395 400
Leu Leu Tyr Glu Ser Phe Leu Gln Glu Glu Lys Lys Glu Lys Ala Val
405 410 415
Trp Phe Pro Glu Glu Arg Ile Asp Asn Pro Asn Thr His Gly Thr Gly
420 425 430
Cys Thr Leu Ser Ser Ala Val Ala Ala Asn Leu Ala Lys Gly Phe Pro
435 440 445
Val Glu Glu Ser Val Lys Lys Ala Lys Val Tyr Ile Ser Gly Ala Ile
450 455 460
Arg Ala Met Leu Asn Leu Gly Gln Gly Asn Gly Pro Leu Asn His Met
465 470 475 480
Trp Asp Leu
<210> 174
<211> 1362
<212> DNA
<213> Eubacterium species
<220>
<221> CDS
<222> (1)..(1362)
<223> Eubacterium sp. CAG:252 gene encoding TMP phosphatase [CDB67556]
<400> 174
atg aaa aat aaa ttt ttc aca cgc gag att tgt gtc tgc gtg cac ttg 48
Met Lys Asn Lys Phe Phe Thr Arg Glu Ile Cys Val Cys Val His Leu
1 5 10 15
aca caa act cgt tat gcg caa aaa acg tgc gca gaa atg agg aat agt 96
Thr Gln Thr Arg Tyr Ala Gln Lys Thr Cys Ala Glu Met Arg Asn Ser
20 25 30
gtg aag gtt aaa gct gag gat atg cag cta tac gct gtt aca gat aca 144
Val Lys Val Lys Ala Glu Asp Met Gln Leu Tyr Ala Val Thr Asp Thr
35 40 45
cag tgg ctt aat gga cgt gac ttt ctt gaa gta ata gaa agc gtt ctt 192
Gln Trp Leu Asn Gly Arg Asp Phe Leu Glu Val Ile Glu Ser Val Leu
50 55 60
gca aat gga gct aca ttt tta cag tta agg gaa aaa aat gcc aca cat 240
Ala Asn Gly Ala Thr Phe Leu Gln Leu Arg Glu Lys Asn Ala Thr His
65 70 75 80
gag gaa ata gtg gca aag gcg aag gca ata aag cca ata gct aag aag 288
Glu Glu Ile Val Ala Lys Ala Lys Ala Ile Lys Pro Ile Ala Lys Lys
85 90 95
tac gga gtg cct ttt gtc ata gat gat gac ata tat gca gct aaa gag 336
Tyr Gly Val Pro Phe Val Ile Asp Asp Asp Ile Tyr Ala Ala Lys Glu
100 105 110
gca gac gtg gat ggt gtc cac ata ggg cag aat gat gca agc tat gag 384
Ala Asp Val Asp Gly Val His Ile Gly Gln Asn Asp Ala Ser Tyr Glu
115 120 125
aag gca aga gaa gtt ctt gga gaa ggc aag ata ata gga atg acg gtc 432
Lys Ala Arg Glu Val Leu Gly Glu Gly Lys Ile Ile Gly Met Thr Val
130 135 140
aag aca agg cag cag gca gaa aat gcc ata aga ctt ggc gct gac tat 480
Lys Thr Arg Gln Gln Ala Glu Asn Ala Ile Arg Leu Gly Ala Asp Tyr
145 150 155 160
gtt gga atg ggg gca gtg ttt cat aca agc act aaa aaa gat gca aag 528
Val Gly Met Gly Ala Val Phe His Thr Ser Thr Lys Lys Asp Ala Lys
165 170 175
gat atg agc agg gaa aca ctt tta gag ctt gca ggg atg atg gag gat 576
Asp Met Ser Arg Glu Thr Leu Leu Glu Leu Ala Gly Met Met Glu Asp
180 185 190
att ccg gtg gtc gcc att ggc ggc ata agc tat gat aac tgc gat tac 624
Ile Pro Val Val Ala Ile Gly Gly Ile Ser Tyr Asp Asn Cys Asp Tyr
195 200 205
tta aag gac aca ggt gtt gat gga ata gca gtt gtt tca gcc ata ttt 672
Leu Lys Asp Thr Gly Val Asp Gly Ile Ala Val Val Ser Ala Ile Phe
210 215 220
gca agt gat gac tgt gcg ctt gcc aca aga aag ctt ttt gta aag aca 720
Ala Ser Asp Asp Cys Ala Leu Ala Thr Arg Lys Leu Phe Val Lys Thr
225 230 235 240
agg gaa ttg ttt gga aag aaa aga aac ata ata atg gat atg gat ggt 768
Arg Glu Leu Phe Gly Lys Lys Arg Asn Ile Ile Met Asp Met Asp Gly
245 250 255
acg ctt gca gac tct atg cct ttc tgg aaa aac agc gca aga gag tat 816
Thr Leu Ala Asp Ser Met Pro Phe Trp Lys Asn Ser Ala Arg Glu Tyr
260 265 270
gcg ata tta cgt gga gca gat att ccg gat aat ttc gat gag ata act 864
Ala Ile Leu Arg Gly Ala Asp Ile Pro Asp Asn Phe Asp Glu Ile Thr
275 280 285
ggc gtt atg gac ctt aat gat tat gct gag tat gtt aaa aat gtt ctt 912
Gly Val Met Asp Leu Asn Asp Tyr Ala Glu Tyr Val Lys Asn Val Leu
290 295 300
ggc ata gat act aat ctt gag cag ata aca gaa gcg gct gtc gag att 960
Gly Ile Asp Thr Asn Leu Glu Gln Ile Thr Glu Ala Ala Val Glu Ile
305 310 315 320
atg aat aaa cat tac gaa aaa gat ata cct gca aag gac ggt atg aca 1008
Met Asn Lys His Tyr Glu Lys Asp Ile Pro Ala Lys Asp Gly Met Thr
325 330 335
gag ctt gtc acg aga gaa tat aag gcc gga agc aga ctt gtt gtg ttt 1056
Glu Leu Val Thr Arg Glu Tyr Lys Ala Gly Ser Arg Leu Val Val Phe
340 345 350
acg gct tca gat aga aga agt gtt gaa att ctt ctt tca cac ctt gga 1104
Thr Ala Ser Asp Arg Arg Ser Val Glu Ile Leu Leu Ser His Leu Gly
355 360 365
ata aga gaa tgt ttt tat gat ata tat aca gtc tat gat gta gga ctt 1152
Ile Arg Glu Cys Phe Tyr Asp Ile Tyr Thr Val Tyr Asp Val Gly Leu
370 375 380
aag aag agt gat aag aac agc tat ctt aag gtg gca gag ctt gca ggc 1200
Lys Lys Ser Asp Lys Asn Ser Tyr Leu Lys Val Ala Glu Leu Ala Gly
385 390 395 400
atg aaa gat aca tca cag gta tgg gta tat gag gat ata tta aga ggt 1248
Met Lys Asp Thr Ser Gln Val Trp Val Tyr Glu Asp Ile Leu Arg Gly
405 410 415
gta aag gca gcg aaa gag gcc gga ctt aat gtg tgt gca gtg tat gat 1296
Val Lys Ala Ala Lys Glu Ala Gly Leu Asn Val Cys Ala Val Tyr Asp
420 425 430
gag gac tcc gca ggc gac tgg gag gac ata aaa gag ctt gcg gat aag 1344
Glu Asp Ser Ala Gly Asp Trp Glu Asp Ile Lys Glu Leu Ala Asp Lys
435 440 445
acc ctt gaa ctt gtg taa 1362
Thr Leu Glu Leu Val
450
<210> 175
<211> 453
<212> PRT
<213> Eubacterium species
<400> 175
Met Lys Asn Lys Phe Phe Thr Arg Glu Ile Cys Val Cys Val His Leu
1 5 10 15
Thr Gln Thr Arg Tyr Ala Gln Lys Thr Cys Ala Glu Met Arg Asn Ser
20 25 30
Val Lys Val Lys Ala Glu Asp Met Gln Leu Tyr Ala Val Thr Asp Thr
35 40 45
Gln Trp Leu Asn Gly Arg Asp Phe Leu Glu Val Ile Glu Ser Val Leu
50 55 60
Ala Asn Gly Ala Thr Phe Leu Gln Leu Arg Glu Lys Asn Ala Thr His
65 70 75 80
Glu Glu Ile Val Ala Lys Ala Lys Ala Ile Lys Pro Ile Ala Lys Lys
85 90 95
Tyr Gly Val Pro Phe Val Ile Asp Asp Asp Ile Tyr Ala Ala Lys Glu
100 105 110
Ala Asp Val Asp Gly Val His Ile Gly Gln Asn Asp Ala Ser Tyr Glu
115 120 125
Lys Ala Arg Glu Val Leu Gly Glu Gly Lys Ile Ile Gly Met Thr Val
130 135 140
Lys Thr Arg Gln Gln Ala Glu Asn Ala Ile Arg Leu Gly Ala Asp Tyr
145 150 155 160
Val Gly Met Gly Ala Val Phe His Thr Ser Thr Lys Lys Asp Ala Lys
165 170 175
Asp Met Ser Arg Glu Thr Leu Leu Glu Leu Ala Gly Met Met Glu Asp
180 185 190
Ile Pro Val Val Ala Ile Gly Gly Ile Ser Tyr Asp Asn Cys Asp Tyr
195 200 205
Leu Lys Asp Thr Gly Val Asp Gly Ile Ala Val Val Ser Ala Ile Phe
210 215 220
Ala Ser Asp Asp Cys Ala Leu Ala Thr Arg Lys Leu Phe Val Lys Thr
225 230 235 240
Arg Glu Leu Phe Gly Lys Lys Arg Asn Ile Ile Met Asp Met Asp Gly
245 250 255
Thr Leu Ala Asp Ser Met Pro Phe Trp Lys Asn Ser Ala Arg Glu Tyr
260 265 270
Ala Ile Leu Arg Gly Ala Asp Ile Pro Asp Asn Phe Asp Glu Ile Thr
275 280 285
Gly Val Met Asp Leu Asn Asp Tyr Ala Glu Tyr Val Lys Asn Val Leu
290 295 300
Gly Ile Asp Thr Asn Leu Glu Gln Ile Thr Glu Ala Ala Val Glu Ile
305 310 315 320
Met Asn Lys His Tyr Glu Lys Asp Ile Pro Ala Lys Asp Gly Met Thr
325 330 335
Glu Leu Val Thr Arg Glu Tyr Lys Ala Gly Ser Arg Leu Val Val Phe
340 345 350
Thr Ala Ser Asp Arg Arg Ser Val Glu Ile Leu Leu Ser His Leu Gly
355 360 365
Ile Arg Glu Cys Phe Tyr Asp Ile Tyr Thr Val Tyr Asp Val Gly Leu
370 375 380
Lys Lys Ser Asp Lys Asn Ser Tyr Leu Lys Val Ala Glu Leu Ala Gly
385 390 395 400
Met Lys Asp Thr Ser Gln Val Trp Val Tyr Glu Asp Ile Leu Arg Gly
405 410 415
Val Lys Ala Ala Lys Glu Ala Gly Leu Asn Val Cys Ala Val Tyr Asp
420 425 430
Glu Asp Ser Ala Gly Asp Trp Glu Asp Ile Lys Glu Leu Ala Asp Lys
435 440 445
Thr Leu Glu Leu Val
450
<210> 176
<211> 1260
<212> DNA
<213> Lachnospiraceae pectinoschiza
<220>
<221> CDS
<222> (1)..(1260)
<223> Lachnospiraceae pectinoschiza gene encoding TMP phosphatase
[CUQ76318]
<400> 176
atg aaa gtt acc cgt gaa gat atg cag ctt tac gcc gtt aca gat acg 48
Met Lys Val Thr Arg Glu Asp Met Gln Leu Tyr Ala Val Thr Asp Thr
1 5 10 15
caa tgg ctt aat ggc agg gat ttc tat gaa gag att gag aaa gtc ctt 96
Gln Trp Leu Asn Gly Arg Asp Phe Tyr Glu Glu Ile Glu Lys Val Leu
20 25 30
gcg gca gga gct aca ttt ttg cag tta aga gaa aag gat tcg aca cac 144
Ala Ala Gly Ala Thr Phe Leu Gln Leu Arg Glu Lys Asp Ser Thr His
35 40 45
gag gag att gta aaa aaa gca ttg gca att aaa ccg ata gca aga aga 192
Glu Glu Ile Val Lys Lys Ala Leu Ala Ile Lys Pro Ile Ala Arg Arg
50 55 60
tat ggt gtg cca ttt gtt ata gat gat gat ata tac gcg gcg tta gag 240
Tyr Gly Val Pro Phe Val Ile Asp Asp Asp Ile Tyr Ala Ala Leu Glu
65 70 75 80
gca gat gtt gac gga gtt cat ata gga caa agt gat gca agc tac gaa 288
Ala Asp Val Asp Gly Val His Ile Gly Gln Ser Asp Ala Ser Tyr Glu
85 90 95
aca gca aga gag ctt cta gga cct gac aag ata ata gga atg aca gta 336
Thr Ala Arg Glu Leu Leu Gly Pro Asp Lys Ile Ile Gly Met Thr Val
100 105 110
aag aca cca gag cag gcg gca aat gcg gca aga ctt ggt gct gat tat 384
Lys Thr Pro Glu Gln Ala Ala Asn Ala Ala Arg Leu Gly Ala Asp Tyr
115 120 125
gtt gga atg gga gct gta ttt cat aca agc acg aag aaa gat gcc aaa 432
Val Gly Met Gly Ala Val Phe His Thr Ser Thr Lys Lys Asp Ala Lys
130 135 140
gat tta agc agg gat aat ctt ctt aag ctt aca gct atg ctt gat atg 480
Asp Leu Ser Arg Asp Asn Leu Leu Lys Leu Thr Ala Met Leu Asp Met
145 150 155 160
ccg ata gtt gca att ggc ggc att aat tat gac aac tgt gat tat tta 528
Pro Ile Val Ala Ile Gly Gly Ile Asn Tyr Asp Asn Cys Asp Tyr Leu
165 170 175
aaa gat aca ggc gtg gac gga att gct gtt gta tcg gcg ata ttt gca 576
Lys Asp Thr Gly Val Asp Gly Ile Ala Val Val Ser Ala Ile Phe Ala
180 185 190
agt gat gac tgc gcg gag gcg aca cga aag ctt tat aag aag aca aga 624
Ser Asp Asp Cys Ala Glu Ala Thr Arg Lys Leu Tyr Lys Lys Thr Arg
195 200 205
aag ctg ttt aat tat aat aag aac ata ata ttt gat atg gac gga aca 672
Lys Leu Phe Asn Tyr Asn Lys Asn Ile Ile Phe Asp Met Asp Gly Thr
210 215 220
ctt gtt gac tct atg ccg ttc tgg aag aat agt gca agg gaa tat gcc 720
Leu Val Asp Ser Met Pro Phe Trp Lys Asn Ser Ala Arg Glu Tyr Ala
225 230 235 240
att tta aga ggt gct aag ctt cca aag aat ttt gat gag ata aca gga 768
Ile Leu Arg Gly Ala Lys Leu Pro Lys Asn Phe Asp Glu Ile Thr Gly
245 250 255
gtt atg gac ctt tcg gaa tat gcg gct tat ctg caa aat gtt ctt ggg 816
Val Met Asp Leu Ser Glu Tyr Ala Ala Tyr Leu Gln Asn Val Leu Gly
260 265 270
att gat aca tcg cta gaa cag ata aca gag gca gca gtt gat att atg 864
Ile Asp Thr Ser Leu Glu Gln Ile Thr Glu Ala Ala Val Asp Ile Met
275 280 285
aat aag cat tat gca agt gat att cct gca aag aag gga atg ata aag 912
Asn Lys His Tyr Ala Ser Asp Ile Pro Ala Lys Lys Gly Met Ile Lys
290 295 300
ctt ata aga aga gaa tat gag gct gga agc aag ctt gta ata ttc agt 960
Leu Ile Arg Arg Glu Tyr Glu Ala Gly Ser Lys Leu Val Ile Phe Ser
305 310 315 320
gct tcc gat act tcc agt gtg gaa att ctt ctt aaa agg tta gaa ata 1008
Ala Ser Asp Thr Ser Ser Val Glu Ile Leu Leu Lys Arg Leu Glu Ile
325 330 335
tat gaa tgt ttt gag gga ata tac aca gta tat gat gtc ggc ata gga 1056
Tyr Glu Cys Phe Glu Gly Ile Tyr Thr Val Tyr Asp Val Gly Ile Gly
340 345 350
aag agt gat aag gaa agc tat aaa aag gtt gcc agg tca gca gga atg 1104
Lys Ser Asp Lys Glu Ser Tyr Lys Lys Val Ala Arg Ser Ala Gly Met
355 360 365
gat ata tct gat acg tgg gtg tat gag gat att cta aga ggc gtt cgg 1152
Asp Ile Ser Asp Thr Trp Val Tyr Glu Asp Ile Leu Arg Gly Val Arg
370 375 380
gcg gca cat aat gct gga ttg aaa gtg tgt gcg gta tat gat aaa gac 1200
Ala Ala His Asn Ala Gly Leu Lys Val Cys Ala Val Tyr Asp Lys Asp
385 390 395 400
tcg gca gat gac tgg gat gag ata tgc agt att gca gat aaa tgt ata 1248
Ser Ala Asp Asp Trp Asp Glu Ile Cys Ser Ile Ala Asp Lys Cys Ile
405 410 415
ata acc gga taa 1260
Ile Thr Gly
<210> 177
<211> 419
<212> PRT
<213> Lachnospiraceae pectinoschiza
<400> 177
Met Lys Val Thr Arg Glu Asp Met Gln Leu Tyr Ala Val Thr Asp Thr
1 5 10 15
Gln Trp Leu Asn Gly Arg Asp Phe Tyr Glu Glu Ile Glu Lys Val Leu
20 25 30
Ala Ala Gly Ala Thr Phe Leu Gln Leu Arg Glu Lys Asp Ser Thr His
35 40 45
Glu Glu Ile Val Lys Lys Ala Leu Ala Ile Lys Pro Ile Ala Arg Arg
50 55 60
Tyr Gly Val Pro Phe Val Ile Asp Asp Asp Ile Tyr Ala Ala Leu Glu
65 70 75 80
Ala Asp Val Asp Gly Val His Ile Gly Gln Ser Asp Ala Ser Tyr Glu
85 90 95
Thr Ala Arg Glu Leu Leu Gly Pro Asp Lys Ile Ile Gly Met Thr Val
100 105 110
Lys Thr Pro Glu Gln Ala Ala Asn Ala Ala Arg Leu Gly Ala Asp Tyr
115 120 125
Val Gly Met Gly Ala Val Phe His Thr Ser Thr Lys Lys Asp Ala Lys
130 135 140
Asp Leu Ser Arg Asp Asn Leu Leu Lys Leu Thr Ala Met Leu Asp Met
145 150 155 160
Pro Ile Val Ala Ile Gly Gly Ile Asn Tyr Asp Asn Cys Asp Tyr Leu
165 170 175
Lys Asp Thr Gly Val Asp Gly Ile Ala Val Val Ser Ala Ile Phe Ala
180 185 190
Ser Asp Asp Cys Ala Glu Ala Thr Arg Lys Leu Tyr Lys Lys Thr Arg
195 200 205
Lys Leu Phe Asn Tyr Asn Lys Asn Ile Ile Phe Asp Met Asp Gly Thr
210 215 220
Leu Val Asp Ser Met Pro Phe Trp Lys Asn Ser Ala Arg Glu Tyr Ala
225 230 235 240
Ile Leu Arg Gly Ala Lys Leu Pro Lys Asn Phe Asp Glu Ile Thr Gly
245 250 255
Val Met Asp Leu Ser Glu Tyr Ala Ala Tyr Leu Gln Asn Val Leu Gly
260 265 270
Ile Asp Thr Ser Leu Glu Gln Ile Thr Glu Ala Ala Val Asp Ile Met
275 280 285
Asn Lys His Tyr Ala Ser Asp Ile Pro Ala Lys Lys Gly Met Ile Lys
290 295 300
Leu Ile Arg Arg Glu Tyr Glu Ala Gly Ser Lys Leu Val Ile Phe Ser
305 310 315 320
Ala Ser Asp Thr Ser Ser Val Glu Ile Leu Leu Lys Arg Leu Glu Ile
325 330 335
Tyr Glu Cys Phe Glu Gly Ile Tyr Thr Val Tyr Asp Val Gly Ile Gly
340 345 350
Lys Ser Asp Lys Glu Ser Tyr Lys Lys Val Ala Arg Ser Ala Gly Met
355 360 365
Asp Ile Ser Asp Thr Trp Val Tyr Glu Asp Ile Leu Arg Gly Val Arg
370 375 380
Ala Ala His Asn Ala Gly Leu Lys Val Cys Ala Val Tyr Asp Lys Asp
385 390 395 400
Ser Ala Asp Asp Trp Asp Glu Ile Cys Ser Ile Ala Asp Lys Cys Ile
405 410 415
Ile Thr Gly
<210> 178
<211> 1296
<212> DNA
<213> Peptostreptococcaceae bacterium
<220>
<221> CDS
<222> (1)..(1296)
<223> Peptostreptococcaceae bacterium OBRC8 gene encoding TMP
phosphatase[WP_009530263]
<400> 178
atg aaa aat att gac tat aca atg tat tac gtc acc gat gaa gac ctt 48
Met Lys Asn Ile Asp Tyr Thr Met Tyr Tyr Val Thr Asp Glu Asp Leu
1 5 10 15
ttg agc agt aat cat acc ttg gaa aca tct gta caa gat gcc att tta 96
Leu Ser Ser Asn His Thr Leu Glu Thr Ser Val Gln Asp Ala Ile Leu
20 25 30
ggt ggc tgt aca atg ata cag ctt cga gaa aaa cat tca tcc act ctc 144
Gly Gly Cys Thr Met Ile Gln Leu Arg Glu Lys His Ser Ser Thr Leu
35 40 45
gat ttt tat aac aaa gcc ata aaa att aaa gcc att tgc gac aag tac 192
Asp Phe Tyr Asn Lys Ala Ile Lys Ile Lys Ala Ile Cys Asp Lys Tyr
50 55 60
aac ata cct ctt ata ata aat gac aga ata gat gta gct ctt gca ata 240
Asn Ile Pro Leu Ile Ile Asn Asp Arg Ile Asp Val Ala Leu Ala Ile
65 70 75 80
aac gca gac gga gta cat ctc gga caa gac gat atg cct ctt gat att 288
Asn Ala Asp Gly Val His Leu Gly Gln Asp Asp Met Pro Leu Asp Ile
85 90 95
gca aga aaa att atg gga gat ggc aaa att ata gga ata tca act gca 336
Ala Arg Lys Ile Met Gly Asp Gly Lys Ile Ile Gly Ile Ser Thr Ala
100 105 110
act tta gat gaa gct cta atc gct caa caa ggc ggt gca gat tat gta 384
Thr Leu Asp Glu Ala Leu Ile Ala Gln Gln Gly Gly Ala Asp Tyr Val
115 120 125
gga gta ggt gct atg tac agc aca aac aca aaa acc gat gcc aat ttg 432
Gly Val Gly Ala Met Tyr Ser Thr Asn Thr Lys Thr Asp Ala Asn Leu
130 135 140
aca act ata aac gag ctt aca aaa ata aaa aac aat cta aaa ata cct 480
Thr Thr Ile Asn Glu Leu Thr Lys Ile Lys Asn Asn Leu Lys Ile Pro
145 150 155 160
gta gtt gca atc ggc ggt ata aac ctt gac aca ata cct gct cta aaa 528
Val Val Ala Ile Gly Gly Ile Asn Leu Asp Thr Ile Pro Ala Leu Lys
165 170 175
cct gca caa ata gac gga gtt gca ata gta tcc gct ata tct atg cag 576
Pro Ala Gln Ile Asp Gly Val Ala Ile Val Ser Ala Ile Ser Met Gln
180 185 190
gaa gat acc gta tct gca aca aga aaa tta aaa aat act ttt ttg aaa 624
Glu Asp Thr Val Ser Ala Thr Arg Lys Leu Lys Asn Thr Phe Leu Lys
195 200 205
caa tat caa act aaa ggc gta ata ttc gat att gac ggt act ctg ctt 672
Gln Tyr Gln Thr Lys Gly Val Ile Phe Asp Ile Asp Gly Thr Leu Leu
210 215 220
gaa act atg aac ata tgg gac aat gta ctt cta aac ctt atg aat aca 720
Glu Thr Met Asn Ile Trp Asp Asn Val Leu Leu Asn Leu Met Asn Thr
225 230 235 240
ctt aat atc agc tat acc gaa gat gaa ata caa aaa ata tgg aat atg 768
Leu Asn Ile Ser Tyr Thr Glu Asp Glu Ile Gln Lys Ile Trp Asn Met
245 250 255
ggt ttt gca gag ctt gca cag ttc agc ata aaa aaa ttc aag ctt gat 816
Gly Phe Ala Glu Leu Ala Gln Phe Ser Ile Lys Lys Phe Lys Leu Asp
260 265 270
atg agt gta aaa gaa ttt tgg caa ctt ata aaa aaa tta tca gtc gaa 864
Met Ser Val Lys Glu Phe Trp Gln Leu Ile Lys Lys Leu Ser Val Glu
275 280 285
gag tat aaa aat agc aaa ata cac tta aaa aaa ggt gca aaa aaa ctg 912
Glu Tyr Lys Asn Ser Lys Ile His Leu Lys Lys Gly Ala Lys Lys Leu
290 295 300
ctt gag tat ctc aaa gaa aaa ggt gta aaa tta gcc ata gca act gcc 960
Leu Glu Tyr Leu Lys Glu Lys Gly Val Lys Leu Ala Ile Ala Thr Ala
305 310 315 320
ctt tgc aaa gaa cag tat gaa ata gtg ctt aca aag aca ggt atc ata 1008
Leu Cys Lys Glu Gln Tyr Glu Ile Val Leu Thr Lys Thr Gly Ile Ile
325 330 335
gac tat ttt gac ata ata gca tca agc gta gat tta aaa atg gaa aaa 1056
Asp Tyr Phe Asp Ile Ile Ala Ser Ser Val Asp Leu Lys Met Glu Lys
340 345 350
tca gac aga caa ata ttt gac tat ata gca aaa aat cta caa gtt cca 1104
Ser Asp Arg Gln Ile Phe Asp Tyr Ile Ala Lys Asn Leu Gln Val Pro
355 360 365
aac aaa aat ctt att ttc ttt gaa gac gac ata aac tcg tca aca ggt 1152
Asn Lys Asn Leu Ile Phe Phe Glu Asp Asp Ile Asn Ser Ser Thr Gly
370 375 380
gcc aag ttg gca gga cta aaa ctg tgc att gta tca aac aag aaa tat 1200
Ala Lys Leu Ala Gly Leu Lys Leu Cys Ile Val Ser Asn Lys Lys Tyr
385 390 395 400
aac ggt aac agc aaa ttt gac gct ctc ata gat tat aaa ata gat gat 1248
Asn Gly Asn Ser Lys Phe Asp Ala Leu Ile Asp Tyr Lys Ile Asp Asp
405 410 415
ttt gaa aat aaa ttg ata tat gat gaa ata ata gtg gag aaa aat tag 1296
Phe Glu Asn Lys Leu Ile Tyr Asp Glu Ile Ile Val Glu Lys Asn
420 425 430
<210> 179
<211> 431
<212> PRT
<213> Peptostreptococcaceae bacterium
<400> 179
Met Lys Asn Ile Asp Tyr Thr Met Tyr Tyr Val Thr Asp Glu Asp Leu
1 5 10 15
Leu Ser Ser Asn His Thr Leu Glu Thr Ser Val Gln Asp Ala Ile Leu
20 25 30
Gly Gly Cys Thr Met Ile Gln Leu Arg Glu Lys His Ser Ser Thr Leu
35 40 45
Asp Phe Tyr Asn Lys Ala Ile Lys Ile Lys Ala Ile Cys Asp Lys Tyr
50 55 60
Asn Ile Pro Leu Ile Ile Asn Asp Arg Ile Asp Val Ala Leu Ala Ile
65 70 75 80
Asn Ala Asp Gly Val His Leu Gly Gln Asp Asp Met Pro Leu Asp Ile
85 90 95
Ala Arg Lys Ile Met Gly Asp Gly Lys Ile Ile Gly Ile Ser Thr Ala
100 105 110
Thr Leu Asp Glu Ala Leu Ile Ala Gln Gln Gly Gly Ala Asp Tyr Val
115 120 125
Gly Val Gly Ala Met Tyr Ser Thr Asn Thr Lys Thr Asp Ala Asn Leu
130 135 140
Thr Thr Ile Asn Glu Leu Thr Lys Ile Lys Asn Asn Leu Lys Ile Pro
145 150 155 160
Val Val Ala Ile Gly Gly Ile Asn Leu Asp Thr Ile Pro Ala Leu Lys
165 170 175
Pro Ala Gln Ile Asp Gly Val Ala Ile Val Ser Ala Ile Ser Met Gln
180 185 190
Glu Asp Thr Val Ser Ala Thr Arg Lys Leu Lys Asn Thr Phe Leu Lys
195 200 205
Gln Tyr Gln Thr Lys Gly Val Ile Phe Asp Ile Asp Gly Thr Leu Leu
210 215 220
Glu Thr Met Asn Ile Trp Asp Asn Val Leu Leu Asn Leu Met Asn Thr
225 230 235 240
Leu Asn Ile Ser Tyr Thr Glu Asp Glu Ile Gln Lys Ile Trp Asn Met
245 250 255
Gly Phe Ala Glu Leu Ala Gln Phe Ser Ile Lys Lys Phe Lys Leu Asp
260 265 270
Met Ser Val Lys Glu Phe Trp Gln Leu Ile Lys Lys Leu Ser Val Glu
275 280 285
Glu Tyr Lys Asn Ser Lys Ile His Leu Lys Lys Gly Ala Lys Lys Leu
290 295 300
Leu Glu Tyr Leu Lys Glu Lys Gly Val Lys Leu Ala Ile Ala Thr Ala
305 310 315 320
Leu Cys Lys Glu Gln Tyr Glu Ile Val Leu Thr Lys Thr Gly Ile Ile
325 330 335
Asp Tyr Phe Asp Ile Ile Ala Ser Ser Val Asp Leu Lys Met Glu Lys
340 345 350
Ser Asp Arg Gln Ile Phe Asp Tyr Ile Ala Lys Asn Leu Gln Val Pro
355 360 365
Asn Lys Asn Leu Ile Phe Phe Glu Asp Asp Ile Asn Ser Ser Thr Gly
370 375 380
Ala Lys Leu Ala Gly Leu Lys Leu Cys Ile Val Ser Asn Lys Lys Tyr
385 390 395 400
Asn Gly Asn Ser Lys Phe Asp Ala Leu Ile Asp Tyr Lys Ile Asp Asp
405 410 415
Phe Glu Asn Lys Leu Ile Tyr Asp Glu Ile Ile Val Glu Lys Asn
420 425 430
<210> 180
<211> 1296
<212> DNA
<213> Peptostreptococcaceae bacterium
<220>
<221> CDS
<222> (1)..(1296)
<223> Peptostreptococcaceae bacterium CM2 gene encoding TMP
phosphatase[WP_009527854]
<400> 180
atg aaa aat att gac tat aca atg tat tac gtc acc gat gaa gac ctt 48
Met Lys Asn Ile Asp Tyr Thr Met Tyr Tyr Val Thr Asp Glu Asp Leu
1 5 10 15
ttg agc agt aat cac acc ttg gaa aca tct gtg caa gat gcc att tta 96
Leu Ser Ser Asn His Thr Leu Glu Thr Ser Val Gln Asp Ala Ile Leu
20 25 30
ggt ggc tgt aca atg ata cag ctt cga gaa aaa cat tca tcc act ctc 144
Gly Gly Cys Thr Met Ile Gln Leu Arg Glu Lys His Ser Ser Thr Leu
35 40 45
gat ttt tat aac aaa gcc gta aaa att aaa gct att tgc gac aag tac 192
Asp Phe Tyr Asn Lys Ala Val Lys Ile Lys Ala Ile Cys Asp Lys Tyr
50 55 60
aac ata cct ctt ata ata aat gac aga ata gac gta gct ctt gca ata 240
Asn Ile Pro Leu Ile Ile Asn Asp Arg Ile Asp Val Ala Leu Ala Ile
65 70 75 80
aat gca gac gga gta cat ctc gga caa gac gat atg cct ctt gat att 288
Asn Ala Asp Gly Val His Leu Gly Gln Asp Asp Met Pro Leu Asp Ile
85 90 95
gca aga aaa att atg gga gat ggc aaa att ata gga ata tca acc tca 336
Ala Arg Lys Ile Met Gly Asp Gly Lys Ile Ile Gly Ile Ser Thr Ser
100 105 110
act tta gat gaa gct cta atc gct caa caa ggc ggt gca gat tat gta 384
Thr Leu Asp Glu Ala Leu Ile Ala Gln Gln Gly Gly Ala Asp Tyr Val
115 120 125
ggt gta ggt gct atg tac agc aca aac aca aaa act gat gcc aat ttg 432
Gly Val Gly Ala Met Tyr Ser Thr Asn Thr Lys Thr Asp Ala Asn Leu
130 135 140
aca act ata gac gag ctt aca aaa ata aaa aac aat tta aaa ata cct 480
Thr Thr Ile Asp Glu Leu Thr Lys Ile Lys Asn Asn Leu Lys Ile Pro
145 150 155 160
gtt gtt gca atc ggc ggt ata aac ctt gac act ata ccc gct cta aaa 528
Val Val Ala Ile Gly Gly Ile Asn Leu Asp Thr Ile Pro Ala Leu Lys
165 170 175
cct gcg caa ata gac gga gtt gca ata gta tcc gct ata tct atg cag 576
Pro Ala Gln Ile Asp Gly Val Ala Ile Val Ser Ala Ile Ser Met Gln
180 185 190
gaa gat acc gta tct gca aca aga aaa tta aaa aat act ttt ttg aaa 624
Glu Asp Thr Val Ser Ala Thr Arg Lys Leu Lys Asn Thr Phe Leu Lys
195 200 205
caa tat caa act aaa ggc gta ata ttc gat att gac ggt act ctg ctt 672
Gln Tyr Gln Thr Lys Gly Val Ile Phe Asp Ile Asp Gly Thr Leu Leu
210 215 220
gaa act atg aac ata tgg gac aat gta ctt cta aat ctt atg aat acg 720
Glu Thr Met Asn Ile Trp Asp Asn Val Leu Leu Asn Leu Met Asn Thr
225 230 235 240
ctt aat atc cgc tat acc gaa gat gaa ata caa aag ata tgg aat atg 768
Leu Asn Ile Arg Tyr Thr Glu Asp Glu Ile Gln Lys Ile Trp Asn Met
245 250 255
ggt ttt gca gag ctt gca cag ttc agc ata aaa aaa ttc aag ctt gat 816
Gly Phe Ala Glu Leu Ala Gln Phe Ser Ile Lys Lys Phe Lys Leu Asp
260 265 270
atg agt gta aaa gaa ttt tgg caa ctt ata aaa aaa tta tca gtc gaa 864
Met Ser Val Lys Glu Phe Trp Gln Leu Ile Lys Lys Leu Ser Val Glu
275 280 285
gag tat aaa aat agc aaa ata cac tta aaa aaa ggt gca aaa aaa ctg 912
Glu Tyr Lys Asn Ser Lys Ile His Leu Lys Lys Gly Ala Lys Lys Leu
290 295 300
ctt gag tat ctc aaa gaa aaa ggt gta aaa tta gcc ata gca act gcc 960
Leu Glu Tyr Leu Lys Glu Lys Gly Val Lys Leu Ala Ile Ala Thr Ala
305 310 315 320
ctt tgc aaa gaa cag tat gaa ata gtg ctt aca aag aca ggt atc ata 1008
Leu Cys Lys Glu Gln Tyr Glu Ile Val Leu Thr Lys Thr Gly Ile Ile
325 330 335
gac tat ttt gac ata ata gca tca agc gta gat tta aaa atg gaa aaa 1056
Asp Tyr Phe Asp Ile Ile Ala Ser Ser Val Asp Leu Lys Met Glu Lys
340 345 350
tca gat aga caa ata ttt gac tat ata gca aaa aat cta caa gtt cca 1104
Ser Asp Arg Gln Ile Phe Asp Tyr Ile Ala Lys Asn Leu Gln Val Pro
355 360 365
aac aaa aat ttt att ttc ttt gaa gac gac ata aac tcg tca aca ggt 1152
Asn Lys Asn Phe Ile Phe Phe Glu Asp Asp Ile Asn Ser Ser Thr Gly
370 375 380
gca aaa cgt gca gga gta aaa ctg tgc att gta tca aac aag aaa tat 1200
Ala Lys Arg Ala Gly Val Lys Leu Cys Ile Val Ser Asn Lys Lys Tyr
385 390 395 400
aat ggt aac agc aaa ttt gac gct ctc ata gat tat aaa ata gat gat 1248
Asn Gly Asn Ser Lys Phe Asp Ala Leu Ile Asp Tyr Lys Ile Asp Asp
405 410 415
ttt gaa aat aaa ttg ata tat gat gaa ata ata gtg gag aaa aat tag 1296
Phe Glu Asn Lys Leu Ile Tyr Asp Glu Ile Ile Val Glu Lys Asn
420 425 430
<210> 181
<211> 431
<212> PRT
<213> Peptostreptococcaceae bacterium
<400> 181
Met Lys Asn Ile Asp Tyr Thr Met Tyr Tyr Val Thr Asp Glu Asp Leu
1 5 10 15
Leu Ser Ser Asn His Thr Leu Glu Thr Ser Val Gln Asp Ala Ile Leu
20 25 30
Gly Gly Cys Thr Met Ile Gln Leu Arg Glu Lys His Ser Ser Thr Leu
35 40 45
Asp Phe Tyr Asn Lys Ala Val Lys Ile Lys Ala Ile Cys Asp Lys Tyr
50 55 60
Asn Ile Pro Leu Ile Ile Asn Asp Arg Ile Asp Val Ala Leu Ala Ile
65 70 75 80
Asn Ala Asp Gly Val His Leu Gly Gln Asp Asp Met Pro Leu Asp Ile
85 90 95
Ala Arg Lys Ile Met Gly Asp Gly Lys Ile Ile Gly Ile Ser Thr Ser
100 105 110
Thr Leu Asp Glu Ala Leu Ile Ala Gln Gln Gly Gly Ala Asp Tyr Val
115 120 125
Gly Val Gly Ala Met Tyr Ser Thr Asn Thr Lys Thr Asp Ala Asn Leu
130 135 140
Thr Thr Ile Asp Glu Leu Thr Lys Ile Lys Asn Asn Leu Lys Ile Pro
145 150 155 160
Val Val Ala Ile Gly Gly Ile Asn Leu Asp Thr Ile Pro Ala Leu Lys
165 170 175
Pro Ala Gln Ile Asp Gly Val Ala Ile Val Ser Ala Ile Ser Met Gln
180 185 190
Glu Asp Thr Val Ser Ala Thr Arg Lys Leu Lys Asn Thr Phe Leu Lys
195 200 205
Gln Tyr Gln Thr Lys Gly Val Ile Phe Asp Ile Asp Gly Thr Leu Leu
210 215 220
Glu Thr Met Asn Ile Trp Asp Asn Val Leu Leu Asn Leu Met Asn Thr
225 230 235 240
Leu Asn Ile Arg Tyr Thr Glu Asp Glu Ile Gln Lys Ile Trp Asn Met
245 250 255
Gly Phe Ala Glu Leu Ala Gln Phe Ser Ile Lys Lys Phe Lys Leu Asp
260 265 270
Met Ser Val Lys Glu Phe Trp Gln Leu Ile Lys Lys Leu Ser Val Glu
275 280 285
Glu Tyr Lys Asn Ser Lys Ile His Leu Lys Lys Gly Ala Lys Lys Leu
290 295 300
Leu Glu Tyr Leu Lys Glu Lys Gly Val Lys Leu Ala Ile Ala Thr Ala
305 310 315 320
Leu Cys Lys Glu Gln Tyr Glu Ile Val Leu Thr Lys Thr Gly Ile Ile
325 330 335
Asp Tyr Phe Asp Ile Ile Ala Ser Ser Val Asp Leu Lys Met Glu Lys
340 345 350
Ser Asp Arg Gln Ile Phe Asp Tyr Ile Ala Lys Asn Leu Gln Val Pro
355 360 365
Asn Lys Asn Phe Ile Phe Phe Glu Asp Asp Ile Asn Ser Ser Thr Gly
370 375 380
Ala Lys Arg Ala Gly Val Lys Leu Cys Ile Val Ser Asn Lys Lys Tyr
385 390 395 400
Asn Gly Asn Ser Lys Phe Asp Ala Leu Ile Asp Tyr Lys Ile Asp Asp
405 410 415
Phe Glu Asn Lys Leu Ile Tyr Asp Glu Ile Ile Val Glu Lys Asn
420 425 430
<210> 182
<211> 1365
<212> DNA
<213> Atopobium species
<220>
<221> CDS
<222> (1)..(1365)
<223> Atopobium sp. ICM42b gene encoding TMP phosphatase [WP_035427744]
<400> 182
atg cag gtg acc ggt gca att ttt gat tgc gat gga act ctt gtt gat 48
Met Gln Val Thr Gly Ala Ile Phe Asp Cys Asp Gly Thr Leu Val Asp
1 5 10 15
tca atg cgc gtt tgg cat aac gtt ttt ggc gct gtt ctt cct aaa tat 96
Ser Met Arg Val Trp His Asn Val Phe Gly Ala Val Leu Pro Lys Tyr
20 25 30
ggc aag act att gat tcg gat att ttt gac cgc gta gag gct gtt tcc 144
Gly Lys Thr Ile Asp Ser Asp Ile Phe Asp Arg Val Glu Ala Val Ser
35 40 45
ctc att ggt gga tgt cag att tgc gtt gat gaa ctg gat ttg cct att 192
Leu Ile Gly Gly Cys Gln Ile Cys Val Asp Glu Leu Asp Leu Pro Ile
50 55 60
aca gcg gaa gct ttg tat gaa gag ttc tgc gcg tac gta att gat cag 240
Thr Ala Glu Ala Leu Tyr Glu Glu Phe Cys Ala Tyr Val Ile Asp Gln
65 70 75 80
tac caa cat cat gtt tca atc att ccc ggt gca aag gag ttc tta cag 288
Tyr Gln His His Val Ser Ile Ile Pro Gly Ala Lys Glu Phe Leu Gln
85 90 95
gag ctc tac gat gca ggt att cct atg gcc gtt gct tcg tca act ccc 336
Glu Leu Tyr Asp Ala Gly Ile Pro Met Ala Val Ala Ser Ser Thr Pro
100 105 110
gtg cga gaa gtt cgt gca gct ctg gca gct caa ggt att gag cac ctc 384
Val Arg Glu Val Arg Ala Ala Leu Ala Ala Gln Gly Ile Glu His Leu
115 120 125
ttc aaa aca gtg gtc tca aca gaa gat gtg ggg gga gtg gac aag gtt 432
Phe Lys Thr Val Val Ser Thr Glu Asp Val Gly Gly Val Asp Lys Val
130 135 140
gag cct gat gtt tat ctt gag gct ctt cgc cgt ctt ggc acc gat aag 480
Glu Pro Asp Val Tyr Leu Glu Ala Leu Arg Arg Leu Gly Thr Asp Lys
145 150 155 160
gca act acc tgg gtc ttc gag gat gcc ccg ttt ggc gca cag aca gca 528
Ala Thr Thr Trp Val Phe Glu Asp Ala Pro Phe Gly Ala Gln Thr Ala
165 170 175
caa aat gcg ggc ttt cct gtg gta gcg ctc tac aat gat cat gac ggc 576
Gln Asn Ala Gly Phe Pro Val Val Ala Leu Tyr Asn Asp His Asp Gly
180 185 190
cgc gac ccc gtc ttt atg cgc gag cac tct aac atc ttt gcc cac acc 624
Arg Asp Pro Val Phe Met Arg Glu His Ser Asn Ile Phe Ala His Thr
195 200 205
tac ggc gag ctg tcg ctt ctg cgc ctt cag gac tac gag cgc cct ctg 672
Tyr Gly Glu Leu Ser Leu Leu Arg Leu Gln Asp Tyr Glu Arg Pro Leu
210 215 220
acc gca gcg cct tct ggc gag aaa ccc ctt gag gtc ctt gtt gtg ggc 720
Thr Ala Ala Pro Ser Gly Glu Lys Pro Leu Glu Val Leu Val Val Gly
225 230 235 240
gga tcc cca gag gcg gtt tca cac acg acg ctg tct acc tgc gcc caa 768
Gly Ser Pro Glu Ala Val Ser His Thr Thr Leu Ser Thr Cys Ala Gln
245 250 255
agc gct gac tac ctg ata gcg gtt gac cat ggt gca gat gca tgt cac 816
Ser Ala Asp Tyr Leu Ile Ala Val Asp His Gly Ala Asp Ala Cys His
260 265 270
gct gcc ggc gtg att cca cag ctt gcg ctt gga gac ttt gac tcg gct 864
Ala Ala Gly Val Ile Pro Gln Leu Ala Leu Gly Asp Phe Asp Ser Ala
275 280 285
aca cca gaa act ctg gct tgg ctc aaa gag cag cag gta cct tgc atg 912
Thr Pro Glu Thr Leu Ala Trp Leu Lys Glu Gln Gln Val Pro Cys Met
290 295 300
aag ttt aat gcg gac aag tac gat acc gac ctg gct ctt gct tta aag 960
Lys Phe Asn Ala Asp Lys Tyr Asp Thr Asp Leu Ala Leu Ala Leu Lys
305 310 315 320
tcc gcc gag tac gag gct att cgt aga gat agc aag ctc tct ctt acg 1008
Ser Ala Glu Tyr Glu Ala Ile Arg Arg Asp Ser Lys Leu Ser Leu Thr
325 330 335
gtt gtc tcc aca tct ggc gga cac ctt gat cac cag ctt gta gtg ctt 1056
Val Val Ser Thr Ser Gly Gly His Leu Asp His Gln Leu Val Val Leu
340 345 350
ggt ctt ctc gcc acg tgg gca aag acg ggc aag gca agt gtt cga gtt 1104
Gly Leu Leu Ala Thr Trp Ala Lys Thr Gly Lys Ala Ser Val Arg Val
355 360 365
att gaa aat gac ttt gag atg cgc ttt tta act gca ggc cag gtt gat 1152
Ile Glu Asn Asp Phe Glu Met Arg Phe Leu Thr Ala Gly Gln Val Asp
370 375 380
tct tgg cag ctg agc gca act gat gta ggt aaa aag atg tcc ctt gtg 1200
Ser Trp Gln Leu Ser Ala Thr Asp Val Gly Lys Lys Met Ser Leu Val
385 390 395 400
gct ttg tca gag gag tgc gag gtt tct gag gcc ggc atg aag tgg aat 1248
Ala Leu Ser Glu Glu Cys Glu Val Ser Glu Ala Gly Met Lys Trp Asn
405 410 415
ctt gat cac cag aag ttc acc ttg ctg gga gac gac ggt att tca aat 1296
Leu Asp His Gln Lys Phe Thr Leu Leu Gly Asp Asp Gly Ile Ser Asn
420 425 430
atc gtc gaa tca gac aat tcc tgg gta agg tgc gag aag ggc tgt ctt 1344
Ile Val Glu Ser Asp Asn Ser Trp Val Arg Cys Glu Lys Gly Cys Leu
435 440 445
ttg gtg cag ctt tgg aac taa 1365
Leu Val Gln Leu Trp Asn
450
<210> 183
<211> 454
<212> PRT
<213> Atopobium species
<400> 183
Met Gln Val Thr Gly Ala Ile Phe Asp Cys Asp Gly Thr Leu Val Asp
1 5 10 15
Ser Met Arg Val Trp His Asn Val Phe Gly Ala Val Leu Pro Lys Tyr
20 25 30
Gly Lys Thr Ile Asp Ser Asp Ile Phe Asp Arg Val Glu Ala Val Ser
35 40 45
Leu Ile Gly Gly Cys Gln Ile Cys Val Asp Glu Leu Asp Leu Pro Ile
50 55 60
Thr Ala Glu Ala Leu Tyr Glu Glu Phe Cys Ala Tyr Val Ile Asp Gln
65 70 75 80
Tyr Gln His His Val Ser Ile Ile Pro Gly Ala Lys Glu Phe Leu Gln
85 90 95
Glu Leu Tyr Asp Ala Gly Ile Pro Met Ala Val Ala Ser Ser Thr Pro
100 105 110
Val Arg Glu Val Arg Ala Ala Leu Ala Ala Gln Gly Ile Glu His Leu
115 120 125
Phe Lys Thr Val Val Ser Thr Glu Asp Val Gly Gly Val Asp Lys Val
130 135 140
Glu Pro Asp Val Tyr Leu Glu Ala Leu Arg Arg Leu Gly Thr Asp Lys
145 150 155 160
Ala Thr Thr Trp Val Phe Glu Asp Ala Pro Phe Gly Ala Gln Thr Ala
165 170 175
Gln Asn Ala Gly Phe Pro Val Val Ala Leu Tyr Asn Asp His Asp Gly
180 185 190
Arg Asp Pro Val Phe Met Arg Glu His Ser Asn Ile Phe Ala His Thr
195 200 205
Tyr Gly Glu Leu Ser Leu Leu Arg Leu Gln Asp Tyr Glu Arg Pro Leu
210 215 220
Thr Ala Ala Pro Ser Gly Glu Lys Pro Leu Glu Val Leu Val Val Gly
225 230 235 240
Gly Ser Pro Glu Ala Val Ser His Thr Thr Leu Ser Thr Cys Ala Gln
245 250 255
Ser Ala Asp Tyr Leu Ile Ala Val Asp His Gly Ala Asp Ala Cys His
260 265 270
Ala Ala Gly Val Ile Pro Gln Leu Ala Leu Gly Asp Phe Asp Ser Ala
275 280 285
Thr Pro Glu Thr Leu Ala Trp Leu Lys Glu Gln Gln Val Pro Cys Met
290 295 300
Lys Phe Asn Ala Asp Lys Tyr Asp Thr Asp Leu Ala Leu Ala Leu Lys
305 310 315 320
Ser Ala Glu Tyr Glu Ala Ile Arg Arg Asp Ser Lys Leu Ser Leu Thr
325 330 335
Val Val Ser Thr Ser Gly Gly His Leu Asp His Gln Leu Val Val Leu
340 345 350
Gly Leu Leu Ala Thr Trp Ala Lys Thr Gly Lys Ala Ser Val Arg Val
355 360 365
Ile Glu Asn Asp Phe Glu Met Arg Phe Leu Thr Ala Gly Gln Val Asp
370 375 380
Ser Trp Gln Leu Ser Ala Thr Asp Val Gly Lys Lys Met Ser Leu Val
385 390 395 400
Ala Leu Ser Glu Glu Cys Glu Val Ser Glu Ala Gly Met Lys Trp Asn
405 410 415
Leu Asp His Gln Lys Phe Thr Leu Leu Gly Asp Asp Gly Ile Ser Asn
420 425 430
Ile Val Glu Ser Asp Asn Ser Trp Val Arg Cys Glu Lys Gly Cys Leu
435 440 445
Leu Val Gln Leu Trp Asn
450
<210> 184
<211> 1365
<212> DNA
<213> Atopobium parvulum
<220>
<221> CDS
<222> (1)..(1365)
<223> Atopobium parvulum gene encoding TMP phosphatase [WP_035433109]
<400> 184
atg cag gtg acc ggt gca att ttt gat tgc gat gga act ctt gtt gat 48
Met Gln Val Thr Gly Ala Ile Phe Asp Cys Asp Gly Thr Leu Val Asp
1 5 10 15
tca atg cac gtt tgg cac aac gtt ttt ggc gct gtt ctt cct aaa tac 96
Ser Met His Val Trp His Asn Val Phe Gly Ala Val Leu Pro Lys Tyr
20 25 30
ggc aag act att gat tcg gat att ttt gac cgc gta gag gct gtt tcc 144
Gly Lys Thr Ile Asp Ser Asp Ile Phe Asp Arg Val Glu Ala Val Ser
35 40 45
ctc att ggt gga tgt cag att tgc gtt gat gag ctg gat ttg cct att 192
Leu Ile Gly Gly Cys Gln Ile Cys Val Asp Glu Leu Asp Leu Pro Ile
50 55 60
aca gcg gaa gct tta tat gaa gag ttc tgc gcg tac gta act gat cag 240
Thr Ala Glu Ala Leu Tyr Glu Glu Phe Cys Ala Tyr Val Thr Asp Gln
65 70 75 80
tac cga cat cat gtt tca atc att ccc ggt gca aag gag ttc tta cag 288
Tyr Arg His His Val Ser Ile Ile Pro Gly Ala Lys Glu Phe Leu Gln
85 90 95
gaa ctc cac gac gca ggc att cct atg gcc gtt gct tcg tca act ccc 336
Glu Leu His Asp Ala Gly Ile Pro Met Ala Val Ala Ser Ser Thr Pro
100 105 110
gtg cga gaa gtt cgt gca gct ctg gca gct caa ggt att gag cac ctc 384
Val Arg Glu Val Arg Ala Ala Leu Ala Ala Gln Gly Ile Glu His Leu
115 120 125
ttt aaa aca gtg gtc tca acg gaa gat gtg ggg gga gtg gac aag gtt 432
Phe Lys Thr Val Val Ser Thr Glu Asp Val Gly Gly Val Asp Lys Val
130 135 140
gag cca gat gtt tac ctt gag gct ctt cgc cgt ctt ggc act gat aag 480
Glu Pro Asp Val Tyr Leu Glu Ala Leu Arg Arg Leu Gly Thr Asp Lys
145 150 155 160
gca act acc tgg gtc ttc gag gat gct ccg ttt ggc gca cag aca gca 528
Ala Thr Thr Trp Val Phe Glu Asp Ala Pro Phe Gly Ala Gln Thr Ala
165 170 175
caa aat gca ggc ttt cct gtg gct gta ctc tac aac gac cac gat ggc 576
Gln Asn Ala Gly Phe Pro Val Ala Val Leu Tyr Asn Asp His Asp Gly
180 185 190
cgc gac ccc gtc ttt atg cgc gag cac tct aac atc ttt gcc cac acc 624
Arg Asp Pro Val Phe Met Arg Glu His Ser Asn Ile Phe Ala His Thr
195 200 205
tac ggc gag ctg tcg ctt ctg cgc ctt cag gac tac gag cgc cct ctg 672
Tyr Gly Glu Leu Ser Leu Leu Arg Leu Gln Asp Tyr Glu Arg Pro Leu
210 215 220
acc gca gcg cct tct ggc gag aaa ccc ctt gag gtc ctt gtt gtg ggc 720
Thr Ala Ala Pro Ser Gly Glu Lys Pro Leu Glu Val Leu Val Val Gly
225 230 235 240
gga tcc cca gag gcg gtt tcg cac acg acg ctg tct acc tgc gcc caa 768
Gly Ser Pro Glu Ala Val Ser His Thr Thr Leu Ser Thr Cys Ala Gln
245 250 255
agc gct gac tac ctg ata gcg gtt gac cat ggc gca gat gtc tgt cac 816
Ser Ala Asp Tyr Leu Ile Ala Val Asp His Gly Ala Asp Val Cys His
260 265 270
gct gcc ggc gtg att cca caa ctt gcg ctt gga gac ttt gac tcc gct 864
Ala Ala Gly Val Ile Pro Gln Leu Ala Leu Gly Asp Phe Asp Ser Ala
275 280 285
aca cca gaa act ctg gct tgg ctc aaa gag cag cag gta cct tgc atg 912
Thr Pro Glu Thr Leu Ala Trp Leu Lys Glu Gln Gln Val Pro Cys Met
290 295 300
aag ttt aat gcg gac aag tac gat acc gac ctg gcg cta gca ttg aaa 960
Lys Phe Asn Ala Asp Lys Tyr Asp Thr Asp Leu Ala Leu Ala Leu Lys
305 310 315 320
tca gct gaa tat gag gca att cgt aga gat agc aag ctc tct ctg acg 1008
Ser Ala Glu Tyr Glu Ala Ile Arg Arg Asp Ser Lys Leu Ser Leu Thr
325 330 335
gtt gtc tcc aca tct ggc ggc cac ctt gat cac cag ctt gta gtg ctt 1056
Val Val Ser Thr Ser Gly Gly His Leu Asp His Gln Leu Val Val Leu
340 345 350
ggt ctt ctc gcc acg tgg gca aag acg ggc aag gca agc gtt cga gtt 1104
Gly Leu Leu Ala Thr Trp Ala Lys Thr Gly Lys Ala Ser Val Arg Val
355 360 365
att gag aat gac ttt gag atg cgc ttt tta gtt gct ggc cag gtg gat 1152
Ile Glu Asn Asp Phe Glu Met Arg Phe Leu Val Ala Gly Gln Val Asp
370 375 380
tct tgg cag ctg aac act atc aat gta ggt aaa aag att tct ctt gta 1200
Ser Trp Gln Leu Asn Thr Ile Asn Val Gly Lys Lys Ile Ser Leu Val
385 390 395 400
gct ttg tca gag gag tgc gag gtt tct gag gcc ggc atg aag tgg aat 1248
Ala Leu Ser Glu Glu Cys Glu Val Ser Glu Ala Gly Met Lys Trp Asn
405 410 415
ctt gat cac cag aag ttc acc ttg ctg gga gac gac ggt att tca aac 1296
Leu Asp His Gln Lys Phe Thr Leu Leu Gly Asp Asp Gly Ile Ser Asn
420 425 430
ata gtt gaa tca gac aat tcc tgg gta agg tgc gag aag ggc tgt ctt 1344
Ile Val Glu Ser Asp Asn Ser Trp Val Arg Cys Glu Lys Gly Cys Leu
435 440 445
ttg gtg cag ctt tgg aac taa 1365
Leu Val Gln Leu Trp Asn
450
<210> 185
<211> 454
<212> PRT
<213> Atopobium parvulum
<400> 185
Met Gln Val Thr Gly Ala Ile Phe Asp Cys Asp Gly Thr Leu Val Asp
1 5 10 15
Ser Met His Val Trp His Asn Val Phe Gly Ala Val Leu Pro Lys Tyr
20 25 30
Gly Lys Thr Ile Asp Ser Asp Ile Phe Asp Arg Val Glu Ala Val Ser
35 40 45
Leu Ile Gly Gly Cys Gln Ile Cys Val Asp Glu Leu Asp Leu Pro Ile
50 55 60
Thr Ala Glu Ala Leu Tyr Glu Glu Phe Cys Ala Tyr Val Thr Asp Gln
65 70 75 80
Tyr Arg His His Val Ser Ile Ile Pro Gly Ala Lys Glu Phe Leu Gln
85 90 95
Glu Leu His Asp Ala Gly Ile Pro Met Ala Val Ala Ser Ser Thr Pro
100 105 110
Val Arg Glu Val Arg Ala Ala Leu Ala Ala Gln Gly Ile Glu His Leu
115 120 125
Phe Lys Thr Val Val Ser Thr Glu Asp Val Gly Gly Val Asp Lys Val
130 135 140
Glu Pro Asp Val Tyr Leu Glu Ala Leu Arg Arg Leu Gly Thr Asp Lys
145 150 155 160
Ala Thr Thr Trp Val Phe Glu Asp Ala Pro Phe Gly Ala Gln Thr Ala
165 170 175
Gln Asn Ala Gly Phe Pro Val Ala Val Leu Tyr Asn Asp His Asp Gly
180 185 190
Arg Asp Pro Val Phe Met Arg Glu His Ser Asn Ile Phe Ala His Thr
195 200 205
Tyr Gly Glu Leu Ser Leu Leu Arg Leu Gln Asp Tyr Glu Arg Pro Leu
210 215 220
Thr Ala Ala Pro Ser Gly Glu Lys Pro Leu Glu Val Leu Val Val Gly
225 230 235 240
Gly Ser Pro Glu Ala Val Ser His Thr Thr Leu Ser Thr Cys Ala Gln
245 250 255
Ser Ala Asp Tyr Leu Ile Ala Val Asp His Gly Ala Asp Val Cys His
260 265 270
Ala Ala Gly Val Ile Pro Gln Leu Ala Leu Gly Asp Phe Asp Ser Ala
275 280 285
Thr Pro Glu Thr Leu Ala Trp Leu Lys Glu Gln Gln Val Pro Cys Met
290 295 300
Lys Phe Asn Ala Asp Lys Tyr Asp Thr Asp Leu Ala Leu Ala Leu Lys
305 310 315 320
Ser Ala Glu Tyr Glu Ala Ile Arg Arg Asp Ser Lys Leu Ser Leu Thr
325 330 335
Val Val Ser Thr Ser Gly Gly His Leu Asp His Gln Leu Val Val Leu
340 345 350
Gly Leu Leu Ala Thr Trp Ala Lys Thr Gly Lys Ala Ser Val Arg Val
355 360 365
Ile Glu Asn Asp Phe Glu Met Arg Phe Leu Val Ala Gly Gln Val Asp
370 375 380
Ser Trp Gln Leu Asn Thr Ile Asn Val Gly Lys Lys Ile Ser Leu Val
385 390 395 400
Ala Leu Ser Glu Glu Cys Glu Val Ser Glu Ala Gly Met Lys Trp Asn
405 410 415
Leu Asp His Gln Lys Phe Thr Leu Leu Gly Asp Asp Gly Ile Ser Asn
420 425 430
Ile Val Glu Ser Asp Asn Ser Trp Val Arg Cys Glu Lys Gly Cys Leu
435 440 445
Leu Val Gln Leu Trp Asn
450
<210> 186
<211> 1383
<212> DNA
<213> Atopobium rimae
<220>
<221> CDS
<222> (1)..(1383)
<223> Atopobium rimae gene encoding TMP phosphatase [WP_003148415]
<400> 186
atg cag ata acg ggt gca atc ttt gat ctt gat ggg aca ctg gtt gac 48
Met Gln Ile Thr Gly Ala Ile Phe Asp Leu Asp Gly Thr Leu Val Asp
1 5 10 15
tcc atg tgg atg tgg aga aga tcg ttc gga gat gtt tta gaa gac ctg 96
Ser Met Trp Met Trp Arg Arg Ser Phe Gly Asp Val Leu Glu Asp Leu
20 25 30
cat atc aat atg act ccg gat ttt ttt aaa agg gtc gag gcc att tcg 144
His Ile Asn Met Thr Pro Asp Phe Phe Lys Arg Val Glu Ala Ile Ser
35 40 45
ctt tac gat ggt tgc gta gcg tgt att gag gaa ttt aat ctt cct tta 192
Leu Tyr Asp Gly Cys Val Ala Cys Ile Glu Glu Phe Asn Leu Pro Leu
50 55 60
tcc gca gaa gag ctg tat gaa aag ttc ctt ttg tat gta caa acg gta 240
Ser Ala Glu Glu Leu Tyr Glu Lys Phe Leu Leu Tyr Val Gln Thr Val
65 70 75 80
tat tcg cac gat att aaa agc att gcg ggg gct acc gac ttt ctc cag 288
Tyr Ser His Asp Ile Lys Ser Ile Ala Gly Ala Thr Asp Phe Leu Gln
85 90 95
gaa ctt ttt gac gca gga ata cct ctt gcc att gct tct tct acg cca 336
Glu Leu Phe Asp Ala Gly Ile Pro Leu Ala Ile Ala Ser Ser Thr Pro
100 105 110
tct cgt gcc ata cat gtt gct ctt gaa gcc caa ggt atg gag aag ttt 384
Ser Arg Ala Ile His Val Ala Leu Glu Ala Gln Gly Met Glu Lys Phe
115 120 125
ttt aaa gcg gtt gtg tgt acc gaa gac gtc ggg ggt gtc gat aaa gca 432
Phe Lys Ala Val Val Cys Thr Glu Asp Val Gly Gly Val Asp Lys Ala
130 135 140
aaa ccc gat gtc tat ctt gag gct ctc aga cgc ctg ggc acc gat aaa 480
Lys Pro Asp Val Tyr Leu Glu Ala Leu Arg Arg Leu Gly Thr Asp Lys
145 150 155 160
gca cac acg tgg gtc ttt gag gac gct gag ttt ggt gta cat acg gca 528
Ala His Thr Trp Val Phe Glu Asp Ala Glu Phe Gly Val His Thr Ala
165 170 175
caa acc gag ggc ttt ccc gtt gtt gcg ctg ttc aat ggc aaa gac ggc 576
Gln Thr Glu Gly Phe Pro Val Val Ala Leu Phe Asn Gly Lys Asp Gly
180 185 190
cgt gat ctt gag tat atg aag gcg cac tct gat ctt ctc gca cat gat 624
Arg Asp Leu Glu Tyr Met Lys Ala His Ser Asp Leu Leu Ala His Asp
195 200 205
tat cga gaa ctc tct ctt gcc cgc att tac gat tat gaa cgg gtg acg 672
Tyr Arg Glu Leu Ser Leu Ala Arg Ile Tyr Asp Tyr Glu Arg Val Thr
210 215 220
aat cag cca cat ctg ggc gcc tca tcg gct cag aag gtc ttt tcg gtt 720
Asn Gln Pro His Leu Gly Ala Ser Ser Ala Gln Lys Val Phe Ser Val
225 230 235 240
ctc gtt gtt gat gga tct ccc acg cca agt tca gcc gcg ctg gtt tca 768
Leu Val Val Asp Gly Ser Pro Thr Pro Ser Ser Ala Ala Leu Val Ser
245 250 255
gaa ctt tca tca tgc tcg gat tat gtc gtt gct gca gat cgc ggg gca 816
Glu Leu Ser Ser Cys Ser Asp Tyr Val Val Ala Ala Asp Arg Gly Ala
260 265 270
tat atc tgc aag gag gcc ggt gtc gtt cct gat att gcg tgc gga gac 864
Tyr Ile Cys Lys Glu Ala Gly Val Val Pro Asp Ile Ala Cys Gly Asp
275 280 285
ttt gat tcc gtg gga gaa gag aca ctc tct tgg atc cat gca caa aag 912
Phe Asp Ser Val Gly Glu Glu Thr Leu Ser Trp Ile His Ala Gln Lys
290 295 300
gtg cac acg att gct tat cct caa gat aag tac gag acc gat ttg tct 960
Val His Thr Ile Ala Tyr Pro Gln Asp Lys Tyr Glu Thr Asp Leu Ser
305 310 315 320
ctt gca ctc aat gcc gct tgc cat gaa gca acc cgt caa gca ctt ccg 1008
Leu Ala Leu Asn Ala Ala Cys His Glu Ala Thr Arg Gln Ala Leu Pro
325 330 335
ctg tca ctg aca ctt acc tgc gct tcc ggc ggc agg ctt gat cat gag 1056
Leu Ser Leu Thr Leu Thr Cys Ala Ser Gly Gly Arg Leu Asp His Glu
340 345 350
ctt ggt gta gtg ggg ctt ctg gct cga tta agc act gcc tca tgg agg 1104
Leu Gly Val Val Gly Leu Leu Ala Arg Leu Ser Thr Ala Ser Trp Arg
355 360 365
gtg cgg att gtt gag gat gcc ttt gaa gca agg att ctt tcg gca gat 1152
Val Arg Ile Val Glu Asp Ala Phe Glu Ala Arg Ile Leu Ser Ala Asp
370 375 380
acg tat gcg gcg tgg agg ctc tca gaa aaa gat cga gga aag aca ctg 1200
Thr Tyr Ala Ala Trp Arg Leu Ser Glu Lys Asp Arg Gly Lys Thr Leu
385 390 395 400
tcg gtg ctt ccg ctt cag gaa gaa acg gtg att acc gag atc ggt atg 1248
Ser Val Leu Pro Leu Gln Glu Glu Thr Val Ile Thr Glu Ile Gly Met
405 410 415
caa tgg gac ctt gcc tca cga act ttg ctg ctc ctg tct gat gaa gga 1296
Gln Trp Asp Leu Ala Ser Arg Thr Leu Leu Leu Leu Ser Asp Glu Gly
420 425 430
att tcc aat gtg gta caa acg gat gtg gca caa ata cat tgc gag aag 1344
Ile Ser Asn Val Val Gln Thr Asp Val Ala Gln Ile His Cys Glu Lys
435 440 445
ggc aag gcg ctc gtg gtg ctt ctc gca aat gaa tcg tga 1383
Gly Lys Ala Leu Val Val Leu Leu Ala Asn Glu Ser
450 455 460
<210> 187
<211> 460
<212> PRT
<213> Atopobium rimae
<400> 187
Met Gln Ile Thr Gly Ala Ile Phe Asp Leu Asp Gly Thr Leu Val Asp
1 5 10 15
Ser Met Trp Met Trp Arg Arg Ser Phe Gly Asp Val Leu Glu Asp Leu
20 25 30
His Ile Asn Met Thr Pro Asp Phe Phe Lys Arg Val Glu Ala Ile Ser
35 40 45
Leu Tyr Asp Gly Cys Val Ala Cys Ile Glu Glu Phe Asn Leu Pro Leu
50 55 60
Ser Ala Glu Glu Leu Tyr Glu Lys Phe Leu Leu Tyr Val Gln Thr Val
65 70 75 80
Tyr Ser His Asp Ile Lys Ser Ile Ala Gly Ala Thr Asp Phe Leu Gln
85 90 95
Glu Leu Phe Asp Ala Gly Ile Pro Leu Ala Ile Ala Ser Ser Thr Pro
100 105 110
Ser Arg Ala Ile His Val Ala Leu Glu Ala Gln Gly Met Glu Lys Phe
115 120 125
Phe Lys Ala Val Val Cys Thr Glu Asp Val Gly Gly Val Asp Lys Ala
130 135 140
Lys Pro Asp Val Tyr Leu Glu Ala Leu Arg Arg Leu Gly Thr Asp Lys
145 150 155 160
Ala His Thr Trp Val Phe Glu Asp Ala Glu Phe Gly Val His Thr Ala
165 170 175
Gln Thr Glu Gly Phe Pro Val Val Ala Leu Phe Asn Gly Lys Asp Gly
180 185 190
Arg Asp Leu Glu Tyr Met Lys Ala His Ser Asp Leu Leu Ala His Asp
195 200 205
Tyr Arg Glu Leu Ser Leu Ala Arg Ile Tyr Asp Tyr Glu Arg Val Thr
210 215 220
Asn Gln Pro His Leu Gly Ala Ser Ser Ala Gln Lys Val Phe Ser Val
225 230 235 240
Leu Val Val Asp Gly Ser Pro Thr Pro Ser Ser Ala Ala Leu Val Ser
245 250 255
Glu Leu Ser Ser Cys Ser Asp Tyr Val Val Ala Ala Asp Arg Gly Ala
260 265 270
Tyr Ile Cys Lys Glu Ala Gly Val Val Pro Asp Ile Ala Cys Gly Asp
275 280 285
Phe Asp Ser Val Gly Glu Glu Thr Leu Ser Trp Ile His Ala Gln Lys
290 295 300
Val His Thr Ile Ala Tyr Pro Gln Asp Lys Tyr Glu Thr Asp Leu Ser
305 310 315 320
Leu Ala Leu Asn Ala Ala Cys His Glu Ala Thr Arg Gln Ala Leu Pro
325 330 335
Leu Ser Leu Thr Leu Thr Cys Ala Ser Gly Gly Arg Leu Asp His Glu
340 345 350
Leu Gly Val Val Gly Leu Leu Ala Arg Leu Ser Thr Ala Ser Trp Arg
355 360 365
Val Arg Ile Val Glu Asp Ala Phe Glu Ala Arg Ile Leu Ser Ala Asp
370 375 380
Thr Tyr Ala Ala Trp Arg Leu Ser Glu Lys Asp Arg Gly Lys Thr Leu
385 390 395 400
Ser Val Leu Pro Leu Gln Glu Glu Thr Val Ile Thr Glu Ile Gly Met
405 410 415
Gln Trp Asp Leu Ala Ser Arg Thr Leu Leu Leu Leu Ser Asp Glu Gly
420 425 430
Ile Ser Asn Val Val Gln Thr Asp Val Ala Gln Ile His Cys Glu Lys
435 440 445
Gly Lys Ala Leu Val Val Leu Leu Ala Asn Glu Ser
450 455 460
<210> 188
<211> 1380
<212> DNA
<213> Olsenella uli
<220>
<221> CDS
<222> (1)..(1380)
<223> Olsenella uli gene encoding TMP phosphatase [WP_013251930]
<400> 188
atg ccc atc aag gcc gcc atc ttc gac tgt gac gga acg ctg gtc gac 48
Met Pro Ile Lys Ala Ala Ile Phe Asp Cys Asp Gly Thr Leu Val Asp
1 5 10 15
tcc atg ccc ctg tgg cat gac gtg acg gtc gaa ctg ctg cgc cgc cac 96
Ser Met Pro Leu Trp His Asp Val Thr Val Glu Leu Leu Arg Arg His
20 25 30
cat gtc gcc gac gcc gag gag gcg ttc gtc cgc acc gag tcg ctt ccc 144
His Val Ala Asp Ala Glu Glu Ala Phe Val Arg Thr Glu Ser Leu Pro
35 40 45
atg gtc gag atg tgc cat gcc ttc cac gac gag tgg ggc gtt gag gcc 192
Met Val Glu Met Cys His Ala Phe His Asp Glu Trp Gly Val Glu Ala
50 55 60
gag ggc gag gag ctg gtg cgc gag ctg gtc gat atg gtc cgc gag ggg 240
Glu Gly Glu Glu Leu Val Arg Glu Leu Val Asp Met Val Arg Glu Gly
65 70 75 80
tat cgc agc cgg gtt agc ctg ctg ccg ggc tgc cgg gcg ttt ctg gac 288
Tyr Arg Ser Arg Val Ser Leu Leu Pro Gly Cys Arg Ala Phe Leu Asp
85 90 95
gag ctg gcg tct gcg ggc gtc cgc atg gtc gtc gcg tcg tcg acg gct 336
Glu Leu Ala Ser Ala Gly Val Arg Met Val Val Ala Ser Ser Thr Ala
100 105 110
ccg gag gag ctc tcc gtc gcg cta tcg gcg cag ggg gtc gac ggc tac 384
Pro Glu Glu Leu Ser Val Ala Leu Ser Ala Gln Gly Val Asp Gly Tyr
115 120 125
ttc gag cgg gtc ttc tcc acg gga ggc ccc ata cgc agc aag gac tac 432
Phe Glu Arg Val Phe Ser Thr Gly Gly Pro Ile Arg Ser Lys Asp Tyr
130 135 140
ccg gac atc tgg gag ctg gtc ctg gac tac ctg ggc acc gac ccg gct 480
Pro Asp Ile Trp Glu Leu Val Leu Asp Tyr Leu Gly Thr Asp Pro Ala
145 150 155 160
gac acc tgg gtc ttc gag gac gcc ccg ttt ggg atg cgg acg gcc cga 528
Asp Thr Trp Val Phe Glu Asp Ala Pro Phe Gly Met Arg Thr Ala Arg
165 170 175
tcg gtc ggc gcc aac acc gtc tgc ctg ttc agc cca cac ggg gac cgc 576
Ser Val Gly Ala Asn Thr Val Cys Leu Phe Ser Pro His Gly Asp Arg
180 185 190
gac ctt gcg gcc tgc gag cgc tac gct gac ata ctg gtc cac agc tac 624
Asp Leu Ala Ala Cys Glu Arg Tyr Ala Asp Ile Leu Val His Ser Tyr
195 200 205
cac gag cta tcg ctc gcc ctg ctg gac gac tac gcc cgt ccg ccg caa 672
His Glu Leu Ser Leu Ala Leu Leu Asp Asp Tyr Ala Arg Pro Pro Gln
210 215 220
gcg tcc ccc tcg gcc cac cct cgc ctc gcg ccg ctt cgc gtc ctc gtc 720
Ala Ser Pro Ser Ala His Pro Arg Leu Ala Pro Leu Arg Val Leu Val
225 230 235 240
gtg ggc gcc tcg ccc gag cgc ccg tct tcg gcg ctg ctc cgc tcc ctg 768
Val Gly Ala Ser Pro Glu Arg Pro Ser Ser Ala Leu Leu Arg Ser Leu
245 250 255
gcc gcc agt acc gac tac gtc atc gcc gcc gac gcc ggg gcc gac gcg 816
Ala Ala Ser Thr Asp Tyr Val Ile Ala Ala Asp Ala Gly Ala Asp Ala
260 265 270
ctg cgc tcc tgt ggc atc gcc ccc gac gtc ttc tgc ggc gac gcc gac 864
Leu Arg Ser Cys Gly Ile Ala Pro Asp Val Phe Cys Gly Asp Ala Asp
275 280 285
tcg gca acg ggc gaa tcg gct gcg tgg gcc cgc tcg gtc gcc cgt gcg 912
Ser Ala Thr Gly Glu Ser Ala Ala Trp Ala Arg Ser Val Ala Arg Ala
290 295 300
gac ata gag ttt ccc tcc gag aag tac gcg acc gac ctc gcc ctc gcc 960
Asp Ile Glu Phe Pro Ser Glu Lys Tyr Ala Thr Asp Leu Ala Leu Ala
305 310 315 320
atc tcc tgc gcc cgc cat gag gcc gct cga cgc aac gcg cgg ctg gag 1008
Ile Ser Cys Ala Arg His Glu Ala Ala Arg Arg Asn Ala Arg Leu Glu
325 330 335
ctc acg ctg acc ggc gtc acg ggc ggc agg ccc gac cac gcc ctt gcc 1056
Leu Thr Leu Thr Gly Val Thr Gly Gly Arg Pro Asp His Ala Leu Ala
340 345 350
gtc gtg ggt cag ctc gcg cgg aac gct gac gcc tcg ccg cgc atc gtg 1104
Val Val Gly Gln Leu Ala Arg Asn Ala Asp Ala Ser Pro Arg Ile Val
355 360 365
gag gac ggc ttc gag tgc cga ctg ctc agc ccc tct ggc act gcg tgc 1152
Glu Asp Gly Phe Glu Cys Arg Leu Leu Ser Pro Ser Gly Thr Ala Cys
370 375 380
tgg gag ctg ggt ggg gcc cac gtg cca gcc gcc ggg gtc gag ggg acg 1200
Trp Glu Leu Gly Gly Ala His Val Pro Ala Ala Gly Val Glu Gly Thr
385 390 395 400
ctc ttc tcg gcc att ccc gtg gca gag ggg acc atg ctc tcc gag cgg 1248
Leu Phe Ser Ala Ile Pro Val Ala Glu Gly Thr Met Leu Ser Glu Arg
405 410 415
ggc ttc aag tgg gag ctg gat cat cgt gag ctg ccc ctt ctg ggg gat 1296
Gly Phe Lys Trp Glu Leu Asp His Arg Glu Leu Pro Leu Leu Gly Asp
420 425 430
gag gga atc tcg aac gtg gtc acg tcc gcg acg gcc agc gtc gag tgc 1344
Glu Gly Ile Ser Asn Val Val Thr Ser Ala Thr Ala Ser Val Glu Cys
435 440 445
cat gcc ggc gca gtt gcg gcg ttc ctg ttg gca tag 1380
His Ala Gly Ala Val Ala Ala Phe Leu Leu Ala
450 455
<210> 189
<211> 459
<212> PRT
<213> Olsenella uli
<400> 189
Met Pro Ile Lys Ala Ala Ile Phe Asp Cys Asp Gly Thr Leu Val Asp
1 5 10 15
Ser Met Pro Leu Trp His Asp Val Thr Val Glu Leu Leu Arg Arg His
20 25 30
His Val Ala Asp Ala Glu Glu Ala Phe Val Arg Thr Glu Ser Leu Pro
35 40 45
Met Val Glu Met Cys His Ala Phe His Asp Glu Trp Gly Val Glu Ala
50 55 60
Glu Gly Glu Glu Leu Val Arg Glu Leu Val Asp Met Val Arg Glu Gly
65 70 75 80
Tyr Arg Ser Arg Val Ser Leu Leu Pro Gly Cys Arg Ala Phe Leu Asp
85 90 95
Glu Leu Ala Ser Ala Gly Val Arg Met Val Val Ala Ser Ser Thr Ala
100 105 110
Pro Glu Glu Leu Ser Val Ala Leu Ser Ala Gln Gly Val Asp Gly Tyr
115 120 125
Phe Glu Arg Val Phe Ser Thr Gly Gly Pro Ile Arg Ser Lys Asp Tyr
130 135 140
Pro Asp Ile Trp Glu Leu Val Leu Asp Tyr Leu Gly Thr Asp Pro Ala
145 150 155 160
Asp Thr Trp Val Phe Glu Asp Ala Pro Phe Gly Met Arg Thr Ala Arg
165 170 175
Ser Val Gly Ala Asn Thr Val Cys Leu Phe Ser Pro His Gly Asp Arg
180 185 190
Asp Leu Ala Ala Cys Glu Arg Tyr Ala Asp Ile Leu Val His Ser Tyr
195 200 205
His Glu Leu Ser Leu Ala Leu Leu Asp Asp Tyr Ala Arg Pro Pro Gln
210 215 220
Ala Ser Pro Ser Ala His Pro Arg Leu Ala Pro Leu Arg Val Leu Val
225 230 235 240
Val Gly Ala Ser Pro Glu Arg Pro Ser Ser Ala Leu Leu Arg Ser Leu
245 250 255
Ala Ala Ser Thr Asp Tyr Val Ile Ala Ala Asp Ala Gly Ala Asp Ala
260 265 270
Leu Arg Ser Cys Gly Ile Ala Pro Asp Val Phe Cys Gly Asp Ala Asp
275 280 285
Ser Ala Thr Gly Glu Ser Ala Ala Trp Ala Arg Ser Val Ala Arg Ala
290 295 300
Asp Ile Glu Phe Pro Ser Glu Lys Tyr Ala Thr Asp Leu Ala Leu Ala
305 310 315 320
Ile Ser Cys Ala Arg His Glu Ala Ala Arg Arg Asn Ala Arg Leu Glu
325 330 335
Leu Thr Leu Thr Gly Val Thr Gly Gly Arg Pro Asp His Ala Leu Ala
340 345 350
Val Val Gly Gln Leu Ala Arg Asn Ala Asp Ala Ser Pro Arg Ile Val
355 360 365
Glu Asp Gly Phe Glu Cys Arg Leu Leu Ser Pro Ser Gly Thr Ala Cys
370 375 380
Trp Glu Leu Gly Gly Ala His Val Pro Ala Ala Gly Val Glu Gly Thr
385 390 395 400
Leu Phe Ser Ala Ile Pro Val Ala Glu Gly Thr Met Leu Ser Glu Arg
405 410 415
Gly Phe Lys Trp Glu Leu Asp His Arg Glu Leu Pro Leu Leu Gly Asp
420 425 430
Glu Gly Ile Ser Asn Val Val Thr Ser Ala Thr Ala Ser Val Glu Cys
435 440 445
His Ala Gly Ala Val Ala Ala Phe Leu Leu Ala
450 455
<210> 190
<211> 762
<212> DNA
<213> Atopobium minutum
<220>
<221> CDS
<222> (1)..(762)
<223> Atopobium minutum gene encoding TMP phosphatase [KRN55115]
<400> 190
atg tgg gct aaa acc tct cga cat tgt acg caa aaa ggc ttt acc atg 48
Met Trp Ala Lys Thr Ser Arg His Cys Thr Gln Lys Gly Phe Thr Met
1 5 10 15
aac cct gca cgc att tta ttt gat gga gga act tgt atg gca ata agc 96
Asn Pro Ala Arg Ile Leu Phe Asp Gly Gly Thr Cys Met Ala Ile Ser
20 25 30
ggc gca atc ttt gac tgt gac ggc acg ctg gtt gat tct atg tat atg 144
Gly Ala Ile Phe Asp Cys Asp Gly Thr Leu Val Asp Ser Met Tyr Met
35 40 45
tgg tgg gac gcc ttt ccc cgc ctg ctt gcc agc cat ggc ttt gct atg 192
Trp Trp Asp Ala Phe Pro Arg Leu Leu Ala Ser His Gly Phe Ala Met
50 55 60
acg cct cag atc gag aaa atc ttg cat gag tgt gag gcg gtc agc ttg 240
Thr Pro Gln Ile Glu Lys Ile Leu His Glu Cys Glu Ala Val Ser Leu
65 70 75 80
gat gaa gag atc cat acg ctg cgc aac gct ctt gct att ccc gct tct 288
Asp Glu Glu Ile His Thr Leu Arg Asn Ala Leu Ala Ile Pro Ala Ser
85 90 95
gcc gag cag cta gca caa gaa tta tcc cag aat att agc aat gcg tat 336
Ala Glu Gln Leu Ala Gln Glu Leu Ser Gln Asn Ile Ser Asn Ala Tyr
100 105 110
gcc tca gag atc aaa gca tgg cct gcc gtt aag ccg ttc ttg gat cag 384
Ala Ser Glu Ile Lys Ala Trp Pro Ala Val Lys Pro Phe Leu Asp Gln
115 120 125
ctc aaa gac gca ggt atc ccc atg atc att tgt act tct acc gga gcc 432
Leu Lys Asp Ala Gly Ile Pro Met Ile Ile Cys Thr Ser Thr Gly Ala
130 135 140
aaa gaa gtt ggt ctg tgc atg gat cat ctt ggt ttg tcc aag ttt ttt 480
Lys Glu Val Gly Leu Cys Met Asp His Leu Gly Leu Ser Lys Phe Phe
145 150 155 160
gta gat att gtc agc gcg gaa gaa aac aat ttc acc aaa act gag cca 528
Val Asp Ile Val Ser Ala Glu Glu Asn Asn Phe Thr Lys Thr Glu Pro
165 170 175
gat atc tat tac tat gcg cta aaa aag ctt ggt acc act aaa gag aca 576
Asp Ile Tyr Tyr Tyr Ala Leu Lys Lys Leu Gly Thr Thr Lys Glu Thr
180 185 190
acc tgg gta ttt gag gat gct ccg ttt ggc ctt act acc tct gag cgt 624
Thr Trp Val Phe Glu Asp Ala Pro Phe Gly Leu Thr Thr Ser Glu Arg
195 200 205
gca gga ttt cct aat gtg tgc gtc ttt aat gcg cac gat aag cgc gat 672
Ala Gly Phe Pro Asn Val Cys Val Phe Asn Ala His Asp Lys Arg Asp
210 215 220
gag gac ttt ttg cgt ctt cat gct acg ttg ttt acg cac ata tat gag 720
Glu Asp Phe Leu Arg Leu His Ala Thr Leu Phe Thr His Ile Tyr Glu
225 230 235 240
gat att tcc ctt gcg gat ttg cag tcg tac ccc acc aag taa 762
Asp Ile Ser Leu Ala Asp Leu Gln Ser Tyr Pro Thr Lys
245 250
<210> 191
<211> 253
<212> PRT
<213> Atopobium minutum
<400> 191
Met Trp Ala Lys Thr Ser Arg His Cys Thr Gln Lys Gly Phe Thr Met
1 5 10 15
Asn Pro Ala Arg Ile Leu Phe Asp Gly Gly Thr Cys Met Ala Ile Ser
20 25 30
Gly Ala Ile Phe Asp Cys Asp Gly Thr Leu Val Asp Ser Met Tyr Met
35 40 45
Trp Trp Asp Ala Phe Pro Arg Leu Leu Ala Ser His Gly Phe Ala Met
50 55 60
Thr Pro Gln Ile Glu Lys Ile Leu His Glu Cys Glu Ala Val Ser Leu
65 70 75 80
Asp Glu Glu Ile His Thr Leu Arg Asn Ala Leu Ala Ile Pro Ala Ser
85 90 95
Ala Glu Gln Leu Ala Gln Glu Leu Ser Gln Asn Ile Ser Asn Ala Tyr
100 105 110
Ala Ser Glu Ile Lys Ala Trp Pro Ala Val Lys Pro Phe Leu Asp Gln
115 120 125
Leu Lys Asp Ala Gly Ile Pro Met Ile Ile Cys Thr Ser Thr Gly Ala
130 135 140
Lys Glu Val Gly Leu Cys Met Asp His Leu Gly Leu Ser Lys Phe Phe
145 150 155 160
Val Asp Ile Val Ser Ala Glu Glu Asn Asn Phe Thr Lys Thr Glu Pro
165 170 175
Asp Ile Tyr Tyr Tyr Ala Leu Lys Lys Leu Gly Thr Thr Lys Glu Thr
180 185 190
Thr Trp Val Phe Glu Asp Ala Pro Phe Gly Leu Thr Thr Ser Glu Arg
195 200 205
Ala Gly Phe Pro Asn Val Cys Val Phe Asn Ala His Asp Lys Arg Asp
210 215 220
Glu Asp Phe Leu Arg Leu His Ala Thr Leu Phe Thr His Ile Tyr Glu
225 230 235 240
Asp Ile Ser Leu Ala Asp Leu Gln Ser Tyr Pro Thr Lys
245 250
<210> 192
<211> 648
<212> DNA
<213> Syntrophomonas wolfei
<220>
<221> CDS
<222> (1)..(648)
<223> Syntrophomonas wolfei gene encoding TMP phosphatase
[WP_011640074]
<400> 192
atg gga gag aaa tta ata att ttt atg gat ttc gat ggc act att tct 48
Met Gly Glu Lys Leu Ile Ile Phe Met Asp Phe Asp Gly Thr Ile Ser
1 5 10 15
cgg gag gat gtc tgc aat aag atg gca gcc agg tat gcc ggc agg gac 96
Arg Glu Asp Val Cys Asn Lys Met Ala Ala Arg Tyr Ala Gly Arg Asp
20 25 30
tgg gag gaa ata aac cgc ctc tgg gaa gag gga ggt att act act gga 144
Trp Glu Glu Ile Asn Arg Leu Trp Glu Glu Gly Gly Ile Thr Thr Gly
35 40 45
gag tgc gcc agt cgt att ctt tca tca atg gag gta ggg gcg gct gaa 192
Glu Cys Ala Ser Arg Ile Leu Ser Ser Met Glu Val Gly Ala Ala Glu
50 55 60
ttg gag gcc ttt ttt cag gct cag gaa gta gac ccc ggc ttt tcc cct 240
Leu Glu Ala Phe Phe Gln Ala Gln Glu Val Asp Pro Gly Phe Ser Pro
65 70 75 80
ttc ctg gac tgg gta caa aaa aat cag cac ctc ccc att ata ttg agc 288
Phe Leu Asp Trp Val Gln Lys Asn Gln His Leu Pro Ile Ile Leu Ser
85 90 95
gat ggt tat gac cgc tat ata aaa agc ata tta cgg ggc cag ggc tgg 336
Asp Gly Tyr Asp Arg Tyr Ile Lys Ser Ile Leu Arg Gly Gln Gly Trp
100 105 110
gaa atc gag ttt tat gcc aat aaa tta tac tgg gat gac gcc tgg cgg 384
Glu Ile Glu Phe Tyr Ala Asn Lys Leu Tyr Trp Asp Asp Ala Trp Arg
115 120 125
atg gaa tcg ccc tac ctg gat gaa gaa tgc ttt aaa tgt ggg gta tgc 432
Met Glu Ser Pro Tyr Leu Asp Glu Glu Cys Phe Lys Cys Gly Val Cys
130 135 140
aag agc aag ata atc cag gaa aga agt tta ccc ggc tat ctc aca gta 480
Lys Ser Lys Ile Ile Gln Glu Arg Ser Leu Pro Gly Tyr Leu Thr Val
145 150 155 160
tat atc gga gat ggc tac tcc gat ttc tgc ccg gcg gcc tct tgt gat 528
Tyr Ile Gly Asp Gly Tyr Ser Asp Phe Cys Pro Ala Ala Ser Cys Asp
165 170 175
att gtt ttt gcc aaa aat gaa ctg gcc ggc tac tgc cag aaa gag ggt 576
Ile Val Phe Ala Lys Asn Glu Leu Ala Gly Tyr Cys Gln Lys Glu Gly
180 185 190
tta act tac tac ccc tac cgg gat ttt cac gat att ctc cag caa ctg 624
Leu Thr Tyr Tyr Pro Tyr Arg Asp Phe His Asp Ile Leu Gln Gln Leu
195 200 205
ccg agg att gtt agc agg atg tag 648
Pro Arg Ile Val Ser Arg Met
210 215
<210> 193
<211> 215
<212> PRT
<213> Syntrophomonas wolfei
<400> 193
Met Gly Glu Lys Leu Ile Ile Phe Met Asp Phe Asp Gly Thr Ile Ser
1 5 10 15
Arg Glu Asp Val Cys Asn Lys Met Ala Ala Arg Tyr Ala Gly Arg Asp
20 25 30
Trp Glu Glu Ile Asn Arg Leu Trp Glu Glu Gly Gly Ile Thr Thr Gly
35 40 45
Glu Cys Ala Ser Arg Ile Leu Ser Ser Met Glu Val Gly Ala Ala Glu
50 55 60
Leu Glu Ala Phe Phe Gln Ala Gln Glu Val Asp Pro Gly Phe Ser Pro
65 70 75 80
Phe Leu Asp Trp Val Gln Lys Asn Gln His Leu Pro Ile Ile Leu Ser
85 90 95
Asp Gly Tyr Asp Arg Tyr Ile Lys Ser Ile Leu Arg Gly Gln Gly Trp
100 105 110
Glu Ile Glu Phe Tyr Ala Asn Lys Leu Tyr Trp Asp Asp Ala Trp Arg
115 120 125
Met Glu Ser Pro Tyr Leu Asp Glu Glu Cys Phe Lys Cys Gly Val Cys
130 135 140
Lys Ser Lys Ile Ile Gln Glu Arg Ser Leu Pro Gly Tyr Leu Thr Val
145 150 155 160
Tyr Ile Gly Asp Gly Tyr Ser Asp Phe Cys Pro Ala Ala Ser Cys Asp
165 170 175
Ile Val Phe Ala Lys Asn Glu Leu Ala Gly Tyr Cys Gln Lys Glu Gly
180 185 190
Leu Thr Tyr Tyr Pro Tyr Arg Asp Phe His Asp Ile Leu Gln Gln Leu
195 200 205
Pro Arg Ile Val Ser Arg Met
210 215
<210> 194
<211> 666
<212> DNA
<213> Desulfitobacterium hafniense
<220>
<221> CDS
<222> (1)..(666)
<223> Desulfitobacterium hafniense gene encoding TMP phosphatase
[WP_018212876]
<400> 194
atg gag gaa ttg aac agt att ttc ttc gtg gat ttt gac ggc acc atc 48
Met Glu Glu Leu Asn Ser Ile Phe Phe Val Asp Phe Asp Gly Thr Ile
1 5 10 15
gtc act cag gat atg tgt gca gtc ctc gtt gaa acc ttg gcc ggg gaa 96
Val Thr Gln Asp Met Cys Ala Val Leu Val Glu Thr Leu Ala Gly Glu
20 25 30
gga tgg cgg gag att aat gaa ctt tgg gaa aga aaa gag ctt tcc acc 144
Gly Trp Arg Glu Ile Asn Glu Leu Trp Glu Arg Lys Glu Leu Ser Thr
35 40 45
ctg gag tgc gcc cgc cgg acc ttt aaa ctc ttt aac agc aat gac ccg 192
Leu Glu Cys Ala Arg Arg Thr Phe Lys Leu Phe Asn Ser Asn Asp Pro
50 55 60
gaa gtt ttt cgc cag ctt atc ggg cag gcg gtg ttc gat ccc gga ttt 240
Glu Val Phe Arg Gln Leu Ile Gly Gln Ala Val Phe Asp Pro Gly Phe
65 70 75 80
tta gat ttt gcc gct ttt tgt gaa cag aga gga ttt ccc ctc atc att 288
Leu Asp Phe Ala Ala Phe Cys Glu Gln Arg Gly Phe Pro Leu Ile Ile
85 90 95
ctc agc gac gga tat gat ttc tat att gag tac ctc ttg caa aga gag 336
Leu Ser Asp Gly Tyr Asp Phe Tyr Ile Glu Tyr Leu Leu Gln Arg Glu
100 105 110
gga ttg aac ctg cca tac tat gcc aac aaa ttg ctg ttt gct ccc caa 384
Gly Leu Asn Leu Pro Tyr Tyr Ala Asn Lys Leu Leu Phe Ala Pro Gln
115 120 125
ctt gac gta gaa acc ccc tac agc tcc ggc gaa tgt gat cta tgc ggg 432
Leu Asp Val Glu Thr Pro Tyr Ser Ser Gly Glu Cys Asp Leu Cys Gly
130 135 140
gtc tgc aaa ctg cag ctg atg gaa aaa ttg ctt aaa ccc ggt tgc cga 480
Val Cys Lys Leu Gln Leu Met Glu Lys Leu Leu Lys Pro Gly Cys Arg
145 150 155 160
tcc gtc tat atc gga gat ggg act tcc gat ttt tgc ccg gcg gaa agg 528
Ser Val Tyr Ile Gly Asp Gly Thr Ser Asp Phe Cys Pro Ala Glu Arg
165 170 175
gcg gat aag gtc ttt gcc agg agc agg ctt tat cag cat tgc cag gag 576
Ala Asp Lys Val Phe Ala Arg Ser Arg Leu Tyr Gln His Cys Gln Glu
180 185 190
gtg ggc aaa gaa gcc cag cta ttc caa tcg ttt cag gat att ctt cag 624
Val Gly Lys Glu Ala Gln Leu Phe Gln Ser Phe Gln Asp Ile Leu Gln
195 200 205
aca gtt gaa cat tgg gga agg gaa gag gag gaa ggg act tga 666
Thr Val Glu His Trp Gly Arg Glu Glu Glu Glu Gly Thr
210 215 220
<210> 195
<211> 221
<212> PRT
<213> Desulfitobacterium hafniense
<400> 195
Met Glu Glu Leu Asn Ser Ile Phe Phe Val Asp Phe Asp Gly Thr Ile
1 5 10 15
Val Thr Gln Asp Met Cys Ala Val Leu Val Glu Thr Leu Ala Gly Glu
20 25 30
Gly Trp Arg Glu Ile Asn Glu Leu Trp Glu Arg Lys Glu Leu Ser Thr
35 40 45
Leu Glu Cys Ala Arg Arg Thr Phe Lys Leu Phe Asn Ser Asn Asp Pro
50 55 60
Glu Val Phe Arg Gln Leu Ile Gly Gln Ala Val Phe Asp Pro Gly Phe
65 70 75 80
Leu Asp Phe Ala Ala Phe Cys Glu Gln Arg Gly Phe Pro Leu Ile Ile
85 90 95
Leu Ser Asp Gly Tyr Asp Phe Tyr Ile Glu Tyr Leu Leu Gln Arg Glu
100 105 110
Gly Leu Asn Leu Pro Tyr Tyr Ala Asn Lys Leu Leu Phe Ala Pro Gln
115 120 125
Leu Asp Val Glu Thr Pro Tyr Ser Ser Gly Glu Cys Asp Leu Cys Gly
130 135 140
Val Cys Lys Leu Gln Leu Met Glu Lys Leu Leu Lys Pro Gly Cys Arg
145 150 155 160
Ser Val Tyr Ile Gly Asp Gly Thr Ser Asp Phe Cys Pro Ala Glu Arg
165 170 175
Ala Asp Lys Val Phe Ala Arg Ser Arg Leu Tyr Gln His Cys Gln Glu
180 185 190
Val Gly Lys Glu Ala Gln Leu Phe Gln Ser Phe Gln Asp Ile Leu Gln
195 200 205
Thr Val Glu His Trp Gly Arg Glu Glu Glu Glu Gly Thr
210 215 220
<210> 196
<211> 642
<212> DNA
<213> Pelotomaculum thermopropionicum
<220>
<221> CDS
<222> (1)..(642)
<223> Pelotomaculum thermopropionicum gene encoding TMP phosphatase
[WP_012032097]
<400> 196
atg gaa aaa gtt ttt ttt gtt gat ttt gac ggg acg gta acc aaa aag 48
Met Glu Lys Val Phe Phe Val Asp Phe Asp Gly Thr Val Thr Lys Lys
1 5 10 15
gat acc tgc gtg gcc atg atc gag gcc ttt gcc ggc ggc aac tgg aga 96
Asp Thr Cys Val Ala Met Ile Glu Ala Phe Ala Gly Gly Asn Trp Arg
20 25 30
gag att aac gag gcg tgg gaa aga aaa gaa att tcc acg gaa gaa tgt 144
Glu Ile Asn Glu Ala Trp Glu Arg Lys Glu Ile Ser Thr Glu Glu Cys
35 40 45
gca aac atg atc ttc agg ctt ttc cgc gcc ggc att gaa gac atc agg 192
Ala Asn Met Ile Phe Arg Leu Phe Arg Ala Gly Ile Glu Asp Ile Arg
50 55 60
aag ctt ttg gac ggt atc gag ata gac ggc cat ttt aaa gat ttt ctt 240
Lys Leu Leu Asp Gly Ile Glu Ile Asp Gly His Phe Lys Asp Phe Leu
65 70 75 80
tct ttt tgc cgg gaa aga ggc tat aaa ata tac atc ctc agc gac ggt 288
Ser Phe Cys Arg Glu Arg Gly Tyr Lys Ile Tyr Ile Leu Ser Asp Gly
85 90 95
tac gac ttt tgc att gag acg gtg ttt aaa aaa cac gga ata gag ctg 336
Tyr Asp Phe Cys Ile Glu Thr Val Phe Lys Lys His Gly Ile Glu Leu
100 105 110
ccg tac tat gcc aac aaa atg gtt tac ggc aat ggt ttt aaa ata gaa 384
Pro Tyr Tyr Ala Asn Lys Met Val Tyr Gly Asn Gly Phe Lys Ile Glu
115 120 125
tgc ttc agg ccc aac ccg gcc tgc ggt att tgc ggg acc tgc aag acc 432
Cys Phe Arg Pro Asn Pro Ala Cys Gly Ile Cys Gly Thr Cys Lys Thr
130 135 140
aag ctg att gag gag ctt aaa ggg gac ggc agc cag gtt att tac att 480
Lys Leu Ile Glu Glu Leu Lys Gly Asp Gly Ser Gln Val Ile Tyr Ile
145 150 155 160
ggc gac gga tat tcg gac aca tgc ccg gcc atg aaa gcc gat gtg gtt 528
Gly Asp Gly Tyr Ser Asp Thr Cys Pro Ala Met Lys Ala Asp Val Val
165 170 175
ttt gcc aag gga gta ttg tac agg cat tgc cgg gaa aac ggc aaa aag 576
Phe Ala Lys Gly Val Leu Tyr Arg His Cys Arg Glu Asn Gly Lys Lys
180 185 190
gct att tat tat aat aac ttt ggt gat att att aat tat ttt ttc caa 624
Ala Ile Tyr Tyr Asn Asn Phe Gly Asp Ile Ile Asn Tyr Phe Phe Gln
195 200 205
ata aaa aaa agt ttg taa 642
Ile Lys Lys Ser Leu
210
<210> 197
<211> 213
<212> PRT
<213> Pelotomaculum thermopropionicum
<400> 197
Met Glu Lys Val Phe Phe Val Asp Phe Asp Gly Thr Val Thr Lys Lys
1 5 10 15
Asp Thr Cys Val Ala Met Ile Glu Ala Phe Ala Gly Gly Asn Trp Arg
20 25 30
Glu Ile Asn Glu Ala Trp Glu Arg Lys Glu Ile Ser Thr Glu Glu Cys
35 40 45
Ala Asn Met Ile Phe Arg Leu Phe Arg Ala Gly Ile Glu Asp Ile Arg
50 55 60
Lys Leu Leu Asp Gly Ile Glu Ile Asp Gly His Phe Lys Asp Phe Leu
65 70 75 80
Ser Phe Cys Arg Glu Arg Gly Tyr Lys Ile Tyr Ile Leu Ser Asp Gly
85 90 95
Tyr Asp Phe Cys Ile Glu Thr Val Phe Lys Lys His Gly Ile Glu Leu
100 105 110
Pro Tyr Tyr Ala Asn Lys Met Val Tyr Gly Asn Gly Phe Lys Ile Glu
115 120 125
Cys Phe Arg Pro Asn Pro Ala Cys Gly Ile Cys Gly Thr Cys Lys Thr
130 135 140
Lys Leu Ile Glu Glu Leu Lys Gly Asp Gly Ser Gln Val Ile Tyr Ile
145 150 155 160
Gly Asp Gly Tyr Ser Asp Thr Cys Pro Ala Met Lys Ala Asp Val Val
165 170 175
Phe Ala Lys Gly Val Leu Tyr Arg His Cys Arg Glu Asn Gly Lys Lys
180 185 190
Ala Ile Tyr Tyr Asn Asn Phe Gly Asp Ile Ile Asn Tyr Phe Phe Gln
195 200 205
Ile Lys Lys Ser Leu
210
<210> 198
<211> 651
<212> DNA
<213> Desulfotomaculum ruminis
<220>
<221> CDS
<222> (1)..(651)
<223> Desulfotomaculum ruminis gene encoding TMP phosphatase
[WP_013840216]
<400> 198
atg gaa acc att ctt ttt ctg gat ttt gac ggc acc att acc gag cag 48
Met Glu Thr Ile Leu Phe Leu Asp Phe Asp Gly Thr Ile Thr Glu Gln
1 5 10 15
gat acc tgc gat atg ctg atg gag cgc tac ggc aat gcg gaa tgt ctg 96
Asp Thr Cys Asp Met Leu Met Glu Arg Tyr Gly Asn Ala Glu Cys Leu
20 25 30
gaa ttg aac cgg cgc tgg gaa cgc aag gaa att tcc acc atg gaa tgt 144
Glu Leu Asn Arg Arg Trp Glu Arg Lys Glu Ile Ser Thr Met Glu Cys
35 40 45
gcc cgg cag tcc ttc cgg caa atg cag gta act ccc gag gtt cta aag 192
Ala Arg Gln Ser Phe Arg Gln Met Gln Val Thr Pro Glu Val Leu Lys
50 55 60
cgg ttg gtg cag gag gtg aag gta gac cct cat ttg aaa gaa ttg ctc 240
Arg Leu Val Gln Glu Val Lys Val Asp Pro His Leu Lys Glu Leu Leu
65 70 75 80
cgt ttc tgt gag cag gag aat tac ccc gcc tat att ttg agc gat ggg 288
Arg Phe Cys Glu Gln Glu Asn Tyr Pro Ala Tyr Ile Leu Ser Asp Gly
85 90 95
tat gaa ccc atc att cag ggg gta ctg cag cgg gaa gga ata aaa ata 336
Tyr Glu Pro Ile Ile Gln Gly Val Leu Gln Arg Glu Gly Ile Lys Ile
100 105 110
tct tgt ttt tgc aac ggg ttg tcc ttt gac ggc cag tac cgg gtc atg 384
Ser Cys Phe Cys Asn Gly Leu Ser Phe Asp Gly Gln Tyr Arg Val Met
115 120 125
gcg cct cac tat aat ccc cgg tgc ggc cgg tgc gga acc tgt aaa caa 432
Ala Pro His Tyr Asn Pro Arg Cys Gly Arg Cys Gly Thr Cys Lys Gln
130 135 140
aag ctg gtg gaa cgc ctg ggt cag ccg ggc gcc cgg aag att ttt gtg 480
Lys Leu Val Glu Arg Leu Gly Gln Pro Gly Ala Arg Lys Ile Phe Val
145 150 155 160
gga gac ggt tat tcg gat ttc tgt gcc gca gag tcc tgc agt aag gtc 528
Gly Asp Gly Tyr Ser Asp Phe Cys Ala Ala Glu Ser Cys Ser Lys Val
165 170 175
ttt gct aaa aaa aat tta ttg aag tat tgc ctg gaa aac cag att ccg 576
Phe Ala Lys Lys Asn Leu Leu Lys Tyr Cys Leu Glu Asn Gln Ile Pro
180 185 190
gcc cac ccc tat gaa acc ctg gga gag gtt tta cag tgg ctg aga gga 624
Ala His Pro Tyr Glu Thr Leu Gly Glu Val Leu Gln Trp Leu Arg Gly
195 200 205
gag gct gaa cat gga cat ccg gtt taa 651
Glu Ala Glu His Gly His Pro Val
210 215
<210> 199
<211> 216
<212> PRT
<213> Desulfotomaculum ruminis
<400> 199
Met Glu Thr Ile Leu Phe Leu Asp Phe Asp Gly Thr Ile Thr Glu Gln
1 5 10 15
Asp Thr Cys Asp Met Leu Met Glu Arg Tyr Gly Asn Ala Glu Cys Leu
20 25 30
Glu Leu Asn Arg Arg Trp Glu Arg Lys Glu Ile Ser Thr Met Glu Cys
35 40 45
Ala Arg Gln Ser Phe Arg Gln Met Gln Val Thr Pro Glu Val Leu Lys
50 55 60
Arg Leu Val Gln Glu Val Lys Val Asp Pro His Leu Lys Glu Leu Leu
65 70 75 80
Arg Phe Cys Glu Gln Glu Asn Tyr Pro Ala Tyr Ile Leu Ser Asp Gly
85 90 95
Tyr Glu Pro Ile Ile Gln Gly Val Leu Gln Arg Glu Gly Ile Lys Ile
100 105 110
Ser Cys Phe Cys Asn Gly Leu Ser Phe Asp Gly Gln Tyr Arg Val Met
115 120 125
Ala Pro His Tyr Asn Pro Arg Cys Gly Arg Cys Gly Thr Cys Lys Gln
130 135 140
Lys Leu Val Glu Arg Leu Gly Gln Pro Gly Ala Arg Lys Ile Phe Val
145 150 155 160
Gly Asp Gly Tyr Ser Asp Phe Cys Ala Ala Glu Ser Cys Ser Lys Val
165 170 175
Phe Ala Lys Lys Asn Leu Leu Lys Tyr Cys Leu Glu Asn Gln Ile Pro
180 185 190
Ala His Pro Tyr Glu Thr Leu Gly Glu Val Leu Gln Trp Leu Arg Gly
195 200 205
Glu Ala Glu His Gly His Pro Val
210 215
<210> 200
<211> 1896
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(1896)
<223> ThiC gene from E. coli encoding HMP-P synthase
<400> 200
atg tct gca aca aaa ctg acc cgc cgc gaa caa cgc gcc cgg gcc caa 48
Met Ser Ala Thr Lys Leu Thr Arg Arg Glu Gln Arg Ala Arg Ala Gln
1 5 10 15
cat ttt atc gac acc ctg gaa ggc acc gcc ttt ccc aac tca aaa cgc 96
His Phe Ile Asp Thr Leu Glu Gly Thr Ala Phe Pro Asn Ser Lys Arg
20 25 30
att tat atc act ggc aca cac ccc ggc gtg cgc gtg ccg atg cgt gag 144
Ile Tyr Ile Thr Gly Thr His Pro Gly Val Arg Val Pro Met Arg Glu
35 40 45
atc cag ctt agc ccg acg cta att ggc ggt agc aaa gaa cag ccg cag 192
Ile Gln Leu Ser Pro Thr Leu Ile Gly Gly Ser Lys Glu Gln Pro Gln
50 55 60
tac gaa gaa aac gaa gcg att ccg gtc tac gac acc tcc ggc ccg tat 240
Tyr Glu Glu Asn Glu Ala Ile Pro Val Tyr Asp Thr Ser Gly Pro Tyr
65 70 75 80
ggt gat ccg cag att gcc att aac gtg cag caa ggg ctg gca aaa cta 288
Gly Asp Pro Gln Ile Ala Ile Asn Val Gln Gln Gly Leu Ala Lys Leu
85 90 95
cgc cag ccg tgg atc gat gcg cgc ggc gat acc gaa gaa ctt acc gtg 336
Arg Gln Pro Trp Ile Asp Ala Arg Gly Asp Thr Glu Glu Leu Thr Val
100 105 110
cgc agt tcc gat tac act aaa gcg cgg ctg gca gat gat ggc ctc gac 384
Arg Ser Ser Asp Tyr Thr Lys Ala Arg Leu Ala Asp Asp Gly Leu Asp
115 120 125
gaa ctg cgt ttt agc ggc gta cta aca cca aaa cgc gcc aaa gca gga 432
Glu Leu Arg Phe Ser Gly Val Leu Thr Pro Lys Arg Ala Lys Ala Gly
130 135 140
cgc cgt gtc acc caa ctg cac tac gcc cgc cag ggc atc atc acg ccg 480
Arg Arg Val Thr Gln Leu His Tyr Ala Arg Gln Gly Ile Ile Thr Pro
145 150 155 160
gaa atg gaa ttc atc gcc atc cgc gag aat atg ggc cgc gag cgc atc 528
Glu Met Glu Phe Ile Ala Ile Arg Glu Asn Met Gly Arg Glu Arg Ile
165 170 175
cgt agc gag gtt tta cgc cac cag cat ccg gga atg agc ttt ggc gca 576
Arg Ser Glu Val Leu Arg His Gln His Pro Gly Met Ser Phe Gly Ala
180 185 190
cat ctg ccg gaa aat atc act gcg gaa ttt gtc cgt gat gaa gtt gct 624
His Leu Pro Glu Asn Ile Thr Ala Glu Phe Val Arg Asp Glu Val Ala
195 200 205
gcc gga cgt gcg att atc ccg gcc aac att aat cat ccg gaa tcg gag 672
Ala Gly Arg Ala Ile Ile Pro Ala Asn Ile Asn His Pro Glu Ser Glu
210 215 220
ccg atg att att ggt cgc aat ttc ctg gta aaa gtt aac gcc aat atc 720
Pro Met Ile Ile Gly Arg Asn Phe Leu Val Lys Val Asn Ala Asn Ile
225 230 235 240
ggc aac tcg gcg gtc acc tct tcc atc gaa gaa gaa gtg gaa aag ctg 768
Gly Asn Ser Ala Val Thr Ser Ser Ile Glu Glu Glu Val Glu Lys Leu
245 250 255
gta tgg tcc acg cgc tgg gga gcg gat acg gtg atg gat ctc tcc acc 816
Val Trp Ser Thr Arg Trp Gly Ala Asp Thr Val Met Asp Leu Ser Thr
260 265 270
ggt cgc tat att cac gaa acc cgc gag tgg att ttg cgt aac agc ccg 864
Gly Arg Tyr Ile His Glu Thr Arg Glu Trp Ile Leu Arg Asn Ser Pro
275 280 285
gtg ccg atc ggt aca gtg ccg atc tac cag gcg ctg gag aag gtt aac 912
Val Pro Ile Gly Thr Val Pro Ile Tyr Gln Ala Leu Glu Lys Val Asn
290 295 300
ggg atc gcc gaa gat ctt acc tgg gaa gcg ttc cgc gac acg ctg ctg 960
Gly Ile Ala Glu Asp Leu Thr Trp Glu Ala Phe Arg Asp Thr Leu Leu
305 310 315 320
gaa cag gcc gag caa ggt gtg gat tac ttc act atc cat gcg ggc gta 1008
Glu Gln Ala Glu Gln Gly Val Asp Tyr Phe Thr Ile His Ala Gly Val
325 330 335
ctg ctg cgc tat gtg ccg atg acc gcg aaa cgc ctg acc ggt atc gtc 1056
Leu Leu Arg Tyr Val Pro Met Thr Ala Lys Arg Leu Thr Gly Ile Val
340 345 350
tct cgc ggc ggt tcg att atg gcg aaa tgg tgc ctc tcc cat cat cag 1104
Ser Arg Gly Gly Ser Ile Met Ala Lys Trp Cys Leu Ser His His Gln
355 360 365
gaa aat ttc ctc tat caa cac ttc cgc gaa att tgt gaa atc tgt gcc 1152
Glu Asn Phe Leu Tyr Gln His Phe Arg Glu Ile Cys Glu Ile Cys Ala
370 375 380
gct tat gac gtt tcg ctg tcg ctg ggc gac ggt ctg cgc ccc ggt tct 1200
Ala Tyr Asp Val Ser Leu Ser Leu Gly Asp Gly Leu Arg Pro Gly Ser
385 390 395 400
att cag gac gcc aac gat gaa gcg cag ttt gcc gag ctg cat acg ctg 1248
Ile Gln Asp Ala Asn Asp Glu Ala Gln Phe Ala Glu Leu His Thr Leu
405 410 415
ggc gaa ctg acc aaa att gcc tgg gaa tat gac gtg cag gtg atg att 1296
Gly Glu Leu Thr Lys Ile Ala Trp Glu Tyr Asp Val Gln Val Met Ile
420 425 430
gaa ggc cca ggc cac gtg ccg atg cag atg atc cgc cgc aat atg acc 1344
Glu Gly Pro Gly His Val Pro Met Gln Met Ile Arg Arg Asn Met Thr
435 440 445
gag gag tta gag cac tgc cac gaa gcg ccg ttt tac act ctg ggg ccg 1392
Glu Glu Leu Glu His Cys His Glu Ala Pro Phe Tyr Thr Leu Gly Pro
450 455 460
cta act acc gat att gcg ccg ggc tat gac cac ttc acg tcg ggg att 1440
Leu Thr Thr Asp Ile Ala Pro Gly Tyr Asp His Phe Thr Ser Gly Ile
465 470 475 480
ggt gcg gcg atg att ggc tgg ttt ggc tgc gcg atg ctc tgt tac gta 1488
Gly Ala Ala Met Ile Gly Trp Phe Gly Cys Ala Met Leu Cys Tyr Val
485 490 495
acg cca aaa gag cat ctg ggt ctg ccc aat aaa gaa gat gtt aag cag 1536
Thr Pro Lys Glu His Leu Gly Leu Pro Asn Lys Glu Asp Val Lys Gln
500 505 510
ggg ctt atc acc tat aag att gct gcc cac gcc gct gac ctg gcg aaa 1584
Gly Leu Ile Thr Tyr Lys Ile Ala Ala His Ala Ala Asp Leu Ala Lys
515 520 525
ggg cat ccg ggc gcg caa att cgc gat aac gcc atg tcg aaa gcc cgc 1632
Gly His Pro Gly Ala Gln Ile Arg Asp Asn Ala Met Ser Lys Ala Arg
530 535 540
ttc gaa ttt cgc tgg gaa gac cag ttt aat ctg gcc ctc gac ccg ttt 1680
Phe Glu Phe Arg Trp Glu Asp Gln Phe Asn Leu Ala Leu Asp Pro Phe
545 550 555 560
acc gcc cgc gct tat cac gat gaa acc ctg ccg caa gag tca ggt aaa 1728
Thr Ala Arg Ala Tyr His Asp Glu Thr Leu Pro Gln Glu Ser Gly Lys
565 570 575
gtc gcc cat ttt tgc tcc atg tgt ggg ccg aaa ttc tgc tcg atg aaa 1776
Val Ala His Phe Cys Ser Met Cys Gly Pro Lys Phe Cys Ser Met Lys
580 585 590
atc agc cag gaa gtg cgt gat tac gcc gcc acg caa act att gaa atg 1824
Ile Ser Gln Glu Val Arg Asp Tyr Ala Ala Thr Gln Thr Ile Glu Met
595 600 605
gga atg gcg gat atg tcg gag aac ttc cgt gcc aga ggc gga gaa atc 1872
Gly Met Ala Asp Met Ser Glu Asn Phe Arg Ala Arg Gly Gly Glu Ile
610 615 620
tac ctg cgt aag gag gaa gcg tga 1896
Tyr Leu Arg Lys Glu Glu Ala
625 630
<210> 201
<211> 631
<212> PRT
<213> Escherichia coli
<400> 201
Met Ser Ala Thr Lys Leu Thr Arg Arg Glu Gln Arg Ala Arg Ala Gln
1 5 10 15
His Phe Ile Asp Thr Leu Glu Gly Thr Ala Phe Pro Asn Ser Lys Arg
20 25 30
Ile Tyr Ile Thr Gly Thr His Pro Gly Val Arg Val Pro Met Arg Glu
35 40 45
Ile Gln Leu Ser Pro Thr Leu Ile Gly Gly Ser Lys Glu Gln Pro Gln
50 55 60
Tyr Glu Glu Asn Glu Ala Ile Pro Val Tyr Asp Thr Ser Gly Pro Tyr
65 70 75 80
Gly Asp Pro Gln Ile Ala Ile Asn Val Gln Gln Gly Leu Ala Lys Leu
85 90 95
Arg Gln Pro Trp Ile Asp Ala Arg Gly Asp Thr Glu Glu Leu Thr Val
100 105 110
Arg Ser Ser Asp Tyr Thr Lys Ala Arg Leu Ala Asp Asp Gly Leu Asp
115 120 125
Glu Leu Arg Phe Ser Gly Val Leu Thr Pro Lys Arg Ala Lys Ala Gly
130 135 140
Arg Arg Val Thr Gln Leu His Tyr Ala Arg Gln Gly Ile Ile Thr Pro
145 150 155 160
Glu Met Glu Phe Ile Ala Ile Arg Glu Asn Met Gly Arg Glu Arg Ile
165 170 175
Arg Ser Glu Val Leu Arg His Gln His Pro Gly Met Ser Phe Gly Ala
180 185 190
His Leu Pro Glu Asn Ile Thr Ala Glu Phe Val Arg Asp Glu Val Ala
195 200 205
Ala Gly Arg Ala Ile Ile Pro Ala Asn Ile Asn His Pro Glu Ser Glu
210 215 220
Pro Met Ile Ile Gly Arg Asn Phe Leu Val Lys Val Asn Ala Asn Ile
225 230 235 240
Gly Asn Ser Ala Val Thr Ser Ser Ile Glu Glu Glu Val Glu Lys Leu
245 250 255
Val Trp Ser Thr Arg Trp Gly Ala Asp Thr Val Met Asp Leu Ser Thr
260 265 270
Gly Arg Tyr Ile His Glu Thr Arg Glu Trp Ile Leu Arg Asn Ser Pro
275 280 285
Val Pro Ile Gly Thr Val Pro Ile Tyr Gln Ala Leu Glu Lys Val Asn
290 295 300
Gly Ile Ala Glu Asp Leu Thr Trp Glu Ala Phe Arg Asp Thr Leu Leu
305 310 315 320
Glu Gln Ala Glu Gln Gly Val Asp Tyr Phe Thr Ile His Ala Gly Val
325 330 335
Leu Leu Arg Tyr Val Pro Met Thr Ala Lys Arg Leu Thr Gly Ile Val
340 345 350
Ser Arg Gly Gly Ser Ile Met Ala Lys Trp Cys Leu Ser His His Gln
355 360 365
Glu Asn Phe Leu Tyr Gln His Phe Arg Glu Ile Cys Glu Ile Cys Ala
370 375 380
Ala Tyr Asp Val Ser Leu Ser Leu Gly Asp Gly Leu Arg Pro Gly Ser
385 390 395 400
Ile Gln Asp Ala Asn Asp Glu Ala Gln Phe Ala Glu Leu His Thr Leu
405 410 415
Gly Glu Leu Thr Lys Ile Ala Trp Glu Tyr Asp Val Gln Val Met Ile
420 425 430
Glu Gly Pro Gly His Val Pro Met Gln Met Ile Arg Arg Asn Met Thr
435 440 445
Glu Glu Leu Glu His Cys His Glu Ala Pro Phe Tyr Thr Leu Gly Pro
450 455 460
Leu Thr Thr Asp Ile Ala Pro Gly Tyr Asp His Phe Thr Ser Gly Ile
465 470 475 480
Gly Ala Ala Met Ile Gly Trp Phe Gly Cys Ala Met Leu Cys Tyr Val
485 490 495
Thr Pro Lys Glu His Leu Gly Leu Pro Asn Lys Glu Asp Val Lys Gln
500 505 510
Gly Leu Ile Thr Tyr Lys Ile Ala Ala His Ala Ala Asp Leu Ala Lys
515 520 525
Gly His Pro Gly Ala Gln Ile Arg Asp Asn Ala Met Ser Lys Ala Arg
530 535 540
Phe Glu Phe Arg Trp Glu Asp Gln Phe Asn Leu Ala Leu Asp Pro Phe
545 550 555 560
Thr Ala Arg Ala Tyr His Asp Glu Thr Leu Pro Gln Glu Ser Gly Lys
565 570 575
Val Ala His Phe Cys Ser Met Cys Gly Pro Lys Phe Cys Ser Met Lys
580 585 590
Ile Ser Gln Glu Val Arg Asp Tyr Ala Ala Thr Gln Thr Ile Glu Met
595 600 605
Gly Met Ala Asp Met Ser Glu Asn Phe Arg Ala Arg Gly Gly Glu Ile
610 615 620
Tyr Leu Arg Lys Glu Glu Ala
625 630
<210> 202
<211> 1371
<212> DNA
<213> Synechococcus elongatus
<220>
<221> CDS
<222> (1)..(1371)
<223> ThiC gene from Synechococcus_elongatus_PCC_7942:_[NC_007604]
encoding a HMP-P synthase
<400> 202
atg cgc agc gac tgg atc gca ccc cgc cga ggc caa gcc aac gtc act 48
Met Arg Ser Asp Trp Ile Ala Pro Arg Arg Gly Gln Ala Asn Val Thr
1 5 10 15
caa atg cac tac gcc cgc caa ggc gtg atc acc gaa gaa atg gac ttc 96
Gln Met His Tyr Ala Arg Gln Gly Val Ile Thr Glu Glu Met Asp Phe
20 25 30
gtg gcg cgg cgc gaa aat ctg cca gcc gat cta att cgg gat gaa gtg 144
Val Ala Arg Arg Glu Asn Leu Pro Ala Asp Leu Ile Arg Asp Glu Val
35 40 45
gca cgg ggt cgg atg att atc ccc gcc aac atc aac cac acc aat ttg 192
Ala Arg Gly Arg Met Ile Ile Pro Ala Asn Ile Asn His Thr Asn Leu
50 55 60
gag ccg atg gcg atc ggc att gcc tcc aag tgc aag gtc aac gcc aac 240
Glu Pro Met Ala Ile Gly Ile Ala Ser Lys Cys Lys Val Asn Ala Asn
65 70 75 80
atc ggt gct tcg cct aac gcc tcc aac atc gat gaa gaa gtc gag aag 288
Ile Gly Ala Ser Pro Asn Ala Ser Asn Ile Asp Glu Glu Val Glu Lys
85 90 95
ctg aag ctc gcg gtc aaa tac ggt gcc gat acc gtc atg gac ctc tcg 336
Leu Lys Leu Ala Val Lys Tyr Gly Ala Asp Thr Val Met Asp Leu Ser
100 105 110
acc ggc ggc ggc aac ctc gat gag att cgc acc gcg atc atc aat gct 384
Thr Gly Gly Gly Asn Leu Asp Glu Ile Arg Thr Ala Ile Ile Asn Ala
115 120 125
tcg ccg gta ccg atc ggc acc gtg ccg gtc tac caa gcc ctg gaa tcc 432
Ser Pro Val Pro Ile Gly Thr Val Pro Val Tyr Gln Ala Leu Glu Ser
130 135 140
gtt cac ggg cgc atc gaa aaa ctc agc gcc gac gac ttc ttg cat gtg 480
Val His Gly Arg Ile Glu Lys Leu Ser Ala Asp Asp Phe Leu His Val
145 150 155 160
atc gaa aag cac tgc gaa cag ggc gtc gac tac caa acc atc cac gcc 528
Ile Glu Lys His Cys Glu Gln Gly Val Asp Tyr Gln Thr Ile His Ala
165 170 175
ggt ctg ctg att gaa cac ctg ccc aag gtc aag agc cgg atc acc ggg 576
Gly Leu Leu Ile Glu His Leu Pro Lys Val Lys Ser Arg Ile Thr Gly
180 185 190
att gtt tcg cgg ggc ggc ggc atc att gcc cag tgg atg ctc tac cac 624
Ile Val Ser Arg Gly Gly Gly Ile Ile Ala Gln Trp Met Leu Tyr His
195 200 205
cac aag caa aac ccg ctc tat acc cac ttt cgc gac atc atc gaa atc 672
His Lys Gln Asn Pro Leu Tyr Thr His Phe Arg Asp Ile Ile Glu Ile
210 215 220
ttc aag cgc tac gac tgt agc ttc agc ttg ggt gac tcg ctg cgg ccg 720
Phe Lys Arg Tyr Asp Cys Ser Phe Ser Leu Gly Asp Ser Leu Arg Pro
225 230 235 240
ggt tgc ctg cac gat gct agc gac gat gcc cag ctc agc gag ctg aag 768
Gly Cys Leu His Asp Ala Ser Asp Asp Ala Gln Leu Ser Glu Leu Lys
245 250 255
act ctc ggt caa ctg acg cgg gtt gct tgg gaa cac gac gtg caa gtc 816
Thr Leu Gly Gln Leu Thr Arg Val Ala Trp Glu His Asp Val Gln Val
260 265 270
atg gtc gaa ggg cca ggc cac gtt ccc atg gac cag atc gag ttc aac 864
Met Val Glu Gly Pro Gly His Val Pro Met Asp Gln Ile Glu Phe Asn
275 280 285
gtc cgc aag caa atg gaa gag tgc tca gaa gct ccc ttc tac gtc ttg 912
Val Arg Lys Gln Met Glu Glu Cys Ser Glu Ala Pro Phe Tyr Val Leu
290 295 300
ggt ccc ctc gtg acc gac att gca ccg ggc tat gac cac atc acc agc 960
Gly Pro Leu Val Thr Asp Ile Ala Pro Gly Tyr Asp His Ile Thr Ser
305 310 315 320
gcg atc ggg gca gca atg gcg ggc tgg tat ggc acg gca atg ctc tgc 1008
Ala Ile Gly Ala Ala Met Ala Gly Trp Tyr Gly Thr Ala Met Leu Cys
325 330 335
tac gtc acg ccc aaa gag cac ttg ggt ctg ccc aat gcg gaa gat gtg 1056
Tyr Val Thr Pro Lys Glu His Leu Gly Leu Pro Asn Ala Glu Asp Val
340 345 350
cgc aat ggt ttg atc gcc tac aaa att gcg gct cat gca gca gat atc 1104
Arg Asn Gly Leu Ile Ala Tyr Lys Ile Ala Ala His Ala Ala Asp Ile
355 360 365
gct cgc cac cgt ccg ggt gct cgc gat cgc gat gat gaa ctg agt cgg 1152
Ala Arg His Arg Pro Gly Ala Arg Asp Arg Asp Asp Glu Leu Ser Arg
370 375 380
gca cgc tac gcc ttc gac tgg aac aag caa ttt gac ttg agc ctc gat 1200
Ala Arg Tyr Ala Phe Asp Trp Asn Lys Gln Phe Asp Leu Ser Leu Asp
385 390 395 400
cca gag cgg gcg cgg gaa tac cac gac gaa act ctg cca gca gat atc 1248
Pro Glu Arg Ala Arg Glu Tyr His Asp Glu Thr Leu Pro Ala Asp Ile
405 410 415
tac aaa acg gca gaa ttc tgt tcg atg tgt gga ccg aag cac tgt ccg 1296
Tyr Lys Thr Ala Glu Phe Cys Ser Met Cys Gly Pro Lys His Cys Pro
420 425 430
atg caa acc aag atc acc gag gaa gat cta acc gag ttg gaa aaa ttc 1344
Met Gln Thr Lys Ile Thr Glu Glu Asp Leu Thr Glu Leu Glu Lys Phe
435 440 445
ctc gag aaa gat agc gct ctg gcg tag 1371
Leu Glu Lys Asp Ser Ala Leu Ala
450 455
<210> 203
<211> 456
<212> PRT
<213> Synechococcus elongatus
<400> 203
Met Arg Ser Asp Trp Ile Ala Pro Arg Arg Gly Gln Ala Asn Val Thr
1 5 10 15
Gln Met His Tyr Ala Arg Gln Gly Val Ile Thr Glu Glu Met Asp Phe
20 25 30
Val Ala Arg Arg Glu Asn Leu Pro Ala Asp Leu Ile Arg Asp Glu Val
35 40 45
Ala Arg Gly Arg Met Ile Ile Pro Ala Asn Ile Asn His Thr Asn Leu
50 55 60
Glu Pro Met Ala Ile Gly Ile Ala Ser Lys Cys Lys Val Asn Ala Asn
65 70 75 80
Ile Gly Ala Ser Pro Asn Ala Ser Asn Ile Asp Glu Glu Val Glu Lys
85 90 95
Leu Lys Leu Ala Val Lys Tyr Gly Ala Asp Thr Val Met Asp Leu Ser
100 105 110
Thr Gly Gly Gly Asn Leu Asp Glu Ile Arg Thr Ala Ile Ile Asn Ala
115 120 125
Ser Pro Val Pro Ile Gly Thr Val Pro Val Tyr Gln Ala Leu Glu Ser
130 135 140
Val His Gly Arg Ile Glu Lys Leu Ser Ala Asp Asp Phe Leu His Val
145 150 155 160
Ile Glu Lys His Cys Glu Gln Gly Val Asp Tyr Gln Thr Ile His Ala
165 170 175
Gly Leu Leu Ile Glu His Leu Pro Lys Val Lys Ser Arg Ile Thr Gly
180 185 190
Ile Val Ser Arg Gly Gly Gly Ile Ile Ala Gln Trp Met Leu Tyr His
195 200 205
His Lys Gln Asn Pro Leu Tyr Thr His Phe Arg Asp Ile Ile Glu Ile
210 215 220
Phe Lys Arg Tyr Asp Cys Ser Phe Ser Leu Gly Asp Ser Leu Arg Pro
225 230 235 240
Gly Cys Leu His Asp Ala Ser Asp Asp Ala Gln Leu Ser Glu Leu Lys
245 250 255
Thr Leu Gly Gln Leu Thr Arg Val Ala Trp Glu His Asp Val Gln Val
260 265 270
Met Val Glu Gly Pro Gly His Val Pro Met Asp Gln Ile Glu Phe Asn
275 280 285
Val Arg Lys Gln Met Glu Glu Cys Ser Glu Ala Pro Phe Tyr Val Leu
290 295 300
Gly Pro Leu Val Thr Asp Ile Ala Pro Gly Tyr Asp His Ile Thr Ser
305 310 315 320
Ala Ile Gly Ala Ala Met Ala Gly Trp Tyr Gly Thr Ala Met Leu Cys
325 330 335
Tyr Val Thr Pro Lys Glu His Leu Gly Leu Pro Asn Ala Glu Asp Val
340 345 350
Arg Asn Gly Leu Ile Ala Tyr Lys Ile Ala Ala His Ala Ala Asp Ile
355 360 365
Ala Arg His Arg Pro Gly Ala Arg Asp Arg Asp Asp Glu Leu Ser Arg
370 375 380
Ala Arg Tyr Ala Phe Asp Trp Asn Lys Gln Phe Asp Leu Ser Leu Asp
385 390 395 400
Pro Glu Arg Ala Arg Glu Tyr His Asp Glu Thr Leu Pro Ala Asp Ile
405 410 415
Tyr Lys Thr Ala Glu Phe Cys Ser Met Cys Gly Pro Lys His Cys Pro
420 425 430
Met Gln Thr Lys Ile Thr Glu Glu Asp Leu Thr Glu Leu Glu Lys Phe
435 440 445
Leu Glu Lys Asp Ser Ala Leu Ala
450 455
<210> 204
<211> 1764
<212> DNA
<213> Corynebacterium glutamicum
<220>
<221> CDS
<222> (1)..(1764)
<223> ThiC gene from Corynebacterium glutamicum encoding an HMP-P
synthase
<400> 204
atg acg cct acc caa aat gag atc cac ccg aaa cat agc tac tcc ccc 48
Met Thr Pro Thr Gln Asn Glu Ile His Pro Lys His Ser Tyr Ser Pro
1 5 10 15
atc cgc aag gac ggt ctc gag gtc ccg gag acc gaa atc cgc ctc gat 96
Ile Arg Lys Asp Gly Leu Glu Val Pro Glu Thr Glu Ile Arg Leu Asp
20 25 30
gac tcg cca agc ggc ccc aac gaa ccc ttc cgc atc tac cgc acc cgt 144
Asp Ser Pro Ser Gly Pro Asn Glu Pro Phe Arg Ile Tyr Arg Thr Arg
35 40 45
ggc cca gaa acc aac ccc aag cag gga ctt ccg cgg ctg cgc gag tca 192
Gly Pro Glu Thr Asn Pro Lys Gln Gly Leu Pro Arg Leu Arg Glu Ser
50 55 60
tgg atc acc gcc cgc ggc gac gtt gcc acc tat cag ggg cgc gag cgt 240
Trp Ile Thr Ala Arg Gly Asp Val Ala Thr Tyr Gln Gly Arg Glu Arg
65 70 75 80
ttg ctt atc gac gac ggc cgc tcg gca atg cgt cga ggt caa gct tcg 288
Leu Leu Ile Asp Asp Gly Arg Ser Ala Met Arg Arg Gly Gln Ala Ser
85 90 95
gct gag tgg aaa ggc caa aaa cca gct cct ttg aag gcg cta cct ggc 336
Ala Glu Trp Lys Gly Gln Lys Pro Ala Pro Leu Lys Ala Leu Pro Gly
100 105 110
aaa aga gtc acc caa atg gcc tat gca cgt gct ggc gtg att act cgt 384
Lys Arg Val Thr Gln Met Ala Tyr Ala Arg Ala Gly Val Ile Thr Arg
115 120 125
gaa atg gag ttt gta gcg ctg cgc gaa cac gtt gat gcg gag ttt gtg 432
Glu Met Glu Phe Val Ala Leu Arg Glu His Val Asp Ala Glu Phe Val
130 135 140
cgc tct gag gtg gcg cgc ggt cgg gcc att att ccc aac aac gtc aac 480
Arg Ser Glu Val Ala Arg Gly Arg Ala Ile Ile Pro Asn Asn Val Asn
145 150 155 160
cac ccc gaa tct gaa ccg atg att att ggt cgc aaa ttt ttg acc aaa 528
His Pro Glu Ser Glu Pro Met Ile Ile Gly Arg Lys Phe Leu Thr Lys
165 170 175
atc aac gcc aat att ggc aat tct gcg gtc acc tct tca atc gag gaa 576
Ile Asn Ala Asn Ile Gly Asn Ser Ala Val Thr Ser Ser Ile Glu Glu
180 185 190
gag gtg tcc aag ctg cag tgg gcc acg cgc tgg ggt gcc gat acc gtg 624
Glu Val Ser Lys Leu Gln Trp Ala Thr Arg Trp Gly Ala Asp Thr Val
195 200 205
atg gat cta tcc acc ggc gat gat att cac acc acc cgc gaa tgg att 672
Met Asp Leu Ser Thr Gly Asp Asp Ile His Thr Thr Arg Glu Trp Ile
210 215 220
atc cgc aac tcc ccc gtt cct atc ggc acc gtc ccg atc tac caa gcg 720
Ile Arg Asn Ser Pro Val Pro Ile Gly Thr Val Pro Ile Tyr Gln Ala
225 230 235 240
ctg gaa aaa gta aat ggc gtg gcc gca gac ctt aac tgg gaa gta ttc 768
Leu Glu Lys Val Asn Gly Val Ala Ala Asp Leu Asn Trp Glu Val Phe
245 250 255
cgc gat acc atc att gag cag tgt gaa caa ggc gtg gac tat atg acc 816
Arg Asp Thr Ile Ile Glu Gln Cys Glu Gln Gly Val Asp Tyr Met Thr
260 265 270
atc cac gcc ggc gtc ctg ctg gct tat atc cca ctg act acc cgt cgt 864
Ile His Ala Gly Val Leu Leu Ala Tyr Ile Pro Leu Thr Thr Arg Arg
275 280 285
gtc acc ggc att gtc tcc cgc ggc gga tcc att atg gcc ggt tgg tgt 912
Val Thr Gly Ile Val Ser Arg Gly Gly Ser Ile Met Ala Gly Trp Cys
290 295 300
ctg gcg cat cac cgc gaa tca ttc ctc tac gag cat ttc gac gag ctg 960
Leu Ala His His Arg Glu Ser Phe Leu Tyr Glu His Phe Asp Glu Leu
305 310 315 320
tgc gaa atc ttt gca caa tat gac gtc gca ttc tcc ctc ggt gat ggc 1008
Cys Glu Ile Phe Ala Gln Tyr Asp Val Ala Phe Ser Leu Gly Asp Gly
325 330 335
cta cgc ccc gga tcg ctt gcc gat gcc aac gac gcc gcg caa ttc gcc 1056
Leu Arg Pro Gly Ser Leu Ala Asp Ala Asn Asp Ala Ala Gln Phe Ala
340 345 350
gag ctg aaa acc att ggt gag ctc acc caa cgc gcc tgg gaa tac gat 1104
Glu Leu Lys Thr Ile Gly Glu Leu Thr Gln Arg Ala Trp Glu Tyr Asp
355 360 365
gta caa gta atg gtc gaa gga cct gga cac gtg cca cta aac atg atc 1152
Val Gln Val Met Val Glu Gly Pro Gly His Val Pro Leu Asn Met Ile
370 375 380
cag gaa aac aac gag ctg gaa caa aag tgg gca gcg gac gca cct ttt 1200
Gln Glu Asn Asn Glu Leu Glu Gln Lys Trp Ala Ala Asp Ala Pro Phe
385 390 395 400
tac act ctt gga cca cta gtt acc gac atc gct cca ggt tat gac cac 1248
Tyr Thr Leu Gly Pro Leu Val Thr Asp Ile Ala Pro Gly Tyr Asp His
405 410 415
atc act tct gcc att ggt gca gct cac atc gcc atg ggt ggc acc gcc 1296
Ile Thr Ser Ala Ile Gly Ala Ala His Ile Ala Met Gly Gly Thr Ala
420 425 430
atg ctg tgt tat gtc acc ccg aaa gaa cac ctt ggc ctg ccc aac cgt 1344
Met Leu Cys Tyr Val Thr Pro Lys Glu His Leu Gly Leu Pro Asn Arg
435 440 445
gac gac gtc aaa acc ggc gta atc acc tac aag ctc gct gcc cac gca 1392
Asp Asp Val Lys Thr Gly Val Ile Thr Tyr Lys Leu Ala Ala His Ala
450 455 460
gca gat gtg gcc aag ggt cat ccc ggc gcg cgt gcc tgg gac gac gcc 1440
Ala Asp Val Ala Lys Gly His Pro Gly Ala Arg Ala Trp Asp Asp Ala
465 470 475 480
atg agt aaa gcg cgt ttt gaa ttc cgt tgg aat gat cag ttt gcg ctc 1488
Met Ser Lys Ala Arg Phe Glu Phe Arg Trp Asn Asp Gln Phe Ala Leu
485 490 495
tcc ctc gac ccc gac act gca atc gct tat cac gac gaa acc ctg ccg 1536
Ser Leu Asp Pro Asp Thr Ala Ile Ala Tyr His Asp Glu Thr Leu Pro
500 505 510
gca gag cct gcg aaa acc gca cac ttc tgt tca atg tgt ggc ccg aag 1584
Ala Glu Pro Ala Lys Thr Ala His Phe Cys Ser Met Cys Gly Pro Lys
515 520 525
ttc tgc tcc atg cga att agc cag gac att cgc gat atg ttt ggc gat 1632
Phe Cys Ser Met Arg Ile Ser Gln Asp Ile Arg Asp Met Phe Gly Asp
530 535 540
caa atc gcg gaa ttg ggg atg cct ggg gtt ggg gat tct tct agt gct 1680
Gln Ile Ala Glu Leu Gly Met Pro Gly Val Gly Asp Ser Ser Ser Ala
545 550 555 560
gtt gct tct agt ggg gca cgg gag ggg atg gct gag aaa tcc cgg gaa 1728
Val Ala Ser Ser Gly Ala Arg Glu Gly Met Ala Glu Lys Ser Arg Glu
565 570 575
ttt att gct ggt ggt gcg gag gtt tat cgg cgt tag 1764
Phe Ile Ala Gly Gly Ala Glu Val Tyr Arg Arg
580 585
<210> 205
<211> 587
<212> PRT
<213> Corynebacterium glutamicum
<400> 205
Met Thr Pro Thr Gln Asn Glu Ile His Pro Lys His Ser Tyr Ser Pro
1 5 10 15
Ile Arg Lys Asp Gly Leu Glu Val Pro Glu Thr Glu Ile Arg Leu Asp
20 25 30
Asp Ser Pro Ser Gly Pro Asn Glu Pro Phe Arg Ile Tyr Arg Thr Arg
35 40 45
Gly Pro Glu Thr Asn Pro Lys Gln Gly Leu Pro Arg Leu Arg Glu Ser
50 55 60
Trp Ile Thr Ala Arg Gly Asp Val Ala Thr Tyr Gln Gly Arg Glu Arg
65 70 75 80
Leu Leu Ile Asp Asp Gly Arg Ser Ala Met Arg Arg Gly Gln Ala Ser
85 90 95
Ala Glu Trp Lys Gly Gln Lys Pro Ala Pro Leu Lys Ala Leu Pro Gly
100 105 110
Lys Arg Val Thr Gln Met Ala Tyr Ala Arg Ala Gly Val Ile Thr Arg
115 120 125
Glu Met Glu Phe Val Ala Leu Arg Glu His Val Asp Ala Glu Phe Val
130 135 140
Arg Ser Glu Val Ala Arg Gly Arg Ala Ile Ile Pro Asn Asn Val Asn
145 150 155 160
His Pro Glu Ser Glu Pro Met Ile Ile Gly Arg Lys Phe Leu Thr Lys
165 170 175
Ile Asn Ala Asn Ile Gly Asn Ser Ala Val Thr Ser Ser Ile Glu Glu
180 185 190
Glu Val Ser Lys Leu Gln Trp Ala Thr Arg Trp Gly Ala Asp Thr Val
195 200 205
Met Asp Leu Ser Thr Gly Asp Asp Ile His Thr Thr Arg Glu Trp Ile
210 215 220
Ile Arg Asn Ser Pro Val Pro Ile Gly Thr Val Pro Ile Tyr Gln Ala
225 230 235 240
Leu Glu Lys Val Asn Gly Val Ala Ala Asp Leu Asn Trp Glu Val Phe
245 250 255
Arg Asp Thr Ile Ile Glu Gln Cys Glu Gln Gly Val Asp Tyr Met Thr
260 265 270
Ile His Ala Gly Val Leu Leu Ala Tyr Ile Pro Leu Thr Thr Arg Arg
275 280 285
Val Thr Gly Ile Val Ser Arg Gly Gly Ser Ile Met Ala Gly Trp Cys
290 295 300
Leu Ala His His Arg Glu Ser Phe Leu Tyr Glu His Phe Asp Glu Leu
305 310 315 320
Cys Glu Ile Phe Ala Gln Tyr Asp Val Ala Phe Ser Leu Gly Asp Gly
325 330 335
Leu Arg Pro Gly Ser Leu Ala Asp Ala Asn Asp Ala Ala Gln Phe Ala
340 345 350
Glu Leu Lys Thr Ile Gly Glu Leu Thr Gln Arg Ala Trp Glu Tyr Asp
355 360 365
Val Gln Val Met Val Glu Gly Pro Gly His Val Pro Leu Asn Met Ile
370 375 380
Gln Glu Asn Asn Glu Leu Glu Gln Lys Trp Ala Ala Asp Ala Pro Phe
385 390 395 400
Tyr Thr Leu Gly Pro Leu Val Thr Asp Ile Ala Pro Gly Tyr Asp His
405 410 415
Ile Thr Ser Ala Ile Gly Ala Ala His Ile Ala Met Gly Gly Thr Ala
420 425 430
Met Leu Cys Tyr Val Thr Pro Lys Glu His Leu Gly Leu Pro Asn Arg
435 440 445
Asp Asp Val Lys Thr Gly Val Ile Thr Tyr Lys Leu Ala Ala His Ala
450 455 460
Ala Asp Val Ala Lys Gly His Pro Gly Ala Arg Ala Trp Asp Asp Ala
465 470 475 480
Met Ser Lys Ala Arg Phe Glu Phe Arg Trp Asn Asp Gln Phe Ala Leu
485 490 495
Ser Leu Asp Pro Asp Thr Ala Ile Ala Tyr His Asp Glu Thr Leu Pro
500 505 510
Ala Glu Pro Ala Lys Thr Ala His Phe Cys Ser Met Cys Gly Pro Lys
515 520 525
Phe Cys Ser Met Arg Ile Ser Gln Asp Ile Arg Asp Met Phe Gly Asp
530 535 540
Gln Ile Ala Glu Leu Gly Met Pro Gly Val Gly Asp Ser Ser Ser Ala
545 550 555 560
Val Ala Ser Ser Gly Ala Arg Glu Gly Met Ala Glu Lys Ser Arg Glu
565 570 575
Phe Ile Ala Gly Gly Ala Glu Val Tyr Arg Arg
580 585
<210> 206
<211> 1869
<212> DNA
<213> Candidatus Baumannia cicadellinicola
<220>
<221> CDS
<222> (1)..(1869)
<223> ThiC gene from Candidatus Baumannia cicadellinicola
[WP_011520252] encoding HMP-P synthase
<400> 206
atg tca aga tca tca ata cct gct tca cgc cga gtg agc cgt gca aaa 48
Met Ser Arg Ser Ser Ile Pro Ala Ser Arg Arg Val Ser Arg Ala Lys
1 5 10 15
gca cag gct ttt atg gat agc tta aca ggt agt agc tat ttt cct aac 96
Ala Gln Ala Phe Met Asp Ser Leu Thr Gly Ser Ser Tyr Phe Pro Asn
20 25 30
tca aga agg ata tat tta caa ggt aaa aca cct tca gta cat gta cca 144
Ser Arg Arg Ile Tyr Leu Gln Gly Lys Thr Pro Ser Val His Val Pro
35 40 45
atg cgt gaa att aag cta cat cct aca ttg atc ggt aaa aac ggt gaa 192
Met Arg Glu Ile Lys Leu His Pro Thr Leu Ile Gly Lys Asn Gly Glu
50 55 60
cat tat gag gat aat caa cct ata cca gtt tat gat act tca ggt cct 240
His Tyr Glu Asp Asn Gln Pro Ile Pro Val Tyr Asp Thr Ser Gly Pro
65 70 75 80
tac ggt gat cct act ata gca att aac gta cgt aca ggt ctt aac cgg 288
Tyr Gly Asp Pro Thr Ile Ala Ile Asn Val Arg Thr Gly Leu Asn Arg
85 90 95
tta cgc gag ata tgg att ctt gca cga caa gat agt gag cca ata agt 336
Leu Arg Glu Ile Trp Ile Leu Ala Arg Gln Asp Ser Glu Pro Ile Ser
100 105 110
aat aat aat aac gat cgt cag agt tca gat aaa cag tta agt ttt act 384
Asn Asn Asn Asn Asp Arg Gln Ser Ser Asp Lys Gln Leu Ser Phe Thr
115 120 125
act aac tat aat cca cgc cga gct agc tat gga cgc tgt att aca caa 432
Thr Asn Tyr Asn Pro Arg Arg Ala Ser Tyr Gly Arg Cys Ile Thr Gln
130 135 140
tta cat tac gca cgt gcc ggt atc ata acg cca gaa atg gag ttt ata 480
Leu His Tyr Ala Arg Ala Gly Ile Ile Thr Pro Glu Met Glu Phe Ile
145 150 155 160
gct tta cgt gaa aat atg ggc cga gaa cgt att agt agc aac gtg cta 528
Ala Leu Arg Glu Asn Met Gly Arg Glu Arg Ile Ser Ser Asn Val Leu
165 170 175
cat cag cag cat tta ggt tct aac ttt ggt gct aaa aaa gct gat cat 576
His Gln Gln His Leu Gly Ser Asn Phe Gly Ala Lys Lys Ala Asp His
180 185 190
att aca gca gaa ttt gtc cgg cag gaa gta gca gca gga cgt gct att 624
Ile Thr Ala Glu Phe Val Arg Gln Glu Val Ala Ala Gly Arg Ala Ile
195 200 205
ata cct agt aat att aat cat cca gaa tct gag cca atg atc att ggc 672
Ile Pro Ser Asn Ile Asn His Pro Glu Ser Glu Pro Met Ile Ile Gly
210 215 220
cgt aat ttt ctc gta aaa gta aat gca aat att ggt aac tca gca gta 720
Arg Asn Phe Leu Val Lys Val Asn Ala Asn Ile Gly Asn Ser Ala Val
225 230 235 240
aca tct tct att gag gaa gaa gtc gaa aag tta gta tgg gct act cgt 768
Thr Ser Ser Ile Glu Glu Glu Val Glu Lys Leu Val Trp Ala Thr Arg
245 250 255
tgg gga gct gat aca gtc atg gac tta tct act ggt agt tat att cac 816
Trp Gly Ala Asp Thr Val Met Asp Leu Ser Thr Gly Ser Tyr Ile His
260 265 270
gaa act aga gaa tgg ata tta cgt aat agc cca gta cct ata ggt act 864
Glu Thr Arg Glu Trp Ile Leu Arg Asn Ser Pro Val Pro Ile Gly Thr
275 280 285
gta cct atc tat caa gcg tta gaa aaa gta aat gga gtc ata gaa aat 912
Val Pro Ile Tyr Gln Ala Leu Glu Lys Val Asn Gly Val Ile Glu Asn
290 295 300
ctt aat tgg gat att ttc tac gag aca tta tta gaa caa gct aac caa 960
Leu Asn Trp Asp Ile Phe Tyr Glu Thr Leu Leu Glu Gln Ala Asn Gln
305 310 315 320
gga gta gat tat ttt acg att cat gct ggc gta tta aaa cgt tat gtt 1008
Gly Val Asp Tyr Phe Thr Ile His Ala Gly Val Leu Lys Arg Tyr Val
325 330 335
cta cta aca gct agt agg tta act ggt atc gta tcg cgt ggt ggc tct 1056
Leu Leu Thr Ala Ser Arg Leu Thr Gly Ile Val Ser Arg Gly Gly Ser
340 345 350
att atg gct caa tgg agt tta gta cat aat cag gaa aac ttc ctt tat 1104
Ile Met Ala Gln Trp Ser Leu Val His Asn Gln Glu Asn Phe Leu Tyr
355 360 365
gag cat ttt agt gaa att tgc aag ctt tgt gct gct tat gat att gct 1152
Glu His Phe Ser Glu Ile Cys Lys Leu Cys Ala Ala Tyr Asp Ile Ala
370 375 380
cta tct ctt gga gat ggt cta aga ccc ggt tcc gta caa gat gct aat 1200
Leu Ser Leu Gly Asp Gly Leu Arg Pro Gly Ser Val Gln Asp Ala Asn
385 390 395 400
gat gaa gca caa ttt tct gag tta cat aca cta ggc gaa tta act aaa 1248
Asp Glu Ala Gln Phe Ser Glu Leu His Thr Leu Gly Glu Leu Thr Lys
405 410 415
att gcc tgg gaa tat gat gtg caa gta atg atc gaa gga cct ggt cat 1296
Ile Ala Trp Glu Tyr Asp Val Gln Val Met Ile Glu Gly Pro Gly His
420 425 430
att cca cta cat atg att gag cgt aat atg act gat caa ctt aaa tat 1344
Ile Pro Leu His Met Ile Glu Arg Asn Met Thr Asp Gln Leu Lys Tyr
435 440 445
tgc cac gaa gca cca ttc tac act ctc gga cca ctc aca aca gat att 1392
Cys His Glu Ala Pro Phe Tyr Thr Leu Gly Pro Leu Thr Thr Asp Ile
450 455 460
gct cct ggt tat gac cac ttt act tca ggt att ggt gcc gca cta ata 1440
Ala Pro Gly Tyr Asp His Phe Thr Ser Gly Ile Gly Ala Ala Leu Ile
465 470 475 480
ggc tgg ttt gga tgt gct atg ctg tgc tat gta act cct aaa gag cat 1488
Gly Trp Phe Gly Cys Ala Met Leu Cys Tyr Val Thr Pro Lys Glu His
485 490 495
cta ggt tta cct aat aag gaa gac gta aaa cag ggt tta att gcc tat 1536
Leu Gly Leu Pro Asn Lys Glu Asp Val Lys Gln Gly Leu Ile Ala Tyr
500 505 510
aaa att gcc gca cat gct gca gat cta gct aaa gga cat cct ggt gct 1584
Lys Ile Ala Ala His Ala Ala Asp Leu Ala Lys Gly His Pro Gly Ala
515 520 525
caa ata cgt gat aat gct atg tca aaa gct cgt ttc gaa ttt cgc tgg 1632
Gln Ile Arg Asp Asn Ala Met Ser Lys Ala Arg Phe Glu Phe Arg Trp
530 535 540
gaa gat caa ttt aac tta gct tta gat cct ttt acg gcg cgt atg tat 1680
Glu Asp Gln Phe Asn Leu Ala Leu Asp Pro Phe Thr Ala Arg Met Tyr
545 550 555 560
cac gat gaa act ata ccg caa aca gca gga aaa tta gca aat ttt tgc 1728
His Asp Glu Thr Ile Pro Gln Thr Ala Gly Lys Leu Ala Asn Phe Cys
565 570 575
tcg atg tgt ggt cct aag ttt tgt tct atg aag cta tca aaa aaa ata 1776
Ser Met Cys Gly Pro Lys Phe Cys Ser Met Lys Leu Ser Lys Lys Ile
580 585 590
cgt aat tac act aat atg aaa aat ata aaa act att agt aat agt ttc 1824
Arg Asn Tyr Thr Asn Met Lys Asn Ile Lys Thr Ile Ser Asn Ser Phe
595 600 605
atg aat aaa tta gat aat agc ggt att aaa aat gct gac cga taa 1869
Met Asn Lys Leu Asp Asn Ser Gly Ile Lys Asn Ala Asp Arg
610 615 620
<210> 207
<211> 622
<212> PRT
<213> Candidatus Baumannia cicadellinicola
<400> 207
Met Ser Arg Ser Ser Ile Pro Ala Ser Arg Arg Val Ser Arg Ala Lys
1 5 10 15
Ala Gln Ala Phe Met Asp Ser Leu Thr Gly Ser Ser Tyr Phe Pro Asn
20 25 30
Ser Arg Arg Ile Tyr Leu Gln Gly Lys Thr Pro Ser Val His Val Pro
35 40 45
Met Arg Glu Ile Lys Leu His Pro Thr Leu Ile Gly Lys Asn Gly Glu
50 55 60
His Tyr Glu Asp Asn Gln Pro Ile Pro Val Tyr Asp Thr Ser Gly Pro
65 70 75 80
Tyr Gly Asp Pro Thr Ile Ala Ile Asn Val Arg Thr Gly Leu Asn Arg
85 90 95
Leu Arg Glu Ile Trp Ile Leu Ala Arg Gln Asp Ser Glu Pro Ile Ser
100 105 110
Asn Asn Asn Asn Asp Arg Gln Ser Ser Asp Lys Gln Leu Ser Phe Thr
115 120 125
Thr Asn Tyr Asn Pro Arg Arg Ala Ser Tyr Gly Arg Cys Ile Thr Gln
130 135 140
Leu His Tyr Ala Arg Ala Gly Ile Ile Thr Pro Glu Met Glu Phe Ile
145 150 155 160
Ala Leu Arg Glu Asn Met Gly Arg Glu Arg Ile Ser Ser Asn Val Leu
165 170 175
His Gln Gln His Leu Gly Ser Asn Phe Gly Ala Lys Lys Ala Asp His
180 185 190
Ile Thr Ala Glu Phe Val Arg Gln Glu Val Ala Ala Gly Arg Ala Ile
195 200 205
Ile Pro Ser Asn Ile Asn His Pro Glu Ser Glu Pro Met Ile Ile Gly
210 215 220
Arg Asn Phe Leu Val Lys Val Asn Ala Asn Ile Gly Asn Ser Ala Val
225 230 235 240
Thr Ser Ser Ile Glu Glu Glu Val Glu Lys Leu Val Trp Ala Thr Arg
245 250 255
Trp Gly Ala Asp Thr Val Met Asp Leu Ser Thr Gly Ser Tyr Ile His
260 265 270
Glu Thr Arg Glu Trp Ile Leu Arg Asn Ser Pro Val Pro Ile Gly Thr
275 280 285
Val Pro Ile Tyr Gln Ala Leu Glu Lys Val Asn Gly Val Ile Glu Asn
290 295 300
Leu Asn Trp Asp Ile Phe Tyr Glu Thr Leu Leu Glu Gln Ala Asn Gln
305 310 315 320
Gly Val Asp Tyr Phe Thr Ile His Ala Gly Val Leu Lys Arg Tyr Val
325 330 335
Leu Leu Thr Ala Ser Arg Leu Thr Gly Ile Val Ser Arg Gly Gly Ser
340 345 350
Ile Met Ala Gln Trp Ser Leu Val His Asn Gln Glu Asn Phe Leu Tyr
355 360 365
Glu His Phe Ser Glu Ile Cys Lys Leu Cys Ala Ala Tyr Asp Ile Ala
370 375 380
Leu Ser Leu Gly Asp Gly Leu Arg Pro Gly Ser Val Gln Asp Ala Asn
385 390 395 400
Asp Glu Ala Gln Phe Ser Glu Leu His Thr Leu Gly Glu Leu Thr Lys
405 410 415
Ile Ala Trp Glu Tyr Asp Val Gln Val Met Ile Glu Gly Pro Gly His
420 425 430
Ile Pro Leu His Met Ile Glu Arg Asn Met Thr Asp Gln Leu Lys Tyr
435 440 445
Cys His Glu Ala Pro Phe Tyr Thr Leu Gly Pro Leu Thr Thr Asp Ile
450 455 460
Ala Pro Gly Tyr Asp His Phe Thr Ser Gly Ile Gly Ala Ala Leu Ile
465 470 475 480
Gly Trp Phe Gly Cys Ala Met Leu Cys Tyr Val Thr Pro Lys Glu His
485 490 495
Leu Gly Leu Pro Asn Lys Glu Asp Val Lys Gln Gly Leu Ile Ala Tyr
500 505 510
Lys Ile Ala Ala His Ala Ala Asp Leu Ala Lys Gly His Pro Gly Ala
515 520 525
Gln Ile Arg Asp Asn Ala Met Ser Lys Ala Arg Phe Glu Phe Arg Trp
530 535 540
Glu Asp Gln Phe Asn Leu Ala Leu Asp Pro Phe Thr Ala Arg Met Tyr
545 550 555 560
His Asp Glu Thr Ile Pro Gln Thr Ala Gly Lys Leu Ala Asn Phe Cys
565 570 575
Ser Met Cys Gly Pro Lys Phe Cys Ser Met Lys Leu Ser Lys Lys Ile
580 585 590
Arg Asn Tyr Thr Asn Met Lys Asn Ile Lys Thr Ile Ser Asn Ser Phe
595 600 605
Met Asn Lys Leu Asp Asn Ser Gly Ile Lys Asn Ala Asp Arg
610 615 620
<210> 208
<211> 636
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(636)
<223> ThiE gene from E. coli encoding thiamine phosphate synthase
<400> 208
atg tat cag cct gat ttt cct cct gta cct ttt cgt tca gga ctg tac 48
Met Tyr Gln Pro Asp Phe Pro Pro Val Pro Phe Arg Ser Gly Leu Tyr
1 5 10 15
ccg gtg gtg gac agc gta cag tgg atc gaa cgt ctg ttg gat gca ggc 96
Pro Val Val Asp Ser Val Gln Trp Ile Glu Arg Leu Leu Asp Ala Gly
20 25 30
gta cgt act ctc cag cta cgc atc aaa gat cgg cgc gat gaa gag gtg 144
Val Arg Thr Leu Gln Leu Arg Ile Lys Asp Arg Arg Asp Glu Glu Val
35 40 45
gaa gcc gat gtc gtg gcg gca att gcg ctg ggc cgc cgc tat aac gcg 192
Glu Ala Asp Val Val Ala Ala Ile Ala Leu Gly Arg Arg Tyr Asn Ala
50 55 60
cga ttg ttt atc aac gat tac tgg cgg ctg gcg atc aag cat cag gcg 240
Arg Leu Phe Ile Asn Asp Tyr Trp Arg Leu Ala Ile Lys His Gln Ala
65 70 75 80
tat ggc gtc cat ttg ggg cag gaa gat ttg caa gcc acc gat ctc aat 288
Tyr Gly Val His Leu Gly Gln Glu Asp Leu Gln Ala Thr Asp Leu Asn
85 90 95
gcc atc cgc gcg gca ggc ctg cgg ctg ggc gtt tcg aca cat gac gat 336
Ala Ile Arg Ala Ala Gly Leu Arg Leu Gly Val Ser Thr His Asp Asp
100 105 110
atg gaa atc gac gtc gcg ctg gca gca cgc ccc tct tat atc gcg ctg 384
Met Glu Ile Asp Val Ala Leu Ala Ala Arg Pro Ser Tyr Ile Ala Leu
115 120 125
gga cat gtg ttc ccg acg caa acc aaa cag atg cct tct gca ccg cag 432
Gly His Val Phe Pro Thr Gln Thr Lys Gln Met Pro Ser Ala Pro Gln
130 135 140
ggg ctg gaa cag ctg gca cgg cat gtt gag cga ctg gcg gat tat ccc 480
Gly Leu Glu Gln Leu Ala Arg His Val Glu Arg Leu Ala Asp Tyr Pro
145 150 155 160
acc gtg gcg att ggc ggt atc agt ctg gca cgc gcg cct gcg gtg ata 528
Thr Val Ala Ile Gly Gly Ile Ser Leu Ala Arg Ala Pro Ala Val Ile
165 170 175
gca acg ggt gtc ggc agt atc gcc gtc gtc agc gcc att act caa gcc 576
Ala Thr Gly Val Gly Ser Ile Ala Val Val Ser Ala Ile Thr Gln Ala
180 185 190
gca gac tgg cgt ttg gca acg gca cag ttg ctg gaa att gca gga gtt 624
Ala Asp Trp Arg Leu Ala Thr Ala Gln Leu Leu Glu Ile Ala Gly Val
195 200 205
ggc gat gaa tga 636
Gly Asp Glu
210
<210> 209
<211> 211
<212> PRT
<213> Escherichia coli
<400> 209
Met Tyr Gln Pro Asp Phe Pro Pro Val Pro Phe Arg Ser Gly Leu Tyr
1 5 10 15
Pro Val Val Asp Ser Val Gln Trp Ile Glu Arg Leu Leu Asp Ala Gly
20 25 30
Val Arg Thr Leu Gln Leu Arg Ile Lys Asp Arg Arg Asp Glu Glu Val
35 40 45
Glu Ala Asp Val Val Ala Ala Ile Ala Leu Gly Arg Arg Tyr Asn Ala
50 55 60
Arg Leu Phe Ile Asn Asp Tyr Trp Arg Leu Ala Ile Lys His Gln Ala
65 70 75 80
Tyr Gly Val His Leu Gly Gln Glu Asp Leu Gln Ala Thr Asp Leu Asn
85 90 95
Ala Ile Arg Ala Ala Gly Leu Arg Leu Gly Val Ser Thr His Asp Asp
100 105 110
Met Glu Ile Asp Val Ala Leu Ala Ala Arg Pro Ser Tyr Ile Ala Leu
115 120 125
Gly His Val Phe Pro Thr Gln Thr Lys Gln Met Pro Ser Ala Pro Gln
130 135 140
Gly Leu Glu Gln Leu Ala Arg His Val Glu Arg Leu Ala Asp Tyr Pro
145 150 155 160
Thr Val Ala Ile Gly Gly Ile Ser Leu Ala Arg Ala Pro Ala Val Ile
165 170 175
Ala Thr Gly Val Gly Ser Ile Ala Val Val Ser Ala Ile Thr Gln Ala
180 185 190
Ala Asp Trp Arg Leu Ala Thr Ala Gln Leu Leu Glu Ile Ala Gly Val
195 200 205
Gly Asp Glu
210
<210> 210
<211> 756
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(756)
<223> ThiF gene from E.coli encoding a ThiS adenylyltransferase
<400> 210
atg aat gac cgt gac ttt atg cgt tat agc cgc caa atc ctg ctc gac 48
Met Asn Asp Arg Asp Phe Met Arg Tyr Ser Arg Gln Ile Leu Leu Asp
1 5 10 15
gat atc gct ctg gac ggg cag caa aaa ctg ctc gac agc cag gtg ctg 96
Asp Ile Ala Leu Asp Gly Gln Gln Lys Leu Leu Asp Ser Gln Val Leu
20 25 30
att atc ggt ctg ggc ggg ctg ggt aca cct gct gcg ctg tac ctg gcg 144
Ile Ile Gly Leu Gly Gly Leu Gly Thr Pro Ala Ala Leu Tyr Leu Ala
35 40 45
ggc gct ggc gtc ggg acg ctg gta ctg gca gat gac gac gat gtg cat 192
Gly Ala Gly Val Gly Thr Leu Val Leu Ala Asp Asp Asp Asp Val His
50 55 60
tta agc aat ctg caa cga caa atc ctc ttt acc act gaa gat atc gat 240
Leu Ser Asn Leu Gln Arg Gln Ile Leu Phe Thr Thr Glu Asp Ile Asp
65 70 75 80
cgc ccg aaa tcg cag gtc agc caa cag cga ctg aca cag ttg aat ccc 288
Arg Pro Lys Ser Gln Val Ser Gln Gln Arg Leu Thr Gln Leu Asn Pro
85 90 95
gac att caa ctg aca gca tta caa caa cgg tta acg ggt gag gcg tta 336
Asp Ile Gln Leu Thr Ala Leu Gln Gln Arg Leu Thr Gly Glu Ala Leu
100 105 110
aaa gat gcg gtt gca cgg gcc gat gtg gtg ctc gac tgt acc gac aat 384
Lys Asp Ala Val Ala Arg Ala Asp Val Val Leu Asp Cys Thr Asp Asn
115 120 125
atg gcg act cgc cag gag att aat gcc gcc tgc gtg gca ctc aac acg 432
Met Ala Thr Arg Gln Glu Ile Asn Ala Ala Cys Val Ala Leu Asn Thr
130 135 140
ccg ctt atc acc gcc agc gcg gtc gga ttt ggc ggt cag ttg atg gta 480
Pro Leu Ile Thr Ala Ser Ala Val Gly Phe Gly Gly Gln Leu Met Val
145 150 155 160
ctg acg ccg ccc tgg gag cag ggg tgt tac cgc tgc ctg tgg cca gat 528
Leu Thr Pro Pro Trp Glu Gln Gly Cys Tyr Arg Cys Leu Trp Pro Asp
165 170 175
aac cag gag cca gaa cgc aac tgc cgc acg gcg ggc gtg gtt ggc ccg 576
Asn Gln Glu Pro Glu Arg Asn Cys Arg Thr Ala Gly Val Val Gly Pro
180 185 190
gtg gtc ggg gtt atg ggc act ttg cag gca ctg gaa gcc att aag tta 624
Val Val Gly Val Met Gly Thr Leu Gln Ala Leu Glu Ala Ile Lys Leu
195 200 205
tta agc ggt ata gag aca cct gcg gga gaa ctc cga ctg ttc gac ggt 672
Leu Ser Gly Ile Glu Thr Pro Ala Gly Glu Leu Arg Leu Phe Asp Gly
210 215 220
aaa tcg agc cag tgg cgc agc ctg gcg ttg cgc cgc gcc agt ggt tgc 720
Lys Ser Ser Gln Trp Arg Ser Leu Ala Leu Arg Arg Ala Ser Gly Cys
225 230 235 240
ccg gta tgc gga gga agc aat gca gat cct gtt taa 756
Pro Val Cys Gly Gly Ser Asn Ala Asp Pro Val
245 250
<210> 211
<211> 251
<212> PRT
<213> Escherichia coli
<400> 211
Met Asn Asp Arg Asp Phe Met Arg Tyr Ser Arg Gln Ile Leu Leu Asp
1 5 10 15
Asp Ile Ala Leu Asp Gly Gln Gln Lys Leu Leu Asp Ser Gln Val Leu
20 25 30
Ile Ile Gly Leu Gly Gly Leu Gly Thr Pro Ala Ala Leu Tyr Leu Ala
35 40 45
Gly Ala Gly Val Gly Thr Leu Val Leu Ala Asp Asp Asp Asp Val His
50 55 60
Leu Ser Asn Leu Gln Arg Gln Ile Leu Phe Thr Thr Glu Asp Ile Asp
65 70 75 80
Arg Pro Lys Ser Gln Val Ser Gln Gln Arg Leu Thr Gln Leu Asn Pro
85 90 95
Asp Ile Gln Leu Thr Ala Leu Gln Gln Arg Leu Thr Gly Glu Ala Leu
100 105 110
Lys Asp Ala Val Ala Arg Ala Asp Val Val Leu Asp Cys Thr Asp Asn
115 120 125
Met Ala Thr Arg Gln Glu Ile Asn Ala Ala Cys Val Ala Leu Asn Thr
130 135 140
Pro Leu Ile Thr Ala Ser Ala Val Gly Phe Gly Gly Gln Leu Met Val
145 150 155 160
Leu Thr Pro Pro Trp Glu Gln Gly Cys Tyr Arg Cys Leu Trp Pro Asp
165 170 175
Asn Gln Glu Pro Glu Arg Asn Cys Arg Thr Ala Gly Val Val Gly Pro
180 185 190
Val Val Gly Val Met Gly Thr Leu Gln Ala Leu Glu Ala Ile Lys Leu
195 200 205
Leu Ser Gly Ile Glu Thr Pro Ala Gly Glu Leu Arg Leu Phe Asp Gly
210 215 220
Lys Ser Ser Gln Trp Arg Ser Leu Ala Leu Arg Arg Ala Ser Gly Cys
225 230 235 240
Pro Val Cys Gly Gly Ser Asn Ala Asp Pro Val
245 250
<210> 212
<211> 201
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(201)
<223> ThiS gene from E. coli encoding Sulfur-carrier protein
<400> 212
atg cag atc ctg ttt aac gat caa gcg atg cag tgc gcc gcc ggg caa 48
Met Gln Ile Leu Phe Asn Asp Gln Ala Met Gln Cys Ala Ala Gly Gln
1 5 10 15
act gtt cac gaa cta ctg gag caa ctc gac caa cga caa gcg ggc gcg 96
Thr Val His Glu Leu Leu Glu Gln Leu Asp Gln Arg Gln Ala Gly Ala
20 25 30
gct ctg gcg att aat cag caa atc gtc ccg cgt gag cag tgg gcg caa 144
Ala Leu Ala Ile Asn Gln Gln Ile Val Pro Arg Glu Gln Trp Ala Gln
35 40 45
cat atc gtg cag gat ggc gac cag atc ctg ctt ttt cag gtt att gca 192
His Ile Val Gln Asp Gly Asp Gln Ile Leu Leu Phe Gln Val Ile Ala
50 55 60
ggg ggt tga 201
Gly Gly
65
<210> 213
<211> 66
<212> PRT
<213> Escherichia coli
<400> 213
Met Gln Ile Leu Phe Asn Asp Gln Ala Met Gln Cys Ala Ala Gly Gln
1 5 10 15
Thr Val His Glu Leu Leu Glu Gln Leu Asp Gln Arg Gln Ala Gly Ala
20 25 30
Ala Leu Ala Ile Asn Gln Gln Ile Val Pro Arg Glu Gln Trp Ala Gln
35 40 45
His Ile Val Gln Asp Gly Asp Gln Ile Leu Leu Phe Gln Val Ile Ala
50 55 60
Gly Gly
65
<210> 214
<211> 771
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(771)
<223> ThiG gene from E. coli encoding Thiazole synthase
<400> 214
atg tta cgt att gcg gac aaa acg ttt gat tca cat ctg ttt acc ggc 48
Met Leu Arg Ile Ala Asp Lys Thr Phe Asp Ser His Leu Phe Thr Gly
1 5 10 15
aca ggg aaa ttc gct tct tca caa ctg atg gtg gag gcg atc cgc gct 96
Thr Gly Lys Phe Ala Ser Ser Gln Leu Met Val Glu Ala Ile Arg Ala
20 25 30
tcc ggc agc cag ctg gtg aca ctg gcg atg aaa cgt gtc gac ttg cgc 144
Ser Gly Ser Gln Leu Val Thr Leu Ala Met Lys Arg Val Asp Leu Arg
35 40 45
cag cac aac gac gct atc ctc gaa ccg ctt atc gcg gcg ggt gtg acc 192
Gln His Asn Asp Ala Ile Leu Glu Pro Leu Ile Ala Ala Gly Val Thr
50 55 60
ctg ctg cca aat aca tcc ggg gcg aaa aca gcg gaa gaa gcc att ttc 240
Leu Leu Pro Asn Thr Ser Gly Ala Lys Thr Ala Glu Glu Ala Ile Phe
65 70 75 80
gcc gcc cat ctg gct cgt gaa gcg tta ggc aca aac tgg tta aaa tta 288
Ala Ala His Leu Ala Arg Glu Ala Leu Gly Thr Asn Trp Leu Lys Leu
85 90 95
gag att cac cct gac gcc cgc tgg ctg ttg ccc gat ccc atc gaa acc 336
Glu Ile His Pro Asp Ala Arg Trp Leu Leu Pro Asp Pro Ile Glu Thr
100 105 110
ctg aaa gcc gcc gaa acg ctg gta caa cag gga ttt gtc gtg ctg cct 384
Leu Lys Ala Ala Glu Thr Leu Val Gln Gln Gly Phe Val Val Leu Pro
115 120 125
tac tgc ggg gcc gat ccg gta ttg tgt aaa cgt ctg gaa gaa gtc ggc 432
Tyr Cys Gly Ala Asp Pro Val Leu Cys Lys Arg Leu Glu Glu Val Gly
130 135 140
tgt gca gcg gtg atg ccg ctc ggc gcg ccg att ggc tcg aat cag gga 480
Cys Ala Ala Val Met Pro Leu Gly Ala Pro Ile Gly Ser Asn Gln Gly
145 150 155 160
ctg gaa acc cgc gcc atg ctg gag att att atc cag cag gcc aca gtg 528
Leu Glu Thr Arg Ala Met Leu Glu Ile Ile Ile Gln Gln Ala Thr Val
165 170 175
ccg gtg gtt gtc gat gct ggc atc ggc gtt ccc agc cat gcc gcg cag 576
Pro Val Val Val Asp Ala Gly Ile Gly Val Pro Ser His Ala Ala Gln
180 185 190
gcg ctg gaa atg ggg gcc gac gcg gtg tta gtg aat acg gcg att gcc 624
Ala Leu Glu Met Gly Ala Asp Ala Val Leu Val Asn Thr Ala Ile Ala
195 200 205
gtc gcg gac gat ccc gtc aac atg gcg aag gca ttt cgt ctg gcg gta 672
Val Ala Asp Asp Pro Val Asn Met Ala Lys Ala Phe Arg Leu Ala Val
210 215 220
gaa gca ggc cta ctg gca cgt cag tcc gga ccg ggc agc cgc agt tat 720
Glu Ala Gly Leu Leu Ala Arg Gln Ser Gly Pro Gly Ser Arg Ser Tyr
225 230 235 240
ttt gct cat gcc acc agc ccg ctg acc gga ttt ctg gag gca tcg gca 768
Phe Ala His Ala Thr Ser Pro Leu Thr Gly Phe Leu Glu Ala Ser Ala
245 250 255
tga 771
<210> 215
<211> 256
<212> PRT
<213> Escherichia coli
<400> 215
Met Leu Arg Ile Ala Asp Lys Thr Phe Asp Ser His Leu Phe Thr Gly
1 5 10 15
Thr Gly Lys Phe Ala Ser Ser Gln Leu Met Val Glu Ala Ile Arg Ala
20 25 30
Ser Gly Ser Gln Leu Val Thr Leu Ala Met Lys Arg Val Asp Leu Arg
35 40 45
Gln His Asn Asp Ala Ile Leu Glu Pro Leu Ile Ala Ala Gly Val Thr
50 55 60
Leu Leu Pro Asn Thr Ser Gly Ala Lys Thr Ala Glu Glu Ala Ile Phe
65 70 75 80
Ala Ala His Leu Ala Arg Glu Ala Leu Gly Thr Asn Trp Leu Lys Leu
85 90 95
Glu Ile His Pro Asp Ala Arg Trp Leu Leu Pro Asp Pro Ile Glu Thr
100 105 110
Leu Lys Ala Ala Glu Thr Leu Val Gln Gln Gly Phe Val Val Leu Pro
115 120 125
Tyr Cys Gly Ala Asp Pro Val Leu Cys Lys Arg Leu Glu Glu Val Gly
130 135 140
Cys Ala Ala Val Met Pro Leu Gly Ala Pro Ile Gly Ser Asn Gln Gly
145 150 155 160
Leu Glu Thr Arg Ala Met Leu Glu Ile Ile Ile Gln Gln Ala Thr Val
165 170 175
Pro Val Val Val Asp Ala Gly Ile Gly Val Pro Ser His Ala Ala Gln
180 185 190
Ala Leu Glu Met Gly Ala Asp Ala Val Leu Val Asn Thr Ala Ile Ala
195 200 205
Val Ala Asp Asp Pro Val Asn Met Ala Lys Ala Phe Arg Leu Ala Val
210 215 220
Glu Ala Gly Leu Leu Ala Arg Gln Ser Gly Pro Gly Ser Arg Ser Tyr
225 230 235 240
Phe Ala His Ala Thr Ser Pro Leu Thr Gly Phe Leu Glu Ala Ser Ala
245 250 255
<210> 216
<211> 1134
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(1134)
<223> ThiH gene from E. coli encoding 2-iminoacetate synthase
<400> 216
atg aaa acc ttc agc gat cgc tgg cga caa ctg gac tgg gac gac atc 48
Met Lys Thr Phe Ser Asp Arg Trp Arg Gln Leu Asp Trp Asp Asp Ile
1 5 10 15
cgc ctg cgt atc aac ggc aaa acg gct gct gac gta gag cgg gcg cta 96
Arg Leu Arg Ile Asn Gly Lys Thr Ala Ala Asp Val Glu Arg Ala Leu
20 25 30
aat gcc tcg caa ctc acc cgc gac gac atg atg gcg ctg tta tcg cct 144
Asn Ala Ser Gln Leu Thr Arg Asp Asp Met Met Ala Leu Leu Ser Pro
35 40 45
gcc gcc agt ggc tat ctg gaa caa ctg gcc caa cgg gcg cag cgt ctg 192
Ala Ala Ser Gly Tyr Leu Glu Gln Leu Ala Gln Arg Ala Gln Arg Leu
50 55 60
acc cgt cag cga ttt ggc aac aca gtt agt ttc tac gtc ccg ctt tat 240
Thr Arg Gln Arg Phe Gly Asn Thr Val Ser Phe Tyr Val Pro Leu Tyr
65 70 75 80
ctt tcc aat ctt tgc gct aac gac tgc acg tac tgt gga ttt tcc atg 288
Leu Ser Asn Leu Cys Ala Asn Asp Cys Thr Tyr Cys Gly Phe Ser Met
85 90 95
agt aat cgc atc aag cgc aaa acg ctg gat gaa gcg gat att gcc agg 336
Ser Asn Arg Ile Lys Arg Lys Thr Leu Asp Glu Ala Asp Ile Ala Arg
100 105 110
gaa agt gcc gct ata cgg gag atg ggc ttt gaa cat ctg ctg tta gtc 384
Glu Ser Ala Ala Ile Arg Glu Met Gly Phe Glu His Leu Leu Leu Val
115 120 125
act ggt gaa cat cag gcg aaa gtg ggg atg gat tac ttt cgt cgt cat 432
Thr Gly Glu His Gln Ala Lys Val Gly Met Asp Tyr Phe Arg Arg His
130 135 140
ctc cct gcc ctt cgt gaa cag ttc tct tca cta cag atg gaa gtg caa 480
Leu Pro Ala Leu Arg Glu Gln Phe Ser Ser Leu Gln Met Glu Val Gln
145 150 155 160
ccg ctg gcg gag acg gaa tac gcc gag tta aag caa ctt ggt ctg gat 528
Pro Leu Ala Glu Thr Glu Tyr Ala Glu Leu Lys Gln Leu Gly Leu Asp
165 170 175
ggc gtg atg gtt tat cag gag aca tat cac gag gcg act tat gcc cgc 576
Gly Val Met Val Tyr Gln Glu Thr Tyr His Glu Ala Thr Tyr Ala Arg
180 185 190
cat cat ctg aaa ggc aaa aaa cag gac ttc ttc tgg cgg ctg gaa acg 624
His His Leu Lys Gly Lys Lys Gln Asp Phe Phe Trp Arg Leu Glu Thr
195 200 205
ccg gat cgg ctg ggg cgt gcg ggg att gat aag ata ggc ctc ggc gcg 672
Pro Asp Arg Leu Gly Arg Ala Gly Ile Asp Lys Ile Gly Leu Gly Ala
210 215 220
cta att ggc ctt tcc gac aac tgg cgc gtt gac agc tat atg gtt gcc 720
Leu Ile Gly Leu Ser Asp Asn Trp Arg Val Asp Ser Tyr Met Val Ala
225 230 235 240
gaa cat ttg cta tgg ctg caa cag cat tac tgg caa agc cgt tac tct 768
Glu His Leu Leu Trp Leu Gln Gln His Tyr Trp Gln Ser Arg Tyr Ser
245 250 255
gtc tcc ttt ccg cgc ctg cgc ccg tgt act ggc ggc att gag cct gcg 816
Val Ser Phe Pro Arg Leu Arg Pro Cys Thr Gly Gly Ile Glu Pro Ala
260 265 270
tcg att atg gat gaa cgc cag tta gtg caa acc atc tgc gcc ttc cga 864
Ser Ile Met Asp Glu Arg Gln Leu Val Gln Thr Ile Cys Ala Phe Arg
275 280 285
ctg ctt gca ccg gag att gaa ctg tca ctc tcc acg cgg gaa tca ccg 912
Leu Leu Ala Pro Glu Ile Glu Leu Ser Leu Ser Thr Arg Glu Ser Pro
290 295 300
tgg ttt cgc gat cgc gtt att ccg ctg gcg atc aat aac gtc agc gcc 960
Trp Phe Arg Asp Arg Val Ile Pro Leu Ala Ile Asn Asn Val Ser Ala
305 310 315 320
ttc tcg aaa acg cag cca ggt ggc tat gcc gat aat cac ccc gag ttg 1008
Phe Ser Lys Thr Gln Pro Gly Gly Tyr Ala Asp Asn His Pro Glu Leu
325 330 335
gaa cag ttc tca ccg cac gac gat cgc aga ccg gaa gcg gtt gct gcc 1056
Glu Gln Phe Ser Pro His Asp Asp Arg Arg Pro Glu Ala Val Ala Ala
340 345 350
gcg tta acc gct cag ggt ttg cag ccg gta tgg aaa gac tgg gac agc 1104
Ala Leu Thr Ala Gln Gly Leu Gln Pro Val Trp Lys Asp Trp Asp Ser
355 360 365
tat ctg gga cgc gcc tcg caa aga cta tga 1134
Tyr Leu Gly Arg Ala Ser Gln Arg Leu
370 375
<210> 217
<211> 377
<212> PRT
<213> Escherichia coli
<400> 217
Met Lys Thr Phe Ser Asp Arg Trp Arg Gln Leu Asp Trp Asp Asp Ile
1 5 10 15
Arg Leu Arg Ile Asn Gly Lys Thr Ala Ala Asp Val Glu Arg Ala Leu
20 25 30
Asn Ala Ser Gln Leu Thr Arg Asp Asp Met Met Ala Leu Leu Ser Pro
35 40 45
Ala Ala Ser Gly Tyr Leu Glu Gln Leu Ala Gln Arg Ala Gln Arg Leu
50 55 60
Thr Arg Gln Arg Phe Gly Asn Thr Val Ser Phe Tyr Val Pro Leu Tyr
65 70 75 80
Leu Ser Asn Leu Cys Ala Asn Asp Cys Thr Tyr Cys Gly Phe Ser Met
85 90 95
Ser Asn Arg Ile Lys Arg Lys Thr Leu Asp Glu Ala Asp Ile Ala Arg
100 105 110
Glu Ser Ala Ala Ile Arg Glu Met Gly Phe Glu His Leu Leu Leu Val
115 120 125
Thr Gly Glu His Gln Ala Lys Val Gly Met Asp Tyr Phe Arg Arg His
130 135 140
Leu Pro Ala Leu Arg Glu Gln Phe Ser Ser Leu Gln Met Glu Val Gln
145 150 155 160
Pro Leu Ala Glu Thr Glu Tyr Ala Glu Leu Lys Gln Leu Gly Leu Asp
165 170 175
Gly Val Met Val Tyr Gln Glu Thr Tyr His Glu Ala Thr Tyr Ala Arg
180 185 190
His His Leu Lys Gly Lys Lys Gln Asp Phe Phe Trp Arg Leu Glu Thr
195 200 205
Pro Asp Arg Leu Gly Arg Ala Gly Ile Asp Lys Ile Gly Leu Gly Ala
210 215 220
Leu Ile Gly Leu Ser Asp Asn Trp Arg Val Asp Ser Tyr Met Val Ala
225 230 235 240
Glu His Leu Leu Trp Leu Gln Gln His Tyr Trp Gln Ser Arg Tyr Ser
245 250 255
Val Ser Phe Pro Arg Leu Arg Pro Cys Thr Gly Gly Ile Glu Pro Ala
260 265 270
Ser Ile Met Asp Glu Arg Gln Leu Val Gln Thr Ile Cys Ala Phe Arg
275 280 285
Leu Leu Ala Pro Glu Ile Glu Leu Ser Leu Ser Thr Arg Glu Ser Pro
290 295 300
Trp Phe Arg Asp Arg Val Ile Pro Leu Ala Ile Asn Asn Val Ser Ala
305 310 315 320
Phe Ser Lys Thr Gln Pro Gly Gly Tyr Ala Asp Asn His Pro Glu Leu
325 330 335
Glu Gln Phe Ser Pro His Asp Asp Arg Arg Pro Glu Ala Val Ala Ala
340 345 350
Ala Leu Thr Ala Gln Gly Leu Gln Pro Val Trp Lys Asp Trp Asp Ser
355 360 365
Tyr Leu Gly Arg Ala Ser Gln Arg Leu
370 375
<210> 218
<211> 1110
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(1110)
<223> ThiO gene from E. coli encoding Glycine oxidase
<400> 218
atg aaa agg cat tat gaa gca gtg gtg att gga ggc gga att atc ggt 48
Met Lys Arg His Tyr Glu Ala Val Val Ile Gly Gly Gly Ile Ile Gly
1 5 10 15
tcc gca att gct tat tat ttg gca aag gaa aac aaa aac acc gca ttg 96
Ser Ala Ile Ala Tyr Tyr Leu Ala Lys Glu Asn Lys Asn Thr Ala Leu
20 25 30
ttt gaa agc gga aca atg ggc ggc aga acg aca agt gcc gct gcc gga 144
Phe Glu Ser Gly Thr Met Gly Gly Arg Thr Thr Ser Ala Ala Ala Gly
35 40 45
atg ctg ggc gcc cat gcc gaa tgc gag gaa cgt gac gcg ttt ttt gat 192
Met Leu Gly Ala His Ala Glu Cys Glu Glu Arg Asp Ala Phe Phe Asp
50 55 60
ttc gct atg cac agt cag cgt ctg tac aaa ggt ctt gga gaa gag ctt 240
Phe Ala Met His Ser Gln Arg Leu Tyr Lys Gly Leu Gly Glu Glu Leu
65 70 75 80
tat gca tta tcc ggt gtg gat atc agg cag cat aac ggc ggt atg ttt 288
Tyr Ala Leu Ser Gly Val Asp Ile Arg Gln His Asn Gly Gly Met Phe
85 90 95
aag ctt gca ttt tct gaa gaa gat gtg ctg cag ctg aga cag atg gac 336
Lys Leu Ala Phe Ser Glu Glu Asp Val Leu Gln Leu Arg Gln Met Asp
100 105 110
gat ttg gac tct gtc agc tgg tat tca aaa gaa gag gtg tta gaa aaa 384
Asp Leu Asp Ser Val Ser Trp Tyr Ser Lys Glu Glu Val Leu Glu Lys
115 120 125
gag ccg tat gcg tct ggt gac atc ttt ggt gca tct ttt att cag gat 432
Glu Pro Tyr Ala Ser Gly Asp Ile Phe Gly Ala Ser Phe Ile Gln Asp
130 135 140
gat gtg cat gtg gag cct tat ttt gtt tgc aag gca tat gtg aaa gca 480
Asp Val His Val Glu Pro Tyr Phe Val Cys Lys Ala Tyr Val Lys Ala
145 150 155 160
gca aaa atg ctt ggg gcg gag att ttt gag cat acg ccc gtc ctg cat 528
Ala Lys Met Leu Gly Ala Glu Ile Phe Glu His Thr Pro Val Leu His
165 170 175
gtc gaa cgt gac ggt gaa gcc ctg ttc atc aag acc cct agc gga gac 576
Val Glu Arg Asp Gly Glu Ala Leu Phe Ile Lys Thr Pro Ser Gly Asp
180 185 190
gta tgg gct aat cat gtt gtc gtt gcc agc ggg gtg tgg agc gga atg 624
Val Trp Ala Asn His Val Val Val Ala Ser Gly Val Trp Ser Gly Met
195 200 205
ttt ttt aaa cag ctt gga ctg aac aat gct ttt ctc cct gta aaa ggg 672
Phe Phe Lys Gln Leu Gly Leu Asn Asn Ala Phe Leu Pro Val Lys Gly
210 215 220
gag tgc ctg tcc gtt tgg aat gat gat atc ccg ctg aca aaa acg ctt 720
Glu Cys Leu Ser Val Trp Asn Asp Asp Ile Pro Leu Thr Lys Thr Leu
225 230 235 240
tac cat gat cac tgc tat atc gta ccg aga aaa agc gga aga ctg gtt 768
Tyr His Asp His Cys Tyr Ile Val Pro Arg Lys Ser Gly Arg Leu Val
245 250 255
gtc ggc gcg aca atg aag ccg ggg gac tgg agt gaa aca ccg gat ctt 816
Val Gly Ala Thr Met Lys Pro Gly Asp Trp Ser Glu Thr Pro Asp Leu
260 265 270
ggc gga ttg gaa tct gtt atg aaa aaa gca aaa acg atg ctg ccg gct 864
Gly Gly Leu Glu Ser Val Met Lys Lys Ala Lys Thr Met Leu Pro Ala
275 280 285
ata cag aat atg aag gtg gat cgt ttt tgg gcg gga ctc cgt ccg gga 912
Ile Gln Asn Met Lys Val Asp Arg Phe Trp Ala Gly Leu Arg Pro Gly
290 295 300
aca aag gat gga aaa ccg tac atc ggc aga cat cct gag gac agc cgt 960
Thr Lys Asp Gly Lys Pro Tyr Ile Gly Arg His Pro Glu Asp Ser Arg
305 310 315 320
att tta ttt gcg gct ggc cat ttc aga aac ggg atc ctg ctt gct ccc 1008
Ile Leu Phe Ala Ala Gly His Phe Arg Asn Gly Ile Leu Leu Ala Pro
325 330 335
gca acg ggc gct ttg atc agt gat ctc atc atg aat aaa gag gtc aac 1056
Ala Thr Gly Ala Leu Ile Ser Asp Leu Ile Met Asn Lys Glu Val Asn
340 345 350
caa gac tgg ctg cac gca ttc cga att gat cgc aag gag gcg gtt cag 1104
Gln Asp Trp Leu His Ala Phe Arg Ile Asp Arg Lys Glu Ala Val Gln
355 360 365
ata tga 1110
Ile
<210> 219
<211> 369
<212> PRT
<213> Escherichia coli
<400> 219
Met Lys Arg His Tyr Glu Ala Val Val Ile Gly Gly Gly Ile Ile Gly
1 5 10 15
Ser Ala Ile Ala Tyr Tyr Leu Ala Lys Glu Asn Lys Asn Thr Ala Leu
20 25 30
Phe Glu Ser Gly Thr Met Gly Gly Arg Thr Thr Ser Ala Ala Ala Gly
35 40 45
Met Leu Gly Ala His Ala Glu Cys Glu Glu Arg Asp Ala Phe Phe Asp
50 55 60
Phe Ala Met His Ser Gln Arg Leu Tyr Lys Gly Leu Gly Glu Glu Leu
65 70 75 80
Tyr Ala Leu Ser Gly Val Asp Ile Arg Gln His Asn Gly Gly Met Phe
85 90 95
Lys Leu Ala Phe Ser Glu Glu Asp Val Leu Gln Leu Arg Gln Met Asp
100 105 110
Asp Leu Asp Ser Val Ser Trp Tyr Ser Lys Glu Glu Val Leu Glu Lys
115 120 125
Glu Pro Tyr Ala Ser Gly Asp Ile Phe Gly Ala Ser Phe Ile Gln Asp
130 135 140
Asp Val His Val Glu Pro Tyr Phe Val Cys Lys Ala Tyr Val Lys Ala
145 150 155 160
Ala Lys Met Leu Gly Ala Glu Ile Phe Glu His Thr Pro Val Leu His
165 170 175
Val Glu Arg Asp Gly Glu Ala Leu Phe Ile Lys Thr Pro Ser Gly Asp
180 185 190
Val Trp Ala Asn His Val Val Val Ala Ser Gly Val Trp Ser Gly Met
195 200 205
Phe Phe Lys Gln Leu Gly Leu Asn Asn Ala Phe Leu Pro Val Lys Gly
210 215 220
Glu Cys Leu Ser Val Trp Asn Asp Asp Ile Pro Leu Thr Lys Thr Leu
225 230 235 240
Tyr His Asp His Cys Tyr Ile Val Pro Arg Lys Ser Gly Arg Leu Val
245 250 255
Val Gly Ala Thr Met Lys Pro Gly Asp Trp Ser Glu Thr Pro Asp Leu
260 265 270
Gly Gly Leu Glu Ser Val Met Lys Lys Ala Lys Thr Met Leu Pro Ala
275 280 285
Ile Gln Asn Met Lys Val Asp Arg Phe Trp Ala Gly Leu Arg Pro Gly
290 295 300
Thr Lys Asp Gly Lys Pro Tyr Ile Gly Arg His Pro Glu Asp Ser Arg
305 310 315 320
Ile Leu Phe Ala Ala Gly His Phe Arg Asn Gly Ile Leu Leu Ala Pro
325 330 335
Ala Thr Gly Ala Leu Ile Ser Asp Leu Ile Met Asn Lys Glu Val Asn
340 345 350
Gln Asp Trp Leu His Ala Phe Arg Ile Asp Arg Lys Glu Ala Val Gln
355 360 365
Ile
<210> 220
<211> 1098
<212> DNA
<213> Pseudomonas putida
<220>
<221> CDS
<222> (1)..(1098)
<223> ThiO gene from Pseudomonas putida encoding a Glycine oxidase
<400> 220
atg agc aag caa gta gtg gtg gtc ggt ggc ggg gtc att ggc ctg ctg 48
Met Ser Lys Gln Val Val Val Val Gly Gly Gly Val Ile Gly Leu Leu
1 5 10 15
acg gca ttc aac ctg gcg gcg agc gtc gac cag gtg gtg gta tgc gac 96
Thr Ala Phe Asn Leu Ala Ala Ser Val Asp Gln Val Val Val Cys Asp
20 25 30
cag ggc gaa gta ggg cgc gag tcc tcc tgg gct ggg ggc ggt atc gtc 144
Gln Gly Glu Val Gly Arg Glu Ser Ser Trp Ala Gly Gly Gly Ile Val
35 40 45
tcg ccc ctg tat cct tgg cgc tac agc ccg gca gtg acc gcc ctg gcg 192
Ser Pro Leu Tyr Pro Trp Arg Tyr Ser Pro Ala Val Thr Ala Leu Ala
50 55 60
cat tgg tcg cag gac ttt tac cca cag ttg ggc gag cgc ttg ttc gcc 240
His Trp Ser Gln Asp Phe Tyr Pro Gln Leu Gly Glu Arg Leu Phe Ala
65 70 75 80
agc acg ggc ctg gat cct gag gtg cat acc acc ggg ctt tac tgg ctc 288
Ser Thr Gly Leu Asp Pro Glu Val His Thr Thr Gly Leu Tyr Trp Leu
85 90 95
gac ctg gat gac caa gcc cag gcc ttg gcg tgg gca ggc cgt cag cag 336
Asp Leu Asp Asp Gln Ala Gln Ala Leu Ala Trp Ala Gly Arg Gln Gln
100 105 110
cgt ccg ctg agc gcc gtg gat att tca gcg gtg tac gac gca gtc cct 384
Arg Pro Leu Ser Ala Val Asp Ile Ser Ala Val Tyr Asp Ala Val Pro
115 120 125
gtg ctg ggg cca ggc ttt gag cga gcc ctc tac atg gaa ggc gtg gcc 432
Val Leu Gly Pro Gly Phe Glu Arg Ala Leu Tyr Met Glu Gly Val Ala
130 135 140
aat gtg cgc aac ccg cgc ctg gtc aaa tcg ctg aag gcg gcg ttg ctg 480
Asn Val Arg Asn Pro Arg Leu Val Lys Ser Leu Lys Ala Ala Leu Leu
145 150 155 160
gca ttg ccc aat gtg agc gtg cgc gag cac tgc cag atc acg ggg ttc 528
Ala Leu Pro Asn Val Ser Val Arg Glu His Cys Gln Ile Thr Gly Phe
165 170 175
gtg cag cag ggc gct cgt atc att ggg gtg agc acc gct gaa ggc gag 576
Val Gln Gln Gly Ala Arg Ile Ile Gly Val Ser Thr Ala Glu Gly Glu
180 185 190
ctg gcc gcc gac gaa gtc gta ctg agc gcc ggt gcc tgg agc ggc gaa 624
Leu Ala Ala Asp Glu Val Val Leu Ser Ala Gly Ala Trp Ser Gly Glu
195 200 205
ctg ctg cgc cac ttg ggc ctt gag ctt cca gtc gag ccg gta aaa ggg 672
Leu Leu Arg His Leu Gly Leu Glu Leu Pro Val Glu Pro Val Lys Gly
210 215 220
cag atg atc ctg ttc aaa tgc gct gaa gat ttt ctg cca agc atg gtg 720
Gln Met Ile Leu Phe Lys Cys Ala Glu Asp Phe Leu Pro Ser Met Val
225 230 235 240
ctt gcc aaa ggt cgt tat gca att ccg cgt cgg gat ggt cac att ctg 768
Leu Ala Lys Gly Arg Tyr Ala Ile Pro Arg Arg Asp Gly His Ile Leu
245 250 255
gtg ggc agc acg ctg gag cat gcc ggc tac gac aag aca ccc acc gat 816
Val Gly Ser Thr Leu Glu His Ala Gly Tyr Asp Lys Thr Pro Thr Asp
260 265 270
gag gcg ttg gcc agc ctc aag gca tcg gcg gtg gat ctg ctg ccc ggc 864
Glu Ala Leu Ala Ser Leu Lys Ala Ser Ala Val Asp Leu Leu Pro Gly
275 280 285
ctg gaa ggc gcg cac gtg gtt gcc cac tgg gcc ggg ctg cgg cca ggt 912
Leu Glu Gly Ala His Val Val Ala His Trp Ala Gly Leu Arg Pro Gly
290 295 300
tcg cca gaa ggc gtt ccg ttt atc ggg ccg gta ccc ggc ttc gat ggg 960
Ser Pro Glu Gly Val Pro Phe Ile Gly Pro Val Pro Gly Phe Asp Gly
305 310 315 320
tta tgg ctg aac tgc ggc cat tac cga aac ggg ctg gtg ctg gcg ccc 1008
Leu Trp Leu Asn Cys Gly His Tyr Arg Asn Gly Leu Val Leu Ala Pro
325 330 335
gct tcg tgc caa ctg ctg gcc gat ttg ctc aat ggc gcc gag ccc atc 1056
Ala Ser Cys Gln Leu Leu Ala Asp Leu Leu Asn Gly Ala Glu Pro Ile
340 345 350
atc gac ccg tca ccc tac gcc ccg tct ggg cgc ctt ggc taa 1098
Ile Asp Pro Ser Pro Tyr Ala Pro Ser Gly Arg Leu Gly
355 360 365
<210> 221
<211> 365
<212> PRT
<213> Pseudomonas putida
<400> 221
Met Ser Lys Gln Val Val Val Val Gly Gly Gly Val Ile Gly Leu Leu
1 5 10 15
Thr Ala Phe Asn Leu Ala Ala Ser Val Asp Gln Val Val Val Cys Asp
20 25 30
Gln Gly Glu Val Gly Arg Glu Ser Ser Trp Ala Gly Gly Gly Ile Val
35 40 45
Ser Pro Leu Tyr Pro Trp Arg Tyr Ser Pro Ala Val Thr Ala Leu Ala
50 55 60
His Trp Ser Gln Asp Phe Tyr Pro Gln Leu Gly Glu Arg Leu Phe Ala
65 70 75 80
Ser Thr Gly Leu Asp Pro Glu Val His Thr Thr Gly Leu Tyr Trp Leu
85 90 95
Asp Leu Asp Asp Gln Ala Gln Ala Leu Ala Trp Ala Gly Arg Gln Gln
100 105 110
Arg Pro Leu Ser Ala Val Asp Ile Ser Ala Val Tyr Asp Ala Val Pro
115 120 125
Val Leu Gly Pro Gly Phe Glu Arg Ala Leu Tyr Met Glu Gly Val Ala
130 135 140
Asn Val Arg Asn Pro Arg Leu Val Lys Ser Leu Lys Ala Ala Leu Leu
145 150 155 160
Ala Leu Pro Asn Val Ser Val Arg Glu His Cys Gln Ile Thr Gly Phe
165 170 175
Val Gln Gln Gly Ala Arg Ile Ile Gly Val Ser Thr Ala Glu Gly Glu
180 185 190
Leu Ala Ala Asp Glu Val Val Leu Ser Ala Gly Ala Trp Ser Gly Glu
195 200 205
Leu Leu Arg His Leu Gly Leu Glu Leu Pro Val Glu Pro Val Lys Gly
210 215 220
Gln Met Ile Leu Phe Lys Cys Ala Glu Asp Phe Leu Pro Ser Met Val
225 230 235 240
Leu Ala Lys Gly Arg Tyr Ala Ile Pro Arg Arg Asp Gly His Ile Leu
245 250 255
Val Gly Ser Thr Leu Glu His Ala Gly Tyr Asp Lys Thr Pro Thr Asp
260 265 270
Glu Ala Leu Ala Ser Leu Lys Ala Ser Ala Val Asp Leu Leu Pro Gly
275 280 285
Leu Glu Gly Ala His Val Val Ala His Trp Ala Gly Leu Arg Pro Gly
290 295 300
Ser Pro Glu Gly Val Pro Phe Ile Gly Pro Val Pro Gly Phe Asp Gly
305 310 315 320
Leu Trp Leu Asn Cys Gly His Tyr Arg Asn Gly Leu Val Leu Ala Pro
325 330 335
Ala Ser Cys Gln Leu Leu Ala Asp Leu Leu Asn Gly Ala Glu Pro Ile
340 345 350
Ile Asp Pro Ser Pro Tyr Ala Pro Ser Gly Arg Leu Gly
355 360 365
<210> 222
<211> 1140
<212> DNA
<213> Synechococcus elongatus
<220>
<221> CDS
<222> (1)..(1140)
<223> ThiO gene from Synechococcus elongatus encoding a Glycine oxidase
<400> 222
atg gcg ttc gag gta gcc gtc ttt ggg ggc ggc gtc att ggc ttg gcg 48
Met Ala Phe Glu Val Ala Val Phe Gly Gly Gly Val Ile Gly Leu Ala
1 5 10 15
atc gcg cta gaa ctg cga tcg cga ggc gcg atg gtg cag gtc tac agt 96
Ile Ala Leu Glu Leu Arg Ser Arg Gly Ala Met Val Gln Val Tyr Ser
20 25 30
caa aac act cag gcg gcg gca ggt cgt gtg gca gca ggg atg ttg gcg 144
Gln Asn Thr Gln Ala Ala Ala Gly Arg Val Ala Ala Gly Met Leu Ala
35 40 45
ccc cag tcg gaa ggc atc gaa gtc ggg ccc atg ctg gat ctg ggg ctg 192
Pro Gln Ser Glu Gly Ile Glu Val Gly Pro Met Leu Asp Leu Gly Leu
50 55 60
cgc agc cga tcg ctc tac gcc cgc tgg acc cag caa ctc gaa caa ctc 240
Arg Ser Arg Ser Leu Tyr Ala Arg Trp Thr Gln Gln Leu Glu Gln Leu
65 70 75 80
agc ggt caa gac agt ggc tac tgg ccc tgc ggc att ttg gtg ccc ctg 288
Ser Gly Gln Asp Ser Gly Tyr Trp Pro Cys Gly Ile Leu Val Pro Leu
85 90 95
agt gag gcc aaa aat cgc gat cgc tat cct cat cca gca gaa tct ccg 336
Ser Glu Ala Lys Asn Arg Asp Arg Tyr Pro His Pro Ala Glu Ser Pro
100 105 110
ggg caa tgg ctc tcg gca gcg gac tta cga gac ttt cag ccc gca cta 384
Gly Gln Trp Leu Ser Ala Ala Asp Leu Arg Asp Phe Gln Pro Ala Leu
115 120 125
tgc tct gac cta atc ggt ggc tgg tgg ttt tcc caa gaa ggg caa gtt 432
Cys Ser Asp Leu Ile Gly Gly Trp Trp Phe Ser Gln Glu Gly Gln Val
130 135 140
gat agt cgc cgt gcc ctg tat cca gcg ctg cga gcc gcc gcg atc gcc 480
Asp Ser Arg Arg Ala Leu Tyr Pro Ala Leu Arg Ala Ala Ala Ile Ala
145 150 155 160
agt ggc gtc acg atc cat gaa agc gtg gcg ctg cgg gag tta tct gta 528
Ser Gly Val Thr Ile His Glu Ser Val Ala Leu Arg Glu Leu Ser Val
165 170 175
aca ggc gat cgc ctg caa tcc gcg atg acc gat cgc ggg cca gtt caa 576
Thr Gly Asp Arg Leu Gln Ser Ala Met Thr Asp Arg Gly Pro Val Gln
180 185 190
gct gac gcc tac gtt ctg gca acc ggc gct tgg tcc ggc gac tgg cta 624
Ala Asp Ala Tyr Val Leu Ala Thr Gly Ala Trp Ser Gly Asp Trp Leu
195 200 205
caa ctg ccg gtc tat ccc gtt aaa ggc caa atg ttc tcg ctg caa gct 672
Gln Leu Pro Val Tyr Pro Val Lys Gly Gln Met Phe Ser Leu Gln Ala
210 215 220
gac ccg cgt ttg ctg aac cac gtt ttg ttt ggt gag cgg gtg tat att 720
Asp Pro Arg Leu Leu Asn His Val Leu Phe Gly Glu Arg Val Tyr Ile
225 230 235 240
gtg ccg cgc cga gat ggt ctg att gtg gtc ggt gcc acc atg gaa gcg 768
Val Pro Arg Arg Asp Gly Leu Ile Val Val Gly Ala Thr Met Glu Ala
245 250 255
acg gcg gga ttc agg act ggc aac acc gct ggc ccc tta cag agc ttg 816
Thr Ala Gly Phe Arg Thr Gly Asn Thr Ala Gly Pro Leu Gln Ser Leu
260 265 270
atg gcc gag gcg atc gcc ctc gtt ccg gct ctg gcg gac tgt cca ctg 864
Met Ala Glu Ala Ile Ala Leu Val Pro Ala Leu Ala Asp Cys Pro Leu
275 280 285
gtt gaa act tgg tgg gga tac cgt ccc gcg aca cca gat gaa tgg ccg 912
Val Glu Thr Trp Trp Gly Tyr Arg Pro Ala Thr Pro Asp Glu Trp Pro
290 295 300
atc ctg ggg caa ggc ccc gct gag aac tta ttc ttg gcg acc ggc cac 960
Ile Leu Gly Gln Gly Pro Ala Glu Asn Leu Phe Leu Ala Thr Gly His
305 310 315 320
tac cgc aac ggt atg ctg ctc gcc cca att acc gct cag cta ctc gct 1008
Tyr Arg Asn Gly Met Leu Leu Ala Pro Ile Thr Ala Gln Leu Leu Ala
325 330 335
gac caa att ctc gac cac tgc acg gat caa ctg ctt cat gcc ttc cgt 1056
Asp Gln Ile Leu Asp His Cys Thr Asp Gln Leu Leu His Ala Phe Arg
340 345 350
tac gac cgc ttc tcc agc cat gac tcc agc acc cat caa ccc tta ccc 1104
Tyr Asp Arg Phe Ser Ser His Asp Ser Ser Thr His Gln Pro Leu Pro
355 360 365
gct ctt gca ggc ttg tca gcg tca acg ggt cag tga 1140
Ala Leu Ala Gly Leu Ser Ala Ser Thr Gly Gln
370 375
<210> 223
<211> 379
<212> PRT
<213> Synechococcus elongatus
<400> 223
Met Ala Phe Glu Val Ala Val Phe Gly Gly Gly Val Ile Gly Leu Ala
1 5 10 15
Ile Ala Leu Glu Leu Arg Ser Arg Gly Ala Met Val Gln Val Tyr Ser
20 25 30
Gln Asn Thr Gln Ala Ala Ala Gly Arg Val Ala Ala Gly Met Leu Ala
35 40 45
Pro Gln Ser Glu Gly Ile Glu Val Gly Pro Met Leu Asp Leu Gly Leu
50 55 60
Arg Ser Arg Ser Leu Tyr Ala Arg Trp Thr Gln Gln Leu Glu Gln Leu
65 70 75 80
Ser Gly Gln Asp Ser Gly Tyr Trp Pro Cys Gly Ile Leu Val Pro Leu
85 90 95
Ser Glu Ala Lys Asn Arg Asp Arg Tyr Pro His Pro Ala Glu Ser Pro
100 105 110
Gly Gln Trp Leu Ser Ala Ala Asp Leu Arg Asp Phe Gln Pro Ala Leu
115 120 125
Cys Ser Asp Leu Ile Gly Gly Trp Trp Phe Ser Gln Glu Gly Gln Val
130 135 140
Asp Ser Arg Arg Ala Leu Tyr Pro Ala Leu Arg Ala Ala Ala Ile Ala
145 150 155 160
Ser Gly Val Thr Ile His Glu Ser Val Ala Leu Arg Glu Leu Ser Val
165 170 175
Thr Gly Asp Arg Leu Gln Ser Ala Met Thr Asp Arg Gly Pro Val Gln
180 185 190
Ala Asp Ala Tyr Val Leu Ala Thr Gly Ala Trp Ser Gly Asp Trp Leu
195 200 205
Gln Leu Pro Val Tyr Pro Val Lys Gly Gln Met Phe Ser Leu Gln Ala
210 215 220
Asp Pro Arg Leu Leu Asn His Val Leu Phe Gly Glu Arg Val Tyr Ile
225 230 235 240
Val Pro Arg Arg Asp Gly Leu Ile Val Val Gly Ala Thr Met Glu Ala
245 250 255
Thr Ala Gly Phe Arg Thr Gly Asn Thr Ala Gly Pro Leu Gln Ser Leu
260 265 270
Met Ala Glu Ala Ile Ala Leu Val Pro Ala Leu Ala Asp Cys Pro Leu
275 280 285
Val Glu Thr Trp Trp Gly Tyr Arg Pro Ala Thr Pro Asp Glu Trp Pro
290 295 300
Ile Leu Gly Gln Gly Pro Ala Glu Asn Leu Phe Leu Ala Thr Gly His
305 310 315 320
Tyr Arg Asn Gly Met Leu Leu Ala Pro Ile Thr Ala Gln Leu Leu Ala
325 330 335
Asp Gln Ile Leu Asp His Cys Thr Asp Gln Leu Leu His Ala Phe Arg
340 345 350
Tyr Asp Arg Phe Ser Ser His Asp Ser Ser Thr His Gln Pro Leu Pro
355 360 365
Ala Leu Ala Gly Leu Ser Ala Ser Thr Gly Gln
370 375
<210> 224
<211> 801
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(801)
<223> ThiD gene from E. coli encoding phosphohydroxymethylpyrimidine
kinase
<400> 224
atg aaa cga att aac gct ctg acg att gcc ggt act gat ccg agt ggt 48
Met Lys Arg Ile Asn Ala Leu Thr Ile Ala Gly Thr Asp Pro Ser Gly
1 5 10 15
ggt gcg ggg att cag gcc gat ctt aaa acc ttc tcg gca ctt ggc gct 96
Gly Ala Gly Ile Gln Ala Asp Leu Lys Thr Phe Ser Ala Leu Gly Ala
20 25 30
tat ggt tgc tca gtt att act gca ctg gtg gcg caa aat acc cgt ggc 144
Tyr Gly Cys Ser Val Ile Thr Ala Leu Val Ala Gln Asn Thr Arg Gly
35 40 45
gta cag tcg gtg tat cgc att gag cct gat ttt gtc gcc gcc cag ctc 192
Val Gln Ser Val Tyr Arg Ile Glu Pro Asp Phe Val Ala Ala Gln Leu
50 55 60
gat tcg gtg ttc agc gat gtg cga atc gat acc act aaa atc ggt atg 240
Asp Ser Val Phe Ser Asp Val Arg Ile Asp Thr Thr Lys Ile Gly Met
65 70 75 80
ctg gcg gaa acc gat att gtt gaa gcg gtg gca gaa cgg ttg caa cgt 288
Leu Ala Glu Thr Asp Ile Val Glu Ala Val Ala Glu Arg Leu Gln Arg
85 90 95
tat cag atc caa aac gtg gta ctc gac acc gtt atg ctg gca aaa agc 336
Tyr Gln Ile Gln Asn Val Val Leu Asp Thr Val Met Leu Ala Lys Ser
100 105 110
ggc gac ccg ctg ctt tca cct tcg gcg gtt gct acg ctg cgc agt cga 384
Gly Asp Pro Leu Leu Ser Pro Ser Ala Val Ala Thr Leu Arg Ser Arg
115 120 125
tta ttg cca cag gtt tca tta ata acg cca aac ttg ccc gaa gct gcc 432
Leu Leu Pro Gln Val Ser Leu Ile Thr Pro Asn Leu Pro Glu Ala Ala
130 135 140
gcc ttg ctc gac gcg cca cac gcg cgc acc gaa cag gaa atg ctg gaa 480
Ala Leu Leu Asp Ala Pro His Ala Arg Thr Glu Gln Glu Met Leu Glu
145 150 155 160
caa ggg cga tcg ctg ttg gcg atg ggc tgt ggc gca gtg cta atg aaa 528
Gln Gly Arg Ser Leu Leu Ala Met Gly Cys Gly Ala Val Leu Met Lys
165 170 175
ggt ggt cat ctg gat gat gag caa agc ccg gac tgg ctg ttt acc cgc 576
Gly Gly His Leu Asp Asp Glu Gln Ser Pro Asp Trp Leu Phe Thr Arg
180 185 190
gag ggt gaa caa cgg ttt acc gca ccg cgc att atg acc aaa aac acc 624
Glu Gly Glu Gln Arg Phe Thr Ala Pro Arg Ile Met Thr Lys Asn Thr
195 200 205
cac ggc act ggt tgt aca ctc tct gcg gcg ttg gct gca cta cgc ccg 672
His Gly Thr Gly Cys Thr Leu Ser Ala Ala Leu Ala Ala Leu Arg Pro
210 215 220
cgc cat aca aac tgg gct gac acc gta cag gag gca aaa agc tgg ctt 720
Arg His Thr Asn Trp Ala Asp Thr Val Gln Glu Ala Lys Ser Trp Leu
225 230 235 240
tca tcg gcg tta gcc cag gcc gac acg ctg gaa gtt ggt cac ggt att 768
Ser Ser Ala Leu Ala Gln Ala Asp Thr Leu Glu Val Gly His Gly Ile
245 250 255
ggt ccg gtt cac cac ttc cac gcc tgg tgg tga 801
Gly Pro Val His His Phe His Ala Trp Trp
260 265
<210> 225
<211> 266
<212> PRT
<213> Escherichia coli
<400> 225
Met Lys Arg Ile Asn Ala Leu Thr Ile Ala Gly Thr Asp Pro Ser Gly
1 5 10 15
Gly Ala Gly Ile Gln Ala Asp Leu Lys Thr Phe Ser Ala Leu Gly Ala
20 25 30
Tyr Gly Cys Ser Val Ile Thr Ala Leu Val Ala Gln Asn Thr Arg Gly
35 40 45
Val Gln Ser Val Tyr Arg Ile Glu Pro Asp Phe Val Ala Ala Gln Leu
50 55 60
Asp Ser Val Phe Ser Asp Val Arg Ile Asp Thr Thr Lys Ile Gly Met
65 70 75 80
Leu Ala Glu Thr Asp Ile Val Glu Ala Val Ala Glu Arg Leu Gln Arg
85 90 95
Tyr Gln Ile Gln Asn Val Val Leu Asp Thr Val Met Leu Ala Lys Ser
100 105 110
Gly Asp Pro Leu Leu Ser Pro Ser Ala Val Ala Thr Leu Arg Ser Arg
115 120 125
Leu Leu Pro Gln Val Ser Leu Ile Thr Pro Asn Leu Pro Glu Ala Ala
130 135 140
Ala Leu Leu Asp Ala Pro His Ala Arg Thr Glu Gln Glu Met Leu Glu
145 150 155 160
Gln Gly Arg Ser Leu Leu Ala Met Gly Cys Gly Ala Val Leu Met Lys
165 170 175
Gly Gly His Leu Asp Asp Glu Gln Ser Pro Asp Trp Leu Phe Thr Arg
180 185 190
Glu Gly Glu Gln Arg Phe Thr Ala Pro Arg Ile Met Thr Lys Asn Thr
195 200 205
His Gly Thr Gly Cys Thr Leu Ser Ala Ala Leu Ala Ala Leu Arg Pro
210 215 220
Arg His Thr Asn Trp Ala Asp Thr Val Gln Glu Ala Lys Ser Trp Leu
225 230 235 240
Ser Ser Ala Leu Ala Gln Ala Asp Thr Leu Glu Val Gly His Gly Ile
245 250 255
Gly Pro Val His His Phe His Ala Trp Trp
260 265
<210> 226
<211> 789
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(789)
<223> ThiM gene from E. coli encoding a Hydroxyethylthiazole kinase
<400> 226
atg caa gtc gac ctg ctg ggt tca gcg caa tct gcg cac gcg tta cac 48
Met Gln Val Asp Leu Leu Gly Ser Ala Gln Ser Ala His Ala Leu His
1 5 10 15
ctt ttt cac caa cat tcc cct ctt gtg cac tgc atg acc aat gat gtg 96
Leu Phe His Gln His Ser Pro Leu Val His Cys Met Thr Asn Asp Val
20 25 30
gtg caa acc ttt acc gcc aat acc ttg ctg gcg ctc ggt gca tcg cca 144
Val Gln Thr Phe Thr Ala Asn Thr Leu Leu Ala Leu Gly Ala Ser Pro
35 40 45
gcg atg gtt atc gaa acc gaa gag gcc agt cag ttt gcg gct atc gcc 192
Ala Met Val Ile Glu Thr Glu Glu Ala Ser Gln Phe Ala Ala Ile Ala
50 55 60
agt gcc ttg ttg att aac gtt ggc aca ctg acg cag cca cgc gct cag 240
Ser Ala Leu Leu Ile Asn Val Gly Thr Leu Thr Gln Pro Arg Ala Gln
65 70 75 80
gcg atg cgt gct gcc gtt gag caa gca aaa agc tct caa aca ccc tgg 288
Ala Met Arg Ala Ala Val Glu Gln Ala Lys Ser Ser Gln Thr Pro Trp
85 90 95
acg ctt gat cca gta gcg gtg ggt gcg ctc gat tat cgc cgc cat ttt 336
Thr Leu Asp Pro Val Ala Val Gly Ala Leu Asp Tyr Arg Arg His Phe
100 105 110
tgt cat gaa ctt tta tct ttt aaa ccg gca gcg ata cgt ggt aat gct 384
Cys His Glu Leu Leu Ser Phe Lys Pro Ala Ala Ile Arg Gly Asn Ala
115 120 125
tcg gaa atc atg gca tta gct ggc att gct aat ggc gga cgg gga gtg 432
Ser Glu Ile Met Ala Leu Ala Gly Ile Ala Asn Gly Gly Arg Gly Val
130 135 140
gat acc act gac gcc gca gct aac gcg ata ccc gct gca caa aca ctg 480
Asp Thr Thr Asp Ala Ala Ala Asn Ala Ile Pro Ala Ala Gln Thr Leu
145 150 155 160
gca cgg gaa act ggc gca atc gtc gtg gtc act ggc gag atg gat tat 528
Ala Arg Glu Thr Gly Ala Ile Val Val Val Thr Gly Glu Met Asp Tyr
165 170 175
gtt acc gat gga cat cgt atc att ggt att cac ggt ggt gat ccg tta 576
Val Thr Asp Gly His Arg Ile Ile Gly Ile His Gly Gly Asp Pro Leu
180 185 190
atg acc aaa gtg gta gga act ggc tgt gca tta tcg gcg gtt gtc gct 624
Met Thr Lys Val Val Gly Thr Gly Cys Ala Leu Ser Ala Val Val Ala
195 200 205
gcc tgc tgt gcg tta cca ggc gat acg ctg gaa aat gtc gca tct gcc 672
Ala Cys Cys Ala Leu Pro Gly Asp Thr Leu Glu Asn Val Ala Ser Ala
210 215 220
tgt cac tgg atg aaa caa gcc gga gaa cgc gca gtc gcc aga agc gag 720
Cys His Trp Met Lys Gln Ala Gly Glu Arg Ala Val Ala Arg Ser Glu
225 230 235 240
ggg cca ggc agt ttt gtt cca cat ttc ctt gat gcg ctc tgg caa ttg 768
Gly Pro Gly Ser Phe Val Pro His Phe Leu Asp Ala Leu Trp Gln Leu
245 250 255
acg cag gag gtg cag gca tga 789
Thr Gln Glu Val Gln Ala
260
<210> 227
<211> 262
<212> PRT
<213> Escherichia coli
<400> 227
Met Gln Val Asp Leu Leu Gly Ser Ala Gln Ser Ala His Ala Leu His
1 5 10 15
Leu Phe His Gln His Ser Pro Leu Val His Cys Met Thr Asn Asp Val
20 25 30
Val Gln Thr Phe Thr Ala Asn Thr Leu Leu Ala Leu Gly Ala Ser Pro
35 40 45
Ala Met Val Ile Glu Thr Glu Glu Ala Ser Gln Phe Ala Ala Ile Ala
50 55 60
Ser Ala Leu Leu Ile Asn Val Gly Thr Leu Thr Gln Pro Arg Ala Gln
65 70 75 80
Ala Met Arg Ala Ala Val Glu Gln Ala Lys Ser Ser Gln Thr Pro Trp
85 90 95
Thr Leu Asp Pro Val Ala Val Gly Ala Leu Asp Tyr Arg Arg His Phe
100 105 110
Cys His Glu Leu Leu Ser Phe Lys Pro Ala Ala Ile Arg Gly Asn Ala
115 120 125
Ser Glu Ile Met Ala Leu Ala Gly Ile Ala Asn Gly Gly Arg Gly Val
130 135 140
Asp Thr Thr Asp Ala Ala Ala Asn Ala Ile Pro Ala Ala Gln Thr Leu
145 150 155 160
Ala Arg Glu Thr Gly Ala Ile Val Val Val Thr Gly Glu Met Asp Tyr
165 170 175
Val Thr Asp Gly His Arg Ile Ile Gly Ile His Gly Gly Asp Pro Leu
180 185 190
Met Thr Lys Val Val Gly Thr Gly Cys Ala Leu Ser Ala Val Val Ala
195 200 205
Ala Cys Cys Ala Leu Pro Gly Asp Thr Leu Glu Asn Val Ala Ser Ala
210 215 220
Cys His Trp Met Lys Gln Ala Gly Glu Arg Ala Val Ala Arg Ser Glu
225 230 235 240
Gly Pro Gly Ser Phe Val Pro His Phe Leu Asp Ala Leu Trp Gln Leu
245 250 255
Thr Gln Glu Val Gln Ala
260
<210> 228
<211> 978
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(978)
<223> ThiL gene from E. coli encoding thiamine-phosphate kinase (Note:
mutation at nucleotides 133-135 (GGT to GAC) encodes a G133D
substitution)
<400> 228
atg gca tgt ggc gag ttc tcc ctg att gcc cgt tat ttt gac cgt gta 48
Met Ala Cys Gly Glu Phe Ser Leu Ile Ala Arg Tyr Phe Asp Arg Val
1 5 10 15
aga agt tct cgt ctt gat gtc gaa ctg ggc atc ggc gac gat tgc gca 96
Arg Ser Ser Arg Leu Asp Val Glu Leu Gly Ile Gly Asp Asp Cys Ala
20 25 30
ctt ctc aat atc ccc gag aaa cag acc ctg gcg atc agc act gat acg 144
Leu Leu Asn Ile Pro Glu Lys Gln Thr Leu Ala Ile Ser Thr Asp Thr
35 40 45
ctg gtg gcg ggt aac cat ttc ctc cct gat atc gat cct gct gat ctg 192
Leu Val Ala Gly Asn His Phe Leu Pro Asp Ile Asp Pro Ala Asp Leu
50 55 60
gct tat aaa gca ctg gcg gtg aac cta agc gat ctg gca gcg atg ggg 240
Ala Tyr Lys Ala Leu Ala Val Asn Leu Ser Asp Leu Ala Ala Met Gly
65 70 75 80
gcc gat ccg gcc tgg ctg acg ctg gca tta acc tta ccg gac gta gac 288
Ala Asp Pro Ala Trp Leu Thr Leu Ala Leu Thr Leu Pro Asp Val Asp
85 90 95
gaa gcg tgg ctt gag tcc ttc agc gac agt ttg ttt gat ctt ctc aat 336
Glu Ala Trp Leu Glu Ser Phe Ser Asp Ser Leu Phe Asp Leu Leu Asn
100 105 110
tat tac gat atg caa ctc att ggc ggc gat acc acg cgt ggg cca tta 384
Tyr Tyr Asp Met Gln Leu Ile Gly Gly Asp Thr Thr Arg Gly Pro Leu
115 120 125
tca atg acg ttg ggt atc cac ggc ttt gtt ccg atg gga cga gcc tta 432
Ser Met Thr Leu Gly Ile His Gly Phe Val Pro Met Gly Arg Ala Leu
130 135 140
acg cgc tct ggg gcg aaa ccg ggt gac tgg atc tat gtg acc ggt aca 480
Thr Arg Ser Gly Ala Lys Pro Gly Asp Trp Ile Tyr Val Thr Gly Thr
145 150 155 160
ccg ggc gat agc gcc gcc ggg ctg gcg att ttg caa aac cgt ttg cag 528
Pro Gly Asp Ser Ala Ala Gly Leu Ala Ile Leu Gln Asn Arg Leu Gln
165 170 175
gtt gcc gat gct aaa gat gcg gac tac ttg atc aaa cgt cat ctc cgt 576
Val Ala Asp Ala Lys Asp Ala Asp Tyr Leu Ile Lys Arg His Leu Arg
180 185 190
cca tcg ccg cgt att tta cag ggg cag gca ctg cgc gat ctg gca aat 624
Pro Ser Pro Arg Ile Leu Gln Gly Gln Ala Leu Arg Asp Leu Ala Asn
195 200 205
tca gcc atc gat ctc tct gac ggt ttg att tcc gat ctc ggg cat atc 672
Ser Ala Ile Asp Leu Ser Asp Gly Leu Ile Ser Asp Leu Gly His Ile
210 215 220
gtg aaa gcc agc gac tgc ggc gca cgt att gac ctg gca ttg ctg ccg 720
Val Lys Ala Ser Asp Cys Gly Ala Arg Ile Asp Leu Ala Leu Leu Pro
225 230 235 240
ttt tct gat gcg ctt tct cgc cat gtt gaa ccg gaa cag gcg ctg cgc 768
Phe Ser Asp Ala Leu Ser Arg His Val Glu Pro Glu Gln Ala Leu Arg
245 250 255
tgg gcg ctc tct ggc ggt gaa gat tac gag ttg tgt ttc act gtg ccg 816
Trp Ala Leu Ser Gly Gly Glu Asp Tyr Glu Leu Cys Phe Thr Val Pro
260 265 270
gaa ctg aac cgt ggc gcg ctg gat gtg gct ctc gga cac ctg ggc gta 864
Glu Leu Asn Arg Gly Ala Leu Asp Val Ala Leu Gly His Leu Gly Val
275 280 285
ccg ttt acc tgt atc ggg caa atg acc gcc gat atc gaa ggg ctt tgt 912
Pro Phe Thr Cys Ile Gly Gln Met Thr Ala Asp Ile Glu Gly Leu Cys
290 295 300
ttt att cgt gac ggc gaa cct gtt aca tta gac tgg aaa gga tat gac 960
Phe Ile Arg Asp Gly Glu Pro Val Thr Leu Asp Trp Lys Gly Tyr Asp
305 310 315 320
cat ttt gcc acg cca taa 978
His Phe Ala Thr Pro
325
<210> 229
<211> 325
<212> PRT
<213> Escherichia coli
<400> 229
Met Ala Cys Gly Glu Phe Ser Leu Ile Ala Arg Tyr Phe Asp Arg Val
1 5 10 15
Arg Ser Ser Arg Leu Asp Val Glu Leu Gly Ile Gly Asp Asp Cys Ala
20 25 30
Leu Leu Asn Ile Pro Glu Lys Gln Thr Leu Ala Ile Ser Thr Asp Thr
35 40 45
Leu Val Ala Gly Asn His Phe Leu Pro Asp Ile Asp Pro Ala Asp Leu
50 55 60
Ala Tyr Lys Ala Leu Ala Val Asn Leu Ser Asp Leu Ala Ala Met Gly
65 70 75 80
Ala Asp Pro Ala Trp Leu Thr Leu Ala Leu Thr Leu Pro Asp Val Asp
85 90 95
Glu Ala Trp Leu Glu Ser Phe Ser Asp Ser Leu Phe Asp Leu Leu Asn
100 105 110
Tyr Tyr Asp Met Gln Leu Ile Gly Gly Asp Thr Thr Arg Gly Pro Leu
115 120 125
Ser Met Thr Leu Gly Ile His Gly Phe Val Pro Met Gly Arg Ala Leu
130 135 140
Thr Arg Ser Gly Ala Lys Pro Gly Asp Trp Ile Tyr Val Thr Gly Thr
145 150 155 160
Pro Gly Asp Ser Ala Ala Gly Leu Ala Ile Leu Gln Asn Arg Leu Gln
165 170 175
Val Ala Asp Ala Lys Asp Ala Asp Tyr Leu Ile Lys Arg His Leu Arg
180 185 190
Pro Ser Pro Arg Ile Leu Gln Gly Gln Ala Leu Arg Asp Leu Ala Asn
195 200 205
Ser Ala Ile Asp Leu Ser Asp Gly Leu Ile Ser Asp Leu Gly His Ile
210 215 220
Val Lys Ala Ser Asp Cys Gly Ala Arg Ile Asp Leu Ala Leu Leu Pro
225 230 235 240
Phe Ser Asp Ala Leu Ser Arg His Val Glu Pro Glu Gln Ala Leu Arg
245 250 255
Trp Ala Leu Ser Gly Gly Glu Asp Tyr Glu Leu Cys Phe Thr Val Pro
260 265 270
Glu Leu Asn Arg Gly Ala Leu Asp Val Ala Leu Gly His Leu Gly Val
275 280 285
Pro Phe Thr Cys Ile Gly Gln Met Thr Ala Asp Ile Glu Gly Leu Cys
290 295 300
Phe Ile Arg Asp Gly Glu Pro Val Thr Leu Asp Trp Lys Gly Tyr Asp
305 310 315 320
His Phe Ala Thr Pro
325
<210> 230
<211> 47
<212> DNA
<213> Escherichia coli
<220>
<221> promoter
<222> (1)..(47)
<223> apFAB46 promoter
<400> 230
aaaaagagta ttgacttcgc atctttttgt acctataata gattcat 47
<210> 231
<211> 37
<212> DNA
<213> Escherichia coli
<220>
<221> promoter
<222> (1)..(37)
<223> apFAB70 promoter
<400> 231
ttgacatcgc atctttttgt acctataatg tgtggat 37
<210> 232
<211> 37
<212> DNA
<213> Escherichia coli
<220>
<221> promoter
<222> (1)..(37)
<223> apFAB71 promoter
<400> 232
ttgacatcgc atctttttgt acctataata gattcat 37
<210> 233
<211> 29
<212> DNA
<213> Escherichia coli
<220>
<221> promoter
<222> (1)..(29)
<223> pBAD Ara promoter
<400> 233
ctgacgcttt ttatcgcaac tctctactg 29
<210> 234
<211> 74
<212> DNA
<213> Escherichia coli
<220>
<221> promoter
<222> (1)..(74)
<223> lac Promoter with lacO operator site
<400> 234
agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc acacaacata cgagccggaa 60
gcataaagtg taaa 74
<210> 235
<211> 91
<212> DNA
<213> Escherichia coli
<220>
<221> terminator
<222> (1)..(91)
<223> apFAB378 terminator
<400> 235
gagttggtag ctcttgatcc ggcaaacaaa ccaccgttgg tagcggtggt ttttttgttt 60
gcaagcagca gattacgcgc agaaaaaaag g 91
<210> 236
<211> 91
<212> DNA
<213> Escherichia coli
<220>
<221> terminator
<222> (1)..(91)
<223> apFAB377 terminator
<400> 236
atgaccatct acattactga gctaataaca ggcctgctgg taatcgcagg cctttttatt 60
tgggggagag ggaagtcatg aaaaaactaa c 91
<210> 237
<211> 90
<212> DNA
<213> Escherichia coli
<220>
<221> terminator
<222> (1)..(90)
<223> apFAB381 terminator
<400> 237
accctcaaga gaaaatgtaa ccaactcact ggctcacctt cacgggtggg cctttcttcg 60
ttccgggcat taaccctcac taacaggaga 90
<210> 238
<211> 258
<212> DNA
<213> Synthetic
<220>
<221> CDS
<222> (1)..(258)
<223> Synthetic gene encoding a E2 hybrid polypeptide (subunit of
pyruvate dehydrogenase)
<400> 238
atg gct atc gaa atc aaa gta ccg gac atc ggg gct gat gaa gtt gaa 48
Met Ala Ile Glu Ile Lys Val Pro Asp Ile Gly Ala Asp Glu Val Glu
1 5 10 15
atc acc gag atc ctg gtc aaa gtg ggc gac aaa gtt gaa gcc gaa cag 96
Ile Thr Glu Ile Leu Val Lys Val Gly Asp Lys Val Glu Ala Glu Gln
20 25 30
tcg ctg atc acc gta gaa ggc gac aaa gct tct atg gaa gtt ccg gcg 144
Ser Leu Ile Thr Val Glu Gly Asp Lys Ala Ser Met Glu Val Pro Ala
35 40 45
ccg ttt gca ggc gtc gtg aag gaa ctg aaa gtc aac gtt ggc gat aaa 192
Pro Phe Ala Gly Val Val Lys Glu Leu Lys Val Asn Val Gly Asp Lys
50 55 60
gtg aaa act ggc tcg ctg att atg atc ttc gaa gtt gaa ggc gca gcg 240
Val Lys Thr Gly Ser Leu Ile Met Ile Phe Glu Val Glu Gly Ala Ala
65 70 75 80
cct gcg gca gct cct gcg 258
Pro Ala Ala Ala Pro Ala
85
<210> 239
<211> 86
<212> PRT
<213> Synthetic
<400> 239
Met Ala Ile Glu Ile Lys Val Pro Asp Ile Gly Ala Asp Glu Val Glu
1 5 10 15
Ile Thr Glu Ile Leu Val Lys Val Gly Asp Lys Val Glu Ala Glu Gln
20 25 30
Ser Leu Ile Thr Val Glu Gly Asp Lys Ala Ser Met Glu Val Pro Ala
35 40 45
Pro Phe Ala Gly Val Val Lys Glu Leu Lys Val Asn Val Gly Asp Lys
50 55 60
Val Lys Thr Gly Ser Leu Ile Met Ile Phe Glu Val Glu Gly Ala Ala
65 70 75 80
Pro Ala Ala Ala Pro Ala
85
<210> 240
<211> 747
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(747)
<223> fpr gene_E. coli encoding a flavodoxin/ferredoxin reductase
<400> 240
atg gct gat tgg gta aca ggc aaa gtc act aaa gtg cag aac tgg acc 48
Met Ala Asp Trp Val Thr Gly Lys Val Thr Lys Val Gln Asn Trp Thr
1 5 10 15
gac gcc ctg ttt agt ctc acc gtt cac gcc ccc gtg ctt ccg ttt acc 96
Asp Ala Leu Phe Ser Leu Thr Val His Ala Pro Val Leu Pro Phe Thr
20 25 30
gcc ggg caa ttt acc aag ctt ggc ctt gaa atc gac ggc gaa cgc gtc 144
Ala Gly Gln Phe Thr Lys Leu Gly Leu Glu Ile Asp Gly Glu Arg Val
35 40 45
cag cgc gcc tac tcc tat gta aac tcg ccc gat aat ccc gat ctg gag 192
Gln Arg Ala Tyr Ser Tyr Val Asn Ser Pro Asp Asn Pro Asp Leu Glu
50 55 60
ttt tac ctg gtc acc gtc ccc gat ggc aaa tta agc cca cga ctg gcg 240
Phe Tyr Leu Val Thr Val Pro Asp Gly Lys Leu Ser Pro Arg Leu Ala
65 70 75 80
gca ctg aaa cca ggc gat gaa gtg cag gtg gtt agc gaa gcg gca gga 288
Ala Leu Lys Pro Gly Asp Glu Val Gln Val Val Ser Glu Ala Ala Gly
85 90 95
ttc ttt gtg ctc gat gaa gtg ccg cac tgc gaa acg cta tgg atg ctg 336
Phe Phe Val Leu Asp Glu Val Pro His Cys Glu Thr Leu Trp Met Leu
100 105 110
gca acc ggt aca gcg att ggc cct tat tta tcg att ctg caa cta ggt 384
Ala Thr Gly Thr Ala Ile Gly Pro Tyr Leu Ser Ile Leu Gln Leu Gly
115 120 125
aaa gat tta gat cgc ttc aaa aat ctg gtc ctg gtg cac gcc gca cgt 432
Lys Asp Leu Asp Arg Phe Lys Asn Leu Val Leu Val His Ala Ala Arg
130 135 140
tat gcc gcc gac tta agc tat ttg cca ctg atg cag gaa ctg gaa aaa 480
Tyr Ala Ala Asp Leu Ser Tyr Leu Pro Leu Met Gln Glu Leu Glu Lys
145 150 155 160
cgc tac gaa gga aaa ctg cgc att cag acg gtg gtc agt cgg gaa acg 528
Arg Tyr Glu Gly Lys Leu Arg Ile Gln Thr Val Val Ser Arg Glu Thr
165 170 175
gca gcg ggg tcg ctc acc gga cgg ata ccg gca tta att gaa agt ggg 576
Ala Ala Gly Ser Leu Thr Gly Arg Ile Pro Ala Leu Ile Glu Ser Gly
180 185 190
gaa ctg gaa agc acg att ggc ctg ccg atg aat aaa gaa acc agc cat 624
Glu Leu Glu Ser Thr Ile Gly Leu Pro Met Asn Lys Glu Thr Ser His
195 200 205
gtg atg ctg tgc ggc aat cca cag atg gtg cgc gat aca caa cag ttg 672
Val Met Leu Cys Gly Asn Pro Gln Met Val Arg Asp Thr Gln Gln Leu
210 215 220
ctg aaa gag acc cgg cag atg acg aaa cat tta cgt cgc cga ccg ggc 720
Leu Lys Glu Thr Arg Gln Met Thr Lys His Leu Arg Arg Arg Pro Gly
225 230 235 240
cat atg aca gcg gag cat tac tgg taa 747
His Met Thr Ala Glu His Tyr Trp
245
<210> 241
<211> 248
<212> PRT
<213> Escherichia coli
<400> 241
Met Ala Asp Trp Val Thr Gly Lys Val Thr Lys Val Gln Asn Trp Thr
1 5 10 15
Asp Ala Leu Phe Ser Leu Thr Val His Ala Pro Val Leu Pro Phe Thr
20 25 30
Ala Gly Gln Phe Thr Lys Leu Gly Leu Glu Ile Asp Gly Glu Arg Val
35 40 45
Gln Arg Ala Tyr Ser Tyr Val Asn Ser Pro Asp Asn Pro Asp Leu Glu
50 55 60
Phe Tyr Leu Val Thr Val Pro Asp Gly Lys Leu Ser Pro Arg Leu Ala
65 70 75 80
Ala Leu Lys Pro Gly Asp Glu Val Gln Val Val Ser Glu Ala Ala Gly
85 90 95
Phe Phe Val Leu Asp Glu Val Pro His Cys Glu Thr Leu Trp Met Leu
100 105 110
Ala Thr Gly Thr Ala Ile Gly Pro Tyr Leu Ser Ile Leu Gln Leu Gly
115 120 125
Lys Asp Leu Asp Arg Phe Lys Asn Leu Val Leu Val His Ala Ala Arg
130 135 140
Tyr Ala Ala Asp Leu Ser Tyr Leu Pro Leu Met Gln Glu Leu Glu Lys
145 150 155 160
Arg Tyr Glu Gly Lys Leu Arg Ile Gln Thr Val Val Ser Arg Glu Thr
165 170 175
Ala Ala Gly Ser Leu Thr Gly Arg Ile Pro Ala Leu Ile Glu Ser Gly
180 185 190
Glu Leu Glu Ser Thr Ile Gly Leu Pro Met Asn Lys Glu Thr Ser His
195 200 205
Val Met Leu Cys Gly Asn Pro Gln Met Val Arg Asp Thr Gln Gln Leu
210 215 220
Leu Lys Glu Thr Arg Gln Met Thr Lys His Leu Arg Arg Arg Pro Gly
225 230 235 240
His Met Thr Ala Glu His Tyr Trp
245
<210> 242
<211> 999
<212> DNA
<213> Bacillus subtilis
<220>
<221> CDS
<222> (1)..(999)
<223> YumC gene_B. subtilis encoding a flavodoxin/ferredoxin reductase
<400> 242
atg cga gag gat aca aag gtt tat gat att aca att ata ggc ggg gga 48
Met Arg Glu Asp Thr Lys Val Tyr Asp Ile Thr Ile Ile Gly Gly Gly
1 5 10 15
ccg gtc ggc tta ttc acc gct ttt tac ggc ggg atg aga cag gca agc 96
Pro Val Gly Leu Phe Thr Ala Phe Tyr Gly Gly Met Arg Gln Ala Ser
20 25 30
gtc aaa att atc gaa agc ctg cct cag ctc ggc gga cag ctt agc gcc 144
Val Lys Ile Ile Glu Ser Leu Pro Gln Leu Gly Gly Gln Leu Ser Ala
35 40 45
cta tac cct gag aag tat ata tat gat gta gcg gga ttc ccg aaa atc 192
Leu Tyr Pro Glu Lys Tyr Ile Tyr Asp Val Ala Gly Phe Pro Lys Ile
50 55 60
cgc gcg caa gag ctt atc aat aac cta aaa gag caa atg gcg aaa ttc 240
Arg Ala Gln Glu Leu Ile Asn Asn Leu Lys Glu Gln Met Ala Lys Phe
65 70 75 80
gac caa acc att tgt ctg gag caa gcg gtt gaa tct gtt gag aaa caa 288
Asp Gln Thr Ile Cys Leu Glu Gln Ala Val Glu Ser Val Glu Lys Gln
85 90 95
gcg gac ggc gtg ttt aag ctt gta aca aat gaa gaa acc cac tac tct 336
Ala Asp Gly Val Phe Lys Leu Val Thr Asn Glu Glu Thr His Tyr Ser
100 105 110
aaa acg gtc atc ata act gca gga aac ggc gca ttc aaa ccg aga aag 384
Lys Thr Val Ile Ile Thr Ala Gly Asn Gly Ala Phe Lys Pro Arg Lys
115 120 125
ctg gaa ctt gaa aat gcc gag cag tat gaa ggc aaa aac ctc cat tac 432
Leu Glu Leu Glu Asn Ala Glu Gln Tyr Glu Gly Lys Asn Leu His Tyr
130 135 140
ttc gtt gat gat ctg caa aaa ttc gcc ggc aga cgc gtt gcg atc ctt 480
Phe Val Asp Asp Leu Gln Lys Phe Ala Gly Arg Arg Val Ala Ile Leu
145 150 155 160
ggc ggt gga gat tcc gcg gtt gac tgg gcg ctt atg ctt gag cca atc 528
Gly Gly Gly Asp Ser Ala Val Asp Trp Ala Leu Met Leu Glu Pro Ile
165 170 175
gca aaa gaa gta tcg atc att cac cgc cgc gac aag ttc cga gcg cac 576
Ala Lys Glu Val Ser Ile Ile His Arg Arg Asp Lys Phe Arg Ala His
180 185 190
gag cac agt gtg gaa aac ctt cat gcg tcg aag gtt aat gtc ctg aca 624
Glu His Ser Val Glu Asn Leu His Ala Ser Lys Val Asn Val Leu Thr
195 200 205
cca ttc gtc cct gcg gag ctg atc ggc gaa gac aaa atc gaa cag cta 672
Pro Phe Val Pro Ala Glu Leu Ile Gly Glu Asp Lys Ile Glu Gln Leu
210 215 220
gtg ctt gaa gaa gtg aaa ggc gac cgc aaa gag att tta gaa att gat 720
Val Leu Glu Glu Val Lys Gly Asp Arg Lys Glu Ile Leu Glu Ile Asp
225 230 235 240
gac tta atc gtc aac tac ggt ttc gtt tca tct ctt gga ccg atc aaa 768
Asp Leu Ile Val Asn Tyr Gly Phe Val Ser Ser Leu Gly Pro Ile Lys
245 250 255
aac tgg ggc ctg gac atc gag aaa aat tcc att gtc gtg aaa tca aca 816
Asn Trp Gly Leu Asp Ile Glu Lys Asn Ser Ile Val Val Lys Ser Thr
260 265 270
atg gaa aca aat atc gaa ggc ttc ttt gca gca ggt gac att tgt aca 864
Met Glu Thr Asn Ile Glu Gly Phe Phe Ala Ala Gly Asp Ile Cys Thr
275 280 285
tac gaa gga aaa gtc aac ctg att gcc agc ggc ttc ggc gag gca ccg 912
Tyr Glu Gly Lys Val Asn Leu Ile Ala Ser Gly Phe Gly Glu Ala Pro
290 295 300
aca gca gtg aac aac gcc aag gct tac atg gac ccg aaa gcc cgc gta 960
Thr Ala Val Asn Asn Ala Lys Ala Tyr Met Asp Pro Lys Ala Arg Val
305 310 315 320
cag cct ctt cac tca aca agt ctt ttt gaa aat aaa taa 999
Gln Pro Leu His Ser Thr Ser Leu Phe Glu Asn Lys
325 330
<210> 243
<211> 332
<212> PRT
<213> Bacillus subtilis
<400> 243
Met Arg Glu Asp Thr Lys Val Tyr Asp Ile Thr Ile Ile Gly Gly Gly
1 5 10 15
Pro Val Gly Leu Phe Thr Ala Phe Tyr Gly Gly Met Arg Gln Ala Ser
20 25 30
Val Lys Ile Ile Glu Ser Leu Pro Gln Leu Gly Gly Gln Leu Ser Ala
35 40 45
Leu Tyr Pro Glu Lys Tyr Ile Tyr Asp Val Ala Gly Phe Pro Lys Ile
50 55 60
Arg Ala Gln Glu Leu Ile Asn Asn Leu Lys Glu Gln Met Ala Lys Phe
65 70 75 80
Asp Gln Thr Ile Cys Leu Glu Gln Ala Val Glu Ser Val Glu Lys Gln
85 90 95
Ala Asp Gly Val Phe Lys Leu Val Thr Asn Glu Glu Thr His Tyr Ser
100 105 110
Lys Thr Val Ile Ile Thr Ala Gly Asn Gly Ala Phe Lys Pro Arg Lys
115 120 125
Leu Glu Leu Glu Asn Ala Glu Gln Tyr Glu Gly Lys Asn Leu His Tyr
130 135 140
Phe Val Asp Asp Leu Gln Lys Phe Ala Gly Arg Arg Val Ala Ile Leu
145 150 155 160
Gly Gly Gly Asp Ser Ala Val Asp Trp Ala Leu Met Leu Glu Pro Ile
165 170 175
Ala Lys Glu Val Ser Ile Ile His Arg Arg Asp Lys Phe Arg Ala His
180 185 190
Glu His Ser Val Glu Asn Leu His Ala Ser Lys Val Asn Val Leu Thr
195 200 205
Pro Phe Val Pro Ala Glu Leu Ile Gly Glu Asp Lys Ile Glu Gln Leu
210 215 220
Val Leu Glu Glu Val Lys Gly Asp Arg Lys Glu Ile Leu Glu Ile Asp
225 230 235 240
Asp Leu Ile Val Asn Tyr Gly Phe Val Ser Ser Leu Gly Pro Ile Lys
245 250 255
Asn Trp Gly Leu Asp Ile Glu Lys Asn Ser Ile Val Val Lys Ser Thr
260 265 270
Met Glu Thr Asn Ile Glu Gly Phe Phe Ala Ala Gly Asp Ile Cys Thr
275 280 285
Tyr Glu Gly Lys Val Asn Leu Ile Ala Ser Gly Phe Gly Glu Ala Pro
290 295 300
Thr Ala Val Asn Asn Ala Lys Ala Tyr Met Asp Pro Lys Ala Arg Val
305 310 315 320
Gln Pro Leu His Ser Thr Ser Leu Phe Glu Asn Lys
325 330
<210> 244
<211> 780
<212> DNA
<213> Pseudomonas pudita
<220>
<221> CDS
<222> (1)..(780)
<223> fpr-1 gene from Pseudomonas putida KT2440 encoding a
flavodoxin/ferredoxin reductase
<400> 244
atg agc aac atg aac cac gaa cgt gtc ctc agt gtg cac cac tgg aac 48
Met Ser Asn Met Asn His Glu Arg Val Leu Ser Val His His Trp Asn
1 5 10 15
gac acc ctg ttc agc ttc aag tgc acc cgc gac ccg ggc ctg cgc ttc 96
Asp Thr Leu Phe Ser Phe Lys Cys Thr Arg Asp Pro Gly Leu Arg Phe
20 25 30
gag aac ggt cag ttc gtg atg atc ggc ctg cag cag gac aac ggc cgt 144
Glu Asn Gly Gln Phe Val Met Ile Gly Leu Gln Gln Asp Asn Gly Arg
35 40 45
ccg ctc atg cgt gcc tac tcc atc gct tcg cca aac tgg gaa gag cac 192
Pro Leu Met Arg Ala Tyr Ser Ile Ala Ser Pro Asn Trp Glu Glu His
50 55 60
ctt gaa ttc ttc agc atc aag gtg ccg gac ggc ccg ctg acc tcg cag 240
Leu Glu Phe Phe Ser Ile Lys Val Pro Asp Gly Pro Leu Thr Ser Gln
65 70 75 80
ctg cag cac ctg aag gaa ggc gat gag atc atc atc agc aag aag cct 288
Leu Gln His Leu Lys Glu Gly Asp Glu Ile Ile Ile Ser Lys Lys Pro
85 90 95
acc ggc acc ctg gtc ctc gac gac ctg aat cct ggc aag cac ctg tac 336
Thr Gly Thr Leu Val Leu Asp Asp Leu Asn Pro Gly Lys His Leu Tyr
100 105 110
ctg ctg agc acc ggc act ggt ctg gcg ccg ttc atg agc gtc atc cag 384
Leu Leu Ser Thr Gly Thr Gly Leu Ala Pro Phe Met Ser Val Ile Gln
115 120 125
gac ccg gaa acc tac gag cgc ttt gaa aaa gtg atc ctg gtg cac ggc 432
Asp Pro Glu Thr Tyr Glu Arg Phe Glu Lys Val Ile Leu Val His Gly
130 135 140
gtg cgc tat gtg aac gaa gtg gcc tac cgc gag ttc atc acc gag cac 480
Val Arg Tyr Val Asn Glu Val Ala Tyr Arg Glu Phe Ile Thr Glu His
145 150 155 160
ctg ccg cag aac gag ttc ttc ggt gag tcg gtt cgc gac aag ctg atc 528
Leu Pro Gln Asn Glu Phe Phe Gly Glu Ser Val Arg Asp Lys Leu Ile
165 170 175
tac tac ccg acc gtg acc cgc gag ccg ttc gaa aac cag ggc cgt ctg 576
Tyr Tyr Pro Thr Val Thr Arg Glu Pro Phe Glu Asn Gln Gly Arg Leu
180 185 190
acc gac ctg atg cgc agc ggc aag ctg ttc agc gac atc ggc ctg ccg 624
Thr Asp Leu Met Arg Ser Gly Lys Leu Phe Ser Asp Ile Gly Leu Pro
195 200 205
ccg atc aac ccg caa gac gac cgc gcg atg atc tgc ggc agc ccg agc 672
Pro Ile Asn Pro Gln Asp Asp Arg Ala Met Ile Cys Gly Ser Pro Ser
210 215 220
atg ctc gac gag acc agc gaa gtg ctg gac agc ttc ggc ctg aag atc 720
Met Leu Asp Glu Thr Ser Glu Val Leu Asp Ser Phe Gly Leu Lys Ile
225 230 235 240
tcc ccg cgc atg cgc gag ccg ggt gac tac ctg atc gaa cgt gcc ttc 768
Ser Pro Arg Met Arg Glu Pro Gly Asp Tyr Leu Ile Glu Arg Ala Phe
245 250 255
gtc gag aag taa 780
Val Glu Lys
<210> 245
<211> 259
<212> PRT
<213> Pseudomonas pudita
<400> 245
Met Ser Asn Met Asn His Glu Arg Val Leu Ser Val His His Trp Asn
1 5 10 15
Asp Thr Leu Phe Ser Phe Lys Cys Thr Arg Asp Pro Gly Leu Arg Phe
20 25 30
Glu Asn Gly Gln Phe Val Met Ile Gly Leu Gln Gln Asp Asn Gly Arg
35 40 45
Pro Leu Met Arg Ala Tyr Ser Ile Ala Ser Pro Asn Trp Glu Glu His
50 55 60
Leu Glu Phe Phe Ser Ile Lys Val Pro Asp Gly Pro Leu Thr Ser Gln
65 70 75 80
Leu Gln His Leu Lys Glu Gly Asp Glu Ile Ile Ile Ser Lys Lys Pro
85 90 95
Thr Gly Thr Leu Val Leu Asp Asp Leu Asn Pro Gly Lys His Leu Tyr
100 105 110
Leu Leu Ser Thr Gly Thr Gly Leu Ala Pro Phe Met Ser Val Ile Gln
115 120 125
Asp Pro Glu Thr Tyr Glu Arg Phe Glu Lys Val Ile Leu Val His Gly
130 135 140
Val Arg Tyr Val Asn Glu Val Ala Tyr Arg Glu Phe Ile Thr Glu His
145 150 155 160
Leu Pro Gln Asn Glu Phe Phe Gly Glu Ser Val Arg Asp Lys Leu Ile
165 170 175
Tyr Tyr Pro Thr Val Thr Arg Glu Pro Phe Glu Asn Gln Gly Arg Leu
180 185 190
Thr Asp Leu Met Arg Ser Gly Lys Leu Phe Ser Asp Ile Gly Leu Pro
195 200 205
Pro Ile Asn Pro Gln Asp Asp Arg Ala Met Ile Cys Gly Ser Pro Ser
210 215 220
Met Leu Asp Glu Thr Ser Glu Val Leu Asp Ser Phe Gly Leu Lys Ile
225 230 235 240
Ser Pro Arg Met Arg Glu Pro Gly Asp Tyr Leu Ile Glu Arg Ala Phe
245 250 255
Val Glu Lys
<210> 246
<211> 936
<212> DNA
<213> Streptomyces venezuelae
<220>
<221> CDS
<222> (1)..(936)
<223> SVEN_0113 gene from Streptomyces venezuelae encoding a
flavodoxin/ferredoxin reductase
<400> 246
atg agc gag aac ccg ctg caa ctg atc gtc cac cgc atg aca cgg gag 48
Met Ser Glu Asn Pro Leu Gln Leu Ile Val His Arg Met Thr Arg Glu
1 5 10 15
gcc gag ggc gta ctg tcc gtc gaa ctc gcc cac ccc gac ggc aag ccg 96
Ala Glu Gly Val Leu Ser Val Glu Leu Ala His Pro Asp Gly Lys Pro
20 25 30
ctg ccc gcc tgg acg ccg ggc gcc cac atc gac gtc cac gtc ggg ggc 144
Leu Pro Ala Trp Thr Pro Gly Ala His Ile Asp Val His Val Gly Gly
35 40 45
cac gtc cgc cag tac agc ctg tgc ggc gac ccg cac gac cag ggc gcg 192
His Val Arg Gln Tyr Ser Leu Cys Gly Asp Pro His Asp Gln Gly Ala
50 55 60
tac cgg atc ggc gtc ctc gac gaa ccc gcc tca cgc ggc ggt tcg cgc 240
Tyr Arg Ile Gly Val Leu Asp Glu Pro Ala Ser Arg Gly Gly Ser Arg
65 70 75 80
ttc gtg cac acc gca ctg cgc ccc ggc cag acc ctc acg gtc tcc gca 288
Phe Val His Thr Ala Leu Arg Pro Gly Gln Thr Leu Thr Val Ser Ala
85 90 95
ccc cgc aac cac ttc gcc ctc gag gac gcc gcc gcg tac gtc ctc gtc 336
Pro Arg Asn His Phe Ala Leu Glu Asp Ala Ala Ala Tyr Val Leu Val
100 105 110
gcc ggc ggc atc ggc atc acg ccc ctg ctc gcc atg gcc cgc gag gcg 384
Ala Gly Gly Ile Gly Ile Thr Pro Leu Leu Ala Met Ala Arg Glu Ala
115 120 125
gcc cgc cgg ggc gcc gag tgg cgc ctg gtc tac ggc ggc cgg agc cgg 432
Ala Arg Arg Gly Ala Glu Trp Arg Leu Val Tyr Gly Gly Arg Ser Arg
130 135 140
gcg tcg atg gcc ttc acc gcc gaa ctg gcc ctg ctc ggc ggc gag gtg 480
Ala Ser Met Ala Phe Thr Ala Glu Leu Ala Leu Leu Gly Gly Glu Val
145 150 155 160
acc ctc gtc ccg cag gac gaa cgc ggc cac atc gac ctg gac gcc gag 528
Thr Leu Val Pro Gln Asp Glu Arg Gly His Ile Asp Leu Asp Ala Glu
165 170 175
ctg tcc cgg ctg ccc gac ggc gcc ctc gtc tac gcc tgc ggc ccg gaa 576
Leu Ser Arg Leu Pro Asp Gly Ala Leu Val Tyr Ala Cys Gly Pro Glu
180 185 190
ccc ctc ctc gcg gcc gtc gag gaa cgc tgt ccg caa gga cag ctg cgc 624
Pro Leu Leu Ala Ala Val Glu Glu Arg Cys Pro Gln Gly Gln Leu Arg
195 200 205
acc gaa cgg ttc acc gcc ccc acc gtc gaa cgc gca gaa gac gac gga 672
Thr Glu Arg Phe Thr Ala Pro Thr Val Glu Arg Ala Glu Asp Asp Gly
210 215 220
gag ttc gag gtc gag tgc cgc acc tcg ggc ctg acg ctc cgg gtc gac 720
Glu Phe Glu Val Glu Cys Arg Thr Ser Gly Leu Thr Leu Arg Val Asp
225 230 235 240
gca cac tcc tcg atc ctc gac gcc gcc gag aac gcc ggg atc gcc gtc 768
Ala His Ser Ser Ile Leu Asp Ala Ala Glu Asn Ala Gly Ile Ala Val
245 250 255
gac agc tcc tgc cgc gac ggc atc tgc ggc tcc tgc gag acc cgc gtc 816
Asp Ser Ser Cys Arg Asp Gly Ile Cys Gly Ser Cys Glu Thr Arg Val
260 265 270
ctc gaa ggc acc ccg gac cac cgc gac ttc ctc ctc agc gag gcg gaa 864
Leu Glu Gly Thr Pro Asp His Arg Asp Phe Leu Leu Ser Glu Ala Glu
275 280 285
cag gcc gcc ggc gcc acc atg atg atc tgc gtc tcg cgg tgc gcc tcc 912
Gln Ala Ala Gly Ala Thr Met Met Ile Cys Val Ser Arg Cys Ala Ser
290 295 300
ggc cgg ctc gtc ctc gac ctg tga 936
Gly Arg Leu Val Leu Asp Leu
305 310
<210> 247
<211> 311
<212> PRT
<213> Streptomyces venezuelae
<400> 247
Met Ser Glu Asn Pro Leu Gln Leu Ile Val His Arg Met Thr Arg Glu
1 5 10 15
Ala Glu Gly Val Leu Ser Val Glu Leu Ala His Pro Asp Gly Lys Pro
20 25 30
Leu Pro Ala Trp Thr Pro Gly Ala His Ile Asp Val His Val Gly Gly
35 40 45
His Val Arg Gln Tyr Ser Leu Cys Gly Asp Pro His Asp Gln Gly Ala
50 55 60
Tyr Arg Ile Gly Val Leu Asp Glu Pro Ala Ser Arg Gly Gly Ser Arg
65 70 75 80
Phe Val His Thr Ala Leu Arg Pro Gly Gln Thr Leu Thr Val Ser Ala
85 90 95
Pro Arg Asn His Phe Ala Leu Glu Asp Ala Ala Ala Tyr Val Leu Val
100 105 110
Ala Gly Gly Ile Gly Ile Thr Pro Leu Leu Ala Met Ala Arg Glu Ala
115 120 125
Ala Arg Arg Gly Ala Glu Trp Arg Leu Val Tyr Gly Gly Arg Ser Arg
130 135 140
Ala Ser Met Ala Phe Thr Ala Glu Leu Ala Leu Leu Gly Gly Glu Val
145 150 155 160
Thr Leu Val Pro Gln Asp Glu Arg Gly His Ile Asp Leu Asp Ala Glu
165 170 175
Leu Ser Arg Leu Pro Asp Gly Ala Leu Val Tyr Ala Cys Gly Pro Glu
180 185 190
Pro Leu Leu Ala Ala Val Glu Glu Arg Cys Pro Gln Gly Gln Leu Arg
195 200 205
Thr Glu Arg Phe Thr Ala Pro Thr Val Glu Arg Ala Glu Asp Asp Gly
210 215 220
Glu Phe Glu Val Glu Cys Arg Thr Ser Gly Leu Thr Leu Arg Val Asp
225 230 235 240
Ala His Ser Ser Ile Leu Asp Ala Ala Glu Asn Ala Gly Ile Ala Val
245 250 255
Asp Ser Ser Cys Arg Asp Gly Ile Cys Gly Ser Cys Glu Thr Arg Val
260 265 270
Leu Glu Gly Thr Pro Asp His Arg Asp Phe Leu Leu Ser Glu Ala Glu
275 280 285
Gln Ala Ala Gly Ala Thr Met Met Ile Cys Val Ser Arg Cys Ala Ser
290 295 300
Gly Arg Leu Val Leu Asp Leu
305 310
<210> 248
<211> 978
<212> DNA
<213> Corynebacterium glutamicum
<220>
<221> CDS
<222> (1)..(978)
<223> Cgl2384 gene from Corynebacterium glutamicum encoding a
flavodoxin/ferredoxin reductase
<400> 248
atg aac tcg caa tgg caa gat gca cat gtt gtt tcc agc gaa atc atc 48
Met Asn Ser Gln Trp Gln Asp Ala His Val Val Ser Ser Glu Ile Ile
1 5 10 15
gct gca gac att cgg cga ata gaa cta tcc ccg aaa ttt gcg att cca 96
Ala Ala Asp Ile Arg Arg Ile Glu Leu Ser Pro Lys Phe Ala Ile Pro
20 25 30
gta aaa ccc ggc gaa cat ctc aag atc atg gtg ccc cta aaa act gga 144
Val Lys Pro Gly Glu His Leu Lys Ile Met Val Pro Leu Lys Thr Gly
35 40 45
cag gaa aag aga tcg tac tcc atc gtt gac gct cgt cac gac ggt tcg 192
Gln Glu Lys Arg Ser Tyr Ser Ile Val Asp Ala Arg His Asp Gly Ser
50 55 60
act ctc gcc ctg agc gta ctc aaa acc aga aac tcc cgt gga gga tct 240
Thr Leu Ala Leu Ser Val Leu Lys Thr Arg Asn Ser Arg Gly Gly Ser
65 70 75 80
gag ttc atg cat acg ctt cga gct gga gac aca gtt act gtc tcc agg 288
Glu Phe Met His Thr Leu Arg Ala Gly Asp Thr Val Thr Val Ser Arg
85 90 95
ccg tct cag gat ttt cct ctc cgc gtg ggt gcg cct gag tat gta ctt 336
Pro Ser Gln Asp Phe Pro Leu Arg Val Gly Ala Pro Glu Tyr Val Leu
100 105 110
gtt gcc ggc gga att gga atc aca gcg atc cgt tca atg gca tct tta 384
Val Ala Gly Gly Ile Gly Ile Thr Ala Ile Arg Ser Met Ala Ser Leu
115 120 125
tta aag aaa ttg gga gcg aac tac cgc atc cat ttc gca gca cgc agc 432
Leu Lys Lys Leu Gly Ala Asn Tyr Arg Ile His Phe Ala Ala Arg Ser
130 135 140
ctt gat gcc atg gct tac aaa gat gag ctc gtg gca gaa cac ggc gac 480
Leu Asp Ala Met Ala Tyr Lys Asp Glu Leu Val Ala Glu His Gly Asp
145 150 155 160
aag ctg cac ctg cat cta gat tct gaa ggc acc acc atc gat gtc cca 528
Lys Leu His Leu His Leu Asp Ser Glu Gly Thr Thr Ile Asp Val Pro
165 170 175
gca ttg atc gaa acc tta aac ccc cac act gag ctt tat atg tgc ggc 576
Ala Leu Ile Glu Thr Leu Asn Pro His Thr Glu Leu Tyr Met Cys Gly
180 185 190
ccc atc cgc ttg atg gat gcc atc cgg cgc gca tgg aac acc cgc gga 624
Pro Ile Arg Leu Met Asp Ala Ile Arg Arg Ala Trp Asn Thr Arg Gly
195 200 205
ctt gac ccc acc aat ctg cgt ttc gaa acg ttt gga aac agt gga tgg 672
Leu Asp Pro Thr Asn Leu Arg Phe Glu Thr Phe Gly Asn Ser Gly Trp
210 215 220
ttc tcc cca gag gtt ttc cac atc caa gta cca gag ctg ggg ctt cac 720
Phe Ser Pro Glu Val Phe His Ile Gln Val Pro Glu Leu Gly Leu His
225 230 235 240
gcc aca gtc aac aag gat gaa agc atg ctg gag gct ttg caa aag gct 768
Ala Thr Val Asn Lys Asp Glu Ser Met Leu Glu Ala Leu Gln Lys Ala
245 250 255
ggg gcg aat atg atg ttt gat tgt cga aaa ggc gaa tgt ggt ttg tgc 816
Gly Ala Asn Met Met Phe Asp Cys Arg Lys Gly Glu Cys Gly Leu Cys
260 265 270
cag gtt cgc gtt cta gaa gtc gat ggc cag gtt gat cac cgc gat gtg 864
Gln Val Arg Val Leu Glu Val Asp Gly Gln Val Asp His Arg Asp Val
275 280 285
ttc ttc tct gat cgt caa aaa gaa tcc gac gca aag gca tgc gcc tgc 912
Phe Phe Ser Asp Arg Gln Lys Glu Ser Asp Ala Lys Ala Cys Ala Cys
290 295 300
gtg tct cga gta gtc tcc tcc cct tcc tcg tcc cca acc tcg acc att 960
Val Ser Arg Val Val Ser Ser Pro Ser Ser Ser Pro Thr Ser Thr Ile
305 310 315 320
acg gtc gcc ctc tcc taa 978
Thr Val Ala Leu Ser
325
<210> 249
<211> 325
<212> PRT
<213> Corynebacterium glutamicum
<400> 249
Met Asn Ser Gln Trp Gln Asp Ala His Val Val Ser Ser Glu Ile Ile
1 5 10 15
Ala Ala Asp Ile Arg Arg Ile Glu Leu Ser Pro Lys Phe Ala Ile Pro
20 25 30
Val Lys Pro Gly Glu His Leu Lys Ile Met Val Pro Leu Lys Thr Gly
35 40 45
Gln Glu Lys Arg Ser Tyr Ser Ile Val Asp Ala Arg His Asp Gly Ser
50 55 60
Thr Leu Ala Leu Ser Val Leu Lys Thr Arg Asn Ser Arg Gly Gly Ser
65 70 75 80
Glu Phe Met His Thr Leu Arg Ala Gly Asp Thr Val Thr Val Ser Arg
85 90 95
Pro Ser Gln Asp Phe Pro Leu Arg Val Gly Ala Pro Glu Tyr Val Leu
100 105 110
Val Ala Gly Gly Ile Gly Ile Thr Ala Ile Arg Ser Met Ala Ser Leu
115 120 125
Leu Lys Lys Leu Gly Ala Asn Tyr Arg Ile His Phe Ala Ala Arg Ser
130 135 140
Leu Asp Ala Met Ala Tyr Lys Asp Glu Leu Val Ala Glu His Gly Asp
145 150 155 160
Lys Leu His Leu His Leu Asp Ser Glu Gly Thr Thr Ile Asp Val Pro
165 170 175
Ala Leu Ile Glu Thr Leu Asn Pro His Thr Glu Leu Tyr Met Cys Gly
180 185 190
Pro Ile Arg Leu Met Asp Ala Ile Arg Arg Ala Trp Asn Thr Arg Gly
195 200 205
Leu Asp Pro Thr Asn Leu Arg Phe Glu Thr Phe Gly Asn Ser Gly Trp
210 215 220
Phe Ser Pro Glu Val Phe His Ile Gln Val Pro Glu Leu Gly Leu His
225 230 235 240
Ala Thr Val Asn Lys Asp Glu Ser Met Leu Glu Ala Leu Gln Lys Ala
245 250 255
Gly Ala Asn Met Met Phe Asp Cys Arg Lys Gly Glu Cys Gly Leu Cys
260 265 270
Gln Val Arg Val Leu Glu Val Asp Gly Gln Val Asp His Arg Asp Val
275 280 285
Phe Phe Ser Asp Arg Gln Lys Glu Ser Asp Ala Lys Ala Cys Ala Cys
290 295 300
Val Ser Arg Val Val Ser Ser Pro Ser Ser Ser Pro Thr Ser Thr Ile
305 310 315 320
Thr Val Ala Leu Ser
325
<210> 250
<211> 921
<212> DNA
<213> Sphingobacterium sp
<220>
<221> CDS
<222> (1)..(921)
<223> SJN15614.1 from Sphingobacterium sp. JB170 encoding a
flavodoxin/ferredoxin reductase
<400> 250
atg ttt ggc gat cgc gag gtt cgt aga tcg tat tcc ttt agt agc tcg 48
Met Phe Gly Asp Arg Glu Val Arg Arg Ser Tyr Ser Phe Ser Ser Ser
1 5 10 15
cct gca gtt gtt gag ccg cta gcc att acc gta aaa aga gtg gat aat 96
Pro Ala Val Val Glu Pro Leu Ala Ile Thr Val Lys Arg Val Asp Asn
20 25 30
ggg gaa att tcc cgc ctg ttg cat cat cgt aca cgc gtt ggg gat ctt 144
Gly Glu Ile Ser Arg Leu Leu His His Arg Thr Arg Val Gly Asp Leu
35 40 45
gtt gat gtt cta gcc ccg cag gga tta ttt aca tac gaa cct gac ccc 192
Val Asp Val Leu Ala Pro Gln Gly Leu Phe Thr Tyr Glu Pro Asp Pro
50 55 60
act aca gct cga aca tta ttt tta ttt ggc gcc ggc gtt ggt gtt act 240
Thr Thr Ala Arg Thr Leu Phe Leu Phe Gly Ala Gly Val Gly Val Thr
65 70 75 80
ccg tta ttt tcc atc ctg aaa act gcg ctg tcc aca gaa ccc aaa acg 288
Pro Leu Phe Ser Ile Leu Lys Thr Ala Leu Ser Thr Glu Pro Lys Thr
85 90 95
aag gtt gtc ctc att tat agc aac agt tca ccc gat agg aca gtt ttt 336
Lys Val Val Leu Ile Tyr Ser Asn Ser Ser Pro Asp Arg Thr Val Phe
100 105 110
aaa gtt gaa ctt gaa cat tgg caa caa ctg tat gcc gat cgg ctt gaa 384
Lys Val Glu Leu Glu His Trp Gln Gln Leu Tyr Ala Asp Arg Leu Glu
115 120 125
att ata tgg att tac tcc aat tca aaa aat ctg tta aat gca cac cta 432
Ile Ile Trp Ile Tyr Ser Asn Ser Lys Asn Leu Leu Asn Ala His Leu
130 135 140
aac cgc gag aac tta tta cgc ttt gtc aat gaa cgc atg cct gag gat 480
Asn Arg Glu Asn Leu Leu Arg Phe Val Asn Glu Arg Met Pro Glu Asp
145 150 155 160
aac aat gct ata ttt ttc acc tgt ggt ccg gta ttt tac atg gac tta 528
Asn Asn Ala Ile Phe Phe Thr Cys Gly Pro Val Phe Tyr Met Asp Leu
165 170 175
gta cgc ttc acg tta ctc ggt ctt gga atc ccc gac gag gat atc cgc 576
Val Arg Phe Thr Leu Leu Gly Leu Gly Ile Pro Asp Glu Asp Ile Arg
180 185 190
aag gag aca ttt cat ttt cct gaa gaa gaa gat gat gaa gat gag aaa 624
Lys Glu Thr Phe His Phe Pro Glu Glu Glu Asp Asp Glu Asp Glu Lys
195 200 205
gaa gac gat ccg gtt gat acg aca gcg tac aac ata ttg ctc agg ttt 672
Glu Asp Asp Pro Val Asp Thr Thr Ala Tyr Asn Ile Leu Leu Arg Phe
210 215 220
caa gga caa gaa tac ccg ttg aca att cca tac aac aaa aca atc ttg 720
Gln Gly Gln Glu Tyr Pro Leu Thr Ile Pro Tyr Asn Lys Thr Ile Leu
225 230 235 240
cag gcc gga ctg gat aat aat att aaa ctt ccc tat tct tgt aaa tcg 768
Gln Ala Gly Leu Asp Asn Asn Ile Lys Leu Pro Tyr Ser Cys Lys Ser
245 250 255
ggg atg tgt agt act tgt atc tca caa tgt tcc agc ggg tcc gtc cgg 816
Gly Met Cys Ser Thr Cys Ile Ser Gln Cys Ser Ser Gly Ser Val Arg
260 265 270
atg gat tac aat gag gtt cta aca gat cgt gag gtt gaa aat ggc cgt 864
Met Asp Tyr Asn Glu Val Leu Thr Asp Arg Glu Val Glu Asn Gly Arg
275 280 285
tgt ttg att tgt act tcc cac ccg tta gaa gat ggt acg acg att gat 912
Cys Leu Ile Cys Thr Ser His Pro Leu Glu Asp Gly Thr Thr Ile Asp
290 295 300
gta gtt taa 921
Val Val
305
<210> 251
<211> 306
<212> PRT
<213> Sphingobacterium sp
<400> 251
Met Phe Gly Asp Arg Glu Val Arg Arg Ser Tyr Ser Phe Ser Ser Ser
1 5 10 15
Pro Ala Val Val Glu Pro Leu Ala Ile Thr Val Lys Arg Val Asp Asn
20 25 30
Gly Glu Ile Ser Arg Leu Leu His His Arg Thr Arg Val Gly Asp Leu
35 40 45
Val Asp Val Leu Ala Pro Gln Gly Leu Phe Thr Tyr Glu Pro Asp Pro
50 55 60
Thr Thr Ala Arg Thr Leu Phe Leu Phe Gly Ala Gly Val Gly Val Thr
65 70 75 80
Pro Leu Phe Ser Ile Leu Lys Thr Ala Leu Ser Thr Glu Pro Lys Thr
85 90 95
Lys Val Val Leu Ile Tyr Ser Asn Ser Ser Pro Asp Arg Thr Val Phe
100 105 110
Lys Val Glu Leu Glu His Trp Gln Gln Leu Tyr Ala Asp Arg Leu Glu
115 120 125
Ile Ile Trp Ile Tyr Ser Asn Ser Lys Asn Leu Leu Asn Ala His Leu
130 135 140
Asn Arg Glu Asn Leu Leu Arg Phe Val Asn Glu Arg Met Pro Glu Asp
145 150 155 160
Asn Asn Ala Ile Phe Phe Thr Cys Gly Pro Val Phe Tyr Met Asp Leu
165 170 175
Val Arg Phe Thr Leu Leu Gly Leu Gly Ile Pro Asp Glu Asp Ile Arg
180 185 190
Lys Glu Thr Phe His Phe Pro Glu Glu Glu Asp Asp Glu Asp Glu Lys
195 200 205
Glu Asp Asp Pro Val Asp Thr Thr Ala Tyr Asn Ile Leu Leu Arg Phe
210 215 220
Gln Gly Gln Glu Tyr Pro Leu Thr Ile Pro Tyr Asn Lys Thr Ile Leu
225 230 235 240
Gln Ala Gly Leu Asp Asn Asn Ile Lys Leu Pro Tyr Ser Cys Lys Ser
245 250 255
Gly Met Cys Ser Thr Cys Ile Ser Gln Cys Ser Ser Gly Ser Val Arg
260 265 270
Met Asp Tyr Asn Glu Val Leu Thr Asp Arg Glu Val Glu Asn Gly Arg
275 280 285
Cys Leu Ile Cys Thr Ser His Pro Leu Glu Asp Gly Thr Thr Ile Asp
290 295 300
Val Val
305
<210> 252
<211> 3525
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(3525)
<223> ydbK gene from E. coli encoding pyruvate-flavodoxin/ferredoxin
oxidoreductase
<400> 252
atg att act att gac ggt aat ggc gcg gtt gct tcg gtc gca ttt cgc 48
Met Ile Thr Ile Asp Gly Asn Gly Ala Val Ala Ser Val Ala Phe Arg
1 5 10 15
acc agt gaa gtt atc gcc atc tac cct att acc ccc agt tcc acg atg 96
Thr Ser Glu Val Ile Ala Ile Tyr Pro Ile Thr Pro Ser Ser Thr Met
20 25 30
gca gaa cag gct gat gcc tgg gcc gga aac ggc tta aag aac gtt tgg 144
Ala Glu Gln Ala Asp Ala Trp Ala Gly Asn Gly Leu Lys Asn Val Trp
35 40 45
gga gac aca cca cgc gtg gtt gaa atg cag tcg gaa gcg ggt gct atc 192
Gly Asp Thr Pro Arg Val Val Glu Met Gln Ser Glu Ala Gly Ala Ile
50 55 60
gct acc gtg cat ggc gct ttg cag acg ggt gcc ctt tca aca tcg ttt 240
Ala Thr Val His Gly Ala Leu Gln Thr Gly Ala Leu Ser Thr Ser Phe
65 70 75 80
acg tca tcg cag ggt ttg ctg ctg atg atc ccg acg ctg tac aaa ctg 288
Thr Ser Ser Gln Gly Leu Leu Leu Met Ile Pro Thr Leu Tyr Lys Leu
85 90 95
gca ggc gaa cta aca ccg ttt gtc ctg cat gta gcg gca cgt acc gtt 336
Ala Gly Glu Leu Thr Pro Phe Val Leu His Val Ala Ala Arg Thr Val
100 105 110
gcc aca cat gca ctc tct att ttt ggc gat cat tcc gac gtt atg gcg 384
Ala Thr His Ala Leu Ser Ile Phe Gly Asp His Ser Asp Val Met Ala
115 120 125
gtg cgc cag acg ggt tgc gcg atg ttg tgt gca gca aac gtc cag gaa 432
Val Arg Gln Thr Gly Cys Ala Met Leu Cys Ala Ala Asn Val Gln Glu
130 135 140
gcg caa gac ttt gct ctc att tcg caa atc gcg acg ctg aaa agc cgc 480
Ala Gln Asp Phe Ala Leu Ile Ser Gln Ile Ala Thr Leu Lys Ser Arg
145 150 155 160
gtg cca ttt att cat ttc ttt gat ggt ttc cgc acg tcc cac gaa atc 528
Val Pro Phe Ile His Phe Phe Asp Gly Phe Arg Thr Ser His Glu Ile
165 170 175
aat aaa att gtc ccg ctg gcc gat gac acg att ctt gat ctc atg ccg 576
Asn Lys Ile Val Pro Leu Ala Asp Asp Thr Ile Leu Asp Leu Met Pro
180 185 190
cag gtc gaa att gat gct cat cgc gcc cgg gca ctc aac ccg gaa cat 624
Gln Val Glu Ile Asp Ala His Arg Ala Arg Ala Leu Asn Pro Glu His
195 200 205
ccg gtg atc cgc ggt acg tcc gcc aat cct gac act tat ttc cag tct 672
Pro Val Ile Arg Gly Thr Ser Ala Asn Pro Asp Thr Tyr Phe Gln Ser
210 215 220
cgc gaa gcc acc aac cca tgg tac aac gcg gtc tat gac cat gtt gaa 720
Arg Glu Ala Thr Asn Pro Trp Tyr Asn Ala Val Tyr Asp His Val Glu
225 230 235 240
cag gcg atg aat gat ttc tct gcc gcg aca ggt cgt cag tat cag ccg 768
Gln Ala Met Asn Asp Phe Ser Ala Ala Thr Gly Arg Gln Tyr Gln Pro
245 250 255
ttt gaa tat tac ggg cat ccg caa gcg gaa cgg gtg att atc ctg atg 816
Phe Glu Tyr Tyr Gly His Pro Gln Ala Glu Arg Val Ile Ile Leu Met
260 265 270
ggc tct gcc att ggc acc tgt gaa gaa gtg gtt gat gaa ttg cta acc 864
Gly Ser Ala Ile Gly Thr Cys Glu Glu Val Val Asp Glu Leu Leu Thr
275 280 285
cgt ggc gaa aaa gtt ggc gtg ctg aaa gtt cgc ctg tac cgc ccc ttc 912
Arg Gly Glu Lys Val Gly Val Leu Lys Val Arg Leu Tyr Arg Pro Phe
290 295 300
tcc gct aaa cat tta ctg caa gct ctg ccg gga tcc gta cgc agc gtg 960
Ser Ala Lys His Leu Leu Gln Ala Leu Pro Gly Ser Val Arg Ser Val
305 310 315 320
gcg gta ctg gac aga acc aaa gaa ccc ggt gcc cag gca gaa ccg ctc 1008
Ala Val Leu Asp Arg Thr Lys Glu Pro Gly Ala Gln Ala Glu Pro Leu
325 330 335
tat ctg gat gta atg acc gca ctg gca gaa gcc ttt aat aat ggc gag 1056
Tyr Leu Asp Val Met Thr Ala Leu Ala Glu Ala Phe Asn Asn Gly Glu
340 345 350
cgc gaa act ctg ccc cgt gtc att ggt ggg cgc tat ggt ctt tca tcc 1104
Arg Glu Thr Leu Pro Arg Val Ile Gly Gly Arg Tyr Gly Leu Ser Ser
355 360 365
aaa gaa ttt ggc cca gac tgt gta ctg gcg gta ttt gcc gag ctc aac 1152
Lys Glu Phe Gly Pro Asp Cys Val Leu Ala Val Phe Ala Glu Leu Asn
370 375 380
gcg gct aaa ccg aaa gcg cgc ttt acg gtt ggt att tac gat gat gtg 1200
Ala Ala Lys Pro Lys Ala Arg Phe Thr Val Gly Ile Tyr Asp Asp Val
385 390 395 400
acc aat ctg tca ctg ccg ttg ccg gaa aac acc ctg cca aac tcg gcg 1248
Thr Asn Leu Ser Leu Pro Leu Pro Glu Asn Thr Leu Pro Asn Ser Ala
405 410 415
aaa ctg gaa gcc ttg ttt tat ggc ctt ggt agt gat ggc agc gtt tcc 1296
Lys Leu Glu Ala Leu Phe Tyr Gly Leu Gly Ser Asp Gly Ser Val Ser
420 425 430
gcg acc aaa aac aat atc aag att atc ggt aat tcc acg ccg tgg tac 1344
Ala Thr Lys Asn Asn Ile Lys Ile Ile Gly Asn Ser Thr Pro Trp Tyr
435 440 445
gca cag ggc tat ttt gtt tac gac tcc aaa aag gcg ggc ggc ctg acg 1392
Ala Gln Gly Tyr Phe Val Tyr Asp Ser Lys Lys Ala Gly Gly Leu Thr
450 455 460
gtt tct cac ctt cga gtg agc gaa cag ccg att cgt tcc gct tat ctc 1440
Val Ser His Leu Arg Val Ser Glu Gln Pro Ile Arg Ser Ala Tyr Leu
465 470 475 480
att tcc cag gct gat ttt gtt ggc tgc cac cag ttg cag ttt atc gat 1488
Ile Ser Gln Ala Asp Phe Val Gly Cys His Gln Leu Gln Phe Ile Asp
485 490 495
aaa tat cag atg gct gag cgt tta aaa cct ggc ggc att ttc ctg ctc 1536
Lys Tyr Gln Met Ala Glu Arg Leu Lys Pro Gly Gly Ile Phe Leu Leu
500 505 510
aac acg ccg tac agc gca gat gaa gtg tgg tcg cgc ttg ccg caa gaa 1584
Asn Thr Pro Tyr Ser Ala Asp Glu Val Trp Ser Arg Leu Pro Gln Glu
515 520 525
gtt cag gcc gtg tta aac cag aaa aaa gcg cgc ttc tat gtg att aac 1632
Val Gln Ala Val Leu Asn Gln Lys Lys Ala Arg Phe Tyr Val Ile Asn
530 535 540
gcg gcg aaa atc gcc cgc gaa tgt ggc ctg gcg gcc cgt att aat acc 1680
Ala Ala Lys Ile Ala Arg Glu Cys Gly Leu Ala Ala Arg Ile Asn Thr
545 550 555 560
gtc atg cag atg gct ttt ttc cat ctg acg caa att ctg cct ggc gat 1728
Val Met Gln Met Ala Phe Phe His Leu Thr Gln Ile Leu Pro Gly Asp
565 570 575
agc gcc ctc gca gaa ttg cag ggt gcg att gcc aaa agt tac agt agc 1776
Ser Ala Leu Ala Glu Leu Gln Gly Ala Ile Ala Lys Ser Tyr Ser Ser
580 585 590
aaa ggc cag gat ctg gtg gaa cgc aac tgg cag gct ctg gcg ctg gcg 1824
Lys Gly Gln Asp Leu Val Glu Arg Asn Trp Gln Ala Leu Ala Leu Ala
595 600 605
cgt gaa tcc gta gaa gaa gtt ccg ttg caa ccg gta aat ccg cac agc 1872
Arg Glu Ser Val Glu Glu Val Pro Leu Gln Pro Val Asn Pro His Ser
610 615 620
gcc aat cga ccg cca gtg gtt tcc gat gcc gcc cct gat ttc gtg aaa 1920
Ala Asn Arg Pro Pro Val Val Ser Asp Ala Ala Pro Asp Phe Val Lys
625 630 635 640
acc gta acc gct gcg atg ctc gcc ggg ctt ggt gac gcc ctc ccc gtt 1968
Thr Val Thr Ala Ala Met Leu Ala Gly Leu Gly Asp Ala Leu Pro Val
645 650 655
tcg gcg ctg ccg cca gac ggc acc tgg ccg atg ggc act acg cgc tgg 2016
Ser Ala Leu Pro Pro Asp Gly Thr Trp Pro Met Gly Thr Thr Arg Trp
660 665 670
gaa aaa cgc aat atc gcc gaa gag atc ccc atc tgg aaa gag gaa ctc 2064
Glu Lys Arg Asn Ile Ala Glu Glu Ile Pro Ile Trp Lys Glu Glu Leu
675 680 685
tgt acc caa tgt aac cac tgc gtt gcc gct tgc cca cac tca gct att 2112
Cys Thr Gln Cys Asn His Cys Val Ala Ala Cys Pro His Ser Ala Ile
690 695 700
cgc gca aaa gtg gtg ccg cct gaa gcg atg gaa aac gcc cct gcc agc 2160
Arg Ala Lys Val Val Pro Pro Glu Ala Met Glu Asn Ala Pro Ala Ser
705 710 715 720
ctg cat tcg ctg gat gtg aaa tcg cgt gat atg cgc ggg cag aaa tat 2208
Leu His Ser Leu Asp Val Lys Ser Arg Asp Met Arg Gly Gln Lys Tyr
725 730 735
gtc ttg cag gtg gca ccg gaa gat tgc acc ggt tgt aac ctg tgc gtc 2256
Val Leu Gln Val Ala Pro Glu Asp Cys Thr Gly Cys Asn Leu Cys Val
740 745 750
gaa gtt tgc ccg gcg aaa gac cgt cag aat cca gag att aaa gcc atc 2304
Glu Val Cys Pro Ala Lys Asp Arg Gln Asn Pro Glu Ile Lys Ala Ile
755 760 765
aat atg atg tct cgc ctg gaa cat gtc gaa gaa gag aaa atc aat tac 2352
Asn Met Met Ser Arg Leu Glu His Val Glu Glu Glu Lys Ile Asn Tyr
770 775 780
gat ttc ttc ctc aac ctg cca gaa atc gac cgt agc aaa ctg gaa cgt 2400
Asp Phe Phe Leu Asn Leu Pro Glu Ile Asp Arg Ser Lys Leu Glu Arg
785 790 795 800
att gat att cgt aca tcg cag ctg att aca ccg ctg ttt gaa tat tca 2448
Ile Asp Ile Arg Thr Ser Gln Leu Ile Thr Pro Leu Phe Glu Tyr Ser
805 810 815
ggt gct tgc tcc ggt tgt ggc gag acg ccg tat att aaa tta ctg act 2496
Gly Ala Cys Ser Gly Cys Gly Glu Thr Pro Tyr Ile Lys Leu Leu Thr
820 825 830
cag ctc tat ggc gac cgg atg ttg atc gct aac gcc act ggc tgt tct 2544
Gln Leu Tyr Gly Asp Arg Met Leu Ile Ala Asn Ala Thr Gly Cys Ser
835 840 845
tca att tat ggc ggt aac ctg ccc tct aca ccg tat acc acc gat gcc 2592
Ser Ile Tyr Gly Gly Asn Leu Pro Ser Thr Pro Tyr Thr Thr Asp Ala
850 855 860
aac ggt cgt ggg ccg gca tgg gcg aac tct cta ttt gaa gat aat gcc 2640
Asn Gly Arg Gly Pro Ala Trp Ala Asn Ser Leu Phe Glu Asp Asn Ala
865 870 875 880
gaa ttt ggc ctt ggt ttc cgc ctg acg gtc gat caa cac cgt gtc cgc 2688
Glu Phe Gly Leu Gly Phe Arg Leu Thr Val Asp Gln His Arg Val Arg
885 890 895
gtg ctg cgt ctg ctg gat caa ttt gcc gat aaa atc ccg gcg gaa tta 2736
Val Leu Arg Leu Leu Asp Gln Phe Ala Asp Lys Ile Pro Ala Glu Leu
900 905 910
ctg acg gcg ttg aaa tca gac gcc acg cca gag gtt cgt cgt gaa cag 2784
Leu Thr Ala Leu Lys Ser Asp Ala Thr Pro Glu Val Arg Arg Glu Gln
915 920 925
gtt gca gct tta cgc cag caa ctc aac gat gtt gcc gaa gca cat gaa 2832
Val Ala Ala Leu Arg Gln Gln Leu Asn Asp Val Ala Glu Ala His Glu
930 935 940
ctg cta cgt gat gca gat gca ctg gtg gaa aaa tca atc tgg ctg att 2880
Leu Leu Arg Asp Ala Asp Ala Leu Val Glu Lys Ser Ile Trp Leu Ile
945 950 955 960
ggt ggt gat ggc tgg gct tac gat atc ggc ttt ggc ggt ctg gat cat 2928
Gly Gly Asp Gly Trp Ala Tyr Asp Ile Gly Phe Gly Gly Leu Asp His
965 970 975
gta ttg agt ttg acg gaa aac gtc aac att ctg gtg ctg gat acg caa 2976
Val Leu Ser Leu Thr Glu Asn Val Asn Ile Leu Val Leu Asp Thr Gln
980 985 990
tgc tat tcc aac acc ggt ggt cag gcg tcg aaa gcg aca ccg ctg ggt 3024
Cys Tyr Ser Asn Thr Gly Gly Gln Ala Ser Lys Ala Thr Pro Leu Gly
995 1000 1005
gca gta act aaa ttt ggc gag cac ggc aaa cgt aaa gcg cgt aaa 3069
Ala Val Thr Lys Phe Gly Glu His Gly Lys Arg Lys Ala Arg Lys
1010 1015 1020
gat ctt ggc gtc agt atg atg atg tac ggt cat gtt tat gtg gcg 3114
Asp Leu Gly Val Ser Met Met Met Tyr Gly His Val Tyr Val Ala
1025 1030 1035
cag att tct ctc ggc gcg cag ctg aac cag acg gtg aaa gcg att 3159
Gln Ile Ser Leu Gly Ala Gln Leu Asn Gln Thr Val Lys Ala Ile
1040 1045 1050
cag gaa gcg gaa gcg tat ccg ggg cca tcg ctg atc att gct tat 3204
Gln Glu Ala Glu Ala Tyr Pro Gly Pro Ser Leu Ile Ile Ala Tyr
1055 1060 1065
agc ccg tgt gaa gag cat ggt tac gat ctg gca ctc agc cac gac 3249
Ser Pro Cys Glu Glu His Gly Tyr Asp Leu Ala Leu Ser His Asp
1070 1075 1080
cag atg cgc caa ctc aca gct acc ggc ttc tgg ccg cta tat cgc 3294
Gln Met Arg Gln Leu Thr Ala Thr Gly Phe Trp Pro Leu Tyr Arg
1085 1090 1095
ttt gat ccg cgt cgt gcc gat gaa ggc aaa ctg ccg ctg gcc ttg 3339
Phe Asp Pro Arg Arg Ala Asp Glu Gly Lys Leu Pro Leu Ala Leu
1100 1105 1110
gat tca cgc ccg ccg tca gaa gca ccg gaa gaa acg tta ctt cac 3384
Asp Ser Arg Pro Pro Ser Glu Ala Pro Glu Glu Thr Leu Leu His
1115 1120 1125
gag caa cgt ttc cgt cgg ctg aat tcg cag cag cca gaa gtg gca 3429
Glu Gln Arg Phe Arg Arg Leu Asn Ser Gln Gln Pro Glu Val Ala
1130 1135 1140
gaa cag tta tgg aaa gat gct gca gct gat ttg caa aaa cgc tat 3474
Glu Gln Leu Trp Lys Asp Ala Ala Ala Asp Leu Gln Lys Arg Tyr
1145 1150 1155
gac ttc ctg gca caa atg gcc gga aaa gcg gaa aaa agc aac acc 3519
Asp Phe Leu Ala Gln Met Ala Gly Lys Ala Glu Lys Ser Asn Thr
1160 1165 1170
gat taa 3525
Asp
<210> 253
<211> 1174
<212> PRT
<213> Escherichia coli
<400> 253
Met Ile Thr Ile Asp Gly Asn Gly Ala Val Ala Ser Val Ala Phe Arg
1 5 10 15
Thr Ser Glu Val Ile Ala Ile Tyr Pro Ile Thr Pro Ser Ser Thr Met
20 25 30
Ala Glu Gln Ala Asp Ala Trp Ala Gly Asn Gly Leu Lys Asn Val Trp
35 40 45
Gly Asp Thr Pro Arg Val Val Glu Met Gln Ser Glu Ala Gly Ala Ile
50 55 60
Ala Thr Val His Gly Ala Leu Gln Thr Gly Ala Leu Ser Thr Ser Phe
65 70 75 80
Thr Ser Ser Gln Gly Leu Leu Leu Met Ile Pro Thr Leu Tyr Lys Leu
85 90 95
Ala Gly Glu Leu Thr Pro Phe Val Leu His Val Ala Ala Arg Thr Val
100 105 110
Ala Thr His Ala Leu Ser Ile Phe Gly Asp His Ser Asp Val Met Ala
115 120 125
Val Arg Gln Thr Gly Cys Ala Met Leu Cys Ala Ala Asn Val Gln Glu
130 135 140
Ala Gln Asp Phe Ala Leu Ile Ser Gln Ile Ala Thr Leu Lys Ser Arg
145 150 155 160
Val Pro Phe Ile His Phe Phe Asp Gly Phe Arg Thr Ser His Glu Ile
165 170 175
Asn Lys Ile Val Pro Leu Ala Asp Asp Thr Ile Leu Asp Leu Met Pro
180 185 190
Gln Val Glu Ile Asp Ala His Arg Ala Arg Ala Leu Asn Pro Glu His
195 200 205
Pro Val Ile Arg Gly Thr Ser Ala Asn Pro Asp Thr Tyr Phe Gln Ser
210 215 220
Arg Glu Ala Thr Asn Pro Trp Tyr Asn Ala Val Tyr Asp His Val Glu
225 230 235 240
Gln Ala Met Asn Asp Phe Ser Ala Ala Thr Gly Arg Gln Tyr Gln Pro
245 250 255
Phe Glu Tyr Tyr Gly His Pro Gln Ala Glu Arg Val Ile Ile Leu Met
260 265 270
Gly Ser Ala Ile Gly Thr Cys Glu Glu Val Val Asp Glu Leu Leu Thr
275 280 285
Arg Gly Glu Lys Val Gly Val Leu Lys Val Arg Leu Tyr Arg Pro Phe
290 295 300
Ser Ala Lys His Leu Leu Gln Ala Leu Pro Gly Ser Val Arg Ser Val
305 310 315 320
Ala Val Leu Asp Arg Thr Lys Glu Pro Gly Ala Gln Ala Glu Pro Leu
325 330 335
Tyr Leu Asp Val Met Thr Ala Leu Ala Glu Ala Phe Asn Asn Gly Glu
340 345 350
Arg Glu Thr Leu Pro Arg Val Ile Gly Gly Arg Tyr Gly Leu Ser Ser
355 360 365
Lys Glu Phe Gly Pro Asp Cys Val Leu Ala Val Phe Ala Glu Leu Asn
370 375 380
Ala Ala Lys Pro Lys Ala Arg Phe Thr Val Gly Ile Tyr Asp Asp Val
385 390 395 400
Thr Asn Leu Ser Leu Pro Leu Pro Glu Asn Thr Leu Pro Asn Ser Ala
405 410 415
Lys Leu Glu Ala Leu Phe Tyr Gly Leu Gly Ser Asp Gly Ser Val Ser
420 425 430
Ala Thr Lys Asn Asn Ile Lys Ile Ile Gly Asn Ser Thr Pro Trp Tyr
435 440 445
Ala Gln Gly Tyr Phe Val Tyr Asp Ser Lys Lys Ala Gly Gly Leu Thr
450 455 460
Val Ser His Leu Arg Val Ser Glu Gln Pro Ile Arg Ser Ala Tyr Leu
465 470 475 480
Ile Ser Gln Ala Asp Phe Val Gly Cys His Gln Leu Gln Phe Ile Asp
485 490 495
Lys Tyr Gln Met Ala Glu Arg Leu Lys Pro Gly Gly Ile Phe Leu Leu
500 505 510
Asn Thr Pro Tyr Ser Ala Asp Glu Val Trp Ser Arg Leu Pro Gln Glu
515 520 525
Val Gln Ala Val Leu Asn Gln Lys Lys Ala Arg Phe Tyr Val Ile Asn
530 535 540
Ala Ala Lys Ile Ala Arg Glu Cys Gly Leu Ala Ala Arg Ile Asn Thr
545 550 555 560
Val Met Gln Met Ala Phe Phe His Leu Thr Gln Ile Leu Pro Gly Asp
565 570 575
Ser Ala Leu Ala Glu Leu Gln Gly Ala Ile Ala Lys Ser Tyr Ser Ser
580 585 590
Lys Gly Gln Asp Leu Val Glu Arg Asn Trp Gln Ala Leu Ala Leu Ala
595 600 605
Arg Glu Ser Val Glu Glu Val Pro Leu Gln Pro Val Asn Pro His Ser
610 615 620
Ala Asn Arg Pro Pro Val Val Ser Asp Ala Ala Pro Asp Phe Val Lys
625 630 635 640
Thr Val Thr Ala Ala Met Leu Ala Gly Leu Gly Asp Ala Leu Pro Val
645 650 655
Ser Ala Leu Pro Pro Asp Gly Thr Trp Pro Met Gly Thr Thr Arg Trp
660 665 670
Glu Lys Arg Asn Ile Ala Glu Glu Ile Pro Ile Trp Lys Glu Glu Leu
675 680 685
Cys Thr Gln Cys Asn His Cys Val Ala Ala Cys Pro His Ser Ala Ile
690 695 700
Arg Ala Lys Val Val Pro Pro Glu Ala Met Glu Asn Ala Pro Ala Ser
705 710 715 720
Leu His Ser Leu Asp Val Lys Ser Arg Asp Met Arg Gly Gln Lys Tyr
725 730 735
Val Leu Gln Val Ala Pro Glu Asp Cys Thr Gly Cys Asn Leu Cys Val
740 745 750
Glu Val Cys Pro Ala Lys Asp Arg Gln Asn Pro Glu Ile Lys Ala Ile
755 760 765
Asn Met Met Ser Arg Leu Glu His Val Glu Glu Glu Lys Ile Asn Tyr
770 775 780
Asp Phe Phe Leu Asn Leu Pro Glu Ile Asp Arg Ser Lys Leu Glu Arg
785 790 795 800
Ile Asp Ile Arg Thr Ser Gln Leu Ile Thr Pro Leu Phe Glu Tyr Ser
805 810 815
Gly Ala Cys Ser Gly Cys Gly Glu Thr Pro Tyr Ile Lys Leu Leu Thr
820 825 830
Gln Leu Tyr Gly Asp Arg Met Leu Ile Ala Asn Ala Thr Gly Cys Ser
835 840 845
Ser Ile Tyr Gly Gly Asn Leu Pro Ser Thr Pro Tyr Thr Thr Asp Ala
850 855 860
Asn Gly Arg Gly Pro Ala Trp Ala Asn Ser Leu Phe Glu Asp Asn Ala
865 870 875 880
Glu Phe Gly Leu Gly Phe Arg Leu Thr Val Asp Gln His Arg Val Arg
885 890 895
Val Leu Arg Leu Leu Asp Gln Phe Ala Asp Lys Ile Pro Ala Glu Leu
900 905 910
Leu Thr Ala Leu Lys Ser Asp Ala Thr Pro Glu Val Arg Arg Glu Gln
915 920 925
Val Ala Ala Leu Arg Gln Gln Leu Asn Asp Val Ala Glu Ala His Glu
930 935 940
Leu Leu Arg Asp Ala Asp Ala Leu Val Glu Lys Ser Ile Trp Leu Ile
945 950 955 960
Gly Gly Asp Gly Trp Ala Tyr Asp Ile Gly Phe Gly Gly Leu Asp His
965 970 975
Val Leu Ser Leu Thr Glu Asn Val Asn Ile Leu Val Leu Asp Thr Gln
980 985 990
Cys Tyr Ser Asn Thr Gly Gly Gln Ala Ser Lys Ala Thr Pro Leu Gly
995 1000 1005
Ala Val Thr Lys Phe Gly Glu His Gly Lys Arg Lys Ala Arg Lys
1010 1015 1020
Asp Leu Gly Val Ser Met Met Met Tyr Gly His Val Tyr Val Ala
1025 1030 1035
Gln Ile Ser Leu Gly Ala Gln Leu Asn Gln Thr Val Lys Ala Ile
1040 1045 1050
Gln Glu Ala Glu Ala Tyr Pro Gly Pro Ser Leu Ile Ile Ala Tyr
1055 1060 1065
Ser Pro Cys Glu Glu His Gly Tyr Asp Leu Ala Leu Ser His Asp
1070 1075 1080
Gln Met Arg Gln Leu Thr Ala Thr Gly Phe Trp Pro Leu Tyr Arg
1085 1090 1095
Phe Asp Pro Arg Arg Ala Asp Glu Gly Lys Leu Pro Leu Ala Leu
1100 1105 1110
Asp Ser Arg Pro Pro Ser Glu Ala Pro Glu Glu Thr Leu Leu His
1115 1120 1125
Glu Gln Arg Phe Arg Arg Leu Asn Ser Gln Gln Pro Glu Val Ala
1130 1135 1140
Glu Gln Leu Trp Lys Asp Ala Ala Ala Asp Leu Gln Lys Arg Tyr
1145 1150 1155
Asp Phe Leu Ala Gln Met Ala Gly Lys Ala Glu Lys Ser Asn Thr
1160 1165 1170
Asp
<210> 254
<211> 3588
<212> DNA
<213> Geobacter sulfurreducens
<220>
<221> CDS
<222> (1)..(3588)
<223> por gene from Geobacter sulfurreducens AM-1 encoding
pyruvate-flavodoxin/ferredoxin oxidoreductase
<400> 254
atg agt cgc aaa atg gta acc atc gac ggc aat acc gcg gct gcc cac 48
Met Ser Arg Lys Met Val Thr Ile Asp Gly Asn Thr Ala Ala Ala His
1 5 10 15
gtg gcg cac gcc acc aac gag gtc att gcc atc tac ccc att acc cct 96
Val Ala His Ala Thr Asn Glu Val Ile Ala Ile Tyr Pro Ile Thr Pro
20 25 30
tcg tcg gtc atg ggt gag att tcc gac atc aag agc gcc atg ggc gag 144
Ser Ser Val Met Gly Glu Ile Ser Asp Ile Lys Ser Ala Met Gly Glu
35 40 45
aaa aac atc tgg gga acc gta ccg tcg gtt gtc gag atg cag tcg gaa 192
Lys Asn Ile Trp Gly Thr Val Pro Ser Val Val Glu Met Gln Ser Glu
50 55 60
ggc ggc gct gcc ggt gcc gtg cac ggt gcc ctc cag gca ggt gcg ctg 240
Gly Gly Ala Ala Gly Ala Val His Gly Ala Leu Gln Ala Gly Ala Leu
65 70 75 80
acc acc act ttt acc gcc agc cag ggt ctg ctc ctg atg atc ccg aac 288
Thr Thr Thr Phe Thr Ala Ser Gln Gly Leu Leu Leu Met Ile Pro Asn
85 90 95
atg ttc aag atc gcc ggc gag ctg acc tct acg gtc ttc cat gtc tcc 336
Met Phe Lys Ile Ala Gly Glu Leu Thr Ser Thr Val Phe His Val Ser
100 105 110
gcc cgc gcc atc gcg gcc cag gcc ctc tcc atc ttt ggc gac cat tcg 384
Ala Arg Ala Ile Ala Ala Gln Ala Leu Ser Ile Phe Gly Asp His Ser
115 120 125
gac gtc atg tcc tgc cgt tcc acc ggt tgg gcc atg ctc tgc tcc aac 432
Asp Val Met Ser Cys Arg Ser Thr Gly Trp Ala Met Leu Cys Ser Asn
130 135 140
aac tcc cag gag gtc atg gac ttc gcc ctg att gcc cag tcc gcg acg 480
Asn Ser Gln Glu Val Met Asp Phe Ala Leu Ile Ala Gln Ser Ala Thr
145 150 155 160
ctt cgt tcc cgg gtg ccg ttc ctc cat ttc ttc gac ggc ttc cgg acc 528
Leu Arg Ser Arg Val Pro Phe Leu His Phe Phe Asp Gly Phe Arg Thr
165 170 175
tcc cac gag gtt ctc aag gtg gag gag ctg act ttc gac gac atg cgc 576
Ser His Glu Val Leu Lys Val Glu Glu Leu Thr Phe Asp Asp Met Arg
180 185 190
gcc atg ctg gac gac gaa ctg atc gcc gcc cac aag gcc cgg ggc ctc 624
Ala Met Leu Asp Asp Glu Leu Ile Ala Ala His Lys Ala Arg Gly Leu
195 200 205
tct ccg gac cat ccc gtc atg cgc ggc acc gcc cag aac cct gac gtc 672
Ser Pro Asp His Pro Val Met Arg Gly Thr Ala Gln Asn Pro Asp Val
210 215 220
tac ttc cag ggg cgc gag acc gtt aac ccc ttc tac ccg aaa tgc atc 720
Tyr Phe Gln Gly Arg Glu Thr Val Asn Pro Phe Tyr Pro Lys Cys Ile
225 230 235 240
gag atc gtg gca gag gag atg gac aag ttc gcc aag atc acg ggc cgc 768
Glu Ile Val Ala Glu Glu Met Asp Lys Phe Ala Lys Ile Thr Gly Arg
245 250 255
cag tac aag ctg gtg gac tac gtg ggc gcc ccc gac gcc gac cgg gtc 816
Gln Tyr Lys Leu Val Asp Tyr Val Gly Ala Pro Asp Ala Asp Arg Val
260 265 270
atc gtc atc atg gga tcc ggc gcc gac acg gtg cag gag acc gtg gag 864
Ile Val Ile Met Gly Ser Gly Ala Asp Thr Val Gln Glu Thr Val Glu
275 280 285
cac ctg aac acc aag ggt gag aag atc ggc gtg gtg aag gtc cac ctc 912
His Leu Asn Thr Lys Gly Glu Lys Ile Gly Val Val Lys Val His Leu
290 295 300
tac cgg ccg ttc ccc atc gat gcc ttc att gcc gcc ctg ccc aag acc 960
Tyr Arg Pro Phe Pro Ile Asp Ala Phe Ile Ala Ala Leu Pro Lys Thr
305 310 315 320
gtg aag aag atc gcg gtc ctc gac cgg acc aag gag ccc ggc gcc ctg 1008
Val Lys Lys Ile Ala Val Leu Asp Arg Thr Lys Glu Pro Gly Ala Leu
325 330 335
ggc gag ccc ctg tac ctg gat gtc cgc act gcc atc ggc gag gcc atg 1056
Gly Glu Pro Leu Tyr Leu Asp Val Arg Thr Ala Ile Gly Glu Ala Met
340 345 350
gcc gac ggg aag tgc cag ttc gac ggc tac ccg gtc atc gtg ggc ggt 1104
Ala Asp Gly Lys Cys Gln Phe Asp Gly Tyr Pro Val Ile Val Gly Gly
355 360 365
cgc tac ggc ctt ggt tcc aag gag ttc acc ccg gcc cag gcc aag gcg 1152
Arg Tyr Gly Leu Gly Ser Lys Glu Phe Thr Pro Ala Gln Ala Lys Ala
370 375 380
gtg ttc gat aac cta gcc act gcc aag ccg cag aac aag ttc gtg gtc 1200
Val Phe Asp Asn Leu Ala Thr Ala Lys Pro Gln Asn Lys Phe Val Val
385 390 395 400
ggc atc acc gag gac gtg acc aac agc agc ctc ccg tgt gat ccg tcc 1248
Gly Ile Thr Glu Asp Val Thr Asn Ser Ser Leu Pro Cys Asp Pro Ser
405 410 415
ttc ttc aac ccg atg gaa ggg gcc tac cag gcc atg ttc ttc ggc ctc 1296
Phe Phe Asn Pro Met Glu Gly Ala Tyr Gln Ala Met Phe Phe Gly Leu
420 425 430
ggc tcc gac ggt acc gtg ggc gcc aac aag aac tcc atc aag atc atc 1344
Gly Ser Asp Gly Thr Val Gly Ala Asn Lys Asn Ser Ile Lys Ile Ile
435 440 445
ggc gag atg acc gat aac aac gcc cag gcc tac ttc gtc tac gac tcc 1392
Gly Glu Met Thr Asp Asn Asn Ala Gln Ala Tyr Phe Val Tyr Asp Ser
450 455 460
aag aag gcc ggc tcc atg acc acc tcg cac ctg cgc ttc ggc aag aag 1440
Lys Lys Ala Gly Ser Met Thr Thr Ser His Leu Arg Phe Gly Lys Lys
465 470 475 480
tac atc aga gcg ccg tac ctg gtg cag gag gcc gac ttc gtg gcc tgc 1488
Tyr Ile Arg Ala Pro Tyr Leu Val Gln Glu Ala Asp Phe Val Ala Cys
485 490 495
cac aac ttc gcc ttc gtg gaa aag tac gac atg ctg gcc aag gcc aag 1536
His Asn Phe Ala Phe Val Glu Lys Tyr Asp Met Leu Ala Lys Ala Lys
500 505 510
cag ggt gcc acg ttc ctc ctg aac gcc cct tac gac cac aac gag gtg 1584
Gln Gly Ala Thr Phe Leu Leu Asn Ala Pro Tyr Asp His Asn Glu Val
515 520 525
tgg gac agg ctc ccc gcc gac atg cag cag cag atc atc gac aag aag 1632
Trp Asp Arg Leu Pro Ala Asp Met Gln Gln Gln Ile Ile Asp Lys Lys
530 535 540
ctc aag ttc ttc gtg atc gat ggg gta cgc ctc ggc aac gag atc ggg 1680
Leu Lys Phe Phe Val Ile Asp Gly Val Arg Leu Gly Asn Glu Ile Gly
545 550 555 560
ctc ggt ccc cgg atc aac gtg atc atg cag acc gcc ttc ttc aag ata 1728
Leu Gly Pro Arg Ile Asn Val Ile Met Gln Thr Ala Phe Phe Lys Ile
565 570 575
tcc aac atc atc ccg ctg gat cag gcc att gac gag atc aag gac gct 1776
Ser Asn Ile Ile Pro Leu Asp Gln Ala Ile Asp Glu Ile Lys Asp Ala
580 585 590
atc aag aaa acc tat ggc aag gca ggc gag aag gtc gtg gag atg aac 1824
Ile Lys Lys Thr Tyr Gly Lys Ala Gly Glu Lys Val Val Glu Met Asn
595 600 605
tac aag gcg gtt gag gcc ggc ctc aac aac ttc tac gag gta acg gta 1872
Tyr Lys Ala Val Glu Ala Gly Leu Asn Asn Phe Tyr Glu Val Thr Val
610 615 620
ccg gca acg gca acc agt acc ctc cag aag cct ccc gtg gtc agc gcc 1920
Pro Ala Thr Ala Thr Ser Thr Leu Gln Lys Pro Pro Val Val Ser Ala
625 630 635 640
agg gcc ccc cag ttc gtg cag gag acc acc gcc ccc atc atc gcc ggc 1968
Arg Ala Pro Gln Phe Val Gln Glu Thr Thr Ala Pro Ile Ile Ala Gly
645 650 655
ctc ggc gac gac ctg ccg gtg tcc aag atg ccg gcc gac ggc acc ttc 2016
Leu Gly Asp Asp Leu Pro Val Ser Lys Met Pro Ala Asp Gly Thr Phe
660 665 670
ccg acg gcg acc tcc cag ttc gaa aag cgg aac atc gcc gtg gag atc 2064
Pro Thr Ala Thr Ser Gln Phe Glu Lys Arg Asn Ile Ala Val Glu Ile
675 680 685
ccc gtg tgg gac gag cag ctc tgc atc cag tgc ggc atc tgc tcc ttc 2112
Pro Val Trp Asp Glu Gln Leu Cys Ile Gln Cys Gly Ile Cys Ser Phe
690 695 700
gtc tgc ccc cac gcc acc atc agg atg aag gcc tat gac gcc tcc gcc 2160
Val Cys Pro His Ala Thr Ile Arg Met Lys Ala Tyr Asp Ala Ser Ala
705 710 715 720
ctt gcc ggc gcc ccg gca gcg ttc aag tcg gtt gac tgc aag att ccc 2208
Leu Ala Gly Ala Pro Ala Ala Phe Lys Ser Val Asp Cys Lys Ile Pro
725 730 735
gag ttc aag ggg cag aag ttc acc atc cag gta gcc ccg gaa gac tgc 2256
Glu Phe Lys Gly Gln Lys Phe Thr Ile Gln Val Ala Pro Glu Asp Cys
740 745 750
acc ggc tgc ggc gcc tgc gtc cac aac tgc ccg gcc aag agt aag gaa 2304
Thr Gly Cys Gly Ala Cys Val His Asn Cys Pro Ala Lys Ser Lys Glu
755 760 765
gac ccg aac cac aag gcc atc aat atg gca tac cag ccg ccc ctg cgt 2352
Asp Pro Asn His Lys Ala Ile Asn Met Ala Tyr Gln Pro Pro Leu Arg
770 775 780
tct caa gag gtc gag aac tgg gac ttc ttc ctc acc atc ccg gac gtg 2400
Ser Gln Glu Val Glu Asn Trp Asp Phe Phe Leu Thr Ile Pro Asp Val
785 790 795 800
gac ccc acc gta gcc aag ctg gac acg gtc cgc ggt tcc cag ttg gtg 2448
Asp Pro Thr Val Ala Lys Leu Asp Thr Val Arg Gly Ser Gln Leu Val
805 810 815
cgg ccg ctg ttc gaa ttc tcc ggc gcc tgc ctc ggc tgc ggc gag acc 2496
Arg Pro Leu Phe Glu Phe Ser Gly Ala Cys Leu Gly Cys Gly Glu Thr
820 825 830
ccg tac ctg aag ctc ctg acc cag ctc ttc ggc gac cgg acc gtc att 2544
Pro Tyr Leu Lys Leu Leu Thr Gln Leu Phe Gly Asp Arg Thr Val Ile
835 840 845
gcc aac gcc acc ggc tgc tcc tcc atc tac ggc gga aac ctg ccc acc 2592
Ala Asn Ala Thr Gly Cys Ser Ser Ile Tyr Gly Gly Asn Leu Pro Thr
850 855 860
acc ccc tat gcc cag cgg gcc gac ggg ctt ggg ccg gca tgg tcg aac 2640
Thr Pro Tyr Ala Gln Arg Ala Asp Gly Leu Gly Pro Ala Trp Ser Asn
865 870 875 880
tcc ctg ttc gag gac aac gcc gag ttc ggc tac ggc atg cgt ctg gcc 2688
Ser Leu Phe Glu Asp Asn Ala Glu Phe Gly Tyr Gly Met Arg Leu Ala
885 890 895
gtg gat aaa ttc aac gcc atg gcc ctt gag ctg gtc gac aag ctt tcg 2736
Val Asp Lys Phe Asn Ala Met Ala Leu Glu Leu Val Asp Lys Leu Ser
900 905 910
tct tcc tgc tcc tgc tct tcc tgc acg agc gcg gtg ccc ctc atg aac 2784
Ser Ser Cys Ser Cys Ser Ser Cys Thr Ser Ala Val Pro Leu Met Asn
915 920 925
gag atc aag ggc gcc gac cag tcg agc cag gcc ggc atc gag gcc cag 2832
Glu Ile Lys Gly Ala Asp Gln Ser Ser Gln Ala Gly Ile Glu Ala Gln
930 935 940
cgg gcc cgg gtg gcg gag ctg aag aag acc ctt gct tcc tgt ccc gag 2880
Arg Ala Arg Val Ala Glu Leu Lys Lys Thr Leu Ala Ser Cys Pro Glu
945 950 955 960
ccg gat gcc aag cgc ctg ctc acc gtg gcc gac tac ctg gtc aag aag 2928
Pro Asp Ala Lys Arg Leu Leu Thr Val Ala Asp Tyr Leu Val Lys Lys
965 970 975
tcc gtc tgg tgc atc ggc ggc gac ggc tgg gcg tac gat atc ggc tac 2976
Ser Val Trp Cys Ile Gly Gly Asp Gly Trp Ala Tyr Asp Ile Gly Tyr
980 985 990
ggc ggc ctc gac cac gtc atc gcc agc ggc aag aac atc aat ctg ctg 3024
Gly Gly Leu Asp His Val Ile Ala Ser Gly Lys Asn Ile Asn Leu Leu
995 1000 1005
gtg ctc gac acc gag gtc tac tcc aac acc ggc ggc cag gct tcc 3069
Val Leu Asp Thr Glu Val Tyr Ser Asn Thr Gly Gly Gln Ala Ser
1010 1015 1020
aag tcg acc ccg ctg ggc gcc gtg gcc cag ttc gcc gcc ggc ggt 3114
Lys Ser Thr Pro Leu Gly Ala Val Ala Gln Phe Ala Ala Gly Gly
1025 1030 1035
aag ccg gtc tcc aag aag gat ctc ggc atg atg gcc atg gcc tac 3159
Lys Pro Val Ser Lys Lys Asp Leu Gly Met Met Ala Met Ala Tyr
1040 1045 1050
ggg tcg gtc tac gtg gcc act gtc tcc ctc gcc aat ccg gcc cag 3204
Gly Ser Val Tyr Val Ala Thr Val Ser Leu Ala Asn Pro Ala Gln
1055 1060 1065
tgc atc aag gcg ttc ctg gag gcc gag gcc tat gac ggt ccg tcg 3249
Cys Ile Lys Ala Phe Leu Glu Ala Glu Ala Tyr Asp Gly Pro Ser
1070 1075 1080
ctc atc atc gcc tat gcc cac tgc atc gcc cac ggc atc gac atg 3294
Leu Ile Ile Ala Tyr Ala His Cys Ile Ala His Gly Ile Asp Met
1085 1090 1095
acc agc ggc gtg gat gcc cag aag cgg gcg gtt cag tcc ggt tac 3339
Thr Ser Gly Val Asp Ala Gln Lys Arg Ala Val Gln Ser Gly Tyr
1100 1105 1110
tgg ccc ctc tac cgc tat aat ccg cag ctg gcc gcc gag tgc aag 3384
Trp Pro Leu Tyr Arg Tyr Asn Pro Gln Leu Ala Ala Glu Cys Lys
1115 1120 1125
aac ccg ctc cag ctc gac agc aag gcc ccg acc atc gcc ttt gaa 3429
Asn Pro Leu Gln Leu Asp Ser Lys Ala Pro Thr Ile Ala Phe Glu
1130 1135 1140
gag tac gtc aac agc gag aac cgc tac cgc gtc ctc aag aag aac 3474
Glu Tyr Val Asn Ser Glu Asn Arg Tyr Arg Val Leu Lys Lys Asn
1145 1150 1155
aac ccg aaa ggg tac gag gat ctc atg aga aaa gcg gcg gca tgg 3519
Asn Pro Lys Gly Tyr Glu Asp Leu Met Arg Lys Ala Ala Ala Trp
1160 1165 1170
tcc aag gcc cac ttc agc tac tac cag aag ctg gcg gcc ctc aac 3564
Ser Lys Ala His Phe Ser Tyr Tyr Gln Lys Leu Ala Ala Leu Asn
1175 1180 1185
ttc gag gat acc tgc gag aag tag 3588
Phe Glu Asp Thr Cys Glu Lys
1190 1195
<210> 255
<211> 1195
<212> PRT
<213> Geobacter sulfurreducens
<400> 255
Met Ser Arg Lys Met Val Thr Ile Asp Gly Asn Thr Ala Ala Ala His
1 5 10 15
Val Ala His Ala Thr Asn Glu Val Ile Ala Ile Tyr Pro Ile Thr Pro
20 25 30
Ser Ser Val Met Gly Glu Ile Ser Asp Ile Lys Ser Ala Met Gly Glu
35 40 45
Lys Asn Ile Trp Gly Thr Val Pro Ser Val Val Glu Met Gln Ser Glu
50 55 60
Gly Gly Ala Ala Gly Ala Val His Gly Ala Leu Gln Ala Gly Ala Leu
65 70 75 80
Thr Thr Thr Phe Thr Ala Ser Gln Gly Leu Leu Leu Met Ile Pro Asn
85 90 95
Met Phe Lys Ile Ala Gly Glu Leu Thr Ser Thr Val Phe His Val Ser
100 105 110
Ala Arg Ala Ile Ala Ala Gln Ala Leu Ser Ile Phe Gly Asp His Ser
115 120 125
Asp Val Met Ser Cys Arg Ser Thr Gly Trp Ala Met Leu Cys Ser Asn
130 135 140
Asn Ser Gln Glu Val Met Asp Phe Ala Leu Ile Ala Gln Ser Ala Thr
145 150 155 160
Leu Arg Ser Arg Val Pro Phe Leu His Phe Phe Asp Gly Phe Arg Thr
165 170 175
Ser His Glu Val Leu Lys Val Glu Glu Leu Thr Phe Asp Asp Met Arg
180 185 190
Ala Met Leu Asp Asp Glu Leu Ile Ala Ala His Lys Ala Arg Gly Leu
195 200 205
Ser Pro Asp His Pro Val Met Arg Gly Thr Ala Gln Asn Pro Asp Val
210 215 220
Tyr Phe Gln Gly Arg Glu Thr Val Asn Pro Phe Tyr Pro Lys Cys Ile
225 230 235 240
Glu Ile Val Ala Glu Glu Met Asp Lys Phe Ala Lys Ile Thr Gly Arg
245 250 255
Gln Tyr Lys Leu Val Asp Tyr Val Gly Ala Pro Asp Ala Asp Arg Val
260 265 270
Ile Val Ile Met Gly Ser Gly Ala Asp Thr Val Gln Glu Thr Val Glu
275 280 285
His Leu Asn Thr Lys Gly Glu Lys Ile Gly Val Val Lys Val His Leu
290 295 300
Tyr Arg Pro Phe Pro Ile Asp Ala Phe Ile Ala Ala Leu Pro Lys Thr
305 310 315 320
Val Lys Lys Ile Ala Val Leu Asp Arg Thr Lys Glu Pro Gly Ala Leu
325 330 335
Gly Glu Pro Leu Tyr Leu Asp Val Arg Thr Ala Ile Gly Glu Ala Met
340 345 350
Ala Asp Gly Lys Cys Gln Phe Asp Gly Tyr Pro Val Ile Val Gly Gly
355 360 365
Arg Tyr Gly Leu Gly Ser Lys Glu Phe Thr Pro Ala Gln Ala Lys Ala
370 375 380
Val Phe Asp Asn Leu Ala Thr Ala Lys Pro Gln Asn Lys Phe Val Val
385 390 395 400
Gly Ile Thr Glu Asp Val Thr Asn Ser Ser Leu Pro Cys Asp Pro Ser
405 410 415
Phe Phe Asn Pro Met Glu Gly Ala Tyr Gln Ala Met Phe Phe Gly Leu
420 425 430
Gly Ser Asp Gly Thr Val Gly Ala Asn Lys Asn Ser Ile Lys Ile Ile
435 440 445
Gly Glu Met Thr Asp Asn Asn Ala Gln Ala Tyr Phe Val Tyr Asp Ser
450 455 460
Lys Lys Ala Gly Ser Met Thr Thr Ser His Leu Arg Phe Gly Lys Lys
465 470 475 480
Tyr Ile Arg Ala Pro Tyr Leu Val Gln Glu Ala Asp Phe Val Ala Cys
485 490 495
His Asn Phe Ala Phe Val Glu Lys Tyr Asp Met Leu Ala Lys Ala Lys
500 505 510
Gln Gly Ala Thr Phe Leu Leu Asn Ala Pro Tyr Asp His Asn Glu Val
515 520 525
Trp Asp Arg Leu Pro Ala Asp Met Gln Gln Gln Ile Ile Asp Lys Lys
530 535 540
Leu Lys Phe Phe Val Ile Asp Gly Val Arg Leu Gly Asn Glu Ile Gly
545 550 555 560
Leu Gly Pro Arg Ile Asn Val Ile Met Gln Thr Ala Phe Phe Lys Ile
565 570 575
Ser Asn Ile Ile Pro Leu Asp Gln Ala Ile Asp Glu Ile Lys Asp Ala
580 585 590
Ile Lys Lys Thr Tyr Gly Lys Ala Gly Glu Lys Val Val Glu Met Asn
595 600 605
Tyr Lys Ala Val Glu Ala Gly Leu Asn Asn Phe Tyr Glu Val Thr Val
610 615 620
Pro Ala Thr Ala Thr Ser Thr Leu Gln Lys Pro Pro Val Val Ser Ala
625 630 635 640
Arg Ala Pro Gln Phe Val Gln Glu Thr Thr Ala Pro Ile Ile Ala Gly
645 650 655
Leu Gly Asp Asp Leu Pro Val Ser Lys Met Pro Ala Asp Gly Thr Phe
660 665 670
Pro Thr Ala Thr Ser Gln Phe Glu Lys Arg Asn Ile Ala Val Glu Ile
675 680 685
Pro Val Trp Asp Glu Gln Leu Cys Ile Gln Cys Gly Ile Cys Ser Phe
690 695 700
Val Cys Pro His Ala Thr Ile Arg Met Lys Ala Tyr Asp Ala Ser Ala
705 710 715 720
Leu Ala Gly Ala Pro Ala Ala Phe Lys Ser Val Asp Cys Lys Ile Pro
725 730 735
Glu Phe Lys Gly Gln Lys Phe Thr Ile Gln Val Ala Pro Glu Asp Cys
740 745 750
Thr Gly Cys Gly Ala Cys Val His Asn Cys Pro Ala Lys Ser Lys Glu
755 760 765
Asp Pro Asn His Lys Ala Ile Asn Met Ala Tyr Gln Pro Pro Leu Arg
770 775 780
Ser Gln Glu Val Glu Asn Trp Asp Phe Phe Leu Thr Ile Pro Asp Val
785 790 795 800
Asp Pro Thr Val Ala Lys Leu Asp Thr Val Arg Gly Ser Gln Leu Val
805 810 815
Arg Pro Leu Phe Glu Phe Ser Gly Ala Cys Leu Gly Cys Gly Glu Thr
820 825 830
Pro Tyr Leu Lys Leu Leu Thr Gln Leu Phe Gly Asp Arg Thr Val Ile
835 840 845
Ala Asn Ala Thr Gly Cys Ser Ser Ile Tyr Gly Gly Asn Leu Pro Thr
850 855 860
Thr Pro Tyr Ala Gln Arg Ala Asp Gly Leu Gly Pro Ala Trp Ser Asn
865 870 875 880
Ser Leu Phe Glu Asp Asn Ala Glu Phe Gly Tyr Gly Met Arg Leu Ala
885 890 895
Val Asp Lys Phe Asn Ala Met Ala Leu Glu Leu Val Asp Lys Leu Ser
900 905 910
Ser Ser Cys Ser Cys Ser Ser Cys Thr Ser Ala Val Pro Leu Met Asn
915 920 925
Glu Ile Lys Gly Ala Asp Gln Ser Ser Gln Ala Gly Ile Glu Ala Gln
930 935 940
Arg Ala Arg Val Ala Glu Leu Lys Lys Thr Leu Ala Ser Cys Pro Glu
945 950 955 960
Pro Asp Ala Lys Arg Leu Leu Thr Val Ala Asp Tyr Leu Val Lys Lys
965 970 975
Ser Val Trp Cys Ile Gly Gly Asp Gly Trp Ala Tyr Asp Ile Gly Tyr
980 985 990
Gly Gly Leu Asp His Val Ile Ala Ser Gly Lys Asn Ile Asn Leu Leu
995 1000 1005
Val Leu Asp Thr Glu Val Tyr Ser Asn Thr Gly Gly Gln Ala Ser
1010 1015 1020
Lys Ser Thr Pro Leu Gly Ala Val Ala Gln Phe Ala Ala Gly Gly
1025 1030 1035
Lys Pro Val Ser Lys Lys Asp Leu Gly Met Met Ala Met Ala Tyr
1040 1045 1050
Gly Ser Val Tyr Val Ala Thr Val Ser Leu Ala Asn Pro Ala Gln
1055 1060 1065
Cys Ile Lys Ala Phe Leu Glu Ala Glu Ala Tyr Asp Gly Pro Ser
1070 1075 1080
Leu Ile Ile Ala Tyr Ala His Cys Ile Ala His Gly Ile Asp Met
1085 1090 1095
Thr Ser Gly Val Asp Ala Gln Lys Arg Ala Val Gln Ser Gly Tyr
1100 1105 1110
Trp Pro Leu Tyr Arg Tyr Asn Pro Gln Leu Ala Ala Glu Cys Lys
1115 1120 1125
Asn Pro Leu Gln Leu Asp Ser Lys Ala Pro Thr Ile Ala Phe Glu
1130 1135 1140
Glu Tyr Val Asn Ser Glu Asn Arg Tyr Arg Val Leu Lys Lys Asn
1145 1150 1155
Asn Pro Lys Gly Tyr Glu Asp Leu Met Arg Lys Ala Ala Ala Trp
1160 1165 1170
Ser Lys Ala His Phe Ser Tyr Tyr Gln Lys Leu Ala Ala Leu Asn
1175 1180 1185
Phe Glu Asp Thr Cys Glu Lys
1190 1195
<210> 256
<211> 1956
<212> DNA
<213> Streptomyces pratensis
<220>
<221> CDS
<222> (1)..(1956)
<223> Sfla_2592 gene from Streptomyces pratensis ATCC 33331encoding
pyruvate-flavodoxin/ferredoxin oxidoreductase
<400> 256
atg acc agc cag gtc agt agc cca gcc gga aag tcc gat gag gcc agc 48
Met Thr Ser Gln Val Ser Ser Pro Ala Gly Lys Ser Asp Glu Ala Ser
1 5 10 15
gag gct gtc gtc ggg gaa cag cgc gcc ccg cac atc gcc ggt gcg ggt 96
Glu Ala Val Val Gly Glu Gln Arg Ala Pro His Ile Ala Gly Ala Gly
20 25 30
ggc acg gag aag gaa atc cgc cgt ctg gac cgg gtg atc atc cgt ttc 144
Gly Thr Glu Lys Glu Ile Arg Arg Leu Asp Arg Val Ile Ile Arg Phe
35 40 45
gcg ggt gac tcg ggt gac ggt atg cag ttg acg ggc gac cgt ttc acg 192
Ala Gly Asp Ser Gly Asp Gly Met Gln Leu Thr Gly Asp Arg Phe Thr
50 55 60
tcg gag acg gcg tcg ttc ggg aac gac ctg tcg aca ctg ccc aac ttc 240
Ser Glu Thr Ala Ser Phe Gly Asn Asp Leu Ser Thr Leu Pro Asn Phe
65 70 75 80
ccg gcc gag atc cgg gca ccc gcc ggc acc ctg ccc ggg gtg tcg tcg 288
Pro Ala Glu Ile Arg Ala Pro Ala Gly Thr Leu Pro Gly Val Ser Ser
85 90 95
ttc cag ctg cat ttc gcg gac cac gac atc ctg aca ccg ggc gac gcg 336
Phe Gln Leu His Phe Ala Asp His Asp Ile Leu Thr Pro Gly Asp Ala
100 105 110
ccg aac gtc ctg gtc gcg atg aat ccc gcc gcg ctg aag gcg aat atc 384
Pro Asn Val Leu Val Ala Met Asn Pro Ala Ala Leu Lys Ala Asn Ile
115 120 125
gcc gat gtg ccg cgc ggg gcc gac atc atc gtg aac acg gac gag ttc 432
Ala Asp Val Pro Arg Gly Ala Asp Ile Ile Val Asn Thr Asp Glu Phe
130 135 140
acg aag cgc ccg atg gcg aaa gtc gga tat gcg gaa tcc cct ttg gag 480
Thr Lys Arg Pro Met Ala Lys Val Gly Tyr Ala Glu Ser Pro Leu Glu
145 150 155 160
gac ggt tcc ctc gag gcg tac aac gtg cat ccg gtg ccg ttg acg acg 528
Asp Gly Ser Leu Glu Ala Tyr Asn Val His Pro Val Pro Leu Thr Thr
165 170 175
ttg acg atc gag gct ttg aag gag ttc ggg ctt tcc cgc aag gag gcc 576
Leu Thr Ile Glu Ala Leu Lys Glu Phe Gly Leu Ser Arg Lys Glu Ala
180 185 190
gag cgg tcg aag aac atg ttc gcg ctc ggg ctt ctg tcc tgg atg tac 624
Glu Arg Ser Lys Asn Met Phe Ala Leu Gly Leu Leu Ser Trp Met Tyr
195 200 205
aac cgt ccg acc gag ggt acg gag aag ttc ctg cgg tcg aag ttc gcc 672
Asn Arg Pro Thr Glu Gly Thr Glu Lys Phe Leu Arg Ser Lys Phe Ala
210 215 220
agg aag ccg gag atc gcc gag gcc aat gtg gcg gct ttc cgc gcg ggc 720
Arg Lys Pro Glu Ile Ala Glu Ala Asn Val Ala Ala Phe Arg Ala Gly
225 230 235 240
tgg aat ttc ggt gag acg acg gag gat ttc gct gtc tcc tac gag gtc 768
Trp Asn Phe Gly Glu Thr Thr Glu Asp Phe Ala Val Ser Tyr Glu Val
245 250 255
gca ccg gcg tca cag gat ttc ccg acg ggc acc tac cgc aat atc tcc 816
Ala Pro Ala Ser Gln Asp Phe Pro Thr Gly Thr Tyr Arg Asn Ile Ser
260 265 270
ggg aat ctc gca ctg tcg tac ggg ctg atc gcg gcg gga cgg cag gcc 864
Gly Asn Leu Ala Leu Ser Tyr Gly Leu Ile Ala Ala Gly Arg Gln Ala
275 280 285
gat ctg ccg gtg tat ctc ggc tcg tat ccg atc act ccg gcg tcc gac 912
Asp Leu Pro Val Tyr Leu Gly Ser Tyr Pro Ile Thr Pro Ala Ser Asp
290 295 300
atc ctg cac gag ctc agc aag cac aag aac ttc ggt gtg cgg acc ttc 960
Ile Leu His Glu Leu Ser Lys His Lys Asn Phe Gly Val Arg Thr Phe
305 310 315 320
cag gcg gag gac gag atc gcc ggg atc ggt gcg gcc ctg ggc gcg tcg 1008
Gln Ala Glu Asp Glu Ile Ala Gly Ile Gly Ala Ala Leu Gly Ala Ser
325 330 335
ttc ggc ggt tca ctg ggt gtg acg acg acg tcg ggc ccg ggt gtg gcg 1056
Phe Gly Gly Ser Leu Gly Val Thr Thr Thr Ser Gly Pro Gly Val Ala
340 345 350
ctg aag tcg gag acg atc ggc ctg gcg gtg tca ctg gaa ctg ccg ctg 1104
Leu Lys Ser Glu Thr Ile Gly Leu Ala Val Ser Leu Glu Leu Pro Leu
355 360 365
ctg atc atc gac atc cag cgc ggc ggc ccc tcc acc ggc ctg ccg acc 1152
Leu Ile Ile Asp Ile Gln Arg Gly Gly Pro Ser Thr Gly Leu Pro Thr
370 375 380
aag acc gag cag gcc gac ctg ctc cag gcc atg tac ggg cgc aac ggc 1200
Lys Thr Glu Gln Ala Asp Leu Leu Gln Ala Met Tyr Gly Arg Asn Gly
385 390 395 400
gag gcc ccg gtc ccg atc gtg gca ccg agg act ccg gcg gac tgc ttc 1248
Glu Ala Pro Val Pro Ile Val Ala Pro Arg Thr Pro Ala Asp Cys Phe
405 410 415
gac gcc gcc ctg gac gcg gcg cgg atc gcg ctg acc tac cgc acc ccg 1296
Asp Ala Ala Leu Asp Ala Ala Arg Ile Ala Leu Thr Tyr Arg Thr Pro
420 425 430
gtc ttc ctg ctc tcg gac ggg tac ctc gcg aac ggc tcc gag ccg tgg 1344
Val Phe Leu Leu Ser Asp Gly Tyr Leu Ala Asn Gly Ser Glu Pro Trp
435 440 445
cgg atc ccc gag gcc gac agc ctc ccc gac ctg cgg aca cgg ttc gcg 1392
Arg Ile Pro Glu Ala Asp Ser Leu Pro Asp Leu Arg Thr Arg Phe Ala
450 455 460
acc ggc ccg aat cac gaa ctc gcg gac ggc acc gag gtg ttc tgg ccc 1440
Thr Gly Pro Asn His Glu Leu Ala Asp Gly Thr Glu Val Phe Trp Pro
465 470 475 480
tac aag agg gac ccc gag acg ctg gcc cgc ccg tgg gcg gtg ccc ggc 1488
Tyr Lys Arg Asp Pro Glu Thr Leu Ala Arg Pro Trp Ala Val Pro Gly
485 490 495
acc ccg ggt ctg gag cac cgg atc ggc ggg atc gag aag cag gac ggc 1536
Thr Pro Gly Leu Glu His Arg Ile Gly Gly Ile Glu Lys Gln Asp Gly
500 505 510
acg ggg aac atc tcc tac gat ccg gcc aac cac gac ttc atg gtc cgc 1584
Thr Gly Asn Ile Ser Tyr Asp Pro Ala Asn His Asp Phe Met Val Arg
515 520 525
acc cgc cag gcc aag atc gac ggc atc cgg gtc ccc gac ctg gag gtc 1632
Thr Arg Gln Ala Lys Ile Asp Gly Ile Arg Val Pro Asp Leu Glu Val
530 535 540
gac gac ccg gcc ggc gcg acg acc ctg gtc ctg ggc tgg ggt tcg acg 1680
Asp Asp Pro Ala Gly Ala Thr Thr Leu Val Leu Gly Trp Gly Ser Thr
545 550 555 560
tac ggg ccg atc acc gcc gcc gtg cgc cgt ctc cgc gcg gcc ggc gag 1728
Tyr Gly Pro Ile Thr Ala Ala Val Arg Arg Leu Arg Ala Ala Gly Glu
565 570 575
acg atc gca cag gca cat ctg cgc cac ctc aat ccc ttc ccc gcc aat 1776
Thr Ile Ala Gln Ala His Leu Arg His Leu Asn Pro Phe Pro Ala Asn
580 585 590
ctc ggt gag gta ctg cgg cgc tac gac aag gtc gtc gtc ccc gag atg 1824
Leu Gly Glu Val Leu Arg Arg Tyr Asp Lys Val Val Val Pro Glu Met
595 600 605
aac ctc ggt cag ctc gcc ctg ctg ctg aga gcc aag tac ctc gtg gac 1872
Asn Leu Gly Gln Leu Ala Leu Leu Leu Arg Ala Lys Tyr Leu Val Asp
610 615 620
gcg cag agt ttc aac cag gtc aac gga atg ccc ttc aag gcg gag cag 1920
Ala Gln Ser Phe Asn Gln Val Asn Gly Met Pro Phe Lys Ala Glu Gln
625 630 635 640
ctc gcc aca gcc ctc aag gag gcc atc gat gcc tga 1956
Leu Ala Thr Ala Leu Lys Glu Ala Ile Asp Ala
645 650
<210> 257
<211> 651
<212> PRT
<213> Streptomyces pratensis
<400> 257
Met Thr Ser Gln Val Ser Ser Pro Ala Gly Lys Ser Asp Glu Ala Ser
1 5 10 15
Glu Ala Val Val Gly Glu Gln Arg Ala Pro His Ile Ala Gly Ala Gly
20 25 30
Gly Thr Glu Lys Glu Ile Arg Arg Leu Asp Arg Val Ile Ile Arg Phe
35 40 45
Ala Gly Asp Ser Gly Asp Gly Met Gln Leu Thr Gly Asp Arg Phe Thr
50 55 60
Ser Glu Thr Ala Ser Phe Gly Asn Asp Leu Ser Thr Leu Pro Asn Phe
65 70 75 80
Pro Ala Glu Ile Arg Ala Pro Ala Gly Thr Leu Pro Gly Val Ser Ser
85 90 95
Phe Gln Leu His Phe Ala Asp His Asp Ile Leu Thr Pro Gly Asp Ala
100 105 110
Pro Asn Val Leu Val Ala Met Asn Pro Ala Ala Leu Lys Ala Asn Ile
115 120 125
Ala Asp Val Pro Arg Gly Ala Asp Ile Ile Val Asn Thr Asp Glu Phe
130 135 140
Thr Lys Arg Pro Met Ala Lys Val Gly Tyr Ala Glu Ser Pro Leu Glu
145 150 155 160
Asp Gly Ser Leu Glu Ala Tyr Asn Val His Pro Val Pro Leu Thr Thr
165 170 175
Leu Thr Ile Glu Ala Leu Lys Glu Phe Gly Leu Ser Arg Lys Glu Ala
180 185 190
Glu Arg Ser Lys Asn Met Phe Ala Leu Gly Leu Leu Ser Trp Met Tyr
195 200 205
Asn Arg Pro Thr Glu Gly Thr Glu Lys Phe Leu Arg Ser Lys Phe Ala
210 215 220
Arg Lys Pro Glu Ile Ala Glu Ala Asn Val Ala Ala Phe Arg Ala Gly
225 230 235 240
Trp Asn Phe Gly Glu Thr Thr Glu Asp Phe Ala Val Ser Tyr Glu Val
245 250 255
Ala Pro Ala Ser Gln Asp Phe Pro Thr Gly Thr Tyr Arg Asn Ile Ser
260 265 270
Gly Asn Leu Ala Leu Ser Tyr Gly Leu Ile Ala Ala Gly Arg Gln Ala
275 280 285
Asp Leu Pro Val Tyr Leu Gly Ser Tyr Pro Ile Thr Pro Ala Ser Asp
290 295 300
Ile Leu His Glu Leu Ser Lys His Lys Asn Phe Gly Val Arg Thr Phe
305 310 315 320
Gln Ala Glu Asp Glu Ile Ala Gly Ile Gly Ala Ala Leu Gly Ala Ser
325 330 335
Phe Gly Gly Ser Leu Gly Val Thr Thr Thr Ser Gly Pro Gly Val Ala
340 345 350
Leu Lys Ser Glu Thr Ile Gly Leu Ala Val Ser Leu Glu Leu Pro Leu
355 360 365
Leu Ile Ile Asp Ile Gln Arg Gly Gly Pro Ser Thr Gly Leu Pro Thr
370 375 380
Lys Thr Glu Gln Ala Asp Leu Leu Gln Ala Met Tyr Gly Arg Asn Gly
385 390 395 400
Glu Ala Pro Val Pro Ile Val Ala Pro Arg Thr Pro Ala Asp Cys Phe
405 410 415
Asp Ala Ala Leu Asp Ala Ala Arg Ile Ala Leu Thr Tyr Arg Thr Pro
420 425 430
Val Phe Leu Leu Ser Asp Gly Tyr Leu Ala Asn Gly Ser Glu Pro Trp
435 440 445
Arg Ile Pro Glu Ala Asp Ser Leu Pro Asp Leu Arg Thr Arg Phe Ala
450 455 460
Thr Gly Pro Asn His Glu Leu Ala Asp Gly Thr Glu Val Phe Trp Pro
465 470 475 480
Tyr Lys Arg Asp Pro Glu Thr Leu Ala Arg Pro Trp Ala Val Pro Gly
485 490 495
Thr Pro Gly Leu Glu His Arg Ile Gly Gly Ile Glu Lys Gln Asp Gly
500 505 510
Thr Gly Asn Ile Ser Tyr Asp Pro Ala Asn His Asp Phe Met Val Arg
515 520 525
Thr Arg Gln Ala Lys Ile Asp Gly Ile Arg Val Pro Asp Leu Glu Val
530 535 540
Asp Asp Pro Ala Gly Ala Thr Thr Leu Val Leu Gly Trp Gly Ser Thr
545 550 555 560
Tyr Gly Pro Ile Thr Ala Ala Val Arg Arg Leu Arg Ala Ala Gly Glu
565 570 575
Thr Ile Ala Gln Ala His Leu Arg His Leu Asn Pro Phe Pro Ala Asn
580 585 590
Leu Gly Glu Val Leu Arg Arg Tyr Asp Lys Val Val Val Pro Glu Met
595 600 605
Asn Leu Gly Gln Leu Ala Leu Leu Leu Arg Ala Lys Tyr Leu Val Asp
610 615 620
Ala Gln Ser Phe Asn Gln Val Asn Gly Met Pro Phe Lys Ala Glu Gln
625 630 635 640
Leu Ala Thr Ala Leu Lys Glu Ala Ile Asp Ala
645 650
<210> 258
<211> 3768
<212> DNA
<213> Propionibacterium freudenreichii
<220>
<221> CDS
<222> (1)..(3768)
<223> RM25_0186 gene from Propionibacterium freudenreichii DSM
20271encoding pyruvate-flavodoxin/ferredoxin oxidoreductase
<400> 258
atg act aca act acc cgt ggg ccg gtt ccc ggc tcg aat ggc atg ccc 48
Met Thr Thr Thr Thr Arg Gly Pro Val Pro Gly Ser Asn Gly Met Pro
1 5 10 15
gcc aat cca ggt ctg agc ggc gag gcc gcc acc gca acc ccg tca ccc 96
Ala Asn Pro Gly Leu Ser Gly Glu Ala Ala Thr Ala Thr Pro Ser Pro
20 25 30
gtt gac gtc gct gcc ggc gcc aag gac gct gcc gat gag ctg gcc cag 144
Val Asp Val Ala Ala Gly Ala Lys Asp Ala Ala Asp Glu Leu Ala Gln
35 40 45
tca cga cgc gag cag gac atc acc cat cag atg atc tgc gac ggc aac 192
Ser Arg Arg Glu Gln Asp Ile Thr His Gln Met Ile Cys Asp Gly Asn
50 55 60
acc gcc gcc tct gat gtg gcc ttc cgc atc aat gag ctg tgc tcg atc 240
Thr Ala Ala Ser Asp Val Ala Phe Arg Ile Asn Glu Leu Cys Ser Ile
65 70 75 80
tac ccg atc acg ccg agc tcc ccg atg gcc gaa ctg gcc gac gag tgg 288
Tyr Pro Ile Thr Pro Ser Ser Pro Met Ala Glu Leu Ala Asp Glu Trp
85 90 95
agt gcc cgc gac cgc atg aac atc tgg ggc cag gtg ccc cat gtg atg 336
Ser Ala Arg Asp Arg Met Asn Ile Trp Gly Gln Val Pro His Val Met
100 105 110
gag atg cag tcg gag gcc ggc gcg gcc ggt gcc atg cac ggc tcc ctg 384
Glu Met Gln Ser Glu Ala Gly Ala Ala Gly Ala Met His Gly Ser Leu
115 120 125
cag ggc ggc gcc ctg gcg acc acc ttc acg gcg tcg cag ggc ctg ctg 432
Gln Gly Gly Ala Leu Ala Thr Thr Phe Thr Ala Ser Gln Gly Leu Leu
130 135 140
ctg atg atc ccg aac atg tac aag atc gcc ggt gag ctc acc tcc acg 480
Leu Met Ile Pro Asn Met Tyr Lys Ile Ala Gly Glu Leu Thr Ser Thr
145 150 155 160
gtg atg cac gtc gcc gcg cgc tcg ctg gcc acc cag ggc ctg tcg atc 528
Val Met His Val Ala Ala Arg Ser Leu Ala Thr Gln Gly Leu Ser Ile
165 170 175
ttc ggt gat cac cag gac gtg atg gcc tgt cgc cag acc ggt tgg gcg 576
Phe Gly Asp His Gln Asp Val Met Ala Cys Arg Gln Thr Gly Trp Ala
180 185 190
atg ctg tgc tcc acc ggc gtg cag cag tgc cat gac aat gcc ctg atc 624
Met Leu Cys Ser Thr Gly Val Gln Gln Cys His Asp Asn Ala Leu Ile
195 200 205
tcc cag gtc gcc acg ctg cgt tcg cgc gtg ccg ttc atg cac ttc ttc 672
Ser Gln Val Ala Thr Leu Arg Ser Arg Val Pro Phe Met His Phe Phe
210 215 220
gac ggc ttc cgc acc agc cat gag ctc aac acc tgc atc cag ctc acc 720
Asp Gly Phe Arg Thr Ser His Glu Leu Asn Thr Cys Ile Gln Leu Thr
225 230 235 240
gac gac cag ctg cgt tcg atg gtg ccc gat gcg ctc gtg cgc gag cac 768
Asp Asp Gln Leu Arg Ser Met Val Pro Asp Ala Leu Val Arg Glu His
245 250 255
cgc gag cgg gcc ctg tcg ccc gac aac ccg ttc atc cgt ggc acc gcc 816
Arg Glu Arg Ala Leu Ser Pro Asp Asn Pro Phe Ile Arg Gly Thr Ala
260 265 270
cag aac gcc gac gtg tac ttc cag ggc cgc gag gcc ggc aac aag tac 864
Gln Asn Ala Asp Val Tyr Phe Gln Gly Arg Glu Ala Gly Asn Lys Tyr
275 280 285
tac gac tcg gtt ccg ggc atc gtg cag gac gcg atg gac gag ttc gcc 912
Tyr Asp Ser Val Pro Gly Ile Val Gln Asp Ala Met Asp Glu Phe Ala
290 295 300
gcc atg acc ggc cgc cag tac cac ctg gcc gac tac tac ggc gcg ccc 960
Ala Met Thr Gly Arg Gln Tyr His Leu Ala Asp Tyr Tyr Gly Ala Pro
305 310 315 320
gac gcc gat cgc gtc atc gtg atc atg ggc tcg ggt gcc gag acc gtg 1008
Asp Ala Asp Arg Val Ile Val Ile Met Gly Ser Gly Ala Glu Thr Val
325 330 335
cag cag acc gtc agc aag ctc aat gag cag ggc gag aag gtc ggc ctg 1056
Gln Gln Thr Val Ser Lys Leu Asn Glu Gln Gly Glu Lys Val Gly Leu
340 345 350
gtg gtc atc cgc ctg tac cgt ccg ttc ccg acg cag gcc gtg ctg gac 1104
Val Val Ile Arg Leu Tyr Arg Pro Phe Pro Thr Gln Ala Val Leu Asp
355 360 365
tgc att ccc gca tcg gtc aag aag atc gcc gtg ctc gac cgc acc aag 1152
Cys Ile Pro Ala Ser Val Lys Lys Ile Ala Val Leu Asp Arg Thr Lys
370 375 380
gag ccg ggc tcc aac ggt gag ccc ctg ttc ctc gac gtg gtc tcg gca 1200
Glu Pro Gly Ser Asn Gly Glu Pro Leu Phe Leu Asp Val Val Ser Ala
385 390 395 400
gtc tcc gag gcc tat tcg aac ggc gag cgc gac aac ctg ccc gcc atc 1248
Val Ser Glu Ala Tyr Ser Asn Gly Glu Arg Asp Asn Leu Pro Ala Ile
405 410 415
atc ggt ggc cgc tac ggc ctg tcg agc aag gag ttc acg ccg ggc atg 1296
Ile Gly Gly Arg Tyr Gly Leu Ser Ser Lys Glu Phe Thr Pro Gly Met
420 425 430
tgc gcc gcc gtg tac gac gag ctc gcc aag gac aag ccg aag cgt cgc 1344
Cys Ala Ala Val Tyr Asp Glu Leu Ala Lys Asp Lys Pro Lys Arg Arg
435 440 445
ttc acc gtc ggc atc acc gac gat gtg acg cac ctg tcg atc ccg tgg 1392
Phe Thr Val Gly Ile Thr Asp Asp Val Thr His Leu Ser Ile Pro Trp
450 455 460
gac gcc tcg ctc gac ctg gag gac ccc gag acc tcg cgc gca gtg ttc 1440
Asp Ala Ser Leu Asp Leu Glu Asp Pro Glu Thr Ser Arg Ala Val Phe
465 470 475 480
tac ggc atc ggt gct gac ggc acc gtc ggc gcc aac aag aac acc atc 1488
Tyr Gly Ile Gly Ala Asp Gly Thr Val Gly Ala Asn Lys Asn Thr Ile
485 490 495
aag atc ctc ggc tcc gag ccg ggc acc tac gcg cag ggc tac ttc gtc 1536
Lys Ile Leu Gly Ser Glu Pro Gly Thr Tyr Ala Gln Gly Tyr Phe Val
500 505 510
tac gac tcg aag aag tcc ggc ggc cgc acc acc tcg cac ctt cgc ttc 1584
Tyr Asp Ser Lys Lys Ser Gly Gly Arg Thr Thr Ser His Leu Arg Phe
515 520 525
gga ccc gat ccg atc aag gcc ccc tac ctg gtg aac cag gcc ggc ttc 1632
Gly Pro Asp Pro Ile Lys Ala Pro Tyr Leu Val Asn Gln Ala Gly Phe
530 535 540
atc ggc gtg cac cac tgg gcc gac ctt gag cgc atc gac gtg ctg gcg 1680
Ile Gly Val His His Trp Ala Asp Leu Glu Arg Ile Asp Val Leu Ala
545 550 555 560
ttc gcc cgc aag ggc acc acg gtg ctg atc aac agc ccg tac ccc gcc 1728
Phe Ala Arg Lys Gly Thr Thr Val Leu Ile Asn Ser Pro Tyr Pro Ala
565 570 575
gag gac gtc tgg ggc cat ctg ccg gcc ccg atg cag aag aag atc atc 1776
Glu Asp Val Trp Gly His Leu Pro Ala Pro Met Gln Lys Lys Ile Ile
580 585 590
gac ctc gac ctg cag gtg tat gcg atc gac gcc ggt gag gtg gcc cgt 1824
Asp Leu Asp Leu Gln Val Tyr Ala Ile Asp Ala Gly Glu Val Ala Arg
595 600 605
tcg gtg ggc ctg ggc aac cgc acc aac acg gtg ctg cag acc tgc tac 1872
Ser Val Gly Leu Gly Asn Arg Thr Asn Thr Val Leu Gln Thr Cys Tyr
610 615 620
ttc aag atc agt ggc gtg ctt ccc gag gac cac gcg atc gag gcc atc 1920
Phe Lys Ile Ser Gly Val Leu Pro Glu Asp His Ala Ile Glu Ala Ile
625 630 635 640
aag aac tcg atc acc aag acc tac gcg aag aag tcg atg gag atc gtg 1968
Lys Asn Ser Ile Thr Lys Thr Tyr Ala Lys Lys Ser Met Glu Ile Val
645 650 655
gag aag aac cac gcc gcc gtc gac gcc gcc ctg gag cac ctg cac aag 2016
Glu Lys Asn His Ala Ala Val Asp Ala Ala Leu Glu His Leu His Lys
660 665 670
atc gac gtg ccg gcc aag gtc acc tcc acc gag gac tac ctg ccg ccc 2064
Ile Asp Val Pro Ala Lys Val Thr Ser Thr Glu Asp Tyr Leu Pro Pro
675 680 685
gtg ccg tcg ttc gcg cct gac ttc gtc aag gac gtc acc gcg gcc atg 2112
Val Pro Ser Phe Ala Pro Asp Phe Val Lys Asp Val Thr Ala Ala Met
690 695 700
atg acc gag cag ggc gag tcg ctg ccg gtg agc aag ctg ccg gcc gat 2160
Met Thr Glu Gln Gly Glu Ser Leu Pro Val Ser Lys Leu Pro Ala Asp
705 710 715 720
ggt tcg ttc ccc tcg ggc acc acg cag tac gag aag cgc aat gtg tcc 2208
Gly Ser Phe Pro Ser Gly Thr Thr Gln Tyr Glu Lys Arg Asn Val Ser
725 730 735
gag atc atc gcg gtc tgg gac cag gac aac tgc atc cag tgc ggc aac 2256
Glu Ile Ile Ala Val Trp Asp Gln Asp Asn Cys Ile Gln Cys Gly Asn
740 745 750
tgc gcc ttc gtc tgc ccg cac ggc gtg ctg agg gcc aag tac tac aag 2304
Cys Ala Phe Val Cys Pro His Gly Val Leu Arg Ala Lys Tyr Tyr Lys
755 760 765
ccc gat gtg ctc gac gat gcg ccg aag tcg ttc cag gcg gtt ccg ctg 2352
Pro Asp Val Leu Asp Asp Ala Pro Lys Ser Phe Gln Ala Val Pro Leu
770 775 780
aat gcg gcc ggc ctg ccc gac gag atg tac acc ctg cag gtg ttc gcc 2400
Asn Ala Ala Gly Leu Pro Asp Glu Met Tyr Thr Leu Gln Val Phe Ala
785 790 795 800
gag gac tgc acc ggt tgt ggc ctg tgc gtc gag gcc tgc ccc gtg cat 2448
Glu Asp Cys Thr Gly Cys Gly Leu Cys Val Glu Ala Cys Pro Val His
805 810 815
ccc atc ggt ggc gac ccc gaa tgc aag gcg atc aac ctg gat tcc gtg 2496
Pro Ile Gly Gly Asp Pro Glu Cys Lys Ala Ile Asn Leu Asp Ser Val
820 825 830
ctc gac cgc acc aac gag cgg gcg aac gtg gag ttc ttc cag aag atc 2544
Leu Asp Arg Thr Asn Glu Arg Ala Asn Val Glu Phe Phe Gln Lys Ile
835 840 845
ccc gag ccc ccg cgc acc cgc gtg aac tac ggt gcc gtg cgt ggc gcc 2592
Pro Glu Pro Pro Arg Thr Arg Val Asn Tyr Gly Ala Val Arg Gly Ala
850 855 860
cag ttc ctg cag ccg ctg ttc gag ttc agc ggt gcc tgc ccg ggt tgt 2640
Gln Phe Leu Gln Pro Leu Phe Glu Phe Ser Gly Ala Cys Pro Gly Cys
865 870 875 880
ggc gag acg ccg tac ctc aag ctg ctc acc cag ctg ttc ggc gac cgc 2688
Gly Glu Thr Pro Tyr Leu Lys Leu Leu Thr Gln Leu Phe Gly Asp Arg
885 890 895
gcc acc gtg gcg aat gcc acc ggc tgc tcg tcc atc tac ggc ggc aac 2736
Ala Thr Val Ala Asn Ala Thr Gly Cys Ser Ser Ile Tyr Gly Gly Asn
900 905 910
ctg ccg acc acc ccg tgg gcg aag aac aag gag gga cgc ggc ccg gcc 2784
Leu Pro Thr Thr Pro Trp Ala Lys Asn Lys Glu Gly Arg Gly Pro Ala
915 920 925
tgg agc aac tca ttg ttc gag gac aac gcc gag ttc ggc ctt ggc atg 2832
Trp Ser Asn Ser Leu Phe Glu Asp Asn Ala Glu Phe Gly Leu Gly Met
930 935 940
cgc ctg gcg gcc gac ctg cac aac gaa ctg gcc cgt cag cgc gtt gac 2880
Arg Leu Ala Ala Asp Leu His Asn Glu Leu Ala Arg Gln Arg Val Asp
945 950 955 960
gag ctg tcc gat gcg atc aac gac ccc gag ctg gtc gat cag ctg ctg 2928
Glu Leu Ser Asp Ala Ile Asn Asp Pro Glu Leu Val Asp Gln Leu Leu
965 970 975
aac gcc ccg cag gcg cag gag tcc gat ctg cac gcc cag gcc gag cgc 2976
Asn Ala Pro Gln Ala Gln Glu Ser Asp Leu His Ala Gln Ala Glu Arg
980 985 990
gtc gac gcc ctg cag gat cgc ctg acc gac ctg gtc aac gat ccg aac 3024
Val Asp Ala Leu Gln Asp Arg Leu Thr Asp Leu Val Asn Asp Pro Asn
995 1000 1005
gtg gac gcc gac acc aag gcc aag gtc gag gac ctg cgg tcg gtg 3069
Val Asp Ala Asp Thr Lys Ala Lys Val Glu Asp Leu Arg Ser Val
1010 1015 1020
gcc gac aac ctg ctg cgt cgt tcc gtg tgg atc gtc ggc ggc gac 3114
Ala Asp Asn Leu Leu Arg Arg Ser Val Trp Ile Val Gly Gly Asp
1025 1030 1035
ggt tgg gcc tac gac atc ggt tcg ggc ggc ctt gac cat gtg ctg 3159
Gly Trp Ala Tyr Asp Ile Gly Ser Gly Gly Leu Asp His Val Leu
1040 1045 1050
tcc acc gga cgc aat gtc aat gtg ctg gtg ctc gac acc gag gtc 3204
Ser Thr Gly Arg Asn Val Asn Val Leu Val Leu Asp Thr Glu Val
1055 1060 1065
tac tcc aat acc ggt ggc cag gcc tcg aag tcg tcg ccc atg ggt 3249
Tyr Ser Asn Thr Gly Gly Gln Ala Ser Lys Ser Ser Pro Met Gly
1070 1075 1080
gcg atc gcg aag ttc gcg acc gcc ggc aag cgc acg aac aag aag 3294
Ala Ile Ala Lys Phe Ala Thr Ala Gly Lys Arg Thr Asn Lys Lys
1085 1090 1095
gac atc gcc atg cag gcc gtg tcc tac ggc gac gtc tat gtc gcc 3339
Asp Ile Ala Met Gln Ala Val Ser Tyr Gly Asp Val Tyr Val Ala
1100 1105 1110
cgc gtg gcg ttc ggt gcc gac ccg gag cag acg ctg aag gca ttc 3384
Arg Val Ala Phe Gly Ala Asp Pro Glu Gln Thr Leu Lys Ala Phe
1115 1120 1125
cgt gag gcc gag gcc tac ccc ggc ccc agc ctg atc atc gcc tac 3429
Arg Glu Ala Glu Ala Tyr Pro Gly Pro Ser Leu Ile Ile Ala Tyr
1130 1135 1140
agc cac tgc atc agc cat ggc tac aac ctg cgc aag ggc ctg gac 3474
Ser His Cys Ile Ser His Gly Tyr Asn Leu Arg Lys Gly Leu Asp
1145 1150 1155
cag cag tac aag gca gtg gcc tcc ggt cac tgg ccg ctg atc cgg 3519
Gln Gln Tyr Lys Ala Val Ala Ser Gly His Trp Pro Leu Ile Arg
1160 1165 1170
tac aac ccg gag gtt cgc gac tcg ggt ggc aac ccg ttc ctg ctc 3564
Tyr Asn Pro Glu Val Arg Asp Ser Gly Gly Asn Pro Phe Leu Leu
1175 1180 1185
gac tcg gcc cgt ccg cgc atc tcg ctg atg gac tac cgc aag acc 3609
Asp Ser Ala Arg Pro Arg Ile Ser Leu Met Asp Tyr Arg Lys Thr
1190 1195 1200
gag ctg cgc ttc aag atg ctg atg gtc aag gat ccg gaa gag gcc 3654
Glu Leu Arg Phe Lys Met Leu Met Val Lys Asp Pro Glu Glu Ala
1205 1210 1215
aag cac ctc aat gac ctc agc cag gag cag gtg acc agg cgt ttc 3699
Lys His Leu Asn Asp Leu Ser Gln Glu Gln Val Thr Arg Arg Phe
1220 1225 1230
gcc gac tac gag gaa atg gcc tca cgt ccg gcc gag atg ttc gcc 3744
Ala Asp Tyr Glu Glu Met Ala Ser Arg Pro Ala Glu Met Phe Ala
1235 1240 1245
acc gac gca cgg agg gat gtc tga 3768
Thr Asp Ala Arg Arg Asp Val
1250 1255
<210> 259
<211> 1255
<212> PRT
<213> Propionibacterium freudenreichii
<400> 259
Met Thr Thr Thr Thr Arg Gly Pro Val Pro Gly Ser Asn Gly Met Pro
1 5 10 15
Ala Asn Pro Gly Leu Ser Gly Glu Ala Ala Thr Ala Thr Pro Ser Pro
20 25 30
Val Asp Val Ala Ala Gly Ala Lys Asp Ala Ala Asp Glu Leu Ala Gln
35 40 45
Ser Arg Arg Glu Gln Asp Ile Thr His Gln Met Ile Cys Asp Gly Asn
50 55 60
Thr Ala Ala Ser Asp Val Ala Phe Arg Ile Asn Glu Leu Cys Ser Ile
65 70 75 80
Tyr Pro Ile Thr Pro Ser Ser Pro Met Ala Glu Leu Ala Asp Glu Trp
85 90 95
Ser Ala Arg Asp Arg Met Asn Ile Trp Gly Gln Val Pro His Val Met
100 105 110
Glu Met Gln Ser Glu Ala Gly Ala Ala Gly Ala Met His Gly Ser Leu
115 120 125
Gln Gly Gly Ala Leu Ala Thr Thr Phe Thr Ala Ser Gln Gly Leu Leu
130 135 140
Leu Met Ile Pro Asn Met Tyr Lys Ile Ala Gly Glu Leu Thr Ser Thr
145 150 155 160
Val Met His Val Ala Ala Arg Ser Leu Ala Thr Gln Gly Leu Ser Ile
165 170 175
Phe Gly Asp His Gln Asp Val Met Ala Cys Arg Gln Thr Gly Trp Ala
180 185 190
Met Leu Cys Ser Thr Gly Val Gln Gln Cys His Asp Asn Ala Leu Ile
195 200 205
Ser Gln Val Ala Thr Leu Arg Ser Arg Val Pro Phe Met His Phe Phe
210 215 220
Asp Gly Phe Arg Thr Ser His Glu Leu Asn Thr Cys Ile Gln Leu Thr
225 230 235 240
Asp Asp Gln Leu Arg Ser Met Val Pro Asp Ala Leu Val Arg Glu His
245 250 255
Arg Glu Arg Ala Leu Ser Pro Asp Asn Pro Phe Ile Arg Gly Thr Ala
260 265 270
Gln Asn Ala Asp Val Tyr Phe Gln Gly Arg Glu Ala Gly Asn Lys Tyr
275 280 285
Tyr Asp Ser Val Pro Gly Ile Val Gln Asp Ala Met Asp Glu Phe Ala
290 295 300
Ala Met Thr Gly Arg Gln Tyr His Leu Ala Asp Tyr Tyr Gly Ala Pro
305 310 315 320
Asp Ala Asp Arg Val Ile Val Ile Met Gly Ser Gly Ala Glu Thr Val
325 330 335
Gln Gln Thr Val Ser Lys Leu Asn Glu Gln Gly Glu Lys Val Gly Leu
340 345 350
Val Val Ile Arg Leu Tyr Arg Pro Phe Pro Thr Gln Ala Val Leu Asp
355 360 365
Cys Ile Pro Ala Ser Val Lys Lys Ile Ala Val Leu Asp Arg Thr Lys
370 375 380
Glu Pro Gly Ser Asn Gly Glu Pro Leu Phe Leu Asp Val Val Ser Ala
385 390 395 400
Val Ser Glu Ala Tyr Ser Asn Gly Glu Arg Asp Asn Leu Pro Ala Ile
405 410 415
Ile Gly Gly Arg Tyr Gly Leu Ser Ser Lys Glu Phe Thr Pro Gly Met
420 425 430
Cys Ala Ala Val Tyr Asp Glu Leu Ala Lys Asp Lys Pro Lys Arg Arg
435 440 445
Phe Thr Val Gly Ile Thr Asp Asp Val Thr His Leu Ser Ile Pro Trp
450 455 460
Asp Ala Ser Leu Asp Leu Glu Asp Pro Glu Thr Ser Arg Ala Val Phe
465 470 475 480
Tyr Gly Ile Gly Ala Asp Gly Thr Val Gly Ala Asn Lys Asn Thr Ile
485 490 495
Lys Ile Leu Gly Ser Glu Pro Gly Thr Tyr Ala Gln Gly Tyr Phe Val
500 505 510
Tyr Asp Ser Lys Lys Ser Gly Gly Arg Thr Thr Ser His Leu Arg Phe
515 520 525
Gly Pro Asp Pro Ile Lys Ala Pro Tyr Leu Val Asn Gln Ala Gly Phe
530 535 540
Ile Gly Val His His Trp Ala Asp Leu Glu Arg Ile Asp Val Leu Ala
545 550 555 560
Phe Ala Arg Lys Gly Thr Thr Val Leu Ile Asn Ser Pro Tyr Pro Ala
565 570 575
Glu Asp Val Trp Gly His Leu Pro Ala Pro Met Gln Lys Lys Ile Ile
580 585 590
Asp Leu Asp Leu Gln Val Tyr Ala Ile Asp Ala Gly Glu Val Ala Arg
595 600 605
Ser Val Gly Leu Gly Asn Arg Thr Asn Thr Val Leu Gln Thr Cys Tyr
610 615 620
Phe Lys Ile Ser Gly Val Leu Pro Glu Asp His Ala Ile Glu Ala Ile
625 630 635 640
Lys Asn Ser Ile Thr Lys Thr Tyr Ala Lys Lys Ser Met Glu Ile Val
645 650 655
Glu Lys Asn His Ala Ala Val Asp Ala Ala Leu Glu His Leu His Lys
660 665 670
Ile Asp Val Pro Ala Lys Val Thr Ser Thr Glu Asp Tyr Leu Pro Pro
675 680 685
Val Pro Ser Phe Ala Pro Asp Phe Val Lys Asp Val Thr Ala Ala Met
690 695 700
Met Thr Glu Gln Gly Glu Ser Leu Pro Val Ser Lys Leu Pro Ala Asp
705 710 715 720
Gly Ser Phe Pro Ser Gly Thr Thr Gln Tyr Glu Lys Arg Asn Val Ser
725 730 735
Glu Ile Ile Ala Val Trp Asp Gln Asp Asn Cys Ile Gln Cys Gly Asn
740 745 750
Cys Ala Phe Val Cys Pro His Gly Val Leu Arg Ala Lys Tyr Tyr Lys
755 760 765
Pro Asp Val Leu Asp Asp Ala Pro Lys Ser Phe Gln Ala Val Pro Leu
770 775 780
Asn Ala Ala Gly Leu Pro Asp Glu Met Tyr Thr Leu Gln Val Phe Ala
785 790 795 800
Glu Asp Cys Thr Gly Cys Gly Leu Cys Val Glu Ala Cys Pro Val His
805 810 815
Pro Ile Gly Gly Asp Pro Glu Cys Lys Ala Ile Asn Leu Asp Ser Val
820 825 830
Leu Asp Arg Thr Asn Glu Arg Ala Asn Val Glu Phe Phe Gln Lys Ile
835 840 845
Pro Glu Pro Pro Arg Thr Arg Val Asn Tyr Gly Ala Val Arg Gly Ala
850 855 860
Gln Phe Leu Gln Pro Leu Phe Glu Phe Ser Gly Ala Cys Pro Gly Cys
865 870 875 880
Gly Glu Thr Pro Tyr Leu Lys Leu Leu Thr Gln Leu Phe Gly Asp Arg
885 890 895
Ala Thr Val Ala Asn Ala Thr Gly Cys Ser Ser Ile Tyr Gly Gly Asn
900 905 910
Leu Pro Thr Thr Pro Trp Ala Lys Asn Lys Glu Gly Arg Gly Pro Ala
915 920 925
Trp Ser Asn Ser Leu Phe Glu Asp Asn Ala Glu Phe Gly Leu Gly Met
930 935 940
Arg Leu Ala Ala Asp Leu His Asn Glu Leu Ala Arg Gln Arg Val Asp
945 950 955 960
Glu Leu Ser Asp Ala Ile Asn Asp Pro Glu Leu Val Asp Gln Leu Leu
965 970 975
Asn Ala Pro Gln Ala Gln Glu Ser Asp Leu His Ala Gln Ala Glu Arg
980 985 990
Val Asp Ala Leu Gln Asp Arg Leu Thr Asp Leu Val Asn Asp Pro Asn
995 1000 1005
Val Asp Ala Asp Thr Lys Ala Lys Val Glu Asp Leu Arg Ser Val
1010 1015 1020
Ala Asp Asn Leu Leu Arg Arg Ser Val Trp Ile Val Gly Gly Asp
1025 1030 1035
Gly Trp Ala Tyr Asp Ile Gly Ser Gly Gly Leu Asp His Val Leu
1040 1045 1050
Ser Thr Gly Arg Asn Val Asn Val Leu Val Leu Asp Thr Glu Val
1055 1060 1065
Tyr Ser Asn Thr Gly Gly Gln Ala Ser Lys Ser Ser Pro Met Gly
1070 1075 1080
Ala Ile Ala Lys Phe Ala Thr Ala Gly Lys Arg Thr Asn Lys Lys
1085 1090 1095
Asp Ile Ala Met Gln Ala Val Ser Tyr Gly Asp Val Tyr Val Ala
1100 1105 1110
Arg Val Ala Phe Gly Ala Asp Pro Glu Gln Thr Leu Lys Ala Phe
1115 1120 1125
Arg Glu Ala Glu Ala Tyr Pro Gly Pro Ser Leu Ile Ile Ala Tyr
1130 1135 1140
Ser His Cys Ile Ser His Gly Tyr Asn Leu Arg Lys Gly Leu Asp
1145 1150 1155
Gln Gln Tyr Lys Ala Val Ala Ser Gly His Trp Pro Leu Ile Arg
1160 1165 1170
Tyr Asn Pro Glu Val Arg Asp Ser Gly Gly Asn Pro Phe Leu Leu
1175 1180 1185
Asp Ser Ala Arg Pro Arg Ile Ser Leu Met Asp Tyr Arg Lys Thr
1190 1195 1200
Glu Leu Arg Phe Lys Met Leu Met Val Lys Asp Pro Glu Glu Ala
1205 1210 1215
Lys His Leu Asn Asp Leu Ser Gln Glu Gln Val Thr Arg Arg Phe
1220 1225 1230
Ala Asp Tyr Glu Glu Met Ala Ser Arg Pro Ala Glu Met Phe Ala
1235 1240 1245
Thr Asp Ala Arg Arg Asp Val
1250 1255
<210> 260
<211> 3600
<212> DNA
<213> Synechocystis PCC6803
<220>
<221> CDS
<222> (1)..(3600)
<223> nifJ gene from Synechocystis sp. PCC 6803 encoding
pyruvate-flavodoxin/ferredoxin oxidoreductase
<400> 260
atg agt tta cct acc tat gcc acc ctc gac ggt aat gaa gcg gtg gcc 48
Met Ser Leu Pro Thr Tyr Ala Thr Leu Asp Gly Asn Glu Ala Val Ala
1 5 10 15
cgt gtg gcc tac ctg ctc agt gaa gtg att gcc att tat ccc atc acc 96
Arg Val Ala Tyr Leu Leu Ser Glu Val Ile Ala Ile Tyr Pro Ile Thr
20 25 30
cct tcc tcg ccc atg ggg gaa tgg tcc gat gct tgg gca gca gaa cac 144
Pro Ser Ser Pro Met Gly Glu Trp Ser Asp Ala Trp Ala Ala Glu His
35 40 45
cgg ccc aat ttg tgg ggc acc gta cca ttg gtg gtg gaa atg caa agc 192
Arg Pro Asn Leu Trp Gly Thr Val Pro Leu Val Val Glu Met Gln Ser
50 55 60
gag ggg gga gcc gcc ggt act gtc cat ggc gct ctg caa tcg gga gct 240
Glu Gly Gly Ala Ala Gly Thr Val His Gly Ala Leu Gln Ser Gly Ala
65 70 75 80
ttg acc aca aca ttt acc gct tcc cag ggc tta atg ttg atg ttg ccc 288
Leu Thr Thr Thr Phe Thr Ala Ser Gln Gly Leu Met Leu Met Leu Pro
85 90 95
aat atg cac aaa att gct ggg gaa tta aca gcc atg gtt ttg cat gtg 336
Asn Met His Lys Ile Ala Gly Glu Leu Thr Ala Met Val Leu His Val
100 105 110
gcg gcc cgt tct tta gcg gcc cag ggc cta tct att ttt ggg gat cac 384
Ala Ala Arg Ser Leu Ala Ala Gln Gly Leu Ser Ile Phe Gly Asp His
115 120 125
agt gat gtg atg gcg gcc aga aat acg ggc ttt gcc atg tta agt tcc 432
Ser Asp Val Met Ala Ala Arg Asn Thr Gly Phe Ala Met Leu Ser Ser
130 135 140
aat tct gtc cag gaa gcc cac gat ttt gcc ctc att gcc acg gcc acc 480
Asn Ser Val Gln Glu Ala His Asp Phe Ala Leu Ile Ala Thr Ala Thr
145 150 155 160
agc ttt gcc acc agg ata ccg gga ctg cac ttt ttt gat ggt ttt cgc 528
Ser Phe Ala Thr Arg Ile Pro Gly Leu His Phe Phe Asp Gly Phe Arg
165 170 175
act tcc cac gaa gaa caa aaa att gag ctt tta ccc cag gaa gta ctc 576
Thr Ser His Glu Glu Gln Lys Ile Glu Leu Leu Pro Gln Glu Val Leu
180 185 190
cgt ggt ttg att aag gat gag gat gtg cta gcc cac cgg gga cgg gct 624
Arg Gly Leu Ile Lys Asp Glu Asp Val Leu Ala His Arg Gly Arg Ala
195 200 205
ttg acc ccc gat cgc ccg aag ttg cgg ggg acg gcc caa aat ccg gat 672
Leu Thr Pro Asp Arg Pro Lys Leu Arg Gly Thr Ala Gln Asn Pro Asp
210 215 220
gtc tat ttc caa gct agg gaa acg gtt aat ccc ttt tat gcc agt tat 720
Val Tyr Phe Gln Ala Arg Glu Thr Val Asn Pro Phe Tyr Ala Ser Tyr
225 230 235 240
ccc aac gtg ctg gag cag gtg atg gaa caa ttt ggc cag cta acc ggc 768
Pro Asn Val Leu Glu Gln Val Met Glu Gln Phe Gly Gln Leu Thr Gly
245 250 255
cgc cat tac cgt ccc tat gaa tat tgt ggc cat ccg gaa gcg gaa cgg 816
Arg His Tyr Arg Pro Tyr Glu Tyr Cys Gly His Pro Glu Ala Glu Arg
260 265 270
gtg att gtg ctg atg ggt tct ggt gcg gaa acg gcc cag gaa acg gtg 864
Val Ile Val Leu Met Gly Ser Gly Ala Glu Thr Ala Gln Glu Thr Val
275 280 285
gat ttt cta act gcc caa ggg gaa aag gtt ggt tta ctg aaa gta cgc 912
Asp Phe Leu Thr Ala Gln Gly Glu Lys Val Gly Leu Leu Lys Val Arg
290 295 300
ctc tat cgg ccc ttt gct ggc gat cgc ctg gtt aat gct cta cca aaa 960
Leu Tyr Arg Pro Phe Ala Gly Asp Arg Leu Val Asn Ala Leu Pro Lys
305 310 315 320
acg gtg caa aaa ata gcg gtg ctg gac cgg tgt aag gaa ccg ggg agc 1008
Thr Val Gln Lys Ile Ala Val Leu Asp Arg Cys Lys Glu Pro Gly Ser
325 330 335
att ggg gaa ccc ctc tat cag gat gtg ctg acg gcc ttt ttt gaa gcg 1056
Ile Gly Glu Pro Leu Tyr Gln Asp Val Leu Thr Ala Phe Phe Glu Ala
340 345 350
ggc atg atg ccg aaa att att ggt ggc cgt tac ggt ctg tca tcc aag 1104
Gly Met Met Pro Lys Ile Ile Gly Gly Arg Tyr Gly Leu Ser Ser Lys
355 360 365
gaa ttt acc ccc gcc atg gtt aaa ggg gtg ttg gac cat tta aat caa 1152
Glu Phe Thr Pro Ala Met Val Lys Gly Val Leu Asp His Leu Asn Gln
370 375 380
acc aac ccc aaa aac cat ttc acc gta ggc att aac gat gat ttg agc 1200
Thr Asn Pro Lys Asn His Phe Thr Val Gly Ile Asn Asp Asp Leu Ser
385 390 395 400
cac acc agc atc gac tat gac ccc agt ttt tcc acg gaa gca gat tct 1248
His Thr Ser Ile Asp Tyr Asp Pro Ser Phe Ser Thr Glu Ala Asp Ser
405 410 415
gtc gtc cgg gca att ttc tac ggt ctc ggt tcc gac ggt acg gtg ggg 1296
Val Val Arg Ala Ile Phe Tyr Gly Leu Gly Ser Asp Gly Thr Val Gly
420 425 430
gcc aat aag aac tcc atc aaa atc att ggc gaa gat acg gat aac tac 1344
Ala Asn Lys Asn Ser Ile Lys Ile Ile Gly Glu Asp Thr Asp Asn Tyr
435 440 445
gcc cag ggt tat ttt gtt tac gac tcg aaa aaa tcc ggt tct gta acc 1392
Ala Gln Gly Tyr Phe Val Tyr Asp Ser Lys Lys Ser Gly Ser Val Thr
450 455 460
gtt tcc cat ctg cgc ttt ggc cct aat ccc atc ctg tcc act tac ctg 1440
Val Ser His Leu Arg Phe Gly Pro Asn Pro Ile Leu Ser Thr Tyr Leu
465 470 475 480
att agc caa gcc aat ttt gtc gcc tgt cac cag tgg gaa ttt ttg gaa 1488
Ile Ser Gln Ala Asn Phe Val Ala Cys His Gln Trp Glu Phe Leu Glu
485 490 495
cag ttt gaa gtc ttg gaa cca gcc gtt gat ggc ggc gtt ttc ctg gtc 1536
Gln Phe Glu Val Leu Glu Pro Ala Val Asp Gly Gly Val Phe Leu Val
500 505 510
aat agc ccc tac ggc cca gag gaa att tgg cga gag ttt ccc cgc aaa 1584
Asn Ser Pro Tyr Gly Pro Glu Glu Ile Trp Arg Glu Phe Pro Arg Lys
515 520 525
gta caa cag gaa att att gac aaa aat ctc aag gtt tac acc atc aat 1632
Val Gln Gln Glu Ile Ile Asp Lys Asn Leu Lys Val Tyr Thr Ile Asn
530 535 540
gcc aat gac gta gcc agg gat gcg ggc atg ggc cgc cgc acc aac aca 1680
Ala Asn Asp Val Ala Arg Asp Ala Gly Met Gly Arg Arg Thr Asn Thr
545 550 555 560
gtc atg caa acc tgt ttc ttt gcc cta gcg gga gtg tta ccc cgg gaa 1728
Val Met Gln Thr Cys Phe Phe Ala Leu Ala Gly Val Leu Pro Arg Glu
565 570 575
gag gcg atc gcc aaa att aag cag tcg gtc caa aaa acc tac ggc aaa 1776
Glu Ala Ile Ala Lys Ile Lys Gln Ser Val Gln Lys Thr Tyr Gly Lys
580 585 590
aag ggt cag gaa att gtc gag atg aat att aaa gcg gtg gat tcc acc 1824
Lys Gly Gln Glu Ile Val Glu Met Asn Ile Lys Ala Val Asp Ser Thr
595 600 605
ctg gcc cat ctc tat gaa gtg tcc gta ccg gaa acg gtg agc gac gat 1872
Leu Ala His Leu Tyr Glu Val Ser Val Pro Glu Thr Val Ser Asp Asp
610 615 620
gcc cct gct atg cgg ccg gtg gtg cct gat aac gcc ccg gtg ttt gtg 1920
Ala Pro Ala Met Arg Pro Val Val Pro Asp Asn Ala Pro Val Phe Val
625 630 635 640
cgg gaa gtg tta gga aaa atc atg gcc cgg caa ggg gat gat ctc ccg 1968
Arg Glu Val Leu Gly Lys Ile Met Ala Arg Gln Gly Asp Asp Leu Pro
645 650 655
gtc agt gct tta ccc tgc gat ggc acc tat ccc acc gcc act acc caa 2016
Val Ser Ala Leu Pro Cys Asp Gly Thr Tyr Pro Thr Ala Thr Thr Gln
660 665 670
tgg gaa aaa cgc aac gtg ggc cac gaa att ccc gtt tgg gac ccc gat 2064
Trp Glu Lys Arg Asn Val Gly His Glu Ile Pro Val Trp Asp Pro Asp
675 680 685
gtt tgt gtg caa tgc ggc aaa tgc gtc att gtt tgt ccc cat gct gtg 2112
Val Cys Val Gln Cys Gly Lys Cys Val Ile Val Cys Pro His Ala Val
690 695 700
att cgg ggc aaa gtt tac gag gag gca gaa ttg gcc aat gct ccg gtc 2160
Ile Arg Gly Lys Val Tyr Glu Glu Ala Glu Leu Ala Asn Ala Pro Val
705 710 715 720
agt ttc aaa ttt acc aat gcc aaa gac cat gat tgg caa ggt tct aag 2208
Ser Phe Lys Phe Thr Asn Ala Lys Asp His Asp Trp Gln Gly Ser Lys
725 730 735
ttc acc atc cag gta gcc ccg gaa gat tgc acc ggt tgc ggc atc tgt 2256
Phe Thr Ile Gln Val Ala Pro Glu Asp Cys Thr Gly Cys Gly Ile Cys
740 745 750
gtg gac gta tgc ccg gct aaa aat aaa tcc cag cct cgt tta agg gcg 2304
Val Asp Val Cys Pro Ala Lys Asn Lys Ser Gln Pro Arg Leu Arg Ala
755 760 765
att aat atg gct ccc cag tta ccc ttg cgg gaa cag gaa cgg gag aat 2352
Ile Asn Met Ala Pro Gln Leu Pro Leu Arg Glu Gln Glu Arg Glu Asn
770 775 780
tgg gac ttt ttc cta gat ttg ccc aac ccc gat cgc ctc agt ttg aat 2400
Trp Asp Phe Phe Leu Asp Leu Pro Asn Pro Asp Arg Leu Ser Leu Asn
785 790 795 800
ttg aac aaa atc agc cat caa cag atg cag gag ccg tta ttt gaa ttt 2448
Leu Asn Lys Ile Ser His Gln Gln Met Gln Glu Pro Leu Phe Glu Phe
805 810 815
tct gga gcc tgt gcc ggt tgt ggg gaa acc cct tat ttg aaa ctg gtc 2496
Ser Gly Ala Cys Ala Gly Cys Gly Glu Thr Pro Tyr Leu Lys Leu Val
820 825 830
agt caa tta ttt ggc gat cgc atg tta gtg gcc aac gcc acc ggt tgc 2544
Ser Gln Leu Phe Gly Asp Arg Met Leu Val Ala Asn Ala Thr Gly Cys
835 840 845
tct tcc atc tat ggc ggc aac tta ccg aca act ccc tgg gcc caa aat 2592
Ser Ser Ile Tyr Gly Gly Asn Leu Pro Thr Thr Pro Trp Ala Gln Asn
850 855 860
gct gag ggt cgc ggt ccc gct tgg tcc aat tcc ctg ttt gaa gat aac 2640
Ala Glu Gly Arg Gly Pro Ala Trp Ser Asn Ser Leu Phe Glu Asp Asn
865 870 875 880
gct gaa ttt ggc ctt ggt ttc cga gtg gcg atc gac aag caa acg gaa 2688
Ala Glu Phe Gly Leu Gly Phe Arg Val Ala Ile Asp Lys Gln Thr Glu
885 890 895
ttt gca ggg gaa ttg cta aaa acc ttt gct ggg gag ttg gga gac agt 2736
Phe Ala Gly Glu Leu Leu Lys Thr Phe Ala Gly Glu Leu Gly Asp Ser
900 905 910
ttg gta agt gaa att ctc aac aat gcc caa acc act gaa gcg gat att 2784
Leu Val Ser Glu Ile Leu Asn Asn Ala Gln Thr Thr Glu Ala Asp Ile
915 920 925
ttt gaa caa cgg caa ttg gta gaa cag gtt aag caa cgt ttg caa aat 2832
Phe Glu Gln Arg Gln Leu Val Glu Gln Val Lys Gln Arg Leu Gln Asn
930 935 940
ctg gaa act ccc caa gcc caa atg ttc ctt tct gta gcg gat tac ctc 2880
Leu Glu Thr Pro Gln Ala Gln Met Phe Leu Ser Val Ala Asp Tyr Leu
945 950 955 960
gtg aag aaa agc gtt tgg att att ggt ggc gat ggc tgg gcc tac gac 2928
Val Lys Lys Ser Val Trp Ile Ile Gly Gly Asp Gly Trp Ala Tyr Asp
965 970 975
att ggg tac ggc ggt ttg gat cac gtc ctc gcc agt ggg cgt aat gtc 2976
Ile Gly Tyr Gly Gly Leu Asp His Val Leu Ala Ser Gly Arg Asn Val
980 985 990
aat atc ttg gtg atg gat acg gaa gtc tat tcc aac acc ggg ggc caa 3024
Asn Ile Leu Val Met Asp Thr Glu Val Tyr Ser Asn Thr Gly Gly Gln
995 1000 1005
gcc tcc aaa gcc act ccc cgg gcc gct gta gct aaa ttc gcc gct 3069
Ala Ser Lys Ala Thr Pro Arg Ala Ala Val Ala Lys Phe Ala Ala
1010 1015 1020
ggg ggt aaa ccc tct ccc aaa aaa gat ttg ggc tta atg gcc atg 3114
Gly Gly Lys Pro Ser Pro Lys Lys Asp Leu Gly Leu Met Ala Met
1025 1030 1035
acc tac ggc aac gtc tat gtg gcc agt atc gcc atg gga gcc aaa 3159
Thr Tyr Gly Asn Val Tyr Val Ala Ser Ile Ala Met Gly Ala Lys
1040 1045 1050
aat gag cag tcc att aaa gcc ttt atg gaa gcg gaa gcc tat ccc 3204
Asn Glu Gln Ser Ile Lys Ala Phe Met Glu Ala Glu Ala Tyr Pro
1055 1060 1065
ggt gtc tcg tta att att gcc tac tcc cac tgc att gcc cac ggc 3249
Gly Val Ser Leu Ile Ile Ala Tyr Ser His Cys Ile Ala His Gly
1070 1075 1080
att aat atg acc acc gcg atg aac cat caa aaa gag ttg gtg gac 3294
Ile Asn Met Thr Thr Ala Met Asn His Gln Lys Glu Leu Val Asp
1085 1090 1095
agc ggt cgt tgg ttg ctc tac cgc tat aac cct ttg ttg gcg gat 3339
Ser Gly Arg Trp Leu Leu Tyr Arg Tyr Asn Pro Leu Leu Ala Asp
1100 1105 1110
gaa ggt aaa aat ccc ctg caa ttg gat atg gga tcg cca aaa gta 3384
Glu Gly Lys Asn Pro Leu Gln Leu Asp Met Gly Ser Pro Lys Val
1115 1120 1125
gcc att gac aaa acg gtc tat tcg gaa aat cgc ttt gcc atg ctc 3429
Ala Ile Asp Lys Thr Val Tyr Ser Glu Asn Arg Phe Ala Met Leu
1130 1135 1140
acc cgc agt caa cca gag gag gcc aaa cgc tta atg aag tta gct 3474
Thr Arg Ser Gln Pro Glu Glu Ala Lys Arg Leu Met Lys Leu Ala
1145 1150 1155
caa ggg gat gtg aac act cgc tgg gcc atg tac gaa tat ctg gcg 3519
Gln Gly Asp Val Asn Thr Arg Trp Ala Met Tyr Glu Tyr Leu Ala
1160 1165 1170
aaa cgt tct ctg ggt ggg gaa att aac ggt aac aac cat ggt gtt 3564
Lys Arg Ser Leu Gly Gly Glu Ile Asn Gly Asn Asn His Gly Val
1175 1180 1185
tcc cca tct ccg gag gta att gct aaa tct gtt tag 3600
Ser Pro Ser Pro Glu Val Ile Ala Lys Ser Val
1190 1195
<210> 261
<211> 1199
<212> PRT
<213> Synechocystis PCC6803
<400> 261
Met Ser Leu Pro Thr Tyr Ala Thr Leu Asp Gly Asn Glu Ala Val Ala
1 5 10 15
Arg Val Ala Tyr Leu Leu Ser Glu Val Ile Ala Ile Tyr Pro Ile Thr
20 25 30
Pro Ser Ser Pro Met Gly Glu Trp Ser Asp Ala Trp Ala Ala Glu His
35 40 45
Arg Pro Asn Leu Trp Gly Thr Val Pro Leu Val Val Glu Met Gln Ser
50 55 60
Glu Gly Gly Ala Ala Gly Thr Val His Gly Ala Leu Gln Ser Gly Ala
65 70 75 80
Leu Thr Thr Thr Phe Thr Ala Ser Gln Gly Leu Met Leu Met Leu Pro
85 90 95
Asn Met His Lys Ile Ala Gly Glu Leu Thr Ala Met Val Leu His Val
100 105 110
Ala Ala Arg Ser Leu Ala Ala Gln Gly Leu Ser Ile Phe Gly Asp His
115 120 125
Ser Asp Val Met Ala Ala Arg Asn Thr Gly Phe Ala Met Leu Ser Ser
130 135 140
Asn Ser Val Gln Glu Ala His Asp Phe Ala Leu Ile Ala Thr Ala Thr
145 150 155 160
Ser Phe Ala Thr Arg Ile Pro Gly Leu His Phe Phe Asp Gly Phe Arg
165 170 175
Thr Ser His Glu Glu Gln Lys Ile Glu Leu Leu Pro Gln Glu Val Leu
180 185 190
Arg Gly Leu Ile Lys Asp Glu Asp Val Leu Ala His Arg Gly Arg Ala
195 200 205
Leu Thr Pro Asp Arg Pro Lys Leu Arg Gly Thr Ala Gln Asn Pro Asp
210 215 220
Val Tyr Phe Gln Ala Arg Glu Thr Val Asn Pro Phe Tyr Ala Ser Tyr
225 230 235 240
Pro Asn Val Leu Glu Gln Val Met Glu Gln Phe Gly Gln Leu Thr Gly
245 250 255
Arg His Tyr Arg Pro Tyr Glu Tyr Cys Gly His Pro Glu Ala Glu Arg
260 265 270
Val Ile Val Leu Met Gly Ser Gly Ala Glu Thr Ala Gln Glu Thr Val
275 280 285
Asp Phe Leu Thr Ala Gln Gly Glu Lys Val Gly Leu Leu Lys Val Arg
290 295 300
Leu Tyr Arg Pro Phe Ala Gly Asp Arg Leu Val Asn Ala Leu Pro Lys
305 310 315 320
Thr Val Gln Lys Ile Ala Val Leu Asp Arg Cys Lys Glu Pro Gly Ser
325 330 335
Ile Gly Glu Pro Leu Tyr Gln Asp Val Leu Thr Ala Phe Phe Glu Ala
340 345 350
Gly Met Met Pro Lys Ile Ile Gly Gly Arg Tyr Gly Leu Ser Ser Lys
355 360 365
Glu Phe Thr Pro Ala Met Val Lys Gly Val Leu Asp His Leu Asn Gln
370 375 380
Thr Asn Pro Lys Asn His Phe Thr Val Gly Ile Asn Asp Asp Leu Ser
385 390 395 400
His Thr Ser Ile Asp Tyr Asp Pro Ser Phe Ser Thr Glu Ala Asp Ser
405 410 415
Val Val Arg Ala Ile Phe Tyr Gly Leu Gly Ser Asp Gly Thr Val Gly
420 425 430
Ala Asn Lys Asn Ser Ile Lys Ile Ile Gly Glu Asp Thr Asp Asn Tyr
435 440 445
Ala Gln Gly Tyr Phe Val Tyr Asp Ser Lys Lys Ser Gly Ser Val Thr
450 455 460
Val Ser His Leu Arg Phe Gly Pro Asn Pro Ile Leu Ser Thr Tyr Leu
465 470 475 480
Ile Ser Gln Ala Asn Phe Val Ala Cys His Gln Trp Glu Phe Leu Glu
485 490 495
Gln Phe Glu Val Leu Glu Pro Ala Val Asp Gly Gly Val Phe Leu Val
500 505 510
Asn Ser Pro Tyr Gly Pro Glu Glu Ile Trp Arg Glu Phe Pro Arg Lys
515 520 525
Val Gln Gln Glu Ile Ile Asp Lys Asn Leu Lys Val Tyr Thr Ile Asn
530 535 540
Ala Asn Asp Val Ala Arg Asp Ala Gly Met Gly Arg Arg Thr Asn Thr
545 550 555 560
Val Met Gln Thr Cys Phe Phe Ala Leu Ala Gly Val Leu Pro Arg Glu
565 570 575
Glu Ala Ile Ala Lys Ile Lys Gln Ser Val Gln Lys Thr Tyr Gly Lys
580 585 590
Lys Gly Gln Glu Ile Val Glu Met Asn Ile Lys Ala Val Asp Ser Thr
595 600 605
Leu Ala His Leu Tyr Glu Val Ser Val Pro Glu Thr Val Ser Asp Asp
610 615 620
Ala Pro Ala Met Arg Pro Val Val Pro Asp Asn Ala Pro Val Phe Val
625 630 635 640
Arg Glu Val Leu Gly Lys Ile Met Ala Arg Gln Gly Asp Asp Leu Pro
645 650 655
Val Ser Ala Leu Pro Cys Asp Gly Thr Tyr Pro Thr Ala Thr Thr Gln
660 665 670
Trp Glu Lys Arg Asn Val Gly His Glu Ile Pro Val Trp Asp Pro Asp
675 680 685
Val Cys Val Gln Cys Gly Lys Cys Val Ile Val Cys Pro His Ala Val
690 695 700
Ile Arg Gly Lys Val Tyr Glu Glu Ala Glu Leu Ala Asn Ala Pro Val
705 710 715 720
Ser Phe Lys Phe Thr Asn Ala Lys Asp His Asp Trp Gln Gly Ser Lys
725 730 735
Phe Thr Ile Gln Val Ala Pro Glu Asp Cys Thr Gly Cys Gly Ile Cys
740 745 750
Val Asp Val Cys Pro Ala Lys Asn Lys Ser Gln Pro Arg Leu Arg Ala
755 760 765
Ile Asn Met Ala Pro Gln Leu Pro Leu Arg Glu Gln Glu Arg Glu Asn
770 775 780
Trp Asp Phe Phe Leu Asp Leu Pro Asn Pro Asp Arg Leu Ser Leu Asn
785 790 795 800
Leu Asn Lys Ile Ser His Gln Gln Met Gln Glu Pro Leu Phe Glu Phe
805 810 815
Ser Gly Ala Cys Ala Gly Cys Gly Glu Thr Pro Tyr Leu Lys Leu Val
820 825 830
Ser Gln Leu Phe Gly Asp Arg Met Leu Val Ala Asn Ala Thr Gly Cys
835 840 845
Ser Ser Ile Tyr Gly Gly Asn Leu Pro Thr Thr Pro Trp Ala Gln Asn
850 855 860
Ala Glu Gly Arg Gly Pro Ala Trp Ser Asn Ser Leu Phe Glu Asp Asn
865 870 875 880
Ala Glu Phe Gly Leu Gly Phe Arg Val Ala Ile Asp Lys Gln Thr Glu
885 890 895
Phe Ala Gly Glu Leu Leu Lys Thr Phe Ala Gly Glu Leu Gly Asp Ser
900 905 910
Leu Val Ser Glu Ile Leu Asn Asn Ala Gln Thr Thr Glu Ala Asp Ile
915 920 925
Phe Glu Gln Arg Gln Leu Val Glu Gln Val Lys Gln Arg Leu Gln Asn
930 935 940
Leu Glu Thr Pro Gln Ala Gln Met Phe Leu Ser Val Ala Asp Tyr Leu
945 950 955 960
Val Lys Lys Ser Val Trp Ile Ile Gly Gly Asp Gly Trp Ala Tyr Asp
965 970 975
Ile Gly Tyr Gly Gly Leu Asp His Val Leu Ala Ser Gly Arg Asn Val
980 985 990
Asn Ile Leu Val Met Asp Thr Glu Val Tyr Ser Asn Thr Gly Gly Gln
995 1000 1005
Ala Ser Lys Ala Thr Pro Arg Ala Ala Val Ala Lys Phe Ala Ala
1010 1015 1020
Gly Gly Lys Pro Ser Pro Lys Lys Asp Leu Gly Leu Met Ala Met
1025 1030 1035
Thr Tyr Gly Asn Val Tyr Val Ala Ser Ile Ala Met Gly Ala Lys
1040 1045 1050
Asn Glu Gln Ser Ile Lys Ala Phe Met Glu Ala Glu Ala Tyr Pro
1055 1060 1065
Gly Val Ser Leu Ile Ile Ala Tyr Ser His Cys Ile Ala His Gly
1070 1075 1080
Ile Asn Met Thr Thr Ala Met Asn His Gln Lys Glu Leu Val Asp
1085 1090 1095
Ser Gly Arg Trp Leu Leu Tyr Arg Tyr Asn Pro Leu Leu Ala Asp
1100 1105 1110
Glu Gly Lys Asn Pro Leu Gln Leu Asp Met Gly Ser Pro Lys Val
1115 1120 1125
Ala Ile Asp Lys Thr Val Tyr Ser Glu Asn Arg Phe Ala Met Leu
1130 1135 1140
Thr Arg Ser Gln Pro Glu Glu Ala Lys Arg Leu Met Lys Leu Ala
1145 1150 1155
Gln Gly Asp Val Asn Thr Arg Trp Ala Met Tyr Glu Tyr Leu Ala
1160 1165 1170
Lys Arg Ser Leu Gly Gly Glu Ile Asn Gly Asn Asn His Gly Val
1175 1180 1185
Ser Pro Ser Pro Glu Val Ile Ala Lys Ser Val
1190 1195
<210> 262
<211> 531
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(531)
<223> fldA gene from E. coli encoding flavodoxin
<400> 262
atg gct atc act ggc atc ttt ttc ggc agc gac acc ggt aat acc gaa 48
Met Ala Ile Thr Gly Ile Phe Phe Gly Ser Asp Thr Gly Asn Thr Glu
1 5 10 15
aat atc gca aaa atg att caa aaa cag ctt ggt aaa gac gtt gcc gat 96
Asn Ile Ala Lys Met Ile Gln Lys Gln Leu Gly Lys Asp Val Ala Asp
20 25 30
gtc cat gac att gca aaa agc agc aaa gaa gat ctg gaa gct tat gac 144
Val His Asp Ile Ala Lys Ser Ser Lys Glu Asp Leu Glu Ala Tyr Asp
35 40 45
att ctg ctg ctg ggc atc cca acc tgg tat tac ggc gaa gcg cag tgt 192
Ile Leu Leu Leu Gly Ile Pro Thr Trp Tyr Tyr Gly Glu Ala Gln Cys
50 55 60
gac tgg gat gac ttc ttc ccg act ctc gaa gag att gat ttc aac ggc 240
Asp Trp Asp Asp Phe Phe Pro Thr Leu Glu Glu Ile Asp Phe Asn Gly
65 70 75 80
aaa ctg gtt gcg ctg ttt ggt tgt ggt gac cag gaa gat tac gcc gaa 288
Lys Leu Val Ala Leu Phe Gly Cys Gly Asp Gln Glu Asp Tyr Ala Glu
85 90 95
tat ttc tgc gac gca ttg ggc acc atc cgc gac atc att gaa ccg cgc 336
Tyr Phe Cys Asp Ala Leu Gly Thr Ile Arg Asp Ile Ile Glu Pro Arg
100 105 110
ggt gca acc atc gtt ggt cac tgg cca act gcg ggc tat cat ttc gaa 384
Gly Ala Thr Ile Val Gly His Trp Pro Thr Ala Gly Tyr His Phe Glu
115 120 125
gca tca aaa ggt ctg gca gat gac gac cac ttt gtc ggt ctg gct atc 432
Ala Ser Lys Gly Leu Ala Asp Asp Asp His Phe Val Gly Leu Ala Ile
130 135 140
gac gaa gac cgt cag ccg gaa ctg acc gct gaa cgt gta gaa aaa tgg 480
Asp Glu Asp Arg Gln Pro Glu Leu Thr Ala Glu Arg Val Glu Lys Trp
145 150 155 160
gtt aaa cag att tct gaa gag ttg cat ctc gac gaa att ctc aat gcc 528
Val Lys Gln Ile Ser Glu Glu Leu His Leu Asp Glu Ile Leu Asn Ala
165 170 175
tga 531
<210> 263
<211> 176
<212> PRT
<213> Escherichia coli
<400> 263
Met Ala Ile Thr Gly Ile Phe Phe Gly Ser Asp Thr Gly Asn Thr Glu
1 5 10 15
Asn Ile Ala Lys Met Ile Gln Lys Gln Leu Gly Lys Asp Val Ala Asp
20 25 30
Val His Asp Ile Ala Lys Ser Ser Lys Glu Asp Leu Glu Ala Tyr Asp
35 40 45
Ile Leu Leu Leu Gly Ile Pro Thr Trp Tyr Tyr Gly Glu Ala Gln Cys
50 55 60
Asp Trp Asp Asp Phe Phe Pro Thr Leu Glu Glu Ile Asp Phe Asn Gly
65 70 75 80
Lys Leu Val Ala Leu Phe Gly Cys Gly Asp Gln Glu Asp Tyr Ala Glu
85 90 95
Tyr Phe Cys Asp Ala Leu Gly Thr Ile Arg Asp Ile Ile Glu Pro Arg
100 105 110
Gly Ala Thr Ile Val Gly His Trp Pro Thr Ala Gly Tyr His Phe Glu
115 120 125
Ala Ser Lys Gly Leu Ala Asp Asp Asp His Phe Val Gly Leu Ala Ile
130 135 140
Asp Glu Asp Arg Gln Pro Glu Leu Thr Ala Glu Arg Val Glu Lys Trp
145 150 155 160
Val Lys Gln Ile Ser Glu Glu Leu His Leu Asp Glu Ile Leu Asn Ala
165 170 175
<210> 264
<211> 522
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(522)
<223> fldB gene from E. coli encoding flavodoxin
<400> 264
atg aat atg ggt ctt ttt tac ggt tcc agc acc tgt tac acc gaa atg 48
Met Asn Met Gly Leu Phe Tyr Gly Ser Ser Thr Cys Tyr Thr Glu Met
1 5 10 15
gcg gca gaa aaa atc cgc gat att atc ggc cca gaa ctg gtg acc tta 96
Ala Ala Glu Lys Ile Arg Asp Ile Ile Gly Pro Glu Leu Val Thr Leu
20 25 30
cat aac ctc aag gac gac tcc ccg aaa tta atg gag cag tac gat gtg 144
His Asn Leu Lys Asp Asp Ser Pro Lys Leu Met Glu Gln Tyr Asp Val
35 40 45
ctc att ctg ggt atc ccg acc tgg gat ttt ggt gaa atc cag gaa gac 192
Leu Ile Leu Gly Ile Pro Thr Trp Asp Phe Gly Glu Ile Gln Glu Asp
50 55 60
tgg gaa gcc gtc tgg gat cag ctc gac gac ctg aac ctt gaa ggt aaa 240
Trp Glu Ala Val Trp Asp Gln Leu Asp Asp Leu Asn Leu Glu Gly Lys
65 70 75 80
att gtt gcg ctg tat ggg ctt ggc gat caa ctg gga tac ggc gag tgg 288
Ile Val Ala Leu Tyr Gly Leu Gly Asp Gln Leu Gly Tyr Gly Glu Trp
85 90 95
ttc ctc gat gcg ctc ggt atg ctg cat gac aaa ctc tcg acc aaa ggc 336
Phe Leu Asp Ala Leu Gly Met Leu His Asp Lys Leu Ser Thr Lys Gly
100 105 110
gtg aag ttc gtc ggc tac tgg cca acg gaa gga tat gaa ttt acc agc 384
Val Lys Phe Val Gly Tyr Trp Pro Thr Glu Gly Tyr Glu Phe Thr Ser
115 120 125
ccg aaa ccg gtg att gct gac ggg caa ctg ttc gtg ggt ctg gcg ctg 432
Pro Lys Pro Val Ile Ala Asp Gly Gln Leu Phe Val Gly Leu Ala Leu
130 135 140
gat gaa act aac cag tat gac ctt agc gac gag cgt att cag agc tgg 480
Asp Glu Thr Asn Gln Tyr Asp Leu Ser Asp Glu Arg Ile Gln Ser Trp
145 150 155 160
tgc gag caa atc ctc aac gaa atg gca gag cat tac gcc tga 522
Cys Glu Gln Ile Leu Asn Glu Met Ala Glu His Tyr Ala
165 170
<210> 265
<211> 173
<212> PRT
<213> Escherichia coli
<400> 265
Met Asn Met Gly Leu Phe Tyr Gly Ser Ser Thr Cys Tyr Thr Glu Met
1 5 10 15
Ala Ala Glu Lys Ile Arg Asp Ile Ile Gly Pro Glu Leu Val Thr Leu
20 25 30
His Asn Leu Lys Asp Asp Ser Pro Lys Leu Met Glu Gln Tyr Asp Val
35 40 45
Leu Ile Leu Gly Ile Pro Thr Trp Asp Phe Gly Glu Ile Gln Glu Asp
50 55 60
Trp Glu Ala Val Trp Asp Gln Leu Asp Asp Leu Asn Leu Glu Gly Lys
65 70 75 80
Ile Val Ala Leu Tyr Gly Leu Gly Asp Gln Leu Gly Tyr Gly Glu Trp
85 90 95
Phe Leu Asp Ala Leu Gly Met Leu His Asp Lys Leu Ser Thr Lys Gly
100 105 110
Val Lys Phe Val Gly Tyr Trp Pro Thr Glu Gly Tyr Glu Phe Thr Ser
115 120 125
Pro Lys Pro Val Ile Ala Asp Gly Gln Leu Phe Val Gly Leu Ala Leu
130 135 140
Asp Glu Thr Asn Gln Tyr Asp Leu Ser Asp Glu Arg Ile Gln Ser Trp
145 150 155 160
Cys Glu Gln Ile Leu Asn Glu Met Ala Glu His Tyr Ala
165 170
<210> 266
<211> 477
<212> DNA
<213> Bacillus subtilis
<220>
<221> CDS
<222> (1)..(477)
<223> ykuN gene from Bacillus subtilis encoding flavodoxin
<400> 266
atg gct aaa gcc ttg att aca tat gcc agc atg tca gga aat aca gaa 48
Met Ala Lys Ala Leu Ile Thr Tyr Ala Ser Met Ser Gly Asn Thr Glu
1 5 10 15
gac att gcc ttc ata ata aaa gat acg ctt cag gaa tat gag ttg gat 96
Asp Ile Ala Phe Ile Ile Lys Asp Thr Leu Gln Glu Tyr Glu Leu Asp
20 25 30
atc gat tgt gtc gag ata aat gat atg gat gcg tct tgt tta acc tcc 144
Ile Asp Cys Val Glu Ile Asn Asp Met Asp Ala Ser Cys Leu Thr Ser
35 40 45
tat gat tat gta ctg att ggc acc tat aca tgg ggg gac ggc gat ttg 192
Tyr Asp Tyr Val Leu Ile Gly Thr Tyr Thr Trp Gly Asp Gly Asp Leu
50 55 60
ccc tac gaa gcg gag gat ttt ttc gaa gag gtc aaa cag att cag ctt 240
Pro Tyr Glu Ala Glu Asp Phe Phe Glu Glu Val Lys Gln Ile Gln Leu
65 70 75 80
aat ggt tta aaa aca gcc tgc ttc ggg tct ggc gat tat tct tat cca 288
Asn Gly Leu Lys Thr Ala Cys Phe Gly Ser Gly Asp Tyr Ser Tyr Pro
85 90 95
aag ttt tgc gaa gcg gtg aat ttg ttc aat gtc atg ctg caa gag gcg 336
Lys Phe Cys Glu Ala Val Asn Leu Phe Asn Val Met Leu Gln Glu Ala
100 105 110
gga gct gct gtt tac cag gaa aca cta aaa att gaa tta gcg cct gaa 384
Gly Ala Ala Val Tyr Gln Glu Thr Leu Lys Ile Glu Leu Ala Pro Glu
115 120 125
aca gat gaa gat gtg gaa agc tgc cga gcg ttt gcg aga ggt ttt ctt 432
Thr Asp Glu Asp Val Glu Ser Cys Arg Ala Phe Ala Arg Gly Phe Leu
130 135 140
gca tgg gca gat tat atg aac aag gaa aaa atc cat gtt tca taa 477
Ala Trp Ala Asp Tyr Met Asn Lys Glu Lys Ile His Val Ser
145 150 155
<210> 267
<211> 158
<212> PRT
<213> Bacillus subtilis
<400> 267
Met Ala Lys Ala Leu Ile Thr Tyr Ala Ser Met Ser Gly Asn Thr Glu
1 5 10 15
Asp Ile Ala Phe Ile Ile Lys Asp Thr Leu Gln Glu Tyr Glu Leu Asp
20 25 30
Ile Asp Cys Val Glu Ile Asn Asp Met Asp Ala Ser Cys Leu Thr Ser
35 40 45
Tyr Asp Tyr Val Leu Ile Gly Thr Tyr Thr Trp Gly Asp Gly Asp Leu
50 55 60
Pro Tyr Glu Ala Glu Asp Phe Phe Glu Glu Val Lys Gln Ile Gln Leu
65 70 75 80
Asn Gly Leu Lys Thr Ala Cys Phe Gly Ser Gly Asp Tyr Ser Tyr Pro
85 90 95
Lys Phe Cys Glu Ala Val Asn Leu Phe Asn Val Met Leu Gln Glu Ala
100 105 110
Gly Ala Ala Val Tyr Gln Glu Thr Leu Lys Ile Glu Leu Ala Pro Glu
115 120 125
Thr Asp Glu Asp Val Glu Ser Cys Arg Ala Phe Ala Arg Gly Phe Leu
130 135 140
Ala Trp Ala Asp Tyr Met Asn Lys Glu Lys Ile His Val Ser
145 150 155
<210> 268
<211> 513
<212> DNA
<213> Synechocystis PCC6803
<220>
<221> CDS
<222> (1)..(513)
<223> isiB gene from Synechocystis encoding flavodoxin
<400> 268
atg aca aaa att gga ctt ttt tac ggt act caa acc ggc aac act gaa 48
Met Thr Lys Ile Gly Leu Phe Tyr Gly Thr Gln Thr Gly Asn Thr Glu
1 5 10 15
acc att gct gaa ctg att caa aaa gaa atg ggc ggc gat agt gtg gtc 96
Thr Ile Ala Glu Leu Ile Gln Lys Glu Met Gly Gly Asp Ser Val Val
20 25 30
gat atg atg gat ata tcc cag gct gat gtt gat gat ttt agg caa tat 144
Asp Met Met Asp Ile Ser Gln Ala Asp Val Asp Asp Phe Arg Gln Tyr
35 40 45
agt tgc ctg att atc ggt tgt ccc acc tgg aat gtg ggg gaa ctc cag 192
Ser Cys Leu Ile Ile Gly Cys Pro Thr Trp Asn Val Gly Glu Leu Gln
50 55 60
agt gat tgg gaa ggc ttt tat gac caa tta gac gaa att gat ttt aat 240
Ser Asp Trp Glu Gly Phe Tyr Asp Gln Leu Asp Glu Ile Asp Phe Asn
65 70 75 80
ggc aaa aaa gta gcc tat ttt ggt gct ggc gat cag gtt ggt tat gca 288
Gly Lys Lys Val Ala Tyr Phe Gly Ala Gly Asp Gln Val Gly Tyr Ala
85 90 95
gat aat ttt caa gac gcc atg ggc att tta gaa gaa aaa atc agt gga 336
Asp Asn Phe Gln Asp Ala Met Gly Ile Leu Glu Glu Lys Ile Ser Gly
100 105 110
tta ggc ggt aaa aca gtg ggg ttt tgg ccc acc gct ggc tat gat ttt 384
Leu Gly Gly Lys Thr Val Gly Phe Trp Pro Thr Ala Gly Tyr Asp Phe
115 120 125
gac gaa tca aaa gcg gtg aaa aat ggg aaa ttt gtt ggt tta gct ttg 432
Asp Glu Ser Lys Ala Val Lys Asn Gly Lys Phe Val Gly Leu Ala Leu
130 135 140
gac gaa gat aat cag cca gag tta aca gaa tta aga gta aag aca tgg 480
Asp Glu Asp Asn Gln Pro Glu Leu Thr Glu Leu Arg Val Lys Thr Trp
145 150 155 160
gta agt gaa att aaa cca att ttg caa tcc tag 513
Val Ser Glu Ile Lys Pro Ile Leu Gln Ser
165 170
<210> 269
<211> 170
<212> PRT
<213> Synechocystis PCC6803
<400> 269
Met Thr Lys Ile Gly Leu Phe Tyr Gly Thr Gln Thr Gly Asn Thr Glu
1 5 10 15
Thr Ile Ala Glu Leu Ile Gln Lys Glu Met Gly Gly Asp Ser Val Val
20 25 30
Asp Met Met Asp Ile Ser Gln Ala Asp Val Asp Asp Phe Arg Gln Tyr
35 40 45
Ser Cys Leu Ile Ile Gly Cys Pro Thr Trp Asn Val Gly Glu Leu Gln
50 55 60
Ser Asp Trp Glu Gly Phe Tyr Asp Gln Leu Asp Glu Ile Asp Phe Asn
65 70 75 80
Gly Lys Lys Val Ala Tyr Phe Gly Ala Gly Asp Gln Val Gly Tyr Ala
85 90 95
Asp Asn Phe Gln Asp Ala Met Gly Ile Leu Glu Glu Lys Ile Ser Gly
100 105 110
Leu Gly Gly Lys Thr Val Gly Phe Trp Pro Thr Ala Gly Tyr Asp Phe
115 120 125
Asp Glu Ser Lys Ala Val Lys Asn Gly Lys Phe Val Gly Leu Ala Leu
130 135 140
Asp Glu Asp Asn Gln Pro Glu Leu Thr Glu Leu Arg Val Lys Thr Trp
145 150 155 160
Val Ser Glu Ile Lys Pro Ile Leu Gln Ser
165 170
<210> 270
<211> 585
<212> DNA
<213> Streptomyces venezuelae
<220>
<221> CDS
<222> (1)..(585)
<223> wrbA gene from Streptomyces venezuelae encoding flavodoxin
<400> 270
atg acc acc ccc gtc gtc tcc atc gcc tac cac tcc ggc tac ggc cac 48
Met Thr Thr Pro Val Val Ser Ile Ala Tyr His Ser Gly Tyr Gly His
1 5 10 15
acc gcg gtc ctg gcc gag gcc gtc cgt gac ggc gcc gcc gac gcg ggc 96
Thr Ala Val Leu Ala Glu Ala Val Arg Asp Gly Ala Ala Asp Ala Gly
20 25 30
gcc acc gtc cac ctg atc aag gtc gac ggg atc acc gag gcg gag tgg 144
Ala Thr Val His Leu Ile Lys Val Asp Gly Ile Thr Glu Ala Glu Trp
35 40 45
gag ctg ctc gac gcc tcc gac gcg atc gtc ttc ggc tcc ccg acc tac 192
Glu Leu Leu Asp Ala Ser Asp Ala Ile Val Phe Gly Ser Pro Thr Tyr
50 55 60
atg ggc acc gcc tcc ggt gcc ttc cac cag ttc gcc gag gac tcc tcg 240
Met Gly Thr Ala Ser Gly Ala Phe His Gln Phe Ala Glu Asp Ser Ser
65 70 75 80
aag cgc tgg ttc ggc gac gtc tgg ctg gac aag ctc gcc gcc ggc ttc 288
Lys Arg Trp Phe Gly Asp Val Trp Leu Asp Lys Leu Ala Ala Gly Phe
85 90 95
acc aac tcc ggc tcc aag agc ggc gac aag ctg cac acc ctg cag tac 336
Thr Asn Ser Gly Ser Lys Ser Gly Asp Lys Leu His Thr Leu Gln Tyr
100 105 110
ttc cag atc ctc gcc ggc cag cac ggc atg cac tgg gtc aac ctc ggc 384
Phe Gln Ile Leu Ala Gly Gln His Gly Met His Trp Val Asn Leu Gly
115 120 125
ctg aag ccc ggc tgg aac acc agc gag gcc tcc gag aac gac atc aac 432
Leu Lys Pro Gly Trp Asn Thr Ser Glu Ala Ser Glu Asn Asp Ile Asn
130 135 140
cgc ctc ggc ttc ttc tcc ggc gcc gcc ggc cag acc ccc gcg gac ctg 480
Arg Leu Gly Phe Phe Ser Gly Ala Ala Gly Gln Thr Pro Ala Asp Leu
145 150 155 160
ggc ccc gag gcc gtc cac aag gcc gac gtc gcc acc gcc gaa cac ctc 528
Gly Pro Glu Ala Val His Lys Ala Asp Val Ala Thr Ala Glu His Leu
165 170 175
ggc cgc cgc gtc gcc gag acc gcc cgc acc ttc gcg gcc ggc aag gcc 576
Gly Arg Arg Val Ala Glu Thr Ala Arg Thr Phe Ala Ala Gly Lys Ala
180 185 190
gcc gcc tga 585
Ala Ala
<210> 271
<211> 194
<212> PRT
<213> Streptomyces venezuelae
<400> 271
Met Thr Thr Pro Val Val Ser Ile Ala Tyr His Ser Gly Tyr Gly His
1 5 10 15
Thr Ala Val Leu Ala Glu Ala Val Arg Asp Gly Ala Ala Asp Ala Gly
20 25 30
Ala Thr Val His Leu Ile Lys Val Asp Gly Ile Thr Glu Ala Glu Trp
35 40 45
Glu Leu Leu Asp Ala Ser Asp Ala Ile Val Phe Gly Ser Pro Thr Tyr
50 55 60
Met Gly Thr Ala Ser Gly Ala Phe His Gln Phe Ala Glu Asp Ser Ser
65 70 75 80
Lys Arg Trp Phe Gly Asp Val Trp Leu Asp Lys Leu Ala Ala Gly Phe
85 90 95
Thr Asn Ser Gly Ser Lys Ser Gly Asp Lys Leu His Thr Leu Gln Tyr
100 105 110
Phe Gln Ile Leu Ala Gly Gln His Gly Met His Trp Val Asn Leu Gly
115 120 125
Leu Lys Pro Gly Trp Asn Thr Ser Glu Ala Ser Glu Asn Asp Ile Asn
130 135 140
Arg Leu Gly Phe Phe Ser Gly Ala Ala Gly Gln Thr Pro Ala Asp Leu
145 150 155 160
Gly Pro Glu Ala Val His Lys Ala Asp Val Ala Thr Ala Glu His Leu
165 170 175
Gly Arg Arg Val Ala Glu Thr Ala Arg Thr Phe Ala Ala Gly Lys Ala
180 185 190
Ala Ala
<210> 272
<211> 459
<212> DNA
<213> Methanococcus aeolicus
<220>
<221> CDS
<222> (1)..(459)
<223> PRK06242 gene from Methanococcus aeolicus encoding flavodoxin
<400> 272
atg aaa ata tta att att tgt aaa tcc gta cac cat gga aac act aaa 48
Met Lys Ile Leu Ile Ile Cys Lys Ser Val His His Gly Asn Thr Lys
1 5 10 15
aaa ata gca gat gcc atg gca gag gtt tta aat gca gag gtt att gca 96
Lys Ile Ala Asp Ala Met Ala Glu Val Leu Asn Ala Glu Val Ile Ala
20 25 30
cct gaa aat gta agt tcc gaa gat atc aaa aaa tat gat ttg gtg gga 144
Pro Glu Asn Val Ser Ser Glu Asp Ile Lys Lys Tyr Asp Leu Val Gly
35 40 45
ttt ggc tct gga ata tat att ggg aaa cat cat aaa aag cta tta aaa 192
Phe Gly Ser Gly Ile Tyr Ile Gly Lys His His Lys Lys Leu Leu Lys
50 55 60
ctt gcg gat aat ctt cca aat gga gaa aat aaa aca gta ttt gta ttt 240
Leu Ala Asp Asn Leu Pro Asn Gly Glu Asn Lys Thr Val Phe Val Phe
65 70 75 80
tcc aca agc gat aac tgg aag caa aat tac cat aag cca tta atg gat 288
Ser Thr Ser Asp Asn Trp Lys Gln Asn Tyr His Lys Pro Leu Met Asp
85 90 95
aaa cta aat tcc aga gga tat aaa aca gta gga gaa ttc aac tgt aaa 336
Lys Leu Asn Ser Arg Gly Tyr Lys Thr Val Gly Glu Phe Asn Cys Lys
100 105 110
ggg ttt gat gac tgg ttt ata ttt aaa tta att ggt ggt aga aat aaa 384
Gly Phe Asp Asp Trp Phe Ile Phe Lys Leu Ile Gly Gly Arg Asn Lys
115 120 125
gga cat cca aat aaa aaa gat att gaa aat gca aaa aaa ttt gct gaa 432
Gly His Pro Asn Lys Lys Asp Ile Glu Asn Ala Lys Lys Phe Ala Glu
130 135 140
aat ata aag aat ata gaa aat ata tag 459
Asn Ile Lys Asn Ile Glu Asn Ile
145 150
<210> 273
<211> 152
<212> PRT
<213> Methanococcus aeolicus
<400> 273
Met Lys Ile Leu Ile Ile Cys Lys Ser Val His His Gly Asn Thr Lys
1 5 10 15
Lys Ile Ala Asp Ala Met Ala Glu Val Leu Asn Ala Glu Val Ile Ala
20 25 30
Pro Glu Asn Val Ser Ser Glu Asp Ile Lys Lys Tyr Asp Leu Val Gly
35 40 45
Phe Gly Ser Gly Ile Tyr Ile Gly Lys His His Lys Lys Leu Leu Lys
50 55 60
Leu Ala Asp Asn Leu Pro Asn Gly Glu Asn Lys Thr Val Phe Val Phe
65 70 75 80
Ser Thr Ser Asp Asn Trp Lys Gln Asn Tyr His Lys Pro Leu Met Asp
85 90 95
Lys Leu Asn Ser Arg Gly Tyr Lys Thr Val Gly Glu Phe Asn Cys Lys
100 105 110
Gly Phe Asp Asp Trp Phe Ile Phe Lys Leu Ile Gly Gly Arg Asn Lys
115 120 125
Gly His Pro Asn Lys Lys Asp Ile Glu Asn Ala Lys Lys Phe Ala Glu
130 135 140
Asn Ile Lys Asn Ile Glu Asn Ile
145 150
<210> 274
<211> 336
<212> DNA
<213> Escherichia coli
<220>
<221> CDS
<222> (1)..(336)
<223> fdx gene from E. coli encoding ferredoxin
<400> 274
atg cca aag att gtt att ttg cct cat cag gat ctc tgc cct gat ggc 48
Met Pro Lys Ile Val Ile Leu Pro His Gln Asp Leu Cys Pro Asp Gly
1 5 10 15
gct gtt ctg gaa gct aat agc ggt gaa acc att ctc gac gca gct ctg 96
Ala Val Leu Glu Ala Asn Ser Gly Glu Thr Ile Leu Asp Ala Ala Leu
20 25 30
cgt aac ggt atc gag att gaa cac gcc tgt gaa aaa tcc tgt gct tgc 144
Arg Asn Gly Ile Glu Ile Glu His Ala Cys Glu Lys Ser Cys Ala Cys
35 40 45
acc acc tgc cac tgc atc gtt cgt gaa ggt ttt gac tca ctg ccg gaa 192
Thr Thr Cys His Cys Ile Val Arg Glu Gly Phe Asp Ser Leu Pro Glu
50 55 60
agc tca gag cag gaa gac gac atg ctg gac aaa gcc tgg gga ctg gag 240
Ser Ser Glu Gln Glu Asp Asp Met Leu Asp Lys Ala Trp Gly Leu Glu
65 70 75 80
ccg gaa agc cgt tta agc tgc cag gcg cgc gtt acc gac gaa gat tta 288
Pro Glu Ser Arg Leu Ser Cys Gln Ala Arg Val Thr Asp Glu Asp Leu
85 90 95
gta gtc gaa atc ccg cgt tac act atc aac cat gcg cgt gag cat taa 336
Val Val Glu Ile Pro Arg Tyr Thr Ile Asn His Ala Arg Glu His
100 105 110
<210> 275
<211> 111
<212> PRT
<213> Escherichia coli
<400> 275
Met Pro Lys Ile Val Ile Leu Pro His Gln Asp Leu Cys Pro Asp Gly
1 5 10 15
Ala Val Leu Glu Ala Asn Ser Gly Glu Thr Ile Leu Asp Ala Ala Leu
20 25 30
Arg Asn Gly Ile Glu Ile Glu His Ala Cys Glu Lys Ser Cys Ala Cys
35 40 45
Thr Thr Cys His Cys Ile Val Arg Glu Gly Phe Asp Ser Leu Pro Glu
50 55 60
Ser Ser Glu Gln Glu Asp Asp Met Leu Asp Lys Ala Trp Gly Leu Glu
65 70 75 80
Pro Glu Ser Arg Leu Ser Cys Gln Ala Arg Val Thr Asp Glu Asp Leu
85 90 95
Val Val Glu Ile Pro Arg Tyr Thr Ile Asn His Ala Arg Glu His
100 105 110
<210> 276
<211> 249
<212> DNA
<213> Bacillus subtilis
<220>
<221> CDS
<222> (1)..(249)
<223> fer gene from B. subtilis encoding ferredoxin
<400> 276
atg gca aag tac aca atc gta gac aaa gat aca tgt att gca tgc ggc 48
Met Ala Lys Tyr Thr Ile Val Asp Lys Asp Thr Cys Ile Ala Cys Gly
1 5 10 15
gct tgc gga gct gct gca cca gac att tac gat tac gat gat gaa ggc 96
Ala Cys Gly Ala Ala Ala Pro Asp Ile Tyr Asp Tyr Asp Asp Glu Gly
20 25 30
atc gcg ttc gta acg ctt gat gaa aac aaa ggt gtt gtc gaa gtt cct 144
Ile Ala Phe Val Thr Leu Asp Glu Asn Lys Gly Val Val Glu Val Pro
35 40 45
gag gta ctg gaa gaa gat atg att gac gca ttt gaa gga tgc cct act 192
Glu Val Leu Glu Glu Asp Met Ile Asp Ala Phe Glu Gly Cys Pro Thr
50 55 60
gat tcc atc aaa gtg gcg gat gag cca ttt gaa ggc gac ccg ctt aaa 240
Asp Ser Ile Lys Val Ala Asp Glu Pro Phe Glu Gly Asp Pro Leu Lys
65 70 75 80
ttt gaa tag 249
Phe Glu
<210> 277
<211> 82
<212> PRT
<213> Bacillus subtilis
<400> 277
Met Ala Lys Tyr Thr Ile Val Asp Lys Asp Thr Cys Ile Ala Cys Gly
1 5 10 15
Ala Cys Gly Ala Ala Ala Pro Asp Ile Tyr Asp Tyr Asp Asp Glu Gly
20 25 30
Ile Ala Phe Val Thr Leu Asp Glu Asn Lys Gly Val Val Glu Val Pro
35 40 45
Glu Val Leu Glu Glu Asp Met Ile Asp Ala Phe Glu Gly Cys Pro Thr
50 55 60
Asp Ser Ile Lys Val Ala Asp Glu Pro Phe Glu Gly Asp Pro Leu Lys
65 70 75 80
Phe Glu
<210> 278
<211> 321
<212> DNA
<213> Corynebacterium glutamicum
<220>
<221> CDS
<222> (1)..(321)
<223> fdxB gene from Corynebacterium glutamicum encoding ferredoxin
<400> 278
atg tct act att cat ttc att gat cat gct ggc aaa acc cgc acc atc 48
Met Ser Thr Ile His Phe Ile Asp His Ala Gly Lys Thr Arg Thr Ile
1 5 10 15
gag gcg act gtt ggt gat tca gta atg gag acc gca gtc cga aac gga 96
Glu Ala Thr Val Gly Asp Ser Val Met Glu Thr Ala Val Arg Asn Gly
20 25 30
gtg cct gga att gtt gct gaa tgc ggc ggt tcc tta tcg tgt gca acc 144
Val Pro Gly Ile Val Ala Glu Cys Gly Gly Ser Leu Ser Cys Ala Thr
35 40 45
tgc cat gtg ttt gtt gac cct gca cag tat gat gcg ctt ccc cca atg 192
Cys His Val Phe Val Asp Pro Ala Gln Tyr Asp Ala Leu Pro Pro Met
50 55 60
gag gag atg gaa gat gaa atg ctg tgg ggt gct gcc gtg gac cgt gag 240
Glu Glu Met Glu Asp Glu Met Leu Trp Gly Ala Ala Val Asp Arg Glu
65 70 75 80
gat tgc tcc cgt ttg tct tgc caa atc aag gtc acc gaa ggc atg gat 288
Asp Cys Ser Arg Leu Ser Cys Gln Ile Lys Val Thr Glu Gly Met Asp
85 90 95
ctt tcg ttg acc acg cca gaa acg caa gtg tga 321
Leu Ser Leu Thr Thr Pro Glu Thr Gln Val
100 105
<210> 279
<211> 106
<212> PRT
<213> Corynebacterium glutamicum
<400> 279
Met Ser Thr Ile His Phe Ile Asp His Ala Gly Lys Thr Arg Thr Ile
1 5 10 15
Glu Ala Thr Val Gly Asp Ser Val Met Glu Thr Ala Val Arg Asn Gly
20 25 30
Val Pro Gly Ile Val Ala Glu Cys Gly Gly Ser Leu Ser Cys Ala Thr
35 40 45
Cys His Val Phe Val Asp Pro Ala Gln Tyr Asp Ala Leu Pro Pro Met
50 55 60
Glu Glu Met Glu Asp Glu Met Leu Trp Gly Ala Ala Val Asp Arg Glu
65 70 75 80
Asp Cys Ser Arg Leu Ser Cys Gln Ile Lys Val Thr Glu Gly Met Asp
85 90 95
Leu Ser Leu Thr Thr Pro Glu Thr Gln Val
100 105
<210> 280
<211> 561
<212> DNA
<213> Synechocystis PCC6803
<220>
<221> CDS
<222> (1)..(561)
<223> fdx gene from Synechocystis encoding ferredoxin
<400> 280
atg acc atg cca cca tta tgg aat tgc tct gtc gcc aac agg gtt aat 48
Met Thr Met Pro Pro Leu Trp Asn Cys Ser Val Ala Asn Arg Val Asn
1 5 10 15
gcc att gtt gcc agt act aag gag gat tgt gtg gct aaa act att aag 96
Ala Ile Val Ala Ser Thr Lys Glu Asp Cys Val Ala Lys Thr Ile Lys
20 25 30
ctc gac ccc att gat tta aaa gtc gcc atc gag acc aac gat aac ctg 144
Leu Asp Pro Ile Asp Leu Lys Val Ala Ile Glu Thr Asn Asp Asn Leu
35 40 45
ctc tcg ggg ttg ctc ggt cag gat tta cgg atc atg aag gag tgt ggt 192
Leu Ser Gly Leu Leu Gly Gln Asp Leu Arg Ile Met Lys Glu Cys Gly
50 55 60
ggt cgg ggt atg tgt gcc act tgt cac gtt tac atc acc gct ggg atg 240
Gly Arg Gly Met Cys Ala Thr Cys His Val Tyr Ile Thr Ala Gly Met
65 70 75 80
gag agt ctt tct ccc ctc aac cgt cgg gag cag cgc acc cta gag gtg 288
Glu Ser Leu Ser Pro Leu Asn Arg Arg Glu Gln Arg Thr Leu Glu Val
85 90 95
atc acc acc cac aat cgt tat tcc cgt ttg gct tgc caa gcc cgg gtg 336
Ile Thr Thr His Asn Arg Tyr Ser Arg Leu Ala Cys Gln Ala Arg Val
100 105 110
ttg gat gaa ggc gtg gtg gtg gaa ttg ccc gct ggg atg tac gtc agt 384
Leu Asp Glu Gly Val Val Val Glu Leu Pro Ala Gly Met Tyr Val Ser
115 120 125
gaa att gag gac atc gag gag ctg att ggc cgt cga gcg gag gaa aat 432
Glu Ile Glu Asp Ile Glu Glu Leu Ile Gly Arg Arg Ala Glu Glu Asn
130 135 140
att ctc aat cct cgg gat ggg agc atc cta gtg gaa aaa ggt aag tta 480
Ile Leu Asn Pro Arg Asp Gly Ser Ile Leu Val Glu Lys Gly Lys Leu
145 150 155 160
att acc cgt tcc atg att agt caa cta gat gac cag tta cag gcg gcc 528
Ile Thr Arg Ser Met Ile Ser Gln Leu Asp Asp Gln Leu Gln Ala Ala
165 170 175
aaa att cag att gtc aac gat acc gat gaa taa 561
Lys Ile Gln Ile Val Asn Asp Thr Asp Glu
180 185
<210> 281
<211> 186
<212> PRT
<213> Synechocystis PCC6803
<400> 281
Met Thr Met Pro Pro Leu Trp Asn Cys Ser Val Ala Asn Arg Val Asn
1 5 10 15
Ala Ile Val Ala Ser Thr Lys Glu Asp Cys Val Ala Lys Thr Ile Lys
20 25 30
Leu Asp Pro Ile Asp Leu Lys Val Ala Ile Glu Thr Asn Asp Asn Leu
35 40 45
Leu Ser Gly Leu Leu Gly Gln Asp Leu Arg Ile Met Lys Glu Cys Gly
50 55 60
Gly Arg Gly Met Cys Ala Thr Cys His Val Tyr Ile Thr Ala Gly Met
65 70 75 80
Glu Ser Leu Ser Pro Leu Asn Arg Arg Glu Gln Arg Thr Leu Glu Val
85 90 95
Ile Thr Thr His Asn Arg Tyr Ser Arg Leu Ala Cys Gln Ala Arg Val
100 105 110
Leu Asp Glu Gly Val Val Val Glu Leu Pro Ala Gly Met Tyr Val Ser
115 120 125
Glu Ile Glu Asp Ile Glu Glu Leu Ile Gly Arg Arg Ala Glu Glu Asn
130 135 140
Ile Leu Asn Pro Arg Asp Gly Ser Ile Leu Val Glu Lys Gly Lys Leu
145 150 155 160
Ile Thr Arg Ser Met Ile Ser Gln Leu Asp Asp Gln Leu Gln Ala Ala
165 170 175
Lys Ile Gln Ile Val Asn Asp Thr Asp Glu
180 185
<210> 282
<211> 294
<212> DNA
<213> Streptomyces venezuelae
<220>
<221> CDS
<222> (1)..(294)
<223> SVEN_7039 gene from Streptomyces venezuelae encoding ferredoxin
<400> 282
atg gcg tac gtc gtc acc gac gag tgc atc ggc tgc aag tac acg gac 48
Met Ala Tyr Val Val Thr Asp Glu Cys Ile Gly Cys Lys Tyr Thr Asp
1 5 10 15
tgt gtg gac gtc tgc ccc gtg agc tgt ttc cac gag ggc ccc gag atg 96
Cys Val Asp Val Cys Pro Val Ser Cys Phe His Glu Gly Pro Glu Met
20 25 30
ctc tac atc aac ccc gag gaa tgc atc gac tgc aac gcg tgc gtc gcc 144
Leu Tyr Ile Asn Pro Glu Glu Cys Ile Asp Cys Asn Ala Cys Val Ala
35 40 45
gag tgc ccg ccc gag gcc atc tgg gcg gac gtc gac ctg ccg gag gac 192
Glu Cys Pro Pro Glu Ala Ile Trp Ala Asp Val Asp Leu Pro Glu Asp
50 55 60
aag ctc cag tgg atc gag atc aac gga gag atg agt gcc aag tac ccg 240
Lys Leu Gln Trp Ile Glu Ile Asn Gly Glu Met Ser Ala Lys Tyr Pro
65 70 75 80
gtt ctc cac gag agc cgg ggc ccc cac gga cag ccc tcc agc cag cct 288
Val Leu His Glu Ser Arg Gly Pro His Gly Gln Pro Ser Ser Gln Pro
85 90 95
tcc tga 294
Ser
<210> 283
<211> 97
<212> PRT
<213> Streptomyces venezuelae
<400> 283
Met Ala Tyr Val Val Thr Asp Glu Cys Ile Gly Cys Lys Tyr Thr Asp
1 5 10 15
Cys Val Asp Val Cys Pro Val Ser Cys Phe His Glu Gly Pro Glu Met
20 25 30
Leu Tyr Ile Asn Pro Glu Glu Cys Ile Asp Cys Asn Ala Cys Val Ala
35 40 45
Glu Cys Pro Pro Glu Ala Ile Trp Ala Asp Val Asp Leu Pro Glu Asp
50 55 60
Lys Leu Gln Trp Ile Glu Ile Asn Gly Glu Met Ser Ala Lys Tyr Pro
65 70 75 80
Val Leu His Glu Ser Arg Gly Pro His Gly Gln Pro Ser Ser Gln Pro
85 90 95
Ser
<210> 284
<211> 1746
<212> DNA
<213> Methanococcus aeolicus
<220>
<221> CDS
<222> (1)..(1746)
<223> fdx gene from Methanococcus aeolicus encoding ferredoxin
<400> 284
atg ggg ggt gtt atg atg tat aat att aca tac ata aaa gag gat gga 48
Met Gly Gly Val Met Met Tyr Asn Ile Thr Tyr Ile Lys Glu Asp Gly
1 5 10 15
act aaa aaa tca att aaa gtt aaa gaa gga acc aca ata ctt gaa gga 96
Thr Lys Lys Ser Ile Lys Val Lys Glu Gly Thr Thr Ile Leu Glu Gly
20 25 30
gcg ata aaa gcg gga gtt tat att gat gct cca tgt gga acg ggg aaa 144
Ala Ile Lys Ala Gly Val Tyr Ile Asp Ala Pro Cys Gly Thr Gly Lys
35 40 45
tgt ggt aag tgt aaa gtt tta gtg gag aaa ggt tta gaa aat att gat 192
Cys Gly Lys Cys Lys Val Leu Val Glu Lys Gly Leu Glu Asn Ile Asp
50 55 60
aag gat agt att gtg gaa gat gag tat gca ctg gca tgt gtg gca aaa 240
Lys Asp Ser Ile Val Glu Asp Glu Tyr Ala Leu Ala Cys Val Ala Lys
65 70 75 80
gtt tat ggg gac ata tca att aat gtt cca aat ttc caa ggt gtg gtt 288
Val Tyr Gly Asp Ile Ser Ile Asn Val Pro Asn Phe Gln Gly Val Val
85 90 95
tgt aag gat atc acc aac gaa gtt ggt gag cta caa act cga aga gtt 336
Cys Lys Asp Ile Thr Asn Glu Val Gly Glu Leu Gln Thr Arg Arg Val
100 105 110
tgt tca att acc gaa caa tgt aaa ggt gag cta caa aat ctc gag gga 384
Cys Ser Ile Thr Glu Gln Cys Lys Gly Glu Leu Gln Asn Leu Glu Gly
115 120 125
ttt cat ccg atg tct tta aac ccc gat att gga ata aat aaa att act 432
Phe His Pro Met Ser Leu Asn Pro Asp Ile Gly Ile Asn Lys Ile Thr
130 135 140
aca aca gta ttg gaa tca tct aac tat aac cta aca tta gat gca ata 480
Thr Thr Val Leu Glu Ser Ser Asn Tyr Asn Leu Thr Leu Asp Ala Ile
145 150 155 160
aat aag ctt aat tct atg aag tta tcg gac gaa gta act tta ata tta 528
Asn Lys Leu Asn Ser Met Lys Leu Ser Asp Glu Val Thr Leu Ile Leu
165 170 175
aag gga gat aat gtc gtt aat gta gaa aaa gat ttt tct gga att tat 576
Lys Gly Asp Asn Val Val Asn Val Glu Lys Asp Phe Ser Gly Ile Tyr
180 185 190
ggg ctt tca att gat att ggg act aca tct gtt gtt gta tat ctt gtt 624
Gly Leu Ser Ile Asp Ile Gly Thr Thr Ser Val Val Val Tyr Leu Val
195 200 205
gat att tct aaa ggt att gtt tta gat aat att tct ttt tta aat cct 672
Asp Ile Ser Lys Gly Ile Val Leu Asp Asn Ile Ser Phe Leu Asn Pro
210 215 220
cag agg cag ttt ggg gca gat gtt gtt tca aga ata gca tac aac aac 720
Gln Arg Gln Phe Gly Ala Asp Val Val Ser Arg Ile Ala Tyr Asn Asn
225 230 235 240
gga att tta ctg caa aaa aca ctt ata act gaa tta aac gat tct ata 768
Gly Ile Leu Leu Gln Lys Thr Leu Ile Thr Glu Leu Asn Asp Ser Ile
245 250 255
tca aaa tta tgt tca aac aat aac ata aaa atg gat aat att tat gaa 816
Ser Lys Leu Cys Ser Asn Asn Asn Ile Lys Met Asp Asn Ile Tyr Glu
260 265 270
gtt agt gtg gta gga aac act gct atg ata cac ttc ttt tat gga ata 864
Val Ser Val Val Gly Asn Thr Ala Met Ile His Phe Phe Tyr Gly Ile
275 280 285
gtc cca aaa aat ctt gca acc cat cct tat gtt cca aca ttt aaa aac 912
Val Pro Lys Asn Leu Ala Thr His Pro Tyr Val Pro Thr Phe Lys Asn
290 295 300
tca cca tat ctt cct gca aaa gag ttg ggg cta aac cta aga aac gca 960
Ser Pro Tyr Leu Pro Ala Lys Glu Leu Gly Leu Asn Leu Arg Asn Ala
305 310 315 320
tac att tac aca ctt ccg ata ata gga ggt tat gtt ggg gca gac aca 1008
Tyr Ile Tyr Thr Leu Pro Ile Ile Gly Gly Tyr Val Gly Ala Asp Thr
325 330 335
gtt gga gca att tta tca tct gaa atg cat aaa aaa gat gat ata agt 1056
Val Gly Ala Ile Leu Ser Ser Glu Met His Lys Lys Asp Asp Ile Ser
340 345 350
ctc ctt ata gat att ggc aca aat ggg gaa att gtt tta ggg aat aaa 1104
Leu Leu Ile Asp Ile Gly Thr Asn Gly Glu Ile Val Leu Gly Asn Lys
355 360 365
gaa aag tta tta acc tgt tca tgt gca gca ggt cct gca ttt gag ggt 1152
Glu Lys Leu Leu Thr Cys Ser Cys Ala Ala Gly Pro Ala Phe Glu Gly
370 375 380
gtc agc ata gag cat ggg aca aat gct aga gag ggg gca gta tgt aga 1200
Val Ser Ile Glu His Gly Thr Asn Ala Arg Glu Gly Ala Val Cys Arg
385 390 395 400
gta aaa ata gat gaa aat aac ata tac tat gag acc ata gga aat aaa 1248
Val Lys Ile Asp Glu Asn Asn Ile Tyr Tyr Glu Thr Ile Gly Asn Lys
405 410 415
acg ccc cct att gga ata tgc ggg tct gga ata ata gat att gta gct 1296
Thr Pro Pro Ile Gly Ile Cys Gly Ser Gly Ile Ile Asp Ile Val Ala
420 425 430
gaa ttt tta aaa tcc gga tta att aat aaa acc ggt aga ttt act gga 1344
Glu Phe Leu Lys Ser Gly Leu Ile Asn Lys Thr Gly Arg Phe Thr Gly
435 440 445
gaa cat aaa aac tta aag gaa aat aaa ttt atc att gaa gat tct att 1392
Glu His Lys Asn Leu Lys Glu Asn Lys Phe Ile Ile Glu Asp Ser Ile
450 455 460
tat ttc aca cag ggc gat att agg gaa gta cag ctt gca aaa ggg gca 1440
Tyr Phe Thr Gln Gly Asp Ile Arg Glu Val Gln Leu Ala Lys Gly Ala
465 470 475 480
ata tat gca gga ata aaa att ctc tgt tat gaa tat gga ata agt atg 1488
Ile Tyr Ala Gly Ile Lys Ile Leu Cys Tyr Glu Tyr Gly Ile Ser Met
485 490 495
gaa gat ata tct aat gta tat gtt act gga gca ttt gga tgt cat atc 1536
Glu Asp Ile Ser Asn Val Tyr Val Thr Gly Ala Phe Gly Cys His Ile
500 505 510
gat gtt gaa aat gca aag att atc gga ctt tta ccg gat ttg gat aat 1584
Asp Val Glu Asn Ala Lys Ile Ile Gly Leu Leu Pro Asp Leu Asp Asn
515 520 525
ata ttg agt att gat aat gct gct gga agg ggg act ata atg gct tta 1632
Ile Leu Ser Ile Asp Asn Ala Ala Gly Arg Gly Thr Ile Met Ala Leu
530 535 540
cta tct aaa aaa att aga aat gaa gcc gat aag ttg gca aaa aat acg 1680
Leu Ser Lys Lys Ile Arg Asn Glu Ala Asp Lys Leu Ala Lys Asn Thr
545 550 555 560
aaa tat att gaa tta agt agt cat gat aat ttt gaa agt gag ttc ata 1728
Lys Tyr Ile Glu Leu Ser Ser His Asp Asn Phe Glu Ser Glu Phe Ile
565 570 575
tct gcc ctt ggg ttt taa 1746
Ser Ala Leu Gly Phe
580
<210> 285
<211> 581
<212> PRT
<213> Methanococcus aeolicus
<400> 285
Met Gly Gly Val Met Met Tyr Asn Ile Thr Tyr Ile Lys Glu Asp Gly
1 5 10 15
Thr Lys Lys Ser Ile Lys Val Lys Glu Gly Thr Thr Ile Leu Glu Gly
20 25 30
Ala Ile Lys Ala Gly Val Tyr Ile Asp Ala Pro Cys Gly Thr Gly Lys
35 40 45
Cys Gly Lys Cys Lys Val Leu Val Glu Lys Gly Leu Glu Asn Ile Asp
50 55 60
Lys Asp Ser Ile Val Glu Asp Glu Tyr Ala Leu Ala Cys Val Ala Lys
65 70 75 80
Val Tyr Gly Asp Ile Ser Ile Asn Val Pro Asn Phe Gln Gly Val Val
85 90 95
Cys Lys Asp Ile Thr Asn Glu Val Gly Glu Leu Gln Thr Arg Arg Val
100 105 110
Cys Ser Ile Thr Glu Gln Cys Lys Gly Glu Leu Gln Asn Leu Glu Gly
115 120 125
Phe His Pro Met Ser Leu Asn Pro Asp Ile Gly Ile Asn Lys Ile Thr
130 135 140
Thr Thr Val Leu Glu Ser Ser Asn Tyr Asn Leu Thr Leu Asp Ala Ile
145 150 155 160
Asn Lys Leu Asn Ser Met Lys Leu Ser Asp Glu Val Thr Leu Ile Leu
165 170 175
Lys Gly Asp Asn Val Val Asn Val Glu Lys Asp Phe Ser Gly Ile Tyr
180 185 190
Gly Leu Ser Ile Asp Ile Gly Thr Thr Ser Val Val Val Tyr Leu Val
195 200 205
Asp Ile Ser Lys Gly Ile Val Leu Asp Asn Ile Ser Phe Leu Asn Pro
210 215 220
Gln Arg Gln Phe Gly Ala Asp Val Val Ser Arg Ile Ala Tyr Asn Asn
225 230 235 240
Gly Ile Leu Leu Gln Lys Thr Leu Ile Thr Glu Leu Asn Asp Ser Ile
245 250 255
Ser Lys Leu Cys Ser Asn Asn Asn Ile Lys Met Asp Asn Ile Tyr Glu
260 265 270
Val Ser Val Val Gly Asn Thr Ala Met Ile His Phe Phe Tyr Gly Ile
275 280 285
Val Pro Lys Asn Leu Ala Thr His Pro Tyr Val Pro Thr Phe Lys Asn
290 295 300
Ser Pro Tyr Leu Pro Ala Lys Glu Leu Gly Leu Asn Leu Arg Asn Ala
305 310 315 320
Tyr Ile Tyr Thr Leu Pro Ile Ile Gly Gly Tyr Val Gly Ala Asp Thr
325 330 335
Val Gly Ala Ile Leu Ser Ser Glu Met His Lys Lys Asp Asp Ile Ser
340 345 350
Leu Leu Ile Asp Ile Gly Thr Asn Gly Glu Ile Val Leu Gly Asn Lys
355 360 365
Glu Lys Leu Leu Thr Cys Ser Cys Ala Ala Gly Pro Ala Phe Glu Gly
370 375 380
Val Ser Ile Glu His Gly Thr Asn Ala Arg Glu Gly Ala Val Cys Arg
385 390 395 400
Val Lys Ile Asp Glu Asn Asn Ile Tyr Tyr Glu Thr Ile Gly Asn Lys
405 410 415
Thr Pro Pro Ile Gly Ile Cys Gly Ser Gly Ile Ile Asp Ile Val Ala
420 425 430
Glu Phe Leu Lys Ser Gly Leu Ile Asn Lys Thr Gly Arg Phe Thr Gly
435 440 445
Glu His Lys Asn Leu Lys Glu Asn Lys Phe Ile Ile Glu Asp Ser Ile
450 455 460
Tyr Phe Thr Gln Gly Asp Ile Arg Glu Val Gln Leu Ala Lys Gly Ala
465 470 475 480
Ile Tyr Ala Gly Ile Lys Ile Leu Cys Tyr Glu Tyr Gly Ile Ser Met
485 490 495
Glu Asp Ile Ser Asn Val Tyr Val Thr Gly Ala Phe Gly Cys His Ile
500 505 510
Asp Val Glu Asn Ala Lys Ile Ile Gly Leu Leu Pro Asp Leu Asp Asn
515 520 525
Ile Leu Ser Ile Asp Asn Ala Ala Gly Arg Gly Thr Ile Met Ala Leu
530 535 540
Leu Ser Lys Lys Ile Arg Asn Glu Ala Asp Lys Leu Ala Lys Asn Thr
545 550 555 560
Lys Tyr Ile Glu Leu Ser Ser His Asp Asn Phe Glu Ser Glu Phe Ile
565 570 575
Ser Ala Leu Gly Phe
580
<210> 286
<211> 720
<212> DNA
<213> Synthetic
<220>
<221> CDS
<222> (1)..(720)
<223> Synthetic gpf gene encoding GFP
<400> 286
atg cgt aaa ggc gaa gag ctg ttc act ggt gtc gtc cct att ctg gtg 48
Met Arg Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
1 5 10 15
gaa ctg gat ggt gat gtc aac ggt cat aag ttt tcc gtg cgt ggc gag 96
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Arg Gly Glu
20 25 30
ggt gaa ggt gac gca act aat ggt aaa ctg acg ctg aag ttc atc tgt 144
Gly Glu Gly Asp Ala Thr Asn Gly Lys Leu Thr Leu Lys Phe Ile Cys
35 40 45
act act ggt aaa ctg ccg gta cct tgg ccg act ctg gta acg acg ctg 192
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu
50 55 60
act tat ggt gtt cag tgc ttt gct cgt tat ccg gac cat atg aag cag 240
Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln
65 70 75 80
cat gac ttc ttc aag tcc gcc atg ccg gaa ggc tat gtg cag gaa cgc 288
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
85 90 95
acg att tcc ttt aag gat gac ggc acg tac aaa acg cgt gcg gaa gtg 336
Thr Ile Ser Phe Lys Asp Asp Gly Thr Tyr Lys Thr Arg Ala Glu Val
100 105 110
aaa ttt gaa ggc gat acc ctg gta aac cgc att gag ctg aaa ggc att 384
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
115 120 125
gac ttt aaa gaa gac ggc aat atc ctg ggc cat aag ctg gaa tac aat 432
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
130 135 140
ttt aac agc cac aat gtt tac atc acc gcc gat aaa caa aaa aat ggc 480
Phe Asn Ser His Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly
145 150 155 160
att aaa gcg aat ttt aaa att cgc cac aac gtg gag gat ggc agc gtg 528
Ile Lys Ala Asn Phe Lys Ile Arg His Asn Val Glu Asp Gly Ser Val
165 170 175
cag ctg gct gat cac tac cag caa aac act cca atc ggt gat ggt cct 576
Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro
180 185 190
gtt ctg ctg cca gac aat cac tat ctg agc acg caa agc gtt ctg tct 624
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Val Leu Ser
195 200 205
aaa gat ccg aac gag aaa cgc gat cat atg gtt ctg ctg gag ttc gta 672
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val
210 215 220
acc gca gcg ggc atc acg cat ggt atg gat gaa ctg tac aaa tga tga 720
Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys
225 230 235
<210> 287
<211> 238
<212> PRT
<213> Synthetic
<400> 287
Met Arg Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
1 5 10 15
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Arg Gly Glu
20 25 30
Gly Glu Gly Asp Ala Thr Asn Gly Lys Leu Thr Leu Lys Phe Ile Cys
35 40 45
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu
50 55 60
Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln
65 70 75 80
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
85 90 95
Thr Ile Ser Phe Lys Asp Asp Gly Thr Tyr Lys Thr Arg Ala Glu Val
100 105 110
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
115 120 125
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
130 135 140
Phe Asn Ser His Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly
145 150 155 160
Ile Lys Ala Asn Phe Lys Ile Arg His Asn Val Glu Asp Gly Ser Val
165 170 175
Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro
180 185 190
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Val Leu Ser
195 200 205
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val
210 215 220
Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Lys
225 230 235
<210> 288
<211> 1486
<212> DNA
<213> Escherichia coli
<220>
<221> promoter
<222> (1)..(36)
<223> apFAB306 promoter
<220>
<221> CDS
<222> (73)..(603)
<223> fldA gene encoding flavodoxin
<220>
<221> CDS
<222> (620)..(1366)
<223> flpr gene encoding flavodoxin/ferredoxin reductase
<220>
<221> terminator
<222> (1400)..(1486)
<400> 288
ttgacaatta atcatccggc tcgtagtgtt tgtggatggc agtggctagc ggccgcagag 60
gttatttcac tc atg gct atc act ggc atc ttt ttc ggc agc gac acc ggt 111
Met Ala Ile Thr Gly Ile Phe Phe Gly Ser Asp Thr Gly
1 5 10
aat acc gaa aat atc gca aaa atg att caa aaa cag ctt ggt aaa gac 159
Asn Thr Glu Asn Ile Ala Lys Met Ile Gln Lys Gln Leu Gly Lys Asp
15 20 25
gtt gcc gat gtc cat gac att gca aaa agc agc aaa gaa gat ctg gaa 207
Val Ala Asp Val His Asp Ile Ala Lys Ser Ser Lys Glu Asp Leu Glu
30 35 40 45
gct tat gac att ctg ctg ctg ggc atc cca acc tgg tat tac ggc gaa 255
Ala Tyr Asp Ile Leu Leu Leu Gly Ile Pro Thr Trp Tyr Tyr Gly Glu
50 55 60
gcg cag tgt gac tgg gat gac ttc ttc ccg act ctc gaa gag att gat 303
Ala Gln Cys Asp Trp Asp Asp Phe Phe Pro Thr Leu Glu Glu Ile Asp
65 70 75
ttc aac ggc aaa ctg gtt gcg ctg ttt ggt tgt ggt gac cag gaa gat 351
Phe Asn Gly Lys Leu Val Ala Leu Phe Gly Cys Gly Asp Gln Glu Asp
80 85 90
tac gcc gaa tat ttc tgc gac gca ttg ggc acc atc cgc gac atc att 399
Tyr Ala Glu Tyr Phe Cys Asp Ala Leu Gly Thr Ile Arg Asp Ile Ile
95 100 105
gaa ccg cgc ggt gca acc atc gtt ggt cac tgg cca act gcg ggc tat 447
Glu Pro Arg Gly Ala Thr Ile Val Gly His Trp Pro Thr Ala Gly Tyr
110 115 120 125
cat ttc gaa gca tca aaa ggt ctg gca gat gac gac cac ttt gtc ggt 495
His Phe Glu Ala Ser Lys Gly Leu Ala Asp Asp Asp His Phe Val Gly
130 135 140
ctg gct atc gac gaa gac cgt cag ccg gaa ctg acc gct gaa cgt gta 543
Leu Ala Ile Asp Glu Asp Arg Gln Pro Glu Leu Thr Ala Glu Arg Val
145 150 155
gaa aaa tgg gtt aaa cag att tct gaa gag ttg cat ctc gac gaa att 591
Glu Lys Trp Val Lys Gln Ile Ser Glu Glu Leu His Leu Asp Glu Ile
160 165 170
ctc aat gcc tga aaaacaggag aaaaac atg gct gat tgg gta aca ggc aaa 643
Leu Asn Ala Met Ala Asp Trp Val Thr Gly Lys
175 180
gtc act aaa gtg cag aac tgg acc gac gcc ctg ttt agt ctc acc gtt 691
Val Thr Lys Val Gln Asn Trp Thr Asp Ala Leu Phe Ser Leu Thr Val
185 190 195 200
cac gcc ccc gtg ctt ccg ttt acc gcc ggg caa ttt acc aag ctt ggc 739
His Ala Pro Val Leu Pro Phe Thr Ala Gly Gln Phe Thr Lys Leu Gly
205 210 215
ctt gaa atc gac ggc gaa cgc gtc cag cgc gcc tac tcc tat gta aac 787
Leu Glu Ile Asp Gly Glu Arg Val Gln Arg Ala Tyr Ser Tyr Val Asn
220 225 230
tcg ccc gat aat ccc gat ctg gag ttt tac ctg gtc acc gtc ccc gat 835
Ser Pro Asp Asn Pro Asp Leu Glu Phe Tyr Leu Val Thr Val Pro Asp
235 240 245
ggc aaa tta agc cca cga ctg gcg gca ctg aaa cca ggc gat gaa gtg 883
Gly Lys Leu Ser Pro Arg Leu Ala Ala Leu Lys Pro Gly Asp Glu Val
250 255 260
cag gtg gtt agc gaa gcg gca gga ttc ttt gtg ctc gat gaa gtg ccg 931
Gln Val Val Ser Glu Ala Ala Gly Phe Phe Val Leu Asp Glu Val Pro
265 270 275 280
cac tgc gaa acg cta tgg atg ctg gca acc ggt aca gcg att ggc cct 979
His Cys Glu Thr Leu Trp Met Leu Ala Thr Gly Thr Ala Ile Gly Pro
285 290 295
tat tta tcg att ctg caa cta ggt aaa gat tta gat cgc ttc aaa aat 1027
Tyr Leu Ser Ile Leu Gln Leu Gly Lys Asp Leu Asp Arg Phe Lys Asn
300 305 310
ctg gtc ctg gtg cac gcc gca cgt tat gcc gcc gac tta agc tat ttg 1075
Leu Val Leu Val His Ala Ala Arg Tyr Ala Ala Asp Leu Ser Tyr Leu
315 320 325
cca ctg atg cag gaa ctg gaa aaa cgc tac gaa gga aaa ctg cgc att 1123
Pro Leu Met Gln Glu Leu Glu Lys Arg Tyr Glu Gly Lys Leu Arg Ile
330 335 340
cag acg gtg gtc agt cgg gaa acg gca gcg ggg tcg ctc acc gga cgg 1171
Gln Thr Val Val Ser Arg Glu Thr Ala Ala Gly Ser Leu Thr Gly Arg
345 350 355 360
ata ccg gca tta att gaa agt ggg gaa ctg gaa agc acg att ggc ctg 1219
Ile Pro Ala Leu Ile Glu Ser Gly Glu Leu Glu Ser Thr Ile Gly Leu
365 370 375
ccg atg aat aaa gaa acc agc cat gtg atg ctg tgc ggc aat cca cag 1267
Pro Met Asn Lys Glu Thr Ser His Val Met Leu Cys Gly Asn Pro Gln
380 385 390
atg gtg cgc gat aca caa cag ttg ctg aaa gag acc cgg cag atg acg 1315
Met Val Arg Asp Thr Gln Gln Leu Leu Lys Glu Thr Arg Gln Met Thr
395 400 405
aaa cat tta cgt cgc cga ccg ggc cat atg aca gcg gag cat tac tgg 1363
Lys His Leu Arg Arg Arg Pro Gly His Met Thr Ala Glu His Tyr Trp
410 415 420
taa tagcttcata tggtccacag gacactcgtt gctttcacca tgcgtaaagc 1416
aatcagatac ccagcccgcc taatgagcgg gctttttttt gaacaaaatt agagaataac 1476
aatgcaaaca 1486
<210> 289
<211> 176
<212> PRT
<213> Escherichia coli
<400> 289
Met Ala Ile Thr Gly Ile Phe Phe Gly Ser Asp Thr Gly Asn Thr Glu
1 5 10 15
Asn Ile Ala Lys Met Ile Gln Lys Gln Leu Gly Lys Asp Val Ala Asp
20 25 30
Val His Asp Ile Ala Lys Ser Ser Lys Glu Asp Leu Glu Ala Tyr Asp
35 40 45
Ile Leu Leu Leu Gly Ile Pro Thr Trp Tyr Tyr Gly Glu Ala Gln Cys
50 55 60
Asp Trp Asp Asp Phe Phe Pro Thr Leu Glu Glu Ile Asp Phe Asn Gly
65 70 75 80
Lys Leu Val Ala Leu Phe Gly Cys Gly Asp Gln Glu Asp Tyr Ala Glu
85 90 95
Tyr Phe Cys Asp Ala Leu Gly Thr Ile Arg Asp Ile Ile Glu Pro Arg
100 105 110
Gly Ala Thr Ile Val Gly His Trp Pro Thr Ala Gly Tyr His Phe Glu
115 120 125
Ala Ser Lys Gly Leu Ala Asp Asp Asp His Phe Val Gly Leu Ala Ile
130 135 140
Asp Glu Asp Arg Gln Pro Glu Leu Thr Ala Glu Arg Val Glu Lys Trp
145 150 155 160
Val Lys Gln Ile Ser Glu Glu Leu His Leu Asp Glu Ile Leu Asn Ala
165 170 175
<210> 290
<211> 248
<212> PRT
<213> Escherichia coli
<400> 290
Met Ala Asp Trp Val Thr Gly Lys Val Thr Lys Val Gln Asn Trp Thr
1 5 10 15
Asp Ala Leu Phe Ser Leu Thr Val His Ala Pro Val Leu Pro Phe Thr
20 25 30
Ala Gly Gln Phe Thr Lys Leu Gly Leu Glu Ile Asp Gly Glu Arg Val
35 40 45
Gln Arg Ala Tyr Ser Tyr Val Asn Ser Pro Asp Asn Pro Asp Leu Glu
50 55 60
Phe Tyr Leu Val Thr Val Pro Asp Gly Lys Leu Ser Pro Arg Leu Ala
65 70 75 80
Ala Leu Lys Pro Gly Asp Glu Val Gln Val Val Ser Glu Ala Ala Gly
85 90 95
Phe Phe Val Leu Asp Glu Val Pro His Cys Glu Thr Leu Trp Met Leu
100 105 110
Ala Thr Gly Thr Ala Ile Gly Pro Tyr Leu Ser Ile Leu Gln Leu Gly
115 120 125
Lys Asp Leu Asp Arg Phe Lys Asn Leu Val Leu Val His Ala Ala Arg
130 135 140
Tyr Ala Ala Asp Leu Ser Tyr Leu Pro Leu Met Gln Glu Leu Glu Lys
145 150 155 160
Arg Tyr Glu Gly Lys Leu Arg Ile Gln Thr Val Val Ser Arg Glu Thr
165 170 175
Ala Ala Gly Ser Leu Thr Gly Arg Ile Pro Ala Leu Ile Glu Ser Gly
180 185 190
Glu Leu Glu Ser Thr Ile Gly Leu Pro Met Asn Lys Glu Thr Ser His
195 200 205
Val Met Leu Cys Gly Asn Pro Gln Met Val Arg Asp Thr Gln Gln Leu
210 215 220
Leu Lys Glu Thr Arg Gln Met Thr Lys His Leu Arg Arg Arg Pro Gly
225 230 235 240
His Met Thr Ala Glu His Tyr Trp
245
<210> 291
<211> 36
<212> DNA
<213> Synthetic
<220>
<221> promoter
<222> (1)..(36)
<223> apFAB309 promoter
<400> 291
ttgacaatta atcatccggc tcgtagtgtc tgtgga 36
<210> 292
<211> 87
<212> DNA
<213> Synthetic
<220>
<221> terminator
<222> (1)..(87)
<223> apFAB378 terminator
<400> 292
ttcaccatgc gtaaagcaat cagataccca gcccgcctaa tgagcgggct tttttttgaa 60
caaaattaga gaataacaat gcaaaca 87
Claims (18)
- 하기를 포함하는, 비오틴 또는 리포산 또는 티아민의 증가된 생산을 위한 유전자 변형 박테리아:
a) 돌연변이 IscR 폴리펩티드를 코딩하는 유전자 변형 내인성 iscR 유전자로서,
상기 돌연변이 IscR 폴리펩티드의 아미노산 서열은 서열번호 2, 4, 6, 8, 10, 12 및 14로 이루어진 군으로부터 선택된 서열과 적어도 80% 아미노산 서열 상동성을 갖는 것이고, 상기 아미노산 서열은 i) L15X, C92X, C98X, C104X, 및 H107X로 이루어진 군으로부터 선택된 적어도 하나의 아미노산 치환을 갖는 것이고; 상기 X는 서열번호 2, 4, 6, 8, 10, 12 및 14에 상응하는 아미노산 잔기 이외의 임의의 아미노산인 것인, 유전자; 및
b) 적어도 하나의 전이 유전자로서,
ii) 비오틴 신타아제 (EC 2.8.1.6) 활성을 갖는 폴리펩티드;
iii) 리포산 신타아제 (EC2.8.1.8) 활성을 갖는 폴리펩티드;
iv) HMP-P 신타아제 (EC 4.1.99.17) 활성을 갖는 폴리펩티드; 및
v) 티로신 리아제 (EC 4.1.99.19) 활성을 갖는 폴리펩티드로 이루어진 군으로부터 선택된 폴리펩티드를 코딩하는 적어도 하나의 전이 유전자. - 청구항 1에 있어서, 상기 돌연변이 IscR 폴리펩티드에서 상기 적어도 하나의 아미노산 치환은 다음으로 이루어진 군으로부터 선택되는 것인, 비오틴 또는 리포산 또는 티아민의 증가된 생산을 위한 유전자 변형 박테리아:
a. L15X, 상기 X는 F, Y, M 및 W 중 어느 하나임;
b. C92X, 상기 X는 Y, A, M, F 및 W 중 어느 하나임;
c. C98X, 상기 X는 A, V, I, L, F 및 W 중 어느 하나임;
d. C104X, 상기 X는 A V, I, L, F 및 W 중 어느 하나임; 및
e. H107X; 상기 X는 A, Y, V, I, 및 L 중 어느 하나임. - 청구항 1 또는 2에 있어서, 상기 비오틴 신타아제 활성을 갖는 폴리펩티드 (EC 2.8.1.6)를 코딩하는 적어도 하나의 전이 유전자는 다음으로 이루어진 군으로부터 선택된 하나 이상의 폴리펩티드를 코딩하는 추가적인 전이 유전자를 더 포함하고:
a. SAM (S-아데노실메티오닌)-의존적 메틸트랜스퍼라제 (EC 2.1.1.197) 활성을 갖는 폴리펩티드;
b. 7-케토-8-아미노펠라르곤산 (KAPA) 신타아제 (EC 2.3.1.47) 활성을 갖는 폴리펩티드;
c. 7,8-디아미노펠라르곤산 (DAPA) 신타아제 (EC:2.6.1.62 또는 EC2.6.1.105) 활성을 갖는 폴리펩티드; 및
d. 데티오비오틴 (DTB) 신타아제 (EC 6.3.3.3) 활성을 갖는 폴리펩티드;
e. 피멜로일-[아실-운반 단백질] 메틸 에스테르 에스테라제 (EC 3.1.1.85)를 갖는 폴리펩티드; 및
f. 6-카복시헥사노에이트-CoA 리가제 (EC 6.2.1.14) 활성을 갖는 폴리펩티드
상기 박테리아는 비오틴의 증가된 생산을 위한 것인 유전자 변형 박테리아. - 청구항 1 또는 2에 있어서, 상기 적어도 하나의 전이 유전자는 리포산 신타아제 (EC 2.8.1.8) 활성을 갖는 폴리펩티드를 코딩하고, 다음으로 이루어진 군으로부터 선택된 하나 이상의 폴리펩티드를 코딩하는 추가적인 전이 유전자를 더 포함하는 것이고:
a. 옥타노일트랜스퍼라제 (EC 2.3.1.181) 활성을 갖는 폴리펩티드;
b. 피루브산 탈수소효소 (EC 2.3.1.12)의 디하이드로리포일라이신-잔기 아세틸트랜스퍼라제 성분을 포함하는 폴리펩티드; 및
c. 리포에이트-단백질 리가아제 A (EC:6.3.1.20) 활성을 갖는 폴리펩티드;
상기 박테리아는 리포산의 증가된 생산을 위한 것인 유전자 변형 박테리아. - 청구항 1 또는 2에 있어서, 상기 HMP-P 신타아제 (EC 4.1.99.17) 활성을 갖는 폴리펩티드를 코딩하는 적어도 하나의 전이 유전자 및/또는 티로신 리아제 (EC 4.1.99.19) 활성을 갖는 폴리펩티드를 코딩하는 적어도 하나의 전이 유전자는 다음으로 이루어진 군으로부터 선택된 하나 이상의 폴리펩티드를 코딩하는 추가적인 전이 유전자를 더 포함하는 것이고:
a. ThiS 아데닐트랜스퍼라제 (EC2.7.7.73) 활성을 갖는 ThiF 폴리펩티드;
b. 티아민 포스페이트 신타아제 (EC 2.5.1.3) 활성을 갖는 ThiE 폴리펩티드;
c. 티아졸 신타아제 활성 (E.C.2.8.1.10) 활성을 갖는 ThiG 폴리펩티드;
d. 포스포하이드록시메틸피리미딘 키나아제 (EC 2.7.4.7) 활성을 갖는 ThiD 폴리펩티드;
e. 글리신 옥시다아제 (EC 1.4.3.19) 활성을 갖는 ThiO 폴리펩티드;
f. 황-운반 단백질 활성을 갖는 ThiS 폴리펩티드;
g. 하이드록시에틸티아졸 키나아제 (EC2.7.1.50) 활성을 갖는 ThiM 폴리펩티드; 및
h. 모노-포스페이트 포스파타아제 (E.C. 3.1.3.-) 활성을 갖는 폴리펩티드;
상기 박테리아는 티아민의 증가된 생산을 위한 것인 유전자 변형 박테리아. - 청구항 5에 있어서, 상기 박테리아는 다음을 코딩하는 추가적인 전이 유전자를 포함하는 것인 유전자 변형 박테리아:
a. HMP-P 신타아제 (EC 4.1.99.17) 활성을 갖는 ThiC 폴리펩티드;
b. 티로신 리아제 (EC 4.1.99.19) 활성을 갖는 ThiH 폴리펩티드 또는 글리신 옥시다아제 (EC 1.4.3.19) 활성을 갖는 ThiO 폴리펩티드;
c. ThiS 아데닐트랜스퍼라제 (EC 2.7.7.73) 활성을 갖는 ThiF 폴리펩티드;
d. 티아민 포스페이트 신타아제 (EC 2.5.1.3) 활성을 갖는 ThiE 폴리펩티드;
e. 티아졸 신타아제 (E.C.2.8.1.10) 활성을 갖는 ThiG 폴리펩티드;
f. 포스포하이드록시메틸피리미딘 키나아제 (EC 2.7.4.7) 활성을 갖는 ThiD 폴리펩티드;
g. 황-운반 단백질 활성을 갖는 ThiS 폴리펩티드; 및
h. 티아민 모노-포스페이트 포스파타아제 (E.C. 3.1.3.-) 활성을 갖는 폴리펩티드. - 청구항 1 내지 6 중 어느 한 항에 있어서, 상기 적어도 하나의 전이 유전자 및 상기 하나 이상의 추가적인 전이 유전자는 항시성 프로모터(constitutive promoter)에 작동 가능하게 연결된 것인 유전자 변형 박테리아.
- 청구항 1 내지 7 중 어느 한 항에 있어서, 상기 박테리아는 에셔리키아 (Escherichia), 바실러스 (Bacillus), 브레비박테리움 (Brevibacterium), 버크홀데리아 (Burkholderia), 캄필로박터 (Campylobacter), 코리네박테리움 (Corynebacterium), 슈도모나스 (Pseudomonas), 셀라티아 (Serratia), 락토바실러스 (Lactobacillus), 락토코커스 (Lactocooccus), 아시네토박터 (Acinetobacter), 슈도모나스 (Pseudomonas), 및 아세토박터 (Acetobacter)로 이루어진 군으로부터 선택된 박테리아의 속인 것인 유전자 변형 박테리아.
- 하기 단계를 포함하는 비오틴을 생산하는 방법:
a. 청구항 1 내지 3, 7 및 8 중 어느 한 항에 따른 유전자 변형 박테리아를 배양물을 생산하기 위한 증식 배지 내로 도입하는 단계로서, 여기에서 상기 적어도 하나의 전이 유전자는 비오틴 신타아제 (EC 2.8.1.6) 활성을 갖는 폴리펩티드를 코딩하는 것인 단계;
b. 상기 배양물을 배양하는 단계; 및
c. 상기 배양에 의해 생산된 비오틴을 회수하고, 선택적으로 회수된 비오틴을 정제하는 단계. - 리포산을 생산하는 방법으로서,
a. 청구항 1, 2, 4, 7 및 8 중 어느 한 항에 따른 유전자 변형 박테리아를 배양물을 생산하기 위한 증식 배지 내로 도입하는 단계로서, 여기에서 상기 적어도 하나의 전이 유전자는 리포산 신타아제 (EC 2.8.1.6) 활성을 갖는 폴리펩티드를 코딩하는 것인 단계;
b. 상기 배양물을 배양하는 단계; 및
c. 상기 배양에 의해 생산된 리포산을 회수하고, 선택적으로 회수된 리포산을 정제하는 단계를 포함하는 방법. - 티아민을 생산하는 방법으로서,
a. 청구항 1, 2, 및 5 내지 8 중 어느 한 항에 따른 유전자 변형 박테리아를 배양물을 생산하기 위한 증식 배지 내로 도입하는 단계로서, 여기에서 상기 적어도 하나의 전이 유전자는 HMP-P 신타아제 (EC 4.1.99.17) 활성을 갖는 폴리펩티드 및/또는 티로신 리아제 (EC 4.1.99.19) 활성을 갖는 폴리펩티드를 코딩하는 것인 단계;
b. 상기 배양물을 배양하는 단계; 및
c. 상기 배양에 의해 생산된 티아민을 회수하고, 선택적으로 회수된 티아민을 정제하는 단계를 포함하는 방법. - 청구항 9 내지 11 중 어느 한 항에 있어서, 상기 증식 배지는 글루코스, 말토스, 갈락토스, 프럭토스, 수크로스, 아라비노스, 자일로스, 라피노스, 만노스,및 락토스, 또는 이들의 임의의 조합으로부터 선택된 탄소원을 포함하는 것인, 비오틴, 리포산 및 티아민 중 어느 하나를 생산하는 방법.
- 유전자 변형 박테리아에서 비오틴, 리포산 또는 티아민 중 어느 하나의 생산을 증가시키기 위한 돌연변이 iscR 폴리펩티드를 코딩하는 유전적으로 변형된 유전자의 용도로서, 상기 박테리아는:
i. 비오틴 신타아제 (EC 2.8.1.6) 활성을 갖는 폴리펩티드;
ii. 리포산 신타아제 (EC 2.8.1.8) 활성을 갖는 폴리펩티드;
iii. HMP-P 신타아제 (EC 4.1.99.17) 활성을 갖는 폴리펩티드; 및
iv. 티로신 리아제 (EC 4.1.99.19) 활성을 갖는 폴리펩티드로 이루어진 군으로부터 선택된 폴리펩티드를 코딩하는 적어도 하나의 전이유전자를 포함하고 발현하는 것이고;
상기 유전적으로 변형된 유전자는 돌연변이 IscR 폴리펩티드를 코딩하는 내인성 iscR 유전자이고, 여기에서 상기 돌연변이 IscR 폴리펩티드의 아미노산 서열은 서열번호 2, 4, 6, 8, 10, 12 및 14와 적어도 80% 아미노산 서열 상동성을 갖는 것이고, 상기 아미노산 서열은: L15X, C92X, C98X, C104X, 및 H107X로 이루어진 군으로부터 선택된 적어도 하나의 아미산 치환을 갖는 것이고; 상기 X는 서열번호 2, 4, 6, 8, 10, 12 및 14에 상응하는 아미노산 잔기 이외의 임의의 아미노산인 것인 용도. - 청구항 13에 있어서, 상기 돌연변이 IscR 폴리펩티드에서 상기 적어도 하나의 아미노산 치환은 다음으로 이루어진 군으로부터 선택되는 것인, 유전자 변형 박테리아에서 비오틴, 리포산 또는 티아민 중 어느 하나의 생산을 증가시키기 위한 돌연변이 iscR 폴리펩티드를 코딩하는 유전적으로 변형된 유전자의 용도:
a. L15X, 상기 X는 F, Y, M 및 W 중 어느 하나임;
b. C92X, 상기 X는 Y, A, M, F 및 W 중 어느 하나임;
c. C98X, 상기 X는 A, V, I, L, F 및 W 중 어느 하나임;
d. C104X, 상기 X는 A V, I, L, F 및 W 중 어느 하나임; 및
e. H107X; 상기 X는 A, Y, V, I, 및 L 중 어느 하나임. - 비오틴, 리포산 또는 티아민 중 어느 하나의 증가된 생산을 위한, 청구항 1 내지 8 중 어느 한 항에 따른 유전자 변형 박테리아의 용도.
- 청구항 1 내지 8 중 어느 한 항에 있어서, 상기 박테리아는 하기 군으로부터 선택된 하나 이상의 유전자를 더 포함하는 것인 유전자 변형 박테리아:
a. 플라보독신/페레독신-NADP 환원 효소 (EC:1.18.1.2 및 EC 1.19.1.1)를 코딩하는 유전자;
b. 피루브산-플라보독신/페레독신 산화 환원 효소 (EC 1.2.7)를 코딩하는 유전자;
c. 플라보독신을 코딩하는 유전자;
d. 페레독신을 코딩하는 유전자; 및
e. 플라보독신 및 페레독신-NADP 환원 효소를 코딩하는 유전자;
여기서, 상기 하나 이상의 유전자는 상기 박테리아에서 상기 하나 이상의 유전자의 발현을 증가시킬 수 있는 비-천연(non-native) 프로모터에 작동 가능하게-연결되는 것이고, 상기 하나 이상의 유전자는 천연 유전자(native gene) 또는 전이 유전자일 수 있다. - 비오틴, 리포산 또는 티아민 중 어느 하나의 증가된 생산을 위한, 청구항 16에 따른 유전자 변형 박테리아의 용도.
- 청구항 9 내지 12 중 어느 한 항에 있어서, , 상기 유전자 변형 박테리아는 다음의 군으로부터 선택된 하나 이상의 유전자를 더 포함하는 것인 비오틴, 리포산 및 티아민 중 어느 하나를 생산하는 방법:
a. 플라보독신/페레독신-NADP 환원 효소 (EC:1.18.1.2 및 EC 1.19.1.1)를 코딩하는 유전자;
b. 피루브산-플라보독신/페레독신 산화 환원 효소 (EC 1.2.7)를 코딩하는 유전자;
c. 플라보독신을 코딩하는 유전자;
d. 페레독신을 코딩하는 유전자; 및
e. 플라보독신 및 페레독신-NADP 환원 효소;
상기 하나 이상의 유전자는 상기 박테리아에서 상기 하나 이상의 유전자의 발현을 증가시킬 수 있는 비-천연 프로모터에 작동 가능하게-연결된 것이고, 상기 하나 이상의 유전자는 천연 유전자 또는 전이 유전자일 수 있다.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP17181503 | 2017-07-14 | ||
EP17181503.8 | 2017-07-14 | ||
PCT/EP2018/068989 WO2019012058A1 (en) | 2017-07-14 | 2018-07-12 | CELL FACTORY HAVING IMPROVED DISTRIBUTION OF IRON SULFUR AMAS |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20200075813A true KR20200075813A (ko) | 2020-06-26 |
Family
ID=59362984
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020207002518A KR20200075813A (ko) | 2017-07-14 | 2018-07-12 | 개선된 철-황 클러스터 전달을 갖는 세포 공장 |
Country Status (9)
Country | Link |
---|---|
US (1) | US11851461B2 (ko) |
EP (1) | EP3652198B1 (ko) |
JP (2) | JP2020527359A (ko) |
KR (1) | KR20200075813A (ko) |
CN (1) | CN110869384A (ko) |
AU (1) | AU2018300754B2 (ko) |
BR (1) | BR112020000548A2 (ko) |
CA (1) | CA3069650A1 (ko) |
WO (1) | WO2019012058A1 (ko) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108414772B (zh) * | 2018-03-28 | 2021-02-09 | 河南科技大学 | 一种用于研究细菌中类泛素系统的试剂盒及其应用 |
EP3938518A4 (en) * | 2019-03-12 | 2023-01-11 | Terra Bioworks, Inc. | EXPRESSION VECTOR |
CN115135762A (zh) * | 2019-12-20 | 2022-09-30 | 巴斯夫欧洲公司 | 降低萜烯的毒性和增加微生物的生产潜力 |
EP4168566A1 (en) | 2020-06-18 | 2023-04-26 | Biosyntia ApS | Methods for producing biotin in genetically modified microorganisms |
EP4294209A1 (en) | 2021-02-22 | 2023-12-27 | Symrise AG | Biotechnological production of meat-like flavourings |
WO2023285585A2 (en) | 2021-07-16 | 2023-01-19 | Biosyntia Aps | Microbial cell factories producing vitamin b compounds |
GB202203725D0 (en) * | 2022-03-17 | 2022-05-04 | Univ Nottingham | Bio-manufacturing process |
WO2024013212A1 (en) * | 2022-07-15 | 2024-01-18 | Biosyntia Aps | Microbial cell factories producing thiamine |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1390470A4 (en) | 2001-04-20 | 2004-08-18 | Cargill Inc | PRODUCTION OF ALPHA-LIPOIC ACID |
CN1894424B (zh) * | 2003-06-02 | 2012-05-23 | 帝斯曼知识产权资产管理有限公司 | 通过发酵生产硫胺的方法 |
US7423136B2 (en) | 2005-10-19 | 2008-09-09 | Stephen F. Austin State University | Nucleic acid for biotin production |
WO2009091582A1 (en) * | 2008-01-17 | 2009-07-23 | Indigene Pharmaceuticals, Inc. | PRODUCTION OF R-α-LIPOIC ACID BY FERMENTATION USING GENETICALLY ENGINEERED MICROORGANISMS |
JP2011160673A (ja) * | 2010-02-04 | 2011-08-25 | Fujirebio Inc | 水素ガスの生産方法及びそのための微生物 |
CN102782119B (zh) * | 2010-02-17 | 2016-03-16 | 布特马斯先进生物燃料有限责任公司 | 改善需Fe-S簇蛋白质的活性 |
CN104450762B (zh) * | 2013-09-17 | 2018-03-20 | 中国科学院广州生物医药与健康研究院 | α‑硫辛酸的生物合成方法、工程菌株及其制备方法 |
ES2806732T3 (es) * | 2015-12-18 | 2021-02-18 | Biosyntia Aps | Fábrica de células bacterianas modificadas genéticamente para la producción de tiamina |
CN106086052B (zh) | 2016-07-01 | 2019-11-01 | 福建师范大学 | 生产吡咯喹啉醌的细菌及其应用 |
EP3683227A1 (en) * | 2019-01-16 | 2020-07-22 | Biosyntia ApS | Cell factories for improved production of compounds and proteins dependent on iron sulfur clusters |
-
2018
- 2018-07-12 AU AU2018300754A patent/AU2018300754B2/en active Active
- 2018-07-12 BR BR112020000548-7A patent/BR112020000548A2/pt unknown
- 2018-07-12 EP EP18740207.8A patent/EP3652198B1/en active Active
- 2018-07-12 WO PCT/EP2018/068989 patent/WO2019012058A1/en active Search and Examination
- 2018-07-12 CN CN201880045992.6A patent/CN110869384A/zh active Pending
- 2018-07-12 US US16/630,203 patent/US11851461B2/en active Active
- 2018-07-12 KR KR1020207002518A patent/KR20200075813A/ko not_active Application Discontinuation
- 2018-07-12 CA CA3069650A patent/CA3069650A1/en active Pending
- 2018-07-12 JP JP2020523046A patent/JP2020527359A/ja active Pending
-
2023
- 2023-04-05 JP JP2023061066A patent/JP2023093532A/ja active Pending
Also Published As
Publication number | Publication date |
---|---|
EP3652198B1 (en) | 2021-10-06 |
EP3652198A1 (en) | 2020-05-20 |
US11851461B2 (en) | 2023-12-26 |
US20230192778A1 (en) | 2023-06-22 |
CN110869384A (zh) | 2020-03-06 |
AU2018300754A1 (en) | 2020-01-16 |
CA3069650A1 (en) | 2019-01-17 |
BR112020000548A2 (pt) | 2020-07-21 |
JP2020527359A (ja) | 2020-09-10 |
AU2018300754B2 (en) | 2023-02-23 |
WO2019012058A1 (en) | 2019-01-17 |
JP2023093532A (ja) | 2023-07-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR20200075813A (ko) | 개선된 철-황 클러스터 전달을 갖는 세포 공장 | |
Schwentner et al. | Metabolic engineering to guide evolution–Creating a novel mode for L-valine production with Corynebacterium glutamicum | |
KR101915819B1 (ko) | 메티오닌 생산을 위한 균주 및 방법 | |
Christensen et al. | A novel amidotransferase required for lipoic acid cofactor assembly in Bacillus subtilis | |
US20230080311A1 (en) | Method of improving methyltransferase activity | |
US10696992B2 (en) | Genetically modified bacterial cell factory for thiamine production | |
Flynn et al. | Decreased coenzyme A levels in ridA mutant strains of S almonella enterica result from inactivated serine hydroxymethyltransferase | |
US20190071680A1 (en) | Microbial production of nicotinic acid riboside | |
Hermes et al. | The role of the Saccharomyces cerevisiae lipoate protein ligase homologue, Lip3, in lipoic acid synthesis | |
Ernst et al. | L‐2, 3‐diaminopropionate generates diverse metabolic stresses in Salmonella enterica | |
US20220127311A1 (en) | Cell factories for improved production of compounds and proteins dependent on iron sulfur clusters | |
Jojima et al. | Identification of a HAD superfamily phosphatase, HdpA, involved in 1, 3-dihydroxyacetone production during sugar catabolism in Corynebacterium glutamicum | |
Pfaff et al. | Chorismate pyruvate-lyase and 4-hydroxy-3-solanesylbenzoate decarboxylase are required for plastoquinone biosynthesis in the cyanobacterium Synechocystis sp. PCC6803 | |
von Borzyskowski et al. | Implementation of the β-hydroxyaspartate cycle increases growth performance of Pseudomonas putida on the PET monomer ethylene glycol | |
WO2014049382A2 (en) | Ethylenediamine fermentative production by a recombinant microorganism | |
Stock et al. | Disruption and complementation of the selenocysteine biosynthesis pathway reveals a hierarchy of selenoprotein gene expression in the archaeon Methanococcus maripaludis | |
Delmas et al. | Genetic and biocatalytic basis of formate dependent growth of Escherichia coli strains evolved in continuous culture | |
Buss et al. | Clustering of isochorismate synthase genes menF and entC and channeling of isochorismate in Escherichia coli | |
Chakauya et al. | Pantothenate biosynthesis in higher plants: advances and challenges | |
Wei et al. | Discovery and biochemical characterization of UDP-glucose dehydrogenase from Granulibacter bethesdensis | |
Lako et al. | Cloning, expression and characterization of thermostable YdaP from Bacillus licheniformis 9A | |
EP4370653A2 (en) | Microbial cell factories producing vitamin b compounds | |
CN117897476A (zh) | 生产维生素b化合物的微生物细胞工厂 | |
Keasling et al. | Engineering controllable alteration of malonyl-CoA levels to enhance polyketide production and versatility in E. coli | |
Gómez-Coronado et al. | Implementation of the β-h drox aspartate c cle increases gro th performance of Pseudomonas putida on the PET monomer eth lene gl col |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
E902 | Notification of reason for refusal | ||
E601 | Decision to refuse application | ||
E902 | Notification of reason for refusal |