WO2011071982A2 - Synthèse de composés prazoles - Google Patents
Synthèse de composés prazoles Download PDFInfo
- Publication number
- WO2011071982A2 WO2011071982A2 PCT/US2010/059398 US2010059398W WO2011071982A2 WO 2011071982 A2 WO2011071982 A2 WO 2011071982A2 US 2010059398 W US2010059398 W US 2010059398W WO 2011071982 A2 WO2011071982 A2 WO 2011071982A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- residue
- position corresponding
- amino acid
- compound
- seq
- Prior art date
Links
- 150000001875 compounds Chemical class 0.000 title claims abstract description 331
- 230000015572 biosynthetic process Effects 0.000 title description 19
- 238000003786 synthesis reaction Methods 0.000 title description 5
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 433
- 229920001184 polypeptide Polymers 0.000 claims abstract description 429
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 429
- 238000000034 method Methods 0.000 claims abstract description 136
- 108010074633 Mixed Function Oxygenases Proteins 0.000 claims abstract description 130
- 102000008109 Mixed Function Oxygenases Human genes 0.000 claims abstract description 130
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 69
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 69
- 239000002157 polynucleotide Substances 0.000 claims abstract description 69
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 173
- 238000006243 chemical reaction Methods 0.000 claims description 170
- 150000001413 amino acids Chemical class 0.000 claims description 129
- 230000008569 process Effects 0.000 claims description 105
- 239000000758 substrate Substances 0.000 claims description 84
- -1 omeprazole analog compound Chemical class 0.000 claims description 68
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 claims description 55
- XJLXINKUBYWONI-DQQFMEOOSA-N [[(2r,3r,4r,5r)-5-(6-aminopurin-9-yl)-3-hydroxy-4-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2s,3r,4s,5s)-5-(3-carbamoylpyridin-1-ium-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl phosphate Chemical group NC(=O)C1=CC=C[N+]([C@@H]2[C@H]([C@@H](O)[C@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-DQQFMEOOSA-N 0.000 claims description 55
- XURCIPRUUASYLR-UHFFFAOYSA-N Omeprazole sulfide Chemical compound N=1C2=CC(OC)=CC=C2NC=1SCC1=NC=C(C)C(OC)=C1C XURCIPRUUASYLR-UHFFFAOYSA-N 0.000 claims description 52
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 claims description 47
- 101001110310 Lentilactobacillus kefiri NADP-dependent (R)-specific alcohol dehydrogenase Proteins 0.000 claims description 34
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 claims description 33
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 claims description 30
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 28
- SECXISVLQFMRJM-UHFFFAOYSA-N N-Methylpyrrolidone Chemical compound CN1CCCC1=O SECXISVLQFMRJM-UHFFFAOYSA-N 0.000 claims description 24
- 229910052720 vanadium Inorganic materials 0.000 claims description 24
- 125000003118 aryl group Chemical group 0.000 claims description 23
- 125000001072 heteroaryl group Chemical group 0.000 claims description 21
- BZLVMXJERCGZMT-UHFFFAOYSA-N Methyl tert-butyl ether Chemical compound COC(C)(C)C BZLVMXJERCGZMT-UHFFFAOYSA-N 0.000 claims description 18
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-Dimethylformamide Chemical compound CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 claims description 18
- 125000000217 alkyl group Chemical group 0.000 claims description 18
- 229910052700 potassium Inorganic materials 0.000 claims description 17
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 claims description 16
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical group OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 claims description 15
- YXFVVABEGXRONW-UHFFFAOYSA-N Toluene Chemical compound CC1=CC=CC=C1 YXFVVABEGXRONW-UHFFFAOYSA-N 0.000 claims description 15
- 239000013604 expression vector Substances 0.000 claims description 15
- 108010021809 Alcohol dehydrogenase Proteins 0.000 claims description 14
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 claims description 14
- 229910052799 carbon Inorganic materials 0.000 claims description 14
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 claims description 14
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 claims description 14
- 229910052757 nitrogen Inorganic materials 0.000 claims description 14
- 230000001172 regenerating effect Effects 0.000 claims description 14
- 241000588724 Escherichia coli Species 0.000 claims description 13
- BAWFJGJZGIEFAR-NNYOXOHSSA-O NAD(+) Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-O 0.000 claims description 13
- 229910052740 iodine Inorganic materials 0.000 claims description 13
- 229960005019 pantoprazole Drugs 0.000 claims description 13
- 239000002773 nucleotide Substances 0.000 claims description 12
- 125000003729 nucleotide group Chemical group 0.000 claims description 12
- SUBDBMMJDZJVOS-UHFFFAOYSA-N 5-methoxy-2-{[(4-methoxy-3,5-dimethylpyridin-2-yl)methyl]sulfinyl}-1H-benzimidazole Chemical compound N=1C2=CC(OC)=CC=C2NC=1S(=O)CC1=NC=C(C)C(OC)=C1C SUBDBMMJDZJVOS-UHFFFAOYSA-N 0.000 claims description 11
- 239000006184 cosolvent Substances 0.000 claims description 11
- 125000000753 cycloalkyl group Chemical group 0.000 claims description 10
- YREYEVIYCVEVJK-VWLOTQADSA-N 2-[(s)-[4-(3-methoxypropoxy)-3-methylpyridin-2-yl]methylsulfinyl]-1h-benzimidazole Chemical compound COCCCOC1=CC=NC(C[S@](=O)C=2NC3=CC=CC=C3N=2)=C1C YREYEVIYCVEVJK-VWLOTQADSA-N 0.000 claims description 9
- ZBFDAUIVDSSISP-DEOSSOPVSA-N 5-methoxy-2-[(s)-(4-methoxy-3,5-dimethylpyridin-2-yl)methylsulfinyl]-1h-imidazo[4,5-b]pyridine Chemical compound C([S@](=O)C=1NC2=CC=C(N=C2N=1)OC)C1=NC=C(C)C(OC)=C1C ZBFDAUIVDSSISP-DEOSSOPVSA-N 0.000 claims description 9
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 claims description 9
- 125000004404 heteroalkyl group Chemical group 0.000 claims description 9
- CCHLMSUZHFPSFC-UHFFFAOYSA-N 2-[[3-methyl-4-(2,2,2-trifluoroethoxy)pyridin-2-yl]methylsulfanyl]-1h-benzimidazole Chemical compound CC1=C(OCC(F)(F)F)C=CN=C1CSC1=NC2=CC=CC=C2N1 CCHLMSUZHFPSFC-UHFFFAOYSA-N 0.000 claims description 8
- 125000000592 heterocycloalkyl group Chemical group 0.000 claims description 8
- 108010050375 Glucose 1-Dehydrogenase Proteins 0.000 claims description 7
- 229910019142 PO4 Inorganic materials 0.000 claims description 7
- 150000002460 imidazoles Chemical class 0.000 claims description 7
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 claims description 7
- 239000010452 phosphate Substances 0.000 claims description 7
- BSXAHDOWMOSVAP-UHFFFAOYSA-N 2-[[4-(3-methoxypropoxy)-3-methylpyridin-2-yl]methylsulfanyl]-1h-benzimidazole Chemical compound COCCCOC1=CC=NC(CSC=2NC3=CC=CC=C3N=2)=C1C BSXAHDOWMOSVAP-UHFFFAOYSA-N 0.000 claims description 6
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 6
- FXHOOIRPVKKKFG-UHFFFAOYSA-N N,N-Dimethylacetamide Chemical compound CN(C)C(C)=O FXHOOIRPVKKKFG-UHFFFAOYSA-N 0.000 claims description 6
- 239000002202 Polyethylene glycol Substances 0.000 claims description 6
- MJIHNNLFOKEZEW-RUZDIDTESA-N dexlansoprazole Chemical compound CC1=C(OCC(F)(F)F)C=CN=C1C[S@@](=O)C1=NC2=CC=CC=C2N1 MJIHNNLFOKEZEW-RUZDIDTESA-N 0.000 claims description 6
- 229920001223 polyethylene glycol Polymers 0.000 claims description 6
- 125000004076 pyridyl group Chemical group 0.000 claims description 6
- ZZRLRRBNMPMTIL-UHFFFAOYSA-N 5-methoxy-2-[(4-methoxy-3,5-dimethylpyridin-2-yl)methylsulfanyl]-1h-imidazo[4,5-b]pyridine Chemical compound N=1C2=NC(OC)=CC=C2NC=1SCC1=NC=C(C)C(OC)=C1C ZZRLRRBNMPMTIL-UHFFFAOYSA-N 0.000 claims description 5
- UKILEIRWOYBGEJ-UHFFFAOYSA-N 6-(difluoromethoxy)-2-[(3,4-dimethoxypyridin-2-yl)methylsulfanyl]-1h-benzimidazole Chemical compound COC1=CC=NC(CSC=2NC3=CC(OC(F)F)=CC=C3N=2)=C1OC UKILEIRWOYBGEJ-UHFFFAOYSA-N 0.000 claims description 5
- 239000008103 glucose Substances 0.000 claims description 5
- 229950007395 leminoprazole Drugs 0.000 claims description 5
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 claims description 5
- 229950007657 saviprazole Drugs 0.000 claims description 5
- 150000003333 secondary alcohols Chemical class 0.000 claims description 5
- ZBFDAUIVDSSISP-XMMPIXPASA-N 5-methoxy-2-[(r)-(4-methoxy-3,5-dimethylpyridin-2-yl)methylsulfinyl]-1h-imidazo[4,5-b]pyridine Chemical compound C([S@@](=O)C=1NC2=CC=C(N=C2N=1)OC)C1=NC=C(C)C(OC)=C1C ZBFDAUIVDSSISP-XMMPIXPASA-N 0.000 claims description 4
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 4
- GUVUOGQBMYCBQP-UHFFFAOYSA-N dmpu Chemical compound CN1CCCN(C)C1=O GUVUOGQBMYCBQP-UHFFFAOYSA-N 0.000 claims description 4
- 229950008491 ilaprazole Drugs 0.000 claims description 4
- 230000036961 partial effect Effects 0.000 claims description 4
- AVQQQNCBBIEMEU-UHFFFAOYSA-N 1,1,3,3-tetramethylurea Chemical compound CN(C)C(=O)N(C)C AVQQQNCBBIEMEU-UHFFFAOYSA-N 0.000 claims description 3
- MJIHNNLFOKEZEW-VWLOTQADSA-N 2-[(s)-[3-methyl-4-(2,2,2-trifluoroethoxy)pyridin-2-yl]methylsulfinyl]-1h-benzimidazole Chemical compound CC1=C(OCC(F)(F)F)C=CN=C1C[S@](=O)C1=NC2=CC=CC=C2N1 MJIHNNLFOKEZEW-VWLOTQADSA-N 0.000 claims description 3
- 229960003568 dexlansoprazole Drugs 0.000 claims description 3
- GNOIPBMMFNIUFM-UHFFFAOYSA-N hexamethylphosphoric triamide Chemical compound CN(C)P(=O)(N(C)C)N(C)C GNOIPBMMFNIUFM-UHFFFAOYSA-N 0.000 claims description 3
- 125000002883 imidazolyl group Chemical group 0.000 claims description 3
- 125000003107 substituted aryl group Chemical group 0.000 claims description 3
- 125000005346 substituted cycloalkyl group Chemical group 0.000 claims description 3
- ZUHZGEOKBKGPSW-UHFFFAOYSA-N tetraglyme Chemical compound COCCOCCOCCOCCOC ZUHZGEOKBKGPSW-UHFFFAOYSA-N 0.000 claims description 3
- 125000001544 thienyl group Chemical group 0.000 claims description 3
- 229910052721 tungsten Inorganic materials 0.000 claims description 3
- 229910052717 sulfur Inorganic materials 0.000 claims description 2
- 235000001014 amino acid Nutrition 0.000 description 135
- 229940024606 amino acid Drugs 0.000 description 133
- 125000000539 amino acid group Chemical group 0.000 description 96
- 108010058646 cyclohexanone oxygenase Proteins 0.000 description 92
- 210000004027 cell Anatomy 0.000 description 72
- 239000000047 product Substances 0.000 description 71
- 102000004190 Enzymes Human genes 0.000 description 68
- 108090000790 Enzymes Proteins 0.000 description 68
- 229940088598 enzyme Drugs 0.000 description 61
- 230000000694 effects Effects 0.000 description 51
- SUBDBMMJDZJVOS-DEOSSOPVSA-N esomeprazole Chemical compound C([S@](=O)C1=NC2=CC=C(C=C2N1)OC)C1=NC=C(C)C(OC)=C1C SUBDBMMJDZJVOS-DEOSSOPVSA-N 0.000 description 48
- 229960004770 esomeprazole Drugs 0.000 description 44
- 238000006467 substitution reaction Methods 0.000 description 41
- 108090000623 proteins and genes Proteins 0.000 description 40
- 230000014509 gene expression Effects 0.000 description 37
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 36
- 238000004128 high performance liquid chromatography Methods 0.000 description 33
- 238000002360 preparation method Methods 0.000 description 31
- 239000006227 byproduct Substances 0.000 description 30
- 238000003556 assay Methods 0.000 description 26
- 239000000243 solution Substances 0.000 description 24
- 239000002904 solvent Substances 0.000 description 22
- UCKMPCXJQFINFW-UHFFFAOYSA-N Sulphide Chemical compound [S-2] UCKMPCXJQFINFW-UHFFFAOYSA-N 0.000 description 21
- 239000000203 mixture Substances 0.000 description 20
- 125000001424 substituent group Chemical group 0.000 description 20
- 238000004519 manufacturing process Methods 0.000 description 19
- 239000002243 precursor Substances 0.000 description 19
- 108020004705 Codon Proteins 0.000 description 18
- 102000004169 proteins and genes Human genes 0.000 description 18
- 230000002378 acidificating effect Effects 0.000 description 16
- 125000001931 aliphatic group Chemical group 0.000 description 16
- 229960000381 omeprazole Drugs 0.000 description 16
- 239000011541 reaction mixture Substances 0.000 description 16
- 239000000523 sample Substances 0.000 description 16
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 15
- 239000008363 phosphate buffer Substances 0.000 description 15
- 239000013598 vector Substances 0.000 description 15
- 235000018417 cysteine Nutrition 0.000 description 14
- 238000012217 deletion Methods 0.000 description 14
- 230000037430 deletion Effects 0.000 description 14
- 230000002255 enzymatic effect Effects 0.000 description 14
- 239000000843 powder Substances 0.000 description 14
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 13
- 238000007792 addition Methods 0.000 description 13
- 230000002210 biocatalytic effect Effects 0.000 description 13
- 239000013592 cell lysate Substances 0.000 description 13
- 230000008929 regeneration Effects 0.000 description 13
- 238000011069 regeneration method Methods 0.000 description 13
- 239000007853 buffer solution Substances 0.000 description 12
- 235000018102 proteins Nutrition 0.000 description 12
- 238000004296 chiral HPLC Methods 0.000 description 11
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 11
- 239000006166 lysate Substances 0.000 description 11
- 102200075243 rs118204105 Human genes 0.000 description 11
- 102220277039 rs532585602 Human genes 0.000 description 11
- BDAGIHXWWSANSR-UHFFFAOYSA-M Formate Chemical compound [O-]C=O BDAGIHXWWSANSR-UHFFFAOYSA-M 0.000 description 10
- 108010036197 NAD phosphite oxidoreductase Proteins 0.000 description 10
- 238000004458 analytical method Methods 0.000 description 10
- 239000000872 buffer Substances 0.000 description 10
- 230000003247 decreasing effect Effects 0.000 description 10
- 239000007789 gas Substances 0.000 description 10
- 230000002209 hydrophobic effect Effects 0.000 description 10
- 241000589291 Acinetobacter Species 0.000 description 9
- 108010076504 Protein Sorting Signals Proteins 0.000 description 9
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 9
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 9
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 9
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 9
- 229960003174 lansoprazole Drugs 0.000 description 9
- 150000007523 nucleic acids Chemical class 0.000 description 9
- 239000008188 pellet Substances 0.000 description 9
- 239000007787 solid Substances 0.000 description 9
- 241000894007 species Species 0.000 description 9
- 239000011550 stock solution Substances 0.000 description 9
- 239000000126 substance Substances 0.000 description 9
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 8
- 125000004432 carbon atom Chemical group C* 0.000 description 8
- 238000004587 chromatography analysis Methods 0.000 description 8
- 102000039446 nucleic acids Human genes 0.000 description 8
- 108020004707 nucleic acids Proteins 0.000 description 8
- 229910052760 oxygen Inorganic materials 0.000 description 8
- 239000001301 oxygen Substances 0.000 description 8
- 230000035484 reaction time Effects 0.000 description 8
- 238000004064 recycling Methods 0.000 description 8
- 239000002002 slurry Substances 0.000 description 8
- 150000003457 sulfones Chemical class 0.000 description 8
- 150000003462 sulfoxides Chemical class 0.000 description 8
- 239000006228 supernatant Substances 0.000 description 8
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 8
- 108091026890 Coding region Proteins 0.000 description 7
- 239000011942 biocatalyst Substances 0.000 description 7
- 230000008859 change Effects 0.000 description 7
- 238000003780 insertion Methods 0.000 description 7
- 230000037431 insertion Effects 0.000 description 7
- 239000010410 layer Substances 0.000 description 7
- 238000007254 oxidation reaction Methods 0.000 description 7
- 239000012071 phase Substances 0.000 description 7
- 230000002829 reductive effect Effects 0.000 description 7
- 230000004044 response Effects 0.000 description 7
- SUBDBMMJDZJVOS-XMMPIXPASA-N (R)-omeprazole Chemical compound C([S@@](=O)C=1NC2=CC=C(C=C2N=1)OC)C1=NC=C(C)C(OC)=C1C SUBDBMMJDZJVOS-XMMPIXPASA-N 0.000 description 6
- 108090000698 Formate Dehydrogenases Proteins 0.000 description 6
- 239000002253 acid Substances 0.000 description 6
- 125000003342 alkenyl group Chemical group 0.000 description 6
- 125000003545 alkoxy group Chemical group 0.000 description 6
- 239000002585 base Substances 0.000 description 6
- 239000011203 carbon fibre reinforced carbon Substances 0.000 description 6
- 238000005119 centrifugation Methods 0.000 description 6
- 239000003153 chemical reaction reagent Substances 0.000 description 6
- 238000010790 dilution Methods 0.000 description 6
- 239000012895 dilution Substances 0.000 description 6
- 239000011714 flavin adenine dinucleotide Substances 0.000 description 6
- VWWQXMAJTJZDQX-UYBVJOGSSA-N flavin adenine dinucleotide Chemical compound C1=NC2=C(N)N=CN=C2N1[C@@H]([C@H](O)[C@@H]1O)O[C@@H]1CO[P@](O)(=O)O[P@@](O)(=O)OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C2=NC(=O)NC(=O)C2=NC2=C1C=C(C)C(C)=C2 VWWQXMAJTJZDQX-UYBVJOGSSA-N 0.000 description 6
- 235000019162 flavin adenine dinucleotide Nutrition 0.000 description 6
- 229940093632 flavin-adenine dinucleotide Drugs 0.000 description 6
- 230000005714 functional activity Effects 0.000 description 6
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 6
- 230000003647 oxidation Effects 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 229960004157 rabeprazole Drugs 0.000 description 6
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 5
- 241000228245 Aspergillus niger Species 0.000 description 5
- 108030000430 Cyclohexanone monooxygenases Proteins 0.000 description 5
- MYMOFIZGZYHOMD-UHFFFAOYSA-N Dioxygen Chemical compound O=O MYMOFIZGZYHOMD-UHFFFAOYSA-N 0.000 description 5
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 5
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 5
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 5
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 5
- IMNFDUFMRHMDMM-UHFFFAOYSA-N N-Heptane Chemical compound CCCCCCC IMNFDUFMRHMDMM-UHFFFAOYSA-N 0.000 description 5
- BAWFJGJZGIEFAR-NNYOXOHSSA-N NAD zwitterion Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 BAWFJGJZGIEFAR-NNYOXOHSSA-N 0.000 description 5
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 5
- 125000000304 alkynyl group Chemical group 0.000 description 5
- 125000004429 atom Chemical group 0.000 description 5
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 5
- 150000001721 carbon Chemical group 0.000 description 5
- 230000001186 cumulative effect Effects 0.000 description 5
- 230000001419 dependent effect Effects 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 229910001882 dioxygen Inorganic materials 0.000 description 5
- 238000006911 enzymatic reaction Methods 0.000 description 5
- 238000001914 filtration Methods 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 229960001031 glucose Drugs 0.000 description 5
- 125000005842 heteroatom Chemical group 0.000 description 5
- 230000006872 improvement Effects 0.000 description 5
- 238000011068 loading method Methods 0.000 description 5
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 5
- FSYKKLYZXJSNPZ-UHFFFAOYSA-N sarcosine Chemical compound C[NH2+]CC([O-])=O FSYKKLYZXJSNPZ-UHFFFAOYSA-N 0.000 description 5
- 238000003756 stirring Methods 0.000 description 5
- 240000006439 Aspergillus oryzae Species 0.000 description 4
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 101710088194 Dehydrogenase Proteins 0.000 description 4
- 108010035289 Glucose Dehydrogenases Proteins 0.000 description 4
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 4
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 4
- 108010014251 Muramidase Proteins 0.000 description 4
- 102000016943 Muramidase Human genes 0.000 description 4
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 4
- 239000004698 Polyethylene Substances 0.000 description 4
- 238000003491 array Methods 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 229910002092 carbon dioxide Inorganic materials 0.000 description 4
- 230000001747 exhibiting effect Effects 0.000 description 4
- 238000000855 fermentation Methods 0.000 description 4
- 230000004151 fermentation Effects 0.000 description 4
- 125000001188 haloalkyl group Chemical group 0.000 description 4
- 229910052739 hydrogen Inorganic materials 0.000 description 4
- 230000000977 initiatory effect Effects 0.000 description 4
- 239000004325 lysozyme Substances 0.000 description 4
- 229960000274 lysozyme Drugs 0.000 description 4
- 235000010335 lysozyme Nutrition 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 239000012528 membrane Substances 0.000 description 4
- 229910052751 metal Inorganic materials 0.000 description 4
- 239000002184 metal Substances 0.000 description 4
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 4
- 238000002703 mutagenesis Methods 0.000 description 4
- 231100000350 mutagenesis Toxicity 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 239000013642 negative control Substances 0.000 description 4
- OJMIONKXNSYLSR-UHFFFAOYSA-N phosphorous acid Chemical compound OP(O)O OJMIONKXNSYLSR-UHFFFAOYSA-N 0.000 description 4
- 239000000376 reactant Substances 0.000 description 4
- 238000011282 treatment Methods 0.000 description 4
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 4
- 238000010626 work up procedure Methods 0.000 description 4
- SXGMVGOVILIERA-UHFFFAOYSA-N 2,3-diaminobutanoic acid Chemical compound CC(N)C(N)C(O)=O SXGMVGOVILIERA-UHFFFAOYSA-N 0.000 description 3
- FUOOLUPWFVMBKG-UHFFFAOYSA-N 2-Aminoisobutyric acid Chemical compound CC(C)(N)C(O)=O FUOOLUPWFVMBKG-UHFFFAOYSA-N 0.000 description 3
- PECYZEOJVXMISF-UHFFFAOYSA-N 3-aminoalanine Chemical compound [NH3+]CC(N)C([O-])=O PECYZEOJVXMISF-UHFFFAOYSA-N 0.000 description 3
- JJDJLFDGCUYZMN-QMMMGPOBSA-N 3-chloro-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC(Cl)=C1 JJDJLFDGCUYZMN-QMMMGPOBSA-N 0.000 description 3
- 239000004382 Amylase Substances 0.000 description 3
- 108010065511 Amylases Proteins 0.000 description 3
- 102000013142 Amylases Human genes 0.000 description 3
- 102000016938 Catalase Human genes 0.000 description 3
- 108010053835 Catalase Proteins 0.000 description 3
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 3
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 3
- HXEACLLIILLPRG-YFKPBYRVSA-N L-pipecolic acid Chemical compound [O-]C(=O)[C@@H]1CCCC[NH2+]1 HXEACLLIILLPRG-YFKPBYRVSA-N 0.000 description 3
- NTIZESTWPVYFNL-UHFFFAOYSA-N Methyl isobutyl ketone Chemical compound CC(C)CC(C)=O NTIZESTWPVYFNL-UHFFFAOYSA-N 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 3
- 235000019418 amylase Nutrition 0.000 description 3
- 239000003125 aqueous solvent Substances 0.000 description 3
- 125000003710 aryl alkyl group Chemical group 0.000 description 3
- GFZWHAAOIVMHOI-UHFFFAOYSA-N azetidine-3-carboxylic acid Chemical compound OC(=O)C1CNC1 GFZWHAAOIVMHOI-UHFFFAOYSA-N 0.000 description 3
- 125000000051 benzyloxy group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])O* 0.000 description 3
- BTANRVKWQNVYAZ-UHFFFAOYSA-N butan-2-ol Chemical compound CCC(C)O BTANRVKWQNVYAZ-UHFFFAOYSA-N 0.000 description 3
- 239000003054 catalyst Substances 0.000 description 3
- 230000006037 cell lysis Effects 0.000 description 3
- 239000003638 chemical reducing agent Substances 0.000 description 3
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 3
- 229960005091 chloramphenicol Drugs 0.000 description 3
- 125000004122 cyclic group Chemical group 0.000 description 3
- JHIVVAPYMSGYDF-UHFFFAOYSA-N cyclohexanone Chemical compound O=C1CCCCC1 JHIVVAPYMSGYDF-UHFFFAOYSA-N 0.000 description 3
- 108010057167 dimethylaniline monooxygenase (N-oxide forming) Proteins 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 230000002538 fungal effect Effects 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 208000021302 gastroesophageal reflux disease Diseases 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 3
- HXEACLLIILLPRG-RXMQYKEDSA-N l-pipecolic acid Natural products OC(=O)[C@H]1CCCCN1 HXEACLLIILLPRG-RXMQYKEDSA-N 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 229950006238 nadide Drugs 0.000 description 3
- 230000007935 neutral effect Effects 0.000 description 3
- 239000012044 organic layer Substances 0.000 description 3
- 239000012074 organic phase Substances 0.000 description 3
- KQAOIKIZSJJTII-UHFFFAOYSA-N p-mercuribenzenesulfonic acid Chemical compound OS(=O)(=O)C1=CC=C([Hg])C=C1 KQAOIKIZSJJTII-UHFFFAOYSA-N 0.000 description 3
- 150000002978 peroxides Chemical class 0.000 description 3
- 230000008488 polyadenylation Effects 0.000 description 3
- HHAVHBDPWSUKHZ-UHFFFAOYSA-N propan-2-ol;propan-2-one Chemical compound CC(C)O.CC(C)=O HHAVHBDPWSUKHZ-UHFFFAOYSA-N 0.000 description 3
- 238000010791 quenching Methods 0.000 description 3
- 238000006722 reduction reaction Methods 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 239000000377 silicon dioxide Substances 0.000 description 3
- 150000003568 thioethers Chemical class 0.000 description 3
- 125000003396 thiol group Chemical group [H]S* 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 125000002023 trifluoromethyl group Chemical group FC(F)(F)* 0.000 description 3
- FQRURPFZTFUXEZ-MRVPVSSYSA-N (2s)-2,3,3,3-tetrafluoro-2-(n-fluoroanilino)propanoic acid Chemical compound OC(=O)[C@](F)(C(F)(F)F)N(F)C1=CC=CC=C1 FQRURPFZTFUXEZ-MRVPVSSYSA-N 0.000 description 2
- BVAUMRCGVHUWOZ-ZETCQYMHSA-N (2s)-2-(cyclohexylazaniumyl)propanoate Chemical compound OC(=O)[C@H](C)NC1CCCCC1 BVAUMRCGVHUWOZ-ZETCQYMHSA-N 0.000 description 2
- PECGVEGMRUZOML-AWEZNQCLSA-N (2s)-2-amino-3,3-diphenylpropanoic acid Chemical compound C=1C=CC=CC=1C([C@H](N)C(O)=O)C1=CC=CC=C1 PECGVEGMRUZOML-AWEZNQCLSA-N 0.000 description 2
- GDMOHOYNMWWBAU-QMMMGPOBSA-N (2s)-2-amino-3-(3-bromophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CC(Br)=C1 GDMOHOYNMWWBAU-QMMMGPOBSA-N 0.000 description 2
- BURBNIPKSRJAIQ-QMMMGPOBSA-N (2s)-2-amino-3-[3-(trifluoromethyl)phenyl]propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CC(C(F)(F)F)=C1 BURBNIPKSRJAIQ-QMMMGPOBSA-N 0.000 description 2
- YTHDRUZHNYKZGF-QMMMGPOBSA-N (2s)-2-azaniumyl-3-(3-nitrophenyl)propanoate Chemical compound OC(=O)[C@@H](N)CC1=CC=CC([N+]([O-])=O)=C1 YTHDRUZHNYKZGF-QMMMGPOBSA-N 0.000 description 2
- OGNSCSPNOLGXSM-UHFFFAOYSA-N 2,4-diaminobutyric acid Chemical compound NCCC(N)C(O)=O OGNSCSPNOLGXSM-UHFFFAOYSA-N 0.000 description 2
- MXLMTQWGSQIYOW-UHFFFAOYSA-N 3-methyl-2-butanol Chemical compound CC(C)C(C)O MXLMTQWGSQIYOW-UHFFFAOYSA-N 0.000 description 2
- JZRBSTONIYRNRI-VIFPVBQESA-N 3-methylphenylalanine Chemical compound CC1=CC=CC(C[C@H](N)C(O)=O)=C1 JZRBSTONIYRNRI-VIFPVBQESA-N 0.000 description 2
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 2
- JJMDCOVWQOJGCB-UHFFFAOYSA-N 5-aminopentanoic acid Chemical compound [NH3+]CCCCC([O-])=O JJMDCOVWQOJGCB-UHFFFAOYSA-N 0.000 description 2
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonia chloride Chemical compound [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 description 2
- 102000004580 Aspartic Acid Proteases Human genes 0.000 description 2
- 108010017640 Aspartic Acid Proteases Proteins 0.000 description 2
- 101000757144 Aspergillus niger Glucoamylase Proteins 0.000 description 2
- 108010029675 Bacillus licheniformis alpha-amylase Proteins 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- KXDHJXZQYSOELW-UHFFFAOYSA-M Carbamate Chemical compound NC([O-])=O KXDHJXZQYSOELW-UHFFFAOYSA-M 0.000 description 2
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 2
- 108700010070 Codon Usage Proteins 0.000 description 2
- RGHNJXZEOKUKBD-SQOUGZDYSA-N D-gluconic acid Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C(O)=O RGHNJXZEOKUKBD-SQOUGZDYSA-N 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- 101100369308 Geobacillus stearothermophilus nprS gene Proteins 0.000 description 2
- 108010018962 Glucosephosphate Dehydrogenase Proteins 0.000 description 2
- 102000002794 Glucosephosphate Dehydrogenase Human genes 0.000 description 2
- 108010020056 Hydrogenase Proteins 0.000 description 2
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 2
- 229930194542 Keto Natural products 0.000 description 2
- 150000008575 L-amino acids Chemical class 0.000 description 2
- RHGKLRLOHDJJDR-BYPYZUCNSA-N L-citrulline Chemical compound NC(=O)NCCC[C@H]([NH3+])C([O-])=O RHGKLRLOHDJJDR-BYPYZUCNSA-N 0.000 description 2
- QEFRNWWLZKMPFJ-ZXPFJRLXSA-N L-methionine (R)-S-oxide Chemical compound C[S@@](=O)CC[C@H]([NH3+])C([O-])=O QEFRNWWLZKMPFJ-ZXPFJRLXSA-N 0.000 description 2
- SNDPXSYFESPGGJ-UHFFFAOYSA-N L-norVal-OH Natural products CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 description 2
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- VEYYWZRYIYDQJM-ZETCQYMHSA-N N(2)-acetyl-L-lysine Chemical compound CC(=O)N[C@H](C([O-])=O)CCCC[NH3+] VEYYWZRYIYDQJM-ZETCQYMHSA-N 0.000 description 2
- AKCRVYNORCOYQT-YFKPBYRVSA-N N-methyl-L-valine Chemical compound CN[C@@H](C(C)C)C(O)=O AKCRVYNORCOYQT-YFKPBYRVSA-N 0.000 description 2
- KSPIYJQBLVDRRI-UHFFFAOYSA-N N-methylisoleucine Chemical compound CCC(C)C(NC)C(O)=O KSPIYJQBLVDRRI-UHFFFAOYSA-N 0.000 description 2
- 208000008469 Peptic Ulcer Diseases 0.000 description 2
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 2
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 241000235403 Rhizomucor miehei Species 0.000 description 2
- AUNGANRZJHBGPY-SCRDCRAPSA-N Riboflavin Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-SCRDCRAPSA-N 0.000 description 2
- 108010077895 Sarcosine Proteins 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- WYURNTSHIVDZCO-UHFFFAOYSA-N Tetrahydrofuran Chemical compound C1CCOC1 WYURNTSHIVDZCO-UHFFFAOYSA-N 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 201000008629 Zollinger-Ellison syndrome Diseases 0.000 description 2
- 238000005273 aeration Methods 0.000 description 2
- 150000001299 aldehydes Chemical class 0.000 description 2
- 150000001335 aliphatic alkanes Chemical class 0.000 description 2
- 150000001336 alkenes Chemical class 0.000 description 2
- 150000001345 alkine derivatives Chemical class 0.000 description 2
- 125000004453 alkoxycarbonyl group Chemical group 0.000 description 2
- 108090000637 alpha-Amylases Proteins 0.000 description 2
- 102000004139 alpha-Amylases Human genes 0.000 description 2
- 229940024171 alpha-amylase Drugs 0.000 description 2
- 150000001408 amides Chemical class 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 239000012298 atmosphere Substances 0.000 description 2
- 229940041514 candida albicans extract Drugs 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 2
- 239000005515 coenzyme Substances 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 239000005289 controlled pore glass Substances 0.000 description 2
- 239000013058 crude material Substances 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 229910052805 deuterium Inorganic materials 0.000 description 2
- 229910001873 dinitrogen Inorganic materials 0.000 description 2
- 150000002148 esters Chemical class 0.000 description 2
- 239000012065 filter cake Substances 0.000 description 2
- 235000019253 formic acid Nutrition 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 201000000052 gastrinoma Diseases 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 229910052736 halogen Inorganic materials 0.000 description 2
- 125000005843 halogen group Chemical group 0.000 description 2
- 238000012203 high throughput assay Methods 0.000 description 2
- 239000001257 hydrogen Substances 0.000 description 2
- 125000000959 isobutyl group Chemical group [H]C([H])([H])C([H])(C([H])([H])[H])C([H])([H])* 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 2
- 125000000468 ketone group Chemical group 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- VWHRYODZTDMVSS-QMMMGPOBSA-N m-fluoro-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC(F)=C1 VWHRYODZTDMVSS-QMMMGPOBSA-N 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 150000002739 metals Chemical class 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- JYVLIDXNZAXMDK-UHFFFAOYSA-N pentan-2-ol Chemical compound CCCC(C)O JYVLIDXNZAXMDK-UHFFFAOYSA-N 0.000 description 2
- AQIXEPGDORPWBJ-UHFFFAOYSA-N pentan-3-ol Chemical compound CCC(O)CC AQIXEPGDORPWBJ-UHFFFAOYSA-N 0.000 description 2
- 208000011906 peptic ulcer disease Diseases 0.000 description 2
- HXEACLLIILLPRG-UHFFFAOYSA-N pipecolic acid Chemical compound OC(=O)C1CCCCN1 HXEACLLIILLPRG-UHFFFAOYSA-N 0.000 description 2
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 2
- 229920000573 polyethylene Polymers 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- XAEFZNCEHLXOMS-UHFFFAOYSA-M potassium benzoate Chemical compound [K+].[O-]C(=O)C1=CC=CC=C1 XAEFZNCEHLXOMS-UHFFFAOYSA-M 0.000 description 2
- 229910000160 potassium phosphate Inorganic materials 0.000 description 2
- 239000008057 potassium phosphate buffer Substances 0.000 description 2
- 235000011009 potassium phosphates Nutrition 0.000 description 2
- 238000011533 pre-incubation Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 125000006239 protecting group Chemical group 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 125000006413 ring segment Chemical group 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 230000000707 stereoselective effect Effects 0.000 description 2
- 125000000547 substituted alkyl group Chemical group 0.000 description 2
- 239000010414 supernatant solution Substances 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 230000008961 swelling Effects 0.000 description 2
- 238000004885 tandem mass spectrometry Methods 0.000 description 2
- 150000003573 thiols Chemical class 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- PAPBSGBWRJIAAV-UHFFFAOYSA-N ε-Caprolactone Chemical compound O=C1CCCCCO1 PAPBSGBWRJIAAV-UHFFFAOYSA-N 0.000 description 2
- SPFMQWBKVUQXJV-BTVCFUMJSA-N (2r,3s,4r,5r)-2,3,4,5,6-pentahydroxyhexanal;hydrate Chemical compound O.OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O SPFMQWBKVUQXJV-BTVCFUMJSA-N 0.000 description 1
- NMDDZEVVQDPECF-LURJTMIESA-N (2s)-2,7-diaminoheptanoic acid Chemical compound NCCCCC[C@H](N)C(O)=O NMDDZEVVQDPECF-LURJTMIESA-N 0.000 description 1
- ZENNTZUZBRESKJ-ZETCQYMHSA-N (2s)-2-(1-benzothiophen-2-ylamino)propanoic acid Chemical compound C1=CC=C2SC(N[C@@H](C)C(O)=O)=CC2=C1 ZENNTZUZBRESKJ-ZETCQYMHSA-N 0.000 description 1
- IYKLZBIWFXPUCS-VIFPVBQESA-N (2s)-2-(naphthalen-1-ylamino)propanoic acid Chemical compound C1=CC=C2C(N[C@@H](C)C(O)=O)=CC=CC2=C1 IYKLZBIWFXPUCS-VIFPVBQESA-N 0.000 description 1
- MRTPISKDZDHEQI-YFKPBYRVSA-N (2s)-2-(tert-butylamino)propanoic acid Chemical compound OC(=O)[C@H](C)NC(C)(C)C MRTPISKDZDHEQI-YFKPBYRVSA-N 0.000 description 1
- NPDBDJFLKKQMCM-SCSAIBSYSA-N (2s)-2-amino-3,3-dimethylbutanoic acid Chemical compound CC(C)(C)[C@H](N)C(O)=O NPDBDJFLKKQMCM-SCSAIBSYSA-N 0.000 description 1
- WBZIGVCQRXJYQD-YFKPBYRVSA-N (2s)-2-amino-3-(1,3-thiazol-4-yl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CSC=N1 WBZIGVCQRXJYQD-YFKPBYRVSA-N 0.000 description 1
- JFVLNTLXEZDFHW-QMMMGPOBSA-N (2s)-2-amino-3-(2-bromophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1Br JFVLNTLXEZDFHW-QMMMGPOBSA-N 0.000 description 1
- NHBKDLSKDKUGSB-VIFPVBQESA-N (2s)-2-amino-3-(2-methylphenyl)propanoic acid Chemical compound CC1=CC=CC=C1C[C@H](N)C(O)=O NHBKDLSKDKUGSB-VIFPVBQESA-N 0.000 description 1
- NRCSJHVDTAAISV-QMMMGPOBSA-N (2s)-2-amino-3-(3,4-dichlorophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(Cl)C(Cl)=C1 NRCSJHVDTAAISV-QMMMGPOBSA-N 0.000 description 1
- ZHUOMTMPTNZOJE-VIFPVBQESA-N (2s)-2-amino-3-(3-cyanophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CC(C#N)=C1 ZHUOMTMPTNZOJE-VIFPVBQESA-N 0.000 description 1
- PEMUHKUIQHFMTH-QMMMGPOBSA-N (2s)-2-amino-3-(4-bromophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(Br)C=C1 PEMUHKUIQHFMTH-QMMMGPOBSA-N 0.000 description 1
- KWIPUXXIFQQMKN-VIFPVBQESA-N (2s)-2-amino-3-(4-cyanophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(C#N)C=C1 KWIPUXXIFQQMKN-VIFPVBQESA-N 0.000 description 1
- DQLHSFUMICQIMB-VIFPVBQESA-N (2s)-2-amino-3-(4-methylphenyl)propanoic acid Chemical compound CC1=CC=C(C[C@H](N)C(O)=O)C=C1 DQLHSFUMICQIMB-VIFPVBQESA-N 0.000 description 1
- IOABLDGLYOGEHY-QMMMGPOBSA-N (2s)-2-amino-3-[2-(trifluoromethyl)phenyl]propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1C(F)(F)F IOABLDGLYOGEHY-QMMMGPOBSA-N 0.000 description 1
- GNVNKFUEUXUWDV-VIFPVBQESA-N (2s)-2-amino-3-[4-(aminomethyl)phenyl]propanoic acid Chemical compound NCC1=CC=C(C[C@H](N)C(O)=O)C=C1 GNVNKFUEUXUWDV-VIFPVBQESA-N 0.000 description 1
- GAJBPZXIKZXTCG-VIFPVBQESA-N (2s)-2-amino-3-[4-(azidomethyl)phenyl]propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(CN=[N+]=[N-])C=C1 GAJBPZXIKZXTCG-VIFPVBQESA-N 0.000 description 1
- CRFFPDBJLGAGQL-QMMMGPOBSA-N (2s)-2-amino-3-[4-(trifluoromethyl)phenyl]propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(C(F)(F)F)C=C1 CRFFPDBJLGAGQL-QMMMGPOBSA-N 0.000 description 1
- SDZGVFSSLGTJAJ-ZETCQYMHSA-N (2s)-2-azaniumyl-3-(2-nitrophenyl)propanoate Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1[N+]([O-])=O SDZGVFSSLGTJAJ-ZETCQYMHSA-N 0.000 description 1
- ADJZXDVMJPTFKT-JTQLQIEISA-N (2s)-2-azaniumyl-4-(1h-indol-3-yl)butanoate Chemical compound C1=CC=C2C(CC[C@H](N)C(O)=O)=CNC2=C1 ADJZXDVMJPTFKT-JTQLQIEISA-N 0.000 description 1
- FMUMEWVNYMUECA-LURJTMIESA-N (2s)-2-azaniumyl-5-methylhexanoate Chemical compound CC(C)CC[C@H](N)C(O)=O FMUMEWVNYMUECA-LURJTMIESA-N 0.000 description 1
- XOZMVGJVSFVJGA-VIFPVBQESA-N (2s)-3-phenyl-2-(phosphonomethylamino)propanoic acid Chemical compound OP(=O)(O)CN[C@H](C(=O)O)CC1=CC=CC=C1 XOZMVGJVSFVJGA-VIFPVBQESA-N 0.000 description 1
- LJRDOKAZOAKLDU-UDXJMMFXSA-N (2s,3s,4r,5r,6r)-5-amino-2-(aminomethyl)-6-[(2r,3s,4r,5s)-5-[(1r,2r,3s,5r,6s)-3,5-diamino-2-[(2s,3r,4r,5s,6r)-3-amino-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-6-hydroxycyclohexyl]oxy-4-hydroxy-2-(hydroxymethyl)oxolan-3-yl]oxyoxane-3,4-diol;sulfuric ac Chemical compound OS(O)(=O)=O.N[C@@H]1[C@@H](O)[C@H](O)[C@H](CN)O[C@@H]1O[C@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](N)C[C@@H](N)[C@@H]2O)O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)N)O[C@@H]1CO LJRDOKAZOAKLDU-UDXJMMFXSA-N 0.000 description 1
- SBTVLCPCSXMWIQ-UHFFFAOYSA-N (3,5-dimethylphenyl) carbamate Chemical compound CC1=CC(C)=CC(OC(N)=O)=C1 SBTVLCPCSXMWIQ-UHFFFAOYSA-N 0.000 description 1
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 1
- UKAUYVFTDYCKQA-UHFFFAOYSA-N -2-Amino-4-hydroxybutanoic acid Natural products OC(=O)C(N)CCO UKAUYVFTDYCKQA-UHFFFAOYSA-N 0.000 description 1
- 125000006002 1,1-difluoroethyl group Chemical group 0.000 description 1
- 125000004776 1-fluoroethyl group Chemical group [H]C([H])([H])C([H])(F)* 0.000 description 1
- GWHQTNKPTXDNRM-UHFFFAOYSA-N 2-azaniumyl-3-(2,4-dichlorophenyl)propanoate Chemical compound OC(=O)C(N)CC1=CC=C(Cl)C=C1Cl GWHQTNKPTXDNRM-UHFFFAOYSA-N 0.000 description 1
- UEFLPVKMPDEMFW-UHFFFAOYSA-N 2-azaniumyl-3-(2,4-difluorophenyl)propanoate Chemical compound OC(=O)C(N)CC1=CC=C(F)C=C1F UEFLPVKMPDEMFW-UHFFFAOYSA-N 0.000 description 1
- CVZZNRXMDCOHBG-UHFFFAOYSA-N 2-azaniumyl-3-(2-chlorophenyl)propanoate Chemical compound OC(=O)C(N)CC1=CC=CC=C1Cl CVZZNRXMDCOHBG-UHFFFAOYSA-N 0.000 description 1
- OCDHPLVCNWBKJN-UHFFFAOYSA-N 2-azaniumyl-3-(2-cyanophenyl)propanoate Chemical compound OC(=O)C(N)CC1=CC=CC=C1C#N OCDHPLVCNWBKJN-UHFFFAOYSA-N 0.000 description 1
- PRAWYXDDKCVZTL-UHFFFAOYSA-N 2-azaniumyl-3-(3,4-difluorophenyl)propanoate Chemical compound OC(=O)C(N)CC1=CC=C(F)C(F)=C1 PRAWYXDDKCVZTL-UHFFFAOYSA-N 0.000 description 1
- WTOFYLAWDLQMBZ-UHFFFAOYSA-N 2-azaniumyl-3-thiophen-2-ylpropanoate Chemical compound OC(=O)C(N)CC1=CC=CS1 WTOFYLAWDLQMBZ-UHFFFAOYSA-N 0.000 description 1
- NYCRCTMDYITATC-UHFFFAOYSA-N 2-fluorophenylalanine Chemical compound OC(=O)C(N)CC1=CC=CC=C1F NYCRCTMDYITATC-UHFFFAOYSA-N 0.000 description 1
- JWUJQDFVADABEY-UHFFFAOYSA-N 2-methyltetrahydrofuran Chemical compound CC1CCCO1 JWUJQDFVADABEY-UHFFFAOYSA-N 0.000 description 1
- MLLSSTJTARJLHK-UHFFFAOYSA-N 3-aminocyclopentane-1-carboxylic acid Chemical compound NC1CCC(C(O)=O)C1 MLLSSTJTARJLHK-UHFFFAOYSA-N 0.000 description 1
- BXRLWGXPSRYJDZ-UHFFFAOYSA-N 3-cyanoalanine Chemical compound OC(=O)C(N)CC#N BXRLWGXPSRYJDZ-UHFFFAOYSA-N 0.000 description 1
- IRZQDMYEJPNDEN-UHFFFAOYSA-N 3-phenyl-2-aminobutanoic acid Natural products OC(=O)C(N)C(C)C1=CC=CC=C1 IRZQDMYEJPNDEN-UHFFFAOYSA-N 0.000 description 1
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 1
- CMUHFUGDYMFHEI-QMMMGPOBSA-N 4-amino-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(N)C=C1 CMUHFUGDYMFHEI-QMMMGPOBSA-N 0.000 description 1
- XWHHYOYVRVGJJY-QMMMGPOBSA-N 4-fluoro-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(F)C=C1 XWHHYOYVRVGJJY-QMMMGPOBSA-N 0.000 description 1
- PZNQZSRPDOEBMS-QMMMGPOBSA-N 4-iodo-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(I)C=C1 PZNQZSRPDOEBMS-QMMMGPOBSA-N 0.000 description 1
- GTVVZTAFGPQSPC-UHFFFAOYSA-N 4-nitrophenylalanine Chemical compound OC(=O)C(N)CC1=CC=C([N+]([O-])=O)C=C1 GTVVZTAFGPQSPC-UHFFFAOYSA-N 0.000 description 1
- IQPSEEYGBUAQFF-SANMLTNESA-N 6-(difluoromethoxy)-2-[(s)-(3,4-dimethoxypyridin-2-yl)methylsulfinyl]-1h-benzimidazole Chemical compound COC1=CC=NC(C[S@](=O)C=2NC3=CC=C(OC(F)F)C=C3N=2)=C1OC IQPSEEYGBUAQFF-SANMLTNESA-N 0.000 description 1
- SLXKOJJOQWFEFD-UHFFFAOYSA-N 6-aminohexanoic acid Chemical compound NCCCCCC(O)=O SLXKOJJOQWFEFD-UHFFFAOYSA-N 0.000 description 1
- 108010029731 6-phosphogluconolactonase Proteins 0.000 description 1
- HBAQYPYDRFILMT-UHFFFAOYSA-N 8-[3-(1-cyclopropylpyrazol-4-yl)-1H-pyrazolo[4,3-d]pyrimidin-5-yl]-3-methyl-3,8-diazabicyclo[3.2.1]octan-2-one Chemical class C1(CC1)N1N=CC(=C1)C1=NNC2=C1N=C(N=C2)N1C2C(N(CC1CC2)C)=O HBAQYPYDRFILMT-UHFFFAOYSA-N 0.000 description 1
- 108091006112 ATPases Proteins 0.000 description 1
- RZVAJINKPMORJF-UHFFFAOYSA-N Acetaminophen Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 description 1
- 241000093895 Acidovorax sp. Species 0.000 description 1
- 102000057290 Adenosine Triphosphatases Human genes 0.000 description 1
- 241000588813 Alcaligenes faecalis Species 0.000 description 1
- 102100034044 All-trans-retinol dehydrogenase [NAD(+)] ADH1B Human genes 0.000 description 1
- 101710193111 All-trans-retinol dehydrogenase [NAD(+)] ADH4 Proteins 0.000 description 1
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 1
- 229920000856 Amylose Polymers 0.000 description 1
- 241000024188 Andala Species 0.000 description 1
- 241000534414 Anotopterus nikparini Species 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 101000961203 Aspergillus awamori Glucoamylase Proteins 0.000 description 1
- 241000351920 Aspergillus nidulans Species 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000301512 Bacillus cereus ATCC 14579 Species 0.000 description 1
- 101000695691 Bacillus licheniformis Beta-lactamase Proteins 0.000 description 1
- 241000194107 Bacillus megaterium Species 0.000 description 1
- 108010045681 Bacillus stearothermophilus neutral protease Proteins 0.000 description 1
- 101900040182 Bacillus subtilis Levansucrase Proteins 0.000 description 1
- DKPFZGUDAPQIHT-UHFFFAOYSA-N Butyl acetate Natural products CCCCOC(C)=O DKPFZGUDAPQIHT-UHFFFAOYSA-N 0.000 description 1
- 125000003601 C2-C6 alkynyl group Chemical group 0.000 description 1
- 229910014585 C2-Ce Inorganic materials 0.000 description 1
- 101150096674 C20L gene Proteins 0.000 description 1
- 125000000172 C5-C10 aryl group Chemical group 0.000 description 1
- 125000001313 C5-C10 heteroaryl group Chemical group 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 239000004215 Carbon black (E152) Substances 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 108010059892 Cellulase Proteins 0.000 description 1
- 101710136644 Cyclohexanone 1,2-monooxygenase Proteins 0.000 description 1
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 description 1
- 102000003849 Cytochrome P450 Human genes 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- NBSCHQHZLSJFNQ-GASJEMHNSA-N D-Glucose 6-phosphate Chemical compound OC1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H](O)[C@H]1O NBSCHQHZLSJFNQ-GASJEMHNSA-N 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- RGHNJXZEOKUKBD-UHFFFAOYSA-N D-gluconic acid Natural products OCC(O)C(O)C(O)C(O)C(O)=O RGHNJXZEOKUKBD-UHFFFAOYSA-N 0.000 description 1
- 101100342470 Dictyostelium discoideum pkbA gene Proteins 0.000 description 1
- 108090000204 Dipeptidase 1 Proteins 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 101100385973 Escherichia coli (strain K12) cycA gene Proteins 0.000 description 1
- 241000672609 Escherichia coli BL21 Species 0.000 description 1
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 108020005637 FAD dependent oxidoreductase Proteins 0.000 description 1
- 102000007384 FAD-dependent oxidoreductase Human genes 0.000 description 1
- NIGWMJHCCYYCSF-UHFFFAOYSA-N Fenclonine Chemical compound OC(=O)C(N)CC1=CC=C(Cl)C=C1 NIGWMJHCCYYCSF-UHFFFAOYSA-N 0.000 description 1
- 108030006091 Flavin-containing monooxygenases Proteins 0.000 description 1
- 241000223221 Fusarium oxysporum Species 0.000 description 1
- 230000005526 G1 to G0 transition Effects 0.000 description 1
- 101150108358 GLAA gene Proteins 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 1
- 101100080316 Geobacillus stearothermophilus nprT gene Proteins 0.000 description 1
- VFRROHXSMXFLSN-UHFFFAOYSA-N Glc6P Natural products OP(=O)(O)OCC(O)C(O)C(O)C(O)C=O VFRROHXSMXFLSN-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 241001480714 Humicola insolens Species 0.000 description 1
- 102100027612 Kallikrein-11 Human genes 0.000 description 1
- SNDPXSYFESPGGJ-BYPYZUCNSA-N L-2-aminopentanoic acid Chemical compound CCC[C@H](N)C(O)=O SNDPXSYFESPGGJ-BYPYZUCNSA-N 0.000 description 1
- QUOGESRFPZDMMT-UHFFFAOYSA-N L-Homoarginine Natural products OC(=O)C(N)CCCCNC(N)=N QUOGESRFPZDMMT-UHFFFAOYSA-N 0.000 description 1
- LOOZZTFGSTZNRX-VIFPVBQESA-N L-Homotyrosine Chemical compound OC(=O)[C@@H](N)CCC1=CC=C(O)C=C1 LOOZZTFGSTZNRX-VIFPVBQESA-N 0.000 description 1
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- QWCKQJZIFLGMSD-VKHMYHEASA-N L-alpha-aminobutyric acid Chemical compound CC[C@H](N)C(O)=O QWCKQJZIFLGMSD-VKHMYHEASA-N 0.000 description 1
- ZGUNAGUHMKGQNY-ZETCQYMHSA-N L-alpha-phenylglycine zwitterion Chemical compound OC(=O)[C@@H](N)C1=CC=CC=C1 ZGUNAGUHMKGQNY-ZETCQYMHSA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- QUOGESRFPZDMMT-YFKPBYRVSA-N L-homoarginine Chemical compound OC(=O)[C@@H](N)CCCCNC(N)=N QUOGESRFPZDMMT-YFKPBYRVSA-N 0.000 description 1
- FFFHZYDWPBMWHY-VKHMYHEASA-N L-homocysteine Chemical compound OC(=O)[C@@H](N)CCS FFFHZYDWPBMWHY-VKHMYHEASA-N 0.000 description 1
- JTTHKOPSMAVJFE-VIFPVBQESA-N L-homophenylalanine Chemical compound OC(=O)[C@@H](N)CCC1=CC=CC=C1 JTTHKOPSMAVJFE-VIFPVBQESA-N 0.000 description 1
- UKAUYVFTDYCKQA-VKHMYHEASA-N L-homoserine Chemical compound OC(=O)[C@@H](N)CCO UKAUYVFTDYCKQA-VKHMYHEASA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- QEFRNWWLZKMPFJ-UHFFFAOYSA-N L-methionine sulphoxide Natural products CS(=O)CCC(N)C(O)=O QEFRNWWLZKMPFJ-UHFFFAOYSA-N 0.000 description 1
- DGYHPLMPMRKMPD-UHFFFAOYSA-N L-propargyl glycine Natural products OC(=O)C(N)CC#C DGYHPLMPMRKMPD-UHFFFAOYSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 240000001929 Lactobacillus brevis Species 0.000 description 1
- 235000013957 Lactobacillus brevis Nutrition 0.000 description 1
- 241001468191 Lactobacillus kefiri Species 0.000 description 1
- 108090001060 Lipase Proteins 0.000 description 1
- 102000004882 Lipase Human genes 0.000 description 1
- 239000004367 Lipase Substances 0.000 description 1
- 239000006137 Luria-Bertani broth Substances 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- UIHCLUNTQKBZGK-UHFFFAOYSA-N Methyl isobutyl ketone Natural products CCC(C)C(C)=O UIHCLUNTQKBZGK-UHFFFAOYSA-N 0.000 description 1
- RHGKLRLOHDJJDR-UHFFFAOYSA-N Ndelta-carbamoyl-DL-ornithine Natural products OC(=O)C(N)CCCNC(N)=O RHGKLRLOHDJJDR-UHFFFAOYSA-N 0.000 description 1
- BZQFBWGGLXLEPQ-UHFFFAOYSA-N O-phosphoryl-L-serine Natural products OC(=O)C(N)COP(O)(O)=O BZQFBWGGLXLEPQ-UHFFFAOYSA-N 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 1
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Natural products OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 101150096038 PTH1R gene Proteins 0.000 description 1
- 240000002390 Pandanus odoratissimus Species 0.000 description 1
- 235000005311 Pandanus odoratissimus Nutrition 0.000 description 1
- 206010034133 Pathogen resistance Diseases 0.000 description 1
- 229920001774 Perfluoroether Polymers 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 229920002873 Polyethylenimine Polymers 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- 102220543923 Protocadherin-10_F16L_mutation Human genes 0.000 description 1
- 241000589614 Pseudomonas stutzeri Species 0.000 description 1
- 241001348364 Pseudooceanicola batsensis HTCC2597 Species 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- 101000968489 Rhizomucor miehei Lipase Proteins 0.000 description 1
- 241000187561 Rhodococcus erythropolis Species 0.000 description 1
- 101900354623 Saccharomyces cerevisiae Galactokinase Proteins 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 244000191761 Sida cordifolia Species 0.000 description 1
- 241000256248 Spodoptera Species 0.000 description 1
- 101100309436 Streptococcus mutans serotype c (strain ATCC 700610 / UA159) ftf gene Proteins 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 241000187432 Streptomyces coelicolor Species 0.000 description 1
- 108090000787 Subtilisin Proteins 0.000 description 1
- 241000548566 Sulfitobacter sp. Species 0.000 description 1
- 241000223258 Thermomyces lanuginosus Species 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- GSEJCLTVZPLZKY-UHFFFAOYSA-N Triethanolamine Chemical compound OCCN(CCO)CCO GSEJCLTVZPLZKY-UHFFFAOYSA-N 0.000 description 1
- 102000005924 Triose-Phosphate Isomerase Human genes 0.000 description 1
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 1
- 101710152431 Trypsin-like protease Proteins 0.000 description 1
- 208000025865 Ulcer Diseases 0.000 description 1
- 101100445889 Vaccinia virus (strain Copenhagen) F16L gene Proteins 0.000 description 1
- 101100445891 Vaccinia virus (strain Western Reserve) VACWR055 gene Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 1
- 241000186864 Weissella minor Species 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- YNDUPGQMECPWKD-UHFFFAOYSA-N [O-][N+](=O)S[N+]([O-])=O Chemical compound [O-][N+](=O)S[N+]([O-])=O YNDUPGQMECPWKD-UHFFFAOYSA-N 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 108010048241 acetamidase Proteins 0.000 description 1
- 239000008186 active pharmaceutical agent Substances 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 108010045649 agarase Proteins 0.000 description 1
- 238000005054 agglomeration Methods 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 229940005347 alcaligenes faecalis Drugs 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 150000001338 aliphatic hydrocarbons Chemical group 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 150000001447 alkali salts Chemical class 0.000 description 1
- 125000004183 alkoxy alkyl group Chemical group 0.000 description 1
- 125000005083 alkoxyalkoxy group Chemical group 0.000 description 1
- 125000003282 alkyl amino group Chemical group 0.000 description 1
- 125000004414 alkyl thio group Chemical group 0.000 description 1
- WNNNWFKQCKFSDK-UHFFFAOYSA-N allylglycine Chemical compound OC(=O)C(N)CC=C WNNNWFKQCKFSDK-UHFFFAOYSA-N 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 235000019270 ammonium chloride Nutrition 0.000 description 1
- 239000000908 ammonium hydroxide Substances 0.000 description 1
- 239000012378 ammonium molybdate tetrahydrate Substances 0.000 description 1
- 150000003863 ammonium salts Chemical class 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000008346 aqueous phase Substances 0.000 description 1
- 125000002029 aromatic hydrocarbon group Chemical group 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 125000004659 aryl alkyl thio group Chemical group 0.000 description 1
- 125000002102 aryl alkyloxo group Chemical group 0.000 description 1
- 125000005110 aryl thio group Chemical group 0.000 description 1
- 125000004104 aryloxy group Chemical group 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- FIXLYHHVMHXSCP-UHFFFAOYSA-H azane;dihydroxy(dioxo)molybdenum;trioxomolybdenum;tetrahydrate Chemical compound N.N.N.N.N.N.O.O.O.O.O=[Mo](=O)=O.O=[Mo](=O)=O.O=[Mo](=O)=O.O=[Mo](=O)=O.O[Mo](O)(=O)=O.O[Mo](O)(=O)=O.O[Mo](O)(=O)=O FIXLYHHVMHXSCP-UHFFFAOYSA-H 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 238000011021 bench scale process Methods 0.000 description 1
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 description 1
- WTOFYLAWDLQMBZ-LURJTMIESA-N beta(2-thienyl)alanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CS1 WTOFYLAWDLQMBZ-LURJTMIESA-N 0.000 description 1
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 1
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical compound NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 239000003139 biocide Substances 0.000 description 1
- 230000002051 biphasic effect Effects 0.000 description 1
- 229910021538 borax Inorganic materials 0.000 description 1
- 125000001246 bromo group Chemical group Br* 0.000 description 1
- 230000003139 buffering effect Effects 0.000 description 1
- 229940043232 butyl acetate Drugs 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229960002713 calcium chloride Drugs 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- LLSDKQJKOVVTOJ-UHFFFAOYSA-L calcium chloride dihydrate Chemical compound O.O.[Cl-].[Cl-].[Ca+2] LLSDKQJKOVVTOJ-UHFFFAOYSA-L 0.000 description 1
- 229940052299 calcium chloride dihydrate Drugs 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 239000001569 carbon dioxide Substances 0.000 description 1
- 238000012219 cassette mutagenesis Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 239000013043 chemical agent Substances 0.000 description 1
- 125000001309 chloro group Chemical group Cl* 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 229960002173 citrulline Drugs 0.000 description 1
- 235000013477 citrulline Nutrition 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 229920001577 copolymer Polymers 0.000 description 1
- PZOSQGBRZPTYNZ-UHFFFAOYSA-L copper(1+);sulfate;pentahydrate Chemical compound O.O.O.O.O.[Cu+].[Cu+].[O-]S([O-])(=O)=O PZOSQGBRZPTYNZ-UHFFFAOYSA-L 0.000 description 1
- 239000000287 crude extract Substances 0.000 description 1
- 238000002425 crystallisation Methods 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- 125000001047 cyclobutenyl group Chemical group C1(=CCC1)* 0.000 description 1
- 125000001995 cyclobutyl group Chemical group [H]C1([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 description 1
- 125000000596 cyclohexenyl group Chemical group C1(=CCCCC1)* 0.000 description 1
- 125000000113 cyclohexyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C([H])([H])C1([H])[H] 0.000 description 1
- 125000000058 cyclopentadienyl group Chemical group C1(=CC=CC1)* 0.000 description 1
- 125000002433 cyclopentenyl group Chemical group C1(=CCCC1)* 0.000 description 1
- 125000001511 cyclopentyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 description 1
- 125000001559 cyclopropyl group Chemical group [H]C1([H])C([H])([H])C1([H])* 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 101150005799 dagA gene Proteins 0.000 description 1
- 238000007872 degassing Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 229950006137 dexfosfoserine Drugs 0.000 description 1
- 229960000673 dextrose monohydrate Drugs 0.000 description 1
- 125000001028 difluoromethyl group Chemical group [H]C(F)(F)* 0.000 description 1
- 125000004982 dihaloalkyl group Chemical group 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- XQGPKZUNMMFTAL-UHFFFAOYSA-L dipotassium;hydrogen phosphate;trihydrate Chemical compound O.O.O.[K+].[K+].OP([O-])([O-])=O XQGPKZUNMMFTAL-UHFFFAOYSA-L 0.000 description 1
- UQGFMSUEHSUPRD-UHFFFAOYSA-N disodium;3,7-dioxido-2,4,6,8,9-pentaoxa-1,3,5,7-tetraborabicyclo[3.3.1]nonane Chemical compound [Na+].[Na+].O1B([O-])OB2OB([O-])OB1O2 UQGFMSUEHSUPRD-UHFFFAOYSA-N 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- 201000006549 dyspepsia Diseases 0.000 description 1
- 229920001971 elastomer Polymers 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- ZSWFCLXCOIISFI-UHFFFAOYSA-N endo-cyclopentadiene Natural products C1C=CC=C1 ZSWFCLXCOIISFI-UHFFFAOYSA-N 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000009088 enzymatic function Effects 0.000 description 1
- KWORUUGOSLYAGD-YPPDDXJESA-N esomeprazole magnesium Chemical compound [Mg+2].C([S@](=O)C=1[N-]C2=CC=C(C=C2N=1)OC)C1=NC=C(C)C(OC)=C1C.C([S@](=O)C=1[N-]C2=CC=C(C=C2N=1)OC)C1=NC=C(C)C(OC)=C1C KWORUUGOSLYAGD-YPPDDXJESA-N 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 239000012527 feed solution Substances 0.000 description 1
- 229960004642 ferric ammonium citrate Drugs 0.000 description 1
- 238000011049 filling Methods 0.000 description 1
- 125000001153 fluoro group Chemical group F* 0.000 description 1
- 125000002485 formyl group Chemical group [H]C(*)=O 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 238000004817 gas chromatography Methods 0.000 description 1
- 101150019455 gdh gene Proteins 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 108010061330 glucan 1,4-alpha-maltohydrolase Proteins 0.000 description 1
- 239000000174 gluconic acid Substances 0.000 description 1
- 235000012208 gluconic acid Nutrition 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 150000004820 halides Chemical class 0.000 description 1
- 150000002367 halogens Chemical class 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 125000004446 heteroarylalkyl group Chemical group 0.000 description 1
- 125000005553 heteroaryloxy group Chemical group 0.000 description 1
- 125000005368 heteroarylthio group Chemical group 0.000 description 1
- FUZZWVXGSFPDMH-UHFFFAOYSA-N hexanoic acid Chemical compound CCCCCC(O)=O FUZZWVXGSFPDMH-UHFFFAOYSA-N 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 150000004677 hydrates Chemical class 0.000 description 1
- 229930195733 hydrocarbon Natural products 0.000 description 1
- GPRLSGONYQIRFK-UHFFFAOYSA-N hydron Chemical compound [H+] GPRLSGONYQIRFK-UHFFFAOYSA-N 0.000 description 1
- 125000002768 hydroxyalkyl group Chemical group 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 150000003949 imides Chemical class 0.000 description 1
- 150000002466 imines Chemical class 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000011261 inert gas Substances 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 239000001573 invertase Substances 0.000 description 1
- 235000011073 invertase Nutrition 0.000 description 1
- 125000002346 iodo group Chemical group I* 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 239000004313 iron ammonium citrate Substances 0.000 description 1
- 235000000011 iron ammonium citrate Nutrition 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- 238000011005 laboratory method Methods 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 235000019421 lipase Nutrition 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 230000007762 localization of cell Effects 0.000 description 1
- 159000000003 magnesium salts Chemical class 0.000 description 1
- WRUGWIBCXHJTDG-UHFFFAOYSA-L magnesium sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Mg+2].[O-]S([O-])(=O)=O WRUGWIBCXHJTDG-UHFFFAOYSA-L 0.000 description 1
- 229940061634 magnesium sulfate heptahydrate Drugs 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- ISPYRSDWRDQNSW-UHFFFAOYSA-L manganese(II) sulfate monohydrate Chemical compound O.[Mn+2].[O-]S([O-])(=O)=O ISPYRSDWRDQNSW-UHFFFAOYSA-L 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 125000006178 methyl benzyl group Chemical group 0.000 description 1
- 125000000250 methylamino group Chemical group [H]N(*)C([H])([H])[H] 0.000 description 1
- 239000011812 mixed powder Substances 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 125000006682 monohaloalkyl group Chemical group 0.000 description 1
- 238000005817 monooxygenase reaction Methods 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 125000004572 morpholin-3-yl group Chemical group N1C(COCC1)* 0.000 description 1
- 125000004573 morpholin-4-yl group Chemical group N1(CCOCC1)* 0.000 description 1
- 125000002757 morpholinyl group Chemical group 0.000 description 1
- 231100000219 mutagenic Toxicity 0.000 description 1
- 230000003505 mutagenic effect Effects 0.000 description 1
- APVPOHHVBBYQAV-UHFFFAOYSA-N n-(4-aminophenyl)sulfonyloctadecanamide Chemical compound CCCCCCCCCCCCCCCCCC(=O)NS(=O)(=O)C1=CC=C(N)C=C1 APVPOHHVBBYQAV-UHFFFAOYSA-N 0.000 description 1
- 125000001624 naphthyl group Chemical group 0.000 description 1
- 210000003739 neck Anatomy 0.000 description 1
- 150000002825 nitriles Chemical class 0.000 description 1
- 229910052756 noble gas Inorganic materials 0.000 description 1
- 150000002835 noble gases Chemical class 0.000 description 1
- 101150105920 npr gene Proteins 0.000 description 1
- 101150017837 nprM gene Proteins 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 229920000620 organic polymer Polymers 0.000 description 1
- 229960003104 ornithine Drugs 0.000 description 1
- 108090000021 oryzin Proteins 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-O oxonium Chemical compound [OH3+] XLYOFNOQVPJJNP-UHFFFAOYSA-O 0.000 description 1
- 125000004430 oxygen atom Chemical group O* 0.000 description 1
- 238000006213 oxygenation reaction Methods 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 229960001639 penicillamine Drugs 0.000 description 1
- 125000005010 perfluoroalkyl group Chemical group 0.000 description 1
- 125000005004 perfluoroethyl group Chemical group FC(F)(F)C(F)(F)* 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 238000005191 phase separation Methods 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 150000008300 phosphoramidites Chemical class 0.000 description 1
- PJNZPQUBCPKICU-UHFFFAOYSA-N phosphoric acid;potassium Chemical compound [K].OP(O)(O)=O PJNZPQUBCPKICU-UHFFFAOYSA-N 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- BZQFBWGGLXLEPQ-REOHCLBHSA-N phosphoserine Chemical compound OC(=O)[C@@H](N)COP(O)(O)=O BZQFBWGGLXLEPQ-REOHCLBHSA-N 0.000 description 1
- USRGIUJOYOXOQJ-GBXIJSLDSA-N phosphothreonine Chemical compound OP(=O)(O)O[C@H](C)[C@H](N)C(O)=O USRGIUJOYOXOQJ-GBXIJSLDSA-N 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- DFOXKPDFWGNLJU-UHFFFAOYSA-N pinacolyl alcohol Chemical compound CC(O)C(C)(C)C DFOXKPDFWGNLJU-UHFFFAOYSA-N 0.000 description 1
- 125000004194 piperazin-1-yl group Chemical group [H]N1C([H])([H])C([H])([H])N(*)C([H])([H])C1([H])[H] 0.000 description 1
- 125000004193 piperazinyl group Chemical group 0.000 description 1
- 125000000587 piperidin-1-yl group Chemical group [H]C1([H])N(*)C([H])([H])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 1
- 125000004574 piperidin-2-yl group Chemical group N1C(CCCC1)* 0.000 description 1
- 125000003386 piperidinyl group Chemical group 0.000 description 1
- 229910052697 platinum Inorganic materials 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 229920001343 polytetrafluoroethylene Polymers 0.000 description 1
- 239000004810 polytetrafluoroethylene Substances 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 150000003141 primary amines Chemical class 0.000 description 1
- 229940002612 prodrug Drugs 0.000 description 1
- 239000000651 prodrug Substances 0.000 description 1
- 125000002572 propoxy group Chemical group [*]OC([H])([H])C(C([H])([H])[H])([H])[H] 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 229940126409 proton pump inhibitor Drugs 0.000 description 1
- 239000000612 proton pump inhibitor Substances 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 101150108007 prs gene Proteins 0.000 description 1
- 101150086435 prs1 gene Proteins 0.000 description 1
- 101150070305 prsA gene Proteins 0.000 description 1
- YREYEVIYCVEVJK-UHFFFAOYSA-N rabeprazole Chemical compound COCCCOC1=CC=NC(CS(=O)C=2NC3=CC=CC=C3N=2)=C1C YREYEVIYCVEVJK-UHFFFAOYSA-N 0.000 description 1
- 235000008001 rakum palm Nutrition 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000001953 recrystallisation Methods 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 102220029329 rs398123484 Human genes 0.000 description 1
- 102220090324 rs876657851 Human genes 0.000 description 1
- 101150025220 sacB gene Proteins 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 229940043230 sarcosine Drugs 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000007423 screening assay Methods 0.000 description 1
- 125000002914 sec-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 150000003335 secondary amines Chemical class 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 239000012064 sodium phosphate buffer Substances 0.000 description 1
- 239000004328 sodium tetraborate Substances 0.000 description 1
- 235000010339 sodium tetraborate Nutrition 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 238000012289 standard assay Methods 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 125000000446 sulfanediyl group Chemical group *S* 0.000 description 1
- 125000000475 sulfinyl group Chemical group [*:2]S([*:1])=O 0.000 description 1
- 229940124530 sulfonamide Drugs 0.000 description 1
- 150000003456 sulfonamides Chemical class 0.000 description 1
- 125000000472 sulfonyl group Chemical group *S(*)(=O)=O 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 229920002994 synthetic fiber Polymers 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 229950008375 tenatoprazole Drugs 0.000 description 1
- 125000000999 tert-butyl group Chemical group [H]C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 150000003512 tertiary amines Chemical class 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- YLQBMQCUIZJEEH-UHFFFAOYSA-N tetrahydrofuran Natural products C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 1
- 125000004192 tetrahydrofuran-2-yl group Chemical group [H]C1([H])OC([H])(*)C([H])([H])C1([H])[H] 0.000 description 1
- 125000003718 tetrahydrofuranyl group Chemical group 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 239000011573 trace mineral Substances 0.000 description 1
- 235000013619 trace mineral Nutrition 0.000 description 1
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- AUALKMYBYGCYNY-UHFFFAOYSA-E triazanium;2-hydroxypropane-1,2,3-tricarboxylate;iron(3+) Chemical compound [NH4+].[NH4+].[NH4+].[Fe+3].[Fe+3].[Fe+3].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O.[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O.[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O.[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O AUALKMYBYGCYNY-UHFFFAOYSA-E 0.000 description 1
- 125000000876 trifluoromethoxy group Chemical group FC(F)(F)O* 0.000 description 1
- 125000004385 trihaloalkyl group Chemical group 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 230000034512 ubiquitination Effects 0.000 description 1
- 238000010798 ubiquitination Methods 0.000 description 1
- 231100000397 ulcer Toxicity 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- RZLVQBNCHSJZPX-UHFFFAOYSA-L zinc sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Zn+2].[O-]S([O-])(=O)=O RZLVQBNCHSJZPX-UHFFFAOYSA-L 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
- C12N9/0073—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14) with NADH or NADPH as one donor, and incorporation of one atom of oxygen 1.14.13
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
- C12N9/0083—Miscellaneous (1.14.99)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/16—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing two or more hetero rings
- C12P17/165—Heterorings having nitrogen atoms as the only ring heteroatoms
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/18—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing at least two hetero rings condensed among themselves or condensed with a common carbocyclic ring system, e.g. rifamycin
- C12P17/182—Heterocyclic compounds containing nitrogen atoms as the only ring heteroatoms in the condensed system
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y114/00—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
- C12Y114/13—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with NADH or NADPH as one donor, and incorporation of one atom of oxygen (1.14.13)
- C12Y114/13022—Cyclohexanone monooxygenase (1.14.13.22)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y114/00—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
- C12Y114/99—Miscellaneous (1.14.99)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P20/00—Technologies relating to chemical industry
- Y02P20/50—Improvements relating to the production of bulk chemicals
- Y02P20/52—Improvements relating to the production of bulk chemicals using catalysts, e.g. selective catalysts
Definitions
- the present disclosure relates to biocatalysts and methods of using the biocatalysts.
- Esomeprazole is a proton pump inhibitor prescribed for the treatment of dyspepsia, peptic ulcer disease (PUD), gastroesophageal reflux disease (GORD/GERD) and Zollinger-Ellison syndrome.
- Esomeprazole is the S-enantiomer of omeprazole, which is a racemic mixture of the S and R isomers.
- Esomeprazole (also referred to herein as "(3 ⁇ 4 ) -5-methoxy-2-((4-methoxy-3,5- dimethylpyridin-2-yl)methylsulfinyl)- lH-benzo[d]imidazole" or "compound (2b)" has the following structure:
- Esomeprazole and its corresponding R-isomer is a prodrug that is converted to the active form in acidic environments. It is activated by a proton catalyzed process to form a sulphenamide, which interacts with the sulfhydryl groups of cysteine residues in the extracellular domain of H + K + -ATPases, thereby inhibiting its activity.
- the efficacy of the S-enantiomer is indicated as being greater than the racemic omeprazole.
- Esomeprazole is typically synthesized by chemical asymmetric oxidation of sulfides to sulfoxides, i.e., a Kagan-Sharpless type oxidation, as described in Cotton et al., 2000, Tetrahedron: Asymmetry 1 1 :3819. The process results in esomeprazole in about 94% enantiomeric excess. The enantiopurity of esomeprazole preparations can be increased substantially by preparing a magnesium salt followed by crystallization. Different salts and hydrates of esomeprazole have also been described. For example, WO 00/44744 discloses the potassium salt of esomeprazole.
- 6,162,816 discloses crystalline form A and less crystalline form B of neutral esomeprazole, prepared by a recrystallization from ethyl acetate, methylene chloride or toluene.
- U.S. Pat. No. 6,369,085 discloses esomeprazole magnesium trihydrate prepared from the corresponding potassium salt, precipitated with acetone, and treated with water.
- the present disclosure provides polypeptides, polynucleotides encoding the polypeptides and methods of using the polypeptides for the biocatalytic sulfoxidation of prazole compounds, in particular the conversion of the sulfide substrate of 5-methoxy-2-((4-methoxy-3,5-dimethylpyridin-2- yl)methylthio)-lH-benzo[d]imidazole (also referred to herein as "compound (1)”) to the (R)- and (S)- omeprazole product of 5-methoxy-2-((4-methoxy-3,5-dimethylpyridin-2-yl)methylsulfinyl)-lH- benzo[d]imidazole (also referred to herein as "compound (2)").
- CHMO cyclohexanone monooxygenase
- the non- naturally occurring monooxygenase polypeptides of the disclosure can carry out the reaction stereoselectively to the form the S-isomer, (2 ⁇ -5-methoxy-2-((4-methoxy-3,5-dimethylpyridin-2- yl)methylsulfinyl)-lH-benzo[d]imidazole (also referred to herein as "compound (2b)”) in
- monooxygenase polypeptides can produce compound (2b) as a substantially enantiomerically pure preparation.
- the polypeptide is capable of converting compound (1) to compound (2a) in enantiomeric excess.
- the R-enantioselective monooxygenase polypeptide comprises an amino acid sequence that is at least 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more identical to a reference sequence selected from the group consisting of SEQ ID NO: 4, 6, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, and 208.
- the reference sequence is SEQ ID NO: 6, 166, 170, 174, 190, 192, 196, 200, 204, or 206.
- the polypeptide is capable of converting compound (1) to compound (2b) in enantiomeric excess.
- the S-enantioselective monooxygenase polypeptide comprises an amino acid sequence that is at least 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more identical to a reference sequence selected from the group consisting of SEQ ID NO: 8, 10, 22, 52, 76, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 210, 212, 214, 216, 218, 2
- the present disclosure provides a monooxygenase polypeptide capable of converting compound (1) to compound (2) at a rate that is improved over the naturally occurring monooxygenase of SEQ ID NO:2.
- the non-naturally occurring monooxygenase polypeptide capable of converting compound (1) to compound (2) at a rate that is improved over the naturally occurring monooxygenase of SEQ ID NO:2.
- the non-naturally occurring monooxygenase polypeptide capable of converting compound (1) to compound (2) at a rate that is improved over the naturally occurring monooxygenase of SEQ ID NO:2.
- the polypeptide is capable of converting compound (1) to compound (2) at a rate that is greater than 1.5 fold the rate of SEQ ID NO:2.
- the polypeptide is capable of converting compound (1) to compound (2) at a rate that is greater than 1.5 fold the polypeptide of SEQ ID NO:2 and comprises an amino acid sequence that is at least 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more identical to a reference sequence selected from SEQ ID NO: 4, 6, 8, 10, 22, 52, 76, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154
- the polypeptide capable of converting compound (1) to compound (2) with improved enantioselectivity and/or enzymatic activity having been derived from the naturally occurring cyclohexanone monooxygenase of Acinetobacter NCIMB9877 , comprises an amino acid sequence that has one or more residue differences as compared to the sequence of SEQ ID NO:2.
- the residue differences can occur at residue positions identified as being associated with desirable changes in enzyme activity, enantioselectivity, sulfone -byproduct formation, thermostability, solvent stability, expression, or various combinations thereof.
- the polypeptides of the disclosure can have one or more residue differences as compared to SEQ ID NO:2 at the following residue positions: X3; X14; X34; X43; X71 ; XI 1 1 ; X141 ; X149, X174; X209; X240; X246; X248; X288; X307; X326; X383; X386; X388; X390; X400; X415; X426; X432; X433; X435; X438; X448; X449; X481 ; X488; X489; X490; X499; X505; X516; X526; X537; and X540.
- residue differences as compared to SEQ ID NO:2 at the following residue positions: X3; X14; X34;
- the monooxygenase amino acid sequence has at least two or more, at least three or more, or at least four or more residue differences at the residue positions above as compared to the reference sequence of SEQ ID NO:2. Amino acid residues that can be present at these positions are provided in the detailed descriptions herein.
- the polypeptide comprises an amino acid sequence having one or more residue differences as compared to SEQ ID NO:2 at residue positions associated with changes in enantioselectivity, which positions are selected from the following: X246; X248; X326; X386; X432; X433; X435; X438; and X448.
- the monooxygenase has at least two or more, at least three or more, or at least four or more residue differences at these residues positions as compared to the reference sequence of SEQ ID NO:2.
- the non-naturally occurring monooxygenase polypeptide comprises an amino acid sequence having one or more residue differences as compared to SEQ ID NO:2 at residue positions associated with decreases in sulfone -byproduct formation, where the sulfone -byproduct has the following structure of co
- Residue positions associated with a decrease in the amount of sulfone-byproduct of compound (3) formed in the biocatalytic process can be selected from the following: X246, X248, X277, and X438.
- the polypeptide comprises an amino acid sequence having one or more residue differences as compared to SEQ ID NO:2 at residue positions associated with increases in thermostability and/or solvent stability, which positions are selected from the following: X43, X71, XI 1 1, X149, X174, X307; X341, X368, X388, X390, X400, X449, X481, and X488.
- the polypeptide comprises an amino acid sequence having residue differences as compared to SEQ ID NO:2 at residue positions associated with increases in protein expression, which position includes X3.
- the polypeptide capable of converting compound (1) to compound (2a) in enantiomeric excess comprises an amino acid sequence which comprises at least two or more of the following features: residue at position corresponding to X432 is an aliphatic amino acid residue; residue at position corresponding to X433 is a non-polar amino acid residue; residue at position corresponding to X435 is a hydroxyl-containing amino acid residue; and residue at position corresponding to X490 is a basic amino acid residue.
- the R- enantioselective monooxygenases have at least three of the above features, or at least all of the above features.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence in which at least residue at position corresponding to X326 is a cysteine (C).
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence in which at least residue at position corresponding to X386 is a hydroxyl-containing amino acid residue, particularly a S.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X326 is a cysteine (C) and residue at position corresponding to X386 is a hydroxyl-containing amino acid residue, particularly a S.
- the S-enantioselective monooxygenase polypeptides comprises an amino acid sequence having, in addition to residue differences associated with S-enantioselectivity above, at least one or more residue differences at residue positions associated with increased enzyme activity for the pyrmetazole substrate and/or increased S-enantioselectivity, particularly at residue positions X432, X433, X435, X438, X448, and X490, more particularly at least one or more residue differences at residue positions X432, X433, X435, and X490.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X326 is a cysteine (C); residue at position corresponding to X386 is a hydroxyl-containing amino acid residue; and residue at position corresponding to X432 is an aliphatic amino acid residue.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X326 is a cysteine (C); residue at position
- corresponding to X386 is a hydroxyl-containing amino acid residue
- residue at position corresponding to X432 is an aliphatic amino acid residue
- residue at position corresponding to X433 is an aliphatic amino acid residue
- residue at position corresponding to X435 is a hydroxyl-containing amino acid residue
- residue at position corresponding to X490 is a basic amino acid residue.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X326 is C; residue at position corresponding to X386 is S; residue at position corresponding to X432 is A, or L; residue at position corresponding to X433 is A, L, or V; residue at position corresponding to X435 is S; and residue at position corresponding to X490 is R.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X34 is K; residue at position corresponding to X209 is P; residue at position corresponding to X240 is F or K; residue at position corresponding to X288 is I; residue at position corresponding to X326 is C; residue at position corresponding to X386 is S;
- residue at position corresponding to X415 is A; residue at position corresponding to X432 is L;
- residue at position corresponding to X433 is A; residue at position corresponding to X435 is S;
- residue at position corresponding to X438 is I; residue at position corresponding to X448 is V; residue at position corresponding to X490 is R; residue at position corresponding to X516 is V; and residue at position corresponding to X537 is T.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X14 is A; residue at position corresponding to X34 is K; residue at position corresponding to X141 is V; residue at position corresponding to X209 is P; residue at position corresponding to X240 is F or K; residue at position corresponding to X288 is I; residue at position corresponding to X326 is C; residue at position corresponding to X386 is S;
- residue at position corresponding to X415 is A; residue at position corresponding to X426 is F;
- residue at position corresponding to X432 is L; residue at position corresponding to X433 is A;
- residue at position corresponding to X435 is S; residue at position corresponding to X438 is I; residue at position corresponding to X448 is V; residue at position corresponding to X489 is C; residue at position corresponding to X490 is R; residue at position corresponding to X516 is V; and residue at position corresponding to X537 is T.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X14 is A; residue at position corresponding to X34 is K; residue at position corresponding to XI 1 1 is T; residue at position corresponding to XI 41 is V; residue at position corresponding to X209 is P; residue at position corresponding to X240 is F or K; residue at position corresponding to X246 is Y; residue at position corresponding to X288 is I; residue at position corresponding to X307 is C or R; residue at position corresponding to X326 is C; residue at position corresponding to X386 is S; residue at position corresponding to X388 is K; residue at position corresponding to X415 is A; residue at position corresponding to X426 is F; residue at position corresponding to X432 is L; residue at position corresponding to X433 is A;
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X14 is A; residue at position corresponding to X34 is K; residue at position corresponding to X43 is G; residue at position corresponding to X71 is M; residue at position corresponding to XI 1 1 is T; residue at position corresponding to XI 41 is V;
- residue at position corresponding to XI 49 is W; residue at position corresponding to X209 is P; residue at position corresponding to X240 is F or K; residue at position corresponding to X246 is Y; residue at position corresponding to X248 is I or V; residue at position corresponding to X277 is M; residue at position corresponding to X288 is I; residue at position corresponding to X307 is C or R; residue at position corresponding to X326 is C; residue at position corresponding to X386 is S;
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X14 is A; residue at position corresponding to X34 is K; residue at position corresponding to X43 is G; residue at position corresponding to X71 is M; residue at position corresponding to XI 1 1 is T; residue at position corresponding to XI 41 is V;
- residue at position corresponding to XI 49 is W; residue at position corresponding to XI 74 is I;
- residue at position corresponding to X209 is P; residue at position corresponding to X240 is F or K; residue at position corresponding to X246 is Y; residue at position corresponding to X248 is I or V; residue at position corresponding to X277 is M; residue at position corresponding to X288 is I;
- residue at position corresponding to X307 is C or R; residue at position corresponding to X326 is C; residue at position corresponding to X341 is E; residue at position corresponding to X383 is G;
- residue at position corresponding to X386 is S; residue at position corresponding to X388 is K;
- residue at position corresponding to X390 is I; residue at position corresponding to X400 is I; residue at position corresponding to X415 is A; residue at position corresponding to X426 is F; residue at position corresponding to X432 is L; residue at position corresponding to X433 is A; residue at position corresponding to X435 is S; residue at position corresponding to X438 is I; residue at position corresponding to X448 is V; residue at position corresponding to X449 is F; residue at position corresponding to X481 is K; residue at position corresponding to X488 is K; residue at position corresponding to X489 is C; residue at position corresponding to X490 is R; residue at position corresponding to X499 is G, L, or R; residue at position corresponding to X505 is L; residue at position corresponding to X516 is V; residue at position corresponding to X526 is V; residue at position corresponding to X537 is T; and residue at
- polynucleotides encoding the monooxygenase polypeptides, expression vectors comprising the polynucleotides, and host cells capable of expressing the polypeptides.
- Ar 1 is an optionally substituted aryl or heteroaryl ring
- R is H, a lower alkyl, a heteroalkyl, or forms a 5 to 8 membered cycloalkyl, heteroalkyl, aryl or heteroaryl fused ring with a ring carbon of Ar 1 ;
- Ar 2 is an optionally substituted cycloalkyl, heterocycloalkyl, aryl, or heteroaryl ring fused to the imidazole ring.
- the process comprises contacting the compound of formula (I) with an engineered monooxygenase polypeptide described herein in presence of a cofactor under suitable reaction conditions for converting the substrate to the product compound of formula (II).
- suitable reaction conditions typically include a source of molecular oxygen 0 2 , and the cofactor is NADPH or NADH.
- processes of using the polypeptides of the disclosure can be used in the preparation of omeprazole analog compounds of structural formula (II) in enantiomeric excess, wherein the compounds of structural formula (II) are selected from: (R) or (S)-lansoprazole, (R) or (S)- tenatoprazole, (R) or (S)-rabeprazole, (R) or (S)-pantoprazole, (R) or (S)-ilaprazole, (R) or (S)- leminoprazole, (R) or (S)-saviprazole, and (R) or (S)-TY- 1 1345.
- the compounds of structural formula (II) are selected from: (R) or (S)-lansoprazole, (R) or (S)- tenatoprazole, (R) or (S)-rabeprazole, (R) or (S)-pantoprazole, (R) or (S)-
- Ar 1 is an optionally substituted phenyl or pyridyl.
- Ar 2 is an optionally substituted thienyl, phenyl or pyridyl.
- the compound of formula (I) is compound (1) and the product compound of formula (II) is compound (2).
- Exemplary polypeptides useful in the above process can comprise an amino acid sequence selected from the group consisting of SEQ ID NO: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78. 80.
- the process is used for the conversion of compound (1) to compound (2a) in enantiomeric excess.
- the process comprises contacting compound (1) with certain monooxygenase polypeptides described herein in presence of a cofactor under suitable reaction conditions for forming compound (2a) in enantiomeric excess.
- Exemplary polypeptides useful in the process for preparing compound (2a) in enantiomeric excess can comprise an amino acid sequence selected from the group consisting of SEQ ID NO: 4, 6, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, and 208.
- the process is used for the conversion of compound (1) to compound (2b) in enantiomeric excess.
- the process comprises contacting compound (1) with certain monooxygenase polypeptides described herein in presence of a cofactor under suitable reaction conditions for forming compound (2b) in enantiomeric excess.
- Exemplary polypeptides useful in the process for preparing compound (2b) in enantiomeric excess can comprise an amino acid sequence selected from the group consisting of SEQ ID NO: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78. 80.
- the process can comprise contacting compound (1) with a polypeptide of the disclosure in presence of a cofactor under suitable reaction conditions to form compound (2b) in at least 90% enantiomeric excess.
- Exemplary polypeptides for preparing compound (2b) in at least 90% enantiomeric excess can comprise an amino acid sequence selected from the group consisting of SEQ ID NO: 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 242, 244, 248, 250, 254, 256, 258, 262, and 264.
- the process comprises contacting compound (1) with a polypeptide of the disclosure in presence of a cofactor under suitable reaction conditions to form compound (2b) in at least 99% enantiomeric excess.
- exemplary polypeptides for preparing compound (2b) in at least 99% enantiomeric excess can comprise an amino acid sequence selected from the group consisting of SEQ ID NO: 16, 18, 20, 22, 24, 26, 28, 30, 32, 42, 44, 86, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, and 158.
- the reaction condition in the process comprises a temperature of 10°C to 50°C, and in particular 25°C to 40°C.
- the reaction condition in the process comprises a pH of about 8.5 to a pH of about 10, in particular a pH of about 8.5 to about 9.0.
- the reaction condition in the process comprises a partial pressure of O2 at greater than atmospheric pressure. Additionally, dissolved molecular oxygen in the process can be increased by sparging the reaction solution with ( ⁇ -containing gas or by use of bubble-free aeration with 02-containing gas.
- the reaction condition comprises a co-solvent, such as for example, MeOH, EtOH, isopropanol (IP A), acetone, toluene, MeCN, methyl tert-butyl ether (MTBE), N-methyl-2-pyrrolidone (NMP), dimethylacetamide (DMAc), dimethylformamide (DMF), propylene glycol, polyethylene glycol (PEG), tetramethylurea, N-ethylpyrollidinone, tetraglyme, 1,3- Dimethyl-3,4,5,6-tetrahydro-2(lH)-pyrimidinone (DMPU), DMIU, hexamethylphosphoramide ( ⁇ ) and dimethylsulfoxide (DMSO).
- a co-solvent such as for example, MeOH, EtOH, isopropanol (IP A), acetone, toluene, MeCN, methyl tert-butyl ether (MTBE), N-methyl-2-pyrrolidon
- the process further comprises converting NADP + or NAD+ formed from the NADPH or NADH, respectively, with a cofactor regenerating system.
- the cofactor regenerating system can use an appropriate dehydrogenase, such a glucose dehydrogenase, glucose- phosphate dehydrogenase, formate dehydrogenase, phosphite dehydrogenase, and
- ketoreductase/alcohol dehydrogenase and corresponding substrate for example, glucose, glucoses- phosphate, formate, phosphite, or alcohol, respectively.
- the co-factor regenerating system is a ketoreductase and a secondary alcohol, particularly isopropanol (IPA).
- the engineered monooxygenase polypeptides can be provided in the form of kits or arrays, particularly for use in the screening for activity on other prazole compounds of interest.
- the present disclosure provides efficient biocatalysts capable of mediating the conversion of the substrate, 5-methoxy-2-((4-methoxy-3,5-dimethylpyridin-2-yl)methylthio)-lH-benzo[d]imidazole ("compound (1)") to the product, 5-methoxy-2-((4-methoxy-3,5-dimethylpyridin-2- yl)methylsulfinyl)-lH-benzo[d]imidazole (“compound (2)").
- the biocatalysts are highly stereoselective, capable of mediating the conversion of compound (1) to either the (S)- isomer of compound (2b) or the (Rj-isomer of compound (2a) in enantiomeric excess.
- the biocatalysts described herein have been designed by changing the amino acid sequence of a naturally occurring cyclohexanone monooxygenase (CHMO) to form polypeptides with the desired enzymatic properties, e.g., enzyme activity, enantioselectivity, byproduct formation, thermostability, solvent stability, and expression. These polypeptides can also be applied to the sulfoxidation of other prazole compounds structurally similar to esomeprazole.
- CHMO cyclohexanone monooxygenase
- Protein "Protein”, “polypeptide,” and “peptide” are used interchangeably herein to denote a polymer of at least two amino acids covalently linked by an amide bond, regardless of length or post- translational modification (e.g., glycosylation, phosphorylation, lipidation, myristilation, ubiquitination, etc.). Included within this definition are D- and L-amino acids, and mixtures of D- and L-amino acids.
- Coding sequence refers to that portion of a nucleic acid (e.g., a gene) that encodes an amino acid sequence of a protein.
- Naturally occurring or wild-type refers to the form found in nature.
- a naturally occurring or wild-type polypeptide or polynucleotide sequence is a sequence present in an organism that can be isolated from a source in nature and which has not been intentionally modified by human manipulation.
- Recombinant or “engineered” or “non-naturally occurring” when used with reference to, e.g., a cell, nucleic acid, or polypeptide, refers to a material, or a material corresponding to the natural or native form of the material, that has been modified in a manner that would not otherwise exist in nature, or is identical thereto but produced or derived from synthetic materials and/or by manipulation using recombinant techniques.
- Non-limiting examples include, among others, recombinant cells expressing genes that are not found within the native (non-recombinant) form of the cell or express native genes that are otherwise expressed at a different level.
- Percentage of sequence identity refers to comparisons between polynucleotide sequences or polypeptide sequences, and are determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide or polypeptide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence for optimal alignment of the two sequences.
- the percentage is calculated by determining the number of positions at which either the identical nucleic acid base or amino acid residue occurs in both sequences or a nucleic acid base or amino acid residue is aligned with a gap to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity. Determination of optimal alignment and percent sequence identity is performed using the BLAST and BLAST 2.0 algorithms (see e.g., Altschul et al., 1990, J. Mol. Biol. 215: 403-410 and Altschul et al., 1977, Nucleic Acids Res. 3389-3402). Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information website.
- the BLAST analyses involve first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as, the neighborhood word score threshold (Altschul et al, supra). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased.
- HSPs high scoring sequence pairs
- Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always ⁇ 0).
- M forward score for a pair of matching residues; always >0
- N penalty score for mismatching residues; always ⁇ 0.
- a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached.
- the BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment.
- W wordlength
- E expectation
- the BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff, 1989, Proc Natl Acad Sci USA 89: 10915).
- Reference sequence refers to a defined sequence to which another sequence is compared.
- a reference sequence may be a subset of a larger sequence, for example, a segment of a full-length gene or polypeptide sequence.
- a reference sequence is at least 20 nucleotide or amino acid residues in length, at least 25 residues in length, at least 50 residues in length, or the full length of the nucleic acid or polypeptide.
- two polynucleotides or polypeptides may each (1) comprise a sequence (i.e., a portion of the complete sequence) that is similar between the two sequences, and (2) may further comprise a sequence that is divergent between the two sequences, sequence comparisons between two (or more) polynucleotides or polypeptide are typically performed by comparing sequences of the two polynucleotides over a comparison window to identify and compare local regions of sequence similarity.
- Comparison sequence is not intended to be limited to wild-type sequences, and can include engineered or altered sequences.
- a “reference sequence” can be a previously engineered or altered amino acid sequence.
- Comparison window refers to a conceptual segment of at least about 20 contiguous nucleotide positions or amino acids residues wherein a sequence may be compared to a reference sequence of at least 20 contiguous nucleotides or amino acids and wherein the portion of the sequence in the comparison window may comprise additions or deletions (i.e., gaps) of 20 percent or less as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences.
- the comparison window can be longer than 20 contiguous residues, and includes, optionally 30, 40, 50, 100, or longer windows.
- “Corresponding to”, “reference to” or “relative to” when used in the context of the numbering of a given amino acid or polynucleotide sequence refers to the numbering of the residues of a specified reference sequence when the given amino acid or polynucleotide sequence is compared to the reference sequence.
- the residue number or residue position of a given polymer is designated with respect to the reference sequence rather than by the actual numerical position of the residue within the given amino acid or polynucleotide sequence.
- a given amino acid sequence such as that of an engineered monooxygenase, can be aligned to a reference sequence by introducing gaps to optimize residue matches between the two sequences. In these cases, although the gaps are present, the numbering of the residue in the given amino acid or polynucleotide sequence is made with respect to the reference sequence to which it has been aligned.
- Steposelectivity refers to the preferential formation in a chemical or enzymatic reaction of one stereoisomer over another. Stereoselectivity can be partial, where the formation of one stereoisomer is favored over the other, or it may be complete where only one stereoisomer is formed. When the stereoisomers are enantiomers, the stereoselectivity is referred to as enantioselectivity, the fraction (typically reported as a percentage) of one enantiomer in the sum of both.
- Highly stereoselective refers to a chemical or enzymatic reaction that is capable of converting a substrate (e.g., 5-methoxy-2-((4-methoxy-3,5-dimethylpyridin-2-yl)methylthio)-lH- benzo[d]imidazole) to its corresponding stereoisomeric product (e.g., (S)-5-methoxy-2-((4-methoxy- 3,5-dimethylpyridin-2-yl)methylsulfinyl)- lH-benzo[d]imidazole, or (R)-5-methoxy-2-((4-methoxy- 3,5-dimethylpyridin-2-yl)methylsulfinyl)-lH-benzo[d]imidazole) with at least about 85%
- “Increased enzymatic activity” or “increased activity” refers to an improved property of an engineered enzyme, which can be represented by an increase in specific activity (e.g., product produced/time/weight protein) or an increase in percent conversion of the substrate to the product (e.g., percent conversion of starting amount of substrate to product in a specified time period using a specified amount of monooxygenase) as compared to a reference enzyme. Exemplary methods to determine enzyme activity are provided in the Examples. Any property relating to enzyme activity may be affected, including the classical enzyme properties of K m , V max or k cat , changes of which can lead to increased enzymatic activity.
- the monooxygenase activity can be measured by any one of standard assays used for measuring monooxygenases, such as change in substrate or product concentration, or change in concentration of the cofactor (in absence of a cofactor regenerating system). Comparisons of enzyme activities are made using a defined preparation of enzyme, a defined assay under a set condition, and one or more defined substrates, as further described in detail herein. Generally, when enzymes in cell lysates are compared, the numbers of cells and the amount of protein assayed are determined as well as use of identical expression systems and identical host cells to minimize variations in amount of enzyme produced by the host cells and present in the lysates.
- Conversion refers to the enzymatic transformation of a substrate to the corresponding product.
- Percent conversion refers to the percent of the substrate that is converted to the product within a period of time under specified conditions.
- the "enzymatic activity” or “activity” of a monooxygenase polypeptide can be expressed as “percent conversion” of the substrate to the product.
- thermo stable refers to a polypeptide that is resistant to inactivation when exposed to a set of temperature conditions (e.g., 40-80°C) for a period of time (e.g., 0.5-24 hrs) compared to the untreated enzyme, thus retaining a certain level of residual activity (e.g., more than 60% to 80% for example) after exposure to elevated temperatures.
- a set of temperature conditions e.g., 40-80°C
- a period of time e.g., 0.5-24 hrs
- solvent stable refers to a polypeptide that maintains similar activity (more than e.g., 60% to 80%) after exposure to varying concentrations (e.g., 5-99%) of solvent , (e.g., isopropyl alcohol, dimethylsulfoxide, tetrahydrofuran, 2-methyltetrahydrofuran, acetone, toluene, butylacetate, methyl tert-butylether, acetonitrile, etc.) for a period of time (e.g., 0.5-24 hrs) compared to the untreated enzyme.
- solvent e.g., isopropyl alcohol, dimethylsulfoxide, tetrahydrofuran, 2-methyltetrahydrofuran, acetone, toluene, butylacetate, methyl tert-butylether, acetonitrile, etc.
- Hydrophilic Amino Acid or Residue refers to an amino acid or residue having a side chain exhibiting a hydrophobicity of less than zero according to the normalized consensus hydrophobicity scale of Eisenberg et ah, 1984, J. Mol. Biol. 179: 125-142.
- Genetically encoded hydrophilic amino acids include Thr (T), Ser (S), His (H), Glu (E), Asn (N), Gin (Q), Asp (D), Lys (K) and Arg (R).
- Acidic Amino Acid or Residue refers to a hydrophilic amino acid or residue having a side chain exhibiting a pK value of less than about 6 when the amino acid is included in a peptide or polypeptide. Acidic amino acids typically have negatively charged side chains at physiological pH due to loss of a hydrogen ion. Genetically encoded acidic amino acids include Glu (E) and Asp (D).
- Base Amino Acid or Residue refers to a hydrophilic amino acid or residue having a side chain exhibiting a pK value of greater than about 6 when the amino acid is included in a peptide or polypeptide.
- Basic amino acids typically have positively charged side chains at physiological pH due to association with hydronium ion.
- Genetically encoded basic amino acids include Arg (R) and Lys (K).
- Poly Amino Acid or Residue refers to a hydrophilic amino acid or residue having a side chain that is uncharged at physiological pH, but which has at least one bond in which the pair of electrons shared in common by two atoms is held more closely by one of the atoms.
- Genetically encoded polar amino acids include Asn (N), Gin (Q), Ser (S) and Thr (T).
- Hydrophobic Amino Acid or Residue refers to an amino acid or residue having a side chain exhibiting a hydrophobicity of greater than zero according to the normalized consensus
- Genetically encoded hydrophobic amino acids include Pro (P), He (I), Phe (F), Val (V), Leu (L), Trp (W), Met (M), Ala (A) and Tyr (Y).
- Aromatic Amino Acid or Residue refers to a hydrophilic or hydrophobic amino acid or residue having a side chain that includes at least one aromatic or heteroaromatic ring. Genetically encoded aromatic amino acids include Phe (F), Tyr (Y) and Trp (W). Although owing to the its heteroaromatic ring side chain His (H) is classified as an aromatic residue, it may also be classified as a basic residue owing to pKa of its heteroaromatic nitrogen atom.
- Non-polar Amino Acid or Residue refers to a hydrophobic amino acid or residue having a side chain that is uncharged at physiological pH and which has bonds in which the pair of electrons shared in common by two atoms is generally held equally by each of the two atoms (i.e., the side chain is not polar).
- Genetically encoded non-polar amino acids include Gly (G), Leu (L), Val (V), He (I), Met (M) and Ala (A).
- Aliphatic Amino Acid or Residue refers to a hydrophobic amino acid or residue having an aliphatic hydrocarbon side chain. Genetically encoded aliphatic amino acids include Ala (A), Val (V), Leu (L) and He (I).
- the amino acid Cys (C) is unique in that it can form disulfide bridges with other Cys (C) amino acids or other sulfanyl- or sulfhydryl-containing amino acids.
- the ability of Cys (and other amino acids with -SH containing side chains) to exist in a polypeptide in either the reduced free -SH or oxidized disulfide-bridged form affects whether it contributes net hydrophobic or hydrophilic character to the polypeptide. While Cys exhibits a hydrophobicity of 0.29 according to the normalized consensus scale of Eisenberg (Eisenberg et al., 1984, supra), it is to be understood that for purposes of the present disclosure, Cys is classified into its own unique group.
- the amino acid Pro (P) is conformationally constrained nature. Although it has hydrophobic properties, as used herein, Pro (P) or other similar residues is classified as a “conformationally constrained”.
- Hydroxyl-containing Amino Acid or Residue refers to an amino acid or residue containing a hydroxyl (-OH) moiety. Genetically-encoded hydroxyl-containing amino acids include Ser (S) and Thr (T). While L-Tyr (Y) contains a hydroxyl moiety, it is classified herein as an aromatic amino acid or residue.
- amino acid difference or “residue difference” refers to a change in the residue at a specified position of a polypeptide sequence when compared to a reference sequence.
- a residue difference at position X3, where the reference sequence has a glutamine refers to a change of the residue at position X3 to any residue other than glutamine.
- an enzyme can include one or more residue differences relative to a reference sequence, where multiple residue differences typically are indicated by a list of the specified positions where changes are made relative to the reference sequence.
- Constant amino acid substitutions refer to the interchangeability of residues having similar side chains, and thus typically involves substitution of the amino acid in the polypeptide with amino acids within the same or similar defined class of amino acids.
- an amino acid with an aliphatic side chain may be substituted with another aliphatic amino acid, e.g., alanine, valine, leucine, and isoleucine; an amino acid with hydroxyl side chain is substituted with another amino acid with a hydroxyl side chain, e.g., serine and threonine; an amino acids having aromatic side chains is substituted with another amino acid having an aromatic side chain, e.g., phenylalanine, tyrosine, tryptophan, and histidine; an amino acid with a basic side chain is substituted with another amino acid with a basis side chain, e.g., lysine and arginine; an amino acid with an acidic side chain is substituted with another amino acid
- Non-conservative substitution refers to substitution of an amino acid in the polypeptide with an amino acid with significantly differing side chain properties. Non-conservative substitutions may use amino acids between, rather than within, the defined groups and affects (a) the structure of the peptide backbone in the area of the substitution (e.g., proline for glycine) (b) the charge or hydrophobicity, or (c) the bulk of the side chain.
- an exemplary non-conservative substitution can be an acidic amino acid substituted with a basic or aliphatic amino acid; an aromatic amino acid substituted with a small amino acid; and a hydrophilic amino acid substituted with a hydrophobic amino acid.
- “Deletion” refers to modification of the polypeptide by removal of one or more amino acids from the reference polypeptide.
- Deletions can comprise removal of 1 or more amino acids, 2 or more amino acids, 5 or more amino acids, 10 or more amino acids, 15 or more amino acids, or 20 or more amino acids, up to 10% of the total number of amino acids, or up to 20% of the total number of amino acids making up the polypeptide while retaining enzymatic activity and/or retaining the improved properties of an engineered monooxygenase enzyme.
- Deletions can be directed to the internal portions and/or terminal portions of the polypeptide.
- the deletion can comprise a continuous segment or can be discontinuous.
- Insertions refers to modification of the polypeptide by addition of one or more amino acids to the reference polypeptide.
- the improved engineered monooxygenase enzymes comprise insertions of one or more amino acids to the naturally occurring cyclohexanone monooxygenase polypeptide as well as insertions of one or more amino acids to other improved monooxygenase polypeptides. Insertions can be in the internal portions of the polypeptide, or to the carboxy or amino terminus. Insertions as used herein include fusion proteins as is known in the art. The insertion can be a contiguous segment of amino acids or separated by one or more of the amino acids in the naturally occurring polypeptide.
- Fragment refers to a polypeptide that has an amino-terminal and/or carboxy- terminal deletion, but where the remaining amino acid sequence is identical to the corresponding positions in the sequence. Fragments can typically have about 80%, 90%, 95%, 98%, and 99% of the full-length monooxygenase polypeptide, for example the polypeptide of SEQ ID NO:2.
- isolated polypeptide refers to a polypeptide which is substantially separated from other contaminants that naturally accompany it, e.g., protein, lipids, and polynucleotides.
- the term embraces polypeptides which have been removed or purified from their naturally-occurring environment or expression system (e.g., host cell or in vitro synthesis). The improved
- monooxygenase enzymes may be present within a cell, present in the cellular medium, or prepared in various forms, such as lysates or isolated preparations.
- the improved monooxygenase enzyme can be an isolated polypeptide.
- substantially pure polypeptide refers to a composition in which the polypeptide species is the predominant species present (i.e., on a molar or weight basis it is more abundant than any other individual macromolecular species in the composition), and is generally a substantially purified composition when the object species comprises at least about 50 percent of the macromolecular species present by mole or % weight.
- a substantially pure monooxygenase composition will comprise about 60 % or more, about 70% or more, about 80% or more, about 90% or more, about 95% or more, and about 98% or more of all macromolecular species by mole or % weight present in the composition.
- Solvent species, small molecules ( ⁇ 500 Daltons), and elemental ion species are not considered macromolecular species.
- the isolated improved monooxygenase polypeptide is a substantially pure polypeptide composition.
- Heterologous polynucleotide refers to any polynucleotide that is introduced into a host cell by laboratory techniques, and includes polynucleotides that are removed from a host cell, subjected to laboratory manipulation, and then reintroduced into a host cell.
- Codon optimized refers to changes in the codons of the polynucleotide encoding a protein to those preferentially used in a particular organism such that the encoded protein is efficiently expressed in the organism of interest.
- the polynucleotides encoding the monooxygenase enzymes may be codon optimized for optimal production from the host organism selected for expression.
- Control sequence is defined herein to include all components, which are necessary or advantageous for the expression of a polynucleotide and/or polypeptide of the present disclosure.
- Each control sequence may be native or foreign to the polynucleotide of interest.
- control sequences include, but are not limited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal peptide sequence, and transcription terminator.
- operably linked is defined herein as a configuration in which a control sequence is appropriately placed (i.e., in a functional relationship) at a position relative to a polynucleotide of interest such that the control sequence directs or regulates the expression of the polynucleotide and/or polypeptide of interest.
- Alkyl by itself or as part of another substituent refers to a saturated or unsaturated branched, straight-chain or cyclic monovalent hydrocarbon radical having the stated number of carbon atoms (i.e., Ci-C 6 means one to six carbon atoms) that is derived by the removal of one hydrogen atom from a single carbon atom of a parent alkane, alkene or alkyne.
- alkyl is specifically intended to include groups having any degree or level of saturation, i.e., groups having exclusively single carbon- carbon bonds, groups having one or more double carbon-carbon bonds, groups having one or more triple carbon-carbon bonds and groups having mixtures of single, double and triple carbon-carbon bonds.
- alkanyl alkenyl
- alkynyl alkynyl
- lower alkyl refers to alkyl groups composed of from 1 to 6 carbon atoms (Ci-C 6 ) , preferably 1-4 carbon atoms (C1-C4) .
- Alkanyl by itself or as part of another substituent refers to a saturated branched, straight-chain or cyclic alkyl derived by the removal of one hydrogen atom from a single carbon atom of a parent alkane.
- Alkanyl groups include, but are not limited to, methanyl; ethanyl; propanyls such as propan-l-yl, propan-2-yl (isopropyl), cyclopropan- 1 -yl, etc.; butanyls such as butan-l-yl, butan-2-yl (yec-butyl), 2-methyl-propan- 1 -yl (isobutyl), 2-methyl-propan-2-yl (?-butyl),
- the alkanyl groups are (Ci-Ce) alkyl.
- alkenyl by itself or as part of another substituent refers to an unsaturated branched, straight-chain or cyclic alkyl having at least one carbon-carbon double bond derived by the removal of one hydrogen atom from a single carbon atom of a parent alkene.
- the group may be in either the cis or trans conformation about the double bond(s).
- the alkenyl group is (C 2 -Ce) alkenyl.
- Alkynyl by itself or as part of another substituent refers to an unsaturated branched, straight-chain or cyclic alkyl having at least one carbon-carbon triple bond derived by the removal of one hydrogen atom from a single carbon atom of a parent alkyne.
- the alkynyl group is (C 2 -C 6 ) alkynyl.
- Alkoxy by itself or as part of another substituent refers to -OR a , where R a represents an alkyl or cycloalkyl group as defined herein. Typical alkoxy groups include, but are not limited to, methoxy, ethoxy, propoxy, butoxy, cyclohexyloxy, and the like.
- amino by itself or as part of another substituent refers to the group -NH 2 .
- Substituted amino refers to the group -NHR b , NR b R b , and NR b R b R b where each R b is independently selected from substituted or unsubstituted alkyl, cycloalkyl, cycloheteroalkyl, alkoxy, aryl, heteroaryl, heteroarylalkyl, acyl, alkoxycarbonyl, sulfanyl, sulfinyl, sulfonyl, and the like.
- Typical amino groups include, but are limited to, dimethylamino, diethylamino, trimethylamino, triethylamino, methylysulfonylamino, furanyl-oxy-sulfamino, and the like.
- Heteroalkyl Heteroalkanyl
- Heteroalkenyl Heteroalkynyl
- Heteroalkynyl by themselves or as part of another substituent refer to alkyl, alkanyl, alkenyl, and alkynyl groups, respectively, in which one or more of the carbon atoms are each independently replaced with the same or different heteroatoms or heteroatom groups.
- Heteroatoms and/or heteroatom groups which can replace the carbon atoms include, but are not limited to, -0-, -S-, -S-0-, -NR C , -PH-, -S(O)-, -S(0) 2 -, -S(O) NR C -, -S(0) 2 NR c -, and the like, including combinations thereof, where each R c is independently hydrogen or (Ci-C 6 ) alkyl.
- Cycloalkyl and Heterocycloalkyl by themselves or as part of another substituent refer to cyclic versions of “alkyl” and “heteroalkyl” groups, respectively.
- Typical cycloalkyl groups include, but are not limited to, cyclopropyl; cyclobutyls such as cyclobutanyl and cyclobutenyl; cyclopentyls such as cyclopentanyl and cyclopentenyl; cyclohexyls such as cyclohexanyl and cyclohexenyl; and the like.
- Typical heterocycloalkyl groups include, but are not limited to, tetrahydrofuranyl (e.g., tetrahydrofuran-2-yl, tetrahydrofuran-3-yl, etc.), piperidinyl (e.g., piperidin- 1 -yl, piperidin-2-yl, etc.), morpholinyl (e.g., morpholin-3-yl, morpholin-4-yl, etc.), piperazinyl (e.g., piperazin- 1 -yl, piperazin-2-yl, etc.), and the like.
- tetrahydrofuranyl e.g., tetrahydrofuran-2-yl, tetrahydrofuran-3-yl, etc.
- piperidinyl e.g., piperidin- 1 -yl, piperidin-2-yl, etc.
- morpholinyl e.g., morpholin-3
- Aryl by itself or as part of another substituent refers to a monovalent aromatic hydrocarbon group having the stated number of carbon atoms (i.e., C 5 -Ci 5 means from 5 to 15 carbon atoms) derived by the removal of one hydrogen atom from a single carbon atom of a parent aromatic ring system.
- the aryl group is (C5-C 10 ) aryl, with (C 5 -Cg) being even more preferred.
- the aryls are cyclopentadienyl, phenyl and naphthyl.
- Heteroaryl by itself or as part of another substituent refers to a monovalent heteroaromatic group having the stated number of ring atoms (e.g., "5-14 membered” means from 5 to 14 ring atoms) derived by the removal of one hydrogen atom from a single atom of a parent heteroaromatic ring system.
- the heteroaryl group is a 5-10 membered heteroaryl.
- the heteroaryl group is a 5-8 membered heteroaryl.
- Halogen or "halo" by themselves or as part of another substituent, unless otherwise stated, refer to fluoro, chloro, bromo and iodo.
- Haloalkyl by itself or as part of another substituent refers to an alkyl group in which one or more of the hydrogen atoms is replaced with a halogen.
- haloalkyl is meant to include monohaloalkyls, dihaloalkyls, trihaloalkyls, etc. up to perhaloalkyls.
- (C 1 -C 2 ) haloalkyl includes 1 -fluoromethyl, difluoromethyl, trifluoromethyl, 1- fluoroethyl, 1, 1-difluoroethyl, 1,2-difluoroethyl, 1 , 1 , 1 -trifluoroethyl, perfluoroethyl, etc.
- Optionally substituted as used herein means one or more hydrogen atoms (e.g., 1, 2, 3, 4, 5, or 6 hydrogen atoms) of the group can each be replaced with a substituent atom or group. Each substituent can be the same or different.
- substituents include, but are not limited to, alkyl, alkenyl, alkynyl, cycloalkyl, aryl, aralkyl, cycloheteroalkyl, heteroaryl, OR d (e.g., hydroxyl, alkoxy (e.g., methoxy, ethoxy, and propoxy), aryloxy, heteroaryloxy, aralkyloxy, ether, ester, carbamate, etc.), hydroxyalkyl, alkoxycarbonyl, alkoxyalkoxy, perhaloalkyl, perfluoroalkyl (e.g., CF 3 , CF 2 , CF 3 ), perfluoroalkoxy (e.g., OCF 3 , OCF 2 CF 3 ), alkoxyalkyl, SR d (e.g., thiol, alkylthio, arylthio, heteroarylthio, aralkylthio, etc
- the number of optional substituents is 1 to 5, wherein the substituents are groups as defined herein.
- 1 -3 optional substituents can be present.
- substitutions will typically number less than about 10 substitutions, more preferably about 1 to 5, with about 1 to 3 substitutions being preferred.
- a substituent is selected that does not act as a substrate for the monooxygenases of the present disclosure.
- Substituted alkyl, cycloalkyl, heterocycloalkyl, aryl, or heteroaryl refers to an alkyl, cycloalkyl, heterocycloalkyl, aryl, or heteroaryl group in which one or more hydrogen atoms is replaced with another substituent group.
- CHMO Cyclohexanone monooxygenases
- the biocatalytic reaction uses O2 and a co-factor NAPDH to generate the caprolactone, oxidized cofactor NADP+, and H 2 0.
- Cyclohexanone monooxygenases are flavin dependent enzymes and contain a flavin prosthetic group, generally flavin adenine dinucleotide (FAD). This FAD prosthetic group is bound to the enzyme and, without being bound by theory, believed to participate in the catalytic reaction by forming a peroxyflavin intermediate (see, e.g., Sheng et al., 2001,
- Cyclohexanone monooxygenases have also been used as biocatalysts for the enantioselective air-oxidation of prochiral thioethers to form chiral sulfoxides (see, e.g., Light et al., 1982,
- Cyclohexanone monooxygenases can recognize a variety of aryl-alkyl sulfide substrates, examples of which are described in Pasta et al., 1995, Tetrahedron: Asymmetry 6(4):933-936; Yeung and Rettie, 2005, "Prochiral Sulfoxidation as a probe for Flavin-Containing Monooxygenases, In Methods in Molecular Biology: Cytochrome P450 Protocols 320: 163- 172; Colonna et al., 2000, Chirality 13(l):40-42; and Alphand et al., 2003, Trends Biotechnology 21(7) :318-323.
- esomeprazole is the S-isomer of omeprazole, which is a racemic mixture containing the S and R isomers.
- Omeprazole and esomeprazole are inhibitors of H + K + ATPases, which are proton transporters involved in the production of acid in the stomach.
- Omeprazole and esomeprazole are prescribed for the treatment of ulcers, gastroesophageal reflux disease, and Zollinger-Ellison syndrome.
- the current synthesis of esomeprazole involves a Kagan-Sharpless type chemical oxidation of sulfides to sulfoxides (see Cotton et al., 2000, Tetrahedron: Asymmetry 1 1 : 3819).
- engineered monooxygenase polypeptides have been designed starting from the cyclohexanone monooxygenase of Acinetobacter NCIMB9 '877 to efficiently oxidize pyrmetazole ("compound (1)") to the sulfoxide product ("compound (2)").
- the naturally occurring cyclohexanone monooxygenase is inefficient in catalyzing this reaction.
- the engineered polypeptides are further capable of enantioselectively converting pyrmetazole to either the (R) or (S) form of omeprazole in enantiomeric excess.
- these engineered biocatalysts provide a highly efficient process for producing esomeprazole as substantially enantiomerically pure preparations.
- These engineered monooxygenase polypeptides can also be applied to the sulfoxidation of compounds structurally similar to pyrmetazole.
- the present disclosure provides polypeptides capable of converting compound (1) to compound (2), as illustrated in reaction Scheme 1 below, at a rate that is improved over the monooxygenase of SEQ ID NO:2:
- the conversion rate of the engineered monooxygenase is greater than 1.5 fold the conversion rate of the monooxygenase of SEQ ID NO: 2.
- the oxidation occurs in presence of molecular oxygen O 2 and an electron donor, such as cofactor NADPH or NADH.
- an electron donor such as cofactor NADPH or NADH.
- the cofactor NADPH is used.
- the reaction can be carried out in presence of a co-factor regenerating system, for example an appropriate dehydrogenase (e.g., glucose dehydrogenase, formate dehydrogenase, phosphite dehydrogenase, or ketoreductase/alcohol dehydrogenase) and a suitable dehydrogenase substrate, such as glucose, glucose-6-phosphate, formate, phosphite, or an alcohol, e.g., isopropanol.
- a co-factor regenerating system for example an appropriate dehydrogenase (e.g., glucose dehydrogenase, formate dehydrogenase, phosphite dehydrogenase, or ketoreductase/alcohol dehydrogenase) and a suitable dehydrogenase substrate, such as glucose, glucose-6-phosphate, formate, phosphite, or an alcohol,
- the polypeptide is capable of converting compound (1) to compound (2) at a rate that is greater than 1.5 fold the rate of the monooxygenase polypeptide of SEQ ID NO:2 and comprises an amino acid sequence that is at least 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more identical to a reference sequence selected from the group consisting of SEQ ID NO: 4, 6, 10, 22, 52, 76, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170,
- Some embodiments of the engineered monooxygenase polypeptides are capable of converting compound (1) to (i?)-5-methoxy-2-((4-methoxy-3,5-dimethylpyridin-2-yl)methylsulfinyl)-lH- benzo[d]imidazole ("compound (2a)”) in enantiomeric excess while other embodiments of the monooxygenase polypeptides are capable of converting compound (1) to (5 ⁇ -5-methoxy-2-((4- methoxy-3,5-dimethylpyridin-2-yl)methylsulfinyl)-lH-benzo[d]imidazole (“compound (2b)”) in enantiomeric excess.
- the stereoselectivity of the engineered monooxygenases can be ascertained by using the substrate pyrmetazole and determining the amounts of (R) and (S) forms of the products produced.
- the stereoselectivity of the monooxygenases towards other prazole substrates can be determined in a similar manner.
- the polypeptides are capable of converting compound (1) to (R)-5- methoxy-2-((4-methoxy-3,5-dimethylpyridin-2-yl)methylsulfinyl)-lH-benzo[d]imidazole
- the polypeptide capable of converting compound (1) to compound (2a)in enantiomeric excess comprises an amino acid sequence that is at least 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more identical to a reference sequence selected from the group consisting of SEQ ID NO: 4, 6, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, and 208.
- the reference sequence is SEQ ID NO: 6, 166, 170, 174, 190, 192, 196, 200, 204, or 206.
- the polypeptide is capable of converting compound (1) to (S)-5- methoxy-2-((4-methoxy-3,5-dimethylpyridin-2-yl)methylsulfinyl)-lH-benzo[d]imidazole
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence that is at least 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more identical to a reference sequence selected from the group consisting of SEQ ID NO: 8, 10, 22, 52, 76, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240
- the polypeptides presented herein comprise an amino acid sequence that has one or more residue differences as compared to the reference sequence of the naturally occurring cyclohexanone monooxygenase of Acinetobacter sp NCIMB9871 represented by SEQ ID NO:2.
- the residue differences can be non-conservative changes or conservative changes.
- the residue differences can be conservative substitutions, non-conservative substitutions, or a combination of non-conservative and conservative substitutions.
- the amino acid residue position in the reference sequence is determined in the monooxygenase polypeptide beginning from the initiating methionine (M) residue (i.e., M represents residue position 1), although it will be understood by the skilled artisan that this initiating methionine residue may be removed by biological processing machinery, such as in a host cell or in vitro translation system, to generate a mature protein lacking the initiating methionine residue.
- M methionine
- the polypeptide sequence position at which a particular amino acid or amino acid change (“residue difference") is present is sometimes described herein as "Xn", or "position n", where n refers to the residue position with respect to the reference sequence.
- a specific substitution mutation which is a replacement of the specific residue in a reference sequence with a different specified residue may be denoted by the conventional notation "X(number)Y", where X is the single letter identifier of the residue in the reference sequence, "number” is the residue position in the reference sequence, and Y is the single letter identifier of the residue substitution in the engineered sequence.
- the residue differences as compared to SEQ ID NO:2 are present at one or more of the following residue positions: X3; X14; X34; X43; X71 ; XI 1 1 ; X141 ; X149, X174; X209; X240; X246; X248; X288; X307; X326; X383; X386; X388; X390; X400; X415; X426; X432; X433; X435; X438; X448; X449; X481 ; X488; X489; X490; X499; X505; X516; X526; X537; and X540.
- the monooxygenase has at least two or more, at least three or more, or at least four or more residue differences at the residue positions above as compared to the reference sequence of SEQ ID NO:2.
- residue positions X14, X34, X43; XI 11, X141, X386, X388, X426, X432, X433, X435, and X438 are within 8A of the FAD prosthetic group on the enzyme; residue positions X149, X209, X277, X326, X426, X432, X435, X438, X488, X489, and X490 are within 8A of enzyme-bound NADPH cofactor; and residue positions X277, X326, X426, X432, X433, X435, X438, X489, X490, and X505 are within 8A of enzyme -bound pyrmetazole.
- amino acid residues at these residue positions are in close proximity to bound substrate, FAD prosthetic group, and co-factor, it has been found that the amino acid residues at these residue positions as well as the others above can be varied to alter specific enzyme properties, including, among others, substrate binding, enzyme activity, enantioselectivity, and thermostability.
- CYMO_ACISP RecName: Full Cyclohexanone 1,2- monooxygenase; (c) gi
- oxidoreductase OS Sulfitobacter sp.
- residue differences as compared to SEQ ID NO:2 at the specified residue positions can be selected from the following features: residue at position corresponding to X3 is a hydroxyl-containing amino acid residue; residue at position corresponding to XI 4 is an aliphatic amino acid residue; residue at position corresponding to X34 is a basic amino acid residue; residue at position corresponding to X43 is an aliphatic amino acid residue; residue at position corresponding to X71 is a non-polar amino acid residue; residue at position corresponding to XI 1 1 is a hydroxyl- containing amino acid residue; residue at position corresponding to XI 41 is an aliphatic amino acid residue; residue at position corresponding to XI 49 is an aliphatic or aromatic amino acid residue; residue at position corresponding to XI 74 is an aliphatic amino acid residue; residue at position corresponding to X209 is a constrained amino acid residue; residue at position corresponding to X240 is a basic amino acid residue; residue
- corresponding to X540 is a polar or aliphatic amino acid residue.
- amino acid residue of SEQ ID NO:2 at the corresponding residue position are encompassed within the category of amino acids described for the specified position, a different amino acid within that amino acid category can be used in light of the guidance provided herein.
- residue differences as compared to SEQ ID NO:2 at the specified residue positions can be selected from the following features: residue at position corresponding to X3 is T; residue at position corresponding to X14 is A; residue at position corresponding to X34 is K; residue at position corresponding to X43 is G; residue at position corresponding to X71 is M; residue at position corresponding to XI 1 1 is T; residue at position corresponding to X 141 is I or V; residue at position corresponding to XI 49 is V or W; residue at position corresponding to XI 74 is L or I;
- residue at position corresponding to X209 is P; residue at position corresponding to X240 is F or K; residue at position corresponding to X246 is E, H, Q, S, R, W, or Y; residue at position corresponding to X248 is C, I, N, V, or S; residue at position corresponding to X277 is L, M, or Q; residue at position corresponding to X278 is G, N, or S; residue at position corresponding to X280 is G, or R; residue at position corresponding to X281 is A, or S; residue at position corresponding to X282 is S; residue at position corresponding to X248 is C, N, V, or S; residue at position corresponding to X288 is I; residue at position corresponding to X307 is R; residue at position corresponding to X326 is C or T; residue at position corresponding to X383 is I or G; residue at position corresponding to X386 is S; residue at position corresponding to X3
- the polypeptide comprises an amino acid sequence having one or more residue differences as compared to SEQ ID NO:2 at residue positions associated with increased enzyme activity, which positions are selected from the following: X14, X34, X43, X71, XI 1 1, X141, X149, X174, X209, X240, X246, X248, X277, X278, X280; X281 ; X282; X288, X307, X326, X341, X368, X386, X388, X390, X400, X415, X426, X432, X433, X435, X438, X448, X449, X481, X488, X489, X490, X499, X505, X516, X526, X537, and X540.
- the polypeptide amino acid sequence has
- the polypeptide comprises an amino acid sequence having one or more residue differences as compared to SEQ ID NO:2 at residue positions affecting enantioselectivity, which positions are selected from the following: X246; X248; X326; X386; X432; X433; X435; X438; and X448.
- the polypeptide amino acid sequence has at least two or more residue differences, at least three or more residue differences, or at least four or more residue differences as compared to SEQ ID NO: 2 at the residue positions associated with enantioselectivity.
- the polypeptide comprises an amino acid sequence having one or more residue differences as compared to SEQ ID NO:2 at residue positions associated with decreased amounts of sulfone-byproduct formation, where the sulfone-byproduct has the structure of compound (3) (shown above). Residue positions associated with decreased amounts of sulfone-byproduct of compound (3) are selected from the following: X246, X248, X277, and X438.
- the sulfone- byproduct of compound (3) is an undesirable product of the enzymatic reaction and is readily identified by its elution profile on a chromatographic column of 3,5-dimethylphenylcarbamate derivatized amylose immobilized on 5- ⁇ silica (CHIRALPAK IA®).
- An exemplary solvent condition for resolution of the sulfone-byproduct from pyrmetazole and esomeprazole is 45:55 Heptane/EtOH, 1.0 mL/min, 40°C, with detection of eluent at 300 nm.
- the polypeptides are capable of producing amounts of sulfone-byproduct of compound (3) that is decreased by 20% or more, 40% or more, or 90% or more than the amount produced by the polypeptide of SEQ ID NO:52 under the same assay conditions.
- the polypeptide capable of forming decreased amount of sulfone- byproduct of compound (3) as compared to the polypeptide of SEQ ID NO: 52 comprises an amino acid sequence which includes one or more of the following features: residue at position
- residue at position corresponding to X246 is Y; residue at position corresponding to X248 is S; and residue at position corresponding to X438 is I.
- the polypeptide comprises an amino acid sequence having one or more residue differences as compared to SEQ ID NO:2 at residue positions associated with increased thermostability and/or solvent stability, which positions are selected from the following: X43, X71, XI 1 1, X149, X174, X307; X341, X368, X388, X390, X400, X449, X481, and X488.
- Thermostability can be determined by preincubating the polypeptide at a set of defined temperatures and times, e.g., 4°C - 50°C for 18-24 hours, followed by measuring the % residual activity of the polypeptide using a defined assay.
- Thermostability can then be characterized as the temperature at which the variant retains 50% residual activity.
- the thermostable polypeptides have at least 50% residual activity under the exemplary preincubation condition. Accordingly, in some embodiments the polypeptides of the disclosure exhibit an increased thermostability of retaining 50% residual activity following 18 hours preincubation at 30°C, 32°C, 34°C, 36°C, 38°C, 40°C, 42°C, 44°C, 46°C, 48°C, 50°C, 52°C, or even higher temperatures.
- Thermostability can also be characterized as the activity FIOP when the activity assay is carried out at an elevated temperature, e.g., FIOP activity at 32°C.
- the polypeptides of the disclosure exhibit an increased thermostability as determined by an increase in activity FIOP at an elevated temperature 30°C, 32°C, 34°C, 36°C, 38°C, 40°C, 42°C, 44°C, 46°C, 48°C, 50°C, 52°C, or even higher temperatures.
- solvent stability can be determined by preincubating the polypeptide in a defined solvent at a defined temperature and time, and measuring the % residual activity of the polypeptide using a defined assay.
- the polypeptide with increased thermostability as compared to the polypeptide of SEQ ID NO: 6 comprises an amino acid sequence which includes one or more of the following features: residue at position corresponding to X43 is G; residue at position corresponding to X71 is M; residue at position corresponding to XI 1 1 is T; residue at position corresponding to XI 49 is V or W; residue at position corresponding to XI 74 is L or I, residue at position corresponding to X307 is R; residue at position corresponding to X341 is E; residue at position corresponding to X368 is N or V; residue at position corresponding to X388 is K; residue at position corresponding to X390 is R; residue at position corresponding to X400 is I; residue at position corresponding to X449 is M, F or L; residue at position corresponding to X481 is K; and residue at position corresponding to X488 is K.
- the polypeptide comprises an amino acid sequence having residue differences as compared to SEQ ID NO:2 at residue positions associated with increased protein expression, which positions include X3.
- the residue at position corresponding to X3 is T.
- the polypeptide can have additional residue differences as compared to SEQ ID NO:2 at other residue positions. Residue differences at these residue other positions provide for additional variations in the amino acid sequence without adversely affecting the ability of the polypeptide to carry out the conversion of compound (1) to compound (2), in particular the conversion to compound (2b) in enantiomeric excess.
- the polypeptide can comprise an amino acid sequence which includes in addition to the features above for the specified residue positions, one or more of the following features: residue at position corresponding to XI 5 is an aliphatic amino acid residue; residue at position corresponding to X22 is an aliphatic amino acid residue; residue at position corresponding to X32 is a polar amino acid residue; residue at position corresponding to X39 is an aliphatic amino acid residue; residue at position corresponding to X44 is an acidic amino acid residue; residue at position corresponding to X59 is an constrained amino acid residue; residue at position corresponding to X74 is an aliphatic amino acid residue; residue at position corresponding to X83 is an acidic amino acid residue; residue at position corresponding to X92 is a polar amino acid residue; residue at position corresponding to X107 is a basic amino acid residue; residue at position corresponding to XI 14 is a basic amino acid residue; residue at position corresponding to X123 is an acidic amino acid residue; residue at position corresponding to
- corresponding to XI 76 is an acidic amino acid residue; residue at position corresponding to XI 94 is a hydroxyl-containing amino acid residue; residue at position corresponding to XI 95 is an aliphatic amino acid residue; residue at position corresponding to XI 99 is a constrained amino acid residue; residue at position corresponding to X201 is a polar amino acid residue; residue at position corresponding to X244 is an aliphatic amino acid residue; residue at position corresponding to X245 is an aliphatic amino acid residue; residue at position corresponding to X329 is a polar amino acid residue; residue at position corresponding to X330 is hydroxyl-containing or aliphatic amino acid residue; residue at position corresponding to X354 is an aliphatic amino acid residue; residue at position corresponding to X367 is an acidic amino acid residue; residue at position corresponding to X368 is a polar or aliphatic amino acid residue; residue at position corresponding to X408 is an
- amino acid residue of SEQ ID NO:2 at the corresponding residue position is encompassed within the category of amino acids described for the specified positions of the engineered polypeptides
- a different amino acid within that amino acid category can be used in light of the guidance provided herein. Without being bound by theory, the influence of these residue positions on enzyme function is provided in Table 3.
- the polypeptide amino acid sequence can have for the specified residue positions, one or more of the following features: residue at position corresponding to XI 5 is A; residue at position corresponding to X22 is A; residue at position corresponding to X32 is N; residue at position corresponding to X38 is E; residue at position corresponding to X39 is G; residue at position corresponding to X44 is E; residue at position corresponding to X59 is P; residue at position corresponding to X64R; residue at position corresponding to X74 is G; residue at position corresponding to X83 is E; residue at position corresponding to X92 is N; residue at position corresponding to XI 07 is K; residue at position corresponding to XI 14 is R; residue at position corresponding to is D; residue at position corresponding to X143 is N or V; residue at position corresponding to XI 44 is A; residue at position corresponding to XI 46 is T or V; residue at position corresponding to X149 is F, M,
- the monooxygenase polypeptide can have additional residue differences as compared to SEQ ID NO:2 at residue positions other than those specified above.
- the polypeptides can have additionally 1-2, 1-3, 1-4, 1-5, 1-6, 1-7, 1-8, 1-9, 1-10, 1-1 1, 1-12, 1-14, 1- 15, 1- 16, 1-18, 1-20, 1-22, 1-24, 1-26, 1-30, 1-35, 1-40 residue differences at other amino acid residue positions as compared to the reference sequence.
- the number of differences can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 14, 15, 16, 18, 20, 22, 24, 26, 30, 35, and 40 residue differences at other residue positions.
- residue difference at these other positions can include conservative changes or non-conservative changes.
- residue differences can comprise conservative substitutions and non-conservative substitutions as compared to the monooxygenase of SEQ ID NO:2.
- the other residue differences comprise conservative substitutions.
- residue positions can have effects on more than one enzyme property, and therefore can be used to affect more than one enzyme property.
- residues affecting enzyme activity and enantioselectivity can be used in combination to form polypeptides with increased enzymatic activity and enantioselectivity for the conversion of compound (1) to compound (2b), or increased enzymatic activity and enantioselectivity for the conversion of compound (1) to compound (2a).
- residue positions associated with increased thermostability, increased solvent stability, decreased sulfone-byproduct formation, and increased protein expression can be added to effect changes in the polypeptide with respect to such properties.
- a polypeptide capable of converting compound (1) to compound (2a)in enantiomeric excess at a rate that is greater than 1.5 fold the rate of the monooxygenase of SEQ ID NO:2 can comprise an amino acid sequence which includes at least two or more of the following features: residue at position corresponding to X432 is an aliphatic or hydroxyl- containing amino acid residue; residue at position corresponding to X433 is a non-polar or aliphatic amino acid residue; residue at position corresponding to X435 is a hydroxy 1-containing or aliphatic amino acid residue; and residue at position corresponding to X490 is a basic amino acid residue.
- the R-enantioselective monooxygenases have at least three or more of the above features, or at least all of the above features.
- the polypeptide capable of converting compound (1) to compound (2a) in enantiomeric excess at a rate that is greater than 1.5 fold the rate of the monooxygenase of SEQ ID NO:2 can comprise an amino acid sequence which includes at least two or more of the following features: residue at position corresponding to X432 is A, C, D, H, K, I, S, T, or Y; residue at position corresponding to X433 is A, F, G, K, or W; residue at position corresponding to X435 is A or S; and residue at position corresponding to X490 is E, P, or R.
- the R-enantioselective monooxygenase has an amino acid sequence that has at least three or more of the above features, or at least all of the above features.
- residue differences at other residue positions associated with desirable changes in other enzyme properties such as enzyme activity, sulfone-byproduct formation, thermostability, solvent stability and/or protein expression can be present.
- the polypeptide amino acid sequence can, in addition to the features above features, further include one or more of the following features: residue at position corresponding to X326 is a hydroxyl-containing residue; residue at position corresponding to X329 is a polar residue; residue at position corresponding to X383 is an aliphatic amino acid residue; and residue at position corresponding to X451 is a basic amino acid residue.
- amino acid residues at the residue positions are selected from the following: residue at position corresponding to X277 is V; residue at position corresponding to X278 is H; residue at position corresponding to X279 is Y; residue at position corresponding to X280 is W; residue at position corresponding to X281 is H; residue at position corresponding to X326 is A, D, L, S, or T; residue at position corresponding to X329 is N; residue at position corresponding to X383 is I; residue at position corresponding to X426 is H, or Q; residue at position corresponding to X451 is R; and residue at position corresponding to X489 is P.
- the polypeptide capable of converting compound (1) to compound (2a) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X432 is A; residue at position corresponding to X433 is A; residue at position corresponding to X435 is S; and residue at position corresponding to X490 is R.
- the polypeptide capable of converting compound (1) to compound (2a) can further comprise one or more of the following features: residue at position corresponding to X277 is V; residue at position corresponding to X278 is H; residue at position corresponding to X279 is Y; residue at position corresponding to X280 is W; residue at position corresponding to X281 is H; residue at position corresponding to X326 is A, D, L, S, or T; residue at position corresponding to X329 is N; residue at position corresponding to X383 is I; residue at position corresponding to X426 is H, or Q; residue at position corresponding to X451 is R; and residue at position corresponding to X489 is P.
- the polypeptide capable of converting compound (1) to compound (2a) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X326 is T; residue at position corresponding to X329 is N; residue at position corresponding to X383 is I; residue at position corresponding to X432 is S; residue at position corresponding to X433 is G; residue at position corresponding to X435 is S;
- the polypeptide capable of converting compound (1) to compound (2a) can further comprise one or more of the following features: residue at position corresponding to X277 is V; residue at position corresponding to X278 is H; residue at position corresponding to X279 is Y; residue at position corresponding to X280 is W; residue at position corresponding to X281 is H; residue at position corresponding to X326 is A, D, L, S, or T; residue at position corresponding to X329 is N; residue at position corresponding to X383 is I; residue at position corresponding to X426 is H, or Q; residue at position corresponding to X451 is R; and residue at position corresponding to X489 is P.
- the R-enantioselective polypeptide can have additional residue differences as compared to SEQ ID NO:2 at other residue positions.
- the R-enantioselective polypeptides can have additionally 1-2, 1-3, 1-4, 1-5, 1-6, 1-7, 1- 8, 1-9, 1-10, 1-1 1, 1-12, 1-14, 1-15, 1-16, 1-18, 1-20, 1-22, 1-24, 1-26, 1-30, 1-35, 1-40 residue differences at other amino acid residue positions as compared to the reference sequence.
- the number of differences can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 14, 15, 16, 18, 20, 22, 24, 26, 30, 35, and 40 residue differences at other residue positions.
- residue difference at these other positions can include conservative changes or non-conservative changes.
- residue differences can comprise conservative substitutions and non-conservative substitutions as compared to the monooxygenase of SEQ ID NO:2. In some embodiments, these other residue differences comprise conservative substitutions.
- some embodiments of the monooxygenases disclosed herein are capable of converting compound (1) to compound (2b) in enantiomeric excess.
- these compound (2b) enantioselective monooxygenases comprise an amino acid sequence having residue differences at residue positions associated with S-enantioselectivity, particularly at residue positions corresponding to X326 and X386.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence in which at least residue at position corresponding to X326 is a cysteine (C).
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence in which at least residue at position corresponding to X386 is a hydroxyl-containing amino acid residue, particularly a S.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence in which in which at least residue at position corresponding to X326 is a cysteine (C) and residue at position corresponding to X386 is a hydroxyl-containing amino acid residue, particularly a S.
- the S-enantioselective monooxygenase polypeptide comprises an amino acid sequence having, in addition to residue differences associated with S-enantioselectivity above, at least one or more residue differences at residue positions associated with increases in enzyme activity for the pyrmetazole substrate and/or further increases in S-enantioselectivity, particularly at residue positions X432, X433, X435, X438, X448, and X490; and more particularly at least one or more residue differences at residue positions X432, X433, X435, and X490.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X326 is a cysteine (C); residue at position
- corresponding to X386 is a hydroxyl-containing amino acid residue; and residue at position corresponding to X432 is an aliphatic or hydroxyl-containing amino acid residue.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X326 is C; residue at position corresponding to X386 is S; and residue at position corresponding to X432 is A, or L.
- the polypeptide amino acid sequence can further include one or more of the following features: residue at position corresponding to X433 is an aliphatic amino acid residue; residue at position corresponding to X435 is a hydroxyl-containing amino acid residue; residue at position corresponding to X438 is an aliphatic amino acid residue; residue at position corresponding to X448 is an aliphatic or aromatic amino acid residue; and residue at position corresponding to X490 is a basic amino acid residue.
- the amino acid residues at the residue positions associated with enzymatic activity can be selected from the following: residue at position corresponding to X433 is A, L, or V; residue at position corresponding to X435 is S; residue at position corresponding to X438 is I; residue at position corresponding to X448 is V or W; and residue at position corresponding to X490 is R.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises amino acid sequence which includes at least the following features: residue at position corresponding to X326 is a cysteine (C); residue at position
- corresponding to X386 is a hydroxyl-containing amino acid residue
- residue at position corresponding to X432 is an aliphatic or hydroxyl-containing amino acid residue
- residue at position corresponding to X433 is an aliphatic amino acid residue
- residue at position corresponding to X435 is a hydroxyl- containing amino acid residue
- residue at position corresponding to X490 is a basic amino acid residue.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X326 is C; residue at position corresponding to X386 is S; residue at position corresponding to X432 is A, or L; residue at position corresponding to X433 is A, L, or V; residue at position corresponding to X435 is S; and residue at position corresponding to X490 is R.
- residue differences as compared to SEQ ID NO:2 at other residue positions associated with desirable changes in enzyme activity, enantioselectivity, sulfone-byproduct formation, thermostability, solvent stability, and expression can be used in combination with the features at the specific residue positions described above to form polypeptides with additional improvements in the biocatalytic conversion of compound (1) to compound (2b), or in some embodiments, the biocatalytic conversion of compound (1) to compound (2a).
- the polypeptide amino acid sequence can include, in addition to the above features, one or more of the following features: residue at position corresponding to X3 is a hydroxyl-containing amino acid residue; residue at position corresponding to X14 is an aliphatic amino acid residue; residue at position corresponding to X34 is a basic amino acid residue; residue at position corresponding to X43 is an aliphatic amino acid residue; residue at position corresponding to X71 is a non-polar amino acid residue; residue at position corresponding to X83 is a an acidic amino acid residue; residue at position corresponding to XI 1 1 is a hydroxyl-containing amino acid residue; residue at position corresponding to X141 is an aliphatic amino acid residue; residue at position corresponding to XI 49 is an aliphatic or aromatic amino acid residue; residue at position corresponding to XI 74 is an aliphatic amino acid residue; residue at position corresponding to X209 is a constrained amino acid residue; residue at position corresponding to corresponding to X
- corresponding to X540 is a polar or aliphatic amino acid residue.
- the polypeptide amino acid sequence can include at one or more of the specified residue positions the following features: residue at position corresponding to X3 is T;
- residue at position corresponding to X277 is L, M, or Q; residue at position corresponding to X278 is G, N, or S; residue at position corresponding to X280 is G, or R; residue at position corresponding to X281 is A, or S; residue at position corresponding to X282 is S; residue at position corresponding to X288 is I; residue at position corresponding to X307 is C or R; residue at position corresponding to X341 is E; residue at position corresponding to X373 is L; residue at position corresponding to X377 is G; residue at position corresponding to X388 is K; residue at position corresponding to X390 is R or I; residue at position corresponding to X400 is I; residue at position corresponding to X415 is A; residue at position corresponding to X426 is F; residue at position corresponding to X449 is M, F, or L; residue at position corresponding to X464 is D; residue at position corresponding to X481
- residue at position corresponding to X499 is L; residue at position corresponding to X505 is W or L; residue at position corresponding to X516 is V; residue at position corresponding to X526 is V;
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess can comprise an amino acid sequence which includes, in addition to the features above for the specified residue positions, residue differences at one or more of the following positions: X15; X22; X32; X38; X39; X44; X59; X64; X74; X92; X107; XI 14; X123; X143; X144; X146; X149; X155; X161 ; X176; X194; X195; X199; X201 ; X244; X245; X246; X272; X278; X279; X280;
- amino acid residues that can be used at these additional residue positions are described above.
- amino acid residue of SEQ ID NO:2 at the corresponding residue position are encompassed within the category of amino acids described for the specified positions of the engineered polypeptides, a different amino acid within that amino acid category can be used in light of the guidance provided herein.
- the polypeptide capable of converting compound (1) to compound (2) can comprise an amino acid sequence which includes at least the following features: residue at position corresponding to X34 is K; residue at position corresponding to X209 is P; residue at position corresponding to X240 is K; residue at position corresponding to X288 is I; residue at position corresponding to X326 is C; residue at position corresponding to X386 is S; residue at position corresponding to X415 is A; residue at position corresponding to X432 is L; residue at position corresponding to X433 is A, residue at position corresponding to X435 is S; residue at position corresponding to X438 is I; residue at position corresponding to X448 is V; residue at position corresponding to X490 is R; residue at position corresponding to X516 is V; and residue at position corresponding to X537 is T.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X14 is A; residue at position corresponding to X34 is K; residue at position corresponding to X141 is V; residue at position corresponding to X209 is P; residue at position corresponding to X240 is K; residue at position corresponding to X288 is I; residue at position corresponding to X326 is C; residue at position corresponding to X386 is S; residue at position corresponding to X415 is A; residue at position corresponding to X426 is F; residue at position corresponding to X432 is L; residue at position corresponding to X433 is A; residue at position corresponding to X435 is S; residue at position corresponding to X438 is I; residue at position corresponding to X448 is V; residue at position corresponding to X489 is C; residue at position corresponding to corresponding to
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X14 is A; residue at position corresponding to X34 is K; residue at position corresponding to XI 1 1 is T; residue at position corresponding to XI 41 is V; residue at position corresponding to X209 is P; residue at position corresponding to X240 is K;
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X14 is A; residue at position corresponding to X34 is K; residue at position corresponding to X43 is G; residue at position corresponding to X71 is M; residue at position corresponding to XI 1 1 is T; residue at position corresponding to XI 41 is V;
- residue at position corresponding to XI 49 is W; residue at position corresponding to X209 is P; residue at position corresponding to X240 is K; residue at position corresponding to X246 is Y; residue at position corresponding to X248 is V; residue at position corresponding to X277 is M; residue at position corresponding to X288 is I; residue at position corresponding to X307 is R; residue at position corresponding to X326 is C; residue at position corresponding to X341 is E; residue at position corresponding to X386 is S; residue at position corresponding to X388 is K; residue at position corresponding to X390 is I; residue at position corresponding to X415 is A; residue at position corresponding to X426 is F; residue at position corresponding to X432 is L; residue at position corresponding to X433 is A; residue at position corresponding to X435 is S; residue at position corresponding to X438 is I; residue at position corresponding to
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X14 is A; residue at position corresponding to X34 is K; residue at position corresponding to X43 is G; residue at position corresponding to X71 is M; residue at position corresponding to X83 is E; residue at position corresponding to XI 1 1 is T; residue at position corresponding to X 141 is V; residue at position corresponding to XI 49 is W; residue at position corresponding to XI 74 is I; residue at position corresponding to X209 is P; residue at position corresponding to X240 is F or K; residue at position corresponding to X246 is Y; residue at position corresponding to X248 is I or V; residue at position corresponding to X277 is M; residue at position corresponding to X288 is I; residue at position corresponding to X307 is C or R;
- the above polypeptides can have additionally 1-2, 1-3, 1-4, 1-5, 1-6, 1- 7, 1-8, 1-9, 1-10, 1-1 1, 1- 12, 1-14, 1-15, 1- 16, 1-18, 1-20, 1-22, 1-24, 1-26, 1-30, 1-35, 1-40 residue differences at other amino acid residue positions as compared to the reference sequence.
- the number of differences can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 14, 15, 16, 18, 20, 22, 24, 26, 30, 35, and 40 residue differences at other residue positions.
- the residue difference at these other positions can include conservative changes or non-conservative changes.
- the residue differences can comprise conservative substitutions and non-conservative substitutions as compared to the monooxygenase of SEQ ID NO:2. In some embodiments, the residue differences comprise conservative substitutions.
- the polypeptide capable of forming compound (2b) in enantiomeric excess comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 112, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 23
- the polypeptide is capable of converting compound (1) to compound (2b) in at least 90% enantiomeric excess. In some embodiments, the polypeptide is capable of forming compound (2b) in at least 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more enantiomeric excess.
- the polypeptide capable of forming compound (2b) in at least 90% enantiomeric excess comprises an amino acid sequence which includes at least one or more, at least two or more, at least three or more, or at least four or more of the following features: residue at position corresponding to X246 is Y; residue at position corresponding to X248 is S; residue at position corresponding to X326 is C; residue at position corresponding to X386 is S; residue at position corresponding to X432 is L; residue at position corresponding to X433 is A; residue at position corresponding to X435 is S; residue at position corresponding to X438 is I; and residue at position corresponding to X448 is V.
- the polypeptide capable of forming compound (2b) in at least 90% enantiomeric excess comprises an amino acid sequence which includes at least the following features: residue at position corresponding to X326 is C; residue at position corresponding to X386 is S;
- residue at position corresponding to X438 is I; and residue at position corresponding to X448 is V.
- the polypeptide capable of forming compound (2b) in at least 90% enantiomeric excess comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126,128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 242, 244, 248, 250, 254, 256, 258, 262, and 264.
- the polypeptide is capable of forming compound (2b) in at least 99% enantiomeric excess and with at least 4-fold the enzymatic activity of the polypeptide of SEQ ID NO: 10.
- the polypeptide capable of forming compound (2b) in at least 99% enantiomeric excess and with at least 4 fold or more the enzymatic activity of the polypeptide of SEQ ID NO: 10 comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 16, 18, 20, 22, 24, 26, 28, 30, 32, 42, 44, 86, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, and 158.
- the monooxygenase polypeptides are capable of forming decreased amount of sulfone-byproduct of compound (3) as compared to the polypeptide of SEQ ID NO: 52.
- Exemplary monooxygenase polypeptides capable of forming decreased amount of sulfone-byproduct as compared to the polypeptide of SEQ ID NO: 52 can comprise an amino acid sequence selected from the group consisting of SEQ ID NO: 54, 58, 60, 62, 64, 70, 72, 76, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 116, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, and 158.
- the polypeptide is capable of forming sulfone-byproduct of compound (3) in an amount that is decreased by about 20% or higher as compared to the amount produced by the polypeptide of SEQ ID NO: 52 and comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 54, 58, 60, 62, 64, 70, 72, 76, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, and 158.
- the polypeptide is capable of forming sulfone-byproduct of compound (3) in an amount that is decreased by about 40% or higher as compared to the amount produced by the polypeptide of SEQ ID NO: 52 and comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 54, 72, 76, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, and 158.
- the polypeptide is capable of forming sulfone-byproduct of compound (3) in an amount that is decreased by about 90% or higher as compared to the amount produced by the polypeptide of SEQ ID NO:52 and comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, and 158.
- the polypeptide amino acid sequence can have residue differences as compared to SEQ ID NO: 2 at residue positions associated with increases in protein expression. Accordingly, in some embodiments, the polypeptide amino acid sequences described herein can further include at the residue position corresponding to X3 a hydroxyl-containing residue. In some embodiments, the residue at position corresponding to X3 is T. This feature is particularly useful for enhanced expression in a bacterial host organism, particularly E. coli.
- Table 2A provides exemplary engineered polypeptides capable of converting compound (1) to compound (2b).
- Odd numbered SEQ ID NOs refer to the nucleotide sequence encoding the amino acid sequence provided by the even numbered SEQ ID NOs. The residue differences are based on comparison to reference sequence of SEQ ID NO:2, the naturally occurring cyclohexanone monooxygenase of Acinetobacter sp NCIMB9871. Activity improvements are presented as "fold improvement over parent" or "FIOP", where the parent polypeptide amino acid sequence used as reference is indicated by the SEQ ID NO in a separate column.
- polypeptides of SEQ ID NO: 4 and 6 have the ability to convert compound (1) to compound (2) but the product has enantiomeric excess of the R-isomer.
- the monooxygenase of SEQ ID NO: 4 has about 1.7 fold the activity of SEQ ID NO:2.
- the Activity FIOP and enantioselectivity (%ee) of the exemplary non-naturally occurring monooxygenase polypeptides ("engineered CHMO polypeptides") of Table 2A in carrying out the biocatalytic conversion of the substrate compound (1) to the product compound (2) were determined using the following general HTP assay conditions: 6 g/L substrate compound (1), 10 ⁇ ⁇ of lysate of the engineered CHMO polypeptide, 1 g/L KRED of SEQ ID NO: 268, 0.1 g/L NADP, in a solution of 50 mM potassium phosphate buffer, 8% (v/v) IPA, 2% acetone, 15% N-methyl-2-pyrrolidone (NMP), pH 9.0, 25°C reaction temperature and 24 h reaction time. Further details of the HTP assay used are described in the Examples.
- the polypeptide capable of converting compound (1) to compound (2a) in enantiomeric excess at a rate that is greater than 1.5 fold the rate of SEQ ID NO:2 can comprise an amino acid sequence that is at least about 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to a reference sequence selected from the group consisting of SEQ ID NO: 4, 6, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, and 208 with the proviso that the polypeptide amino acid sequence comprises any one of the set of residue differences as compared to SEQ ID NO:2 present in the polypeptide sequences represented by SEQ ID NO: 4, 6, 160, 162, 164, 166
- the polypeptides can have additionally 1-2, 1-3, 1-4, 1-5, 1-6, 1-7, 1-8, 1-9, 1-10, 1-1 1, 1-12, 1-14, 1-15, 1-16, 1-18, 1-20, 1-22, 1-24, 1-26, 1-30, 1- 35, 1-40 residue differences at other amino acid residue positions as compared to the reference sequence.
- the number of differences can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 14, 15, 16, 18, 20, 22, 24, 26, 30, 35, and 40 residue differences at other residue positions.
- the residue difference at these other positions can include conservative changes or non-conservative changes.
- the residue differences can comprise conservative substitutions and non-conservative substitutions as compared to the reference sequence.
- the residue differences are conservative substitutions.
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess can comprise an amino acid sequence that is at least about 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more identical to a reference sequence of SEQ ID NO: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144
- the polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess can comprise an amino acid sequence that is at least about 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to a reference sequence selected from the group consisting of SEQ ID NO: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 116, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140
- the polypeptides can have additionally 1-2, 1-3, 1-4, 1-5, 1-6, 1-7, 1-8, 1-9, 1- 10, 1- 1 1, 1- 12, 1-14, 1-15, 1-16, 1-18, 1-20, 1-22, 1-24, 1-26, 1-30, 1-35, 1-40 residue differences at other amino acid residue positions as compared to the reference sequence.
- the number of differences can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 14, 15, 16, 18, 20, 22, 24, 26, 30, 35, and 40 residue differences at other residue positions.
- the residue difference at these other positions can include conservative changes or non- conservative changes.
- the residue differences can comprise conservative substitutions and non-conservative substitutions as compared to the reference sequence.
- the residue differences are conservative substitutions.
- Table 2B below provides 53 exemplary non-naturally occurring monooxygenase polypeptides capable of converting compound (1) to compound (2) that differ by one amino acid residue difference from reference polypeptides of SEQ ID NO: 4, 16, 32, 52, 56, 58, and 76. Odd numbered SEQ ID NOs refer to the nucleotide sequence encoding the amino acid sequence provided by the even numbered SEQ ID NOs. The residue differences in Table 2B are provided based on comparison to the wild-type reference sequence of SEQ ID NO:2, as well as the reference polypeptides of SEQ ID NO: 4, 16, 32, 52, 56, 58, and 76.
- the Activity FIOP and enantioselectivity (%ee) of the 53 exemplary non-naturally occurring monooxygenase polypeptides of Table 2B in carrying out the biocatalytic conversion of the substrate compound (1) (pyrmetazole) to the product compound (2) ((R)- or (3 ⁇ 4 ) -omeprazole) were determined that following general HTP assay conditions: 5 g/L pyrmetazole substrate, 10 ⁇ ⁇ of lysate of the engineered CHMO polypeptide, 1 g/L KRED of SEQ ID NO: 268, 0.5 g/L NADP, in a solution of 50 mM potassium phosphate buffer, 10% (v/v) IPA proposition pH 9.0, 25°C reaction temperature and 24 h reaction time (with 400 rpm stirring). Further details of the HTP assay methods are described in the Examples.
- SEQ ID NO: 4 S489C; W490R; F505L; A516V; I537T;
- the present disclosure provides and engineered polypeptide capable of converting compound (1) to compound (2) can comprise an amino acid sequence that is at least about 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more identical to a reference sequence of SEQ ID NO: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144
- the engineered polypeptide comprises a sequence at least about 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more identical to a sequence of any one of SEQ ID NO: 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, or 266.
- the engineered polypeptide is capable of converting compound (1) to either the (R)- or the (S)- en
- the present disclosure provides an engineered polypeptide is capable of converting a structurally similar analog of the substrate pyrmetazole (e.g., a compound of structural formula (I)) to either the (R)- or the (S)- enantiomer of the corresponding analog prazole compound (e.g., compound of structural formula (II)) in enantiomeric excess, which comprises an amino acid sequence that is at least about 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more identical to a reference sequence of SEQ ID NO: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86,
- the engineered polypeptide is capable of converting an pyrmetazole analog substrate to an omeprazole analog compound selected from: (R) or (S)-lansoprazole, (R) or (S)-tenatoprazole, (R) or (S)- rabeprazole, (R) or (S)-pantoprazole, (R) or (S)-ilaprazole, (R) or (S)-leminoprazole, (R) or (S)- saviprazole, and (R) or (S)-TY- 1 1345.
- an omeprazole analog compound selected from: (R) or (S)-lansoprazole, (R) or (S)-tenatoprazole, (R) or (S)- rabeprazole, (R) or (S)-pantoprazole, (R) or (S)-ilaprazole, (R) or (S)-leminoprazole, (R) or
- the present disclosure provides an engineered polypeptide capable of converting the pyrmetazole analog substrate 5-(difluoromethoxy)-2-((3,4-dimethoxypyridin-2- yl)methylthio)- lH-benzo[d]imidazole to the omeprazole analog compound, 5)-pantoprazole in enantiomeric excess.
- the polypeptide capable of producing S)-pantoprazole in enantiomeric excess comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 70, 72, 76, 78, 80, 82, 84, 86, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 126, 128, 130, 228, 232, 244, 248, 256, 258, 260, 262, and 264.
- the present disclosure provides an engineered polypeptide capable of converting the pyrmetazole analog substrate 5-methoxy-2-((4-methoxy-3,5-dimethylpyridin-2- yl)methylthio)- lH-imidazo[4,5-b]pyridine to the omeprazole analog compound, (S)-tenatoprazole, in enantiomeric excess.
- the polypeptide capable of producing (S)-tenatoprazole comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 66, 102, 104, 1 14, 122, 124, 128, 232, 238, 244, 260, 262, and 264.
- the present disclosure provides an engineered polypeptide capable of converting the pyrmetazole analog substrate 2-((4-(3-methoxypropoxy)-3-methylpyridin-2- yl)methylthio)- lH-benzo[d]imidazole to the omeprazole analog compound, (S)-rabeprazole, in enantiomeric excess
- the polypeptide capable of producing (S)-rabeprazole comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 62, 76, 84, 86, 124, 238, 240, 250, 258, 262, and 264.
- the present disclosure also contemplates engineered CHMO polypeptides capable of converting compound (1) to compound (2), where the polypeptides comprise an amino acid sequence comprising mutations based on locations or regions in the structure of the parent polypeptide.
- a variant of a parent polypeptide can include an amino acid substitution at a particular residue at a location in the structure of the parent polypeptide as identified in Table 3.
- Exemplary substitutions at each of the relevant locations are also identified in Table 3.
- polypeptides can comprise deletions of the engineered amino acids
- the deletions can comprise one or more amino acids, 2 or more amino acids, 3 or more amino acids, 4 or more amino acids, 5 or more amino acids, 6 or more amino acids, 8 or more amino acids, 10 or more amino acids, 15 or more amino acids, or 20 or more amino acids, up to 10% of the total number of amino acids, up to 10% of the total number of amino acids, up to 20% of the total number of amino acids of the polypeptides, as long as the functional activity of the monooxygenase polypeptide as described herein is maintained.
- the functional activity is with respect to the conversion of compound (1) to compound (2) at a rate greater than 1.5 fold the rate of the monooxygenase of SEQ ID NO:2. In some embodiments, the functional activity of the polypeptide is with respect to the conversion of compound (1) to compound (2b) in
- the deletions can comprise, 1-2, 1-3, 1-4, 1-5, 1-6, 1-7, 1-8, 1-9, 1-10, 1-1 1, 1-12, 1-14, 1-15, 1-16, 1-18, 1-20, 1-22, 1-24, 1-26, 1-30, 1-35, or 1-40 amino acid residues.
- the number of deletions can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 14, 15, 16, 18, 20, 22, 24, 26, 30, 35, or 40 amino acids.
- the deletions can comprise deletions of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 18, or 20 amino acid residues.
- the polypeptides can comprise fragments of the engineered polypeptides described herein.
- the fragments can have about 80%, 90%, 95%, 98%, and 99% of the full-length polypeptide, e.g., the polypeptide of SEQ ID NO:8, as long as the functional activity of the polypeptide is maintained.
- the functional activity is with respect to the conversion of compound (1) to compound (2) at a rate greater than 1.5 fold the rate of the monooxygenase of SEQ ID NO: 2.
- the functional activity of the polypeptide with respect to the conversion of compound (1) to compound (2b) in enantiomeric excess is particularly useful for the conversion of compound (1) to compound (2b) in enantiomeric excess.
- the polypeptides of the disclosure can be in the form of fusion polypeptides in which the engineered polypeptides are fused to other polypeptides, such as, by way of example and not limitation, antibody tags (e.g., myc epitope), purifications sequences (e.g., His tags for binding to metals), and cell localization signals (e.g., secretion signals).
- antibody tags e.g., myc epitope
- purifications sequences e.g., His tags for binding to metals
- cell localization signals e.g., secretion signals
- polypeptides described herein are not restricted to the genetically encoded amino acids.
- the polypeptides described herein may be comprised, either in whole or in part, of naturally-occurring and/or synthetic non-encoded amino acids.
- non-encoded amino acids of which the polypeptides described herein may be comprised include, but are not limited to: the D- enantiomers of the genetically-encoded amino acids; 2,3-diaminopropionic acid (Dpr); ⁇ -aminoisobutyric acid (Aib); ⁇ -aminohexanoic acid (Aha); ⁇ -aminovaleric acid (Ava); N- methylglycine or sarcosine (MeGly or Sar); ornithine (Orn); citrulline (Cit); t-butylalanine (Bua); t- butylglycine (Bug); N-methylisoleucine (Melle); phenylglycine (Phg); cyclohexylalanine (Cha); norleucine (Nle); naphthylalanine (Nal); 2-chlorophenylalanine (Ocf); 3-chlorophenylalanine (M
- phosphothreonine pThr
- homoaspartic acid hAsp
- homoglutanic acid hGlu
- l-aminocyclopent-(2 or 3)-ene-4 carboxylic acid pipecolic acid (PA), azetidine-3-carboxylic acid (ACA); 1- aminocyclopentane-3-carboxylic acid
- allylglycine aOly
- propargylglycine pgGly
- homoalanine hAla
- norvaline nVal
- homoleucine hLeu
- homovaline hVal
- homoisolencine hlle
- amino acids or residues bearing side chain protecting groups may also comprise the polypeptides described herein.
- protected amino acids include (protecting groups listed in parentheses), but are not limited to: Arg(tos), Cys(methylbenzyl), Cys (nitropyridinesulfenyl), Glu(8-benzylester), Gln(xanthyl), Asn(N-8-xanthyl), His(bom), His(benzyl),
- Non-encoding amino acids that are conformationally constrained of which the polypeptides described herein may be composed include, but are not limited to, N-methyl amino acids
- the polypeptides can be present in whole cells transformed with gene(s) encoding the engineered monooxygenase enzyme, or as cell extracts, lysates, isolated polypeptide, or substantially purified, in a variety of different forms, including solid (e.g., lyophilized, spray-dried, and the like) or semisolid (e.g., a crude paste).
- the polypeptide described herein can be provided in the form of kits.
- the enzymes in the kits may be present individually or as a plurality of enzymes.
- the kits can further include reagents for carrying out the enzymatic reactions, substrates for assessing the activity of enzymes, as well as reagents for detecting the products.
- the kits can also include reagent dispensers and instructions for use of the kits.
- the polypeptides can be provided on a physical substrate.
- the polypeptides can be provided in the form of an array in which the polypeptides are arranged in positionally distinct locations. The array can be used to test a variety of aryl alkyl sulfides for conversion by the polypeptides. "Substrate,” “support,” “solid support,” “solid carrier,” or “resin” in the context of arrays refer to any solid phase material.
- Substrate also encompasses terms such as "solid phase,” "surface,” and/or “membrane.”
- a solid support can be composed of organic polymers such as polystyrene, polyethylene, polypropylene, polyfluoroethylene, polyethyleneoxy, and polyacrylamide, as well as co-polymers and grafts thereof.
- a solid support can also be inorganic, such as glass, silica, controlled pore glass (CPG), reverse phase silica or metal, such as gold or platinum.
- CPG controlled pore glass
- the configuration of a substrate can be in the form of beads, spheres, particles, granules, a gel, a membrane or a surface. Surfaces can be planar, substantially planar, or non-planar.
- Solid supports can be porous or non-porous, and can have swelling or non-swelling characteristics.
- a solid support can be configured in the form of a well, depression, or other container, vessel, feature, or location.
- a plurality of supports can be configured on an array at various locations, addressable for robotic delivery of reagents, or by detection methods and/or instruments.
- kits of the present disclosure include arrays comprising a plurality of different engineered monooxygenase polypeptides at different addressable position, wherein the different polypeptides are different variants of a reference sequence each having at least one different improved enzyme property.
- arrays comprising a plurality of engineered polypeptides and methods of their use are described in, e.g., WO2009/008908A2.
- polynucleotides encoding the polypeptides described herein.
- the polynucleotides may be operatively linked to one or more heterologous regulatory sequences that control gene expression to create a recombinant polynucleotide capable of expressing the monooxygenase polypeptide.
- Expression constructs containing a heterologous polynucleotide encoding the engineered monooxygenase can be introduced into appropriate host cells to express the corresponding polypeptide.
- the polynucleotides can be selected and/or engineered to comprise codons that are preferably selected to fit the host cell in which the protein is being produced.
- codons that are preferably selected to fit the host cell in which the protein is being produced.
- preferred codons used in bacteria are used to express the gene in bacteria
- preferred codons used in yeast are used for expression in yeast
- preferred codons used in mammals are used for expression in mammalian cells. Since not all codons need to be replaced to optimize the codon usage of the monooxygenases (e.g., because the natural sequence can have preferred codons and because use of preferred codons may not be required for all amino acid residues), codon optimized
- polynucleotides encoding the monooxygenase polypeptides may contain preferred codons at about 40%, 50%, 60%, 70%, 80%, or greater than 90% of codon positions of the full length coding region.
- the polynucleotide encodes a polypeptide comprising an amino acid sequence that has at least about 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142
- the polynucleotide encodes a polypeptide comprising an amino acid sequence that has at least about 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 4 and 6, wherein the polypeptide is capable of converting compound (1) to compound (2a) in enantiomeric excess and at a rate that is greater than 1.5 fold the rate of the monooxygenase of SEQ ID NO:2.
- the polynucleotide encodes a polypeptide capable of converting compound (1) to compound (2a) in enantiomeric excess and comprises an amino acid sequence that is at least about 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to a reference sequence based on SEQ ID NO: 4, 6, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, or 208, with the proviso that the amino acid sequence comprises any one of the set of residue differences as compared to SEQ ID NO:2 as present in the polypeptide sequences of SEQ ID NO: 4, 6, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 18
- the polynucleotide encodes a polypeptide comprising an amino acid sequence that has at least about 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142,
- the polynucleotide encodes a polypeptide capable of converting compound (1) to compound (2b) in enantiomeric excess and comprises an amino acid sequence that is at least about 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to a reference sequence selected from the group consisting of SEQ ID NO: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132,
- the polynucleotides encoding the polypeptides are selected from the group consisting of SEQ ID NO: 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 1 1 1, 1 13, 1 15, 1 17, 119, 121, 123, 125, 126, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181,
- the polynucleotides are capable of hybridizing under highly stringent conditions to a polynucleotide comprising SEQ ID NO: 3, 5, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, or 207, or a complement thereof, where the highly stringently hybridizing polynucleotides encode a
- monooxygenase polypeptide capable of converting compound (1) to compound (2a) in enantiomeric excess at a rate greater than 1.5 fold the rate of the monooxygenase of SEQ ID NO:2.
- the polynucleotides are capable of hybridizing under highly stringent conditions to a polynucleotide selected from the group consisting of SEQ ID NO: 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 1 1 1, 1 13, 1 15, 1 17, 1 19, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 209, 21 1, 213, 215, 217, 219, 221, 223, 225, 227, 2
- the polynucleotides encode the polypeptides described herein but have about 80% or more sequence identity, about 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more sequence identity at the nucleotide level to a reference polynucleotide encoding the engineered monooxygenases described herein.
- the reference polynucleotide is selected from the group consisting of SEQ ID NO: 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 1 1 1, 1 13, 1 15, 1 17, 1 19, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193,
- An isolated polynucleotide encoding a polypeptide of the disclosure may be manipulated in a variety of ways to provide for expression of the polypeptide.
- the polypeptide may be manipulated in a variety of ways to provide for expression of the polypeptide.
- polynucleotides encoding the polypeptides can be provided as expression vectors where one or more control sequences is present to regulate the expression of the polynucleotides.
- Manipulation of the isolated polynucleotide prior to its insertion into a vector may be desirable or necessary depending on the expression vector.
- the techniques for modifying polynucleotides and nucleic acid sequences utilizing recombinant DNA methods are well known in the art. Guidance is provided in Sambrook et al., 2001, Molecular Cloning: A Laboratory Manual, 3rd Ed., Cold Spring Harbor Laboratory Press; and Current Protocols in Molecular Biology, Ausubel. F. ed., Greene Pub. Associates, 1998, updates to 2009, the disclosures of which are incorporated herein by reference.
- control sequences include among others, promoters, leader sequences, polyadenylation sequences, propeptide sequences, signal peptide sequences, and transcription terminators.
- Suitable promoters can be selected based on the host cells used.
- Exemplary bacterial promoters include E. coli lac operon, E. coli trp operon, bacteriophage ⁇ , Streptomyces coelicolor agarase gene (dagA), Bacillus subtilis levansucrase gene (sacB), Bacillus licheniformis alpha-amylase gene (amyL), beta-lactamase gene, and tac promoter.
- promoters for filamentous fungal host cells include promoters obtained from the genes for Aspergillus oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, Aspergillus niger neutral alpha-amylase, Aspergillus niger acid stable alpha-amylase, Aspergillus niger or Aspergillus awamori glucoamylase (glaA), Rhizomucor miehei lipase, Aspergillus oryzae alkaline protease, Aspergillus oryzae triose phosphate isomerase, Aspergillus nidulans acetamidase, and Fusarium oxysporum trypsin-like protease, and mutant, truncated, and hybrid promoters thereof.
- Exemplary yeast cell promoters can be from the genes for Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae galactokinase (GALl), Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP), and Saccharomyces cerevisiae 3-phosphoglycerate kinase.
- ENO-1 Saccharomyces cerevisiae enolase
- GALl Saccharomyces cerevisiae galactokinase
- ADH2/GAP Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase
- Saccharomyces cerevisiae 3-phosphoglycerate kinase Saccharomyces cerevisiae enolase
- GALl Saccharomyces cerevisi
- control sequence may also be a signal peptide coding region that codes for an amino acid sequence linked to the amino terminus of a polypeptide and directs the encoded polypeptide into the cell's secretory pathway.
- the signal sequence typically depends on the type of host cell being used to express the polypeptide.
- Effective signal peptide coding regions for bacterial host cells are the signal peptide coding regions obtained from the genes for Bacillus NC1B 1 1837 maltogenic amylase, Bacillus stearothermophilus alpha-amylase, Bacillus licheniformis subtilisin, Bacillus licheniformis beta-lactamase, Bacillus stearothermophilus neutral proteases (nprT, nprS, nprM), and Bacillus subtilis prsA.
- Exemplary signal peptide coding regions for filamentous fungal host cells can be the signal peptide coding regions obtained from the genes for Aspergillus oryzae TAKA amylase, Aspergillus niger neutral amylase, Aspergillus niger glucoamylase,
- Rhizomucor miehei aspartic proteinase Rhizomucor miehei aspartic proteinase, Humicola insolens cellulase, and Humicola lanuginosa lipase.
- Useful signal peptides for yeast host cells can be from the genes for Saccharomyces cerevisiae alpha- factor and Saccharomyces cerevisiae invertase.
- Other control sequences, such as leader sequences, polyadenylation sequences, and transcription terminator sequences can use those available in the art (see Sambrook, supra, and Current Protocols in Molecular Biology, supra).
- the present disclosure is also directed to a recombinant expression vector comprising a polynucleotide encoding an engineered monooxygenase polypeptide or a variant thereof, and one or more expression regulating regions such as a promoter and a terminator, a replication origin, etc., depending on the type of hosts into which they are to be introduced.
- the recombinant expression vector may be any vector (e.g., a plasmid, cosmid, or virus), which can be conveniently subjected to recombinant DNA procedures and can bring about the expression of the polynucleotide sequence.
- the choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced.
- the vectors may be linear or closed circular plasmids.
- the expression vector may be an autonomously replicating vector, i.e., a vector that exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g., a plasmid, an extrachromosomal element, a minichromosome, or an artificial chromosome.
- the vector may contain any means for assuring self-replication.
- the vector may be one which, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated.
- the expression vector preferably contains one or more selectable markers, which permit easy selection of transformed cells.
- a selectable marker is a gene the product of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, resistance to chemical agents (e.g., antibiotics) and the like.
- the present disclosure provides a host cell comprising a polynucleotide encoding an engineered monooxygenase polypeptide of the present disclosure, the polynucleotide being operatively linked to one or more control sequences for expression of the monooxygenase polypeptide in the host cell.
- Host cells for use in expressing the monooxygenase polypeptides encoded by the expression vectors of the present invention are well known in the art and include but are not limited to, bacterial cells, such as E.
- coli Lactobacillus, Streptomyces and Salmonella typhimurium cells
- fungal cells such as yeast cells
- insect cells such as Drosophila S2 and Spodoptera Sf9 cells
- animal cells such as CHO, COS, BHK, 293, and Bowes melanoma cells
- plant cells Exemplary host cells are Escherichia coli BL21 and W3110.
- the monooxygenase polypeptides and nucleotides encoding such polypeptides can be prepared using methods commonly used by those skilled in the art.
- the naturally-occurring amino acid sequence and corresponding polynucleotide encoding the cyclohexanone monooxygenase enzyme of Acinetobacter sp NCIMB9871 (represented herein as SEQ ID NO:2) is described in Chen et al., 1988, J. Bacterid. 170 (2), 781-789 and Genbank Accession No. BAA86293.1 GL6277322.
- the parent polynucleotide sequence is codon optimized to enhance expression of the monooxygenase in a specified host cell.
- the engineered monooxygenases can be obtained by subjecting the polynucleotide encoding the naturally occurring cyclohexanone monooxygenase to mutagenesis and/or directed evolution methods (e.g., Stemmer, 1994, Proc Natl Acad Sci USA 91 : 10747- 10751 ; WO 95/22625; WO 97/0078; WO 97/35966; WO 98/27230; WO 00/42651 ; WO 01/75767 and U.S. Pat. 6,537,746; each of which is hereby incorporated by reference herein).
- mutagenesis and/or directed evolution methods e.g., Stemmer, 1994, Proc Natl Acad Sci USA 91 : 10747- 10751 ; WO 95/22625; WO 97/0078; WO 97/35966; WO 98/27230; WO 00/42651 ; WO 01/75767 and U.
- the clones obtained following mutagenesis treatment are screened for monooxygenases having a desired enzyme property.
- Measuring monooxygenase enzyme activity from the expression libraries can be performed using the standard techniques, such as separation of the product (e.g., by HPLC) and detection of the product by measuring UV absorbance of the separated substrate and products and/or by detection using tandem mass spectroscopy (e.g., MS/MS).
- Clones containing a polynucleotide encoding the desired polypeptides are then isolated, sequenced to identify the nucleotide sequence changes (if any), and used to express the enzyme in a host cell. Exemplary assays for monooxygenase activity are provided in Example 2.
- the polynucleotides encoding the enzyme can be prepared by standard solid-phase methods, according to known synthetic methods, e.g., the phosphoramidite method described by Beaucage et al., 1981, Tet Lett 22: 1859-69, or the method described by Matthes et al., 1984, EMBO J. 3:801-05.
- fragments of up to about 100 bases can be individually synthesized, then joined (e.g., by enzymatic or chemical litigation methods, or polymerase mediated methods) to form any desired continuous sequence.
- polypeptides can be expressed in appropriate cells, and recovered from the host cells and or the culture medium using any one or more of the well known techniques used for protein purification, including, among others, lysozyme treatment, sonication, filtration, salting-out, ultra- centrifugation, and chromatography. Chromatographic techniques for isolation of the
- monooxygenase polypeptide include, among others, reverse phase chromatography high performance liquid chromatography, ion exchange chromatography, gel electrophoresis, and affinity
- the engineered monooxygenases can be expressed as fusion proteins with purification tags, such as His-tags having affinity for metals, or antibody tags for binding to antibodies, e.g., myc epitope tag.
- polypeptide of the disclosure can be prepared in the form of crude extracts, lyophizates, powders, isolated preparations, and substantially pure preparations, as further described below.
- the monooxygenase polypeptides of the disclosure can be used in a process for conversion of various aryl-alkyl sulfide substrates to the corresponding sulfoxide. While the monooxygenase polypeptides herein are described with respect to the conversion of 5-methoxy-2- ((4-methoxy-3,5-dimethylpyridin-2-yl)methylthio)-lH-benzo[d]imidazole (compound (1)) to 5- methoxy-2-((4-methoxy-3,5-dimethylpyridin-2-yl)methylsulfinyl)-lH-benzo[d]imidazole (compound (2)), the engineered monooxygenase polypeptides can be applied to the conversion of other prazole compounds structurally similar to pyrmetazole. Accordingly in some embodiments, the
- monooxygenase polypeptides of the disclosure can be used in a process for the conversion of the compound of structural Formula (I) to the product of Formula (II):
- Ar 1 is an optionally substituted aryl or heteroaryl ring
- R is ⁇ , a lower alkyl, a heteroalkyl, or forms a 5 to 8 membered cycloalkyl, heteroalkyl, aryl or heteroaryl fused ring with a ring carbon of Ar 1
- Ar 2 is an optionally substituted cycloalkyl, heterocycloalkyl, aryl, or heteroaryl ring fused to the imidazole ring.
- Ar 1 is an optionally substituted phenyl or pyridyl.
- Ar 1 can have 1 to 5 substitutions, preferably 1, 2, or 3 substitutions. Substitutions in Ar 1 can be a substituted or unsubstituted: lower alkyl, lower alkoxy, amino or alkylamino group.
- the substituted alkyl is a haloalkyl.
- Ar 2 is selected from an optionally substituted thienyl, phenyl or pyridyl.
- Ar 2 can have 1 to 5 substitutions, preferably 1, 2 or 3 substitutions.
- Substitutions in Ar 1 can be a substituted or unsubstituted: lower alkyl, lower alkoxy, or a 5 to 7 membered heterocycloalkyl, aryl or heteroaryl ring.
- Prazole compounds that can be prepared by use of the monooxygenases described herein include, by way of example and not limitation, those shown in Table 4 below.
- the process can comprise contacting or incubating the compound of formula (I) above with an engineered CHMO polypeptide described herein in presence of an electron donor under suitable reaction conditions to convert the compound of formula (I) to the product compound of formula (II).
- suitable reaction conditions include a source of molecular oxygen (O 2 ), and the electron donor can be cofactor NADPH or NADH.
- the O 2 can be O 2 dissolved in a reaction solution. The enantioselectivity of the process can be determined by measuring the amount of (R) and (S) products formed in the reaction.
- Exemplary polypeptides for use in the process can be a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NO: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188,
- the engineered CHMO polypeptide useful in the process can comprise an amino acid sequence at least 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more identical to a reference sequence selected from the group consisting of SEQ ID NO: 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146,
- the monooxygenase polypeptides of the disclosure can be used in a process for the conversion of 5-methoxy-2-((4-methoxy-3,5- dimethylpyridin-2-yl)methylthio)-lH-benzo[d]imidazole ("pyrmetazole” or “compound (1)”) to 5- methoxy-2-((4-methoxy-3,5-dimethylpyridin-2-yl)methylsulfinyl)-lH-benzo[d]imidazole ("R- and S- omeprazole” or "compound (2)”).
- the process comprises contacting or incubating compound (1) with a polypeptide described herein in presence of a an electron donor, e.g., a cofactor, under suitable reaction conditions to convert compound (1) to compound (2).
- a an electron donor e.g., a cofactor
- the monooxygenase polypeptides can be used in the conversion of compound (1) to compound (2a) in enantiomeric excess.
- the process comprises contacting or incubating compound (1) with a R-enantioselective polypeptide described herein in presence of an electron donor, e.g., a cofactor, under suitable reaction conditions to convert the compound (1) to compound (2a) in enantiomeric excess.
- an electron donor e.g., a cofactor
- Exemplary polypeptides useful for the process can comprise an amino acid sequence selected from the group consisting of SEQ ID NO: 4, 6, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, and 208, or an engineered CHMO polypeptide that comprises an amino acid sequence having the set of amino acid differences relative to SEQ ID NO: 2 that are present in any one of SEQ ID NO: 4, 6, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, or 208.
- the monooxygenase polypeptides can be used in the conversion of compound (1) to compound (2b) in enantiomeric excess.
- the process comprises contacting or incubating the compound (1) with a S-enantioselective polypeptide described herein in presence of an electron donor, e.g., a cofactor, under suitable reaction conditions to convert the compound (1) to compound (2b) in enantiomeric excess.
- an electron donor e.g., a cofactor
- Exemplary polypeptides useful for the process can comprise an amino acid sequence selected from SEQ ID NO: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 116, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254,
- the process comprises contacting or incubating the compound (1) with a S-enantioselective polypeptide described herein in presence of an electron donor, e.g., a cofactor, under suitable reaction conditions to convert the compound (1) to compound (2b) in at least 90% enantiomeric excess.
- an electron donor e.g., a cofactor
- Exemplary polypeptides useful for the process can comprise an amino acid sequence selected from SEQ ID NO: 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 242, 244, 248, 250, 254, 256, 258, 262, and 264, or an engineered CHMO polypeptide that comprises an amino acid sequence having the set of amino acid differences relative to SEQ ID NO: 2 that are present in any one of
- the process comprises contacting or incubating the compound (1) with a polypeptide described herein in presence of an electron donor, e.g., a cofactor, under suitable reaction conditions to convert the compound (1) to compound (2b) in at least 99% enantiomeric excess.
- an electron donor e.g., a cofactor
- Exemplary polypeptides useful for this process can comprise an amino acid sequence selected from SEQ ID NO: 16, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 16, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, and 158, or an engineered CHMO polypeptide that comprises an amino acid sequence having the set of amino acid differences relative to SEQ ID NO: 2 that are present in any one of SEQ ID NO: 16, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42,
- the monooxygenase polypeptides can be used in the preparation of an omeprazole analog compound of structural formula (II) in enantiomeric excess, wherein the compound of structural formula (II) is selected from: (R) or (5)-lansoprazole, (R) or (S)-tenatoprazole, (R) or (S)-rabeprazole, (R) or (S)-pantoprazole, (R) or (S)-ilaprazole, (R) or (S)-leminoprazole, (R) or (S)-saviprazole, and (R) or (S)-TY- 1 1345.
- the compound of structural formula (II) is selected from: (R) or (5)-lansoprazole, (R) or (S)-tenatoprazole, (R) or (S)-rabeprazole, (R) or (S)-pantoprazole, (R) or (S)-ilapra
- process comprises contacting or incubating a sulfide precursor of structural formula (I) for the omeprazole analog compound (i.e., the corresponding pyrmetazole substrate analog compound) with a polypeptide described herein in presence of an electron donor (e.g., a cofactor) under suitable reaction conditions to convert the sulfide precursor compound of formula (I) to the omeprazole analog compound of structural formula (II).
- a sulfide precursor of structural formula (I) for the omeprazole analog compound i.e., the corresponding pyrmetazole substrate analog compound
- an electron donor e.g., a cofactor
- this process for preparing an omeprazole analog compound can be carried out wherein the sulfide precursor compound of formula (I) is 5-(difluoromethoxy)-2-((3,4- dimethoxypyridin-2-yl)methylthio)- lH-benzo[d]imidazole and the compound of formula (II) is (S)- pantoprazole which is produced in enantiomeric excess.
- the process can be carried out wherein the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 70, 72, 76, 78, 80, 82, 84, 86, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 126, 128, 130, 228, 232, 244, 248, 256, 258, 260, 262, and 264, or in which the polypeptide comprises an amino acid sequence having the set of amino acid differences relative to SEQ ID NO: 2 that are present in any one of ID NO: 70, 72, 76, 78, 80, 82, 84, 86, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 126, 128, 130, 228, 232, 244, 248, 256, 258, 260, 262, and 264,
- this process for preparing an omeprazole analog compound can be carried out wherein the sulfide precursor compound of formula (I) is 5-methoxy-2-((4-methoxy-3,5- dimethylpyridin-2-yl)methylthio)- lH-imidazo[4,5-b]pyridine and the compound of formula (II) is (S)-tenatoprazole, which is produced in enantiomeric excess.
- the process can be carried out wherein the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 66, 102, 104, 1 14, 122, 124, 128, 232, 238, 244, 260, 262, and 264, or in which the polypeptide comprises an amino acid sequence having the set of amino acid differences relative to SEQ ID NO: 2 that are present in any one of SEQ ID NO: 66, 102, 104, 1 14, 122, 124, 128, 232, 238, 244, 260, 262, or 264.
- this process for preparing an omeprazole analog compound can be carried out wherein the sulfide precursor compound of formula (I) is 2-((4-(3-methoxypropoxy)-3- methylpyridin-2-yl)methylthio)- lH-benzo[d]imidazole and the compound of formula (II) is (S)- rabeprazole, which is produced in enantiomeric excess.
- the process can be carried out wherein the polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 62, 76, 84, 86, 124, 238, 240, 250, 258, 262, and 264, or in which the polypeptide comprises an amino acid sequence having the set of amino acid differences relative to SEQ ID NO: 2 that are present in any one of SEQ ID NO: 62, 76, 84, 86, 124, 238, 240, 250, 258, 262, or 264.
- this process for preparing an omeprazole analog compound can be carried out wherein the sulfide precursor compound of formula (I) is 2-((3-methyl-4-(2,2,2- trifluoroethoxy)pyridin-2-yl)methylthio)- lH-benzo[d]imidazole and the compound of formula (II) is (R)-lansoprazole which is produced in enantiomeric excess.
- the process can be carried out wherein the polypeptide comprises an amino acid sequence of SEQ ID NO: 6, or in which the polypeptide comprises an amino acid sequence having the set of amino acid differences relative to SEQ ID NO: 2 that are present in SEQ ID NO: 6.
- the process is carried out at a reaction condition temperature of 10°C to 50°C, and in particular 25°C to 40°C.
- the temperature can be chosen to maximize the reaction rate at higher temperatures while maintaining the activity of the enzyme for sufficient duration for efficient conversion of the substrate to the product.
- polypeptides with increased thermostability can be selected to carry out the process.
- the reaction condition comprises a pH of about 8.5 to a pH of about 10.
- the reaction condition is a pH of about 8.5 to about 9. This slightly basic condition limits the degradation of the product compound (2) that occurs below about pH 8.5.
- the pH of the reaction mixture may change.
- the pH of the reaction mixture may be maintained at a desired pH or within a desired pH range by the addition of an acid or a base during the course of the reaction.
- the pH may be controlled by using an aqueous solvent that comprises a buffer.
- Suitable buffers to maintain desired pH ranges are known in the art and include, for example, phosphate buffer, triethanolamine buffer, and the like. Combinations of buffering and acid or base addition may also be used.
- the reaction condition comprises a phosphate buffer concentration of about 5 to 50 mM. At the lower phosphate buffer concentrations, NADPH is shown to have greater stability.
- the process described herein consumes molecular oxygen, i.e., 0 2 , where an oxygen atom is transferred to a sulfide to yield the sulfoxide.
- the 0 2 is dissolved in the reaction solution. Dissolved 0 2 can be increased by direct sparging of O 2 gas into the reaction solution (e.g., US Patent No. 6,478,964), and/or by increasing the partial pressure of O 2 to pressures higher than the atmospheric pressure.
- oxygenation of the reaction solution is done by a bubble-free process. For example, oxygen mass transfer across PTFE membrane for bubble free aeration is described in Schneider et al., 1995, Enzyme and Microbial Technology 17(9):839-847 and EP0172478, incorporated herein by reference.
- a cofactor is used in the reduction reaction.
- the cofactor operates in combination with the polypeptides of the disclosure in the process.
- Suitable cofactors include, but are not limited to, NADP + (nicotinamide adenine dinucleotide phosphate), NADPH (the reduced form of NADP + ), NAD + (nicotinamide adenine dinucleotide) and NADH (the reduced form of NAD + ).
- the reduced form of the cofactor is added to the reaction mixture.
- the reduced NAD(P)H form can be optionally regenerated from the oxidized NAD(P) + form using a cofactor regeneration system.
- a cofactor recycling system is used to regenerate cofactor
- NADPH/NADH form NADP+/NAD+ produced in the reaction.
- an optional cofactor recycling system can be used to regenerate cofactor NADPH/NADH form NADP+/NAD+ produced in the reaction.
- a cofactor regeneration system refers to a set of reactants that participate in a reaction that reduces the oxidized form of the cofactor (e.g., NADP + to NADPH). Cofactors oxidized by the polypeptide reduction of the keto substrate are regenerated in reduced form by the cofactor regeneration system.
- Cofactor regeneration systems comprise a stoichiometric reductant that is a source of reducing hydrogen equivalents and is capable of reducing the oxidized form of the cofactor.
- the cofactor regeneration system may further comprise a catalyst, for example an enzyme catalyst that catalyzes the reduction of the oxidized form of the cofactor by the reductant.
- a catalyst for example an enzyme catalyst that catalyzes the reduction of the oxidized form of the cofactor by the reductant.
- Suitable exemplary cofactor regeneration systems include, but are not limited to, glucose and glucose dehydrogenase; formate and formate dehydrogenase; glucoses- phosphate and glucose-6-phosphate dehydrogenase; an alcohol (e.g., isopropanol) and a
- ketoreductase/alcohol dehydrogenase phosphite and phosphite dehydrogenase, molecular hydrogen and hydrogenase; and the like. These systems may be used in combination with either
- NADPVNADPH or NAD + /NADH as the cofactor.
- Electrochemical regeneration using hydrogenase may also be used as a cofactor regeneration system. See, e.g., U.S. Pat. Nos. 5,538,867 and
- Chemical cofactor regeneration systems comprising a metal catalyst and a reducing agent (for example, molecular hydrogen or formate) are also suitable. See, e.g., PCT publication WO 2000/053731, which is incorporated herein by reference.
- the cofactor recycling system can comprise glucose dehydrogenase (GDH), which is a NAD + or NADP + -dependent enzyme that catalyzes the conversion of D-glucose and NAD + or NADP + to gluconic acid and NADH or NADPH, respectively.
- GDH glucose dehydrogenase
- Glucose dehydrogenases suitable for use in the practice of the processes described herein include both naturally occurring glucose dehydrogenases, as well as non-naturally occurring glucose dehydrogenases.
- Naturally occurring glucose dehydrogenase encoding genes have been reported in the literature, e.g., the Bacillus subtilis 61297 GDH gene, B. cereus ATCC 14579 and B. megaterium.
- Non-naturally occurring glucose dehydrogenases generated using, for example, mutagenesis, directed evolution, and the like are provided in PCT publication WO 2005/018579, and US publication Nos. 2005/0095619 and 2005/0153417. All of these sequences are incorporated herein by reference.
- the co-factor regenerating system can comprise a formate
- dehydrogenase which is a NAD + or NADP + -dependent enzyme that catalyzes the conversion of formate and NAD + or NADP + to carbon dioxide and NADH or NADPH, respectively.
- Formate dehydrogenases that are suitable for use as cofactor regenerating systems in the monooxygenase reactions described herein include naturally occurring and non-naturally occurring formate dehydrogenases. Suitable formate dehydrogenases are described in PCT publication WO
- Formate may be provided in the form of a salt, typically an alkali or ammonium salt (for example, HC0 2 Na, KHC0 2 NH 4 , and the like), in the form of formic acid, typically aqueous formic acid, or mixtures thereof.
- a base or buffer may be used to provide the desired pH.
- the co-factor regenerating system can comprise a phosphite dehydrogenase, which catalyzes the conversion of phosphite and NAD" or NADP" to a phosphate and NADH or NADPH, respectively.
- Phosphite dehydrogenases that are suitable for use as cofactor regenerating systems in the processes described herein include naturally occurring and non-naturally occurring phosphite dehydrogenases.
- Naturally occurring phosphite dehydrogenases include those from, Pseudomonas stutzeri and Alcaligenes faecalis, and non-naturally occurring phosphite dehydrogenases include engineered phosphite dehydrogenases derived therefrom.
- Phosphite dehydrogenases are described in Johannes et al., 2005, Applied and Environmental Microbiology 71(10):5728-5734; Woodyer et al., 2003, Biochemistry 42 (40): 1 1604-1 1614; Vrtis et al., 2002, Angewandte Chemie 41(17):3257-3259; Johannes et al., 2006, Biotechnology and Bioengineering Volume 96(1): 18-26; and McLachlan et al., 2008, Biotechnology and Bioengineering 99(2):268-274.
- the co-factor regenerating system can comprise an alcohol dehydrogenase or ketoreductase, which is an NAD + or NADP + -dependent enzyme that catalyzes the conversion of an alcohol and NAD + or NADP + to an aldehyde or ketone and NADH or NADPH, respectively.
- Alcohol dehydrogenases and ketoreductases that are suitable for use as cofactor regenerating systems in the processes described herein include naturally occurring and non-naturally occurring alcohol dehydrogenases and ketoreductases.
- Naturally occurring alcohol dehydrogenases include known alcohol dehydrogenase/ketoreductase from, among others, Thermoanerobium brockii, Rhodococcus erythropolis, Saccharomyces cerevisiae, Lactobacillus kefiri, Lactobacillus minor, and Lactobacillus brevis, and non-naturally occurring alcohol dehydrogenase/ketoreductase include engineered alcohol dehydrogenase/ketoreductase derived therefrom.
- non- naturally occurring alcohol dehydrogenase/ketoreductases engineered for thermo- and solvent stability can be used.
- Suitable alcohols include lower secondary alkanols and aryl-alkyl carbinols.
- Examples of lower secondary alcohols include isopropanol, 2-butanol, 3-methyl-2-butanol, 2-pentanol, 3-pentanol, 3,3-dimethyl-2-butanol, and the like.
- the secondary alcohol is isopropanol.
- Suitable aryl-alkyl carbinols include unsubstituted and substituted 1-arylethanols.
- the volatile product can be removed by sparging the reaction solution with a non-reactive gas or by applying a vacuum to lower the reaction pressure and removing the volatile present in the gas phase.
- a non-reactive gas is any gas that does not react with the reaction components.
- Various non-reactive gases include nitrogen and noble gases (e.g., inert gases).
- the non-reactive gas is nitrogen gas.
- acetone formed by oxidation of isopropanol can be removed by sparging with nitrogen gas or applying a vacuum to the reaction solution and removing the acetone from the gas phase by an acetone trap, such as a condenser or other cold trap.
- an acetone trap such as a condenser or other cold trap.
- the polypeptides for carrying out the conversion of pyrmetazole to esomeprazole and any enzymes comprising the optional cofactor regeneration system may be added to the reaction mixture in the form of the purified enzymes, whole cells transformed with gene(s) encoding the enzymes, and/or cell extracts and/or lysates of such cells.
- the gene(s) encoding the polypeptides disclosed herein and the optional cofactor regeneration enzymes can be transformed into host cells separately or together into the same host cell.
- Whole cells transformed with gene(s) encoding the engineered ketoreductase enzyme and/or the optional cofactor regeneration enzymes, or cell extracts and/or lysates thereof may be employed in a variety of different forms, including solid (e.g., lyophilized, spray-dried, and the like) or semisolid (e.g., a crude paste).
- aqueous solvent e.g., water, buffer, and salts.
- aqueous solvents including water and/or co-solvent systems, are used.
- Co-solvents can reduce the formation of aggregates which can affect the rate and scalability of the process. At substrate loading of 75 g/L or higher, the use of a co-solvent is desirable.
- Suitable co-solvents include: MeOH, EtOH, isopropanol (IP A), acetone, toluene, MeCN, methyl tert-butyl ether (MTBE), N-methyl-2-pyrrolidone (NMP), dimethylacetamide (DMAc), dimethylformamide (DMF), propylene glycol, polyethylene glycol (PEG), tetramethylurea, N- ethylpyrollidinone, tetraglyme, l,3-Dimethyl-3,4,5,6-tetrahydro-2(lH)-pyrimidinone (DMPU), DMIU, hexamethylphosphoramide ( ⁇ ) and dimethylsulfoxide (DMSO).
- Choice of co-solvent can be based on evaluating a combination of factors including: compound solubility, compound stability, reaction/process safety, toxicity, allowable level of solvent in the product (e.g., an API product); the effectiveness of the co-solvent in preventing agglomeration of the product, and stability of the monooxygenase to the co-solvent.
- NMP is a particularly suitable co-solvent for reactions with substrate loading of 100 g/L.
- reactants are not critical.
- the reactants may be added together at the same time to a solvent (e.g., monophasic solvent, biphasic aqueous co-solvent system, and the like), or alternatively, some of the reactants may be added separately, and some together at different time points.
- a solvent e.g., monophasic solvent, biphasic aqueous co-solvent system, and the like
- reaction is generally allowed to proceed until essentially complete, or near complete, conversion of compound (1) with a polypeptide described herein under suitable reaction conditions to convert the compound (1) to compound (2b) is obtained.
- Conversion of substrate to product can be monitored using known methods by detecting substrate and/or product. Suitable methods include gas chromatography, HPLC, and the like.
- Example 1 Wild-type cyclohexanone monooxygenase (CHMO) gene acquisition
- CHMO cyclohexanone monooxygenase
- Acinetobacter sp NCIMB9871 (SEQ ID NO: 2) was designed for expression in E. coli using standard codon optimization to yield the nucleotide sequence of SEQ ID NO: 1.
- Standard codon-optimization software is reviewed in, e.g.,”OPTIMIZER: a web server for optimizing the codon usage of DNA sequences," Puigbo et al., Nucleic Acids Res. 2007 Jul; 35(Web Server issue): W126-31. Epub 2007
- the optimized gene was synthesized using oligonucleotides composed of 42 nucleotides and cloned into expression vector pCKl 10900 (which is depicted as FIG. 3 in US Patent Application
- the pCKl 10900 expression vector also contained the P15a origin of replication and the chloramphenicol resistance gene.
- the resulting plasmid was transformed into E. coli W31 10 using standard methods.
- This example illustrates a high throughput spectrophotometric assay in 96-well plate format that is used to assay relative activity of engineered CHMO polypeptides (as in Table 2A). This assay can also be used for real-time monitoring of bioprocesses using these polypeptides.
- the substrate, pyrmetazole (6 g/L), ketoreductase of SEQ ID NO: 268 (cofactor recycling enzyme) (1 g/L) and cofactor (0.1 g/L) were dissolved in 300 ⁇ volume of buffer (pH 9.0) with 8% isopropanol (IP A), 2% acetone and 15% N-methyl-2-pyrrolidone (NMP) in the wells of a 96-well plate.
- IP A isopropanol
- NMP N-methyl-2-pyrrolidone
- Cell lysate from a directed evolution sample or bioprocess sample containing the engineered CHMO polypeptide (3.3% of total reaction volume) was then added to the reaction mixture. Plates also contained negative control wells (empty vector) which provide background reaction levels.
- the reaction was shaken at ambient temperature for 24 h (or 0.5 h can be used for real-time monitoring of bioprocess) then quenched by dilution with 3.33 volumes of acetonitrile and mixed thoroughly to give a total 4.3-fold dilution.
- the quenched mixture is centrifuged at 4000 rpm for 10 min.
- a sample of the quenched mixture was then transferred to an empty 96 well plate.
- the plate was analyzed with chromatographic method by high performance liquid chromatography (HPLC) at 250 nm.
- HPLC high performance liquid chromatography
- the substrate, pyrmetazole (6g/L), ketoreductase (KRED) of SEQ ID NO: 268 (lg/L) and NADP + (O. lg/L) were dissolved in 300 volume of buffer (pH 9.0) with 8% IPA and 2% acetone in the wells of a 96-well plate.
- the reaction was shaken at 32°C for 24 hrs, then quenched by dilution with 3.3 volumes of acetonitrile and mixed thoroughly to give a total 4.3-fold dilution.
- the quenched mixture was centrifuged at 4000 rpm for 10 min. A sample of the quenched mixture was then transferred to an empty 96 well plate. The plate was analyzed with chromatographic method by HPLC at 250 nm.
- the substrate, pyrmetazole (2g/L), KRED (lg/L) and NADP + (O. lg/L) were dissolved in 300 ⁇ . volume of buffer (pH 9.0) with 8% IPA and 2% acetone in the wells of a 96-well plate.
- Cell lysate from a directed evolution or bioprocess sample containing the engineered CHMO polypeptide (60% of total reaction volume) was then added to the reaction mixture. Plates also contained negative controls (empty vector) which provided the background reaction levels.
- Shake flask powders include approximately 30% total protein and accordingly provide a more purified preparation of an engineered enzyme as compared to the cell lysate used in the high throughput assay of Example 2.
- a single microbial colony of E. coli containing a plasmid encoding an engineered CHMO gene of interest was inoculated into 50 mL Luria Bertani broth containing 30 ⁇ g/ml chloramphenicol and 1% glucose. Cells were grown overnight (at least 16 hours) in an incubator at 30°C with shaking at 250 rpm.
- the culture was diluted into 250 mL Terrific Broth (12 g/L bacto-tryptone, 24 g/L yeast extract, 4 mL/L glycerol, 65 mM potassium phosphate, pH 7.0, 1 mM MgSO/t) containing 30 ⁇ g/mL chloramphenicol, in a 1 L flask to an optical density of 0.2 at 600 nm (OD 6 oo)and allowed to grow at 30°C.
- CHMO gene was induced by addition of isopropyl- ⁇ -D-thiogalactoside (IPTG) to a final concentration of 1 mM when the OD 6 oo of the culture is 0.6 to 0.8, and incubation was then continued overnight (at least 16 hours).
- IPTG isopropyl- ⁇ -D-thiogalactoside
- Assays of activity, sulfone-byproduct production, and thermal stability were carried out as describe as in Example 2 except that the following reagents/conditions were used: 1-10 g/L shake flask powder of monooxygenase enzyme; 10-100 g/L pyrmetazole substrate; 1 g/L KRED coenzyme; 0.2-0.5 g/L NADP+ cofactor; 4% IPA; 96% phosphate buffer (pH 9); reaction temperature: room temperature; reaction time: 24 hr.
- Assay results for shake flask powder preparations of exemplary engineered CHMO polypeptides are shown below in Table 6.
- DSP powders include approximately 80% total protein and accordingly provide a more purified preparation of an engineered enzyme as compared to the cell lysate used in the high throughput assay of Example 2.
- the fermentor was inoculated with a late exponential culture of E. coli W31 10 containing the plasmid encoding the engineered CHMO gene of interest (grown in a shake flask as described in Example 3) to a starting OD 6 oo of 0.5 to 2.0.
- the fermentor was agitated at 500-1500 rpm with air supplied to the fermentation vessel at 2.0 -30 L/min to maintain a dissolved oxygen level of 55 % saturation or greater.
- the pH of the culture was maintained at 7.0 by addition of 28% v/v ammonium hydroxide.
- the cell pellet was resuspended in 2 volumes of 25 mM sodium phosphate buffer, pH 7.0 at 4°C to each volume of wet cell paste. By using IN sodium hydroxide pH has to be adjusted to 7.0 before lysis.
- Intracellular CHMO polypeptide was released from the cells by passing the suspension through a homogenizer fitted with a two-stage homogenizing valve assembly using a pressure of 12000 psig. The cell homogenate was collected in chilled container which was kept in ice bath immediately after disruption.
- Lysate pH has to be adjusted to 7.0 by using IN sodium hydroxide
- the resulting suspension was clarified by centrifugation at 6600 G in a standard laboratory centrifuge for 60 minutes at 4-8°C.
- the clear decanted supernatant was cooled to 4-8°C and pH adjusted to 7.0 by using IN NaOH then concentrated ten- fold using a cellulose ultrafiltration membrane with a molecular weight cut off of 30 KDa at a temperature of 10°C.
- This example illustrates processes at two substrate loadings and a product workup procedure for preparing esomeprazole ((S)-5-methoxy-2-((4-methoxy-3,5-dimethylpyridin-2-yl)methylsulfinyl)- lH-benzo[d]imidazole or "compound (2b)”) in enantiomeric excess by contacting the pyrmetazole substrate compound (1) with an engineered CHMO polypeptide of the disclosure (e.g., the polypeptides of SEQ ID NO: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78.
- an engineered CHMO polypeptide of the disclosure e.g., the polypeptides of SEQ ID NO: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38,
- the reaction mixture was a slurry that was stirred at 250 rpm for 24 h at room temperature ( ⁇ 25°C) in an oxygen atmosphere.
- the conversion of substrate to product was monitored periodically by HPLC.
- 23% of pyrmetazole substrate was converted to esomeprazole.
- Further addition of 150 mg (2 g/L) engineered CHMO polypeptide, 75 mg (1 g/L) KRED and 15 mg (0.2 g/L) NADP as solids 5 times over the course of a full reaction time of 69 h provides a final substrate conversion of 99.6%.
- Example 6 Process for Production of Esomeprazole at 30 g Scale Using an
- This example illustrates a process for preparing esomeprazole in enantiomeric excess at a 30 g scale via a biocatalytic conversion of the substrate pyrmetazole using an engineered CHMO polypeptide of the disclosure (e.g., a polypeptide of SEQ ID NO: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78. 80.
- an engineered CHMO polypeptide of the disclosure e.g., a polypeptide of SEQ ID NO: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78. 80.
- A. Biocatalvtic reaction protocol A I L multi-neck jacketed reactor vessel equipped with baffle and overhead stirrer, fitted with an anchor shaped agitator was charged with a pre-mixed powder of 30 g pyrmetazole (from Sinojie (HK) Ltd.) and 1.5 g of an esomeprazole "seed" (prepared in a previous enzymatic reaction), and then 517 mL of 0.05 M potassium phosphate at pH 9.0 ("buffer solution”). This reaction mixture was stirred at 150 rpm at 25°C for 10 min in order to obtain a well- suspended slurry. A three way tap was attached to the necks of the vessel.
- the tap was fitted with 2 rubber balloons filled with oxygen and a vacuum line for conducting degassing steps prior to the start of the reaction. Vacuum was applied to the vessel (30 mbar, obtained within 5 min) and the evacuated flask was filled with oxygen via the three way tap. This evacuation-gas-filling cycle was repeated two more times.
- the reaction vessel under positive oxygen pressure then was charged sequentially with the following: 24 mL IPA (HPLC grade); 60 mg NADP in 4 mL buffer solution (pH 9.0); 300 mg ketoreductase of SEQ ID NO: 268 in 15 mL buffer solution (pH 9.0); 600 mg CHMO of SEQ ID NO: 158 in 40 mL buffer solution (pH 9.0); and 1.2 mL of catalase (Aspergillus niger catalase solution in buffer stabilized with sodium chloride and sorbitol with stated activity of 25000 ClU/g; available from Sigma-Aldrich). Catalase is added to neutralize peroxide that may form during the reaction and negatively affect the performance of the CHMO.
- Catalase is added to neutralize peroxide that may form during the reaction and negatively affect the performance of the CHMO.
- the aqueous phase was transferred back to the heated vessel and 45 mL of MIBK was added. Stirring at 300 rpm for 30 min and phase separation within 20 min afforded a lower slightly turbid, yellowish aqueous layer and a brownish upper organic phase. The aqueous layer was drained and discarded. The organic layer was drained and collected. The solution was submitted to warm Celite filtration after the first filtration was completed (same filter and Celite layer). The organic phases were combined and separated from the aqueous layer that was formed during filtration. The aqueous layer was discarded and the organic phase was transferred back to the vessel. The temperature was adjusted to 15°C and the solution stirred at 150 rpm for 1 hour.
- This example illustrates the use of exemplary engineered CHMO polypeptides disclosed in Tables 2A and 2B for the conversion of sulfide substrate that are structurally similar to pyrmetazole to their corresponding (R)- and/or (S)-prazole compounds that are structural analogs to (R)- and (S)- omeprazole.
- engineered CHMO polypeptides are screened for activity and identified that are useful for the preparation of the prazole compounds (S)-pantoprazole, (SMenatoprazole, (S)- rabeprazole, and (R)- and ( ⁇ -lansoprazole, as shown in Schemes 4, 5, 6, and 7 below.
- a cell-lysis stock solution (1 mg/mL lysozyme, 0.5 mg/mL PMBS, 100 mM phosphate buffer, pH 10) was added 600 ⁇ . per well to the wells of a deep-well plate each containing a cell pellet from E.coli expressing the engineered CHMO of interest.
- the plate was shaken vigorously with high speed for 1-5 min to disperse the cell pellet then followed by 1.5-2 hours of slow shaking at room temperature.
- the plate was then centrifuged at 4000 rpm for 20 minutes at 4 °C.
- the engineered CHMO polypeptides of SEQ ID NO: 236 and 240 were capable of converting the precursor sulfide substrate to 20% and 27% to the opposite enantiomer, (7?)-pantoprazole, in high enantiomeric excesses of 86 and 89%, respectively.
- This example illustrates the use of engineered CHMO polypeptides of the present disclosure for carrying out the biocatalytic conversion of the sulfide precursor substrate (and pyrmetazole analog), 5-methoxy-2-((4-methoxy-3,5-dimethylpyridin-2-yl)methylthio)-lH-imidazo[4,5-b]pyridine, to the prazole product compound SMenatoprazole in enantiomeric excess, as shown in Scheme 5.
- a cell-lysis stock solution (1 mg/mL lysozyme, 0.5 mg/mL PMBS, 100 mM phosphate buffer, pH 10) was added 600 ⁇ ⁇ per well to the wells of a deep-well plate each containing a cell pellet from E.coli expressing a CHMO of interest.
- the plate was shaken vigorously with high speed for 1-5 min to disperse the cell pellet then followed by 1.5-2 hours of slow shaking at room temperature.
- the plate was then centrifuged at 4000 rpm for 20 minutes at 4 °C.
- results As shown below in Table 16, the following engineered CHMO polypeptides were capable of converting the precursor sulfide substrate to between about 10 and 60 percent of the (S)- tenatoprazole product in 100%ee: 66, 102, 104, 1 14, 122, 124, 128, 232, 238, 244, 260, 262, and 264. This represents between about 10-fold and 60-fold improvement in percent conversion of this substrate to this product relative to the wild-type CHMO of SEQ ID NO: 2.
- This example illustrates the use of engineered CHMO polypeptides of the present disclosure for carrying out the biocatalytic conversion of the sulfide precursor substrate (and pyrmetazole analog), 2-((4-(3-methoxypropoxy)-3-methylpyridin-2-yl)methylthio)- lH-benzo[d]imidazole, to the prazole compound (3 ⁇ 4 ) -rabeprazole in enantiomeric excess, as shown in Scheme 6.
- a cell-lysis stock solution (1 mg/mL lysozyme, 0.5 mg/mL PMBS, 100 mM phosphate buffer, pH 10) was added 600 ⁇ ⁇ per well to the wells of a deep-well plate each containing a cell pellet from E.coli expressing the engineered CHMO of interest.
- the plate was shaken vigorously with high speed for 1-5 min to disperse the cell pellet then followed by 1.5-2 hours of slow shaking at room temperature. The plate was then centrifuged at 4000 rpm for 20 minutes at 4 °C.
- This example illustrates the use of engineered CHMO polypeptides of the present disclosure for carrying out the biocatalytic conversion of the sulfide precursor substrate (and pyrmetazole analog), 2-((3-methyl-4-(2,2,2-trifluoroethoxy)pyridin-2-yl)methylthio)- lH-benzo[d]imidazole, to either of the prazole compounds ( ⁇ -lansoprazole or ( ⁇ -lansoprazole in enantiomeric excess, as shown in Scheme 7.
- the engineered CHMO polypeptides of SEQ ID NO: 22, 72, and 80 are capable of converting the lansoprazole sulfide precursor substrate to ( ⁇ -lansoprazole in enantiomeric excess.
- the polypeptides of SEQ ID NO: 72 and 80 are capable of 27% and 19% conversion with enantioselectivity of about 72% ee and 92% ee, respectively.
- the engineered CHMO polypeptide of SEQ ID NO: 6 showed only 1% conversion it was confirmed to be selective for the ( ⁇ -lansoprazole product.
- the polypeptide of SEQ ID NO: 6 was also found to convert pyrmetazole to favor the ( ⁇ -omeprazole product over the (3 ⁇ 4 ) -omeprazole product in 98.9 % enantiomeric excess (see Table 2A). It is reasonable to expect that further directed evolution of the engineered polypeptide of SEQ ID NO: 6 will result in an engineered CHMO polypeptide capable of producing the ( ⁇ -lansoprazole product in high enantiomeric excess (e.g., 98% or greater) and much higher % conversion (e.g., 20% or greater).
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Enzymes And Modification Thereof (AREA)
- Peptides Or Proteins (AREA)
- Heterocyclic Carbon Compounds Containing A Hetero Ring Having Nitrogen And Oxygen As The Only Ring Hetero Atoms (AREA)
Abstract
Priority Applications (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP10836590.9A EP2510089B1 (fr) | 2009-12-08 | 2010-12-08 | Synthèse de composés prazoles |
US13/514,750 US8895271B2 (en) | 2009-12-08 | 2010-12-08 | Synthesis of prazole compounds |
CN201080054980.3A CN102884178B (zh) | 2009-12-08 | 2010-12-08 | 拉唑化合物的合成 |
SG2012041521A SG181535A1 (en) | 2009-12-08 | 2010-12-08 | Synthesis of prazole compounds |
SI201031063T SI2510089T1 (sl) | 2009-12-08 | 2010-12-08 | Sinteza prazolnih spojin |
US14/528,708 US9228216B2 (en) | 2009-12-08 | 2014-10-30 | Synthesis of prazole compounds |
US14/952,119 US9422534B2 (en) | 2009-12-08 | 2015-11-25 | Synthesis of prazole compounds |
US15/211,329 US9631181B2 (en) | 2009-12-08 | 2016-07-15 | Synthesis of prazole compounds |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US26781209P | 2009-12-08 | 2009-12-08 | |
US61/267,812 | 2009-12-08 |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/514,750 A-371-Of-International US8895271B2 (en) | 2009-12-08 | 2010-12-08 | Synthesis of prazole compounds |
US14/528,708 Division US9228216B2 (en) | 2009-12-08 | 2014-10-30 | Synthesis of prazole compounds |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2011071982A2 true WO2011071982A2 (fr) | 2011-06-16 |
WO2011071982A3 WO2011071982A3 (fr) | 2011-12-22 |
Family
ID=44146157
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2010/059398 WO2011071982A2 (fr) | 2009-12-08 | 2010-12-08 | Synthèse de composés prazoles |
Country Status (6)
Country | Link |
---|---|
US (4) | US8895271B2 (fr) |
EP (1) | EP2510089B1 (fr) |
CN (1) | CN102884178B (fr) |
SG (1) | SG181535A1 (fr) |
SI (1) | SI2510089T1 (fr) |
WO (1) | WO2011071982A2 (fr) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012078800A2 (fr) | 2010-12-08 | 2012-06-14 | Codexis, Inc. | Biocatalyseurs et procédés de synthèse de l'armodafinil |
CN106496191A (zh) * | 2016-10-19 | 2017-03-15 | 齐鲁天和惠世制药有限公司 | 一种s‑泮托拉唑钠的制备方法 |
EP3418929A1 (fr) | 2013-09-27 | 2018-12-26 | Codexis, Inc. | Sélection automatique de variantes enzymatiques |
US10787651B2 (en) | 2018-03-08 | 2020-09-29 | Jiangsu Aosaikang Pharmaceutical Co., Ltd. | Bradyrhizobium monooxygenase and use thereof for preparation of chiral sulfoxide |
EP3561049A4 (fr) * | 2016-12-23 | 2020-10-28 | Zhejiang Jingxin Pharmaceutical Co., Ltd. | Bactérie génétiquement modifiée de coexpression de cyclohexanone monooxygénase et d'isopropanol déshydrogénase et utilisation associée |
FR3097234A1 (fr) * | 2019-06-17 | 2020-12-18 | Arkema France | Procede selectif de preparation de sulfoxydes par catalyse enzymatique |
US11345900B2 (en) | 2018-11-26 | 2022-05-31 | Jiangsu Aosaikang Pharmaceutical Co., Ltd. | Monooxygenase and use in preparation of optically pure sulfoxide |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SI2510089T1 (sl) * | 2009-12-08 | 2015-12-31 | Codexis, Inc. | Sinteza prazolnih spojin |
CN105695425B (zh) * | 2014-11-26 | 2019-05-17 | 上海弈柯莱生物医药科技有限公司 | 一种环己酮单加氧酶及其在合成埃索美拉唑中的应用 |
CN108118035B (zh) * | 2016-11-30 | 2021-09-17 | 浙江京新药业股份有限公司 | 一种环己酮单加氧酶及其应用 |
CN108251465A (zh) * | 2016-12-28 | 2018-07-06 | 浙江京新药业股份有限公司 | 一种埃索美拉唑的酶法制备方法 |
CN108251466B (zh) * | 2016-12-28 | 2021-07-02 | 浙江京新药业股份有限公司 | 一种酶法合成埃索美拉唑的方法 |
CN108570439A (zh) * | 2017-03-13 | 2018-09-25 | 浙江京新药业股份有限公司 | 氧化还原酶的融合蛋白、基因工程菌及其制备方法和用途 |
CN109423484B (zh) * | 2017-09-04 | 2021-06-25 | 尚科生物医药(上海)有限公司 | 一种酮还原酶及其在制备(s)-2-氯-1-(3,4-二氟苯基)乙醇上的应用 |
CN107892683A (zh) * | 2018-02-06 | 2018-04-10 | 江苏中邦制药有限公司 | 一种埃索美拉唑的制备方法 |
FR3107903B1 (fr) * | 2020-03-09 | 2023-05-05 | Arkema France | Procédé chimio-enzymatique de coproduction d’un disulfure et d’un sulfoxyde ou d’une sulfone |
US11666562B2 (en) | 2020-04-02 | 2023-06-06 | Northwestern University | Ilaprazole for inhibiting the release of enveloped viruses from cells |
US20230265069A1 (en) * | 2020-04-23 | 2023-08-24 | Northwestern University | Small Molecule Inhibitors That Block the Budding of Enveloped Viruses |
CN112410312A (zh) * | 2020-11-27 | 2021-02-26 | 江南大学 | 一种环己酮单加氧酶及其应用 |
CN113583985B (zh) * | 2021-08-02 | 2023-08-01 | 华东理工大学 | 一种可以在毕赤酵母高效分泌的单加氧酶突变体及应用 |
CN114480315B (zh) * | 2022-02-16 | 2023-09-19 | 四川奥邦古得药业有限公司 | 一种Baeyer-Villiger单加氧酶及其在布立西坦合成中的应用 |
Citations (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0172478A1 (fr) | 1984-08-03 | 1986-02-26 | Gesellschaft für Biotechnologische Forschung mbH (GBF) | Procédé et appareil pour l'aération de liquides sans formation de bulles, plus particulièrement de milieux de culture pour la propagation de tissus |
WO1995022625A1 (fr) | 1994-02-17 | 1995-08-24 | Affymax Technologies N.V. | Mutagenese d'adn par fragmentation aleatoire et reassemblage |
US5538867A (en) | 1988-09-13 | 1996-07-23 | Elf Aquitaine | Process for the electrochemical regeneration of pyridine cofactors |
WO1997000078A1 (fr) | 1995-06-14 | 1997-01-03 | Valio Oy | Procedes de prevention ou de traitement des allergies |
WO1997035966A1 (fr) | 1996-03-25 | 1997-10-02 | Maxygen, Inc. | Procedes et compositions pour biotechniques metaboliques et cellulaires |
WO1998027230A1 (fr) | 1996-12-18 | 1998-06-25 | Maxygen, Inc. | Procedes et compositions pour l'ingenierie des polypeptides |
US5840552A (en) | 1994-11-28 | 1998-11-24 | Astra Aktiebolag | Preparation of pharmaceutically active compounds by biooxidation |
WO2000042651A1 (fr) | 1999-01-13 | 2000-07-20 | Hitachi, Ltd. | Dispositif a semi-conducteurs |
WO2000044744A1 (fr) | 1999-01-28 | 2000-08-03 | Astrazeneca Ab | Sel de potassium, de ($i(s))-omeprazole |
WO2000053731A2 (fr) | 1999-03-11 | 2000-09-14 | Eastman Chemical Company | Reductions enzymatiques avec de l'hydrogene par regeneration de cofacteur a catalyse metallique |
US6162816A (en) | 1996-12-20 | 2000-12-19 | Astrazeneca Ab | Crystalline form of the S-enantiomer of omeprazole |
WO2001075767A2 (fr) | 2000-03-30 | 2001-10-11 | Maxygen, Inc. | Selection de sites de recombinaison par enjambement in silico |
US6369085B1 (en) | 1997-05-30 | 2002-04-09 | Astrazeneca Ab | Form of S-omeprazole |
US6478964B1 (en) | 2001-05-18 | 2002-11-12 | Midwest Water Management, Llp | Floating fine-bubble aeration system |
US6495023B1 (en) | 1998-07-09 | 2002-12-17 | Michigan State University | Electrochemical methods for generation of a biological proton motive force and pyridine nucleotide cofactor regeneration |
US6537746B2 (en) | 1997-12-08 | 2003-03-25 | Maxygen, Inc. | Method for creating polynucleotide and polypeptide sequences |
WO2005018579A2 (fr) | 2003-08-11 | 2005-03-03 | Codexis, Inc. | Processus enzymatiques pour la production de derives d'acide 3-hydroxybutyrique 4-substitue et d'esters d'acide carboxylique substitue par cyano vicinal, par hydroxy |
US20050095619A1 (en) | 2003-08-11 | 2005-05-05 | Codexis, Inc. | Glucose dehydrogenase polypeptides and related polynucleotides |
US20050153417A1 (en) | 2003-08-11 | 2005-07-14 | Codexis, Inc. | Halohydrin dehalogenases and related polynucleotides |
US20060195947A1 (en) | 2003-08-11 | 2006-08-31 | Codexis, Inc. | Ketoreductase polypeptides and related polynucleotides |
US20080318295A1 (en) | 2007-02-08 | 2008-12-25 | Codexis, Inc. | Ketoreductases and Uses Thereof |
WO2009008908A2 (fr) | 2007-02-12 | 2009-01-15 | Codexis, Inc. | Relations structure-activité |
US20090093031A1 (en) | 2007-08-24 | 2009-04-09 | Codexis, Inc. | Ketoreductase Polypeptides for the Production of (R)-3-Hydroxythiolane |
US20090155863A1 (en) | 2007-09-28 | 2009-06-18 | Codexis, Inc. | Ketoreductase polypeptides and uses thereof |
US20090162909A1 (en) | 2007-10-01 | 2009-06-25 | Codexis, Inc. | Ketoreductase Polypeptides for the Production of Azetidinone |
US20090191605A1 (en) | 2007-09-13 | 2009-07-30 | Codexis, Inc. | Ketoreductase Polypeptides for the Reduction of Acetophenones |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6117679A (en) | 1994-02-17 | 2000-09-12 | Maxygen, Inc. | Methods for generating polynucleotides having desired characteristics by iterative selection and recombination |
US6376246B1 (en) | 1999-02-05 | 2002-04-23 | Maxygen, Inc. | Oligonucleotide mediated nucleic acid recombination |
US6498242B1 (en) * | 1999-02-19 | 2002-12-24 | E. I. Du Pont De Nemours And Company | Biological method for the production of adipic acid and intermediates |
US7214520B2 (en) | 2000-07-18 | 2007-05-08 | National Research Council Of Canada | Cloning, sequencing and expression of a comamonas cyclopentanone 1,2-monooxygenase-encoding gene in Escherichia coli |
US7105296B2 (en) | 2001-08-29 | 2006-09-12 | E. I. Du Pont De Nemours And Company | Genes encoding Baeyer-Villiger monooxygenases |
US20050084907A1 (en) | 2002-03-01 | 2005-04-21 | Maxygen, Inc. | Methods, systems, and software for identifying functional biomolecules |
SE0203092D0 (en) * | 2002-10-18 | 2002-10-18 | Astrazeneca Ab | Method for the synthesis of a benzimidazole compound |
WO2007027328A2 (fr) | 2005-07-28 | 2007-03-08 | University Of Iowa Research Foundation | Sulfoxydation et amidation microbienne d'acides benzhydrylsulfanylcarboxyliques et utilisations de celles-ci |
US20090312196A1 (en) | 2008-06-13 | 2009-12-17 | Codexis, Inc. | Method of synthesizing polynucleotide variants |
SI2510089T1 (sl) | 2009-12-08 | 2015-12-31 | Codexis, Inc. | Sinteza prazolnih spojin |
-
2010
- 2010-12-08 SI SI201031063T patent/SI2510089T1/sl unknown
- 2010-12-08 SG SG2012041521A patent/SG181535A1/en unknown
- 2010-12-08 EP EP10836590.9A patent/EP2510089B1/fr active Active
- 2010-12-08 US US13/514,750 patent/US8895271B2/en active Active
- 2010-12-08 WO PCT/US2010/059398 patent/WO2011071982A2/fr active Application Filing
- 2010-12-08 CN CN201080054980.3A patent/CN102884178B/zh active Active
-
2014
- 2014-10-30 US US14/528,708 patent/US9228216B2/en active Active
-
2015
- 2015-11-25 US US14/952,119 patent/US9422534B2/en active Active
-
2016
- 2016-07-15 US US15/211,329 patent/US9631181B2/en active Active
Patent Citations (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0172478A1 (fr) | 1984-08-03 | 1986-02-26 | Gesellschaft für Biotechnologische Forschung mbH (GBF) | Procédé et appareil pour l'aération de liquides sans formation de bulles, plus particulièrement de milieux de culture pour la propagation de tissus |
US5538867A (en) | 1988-09-13 | 1996-07-23 | Elf Aquitaine | Process for the electrochemical regeneration of pyridine cofactors |
WO1995022625A1 (fr) | 1994-02-17 | 1995-08-24 | Affymax Technologies N.V. | Mutagenese d'adn par fragmentation aleatoire et reassemblage |
US5840552A (en) | 1994-11-28 | 1998-11-24 | Astra Aktiebolag | Preparation of pharmaceutically active compounds by biooxidation |
WO1997000078A1 (fr) | 1995-06-14 | 1997-01-03 | Valio Oy | Procedes de prevention ou de traitement des allergies |
WO1997035966A1 (fr) | 1996-03-25 | 1997-10-02 | Maxygen, Inc. | Procedes et compositions pour biotechniques metaboliques et cellulaires |
WO1998027230A1 (fr) | 1996-12-18 | 1998-06-25 | Maxygen, Inc. | Procedes et compositions pour l'ingenierie des polypeptides |
US6162816A (en) | 1996-12-20 | 2000-12-19 | Astrazeneca Ab | Crystalline form of the S-enantiomer of omeprazole |
US6369085B1 (en) | 1997-05-30 | 2002-04-09 | Astrazeneca Ab | Form of S-omeprazole |
US6537746B2 (en) | 1997-12-08 | 2003-03-25 | Maxygen, Inc. | Method for creating polynucleotide and polypeptide sequences |
US6495023B1 (en) | 1998-07-09 | 2002-12-17 | Michigan State University | Electrochemical methods for generation of a biological proton motive force and pyridine nucleotide cofactor regeneration |
WO2000042651A1 (fr) | 1999-01-13 | 2000-07-20 | Hitachi, Ltd. | Dispositif a semi-conducteurs |
WO2000044744A1 (fr) | 1999-01-28 | 2000-08-03 | Astrazeneca Ab | Sel de potassium, de ($i(s))-omeprazole |
WO2000053731A2 (fr) | 1999-03-11 | 2000-09-14 | Eastman Chemical Company | Reductions enzymatiques avec de l'hydrogene par regeneration de cofacteur a catalyse metallique |
WO2001075767A2 (fr) | 2000-03-30 | 2001-10-11 | Maxygen, Inc. | Selection de sites de recombinaison par enjambement in silico |
US6478964B1 (en) | 2001-05-18 | 2002-11-12 | Midwest Water Management, Llp | Floating fine-bubble aeration system |
US20050095619A1 (en) | 2003-08-11 | 2005-05-05 | Codexis, Inc. | Glucose dehydrogenase polypeptides and related polynucleotides |
WO2005018579A2 (fr) | 2003-08-11 | 2005-03-03 | Codexis, Inc. | Processus enzymatiques pour la production de derives d'acide 3-hydroxybutyrique 4-substitue et d'esters d'acide carboxylique substitue par cyano vicinal, par hydroxy |
US20050153417A1 (en) | 2003-08-11 | 2005-07-14 | Codexis, Inc. | Halohydrin dehalogenases and related polynucleotides |
US20060195947A1 (en) | 2003-08-11 | 2006-08-31 | Codexis, Inc. | Ketoreductase polypeptides and related polynucleotides |
US20080318295A1 (en) | 2007-02-08 | 2008-12-25 | Codexis, Inc. | Ketoreductases and Uses Thereof |
WO2009008908A2 (fr) | 2007-02-12 | 2009-01-15 | Codexis, Inc. | Relations structure-activité |
US20090093031A1 (en) | 2007-08-24 | 2009-04-09 | Codexis, Inc. | Ketoreductase Polypeptides for the Production of (R)-3-Hydroxythiolane |
US20090191605A1 (en) | 2007-09-13 | 2009-07-30 | Codexis, Inc. | Ketoreductase Polypeptides for the Reduction of Acetophenones |
US20090155863A1 (en) | 2007-09-28 | 2009-06-18 | Codexis, Inc. | Ketoreductase polypeptides and uses thereof |
US20090162909A1 (en) | 2007-10-01 | 2009-06-25 | Codexis, Inc. | Ketoreductase Polypeptides for the Production of Azetidinone |
Non-Patent Citations (55)
Title |
---|
ALPHAND ET AL., TRENDS BIOTECHNOLOGY, vol. 21, no. 7, 2003, pages 318 - 323 |
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, 1990, pages 403 - 410 |
ALTSCHUL ET AL., NUCLEIC ACIDS RES., 1977, pages 3389 - 3402 |
AUSUBEL. F.: "Current Protocols in Molecular Biology", 1998, GREENE PUB. ASSOCIATES |
BEAUCAGE ET AL., TET LETT, vol. 22, 1981, pages 1859 - 69 |
BLACK ET AL., PROC NATL ACAD SCI USA, vol. 93, 1996, pages 3525 - 3529 |
BOTSTEIN ET AL., SCIENCE, vol. 229, 1985, pages 1193 - 1201 |
CALDWELL ET AL., PCR METHODS APPL., vol. 3, 1994, pages S136 - S140 |
CARTER: "Site-directed mutagenesis", BIOCHEM. J., vol. 237, 1986, pages 1 - 7 |
CHEN ET AL., J. BACTERIOL., vol. 170, no. 2, 1988, pages 781 - 789 |
CHRISTIANS ET AL., NATURE BIOTECH, vol. 17, 1999, pages 259 - 264 |
COLONNA ET AL., CHIRALITY, vol. 13, no. 1, 2000, pages 40 - 42 |
COTTON ET AL., TETRAHEDRON: ASYMMETRY, vol. 11, 2000, pages 3819 |
CRAMERI ET AL., NATURE BIOTECH, vol. 14, 1996, pages 315 - 319 |
CRAMERI ET AL., NATURE BIOTECH, vol. 15, 1997, pages 436 - 438 |
CRAMERI ET AL., NATURE, vol. 391, 1998, pages 288 - 291 |
DALE ET AL., METHODS MOL. BIOL., vol. 57, 1996, pages 369 - 74 |
EISENBERG ET AL., J. MOL. BIOL., vol. 179, 1984, pages 125 - 142 |
F. M. AUSUBEL ET AL.: "Current Protocols in Molecular Biology", 1995, GREENE PUBLISHING ASSOCIATES, INC. |
FASMAN: "CRC Practical Handbook of Biochemistry and Molecular Biology", 1989, CRC PRESS, pages: 3 - 70 |
HENIKOFF; HENIKOFF, PROC NATL ACAD SCI USA, vol. 89, 1989, pages 10915 |
JOHANNES ET AL., APPLIED AND ENVIRONMENTAL MICROBIOLOGY, vol. 71, no. 10, 2005, pages 5728 - 5734 |
JOHANNES ET AL., BIOTECHNOLOGY AND BIOENGINEERING, vol. 96, no. 1, 2006, pages 18 - 26 |
KAYSER ET AL., JOURNAL OF ORGANIC CHEMISTRY, vol. 71, 2006, pages 8424 - 30 |
KRAMER ET AL., CELL, vol. 38, 1984, pages 879 - 887 |
LIGHT ET AL., BIOCHEMISTRY, vol. 21, no. 10, 1982, pages 2490 - 8 |
LING ET AL., ANAL. BIOCHEM., vol. 254, no. 2, 1997, pages 157 - 78 |
MALITO ET AL., PRO. NATL ACAD SCI USA, vol. 101, no. 36, 2004, pages 13157 - 13162 |
MATTHES ET AL., EMBO J., vol. 3, 1984, pages 801 - O5 |
MCLACHLAN ET AL., BIOTECHNOLOGY AND BIOENGINEERING, vol. 99, no. 2, 2008, pages 268 - 274 |
MIHOVILOVIC ET AL., ORGANIC LETT., vol. 8, no. 6, 2006, pages 1221 - 1224 |
MINSHULL ET AL., CURR OPIN CHEM BIOL, vol. 3, 1999, pages 284 - 290 |
MOHOVILOVIC, ORGANIC LETTERS, vol. 8, 2006, pages 1221 - 4 |
NEEDLEMAN; WUNSCH, J. MOL. BIOL., vol. 48, 1970, pages 443 |
PASTA ET AL., TETRAHEDRON: ASYMMETRY, vol. 6, no. 4, 1995, pages 933 - 936 |
PEARSON; LIPMAN, PROC. NATL. ACAD. SCI. USA, vol. 85, 1988, pages 2444 |
PUIGB6 ET AL.: "OPTIMIZER: a web server for optimizing the codon usage of DNA sequences", NUCLEIC ACIDS RES., vol. 35, 16 April 2007 (2007-04-16), pages W126 - 31 |
REETZ ET AL., ANGEW CHEM. INT. ED., vol. 43, 2004, pages 4078 - 4081 |
REETZ ET AL., ANGEW. CHEM. INT. ED., vol. 43, 2004, pages 4075 - 4078 |
REETZ ET AL., ANGEW. CHEM. INT. ED., vol. 43, 2004, pages 4078 - 4081 |
REETZ ET AL., ANGEWANDTE CHEMIE INTERNATIONAL, vol. 43, 2004, pages 4078 - 81 |
SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual, 3rd ed.", 2001, COLD SPRING HARBOR LABORATORY PRESS |
SCHNEIDER ET AL., ENZYME AND MICROBIAL TECHNOLOGY, vol. 17, no. 9, 1995, pages 839 - 847 |
SECUNDO ET AL., TETRAHEDRON: ASYMMETRY, vol. 4, 1993, pages 1981 - 2 |
SHENG ET AL., BIOCHEMISTRY, vol. 40, no. 37, 2001, pages 11156 - 67 |
SMITH, ANN. REV. GENET., vol. 19, 1985, pages 423 - 462 |
SMITH; WATERMAN, ADV. APPL. MATH., vol. 2, 1981, pages 482 |
STEMMER, NATURE, vol. 370, 1994, pages 389 - 391 |
STEMMER, PROC NATL ACAD SCI USA, vol. 91, 1994, pages 10747 - 10751 |
VRTIS ET AL., ANGEWANDTE CHEMIE, vol. 41, no. 17, 2002, pages 3257 - 3259 |
WELLS ET AL., GENE, vol. 34, 1985, pages 315 - 323 |
WOODYER ET AL., BIOCHEMISTRY, vol. 42, no. 40, 2003, pages 11604 - 11614 |
YEUNG; RETTIE: "Prochiral Sulfoxidation as a probe for Flavin-Containing Monooxygenases", METHODS IN MOLECULAR BIOLOGY: CYTOCHROME P450 PROTOCOLS, vol. 320, 2005, pages 163 - 172 |
ZHANG ET AL., PROC NATL ACAD SCI USA, vol. 94, 1997, pages 45 - 4,4509 |
ZHAO ET AL., NAT. BIOTECHNOL., vol. 16, 1998, pages 258 - 261 |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012078800A2 (fr) | 2010-12-08 | 2012-06-14 | Codexis, Inc. | Biocatalyseurs et procédés de synthèse de l'armodafinil |
US9938509B2 (en) | 2010-12-08 | 2018-04-10 | Codexis, Inc. | Biocatalysts and methods for the synthesis of armodafinil |
EP2649187A4 (fr) * | 2010-12-08 | 2015-01-14 | Codexis Inc | Biocatalyseurs et procédés de synthèse de l'armodafinil |
US9267159B2 (en) | 2010-12-08 | 2016-02-23 | Codexis, Inc. | Biocatalysts and methods for the synthesis of armodafinil |
US9365835B2 (en) | 2010-12-08 | 2016-06-14 | Codexis, Inc. | Biocatalysts and methods for the synthesis of armodafinil |
US9528095B2 (en) | 2010-12-08 | 2016-12-27 | Codexis, Inc. | Biocatalysts and methods for the synthesis of armodafinil |
EP2649187A2 (fr) * | 2010-12-08 | 2013-10-16 | Codexis, Inc. | Biocatalyseurs et procédés de synthèse de l'armodafinil |
US9765306B2 (en) | 2010-12-08 | 2017-09-19 | Codexis, Inc. | Biocatalysts and methods for the synthesis of armodafinil |
US10961517B2 (en) | 2010-12-08 | 2021-03-30 | Codexis, Inc. | Biocatalysts and methods for the synthesis of armodafinil |
US10087426B2 (en) | 2010-12-08 | 2018-10-02 | Codexis, Inc. | Polynucleotides encoding cyclohexanone monooxygenases |
US11384343B2 (en) | 2010-12-08 | 2022-07-12 | Codexis, Inc. | Biocatalysts and methods for the synthesis of armodafinil |
US10781429B2 (en) | 2010-12-08 | 2020-09-22 | Codexis, Inc. | Biocatalysts and methods for the synthesis of armodafinil |
US10400223B2 (en) | 2010-12-08 | 2019-09-03 | Codexis, Inc. | Biocatalysts and methods for the synthesis of armodafinil |
US10557126B2 (en) | 2010-12-08 | 2020-02-11 | Codexis, Inc. | Polynucleotides encoding cyclohexanone monooxygenases |
EP3418929A1 (fr) | 2013-09-27 | 2018-12-26 | Codexis, Inc. | Sélection automatique de variantes enzymatiques |
CN106496191A (zh) * | 2016-10-19 | 2017-03-15 | 齐鲁天和惠世制药有限公司 | 一种s‑泮托拉唑钠的制备方法 |
CN106496191B (zh) * | 2016-10-19 | 2018-10-30 | 齐鲁天和惠世制药有限公司 | 一种s-泮托拉唑钠的制备方法 |
EP3561049A4 (fr) * | 2016-12-23 | 2020-10-28 | Zhejiang Jingxin Pharmaceutical Co., Ltd. | Bactérie génétiquement modifiée de coexpression de cyclohexanone monooxygénase et d'isopropanol déshydrogénase et utilisation associée |
US10787651B2 (en) | 2018-03-08 | 2020-09-29 | Jiangsu Aosaikang Pharmaceutical Co., Ltd. | Bradyrhizobium monooxygenase and use thereof for preparation of chiral sulfoxide |
US11345900B2 (en) | 2018-11-26 | 2022-05-31 | Jiangsu Aosaikang Pharmaceutical Co., Ltd. | Monooxygenase and use in preparation of optically pure sulfoxide |
FR3097234A1 (fr) * | 2019-06-17 | 2020-12-18 | Arkema France | Procede selectif de preparation de sulfoxydes par catalyse enzymatique |
FR3097232A1 (fr) * | 2019-06-17 | 2020-12-18 | Arkema France | Procede selectif de preparation de sulfoxydes par catalyse enzymatique |
WO2020254744A1 (fr) * | 2019-06-17 | 2020-12-24 | Arkema France | Procédé sélectif de préparation de sulfoxydes par catalyse enzymatique |
CN114174525A (zh) * | 2019-06-17 | 2022-03-11 | 阿科玛法国公司 | 通过酶催化制备亚砜的选择性工艺 |
Also Published As
Publication number | Publication date |
---|---|
US20160076008A1 (en) | 2016-03-17 |
US9422534B2 (en) | 2016-08-23 |
US8895271B2 (en) | 2014-11-25 |
SG181535A1 (en) | 2012-07-30 |
US20150056668A1 (en) | 2015-02-26 |
CN102884178B (zh) | 2014-12-03 |
CN102884178A (zh) | 2013-01-16 |
EP2510089A4 (fr) | 2013-06-12 |
US20130017580A1 (en) | 2013-01-17 |
EP2510089A2 (fr) | 2012-10-17 |
SI2510089T1 (sl) | 2015-12-31 |
US20160319252A1 (en) | 2016-11-03 |
EP2510089B1 (fr) | 2015-10-21 |
US9228216B2 (en) | 2016-01-05 |
WO2011071982A3 (fr) | 2011-12-22 |
US9631181B2 (en) | 2017-04-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9631181B2 (en) | Synthesis of prazole compounds | |
US10961517B2 (en) | Biocatalysts and methods for the synthesis of armodafinil | |
US10995320B2 (en) | Biocatalysts for Ezetimibe synthesis | |
EP2780448A1 (fr) | Biocatalyseurs pour la préparation de carbamates hydroxy substitués | |
WO2012142302A2 (fr) | Procédé biocatalytique permettant de préparer de l'eslicarbazépine et ses analogues |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 201080054980.3 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 10836590 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13514750 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 5934/CHENP/2012 Country of ref document: IN Ref document number: 2010836590 Country of ref document: EP |